Title: | Fetch Datasets from the Official Tunisian Data Catalog |
---|---|
Description: | Simplifies access to Tunisian government open data from <https://data.gov.tn/fr/>. Queries datasets by theme, author, or keywords, retrieves metadata, and gets structured results ready for analysis; all through the official 'CKAN' API. |
Authors: | Aymen Nasri [aut, cre, cph] |
Maintainer: | Aymen Nasri <[email protected]> |
License: | Apache License (>= 2) |
Version: | 0.1.0 |
Built: | 2025-03-23 15:22:41 UTC |
Source: | https://github.com/aymennasri/tndata |
Downloads a dataset from the Tunisian data catalog API (data.gov.tn).
download_dataset(title, download_dir, format = NULL)
download_dataset(title, download_dir, format = NULL)
title |
Character. Display name of the dataset to download. |
download_dir |
Character. Directory to save the downloaded dataset at. |
format |
Character. Format of the dataset to download. |
The demanded dataset in the demanded path.
try({ download_dataset( "Ressources en eau- Gouvernorat de Kasserine", format = "xls", download_dir = tempdir() ) })
try({ download_dataset( "Ressources en eau- Gouvernorat de Kasserine", format = "xls", download_dir = tempdir() ) })
Retrieves a list of all authors who have contributed datasets to the catalog by fetching datasets and extracting unique author information.
get_authors(max_datasets = 1000)
get_authors(max_datasets = 1000)
max_datasets |
Numeric. Maximum number of datasets to retrieve for author extraction, defaults to 1000. |
A tibble (data frame) with the following columns:
Character. Name of the author.
Numeric. Number of datasets contributed by this author.
try({ authors <- get_authors(max_datasets = 100) head(authors) })
try({ authors <- get_authors(max_datasets = 100) head(authors) })
Fetches datasets by keyword, author and/or organization.
get_datasets( keyword = NULL, author = NULL, organization = NULL, max_results = 100 )
get_datasets( keyword = NULL, author = NULL, organization = NULL, max_results = 100 )
keyword |
Character. Keyword to search for in dataset titles. |
author |
Character. Author name to filter datasets by. |
organization |
Character. Organization name to filter datasets by. |
max_results |
Numeric. Maximum number of datasets to return, defaults to 100. |
A tibble (data frame) with the following columns:
Character. Title of the dataset.
Character. ID of the dataset.
List. A list of tibbles with the following columns:
Character. Name of the dataset.
Character. Format of the dataset.
Character. URL of the dataset.
Date. Date the dataset was created.
try({ datasets <- get_datasets("agriculture") head(datasets) })
try({ datasets <- get_datasets("agriculture") head(datasets) })
Retrieves a list of unique keywords/tags from the Tunisian data catalog API.
get_keywords(limit = 10, query = NULL)
get_keywords(limit = 10, query = NULL)
limit |
Integer. Maximum number of tags to return (default: 10). |
query |
Character. Optional search string to filter tags. |
A data frame of keywords/tags with counts.
try({ get_keywords(limit = 10) })
try({ get_keywords(limit = 10) })
Retrieves organizations data from the Tunisian data catalog API (data.gov.tn) using faceted search. This function returns organizations that have published datasets.
get_organizations(min_count = 1)
get_organizations(min_count = 1)
min_count |
Integer. Minimum number of datasets an organization must have to be included in results. Default is 1, meaning only organizations with at least one dataset are returned. |
A tibble (data frame) with the following columns:
Character. Machine-readable name/identifier of the organization.
Character. Human-readable name of the organization.
Integer. Number of datasets published by the organization.
try({ # Get all organizations with at least 5 datasets orgs <- get_organizations(min_count = 5) head(orgs) })
try({ # Get all organizations with at least 5 datasets orgs <- get_organizations(min_count = 5) head(orgs) })
Fetches and summarizes themes (groups) alongside the number of datasets in each theme from the Tunisian data catalog API (data.gov.tn).
get_themes()
get_themes()
A tibble (data frame) with two columns:
Character. Name of the theme/group.
Numeric. Number of datasets in the theme.
try({ themes_summary <- get_themes() head(themes_summary) })
try({ themes_summary <- get_themes() head(themes_summary) })