This vignette illustrates a basic workflow how to get data from Hlídač státu API using hlidacr
package.
For accessing data from the API, you need to obtain API token at the website of Hlídač státu. To get a token, you need to register.
I store the token in the environment variable HLIDAC_TOKEN
.
For the purpose of the illustration, the following lines show how to get data from the dataset on Czech ministers’ days in office which are stored in the dataset with id ministri
. To get the data, you need to call the function get_dataset_data
which returns a list with three elements: Total, Page, and Results. Total indicates the total number of records, Page indicates the current page queried from the API and Results contain data.frame with the data. Therefore, you need to iterate over all of the pages which I do using purrr::map_df
.
library(dplyr)
library(hlidacr)
<- Sys.getenv("HLIDAC_TOKEN")
TOKEN
<- get_dataset_data("ministri", token = TOKEN)
ministers <- ministers$Total
total_records <- nrow(ministers$Results)
n_rows
<- ceiling(total_records / n_rows)
total_pages
::map_df(1:total_pages, function(x) {
purrrget_dataset_data("ministri", page = x, token = TOKEN)$Results
-> ministers_all
})
%>%
ministers_all mutate(start_date = as.Date(zacatek, format = "%Y-%m-%dT%H:%M:%S"),
end_date = as.Date(konec, format = "%Y-%m-%dT%H:%M:%S"),
term_days = end_date - start_date) -> ministers_terms
# Descriptive statistics of days in office
summary(as.numeric(ministers_terms$term_days))