This R Markdown notebook, describes how to access and visualize Infodengue’s datasets using the R language. For more information about the data and how to cite them, visit here.

To execute this notebook, the following libraries are necessary:

suppressPackageStartupMessages(library(tidyverse))

Data query

As tabelas geradas pelo Infodengue contem dados agregados por semana provenientes de diferentes fontes. Elas podem ser consultadas via formulário, ou diretamente do R, por meio de uma consulta à API.

Infodengue’s tables have data aggregated by week, provided by different sources. They could be consulted by a form, or directly from R, via consult of an API.

This functionality is available via the URL: https://info.dengue.mat.br/api/alertcity?params: must contain the following parameters:

geocode: city's IBGE code
disease: desease type to be consulted (str:dengue|chikungunya|zika)
format: file extension/format (str:json|csv)
ew_start: epidemiological week initial consultation (int:1-53)
ew_end: epidemiological week final consultation (int:1-53)
ey_start: year of initial consultation  (int:0-9999)
ey_end: year of final consultation  (int:0-9999)

Every parameter mentioned so far is obligatory for the consultation. The following example shows the requisition of the register of dengue between 1 and 52 of the year 2020, in Rio de Janeiro (geocodigo = 3304557) on CSV: :

https://info.dengue.mat.br/api/alertcity?geocode=3304557&disease=dengue&format=csv&ew_start=1&ew_end=52&ey_start=2020&ey_end=2020

How to do it using R?

1. Define the parameters

Verify if the consult is correct:

url <- "https://info.dengue.mat.br/api/alertcity?"
geocode <- 3304557
disease <- "dengue"
format <- "csv"
ew_start <- 1
ew_end <- 52
ey_start <- 2021
ey_end <- 2021

# do not change
cons1 <- paste0(url,"geocode=",geocode,"&disease=",disease,"&format=",format,"&ew_start=",ew_start,"&ew_end=",ew_end,"&ey_start=",ey_start,"&ey_end=",ey_end)
cons1
[1] "https://info.dengue.mat.br/api/alertcity?geocode=3304557&disease=dengue&format=csv&ew_start=1&ew_end=52&ey_start=2021&ey_end=2021"

2. Consulting:

dados <- read_csv(cons1, show_col_types=FALSE) %>% arrange(data_iniSE)
glimpse(dados)
Rows: 40
Columns: 21
$ data_iniSE       <date> 2021-01-03, 2021-01-10, 2021-01-17, 2021-01-24, 2021-01-31, 202…
$ SE               <dbl> 202101, 202102, 202103, 202104, 202105, 202106, 202107, 202108, …
$ casos_est        <dbl> 9, 11, 14, 30, 18, 17, 17, 25, 23, 32, 36, 43, 47, 74, 64, 41, 4…
$ casos_est_min    <dbl> 9, 11, 14, 30, 18, 17, 17, 25, 23, 32, 36, 43, 47, 74, 64, 41, 4…
$ casos_est_max    <dbl> 9, 11, 14, 30, 18, 17, 17, 25, 23, 32, 36, 43, 47, 74, 64, 41, 4…
$ casos            <dbl> 9, 11, 14, 30, 18, 17, 17, 25, 23, 32, 36, 43, 47, 74, 64, 41, 4…
$ p_rt1            <dbl> 0.64216300, 0.73637200, 0.82569900, 0.99771600, 0.56761300, 0.24…
$ p_inc100k        <dbl> 0.133377, 0.163016, 0.207475, 0.444588, 0.266753, 0.251933, 0.25…
$ Localidade_id    <dbl> 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0…
$ nivel            <dbl> 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 1, 1, 1, 1, 1, 1, 1, 1…
$ id               <dbl> 3.304557e+17, 3.304557e+17, 3.304557e+17, 3.304557e+17, 3.304557…
$ versao_modelo    <date> 2021-10-13, 2021-10-13, 2021-10-13, 2021-10-13, 2021-10-13, 202…
$ Rt               <dbl> 1, 1, 1, 2, 1, 1, 1, 1, 1, 1, 1, 1, 1, 2, 1, 1, 1, 1, 1, 1, 1, 1…
$ pop              <dbl> 6747815, 6747815, 6747815, 6747815, 6747815, 6747815, 6747815, 6…
$ tempmin          <dbl> 24, 24, 24, 25, 24, 22, 24, 23, 24, 23, 24, 24, 23, 21, 20, 20, …
$ umidmax          <dbl> 90, 89, 84, 80, 85, 90, 95, 95, 90, 96, 93, 91, 93, 94, 92, 92, …
$ receptivo        <dbl> 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 0, 0, 0, 0, 0, 0, 0, 0…
$ transmissao      <dbl> 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0…
$ nivel_inc        <dbl> 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0…
$ notif_accum_year <dbl> 1032, 1032, 1032, 1032, 1032, 1032, 1032, 1032, 1032, 1032, 1032…

The data table

The available variables are:

3. Plotting examples

suppressPackageStartupMessages(library(ggTimeSeries))
library(ggplot2)

#Weekly count of reported cases of dengue in the city of Rio de Janeiro. Arrows indicate weekly variation.
p1 <- ggplot_waterfall(dados,'SE','casos', nArrowSize = 0.8)
p1 + scale_fill_manual(
     values = c("forestgreen", "blue", "darkred"),
     labels = c("4wd", "front", " rear")) +
     xlab("Epidemiological Week")+
     ylab("Reported Cases")+
     scale_x_continuous(breaks=dados$SE)+
     theme_light() +
     theme(legend.position="none",axis.text.x = element_text(angle = 45,size=23, hjust = 1), 
           axis.text.y = element_text(size=23),axis.title= element_text(size=30))

suppressPackageStartupMessages(library(plotly))

#Reported cases and estimated cases
p2 <- plot_ly(dados,x = ~as.factor(SE))
p2 <- p2 %>% add_bars(y = ~casos, type = 'bar',
                      name = "Reported cases",
                      text = "Reported cases",
                      marker = list(color = 'lightblue'),
                      hovertemplate = paste("Date:",dados$data_iniSE,"<br>",
                                            "%{xaxis.title.text}: %{x}<br>",
                                            "%{text}: %{y}<br>"))%>%
             add_lines(y = ~casos_est, name = "Estimated cases",text = "Estimated cases",line =list(dash = "linear", color="blue"),
                       hovertemplate = paste("Date:",dados$data_iniSE,"<br>",
                                             "%{xaxis.title.text}: %{x}<br>",
                                             "%{text}: %{y}<br>")) %>%
             add_lines(y = ~casos_est_min, name = "Minimum Interval of the estimated cases",text = "Minimum Interval of the estimated cases", line=list(dash = "dot", color="black"),
                       hovertemplate = paste("Date:",dados$data_iniSE,"<br>",
                                             "%{xaxis.title.text}: %{x}<br>",
                                             "%{text}: %{y}<br>")) %>%
             add_lines(y = ~casos_est_max,name = "Maximum Interval of the estimated cases",text = "Maximum Interval of the estimated cases", line =list(dash ="dot", color="black"),
                       hovertemplate = paste("Date:",dados$data_iniSE,"<br>",
                                             "%{xaxis.title.text}: %{x}<br>",
                                             "%{text}: %{y}<br>")) %>%
            layout(title = list(text = "Total reported and estimated cases for Rio de Janeiro", x = 0),
                   xaxis = list(title = "Epidemiological Week",
                                tick0=202101, dtick=1,tickangle=315,tickfont = list( size=12)),
                   yaxis = list(title = "Number of cases"))

p2 %>% layout(legend = list(orientation="h", x = 0.5, y = 1,
                            legend.background = element_rect(fill = "transparent")))
