Abstract
Background
Dengue is a public health problem that presents complexity in its dissemination. The physical means of spreading and the dynamics of the spread between municipalities need to be analyzed to guide effective public policies to combat this problem.
Methods
This study uses timing varying graph methods (TVG) to construct a correlation network between occurrences of reported cases of dengue between cities in the state of Bahia-Brazil. The topological network indices of all cities were correlated with dengue incidence using Spearman correlation. A randomization test was used to estimate the significance value of the correlation.
Results
The correlation network presented a complex behavior with a heavy-tail distribution of the network edges weight. The randomization test exhibit a significant correlation (P < 0.0001) between the degree of each municipality in the network and the incidence of dengue in each municipality.
Conclusions
The hypothesis of the existence of a correlation between the occurrences of reported cases of dengue between different municipalities in the state of Bahia was validated. The significant correlation between the node degree and incidence, indicates that municipalities with high incidence are also responsible for the spread of the disease in the state. The method proposed suggests a new tool in epidemiological control strategy.
Keywords: Dengue, Correlation, Transport, Randomization, Bahia
Background
Dengue is a tropical disease of viral origin transmitted through the bite of the Aedes aegypti mosquito. Because dengue is an important arboviral disease, it is important to clearly define which control objectives can in fact be achieved and which preventive measures are required to do so. Dengue has a higher incidence in tropical countries where the climate is favorable to the proliferation of the A aegypti mosquito. In 2012, there were approximately 2.5 billion people worldwide at risk of infection. As a result, dengue is considered one of the most serious public health problems among reemerging diseases [1].
Two-fifths of the world' population is at risk of dengue infection. The lack of effective drugs and vaccines makes vector control the sole tool for primary intervention. Understanding the dengue virus, the transmitting agent, and its interactions with the host is essential for the development of epidemiological control strategies [2, 3].
Many factors are responsible for the resurgence of epidemics of classic and hemorrhagic dengue in the last years of the 20th century. Demographic and social changes, such as population growth, urbanization, and modern transport, contribute to the increased incidence and geographic spread of dengue. The prevalence of this public health problem is greater in tropical areas of Africa, Asia and the Americas because the vector does not survive at low temperatures. The epidemiological situation in Latin America is similar to the situation in Southeast Asia, where multiple serotypes circulate, thus leading to an increased number of cases of classic and hemorrhagic dengue. In 2002, Latin American countries reported more than one million cases of dengue, among which approximately 17,000 cases were of hemorrhagic dengue, which resulted in 225 deaths [4, 5]. Dengue is a major cause of morbidity in the tropics, especially prior to 1999 [6, 7].
Despite being a disease with significant impact on public health policies, there are unanswered questions about dengue’s spreading dynamics, such as the importance of the means of transport or the network of disease spreading across municipalities.
This work aims to study the mechanisms for the spread of dengue across municipalities in the state of Bahia - Brazil. We used an adapted version of the correlation network method proposed by Eguiluz et al. [8] to build the relationships between municipalities.
Correlation networks have been used to characterize various dissemination processes in complex systems, such as seismic events in California [9], rainfall indices [10], firing rates in neurons [11, 12], brain activity [8, 13], climatic factors [14], and other factors related to dengue [15, 16]. In all of them, the generated networks provided insight into new properties due to the complex structure of the interactions among network elements.
In this study, we evaluated the hypothesis that there are correlations between the occurrences of cases of dengue between municipalities in the state of Bahia and that the network generated from these correlation is related to the information on the mechanism of disease spread in the state.
Methods
To assess temporal trends in connectivity between cases of dengue, we used the mathematical basis of the time-varying graph (TVG) formalism [17–19].
Time-varying graphs (TVG)
According to the conventional formalism of graph theory, a graph is defined as G (E, V), where V represents the set of vertices or nodes and E the set of edges ei,j, with i and j ∈V such that E ⊆ V × V, i.e., each edge of the set E is defined by the ordered pair (Vi, Vj). Because a TVG is a dynamic system, the relationship between its elements is considered in a defined time interval T ⊆ N, where T represents the lifetime of the system, and Ν is the set of natural numbers. Thus, the TVG formalism in its simplified form can be defined as a graph G (V, E, F), where F represents the edge presence function, F: E × T ➔ {0, 1}. This function indicates whether there is an edge ei,j ∈E at a given time t ∈T.
Time-varying correlation networks (TVCN)
The use of the correlation between time series to build networks was originally proposed by Eguiluz et al. [8] in studies on brain activity performed with a functional magnetic resonance imaging device. In that study, the different brain regions (voxels) represented the vertices of the network, and the occurrence of significant correlations between the time series of activity represented the edges of the network. A generalization of the method developed by Eguiluz et al. [8] was proposed by Silva et al. [12]. In the proposed method, the correlations between nodes are calculated only for a time window smaller than the size of the series. Based on this modification, the authors estimated the temporal evolution of neuronal activity networks in mice by sliding the window along the neuronal firing time series.
The method proposed in this paper adds the concept of static aggregated networks (SAN) to the method proposed by Silva et al. [9]. Using the formalism of the time-varying graph, we defined a graph as G(C, M), where M is the set of vertices of the network, and C is the set of edges that represents the existence of significant correlation between the time series assessed between each vertex i and j, with i and j ∈M and C ⊆ M × M. The TVCN is a dynamic system with a range defined by T and can thus be formalized as a time-varying graph G(M, C, F), where F represents the edge presence function, F: C × T ➔ {0,1}, representing the existence (1) or non-existence (0) of correlation between the time series of a pair of vertices in a given time. Another way to understand the presence function F is that it indicates the existence of an edge Ci,j at a given time t [12].
For the TVCN, this process can be formally defined as follows:
1 |
where ci,j(t) represents the correlation between cases of dengue in the municipalities i and j within a time window of size J centered at time t. Accordingly, this definition implies that two vertices are connected in the network only if the assessed correlation ci,j(t) is high enough (). Without loss of generality, this study assumes to be constant over time for simplicity.
Once the set of TVG networks is created, we can integrate the networks in time so that
2 |
where wi,j defines the weight of the edge between vertices i and j of the static aggregated network (SAN). Thus, the edges of the SAN will represent the frequency of occurrence of the edge over period T of the TVG.
Time-varying correlation networks of cases of dengue (TVCND)
The method described in Section TVCN was applied to the daily time series of cases of dengue in 417 municipalities of the state of Bahia in Brazil for the period between 01/02/2000 and 04/16/2009. Data were collected from the database of the Notifiable Diseases Information System (Sistema de Informação de Agravos de Notificação - SINAN), an entity of the Federal Government. The SINAN database is fed by the reporting and investigation of cases of diseases and clinical conditions that appear on the national list of diseases of compulsory notification. In the creation of TVCND, the municipalities represent the vertices of the network, and their correlations in time represent the edges. The window size used was 10 days because this time is the average time for occurrence of the disease symptoms. The correlation used was the Pearson correlation, and the correlation index used was the p-value. When p-values are used to measure the correlation, the equation (1) inverts its logical expression i.e. an edge are added when .
In Figure 1, we illustrate the method applied to the dengue data. Figure 1(a) shows the time series for each municipality; Figure 1(b) shows the correlation matrix between all pairs of municipalities for the time window J. The grayscale matrix illustrates the different Pearson correlation values (p-value). The edge is considered for values of correlation below a critical value and is represented by the value 1 in the adjacency matrix. It is important to remember that small p-values indicate high correlations. The network is built from the adjacency matrix in Figure 1(c).
Once a network has been built for a time t, the window is slid by a one-day increment, and a new network is calculated. The set of all networks over time forms the TVCND.
The search for the critical value of the threshold correlation that best represents the network shows that when it is very high (represents low correlation), the noises result in the creation of many edges. Conversely, when the threshold is very low (indicating high correlations), the restrictions increase, and much information is removed from the network.
Results and discussion
Static aggregated networks of dengue in Bahia (SAN)
The criterion used to find the optimal value of was to adopt the value where the total number of edges of the one-day subgraph best correlates with the sum of all cases of municipalities of that day. In Figure 2, we show the scatterplot of the network generated for a presence function with the critical correlation that exhibited the best correlation ().
This correlation shows a coupling between the number of cases in a day and network connectivity on the same day, which indicates that the connection mechanism between the municipalities is an important factor in the spread of the disease.
The TVCN method was applied to cases of dengue in Bahia, generating 3393 subgraphs, where each subgraph represents a correlation network for a 10-day window of data. Its respective SAN was calculated so that a single weighted network is generated where the weights of the edges represent the number of days during which there was a correlation between adjacent vertices. Even assuming a high level of correlation (), many edges with low weight (<100 days) occur in the network. In Figure 3, we show the network for weights above 100 days. The figure shows that few nodes govern the pattern of correlation between cases of dengue in the municipalities. If the 100 days threshold filter is not used, the correlation between cases of dengue in municipalities present a great number of edges, blurring completely the figure, showing no information about the network connectivity.
We recall that if we do not use weights the correlation between cases of dengue in municipalities presents a great number of nodes featuring a highly connected graph.
For a better evaluation of the SAN, we calculated the cumulative distribution of the weights of the edges (Figure 4). The figure shows a heavy-tailed distribution without a defined mean. The central region of the curve, between 10 and 365 days, exhibits a power-law decrease with an exponent equal to −1.90.
Correlation between the degree of the correlation network of cases of dengue and the incidence of dengue
To seek an epidemiological interpretation of time-varying correlation networks of dengue (TVCND), we calculated the correlation between the degree of each municipality in the SAN and the incidence of dengue in each municipality. Using the randomization method [20–22], we applied Spearman correlation to 100,000 randomizations of the original data. The correlation was greater than or equal to the original correlation in only 0.001% of the set of randomizations, which leads to a p-value <0.00001. Figure 5 shows a comparison between the distribution of Spearman correlation indices of the random outputs with the correlation value obtained from the original data. This comparison indicates that, although low, the correlation is significant.
The existence of a significant correlation between the degree of correlation in SAN and the incidence of dengue in the municipality indicates that municipalities with high incidence are also responsible for the spread of the disease in the state.
The correlations between the occurrences of reported cases of dengue in different municipalities in the state of Bahia can aid in creating more efficient campaigns for prevention and the fight against dengue. The weights of the edges of the correlation network identify the most connected municipalities in the context of dengue in this state. In Table 1, in the case of an outbreak in the municipality of Abaíra, the municipalities of Jequié and Salvador should prioritize actions for preventing dengue.
Table 1.
Municipality N | Municipality M | Weight |
---|---|---|
Abaíra | Jequié | 27 |
Abaíra | Salvador | 12 |
Abaíra | Vitória da Conquista | 7 |
Abaíra | Ibotirama | 7 |
Abaíra | Fátima | 7 |
Abaíra | Itabuna | 6 |
Abaíra | Teixeira de Freitas | 3 |
Abaíra | Mairi | 3 |
Abaíra | Buerarema | 1 |
Abaíra | Caravelas | 1 |
Abaíra | Coaraci | 1 |
Abaíra | Conceição do Almeida | 1 |
Abaíra | Itajuípe | 1 |
Abaíra | Juazeiro | 1 |
Abaíra | Quijingue | 1 |
Conclusions
The correlation network of cases of dengue in Bahia shows how one municipality follows the behavior of a different municipality. The correlation between the incidence data and the correlation network itself validates the hypothesis of the existence of a correlation between the occurrences of reported cases of dengue between different municipalities in the state of Bahia.
The information generated shows municipalities with high correlation, when cases of dengue increase in a municipality, there is also an increase of cases in the other correlated municipality and vice versa. This information can guide the attention of public authorities so that when diagnosing a growth in the number of cases or even an epidemic in a municipality, campaigns can be launched in the municipalities that have the highest correlations with the municipality where the outbreak occurred. Thus, epidemics that spread through several municipalities would be minimized with regard to their spread, which would reduce its period of existence.
Acknowledgments
This work received financial support from CNPq (grant numbers 306571/2011-0 and 308785/2011-8) and Coordination of Improvement of Higher Education Personnel (CAPES).
Abbreviations
- TVG
Timing Varying Graph
- A aegypti
Aedes Aegypti
- TVCN
Time-Varying Correlation Networks
- TVCND
Time-Varying Correlation Networks of Cases of Dengue
- SINAN
Sistema de Informação de Agravos de Notificação
- SAN
Static Aggregated Networks of dengue in Bahia.
Footnotes
Competing interests
The authors declare that they have no competing interests.
Authors’ contributions
HS proposed the research question, collected and analyzed the data. VCV was responsible for the theoretical background in epidemiology and dengue. MAM did the statistical analyzes. JGVM developed the computational tools, assisted in data analysis and coordinated the entire process of research development. All authors took an active part in discussions and writing of the manuscript. All authors read and approved the final manuscript.
Contributor Information
Hugo Saba, Email: hcardoso@uneb.br.
Vera C Vale, Email: vcostavale@gmail.com.
Marcelo A Moret, Email: mamoret@gmail.com.
José Garcia V Miranda, Email: vivasm@gmail.com.
References
- 1.Braga IA, Valle D. Epidemiol Serv Saúde Brasília. 2007. Aedes aegypti: histórico do controle no Brasil [Aedes aegypti: history of control in Brazil] [Google Scholar]
- 2.Guo X, Xu Y, Bian G, Pike AD, Xie Y, Xi Z. Response of the mosquito protein interaction network to dengue infection. BMC Genomics. 2010;11:380. doi: 10.1186/1471-2164-11-380. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Kappagoda S, Ioannidis J. Prevention and control of neglected tropical diseases: overview of randomized trials, systematic reviews and meta-analyses. Bull World Health Organ. 2014;92(5):356–366C. doi: 10.2471/BLT.13.129601. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Huy R, Buchy P, Conan A, Ngan C, Ong S, Ali R, Duong V, Yit S, Ung S, Te V, Chroeung N, Pheaktra NC, Uok V, Vong S. National dengue surveillance in Cambodia, 1980−2008: insights on epidemiological and virological trends and impact of vector control interventions. Bull World Health Organ. 2010;88:650–657. doi: 10.2471/BLT.09.073908. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Nogueira R, Schatzmayr H, Santos A, Cunha F, Coelho R, Souza J, Guimarães L, Araújo F, Simone E, Baran T, Teixeira M, Miagostovich G, Pereira M. Emerging Infectious Diseases. 11. Atlanta: EUA; 2005. pp. 1376–1381. [Google Scholar]
- 6.Gubler DJ, Meltzer M. Impact of dengue/dengue haemorrhagic fever on the developing world. Adv Virus Res. 1999;53:35–70. doi: 10.1016/S0065-3527(08)60342-5. [DOI] [PubMed] [Google Scholar]
- 7.Roriz-Cruz M, Sprinz E, Rosset I, Goldani L, Texeira MA. Dengue and primary care: a tale of two cities. Bull World Health Organ. 2010;88:244–245. doi: 10.2471/BLT.10.076935. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Eguiluz VM, Chialvo D, Cecchi GA, Baliki M, Apkarian AV. Scale-free brain functional networks. Phys Rev Lett. 2005;94:018102. doi: 10.1103/PhysRevLett.94.018102. [DOI] [PubMed] [Google Scholar]
- 9.Abe S, Suzuki N. Scale-free network of earthquakes. Europhysics Letters. 2004;65:581–586. doi: 10.1209/epl/i2003-10108-1. [DOI] [Google Scholar]
- 10.Santana CN, Fontes AS, dos S Cidreira MA, Almeida RB, González AP, Andrade RFS, Miranda JGV. Graph theory defining non-local dependency of rainfall in Northeast Brazil. Ecol Complex. 2009;6:272–277. doi: 10.1016/j.ecocom.2009.05.011. [DOI] [Google Scholar]
- 11.Mutlu AY, Bernat E, Aviyente S. A signal-processing-based approach to time-varying graph analysis for dynamic brain network identification. Comput Math Methods Med. 2012;2012:451516. doi: 10.1155/2012/451516. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Silva BBM, Miranda JGV, Corso G, Copelli M, Vasconcelos N, Ribeiro S, Andrade RFS. Statistical characterization of an ensemble of functional neural networks. Eur Phys J E Condensed Matter Complex Systems. 2012;85:358. doi: 10.1140/epjb/e2012-30481-7. [DOI] [Google Scholar]
- 13.Quaak I, Brouns MR, van de Bor M. The dynamics of autism spectrum disorders: how neurotoxic compounds and neurotransmitters interact. revista internacional da investigação ambiental e de saúde pública. 2013;10:3384–3408. doi: 10.3390/ijerph10083384. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Nakapan S, Tripathi NK, Tipdecho T, Souris M. Spatial diffusion of influenza outbreak-related climatic factors in Chiang Mai Province, Thailand. revista internacional da investigação ambiental e de saúde pública. 2012;9(11):3824–3842. doi: 10.3390/ijerph9113824. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Lin C-H, Wen T-H. Using geographically weighted regression (GWR) to explore the different spatial varying relationships of immature mosquitos and human densities with the incidence of dengue. revista internacional da investigação ambiental e de saúde pública. 2011;8(7):2798–2815. doi: 10.3390/ijerph8072798. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Lozano-Fuentes S, Elizondo-Quiroga D, Farfan-Ale JA, Loroño-Pino MA, Garcia-Rejon J, Gomez-Carro S, Lira-Zumbardo V, Najera-Vazquez R, Fernandez-Salas I, Calderon-Martinez J, Dominguez-Galera M, Mis-Avila P, Morris N, Coleman M, Moore CG, Beaty BJ, Eisen L. Use of Google Earth to strengthen public health capacity and facilitate management of vector-borne diseases in resource-poor environments. Bull World Health Organ. 2008;86(9):718–725. doi: 10.2471/BLT.07.045880. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Casteigts A, Flocchini P, Quattrociocchi W, Santoro N. Time-Varying Graphs and Dynamic Networks. 2010. p. 20. [Google Scholar]
- 18.Flocchini P, Mans B, Santoro N. Proc. 20th Intl. Symposium on Algorithms and Computation (ISAAC) 2009. Exploration of periodically varying graphs; p. 534543. [Google Scholar]
- 19.Tang J, Scellato S, Musolesi M, Mascolo C, Latora V. Small-world behavior in time-varying graphs. Phys Rev E Stat Nonlinear Soft Matter Phys. 2010;81(5 Pt 2):055101. doi: 10.1103/PhysRevE.81.055101. [DOI] [PubMed] [Google Scholar]
- 20.Manly BFJ. Randomization, Bootstrap and Monte Carlo Methods in Biology. Flórida: Chapman & Hall; 2006. p. 460. [Google Scholar]
- 21.Viola DN. Detecção e modelagem de padrão eários e de contagem. Piracicaba: USP: Escola Superior de Agricultura “Luiz de Queiroz”; 2007. Tese de Doutorado em Estatística e Experimentação Agronômica. [Google Scholar]
- 22.Saba H, Miranda JGV, Moret MA. Self-organized critical phenomenon as a q-exponential decay: avalanche epidemiology of dengue. Physica A Stat Mech Appl. 2014;413:205–211. doi: 10.1016/j.physa.2014.06.045. [DOI] [Google Scholar]
Pre-publication history
- The pre-publication history for this paper can be accessed here: http://www.biomedcentral.com/1471-2458/14/1085/prepub