Abstract
Objectives
In epidemiological research, major studies have focused on theoretical models; however, few methods of visual analysis have been used to display the patterns of disease distribution.
Design
For this study, a method combining the space-time cube (STC) with space-time scan statistics (STSS) was used to analyze the pattern of incidence of hand-foot-mouth disease (HFMD) in Guangdong Province from May 2008 to March 2009. In this research, STC was used to display the spatiotemporal pattern of incidence of HFMD, and STSS were used to detect the local aggregations of the disease.
Setting
The hand-foot-mouth disease data were obtained from Guangdong Province from May 2008 to March 2009, with a total of 68,130 cases.
Results
The STC analysis revealed a differential pattern of HFMD incidence among different months and cities and also showed that the population density and average precipitation are correlated with the incidence of HFMD. The STSS analysis revealed that the most likely aggregation includes the Shenzhen, Foshan and Dongguan populations, which are the most developed regions in Guangdong Province.
Conclusion
Both STC and STSS are efficient tools for the exploratory data analysis of disease transmission. STC clearly displays the spatiotemporal patterns of disease. Using the maximum likelihood ratio, the STSS model precisely locates the most likely aggregation.
Background
In epidemiological research, we want to detect the aggregation of a disease, and the effective visualization of the distribution of a disease is important to this end. To date, various theoretical models have been established to detect disease outbreaks in time or space [1–11]. Although precise local aggregations are obtained using these analytical, theoretical models, the results of these models are abstract and not easily represented. The use of visual analysis technologies in epidemiological studies can contribute to an understanding of the distribution of disease. Currently, most data in epidemiology are spatiotemporal data, and the visualization of spatial and temporal information is an important problem.
Some visual analysis methods have been developed to display data concerning disease distribution [12–17]. Parallel coordinates [15] can be used to visualize multi-dimensional data, but this method exhibits several shortcomings when expressing spatial relationships. Self-organizing maps (SOM) [16] are good visual tools for the analysis of clustering in multi-dimensional data, but this method cannot sufficiently represent the spatial and temporal properties. The Star map visualization technology [17] displays discrepancies in spatial properties, but this analysis needs to improve for the visualization effect in the time dimension. In the early 1970s, Hägerstrand developed a model called the “space-time cube” [18]. In this model, he used the base of a cube to denote the geographic space and the vertical axis to represent the temporal dimension. Thus, the changes in the spatial patterns in the geography could be represented by the vertical axis. Each frame, which reflects the spatial pattern at a particular time, could be obtained by slicing a particular location in the time dimension. The “space-time cube” model had been used in various fields [19], and the experimental results have demonstrated that the “space-time cube” could be effectively applied to geography, the environment and other domains.
This study changed the coordinate layouts in the traditional space-time cube (STC). In the traditional STC, the cube’s base represents the geography (along the x- and y- axes), while the cube’s height denotes time (z-axis) [20]. However, in our research, we used the x- and y-axes to represent the month and day, respectively, and the z-axis to denote geographical space. The reason for this coordinate layout was to facilitate the observation of the total incidence of disease in a given location or time.
In recent years, HFMD has rampantly spread throughout the western region of the Pacific Ocean, including China, Japan, Singapore, and Malaysia [21–23]. The incidence of HFMD, particularly among children, is increasing in these regions [24]. Every year, a large number of children are infected with HFMD. Thus, it is necessary to study the spatiotemporal pattern of HFMD to prevent the further spread of this disease.
This study combines the methods of the space-time cube [18] and space-time scan statistics [4] to analyze the spatiotemporal pattern of HFMD incidence in Guangdong, a southern province in China. The space-time scan statistic method is used for geographical disease surveillance as well as for the evaluation of geographical disease cluster alarms [25]. The spatiotemporal distribution of this disease could be visualized through the space-time cube, and disease aggregation could be achieved using space-time scan statistics. The remaining sections of this paper are organized in the following manner: the hand-foot-mouth disease data that was obtained from May 2008 to March 2009 in Guangdong Province are provided in section DATA. In section METHODS, we introduce the experimental method that was used in this research, combining the space-time cube with space-time scan statistics. Section RESULTS presents the results of the visual analysis and scan statistics, with a discussion about interesting information concerning HFMD in Guangdong as obtained from these results. Finally, we summarize our work, discuss the advantages and disadvantages of this method, and provide a description of future directions in Section DISCUSSION.
Data
The HFMD dataset that was used in this research was collected at the Chinese Center for Disease Control and Prevention (CDC) from May 1, 2008, to March 27, 2009, in Guangdong Province. Located in the southern region of China, Guangdong Province is China’s most economically developed region and has the largest population density. In recent years, the incidence of HFMD among children under 5 years of age has increased in Guangdong Province. The total number of HFMD cases in our study was 68,130. Fig 1 illustrates the number of HFMD cases from May 2008 to March 2009. The number of HFMD cases decreased over this time period, with most cases occurring in May and June 2008.
In our research, the minimum geographical unit is a prefecture-level city containing a total of 21 cities in Guangdong Province; the time resolution is one day; and the number of days is 331. The demographic characteristics of the prefecture-level cities are summarized in Table 1. As the incidence of HFMD was primarily detected among children, the calculation of HFMD incidence was based on HFMD cases in children aged 0–9 years as the numerator, and the total population aged 0–9 years was used as the denominator [26]. Population data were collected from the China Statistical Yearbook for Cities [27], and precipitation data were collected from the China Meteorological Data Sharing Service System.
Table 1. Demographic characteristics of the prefecture-level cities of Guangdong Province.
CODE | City | East longitude | Northlatitude | Area(km 2) | Population(10,000 person) | GDP(10000 Yuan) |
---|---|---|---|---|---|---|
4401 | Guangzhou | 113.54 | 23.37 | 7434 | 784.23 | 82,158,151 |
4402 | Shaoguan | 113.77 | 24.82 | 18,493 | 322.78 | 5,252,835 |
4403 | Shenzhen | 114.14 | 22.66 | 1953 | 228.07 | 78,065,387 |
4404 | Zhuhai | 113.31 | 22.21 | 1701 | 99.48 | 9,920,616 |
4405 | Shantou | 116.55 | 23.33 | 2068 | 506.60 | 9,747,835 |
4406 | Foshan | 112.95 | 23.01 | 3848 | 364.34 | 43,333,044 |
4407 | Jiangmen | 112.67 | 22.30 | 9554 | 389.78 | 11,720,185 |
4408 | Zhangjiang | 110.16 | 21.10 | 12,941 | 727.36 | 10,405,935 |
4409 | Maoming | 110.95 | 22.02 | 11,458 | 725.81 | 10,388,505 |
4412 | Zhaoqing | 112.20 | 23.54 | 15,230 | 420.27 | 7,639,416 |
4413 | Huizhou | 114.50 | 23.25 | 11,195 | 318.75 | 11,696,057 |
4414 | Meizhou | 116.08 | 24.20 | 15,899 | 504.49 | 5,299,749 |
4415 | Shanwei | 115.52 | 23.02 | 4831 | 337.82 | 2,691,674 |
4416 | Heyuan | 114.96 | 24.04 | 15,808 | 344.44 | 4,849,781 |
4417 | Yangjiang | 111.77 | 22.04 | 8091 | 273.21 | 4,839,429 |
4418 | Qingyuan | 112.87 | 24.31 | 19,194 | 405.74 | 8,000,265 |
4419 | Dongguan | 113.88 | 22.94 | 2465 | 174.87 | 37,025,344 |
4420 | Zhongshan | 113.38 | 22.52 | 1800 | 146.43 | 14,085,194 |
4451 | Chaozhou | 116.77 | 23.80 | 3113 | 256.16 | 4,428,755 |
4452 | Jieyang | 116.12 | 23.34 | 5284 | 641.21 | 7,102,884 |
4453 | Yunfu | 111.80 | 22.82 | 7980 | 272.72 | 2,912,282 |
Methods
Space-Time Cube
Generally, spatiotemporal data include three main components: Space (where), Time (When) and Attribute (What) [28–29]. Although these components could be represented using a map, time sequence or STC model, their differences are apparent. Both space and attribute information can be represented using a static map, but it is difficult to reflect the change in the attribute across the time dimension. A time series can be used to reflect the trend of an attribute across the time dimension, particularly when the attribute exhibits a specific trend. However, it is difficult to observe the distribution of the data in the space dimension. Compared with the pure map and time series, the STC model has many advantages. The STC model could simultaneously reflect the changes in attribute information in both the time and space dimensions. Therefore, we used the STC model to characterize the changes in the spatiotemporal patterns of HFMD incidence.
The coordinate layout that was used in this study was different from the traditional space-time cube. We used one horizontal axis to represent the month, another horizontal axis to denote the day, and the vertical axis to display the geographical space (Fig 2). The reason for this coordinate layout was to facilitate the observation of the total HFMD incidence in a given time or space. Along the vertical axis, geographical units could be arranged according to the environmental factors; through this arrangement, the impact between these factors and the incidence of HFMD could be observed. For example, if the geographical units are arranged in accordance with the population density, we could observe whether the population density impacts the incidence of HFMD; if the geographical units are arranged according to the average temperature, then the impact of the average temperature on the incidence of HFMD could be observed.
We observed the incidence of HFMD from the visible faces of the space-time cube (Fig 2). For example, on the front face where the month dimension is fixed, the entire phenomenon occurring in a particular month could be observed from the plane. Every face is divided into crisscross grids, and the color of each grid represents the incidence of HFMD. We use red to denote high incidence and blue to denote low incidence. The operations of the space-time cube include slice, dice, drill-down, roll-up and pivot [30].
Space-Time Scan Statistics
Kulldorff proposed the spatial scan statistic model in 1997. In this study, Kulldorff used a series of variable scanning windows to detect likely clusters in space [3]. Subsequently, to evaluate space-time cluster alarms, Kulldorff proposed space-time scan statistics and used brain cancer cases in Los Alamos to confirm this model [4]. The software SaTScan facilitates the application of these methods [32–33].
The entire research region G is divided into a series of sub-regions {z i | i = 1, 2, …, k}. There are no gaps between these sub-regions, and each sub-region is not overlapped or intersected by other sub-regions. To calculate the relative risk coefficients (p i) of the disease in all of the sub-regions, this model assumes null and alternative hypotheses. The null hypothesis is that the relative risk coefficients (p i) of the disease in all of the sub-regions are identical. The alternative hypothesis is that there is always an aggregation; within the aggregation, the relative risk coefficients (p i) of the disease are identical; and outside of the aggregation, the relative risk coefficients (q i) of the disease are also identical, while p i > q i [25].
In space-time permutation scan statistic model, the scanning windows were defined as thousands or millions of overlapping cylinders, each being a possible candidate for an outbreak [32]. The base of the cylinder denotes the geographical space of the potential outbreak. Generally, we first iterate over a finite number geographical grid points and then gradually increase the radius of circle from zero to some maximum value defined by the user [32]. The height of the cylinder denotes the number of days. If cases obey a certain distribution (e.g., Poisson distribution), then the disease cases and population inside and outside of the cylinder are obtained, together with the expected number of cases and the population at risk. For each cylinder, the likelihood is calculated using these numbers. The cylinder, maximum likelihood, and expected number of cases represent the most likely cluster [3].
Building the HFMD database
To facilitate the visualization and analysis of the data, the data format should be normalized. In this study, we constructed the HFMD database with three sub-databases. Although the database construction might appear to be application-specific, the properties in disease data are similar. Therefore, these sub-datasets are typically used for other disease studies, and even for other types of data.
The first dataset is prepared for the visualization of the space-time cube and includes 35 fields (Table 2). The field CODE represents the administrative code of the prefecture-level city, which is usually a six-digit number in China. The field NAME shows the name of the prefecture-level city, and the field POPU denotes the population aged 0–9 years in each prefecture-level city. The field MONTH denotes the month when the HFMD data were collected. The remaining fields from DAY1 to DAY31 denote the number of HFMD cases from the first day to the last day of each month. If a month is less than 31 days, then the missing days are assigned a value of -1.
Table 2. Data fields that were prepared for the space-time cube.
Field name | Type | Description |
---|---|---|
CODE | Numeric | The administrative code of the city |
NAME | Text | The name of the city |
POPU | Numeric | The population aged 0–9 years in each city |
MONTH | Numeric | The month when the HFMD cases were recorded |
DAY1-DAY31 | Numeric | The count of HFMD cases that were collected from the first day to the last day of one month |
The second dataset is used for space-time scan statistics and is organized using Microsoft Excel spreadsheets. The dataset involves seven fields, among which the fields CODE, NAME and POPU have the same meanings as in the first dataset. The field TIME denotes the day when the HFMD data were collected. In accordance with the SaTScan software, the TIME field is presented as “year/month/day”. The fields LONGITUDE and LATITUDE denote the longitude and latitude of the city, respectively.
The last dataset is an ESRI shapefile that stores the city boundaries in Guangdong Province. The most important fields are SHAPE, CODE and NAME. The fields CODE and NAME have the same meanings as in the first dataset. The field SHAPE denotes the administrative boundaries of the 21 prefecture-level cities in Guangdong Province.
The integration of the three datasets is required to build the HFMD database. First, records will be extracted from the first dataset to generate the space-time cube; then, the space-time clusters will be achieved from the SaTScan files; and finally, the aggregation will be displayed on the map.
Results
Space-Time Cube of the HFMD incidence in Guangdong Province
The space-time cube model visualizes the incidence of HFMD in Guangdong Province. To reflect all of the information for HFMD, the target needs to accurately observe the incidence in each city and on each day. There are a total of twenty-one prefecture-level cities; therefore, the number of lines on the vertical axis is twenty-one. In this study, we arranged the vertical axis according to the administrative code. As shown in Fig 3, the horizontal axis to the right represents the months and includes 11 months, from May 2008 to March 2009; the horizontal axis to the left shows the day of the month. To construct an integration cube, every month was assumed to contain 31 days. If the number of days in some months is less than 31 (for example, June has only 30 days), the missing days are filled with black. The total 21 prefecture-level cities are displayed on the vertical axis. The color from blue to red in each grid cell denotes the incidence from low to high, respectively.
The incidence of HFMD in all of the cities during a given period of time could be represented in the STC model. The left plane of the STC (Fig 3) displays the incidence of HFMD in May 2008. As observed from this plane, most grids are red or yellow, indicating the serious incidence of HFMD in May 2008. After slicing the STC along the direction parallel with the left plane, we obtained another plane. This plane displays the HFMD incidence of another month. From the various colors of the grid cells, we can easily decipher the incidence of HFMD.
We could also display the HFMD incidence in one city using the STC model. For example, the upper plane represents the incidence of HFMD in Guangzhou, which is the capital of Guangdong. From this plane, it is easy to observe that there are more blue grids than red grids, indicating that Guangzhou had a mild incidence of HFMD from May 2008 to March 2009.
The STC provides all of the information about the HFMD incidence, and users can obtain the information concerning HFMD for a given period of time or sub-region through the “slice” and “dice” operations. The “slice” operation also facilitates the exploration of the relationship between the incidence of the HFMD and the geographic space or time.
Patterns of the HFMD incidence by time
The relationships between the incidence of HFMD and months could be observed using the STC model. This method compares two different planes, which are sliced along the month axis. In this section, we selected two months, May and October, in 2008.
Fig 4A shows the HFMD incidence in Guangdong Province in May 2008, where the horizontal axis denotes the days from 1–31, and the vertical axis denotes the cities, with 21 cities in total. Fig 4B presents the same information that is depicted in Fig 4A, except for the month of October. Fig 4C shows the difference in results between May and October.
Some interesting results can be obtained through the comparison operation. Fig 4A indicates the high HFMD incidence in May 2008, as most of the cells are red. From this sub figure, we know that only four cities have red cells before May 5, while others have blue or green, and the HFMD incidence is not serious at this stage. However, the number of red cells increases after May 5. Another 5 cities become red on May 5, and these cities remain red through the next week, indicating that the HFMD incidence is serious across the Guangdong Province. From May 5, most cities become yellow or green, except for Zhuhai, which is a southern city in Guangdong. Throughout May, Zhuhai has the most serious HFMD incidence, as all 25 cells are red. There are no red cells in Jieyang, Shanwei or Shantou. The HFMD incidence was not serious in these cities, which are all located in the southeast of Guangdong. Most of the cities that have red cells, such as Guangzhou and Shenzhen, are located in the south of Guangdong. All of these cities have a strong economy, and the population mobility is relatively high.
As shown in Fig 4B, most of the cells in October 2008 were green, except for Shantou. The HFMD incidence in Shantou remained high for the entire month. There were no blue cells in Shantou, and most of the cells in the last two weeks were red. Fig 4C depicts the difference in incidence between May and October 2008. From the subfigure, we could easily observe that the incidence in May was heavier than in October of 2008. Some other months were also extracted from the STC model (not shown).
Previous studies found that HFMD will massively break out in late spring and early summer [34]. Meteorological factors, including temperature, relative humidity, and rainfall, have a strong influence on the spread of the disease [16,26,34].Because of the rise of the temperature and the increase of precipitation in late spring, the HFMD will spread very quickly. Most previous researches disclosed the discrepancy of the HFMD incidence among seasons through digital data. However, in this study, Fig 4A and 4B clearly displayed the difference between May and October by means of the color of the graphical visualization.
Patterns of HFMD incidence by geography
The relationships between the incidence of HFMD and geography can be observed using the STC model. When slicing the STC along the vertical axis, we can visualize the HFMD incidence during eleven months in one city.
Fig 5 shows the comparison of HFMD incidence between Guangzhou and Zhanjiang from May 2008 to March 2009. As shown in Fig 5A and 5C, the HFMD outbreaks primarily occurred in May and June 2008, indicating that HFMD typically occurs in the spring and summer. However, Guangzhou experienced a serious HFMD incidence in November 2008 and March 2009, as the green cells show, perhaps reflecting the large population density and rapid population mobility of Guangzhou. Fig 5C displays the difference between the two cities, showing that the HFMD incidence was more serious in Guangzhou than that in Zhanjiang.
Previous studies found that socio-economic factors have a strong influence on the spread of HFMD incidence [16]. Among the socio-economic factors, the population density, population mobility and industrial structure have a serious impact [34]. In this study, the HFMD incidence was more serious in Guangzhou than that in Zhanjiang by means of graphical visualization. Some possible explanations are as follows: (1) Guangzhou is the capital of Guangdong province, while Zhanjiang is a relatively remote city. The population density and mobility in Guangzhou is more than that in Zhanjiang. This provides the population conditions for the spread of HFMD. (2) The tertiary industry occupied a larger proportion in Guangzhou, while the first industry was largest in Zhanjiang. Through the STC model, we found the HFMD incidence concentrated in the Pearl River Delta region, including Guangzhou, Shenzhen, Foshan and Dongguan. The tertiary industry was activity in these regions, and this provided a favorable condition for the spread of HFMD.
Patterns of HFMD incidence by environmental factors
The STC model not only compares discrepancies in time and space but also observes the determinants of the incidence of HFMD. When the order of the vertical axis is arranged in accordance with the order of a determinant, the STC model can be used to observe the influence of the factor on the incidence of HFMD. We have conducted many experiments, including socio-economic and meteorological studies, and observed some phenomena between these factors and the incidence of HFMD. The order of the vertical axis in Fig 6A is arranged according to the size of the population density. As this picture shows, when the population density is relatively small, the outbreak of HFMD (the red units) occurs relatively late, and when the population density gradually increases, the outbreak of HFMD occurs earlier. This phenomenon might reflect a pattern between population density and the incidence of HFMD.
The order of the vertical axis in Fig 6B is arranged according to the amount of precipitation. As this picture shows, the incidence of HFMD is not serious in the three cities where the monthly precipitation is the least. When the precipitation increases, the number of red cells increases, particularly when the precipitation is within a certain range. This phenomenon might indicate that precipitation is related to the incidence of HFMD.
In this research, the association between the HFMD incidence with season, geography and rainfall was discovered through graphical visualization. Because the STC model does not assume that the disease data should obey some statistical distribution, avoiding the error that is caused by an a priori assumption. A further analysis could be conducted by means of the result of the STC model.
Space-time scan statistics of HFMD’s incidence in Guangdong Province
In the present study, we used a circle as the scanning window for the geographic space. The population of a cylinder is less than 10% of the total population, and the disease case model is a discrete Poisson model. The scanning results are shown in Table 3:
Table 3. The results of space-time scan statistics.
Clusters | Regions | Time | No.Cases | No.Expected | LLR | P-Value |
---|---|---|---|---|---|---|
1 | Shenzhen | 2008/5/3–5/16 | 4683 | 196.37 | 10,517.9 | 1.00E-17 |
2 | Foshan | 2008/5/3–5/16 | 3071 | 169.56 | 6056.72 | 1.00E-17 |
3 | Dongguan | 2008/5/4–5/17 | 1866 | 173.68 | 2759.46 | 1.00E-17 |
4 | Shaoguan, Qingyuan, Heyuan | 2008/5/4–7/10 | 4555 | 1300.39 | 2535.94 | 1.00E-17 |
5 | Maoming, Yangjiang | 2008/5/4–5/17 | 2022 | 253.75 | 2451.65 | 1.00E-17 |
To effectively visualize the results of the space-time scan statistics, we used colorful cylinders to display these aggregations (Fig 7). In Fig 7, the deep red cylinders denote the first three high aggregations, and the yellow cylinders represent the fourth and fifth high aggregations. The cylinders are located in clusters, where the circle of the base of the cylinder is proportional to the largest likely ratio (LLR), the height of the cylinder denotes the time period of the clusters, and the base of the cylinder denotes the geographical space of the HFMD outbreak.
As shown in the Fig 7, the most likely cluster is Shenzhen, and the cluster time is from May 3 to May 16, 2008. The largest likelihood ratio is 10517.9, and the p-value equals 1e-17, indicating that the result is highly significant. The second and third most likely clusters are Foshan and Dongguan, respectively, both with high significance. These three aggregations are located in the middle of Guangdong Province, representing the most economically developed areas of Guangdong. The population density and mobility are high. These factors might reflect the high degree of aggregation of the incidence of Hand- foot-mouth disease.
Regarding time, all of the clusters occurred in May 2008, lasting for half of the month, while some periods lasted until July (e.g., Shaoguan), when the temperature gradually increased and an increased amount of precipitation was observed.
Discussion
This article applied a method combining the spatiotemporal cube with space-time scan statistics to explore the incidence of HFMD from May 2008 to March 2009 in Guangdong Province. Through the STC model and STSS model, we found that high HFMD incidences were located in the most economically developed areas of Guangdong, and serious HFMD incidence were focused in May 2008, especially in the first half of the month.
Using the STC, the HFMD incidence across the regions and different times was easily observed. The differences between the geography and seasons were also easily explored through the STC. The STC reflects more detailed information, and these details are difficult to observe through digital data. The space-time scan statistics accurately calculated the maximum likely cluster and tested the cluster significance through a Monte Carlo simulation. This article used the STSS model to detect HFMD in Guangdong Province and found that the maximum possible aggregations were located in Shenzhen, Foshan and Dongguan. These areas are located in the southern coastal area of Guangdong Province and have the most developed economies, with large population densities.
Previous studies have shown that population density have a strong influence on the spread of HFMD [22, 23, 26, 34, 35]. Cao et al. found that urban areas have a higher incidence of HFMD than rural areas, and cities with a higher population density and stronger population mobility suffer more from HFMD [36]. Huang et al. found that tertiary industry had a greater impact on HFMD than first industry. This research further confirmed that serious HFMD incidence was located in the most economically developed areas of Guangdong [34]. To prevent a large-scale outbreak of hand, foot and mouth disease, it is necessary to monitor the HFMD incidence in these regions.
We found that high HFMD incidences were focused in May 2008, especially in the first half of the month. Previous studies have shown a clear seasonal pattern for HFMD outbreaks, with a peak in late spring and early summer [16,21,37]. Compared with the previous results, on the one hand, we confirmed the seasonal pattern of HFMD incidence through the visualization analysis; on the other hand, it is easy to detect HFMD incidence across all cities at the same time through the STC analysis model. Our study also showed that precipitation has a certain effect on the incidence of HFMD. Hii et al. found that precipitation has a 1–2-week lag effect on the incidence of HFMD [38]. Wang et al. used the S-BME spatial-temporal model and found that there is a strong relationship between the incidence of HFMD and monthly rainfall [16]. Compared with the previous research, we described the relationship between HFMD incidence with rainfall through visualization, and this result was easy to understand.
Both the STC and STSS reflect the spatiotemporal clusters of this epidemic, but there are some differences between them. Because the STC only provides relatively rough clusters, which rely on the colors of the plane or grids, and because the STSS uses the maximum likelihood ratio statistical technique, precise aggregations can be calculated. Generally, it is better to obtain information for precise aggregations; however, information from other regions is ignored in this process.
The STC has some distinguishing features. First, the STC displays comprehensive information concerning the epidemic. The STC is a three-dimensional visual tool, through which the information for any location or time can easily be displayed. In addition, the STC provides many functions that have been previously illustrated. These functions can solve many spatiotemporal problems, such as the relationship of the epidemic incidence between two months. Thus, the STC is not only a visualization tool but also a powerful analytical tool. Moreover, the STC does not assume that the disease data should obey some statistical distribution, thereby avoiding the error that is caused by an a priori assumption.
The STSS also has distinguishing features. The most important feature of STSS is accuracy, as this method uses the maximum likelihood ratio statistical method. Because the maximum likelihood ratio reflects the information both inside and outside of the clusters, the probability of the occurrence of the disease can objectively be explained. Due to the distinctive features of these two models, it is necessary to analyze the disease data in stages. First, we constructed a database of disease data, and an STC model was constructed to observe the overview of the disease distribution. Subsequently, the STSS model was used to precisely calculate aggregations.
There are some shortcomings of STC, among which color grading is the biggest deficiency. Different colors greatly influence the visual effects. Different colors are assigned to some data, which only display small differences. Even using color, it is difficult to visualize the actual differences between these data. Classifying the data to obtain a better visual effect is the next step in developing future applications with this technology. Another area of concern is the arrangement on the vertical axis. The regions in geographic area have spatial autocorrelation; when they are projected on the vertical axis, these attributes are lost. Thus, the arrangement of these geographic units is also an important study target.
Conclusion
This article applied a method combining the spatiotemporal cube with space-time scan statistics to explore the incidence of HFMD from May 2008 to March 2009 in Guangdong Province. Through the STC model, we observed that high HFMD incidences were focused in May 2008. Most of the cities that had serious HFMD incidence are located in the south of Guangdong. All of these cities have strong economies. We also found that the HFMD incidence varied among different months and cities. Using the STSS model, we found that the most likely cluster was Shenzhen, and the second and third most likely clusters were Foshan and Dongguan, respectively. All of these three cities are the most economically developed areas of Guangdong.
Supporting Information
Data Availability
All relevant data are within the paper and its Supporting Information files.
Funding Statement
This study was supported by National Natural Science Foundation of China (No.41501426), Open Fund of State Key Laboratory of Remote Sensing Science (No. OFSLRSS201513) and MOST (2012CB955503; 201202006).
References
- 1. Openshaw S, Charlton M, Wymer C, Craft AW. A mark 1 analysis machine for the automated analysis of point data sets. International Journal of Geographical Information Systems. 1987;1:335–358. [Google Scholar]
- 2. Turnbull B, Iwano E, Burnett W, Howe H, Clark L. Monitoring for clusters of disease: application to leukemia incidence in upstate New York. American Journal of Epidemiology.1990;132:136–143. [DOI] [PubMed] [Google Scholar]
- 3. Kulldorff M. A spatial scan statistic. Communications in Statistics: Theory and Methods. 1997; 26:1481–1496. [Google Scholar]
- 4. Kulldorff M, Athas W, Feuer E, Miller B, Key C. Evaluating cluster alarms: A space-time scan statistic and brain cancer in Los Alamos, New Mexico. American Journal of Public Health. 1998; 88:1377–1380. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Reis B, Mand K. Time series modeling for syndromic surveillance. BMC Med Inform Decis Mark. 2003; 3:2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Li M, Sun G, Zhang J, Jin Z, Sun X, Wang Y, et al. Transmission dynamics and control for a brucellosis model in Hingan League of Inner Mongolia, China. Math Biosci Eng. 2014;11(5):1115–1137. 10.3934/mbe.2014.11.1115 [DOI] [PubMed] [Google Scholar]
- 7. Li MT, Sun GQ, Wu YF, Zhang J, Jin Zhen. Transmission dynamics of a multi-group brucellosis model with mixed cross infection in public farm. Applied Mathematics and Computation. 2014;237:582–594. [Google Scholar]
- 8. Nie J, Sun GQ, Sun XD, Zhang J, Wang N, Wang YM, et al. Modeling the transmission dynamics of dairy cattle brucellosis in Jilin Province, China. Journal of Biological Systems. 2014;22:533. [Google Scholar]
- 9. Sun GQ, Zhang ZK. Global stability for a sheep brucellosis model with immigration. Applied Mathematics and Computation. 2014;246:336–345. [Google Scholar]
- 10. Zhang J, Sun GQ, Sun XD, Hou Q, Li M, Huang B, et al. Prediction and control of brucellosis transmission of dairy cattle in Zhejiang Province,China. PLos ONE. 2014;9(11):e108592 10.1371/journal.pone.0108592 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Zhang J, Jin Z, Sun GQ, Sun XD, Wang YM, Huang B. Determination of original infection source of H7N9 avian influenza by dynamical model. Scientific reports. 2014;4:4846 10.1038/srep04846 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Boscoe FP, Mclaughlin C, Schymurab MJ, and Kielb CL. Visualization of the spatial scan statistics using nested circles. Health & Place. 2003; 9:273–277. [DOI] [PubMed] [Google Scholar]
- 13. MacEachren AM, DiBiase DW. Animated maps of aggregated data: Conceptual and practical problems. Cartographic Perspectives. 1990; 4:32. [Google Scholar]
- 14. MacEachren A, Brewer C, Pickle L. Visualizing georeferenced data: representing reliability of health statistics. Environment and planning A. 1998; 9:1547–1561. [Google Scholar]
- 15. Zhou H, Yuan X, Qu H, Cui W, Chen B. Visual clustering in parallel coordinates. Computer Graphics Forum. 2008; 27(3):1047–1054. [Google Scholar]
- 16. Wang JF, Guo YS, Christakos G, Yang WZ, Liao YL, Li ZJ, et al. Hand, foot and mouth disease: spatiotemporal transmission and climate. International Journal of Health Geographics. 2011;10:25 10.1186/1476-072X-10-25 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Hardisty F, Robinson A. The GeoViz Toolkit: Using component-oriented coordination methods for geographic visualization and analysis. International Journal of Geographical Information Science. 2011; 25(2):191–210. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Hagerstrand T. What about people in regional science? Papers, Regional Science Association. 1970; 24:7–21. [Google Scholar]
- 19.Kraak M. The space-time-cube revisited from a geovisuzlization perspective. In Proceeding of the 21st International Cartographic Conference (ICC). “Cartographic Renaissance”, Durban, South Africa, 2003; 1988–1996.
- 20. Kraak M, Koussoulakou A. A Visualization Environment for the Space-Time-Cube. Developments in Spatial Data Handing; 2005. pp.189–200. [Google Scholar]
- 21. Ho M, Chen ER, Hsu KH, Twu SJ, Chen KT, Tsai SF, et al. An epidemic of enterovirus 71 infection in Taiwan. New England Journal of Medicine. 1999; 341:929–935. [DOI] [PubMed] [Google Scholar]
- 22. Ang LW, Koh BK, Chan KP, Chua LT, James L, Goh KT. Epidemiology and control of hand, foot and mouth disease in Singapore,2001–2007. Annals Academy of Medicine Singapore. 2009; 38:106–112. [PubMed] [Google Scholar]
- 23. Liu MY, Liu W, Luo J, Liu Y, Zhu Y, Berman H, et al. Characterization of an outbreak of hand, foot, and mouth disease in Nanchang, China in 2010. PLoS ONE. 2011; 6:e25287 10.1371/journal.pone.0025287 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.WHO. hand, foot and mouth disease a rising menace in Asia. Available: http://www.wpro.who.int/mediacentre/news/2009/20090713/en/index.html. Accessed 29 March 2014.
- 25. Li XZ, Wang JF, Yang WZ, Li ZJ, Lai SJ. A spatial scan statistic for nonisotropic two-level risk cluster. Statistics in Medicine. 2011; 31(2):177–187. 10.1002/sim.4341 [DOI] [PubMed] [Google Scholar]
- 26. Hu M, Li Z, Wang J, Jia L, Liao Y, Lai S, et al. Determinants of the incidence of hand, foot and mouth disease in china using geographically weighted regression models. PLoS ONE. 2012; 6:e38978. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. National Bureau of Statistics of China. China City Statistical Yearbooks-2009. China Statistics; 2009. [Google Scholar]
- 28. Peuquet DJ. It’s about time: a conceptual framework for the representation of temporal dynamics in geographic information systems, Annals of the Association of American Geographers. 1994; 3:441–461. [Google Scholar]
- 29.Gatalsky P, Andrienko N, Andrienko G. Interactive analysis of event data using space-time cube. Proceeding of the Eighth International Conference on Information Visualization. 2004; 145–152.
- 30.OLAP Council: OLAP and OLAP server definitions (1995).
- 31.Shekhar S, Lu CT, Liu R, Zhou C. CubeView: a system for traffic data visualization. Intelligent transportation systems. In: Proceedings of the Fifth IEEE International Conference on Intelligent Transportation Systems. 2002; 674–679.
- 32. Kulldorff M, Heffernan R, Hartman J. Space-time permutation scan statistic for the early detection of disease outbreaks. PLoS Medicine. 2005; 216–224 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Jung I, Kulldorff M, and Richard OJ. A spatial scan statistic for multinomial data. Statistics in Medicine. 2010; 29:1910–1918. 10.1002/sim.3951 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Huang JX, Wang JF, Bo YC, Xu CD, Hu MG, Huang DC. Identification of health risks of Hand, food and mouth disease in China using the geographical detector technique. Int.J.Environ.Res.Public Health. 2014;11:3407–3423. 10.3390/ijerph110303407 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Wang JF, Xu CD, Tong SL, Chen HY, Yang WZ. Spatial dynamic patterns of hand-foot-mouth disease in the People’s Republic of China.Geospatial Health. 2013;7: 381–390. [DOI] [PubMed] [Google Scholar]
- 36. Cao ZD, Zeng DJ, Wang QY, Zheng XL, Wang FY. An epidemiological analysis of the Beijing 2008 hand-foot-mouth epidemic.Chin. Sci. Bulletin. 2010; 55:1142–1149. [Google Scholar]
- 37. Tseng FC, Huang HC, Chi CY, Lin TL, Liu CC, Jian JW, et al. Epidemiological survey of enterovirus infections occurring in Taiwan between 2000 and 2005. Analysis of sentinel physician surveillance data. J. Med. Virol. 2007;79:1850–1860. [DOI] [PubMed] [Google Scholar]
- 38. Hii YL, Rocklov J, Ng N. Short term effects of weather on hand, footand mouth disease. PLoS One.2011; 6: 10.1371/journal.pone.0016796 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All relevant data are within the paper and its Supporting Information files.