Abstract
The differentiation of dengue virus (DENV) infection, a major cause of acute febrile illness in tropical regions, from other etiologies, may help prioritize laboratory testing and limit the inappropriate use of antibiotics. While traditional clinical prediction models focus on individual patient-level parameters, we hypothesize that for infectious diseases, population-level data sources may improve predictive ability. To create a clinical prediction model that integrates patient-extrinsic data for identifying DENV among febrile patients presenting to a hospital in Thailand, we fit random forest classifiers combining clinical data with climate and population-level epidemiologic data. In cross-validation, compared to a parsimonious model with the top clinical predictors, a model with the addition of climate data, reconstructed susceptibility estimates, force of infection estimates, and a recent case clustering metric significantly improved model performance.
Addition of population-level data to clinical data improved performance of a prediction model for dengue test positivity.
INTRODUCTION
Acute febrile illness (AFI) is a common reason for seeking health care in low- and middle-income countries (LMICs) (1). Determination of AFI etiology is often limited by diagnostic testing capacity, given the wide spectrum of potential infectious agents. Inappropriate use of testing and treatment resources may result in poor outcomes, such as the high case fatality rates seen in admitted AFI patients (5 to 20%) (2–7). Dengue virus (DENV) is a major cause of AFI in LMICs, accounting for an estimated 390 million infections, 96 million illnesses, 2 million severe cases, and 21,000 deaths per year (8). The differentiation between dengue and other common causes of febrile illness is important to avoid misdiagnosis, which can lead to delays in initiation of effective treatment and inappropriate use of antibiotics (9). Because of the lack of pathognomonic clinical features that reliably distinguish dengue from other febrile illnesses, virological or serological laboratory confirmation is required for definitive diagnosis. While multiplexed tests that can quickly identify the causative pathogen are ideal, they are often unavailable in LMICs due to cost and insufficient laboratory infrastructure. Even rapid, point-of-care tests may be cost-prohibitive in LMICs (10). Accurate and cost-effective tools to better determine etiology of fever at the point of care are greatly needed to guide the use of diagnostics and therapeutics, conserving scarce health care resources.
Clinical decision support systems (CDSS) incorporating prediction models may offer a solution to better management of infectious diseases in low resource settings. CDSSs, such as applications on smartphone devices, can gather data from a range of online sources and implement sophisticated clinical prediction models that would be impractical for clinicians to calculate manually. CDSS have proven effective at improving therapeutic management and reducing unnecessary diagnostic tests in both high-income countries (11) and LMICs (12–14). In Bangladesh, an electronic CDSS was shown to improve clinical dehydration assessment and World Health Organization (WHO) diarrhea guideline adherence, as well as reduce nonindicated antibiotic use in children under five by 29% (12). Traditional predictive models generally incorporate clinical information that is obtained solely from the presenting patient. Predictive models that incorporate additional information—such as seasonal or climate predictors, location-specific historical prevalence, and characteristics of prior patients—have been shown to increase diagnostic accuracy and limit inappropriate antibiotic use (14–16).
The underlying probability of being infected by DENV varies by both space and time. The risk of DENV transmission depends on conditions that promote mosquito breeding, including when temperatures are warmer (17–19), and the risk of infection is influenced by local population immunity, as large outbreak years are typically followed by periods of low transmission (20–22). As most DENV transmission is highly focal, it means that population susceptibility profiles can be spatially heterogeneous at any time (21, 23–25). Thus, our objective is to develop an improved clinical prediction model for dengue by integrating temporal and spatial (location-specific) parameters including climate data, clustering of recent cases, and population susceptibility estimates derived from seroprevalence or hospital data in the surrounding community. We demonstrate the potential for integrating location- and population-specific data sources into clinical prediction models. This approach has the potential to inform the development of improved tools to aid clinicians in diagnostic and therapeutic decision making for patients presenting with suspected dengue.
RESULTS
Of the 12,833 participants in the clinical dataset, 5731 (45%) were confirmed to have DENV infection by polymerase chain reaction (PCR). DENV-positive patients were significantly younger (18 versus 22 years, P < 0.001; Table 1). Nearly all cases (97.8%) came from the 11 districts within Kamphaeng Phet province (Table 1). There was no significant difference between the probability of testing positive for males and females (P = 0.07); no other genders were reported. The probability of testing positive differed substantially by age, ranging from 26% for those <4 years to 58% for those 15 to 19 years of age (Table 2). Patients between the ages of 10 and 14 years, 15 and 19 years, and 5 and 9 years comprised the largest proportion of cases (23, 18, and 16%, respectively), while older patients comprised a much smaller proportion of cases (30 to 34 years, 5%; 35 to 39 years, 4%).
Table 1. Age, gender, and top discriminative symptoms by DENV positivity.
Overall (N = 12,833)* | DENV negative (N = 7,102)* | DENV positive (N = 5,731)* | P value† | |
---|---|---|---|---|
Age (mean, SD) | 21 (15) | 22 (18) | 18 (11) | <0.001 |
Female | 6,401 (50) | 3,491 (49) | 2,910 (51) | 0.068 |
Symptoms | ||||
Cough | 4,741 (37) | 3,057 (43) | 1,684 (29) | <0.001 |
Nausea | 6,227 (49) | 3,051 (43) | 3,176 (55) | <0.001 |
Fever | 11,467 (89) | 6,129 (86) | 5,338 (93) | <0.001 |
Headache | 9,146 (71) | 4,797 (68) | 4,349 (76) | <0.001 |
Rhinitis | 2,165 (17) | 1,455 (20) | 710 (12) | <0.001 |
Pharyngitis | 3,534 (28) | 2,113 (30) | 1,421 (25) | <0.001 |
Location | ||||
District | <0.001 | |||
Bueng Samakkhi | 226 (1.8) | 166 (2.3) | 60 (1.0) | |
Khanu Woralaksaburi | 910 (7.1) | 522 (7.4) | 388 (6.8) | |
Khlong Khlung | 733 (5.7) | 397 (5.6) | 336 (5.9) | |
Khlong Lan | 945 (7.4) | 645 (9.1) | 300 (5.2) | |
Kosamphi Nakhon | 750 (5.8) | 407 (5.7) | 343 (6.0) | |
Lan Krabue | 556 (4.3) | 333 (4.7) | 223 (3.9) | |
Mueang Kamphaeng Phet | 5,780 (45) | 2,910 (41) | 2,870 (50) | |
Pang Sila Thong | 571 (4.4) | 324 (4.6) | 247 (4.3) | |
Phran Kratai | 1,186 (9.2) | 684 (9.6) | 502 (8.8) | |
Sai Ngam | 609 (4.7) | 363 (5.1) | 246 (4.3) | |
Sai Thong Watthana | 288 (2.2) | 178 (2.5) | 110 (1.9) | |
Province | ||||
Kamphaeng Phet | 12,554 (97.8) | 6,929 (97.5) | 5,625 (98.2) |
*Mean (SD); n (%).
†Wilcoxon rank sum test; Pearson’s chi-square test.
Table 2. The AUCs and CIs by base model, compared to base model plus inclusion of additional data sources.
Model | AUC (%) | 95% CI |
---|---|---|
Clinical*Climate*RS*FoI*Cluster | 70.0 | 67.9–71.9 |
Clinical*Climate*RS*Cluster | 69.5 | 67.5–71.5 |
Clinical*Climate*FoI*Cluster | 69.2 | 67.2–71.2 |
Clinical*Climate*Cluster | 68.8 | 66.8–70.8 |
Clinical*Climate*RS*FoI | 68.7 | 66.7–70.7 |
Clinical*Cluster | 68.7 | 66.7–70.7 |
Clinical*FoI*Cluster | 68.5 | 66.5–70.6 |
Clinical*Climate*RS | 68.4 | 66.4–70.5 |
Clinical*RS*FoI*Cluster | 68.4 | 66.4–70.4 |
Clinical*RS*Cluster | 68.2 | 66.1–70.2 |
Clinical*Climate*FoI | 68.1 | 66.1–70.1 |
Clinical*FoI | 67.7 | 65.7–69.8 |
Clinical*RS*FoI | 67.6 | 65.5–69.6 |
Climate*RS*FoI*Cluster | 67.5 | 65.5–69.6 |
Clinical*RS | 67.5 | 65.4–69.5 |
Clinical*Climate | 67.2 | 65.2–69.3 |
Clinical | 67.0 | 65–69.1 |
Climate*RS*Cluster | 66.8 | 64.8–68.9 |
Climate*RS | 65.7 | 63.6–67.8 |
RS*Cluster | 65.7 | 63.6–67.7 |
RS | 65.6 | 63.5–67.7 |
Climate*FoI*Cluster | 64.7 | 62.6–66.8 |
Climate*Cluster | 60.5 | 58.3–62.7 |
Climate | 58.7 | 56.5–60.9 |
Cluster | 56.4 | 54.2–58.6 |
FoI | 57.0 | 54.8–59.2 |
We found that there were significant differences in many of the clinical symptoms reported by DENV-positive and DENV-negative patients. Table 1 lists the top discriminative symptoms between the groups based on random forest and logistic regression. The most common symptom reported was fever, followed by headache. In univariate analysis, we found that individuals with fever, chills, malaise, retro-orbital pain, nausea, headache, and vomiting were significantly more likely to test positive for DENV, and individuals with cough, rhinitis, and pharyngitis were significantly less likely to test positive for DENV (table S2).
When we examined the proportion of positive cases to total cases by year and month, we found that both total and positive cases significantly increased in the months between June and September (P < 0.001, χ2 test). The proportion of positive cases differed substantially by year (P < 0.001, χ2 test), ranging from 19% in 2016 to 90% in 2017. The period of lowest test positivity in 2016 and 2017 coincided with the Zika virus epidemic in the country (Fig. 1).
Model performance evaluation using only clinical predictors and parsimonious variable selection
We first assessed the performance of the model using a traditional clinical prediction model, which only includes the presenting patient’s information. A random forest classifier using all 23 clinical features resulted in an average area under the receiver operator characteristic curve (AUC) of 69.5% [95% confidence interval (CI): 67.5 to 71.5] from repeated cross-validation. To determine the optimal number of variables for a parsimonious prediction model, we used a random forest classifier to analyze the improvement in model performance with each additional clinical variable included. Figure 2 shows the improvement in AUC with each additional variable using two random forest classifiers—one with all other predictors and the other using only clinical data—as well as a logistic regression model using only clinical variables. Performance leveled off with three clinical variables: age, cough, and nausea. Using a model with only these three predictors, we achieve an average AUC of 67.0% (95% CI: 65.0 to 69.1). Table S3 shows the relative frequency of these variables by age group. We demonstrate the direction and magnitude of the effect of the top predictors by generating partial dependence plots from random forest and logistic regression classifiers (fig. S1).
Addition of climate data to the clinical parameter model resulted in an improved AUC
Next, we fit models using climate data. To appropriately adjust lag time for each climate variable, we fit a random forest classifier using only climate variables and assessed the variables of importance by AUC. A random forest model with recent and lagged aggregated climate data without clinical predictors resulted in an AUC of 58.7% (95% CI: 56.5 to 60.9). We found that the best performing climate variables were visibility, relative humidity, wind speed, and precipitation, all lagged by 3 months. We examined the relationship between the top two performing climate predictors—visibility and relative humidity—with the proportion of positive cases each month (Fig. 3). For each climate predictor, table S4 lists the odds ratio and compares the mean of each predictor by DENV-positive or DENV-negative groups. When combined with the top three clinical variables, climate data performed similarly (AUC of 67.2%, 95% CI: 65.2 to 69.3) as clinical data alone (AUC of 67.0%, 95% CI: 65.0 to 69.1) (median P = 0.60, 2% P < 0.05). However, when climate data were combined with all other predictors, model performance improved from an AUC of 68.4% (95% CI: 66.4 to 70.4) to an AUC of 70.0% (95% CI: 67.9 to 71.0; median P = 0.07, 45% P < 0.05). To assess whether integrating more location-specific climate data would improve performance, we fit models using climate data from each case’s home district; however, model performance did not noticeably change. Table 2 shows the AUCs for the clinical base model, compared to the base model plus the inclusion of additional data sources.
Addition of RS estimates to the clinical parameter model resulted in an improved AUC
Using historical hospital case data from the province, we obtained estimates of the size of the susceptible population by age for each year (across all subdistricts in the province). In our predictive model, we used the prior year’s reconstructed susceptibility (RS) estimates. Using logistic regression, we found that secondary RS estimates performed better than primary RS estimates [60.7% (95% CI: 58.6 to 62.9) versus 52.3% (95% CI: 50.1 to 54.6)]. When added to a random forest classifier with climate and/or clinical predictors, the inclusion of RS estimates consistently resulted in higher AUCs (Table 2). When added to the top three clinical parameters alone, RS estimates nonsignificantly improved model performance from an AUC of 67.0% (95% CI: 65.0 to 68.8) to an AUC of 67.5% (95% CI: 65.4 to 69.5) (median P = 0.40, 9% P < 0.05). Last, a model including all predictors resulted in higher AUCs than a model without RS (median P = 0.09, 32% P < 0.05).
Addition of subdistrict-specific FoI estimates to the clinical parameter model resulted in an improved AUC
We incorporated force of infection (FoI) estimates for each age by subdistrict using data from a local cohort study. This assumes that the underlying differences in the FoI are constant in time. Using logistic regression, FoI estimates had an AUC of 57.0% (95% CI: 54.8 to 59.2). The inclusion of FoI estimates leads to increases in AUC when added to the top clinical predictors, when added to clinical predictors and climate data, and when added to clinical predictors, climate predictors, and RS estimates (Table 2). When included with all other predictors, a model with FoI estimates nonsignificantly improved performance compared to a model without FoI estimates (median P = 0.30, 23% P < 0.05).
Addition of the case clustering metric to the clinical parameter model resulted in an improved AUC
Last, we fit a model that assessed for clustering of recent cases based on prior patients presenting to the Kamphaeng Phet Provincial Hospital (KPPH). Using logistic regression, we found that the case clustering metric (the number of positive cases in the subdistrict over the past 30 days divided by the total number of cases from that subdistrict in the study period) had an AUC of 56.4% (95% CI: 54.2 to 58.6). We found that the use of the case clustering metric consistently improved model performance. Stratifying by the finer spatial size of subdistrict consistently outperformed models with prior patients stratified by province. When added to the top performing clinical variables, model performance significantly improved (median P = 0.02, 60% of P < 0.05). When compared to a model with all predictors except cluster of recent cases, the inclusion of this predictor significantly improved model performance (median P = 0.007, 79% P < 0.05).
Last, when comparing a model including all predictors with a model including only the top clinical predictors, model performance improved from an AUC of 67.0% (95% CI: 65.0 to 69.1) to an AUC of 70.0% (95% CI: 67.9 to 71.9) (median P = 0.006, 87% P < 0.05). Our model had a sensitivity of 55.3%, a specificity of 70.2%, a positive predictive value (PPV) of 60.0%, and a negative predictive value (NPV) of 66.1%.
DISCUSSION
Insufficient diagnostic testing capacity in LMICs necessitates innovative approaches to support clinical decision-making. Here, we present a predictive model for DENV infection that integrates multiple sources of information both intrinsic and extrinsic to the patient, including climate data, clinical data, seroprevalence-based susceptibility estimates, and historical information from prior patients, which results in improved predictive performance. While the model with all predictors included did significantly outperform the base parsimonious model with only clinical predictors (median P = 0.006, 87% P < 0.05), whether the additional 3.0% improvement in AUC is clinically useful may be case and clinician dependent. Certain components of our model require data from sero-surveillance, which may not be accessible in all communities. However, simplifying the model by including only the top clinical predictors and the case cluster metric alone results in an AUC decrease of only 1.3%. These metrics are more readily obtainable and, notably, do not necessitate laboratory resources. Nevertheless, we believe that the results demonstrate a proof of concept that seroprevalence-based susceptibility estimates and climate data can be used to improve predictive performance and may be useful to augment prediction in other communicable diseases.
There is a lack of information on the deficiency of testing capacity both in Thailand and globally in LMICs. Accurately quantifying the true extent of diagnostic testing deficiencies is challenging as LMICs often lack robust national surveillance systems. In a Brazilian study between the years 2010 and 2019, where every suspected case of dengue was recorded in a national surveillance database, only 11% of the 350,000 cases of suspected dengue infection were ultimately tested (26). If we extrapolate the results from Brazil to other LMICs, as much as 90% of dengue-like illness may go undiagnosed, highlighting the need for tools to bridge the diagnostic gap.
In contrast to most dengue diagnostic models, which rely in part on laboratory data, our model relies solely on clinical indicators, making it accessible to clinicians without laboratory resources. For reference, when compared to a multiple regression model from Honduras that used only clinical predictors, our model had a lower sensitivity (55% versus 86%) and PPV (60% versus 75%) and a higher specificity (70% versus 27%) and NPV (66% versus 44%) (27). Models that integrate laboratory values, such as complete blood count and hepatic function tests, tend to perform better than models using only clinical predictors, such as a Bayesian network model from Thailand (sensitivity, 74%; specificity, 79%; PPV, 75%; NPV 79%) (28), a multiple regression model from Sri Lanka (sensitivity, 49%; specificity, 85%; PPV, 70%; NPV, 70%) (29), and a multiple regression model from Brazil (sensitivity, 80%; specificity, 71%) (30).
DENV transmission can exhibit temporal and geographical heterogeneity even at fine spatial scales, with variations observed even among neighboring villages (31–33). We thus used patient-extrinsic (location-specific) data sources in our models. The improvement in AUC with finer spatial units suggests that population-level spatial heterogeneity exists at the district level and can be applied to individual-level clinical prediction. We expect further improvements in predictive performance if finer-scale location became routinely available for case data, such as to the community level. The improvement with the use of either the province or district level case clustering metric highlights the utility of temporal predictors in clinical prediction DENV models.
Spatial heterogeneity in dengue incidence may be explained in part by micro-climates, which can modify transmission dynamics at small scales. For example, within urban heat islands, temperature variations of up 10°C compared to other city areas may create conditions more conducive to dengue transmission in cooler temperatures (34). We collected all climate data from the provincial weather station in Kamphaeng Phet. We attempted to integrate climate data at a more localized level; however, several subdistricts do not have weather stations or weather station data were incomplete. When fitting models using data from all districts in Kamphaeng Phet, however, we found similar results.
Transmission of DENV occurs in a seasonal pattern, and several climate variables have been found to increase DENV transmission and/or vector populations (17–19, 35, 36). While prior studies have demonstrated associations between climate variables like average precipitation, relative humidity, temperature, and wind speed, with varying lag times between 0 and 3 months, and dengue incidence (37–40), our predictive-based analytic framework is not intended to examine causal or associative relationships between climate variables and the outcome of dengue incidence. Our findings suggest that site-specific climate variables aid in site-specific models to predict DENV infection. While visibility has not been found to be associated with dengue incidence, we found that it was the most important climate predictor. It is plausible that visibility serves as a proxy indicator for an underlying factor that affects dengue incidence, such as air pollution, which has been postulated as a contributing factor (41, 42). Appropriate lag times would need to be tuned to different sites. For use in a clinical decision support tool, the most recent climate variables could be gathered from online weather sources based on smartphone-based detection of GPS location. An optimal utilization of this model would be through a smartphone application, as there is a scarcity of electronic medical record availability in LMICs. This would necessitate access to a smart phone device and internet connection; however, clinicians and frontline health care workers increasingly have access to smartphone devices, even in remote areas of LMICs (43).
There were significant differences between DENV-positive and DENV-negative patients in 16 of the 22 clinical symptoms collected on presentation, consistent with features known to distinguish dengue from other illnesses (44, 45). To minimize clinician input requirements (46), we used random forest regression to identify the optimal variables to derive a parsimonious model. We were able to achieve near-optimal performance with only three clinical variables—age, nausea, and cough. Numerous multivariable models based on clinical presentation have been developed to identify dengue infection in patients with AFI. In a review of published logistic regression prediction models, rash and/or petechiae was the most frequently identified predictor (four of seven models) to discriminate between DENV-positive and DENV-negative patients. When evaluated, the absence of cough was found to be a predictor in 33% of models. Nausea, which was evaluated in four logistic regression models, did not achieve significance in any model. Our results differ from those found in many logistic regression models and align with more intricate models for DENV diagnosis. Models using deep neural networks (47), random forest (48), and gradient boosting (XGBoost) (49) noted that age was the best clinical discriminative predictor. These models did not include cough or nausea as variables for assessment. We found that with the input of as little as one clinical variable—age—along with other predictors can provide useful clinical information (AUC: 67.9%, 95% CI: 65.6 to 70.0), especially in cases where other symptoms cannot be easily obtained, such as in nonverbal or comatose patients.
We show that RS estimates, which reflect the transmission dynamics of disease and the susceptible proportion of a population, improve individual-level clinical prediction on their own. However, there are several factors that make the use of RS estimates problematic, and we favor the use of other location-specific predictors. First, RS estimates may be more difficult to obtain across different settings. Moreover, RS estimates may not serve as a reliable indicator of protection against DENV, as they represent a mixed concept—immunity may reflect protection due to herd immunity or may indicate increased risk of dengue infection, as higher levels of immunity may reflect higher viral circulation of the multiple DENV serotypes with substantial immunologic cross-reactivity. Last, RS estimates are themselves derived from a model and so should be considered with caution.
Our study has several limitations. First, our model was constructed using data from a single center and testing was limited to patients suspected of having dengue infection, potentially hindering the model’s generalizability to a broader population. Similarly, as there was inherent heuristic bias in the patients selected for testing, the clinical components of the model reflect this specific population, meaning that other important predictors of dengue infection, such as fever, were already included in the clinician’s decision-making. Our results were limited to internal cross-validation; further studies for external validation are necessary. Last, our assessment of the use of spatial dynamics in DENV transmission was limited as cases were only matched to each district rather than subdistrict or village. In the future, models that integrate cases based on a finer spatial scale may better assess the role of a patient’s residing location in prediction. Despite these limitations, we demonstrate that predictive models that include patient-extrinsic location-specific elements can improve prediction and allow for parsimonious models that minimize clinician input and should be considered in future work on clinical prediction and decision support tools.
METHODS
Location
Kamphaeng Phet is a province in north-central Thailand, which is located 350 km north of Bangkok and has a population of 725,000 people in a mostly rural and semirural setting (33, 50). We used data collected from patients presenting to KPPH, a large, tertiary care hospital in the province to identify clinical predictors that could discriminate between DENV-infected and uninfected patients (33, 50).
Hospital-based suspected dengue patient data
We used data on over 12,000 patients presenting to KPPH with suspected dengue between August 2007 and December 2021. The data were collected by the U.S. Army Medical Directorate–Armed Forces Research Institute of Medical Sciences. As DENV testing in this hospital is provided free of charge and this is a highly DENV-endemic region, individuals will be tested for DENV infection if there is any suspicion of dengue, however minor. This provides an excellent test case to understand whether individual or location-specific risk factors are associated with testing positive for DENV.
For all suspected dengue cases, we used demographic and clinical information including patient age, sex, home village, admission diagnosis, date of admission, presenting symptoms, and DENV PCR status. The following signs and symptom were recorded as binary variables: fever, chills, malaise, rhinitis, rash, sore throat, seizure, cough, nuchal rigidity, eye pain, nausea, headaches, vomiting, joint pain, abnormal movements, anorexia, myalgias, diarrhea, dark urine, abdominal pain, and bleeding. DENV infection was evaluated using reverse transcription PCR. We recorded the residence of each patient to the district (Amphoe) level using detailed base maps of the region.
Climate variables using NOAA data
Climate and seasonal factors such as temperature, precipitation, and humidity influence vector populations and DENV transmission (17–19, 35). We used the R package GSODR to gather climate data from the central most National Oceanic and Atmospheric Administration (NOAA) weather station in the province of Kamphaeng Phet, Thailand, which included mean daily temperature, precipitation, dew point, relative humidity, sea level pressure, visibility, and wind speed. To better reflect seasonal trends, we aggregated data in 14-day increments before the day of the DENV infection prediction. As climate can alter vector feeding behavior (19, 36), we used aggregated climate predictors in the 2 weeks before case presentation. In addition, climate in the months before outbreaks can influence both vector population dynamics and viral replication (19, 35). To determine the appropriate lag time for each climate variable, we constructed a random forest classifier with climate variables lagged at 1, 2, and 3 months. Using the R package “vip,” we calculated each variable of importance by AUC and used the best performing lag time for each climate variable.
Estimates of temporal changes in population susceptibility using national surveillance system data
We estimate population susceptibility data using age-specific case data from the national surveillance system using data from Kamphaeng Phet province only. We note that most of the cases in this dataset are suspected DENV cases (i.e., without confirmatory testing). We have previously developed models to explicitly link underlying infection risks to the observed age distribution of cases by age and year to estimate annual age-specific FoI in provinces of Thailand up until 2017 (51). The estimates can be used to reconstruct the buildup of immunity in populations by age. Here, we reconstruct population susceptibilities in Kamphaeng Phet going into each year, using only data before the year, to mimic the real-world use, where only prior years’ data are available. As dengue disease severity is greatest for secondary infections, we consider two alternative formulations to define susceptibility to disease. First, we consider complete susceptibility, where we use the estimates of the proportion of individuals of an age group and year that are completely seronaive. Second, we consider the proportion of individuals of an age group and year that have experience one prior infection and are therefore at risk of increased risk of severe disease.
Estimates of spatial differences in the underlying FoI using seroprevalence data from a cohort study
To estimate underlying spatial differences in the FoI in the province, we make use of a DENV cohort study in the region, where healthy individuals of all ages from throughout Kamphaeng Phet province have provided blood (52). The cohort is ongoing. We use data from samples collected during baseline blood draws that occurred between 2015 and 2021. Hemagglutination inhibition assays were used to characterize immunity to the four DENV serotypes; individuals were considered seropositive if they had a titer of 10 or greater to any serotype. We have previously used these seroprevalence data to estimate the underlying mean FoI and the proportion of the population that are susceptible to DENV infection in different subdistricts in the province (53). Here, we use these subdistrict-specific estimates to characterize underlying heterogeneity in the FoI in the province. As the cohort data come from 2015 to 2021, however, much of the hospital case data we are working with come from before the cohort, we are assuming that the FoI is stable in time within any location.
Spatial clustering of positive cases based on prior patients presenting to the hospital
The local clustering of positive cases from a single area may signal local ongoing transmission. To assess for a temporal and spatial relationship between cases, we stratified cases that presented to KPPH by both district and province and then summed the number of positive cases in the 30 days before presentation divided by the total cases over the study period from that area.
Statistical analysis and modeling
We fit random forest classifiers to predict DENV infection. Random forests are a machine learning method that constructs a multitude of decision trees and averages over them to obtain a prediction robust to nonlinearities and interactions between covariates and has been widely applied to biomedical sciences for both classification and regression (54, 55).
We initially identified the subset of clinical symptoms that were most informative of true infection status. To do this, we fit random forest models using only clinical predictors and then used the R package vip to calculate the variable of importance by AUC for each clinical variable. We determined a variable’s importance by calculating the change in AUC after permuting, or randomly shuffling each predictor. To attempt to achieve the most parsimonious prediction rule (i.e., the best predictive model requiring the fewest variables to be input by clinicians), we fit random forest and logistic regression models using training data with consecutively increasing clinical predictor set sizes based on the order of importance and applied this to the test set to determine the smallest model with the best performance. Next, we incorporated the patient-extrinsic factors. We fit each random forest classifier using 1000 decision trees and used the default number of variables to be randomly considered at each node split (mtry = square root of number of candidate variables). In the construction of our predictive models, we input climate predictors, age, susceptibility estimates, and the case clustering metric as continuous variables, and we input the optimized clinical predictors as binary presence or absence categorical variables. Missing predictor data were imputed using the R package “RandomForest.”
We used logistic regression for each predictor to create a univariate comparison between DENV-positive and DENV-negative cases. We fit multiple logistic regression models to compare the performance of parsimonious models with a random forest classifier using the same number of predictors.
To assess predictive performance for both random forest and logistic regression models, we used repeated cross-validation using 80% training/20% testing splits with 100 iterations. No testing data were used when training the model. In each iteration, predictions on the test set were produced and corresponding measures of performance were obtained. To determine overall model performance, we averaged the AUC and CIs for the 100 iterations. To determine statistical significance between models, we used a bootstrap method over 100 iterations, which involves resampling the data with replacement multiple times, creating bootstrap samples. For each bootstrap sample, receiver operating characteristic (ROC) curves were generated and the differences between the curves were computed. All analyses were completed using R version 4.2.0, and model development/validation was completed in accordance with the TRIPOD checklist (table S1).
Ethical considerations
This study was approved by the institutional review boards of the Thai Ministry of Public Health and Walter Reed Army Institute of Research (no. 2119) and the University of Utah (IRB_00150106).
Acknowledgments
Material has been reviewed by the Walter Reed Army Institute of Research. There is no objection to its presentation and/or publication. The opinions or assertions contained herein are the private views of the authors and are not to be construed as official or as reflecting true views of the Department of the Army or the Department of Defense. The investigators have adhered to the policies for protection of human subjects as prescribed in AR 70-25.
Funding: Research reported in this publication was supported by the U.S. National Institutes of Health under award numbers R01AI135114 (to D.T.L.), K24AI166087 (to D.T.L.), and P01AI034533 (to A.L.R. and K.B.A.); the Military Infectious Disease Research Program (MIDRP); and the European Research Council (no. 804744, to H.S.). R.J.W. is funded by the National Institute of Heart, Lung, and Blood by the Utah Stimulating Access to Research in Residency Transition Scholar (StARRTS) under award R38HL143605. D.A.T.C. reports contracts from Merck and from Pfizer (to the University of Florida) for research unrelated to this article.
Author contributions: Conceptualization: D.A.T.C., D.B., S.T., R.J.W., H.S., A.F., K.B.A., and D.T.L. Data curation: G.R.D.S., A.T.H., R.J.W., and S.I. Formal analysis: B.J.B. and R.J.W. Funding acquisition: S.F., S.T., A.L.R., H.S., A.F., K.B.A., and D.T.L. Methodology: S.F., S.T., A.L.R., H.S., A.F., K.B.A., and D.T.L. Project administration: S.F., D.B., S.T., S.K., H.S., and D.T.L. Software: R.J.W. and B.J.B. Supervision: D.A.T.C., D.B., S.T., S.K., A.F., and D.T.L. Visualization: G.R.D.S., B.J.B., A.T.H., R.J.W., and S.K. Writing—original draft: D.B., A.T.H., R.J.W., H.S., and D.T.L. Writing—reviewing and editing: S.F., D.A.T.C., B.J.B., D.B., S.T., A.T.H., R.J.W., A.L.R., A.F., K.B.A., and D.T.L.
Competing interests: The authors declare that they have no competing interests.
Data and materials availability: All data needed to evaluate the conclusions in the paper are present in the paper and/or the Supplementary Materials. The data and code for this study are available at https://doi.org/10.5281/zenodo.8407814. Because of privacy concerns, in our publicly available data, we have converted exact dates to month/year and we grouped ages into age categories.
Supplementary Materials
This PDF file includes:
REFERENCES AND NOTES
- 1.Osborn J., Roberts T., Guillen E., Bernal O., Roddy P., Ongarello S., Sprecher A., Page A.-L., Ribeiro I., Piriou E., Tamrat A., de la Tour R., Rao V. B., Flevaud L., Jensen T., McIver L., Kelly C., Dittrich S., Prioritising pathogens for the management of severe febrile patients to improve clinical care in low- and middle-income countries. BMC Infect. Dis. 20, 117 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Prasad N., Murdoch D. R., Reyburn H., Crump J. A., Etiology of severe febrile illness in low- and middle-income countries: A systematic review. PLOS ONE 10, –e0127962 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Feikin D. R., Olack B., Bigogo G. M., Audi A., Cosmas L., Aura B., Burke H., Njenga M. K., Williamson J., Breiman R. F., The burden of common infectious disease syndromes at the clinic and household level from population-based surveillance in rural and urban Kenya. PLOS ONE 6, e16085 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Archibald L. K., den Dulk M. O., Pallangyo K. J., Reller L. B., Fatal Mycobacterium tuberculosis bloodstream infections in febrile hospitalized adults in Dar es Salaam, Tanzania. Clin. Infect. Dis. 26, 290–296 (1998). [DOI] [PubMed] [Google Scholar]
- 5.Chheng K., Carter M. J., Emary K., Chanpheaktra N., Moore C. E., Stoesser N., Putchhat H., Sona S., Reaksmey S., Kitsutani P., Sar B., van Doorn H. R., Uyen N. H., Van Tan L., Paris D., Blacksell S. D., Amornchai P., Wuthiekanun V., Parry C. M., Day N. P. J., Kumar V., A prospective study of the causes of febrile illness requiring hospitalization in children in Cambodia. PLOS ONE 8, e60634 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Crump J. A., Morrissey A. B., Nicholson W. L., Massung R. F., Stoddard R. A., Galloway R. L., Ooi E. E., Maro V. P., Saganda W., Kinabo G. D., Muiruri C., Bartlett J. A., Etiology of severe non-malaria febrile illness in Northern Tanzania: A prospective cohort study. PLOS Negl. Trop. Dis. 7, e2324 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Ssali F. N., Kamya M. R., Wabwire-Mangen F., Kasasa S., Joloba M., Williams D., Mugerwa R. D., Ellner J. J., Johnson J. L., A prospective study of community-acquired bloodstream infections among febrile adults admitted to Mulago Hospital in Kampala, Uganda. J. Acquir. Immune Defic. Syndr. Hum. Retrovirol. 19, 484–489 (1998). [DOI] [PubMed] [Google Scholar]
- 8.Bhatt S., Gething P. W., Brady O. J., Messina J. P., Farlow A. W., Moyes C. L., Drake J. M., Brownstein J. S., Hoen A. G., Sankoh O., Myers M. F., George D. B., Jaenisch T., Wint G. R. W., Simmons C. P., Scott T. W., Farrar J. J., Hay S. I., The global distribution and burden of dengue. Nature 496, 504–507 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Crump J. A., Gove S., Parry C. M., Management of adolescents and adults with febrile illness in resource limited areas. BMJ 343, d4847 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Yager P., Domingo G. J., Gerdes J., Point-of-care diagnostics for global health. Annu. Rev. Biomed. Eng. 10, 107–144 (2008). [DOI] [PubMed] [Google Scholar]
- 11.Bright T. J., Wong A., Dhurjati R., Bristow E., Bastian L., Coeytaux R. R., Samsa G., Hasselblad V., Williams J. W., Musty M. D., Wing L., Kendrick A. S., Sanders G. D., Lobach D., Effect of clinical decision-support systems. Ann. Intern. Med. 157, 29–43 (2012). [DOI] [PubMed] [Google Scholar]
- 12.Bilal S., Nelson E., Meisner L., Alam M., Al Amin S., Ashenafi Y., Teegala S., Khan A. F., Alam N., Levine A., Evaluation of standard and mobile health-supported clinical diagnostic tools for assessing dehydration in patients with diarrhea in rural bangladesh. Am. J. Trop. Med. Hyg. 99, 171–179 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Tuon F. F., Gasparetto J., Wollmann L. C., Moraes T. P., Mobile health application to assist doctors in antibiotic prescription - An approach for antibiotic stewardship. Braz. J. Infect. Dis. 21, 660–664 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Garbern S. C., Nelson E. J., Nasrin S., Keita A. M., Brintz B. J., Gainey M., Badji H., Nasrin D., Howard J., Taniuchi M., Platts-Mills J. A., Kotloff K. L., Haque R., Levine A. C., Sow S. O., Alam N. H., Leung D. T., External validation of a mobile clinical decision support system for diarrhea etiology prediction in children: A multicenter study in Bangladesh and Mali. eLife 11, e72294 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Fine A. M., Brownstein J. S., Nigrovic L. E., Kimia A. A., Olson K. L., Thompson A. D., Mandl K. D., Integrating spatial epidemiology into a decision model for evaluation of facial palsy in children. Arch. Pediatr. Adolesc. Med. 165, 61–67 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Nelson E. J., Khan A. I., Keita A. M., Brintz B. J., Keita Y., Sanogo D., Islam M. T., Khan Z. H., Rashid M. M., Nasrin D., Watt M. H., Ahmed S. M., Haaland B., Pavia A. T., Levine A. C., Chao D. L., Kotloff K. L., Qadri F., Sow S. O., Leung D. T., Improving antibiotic stewardship for diarrheal disease with probability-based electronic clinical decision support: A randomized crossover trial. JAMA Pediatr. 176, 973–979 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Chan M., Johansson M. A., The incubation periods of dengue viruses. PLOS ONE 7, e50972 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Watts D. M., Burke D. S., Harrison B. A., Whitmire R. E., Nisalak A., Effect of temperature on the vector efficiency of Aedes aegypti for dengue 2 virus. Am. J. Trop. Med. Hyg. 36, 143–152 (1987). [DOI] [PubMed] [Google Scholar]
- 19.Barrera R., Amador M., MacKay A. J., Population dynamics of Aedes aegypti and dengue as influenced by weather and human behavior in San Juan, Puerto Rico. PLOS Negl. Trop. Dis. 5, e1378 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Ribeiro G. S., Hamer G. L., Diallo M., Kitron U., Ko A. I., Weaver S. C., Influence of herd immunity in the cyclical nature of arboviruses. Curr. Opin. Virol. 40, 1–10 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Romeo-Aznar V., Picinini Freitas L., Gonçalves Cruz O., King A. A., Pascual M., Fine-scale heterogeneity in population density predicts wave dynamics in dengue epidemics. Nat. Commun. 13, 996 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Lourenço J., Recker M., Natural, persistent oscillations in a spatial multi-strain disease system with application to dengue. PLOS Comput. Biol. 9, e1003308 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Lai W. T., Chen C. H., Hung H., Chen R. B., Shete S., Wu C. C., Recognizing spatial and temporal clustering patterns of dengue outbreaks in Taiwan. BMC Infect. Dis. 18, 256 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Estupiñán Cárdenas M. I., Herrera V. M., Miranda Montoya M. C., Lozano Parra A., Zaraza Moncayo Z. M., Flórez García J. P., Rodríguez Barraquer I., Villar Centeno L., Heterogeneity of dengue transmission in an endemic area of Colombia. PLOS Negl. Trop. Dis. 14, e0008122 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Thai K. T., Nagelkerke N., Phuong H. L., Nga T. T., Giao P. T., Hung L. Q., Binh T. Q., Nam N. V., De Vries P. J., Geographical heterogeneity of dengue transmission in two villages in southern Vietnam. Epidemiol. Infect. 138, 585–591 (2010). [DOI] [PubMed] [Google Scholar]
- 26.Ribeiro Dos Santos G., Durovni B., Saraceni V., Souza Riback T. I., Pinto S. B., Anders K. L., Moreira L. A., Salje H., Estimating the effect of the wMel release programme on the incidence of dengue and chikungunya in Rio de Janeiro, Brazil: A spatiotemporal modelling study. Lancet Infect. Dis. 22, 1587–1595 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Fernández E., Smieja M., Walter S. D., Loeb M., A predictive model to differentiate dengue from other febrile illness. BMC Infect. Dis. 16, 694 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Sa-Ngamuang C., Haddawy P., Luvira V., Piyaphanee W., Iamsirithaworn S., Lawpoolsri S., Accuracy of dengue clinical diagnosis with and without NS1 antigen rapid test: Comparison between human and Bayesian network model decision. PLOS Negl. Trop. Dis. 12, e0006573 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Bodinayake C. K., Tillekeratne L. G., Nagahawatte A., Devasiri V., Kodikara Arachchi W., Strouse J. J., Sessions O. M., Kurukulasooriya R., Uehara A., Howe S., Ong X. M., Tan S., Chow A., Tummalapalli P., De Silva A. D., Østbye T., Woods C. W., Gubler D. J., Reller M. E., Evaluation of the WHO 2009 classification for diagnosis of acute dengue in a large cohort of adults and children in Sri Lanka during a dengue-1 epidemic. PLOS Negl. Trop. Dis. 12, e0006258 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Daumas R. P., Passos S. R., Oliveira R. V., Nogueira R. M., Georg I., Marzochi K. B., Brasil P., Clinical and laboratory features that discriminate dengue from other febrile illnesses: A diagnostic accuracy study in Rio de Janeiro, Brazil. BMC Infect. Dis. 13, 77 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Restrepo A. C., Baker P., Clements A. C., National spatial and temporal patterns of notified dengue cases, Colombia 2007-2010. Trop. Med. Int. Health 19, 863–871 (2014). [DOI] [PubMed] [Google Scholar]
- 32.Yoon I. K., Getis A., Aldstadt J., Rothman A. L., Tannitisupawong D., Koenraadt C. J., Fansiri T., Jones J. W., Morrison A. C., Jarman R. G., Nisalak A., Mammen M. P. Jr., Thammapalo S., Srikiatkhachorn A., Green S., Libraty D. H., Gibbons R. V., Endy T., Pimgate C., Scott T. W., Fine scale spatiotemporal clustering of dengue virus transmission in children and Aedes aegypti in rural Thai villages. PLOS Negl. Trop. Dis. 6, e1730 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Bhoomiboonchoo P., Gibbons R. V., Huang A., Yoon I. K., Buddhari D., Nisalak A., Chansatiporn N., Thipayamongkolgul M., Kalanarooj S., Endy T., Rothman A. L., Srikiatkhachorn A., Green S., Mammen M. P., Cummings D. A., Salje H., The spatial dynamics of dengue virus in Kamphaeng Phet, Thailand. PLOS Negl. Trop. Dis. 8, e3138 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Misslin R., Telle O., Daudé E., Vaguet A., Paul R. E., Urban climate versus global climate change-what makes the difference for dengue? Ann. N. Y. Acad. Sci. 1382, 56–72 (2016). [DOI] [PubMed] [Google Scholar]
- 35.Flores Ruiz S., Cabrera Romo S., Castillo Vera A., Dor A., Effect of the rural and urban microclimate on mosquito richness and abundance in Yucatan State, Mexico. Vector Borne Zoonotic Dis. 22, 281–288 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Scott T. W., Amerasinghe P. H., Morrison A. C., Lorenz L. H., Clark G. G., Strickman D., Kittayapong P., Edman J. D., Longitudinal studies of Aedes aegypti (Diptera: Culicidae) in Thailand and Puerto Rico: Blood feeding frequency. J. Med. Entomol. 37, 89–101 (2000). [DOI] [PubMed] [Google Scholar]
- 37.Abdullah N., Dom N. C., Salleh S. A., Salim H., Precha N., The association between dengue case and climate: A systematic review and meta-analysis. One Health 15, 100452 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Hossain S., Islam M. M., Hasan M. A., Chowdhury P. B., Easty I. A., Tusar M. K., Rashid M. B., Bashar K., Association of climate factors with dengue incidence in Bangladesh, Dhaka City: A count regression approach. Heliyon 9, e16053 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Ouattara C. A., Traore T. I., Traore S., Sangare I., Meda C. Z., Savadogo L. G. B., Climate factors and dengue fever in Burkina Faso from 2017 to 2019. J. Public Health Afr. 13, 2145 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Singh N., Mall R. K., Banerjee T., Gupta A., Association between climate and infectious diseases among children in Varanasi city, India: A prospective cohort study. Sci. Total Environ. 796, 148769 (2021). [DOI] [PubMed] [Google Scholar]
- 41.Lu H. C., Lin F. Y., Huang Y. H., Kao Y. T., Loh E. W., Role of air pollutants in dengue fever incidence: Evidence from two southern cities in Taiwan. Pathog. Glob. Health 117, 596–604 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Carneiro M. A. F., Alves B., Gehrke F. S., Domingues J. N., Sá N., Paixão S., Figueiredo J., Ferreira A., Almeida C., Machi A., Savóia E., Nascimento V., Fonseca F., Environmental factors can influence dengue reported cases. Rev. Assoc. Med. Bras. 63, 957–961 (2017). [DOI] [PubMed] [Google Scholar]
- 43.Betjeman T. J., Soghoian S. E., Foran M. P., mHealth in Sub-Saharan Africa. Int. J. Telemed. Appl. 2013, 482324 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Gubler D. J., Dengue and dengue hemorrhagic fever. Clin. Microbiol. Rev. 11, 480–496 (1998). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Tissera H., Samaraweera P., de Boer M., Gandhi S., Malvaux L., Mehta S., Palihawadana P., Vantomme V., Paris R., Schmidt A., The burden of acute febrile illness attributable to dengue virus infection in Sri Lanka: A single-center 2-year prospective cohort study (2016-2019). Am. J. Trop. Med. Hyg. 106, 160–167 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Richardson S., Dauber-Decker K. L., McGinn T., Barnaby D. P., Cattamanchi A., Pekmezaris R., Barriers to the use of clinical decision support for the evaluation of pulmonary embolism: Qualitative interview study. JMIR Hum. Factors 8, e25046 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Ho T. S., Weng T. C., Wang J. D., Han H. C., Cheng H. C., Yang C. C., Yu C. H., Liu Y. J., Hu C. H., Huang C. Y., Chen M. H., King C. C., Oyang Y. J., Liu C. C., Comparing machine learning with case-control models to identify confirmed dengue cases. PLOS Negl. Trop. Dis. 14, e0008843 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Fathima A. S., Manimeglai D., Analysis of significant factors for dengue infection prognosis using the random forest classifier. Int. J. Adv. Comput. Sci. Appl. 6, 240–245 (2015). [Google Scholar]
- 49.Ming D. K., Tuan N. M., Hernandez B., Sangkaew S., Vuong N. L., Chanh H. Q., Chau N. V., Simmons C. P., Wills B., Georgiou P., The diagnosis of dengue in patients presenting with acute febrile illness using supervised machine learning and impact of seasonality. Front. Digit. Health 4, 849641 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Kerdpanich P., Kongkiatngam S., Buddhari D., Simasathien S., Klungthong C., Rodpradit P., Thaisomboonsuk B., Wongstitwilairoong T., Hunsawong T., Anderson K. B., Fernandez S., Jones A. R., Comparative analyses of historical trends in confirmed dengue illnesses detected at public hospitals in Bangkok and Northern Thailand, 2002-2018. Am. J. Trop. Med. Hyg. 104, 1058–1066 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Huang A. T., Takahashi S., Salje H., Wang L., Garcia-Carreras B., Anderson K., Endy T., Thomas S., Rothman A. L., Klungthong C., Jones A. R., Fernandez S., Iamsirithaworn S., Doung-Ngern P., Rodriguez-Barraquer I., Cummings D. A. T., Assessing the role of multiple mechanisms increasing the age of dengue cases in Thailand. Proc. Natl. Acad. Sci. U.S.A. 119, e2115790119 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Anderson K. B., Buddhari D., Srikiatkhachorn A., Gromowski G. D., Iamsirithaworn S., Weg A. L., Ellison D. W., Macareo L., Cummings D. A. T., Yoon I.-K., Nisalak A., Ponlawat A., Thomas S. J., Fernandez S., Jarman R. G., Rothman A. L., Endy T. P., An Innovative, prospective, hybrid cohort-cluster study design to characterize dengue virus transmission in multigenerational Households in Kamphaeng Phet, Thailand. Am. J. Epidemiol. 189, 648–659 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Ribeiro Dos Santos G., Buddhari D., Iamsirithaworn S., Khampaen D., Ponlawat A., Fansiri T., Farmer A., Fernandez S., Thomas S., Rodriguez Barraquer I., Srikiatkhachorn A., Huang A. T., Cummings D. A. T., Endy T., Rothman A. L., Salje H. A.-O., Anderson K. A.-O., Individual, household, and community drivers of dengue virus infection risk in Kamphaeng Phet Province, Thailand. J. Infect. Dis. 226, 1348–1356 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Sarica A., Cerasa A., Quattrone A., Random forest algorithm for the classification of neuroimaging data in alzheimer's disease: A systematic review. Front. Aging Neurosci. 9, 329 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Peng S. Y., Chuang Y. C., Kang T. W., Tseng K. H., Random forest can predict 30-day mortality of spontaneous intracerebral hemorrhage with remarkable discrimination. Eur. J. Neurol. 17, 945–950 (2010). [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.