Skip to main content
PLOS Neglected Tropical Diseases logoLink to PLOS Neglected Tropical Diseases
. 2025 Feb 7;19(2):e0012843. doi: 10.1371/journal.pntd.0012843

Assessing the ecological resilience of Ebola virus in Africa and potential influencing factors based on a synthesized model

Li Shen 1,#, Jiawei Song 1,#, Yibo Zhou 1,#, Xiaojie Yuan 2, Samuel Seery 3, Ting Fu 2, Xihao Liu 2, Yihong Liu 4, Zhongjun Shao 2,*, Rui Li 2,*, Kun Liu 2,*
Editor: Mabel Carabali5
PMCID: PMC11805440  PMID: 39919104

Abstract

Background

The Ebola epidemic has persisted in Africa since it was firstly identified in 1976. However, few studies have focused on spatiotemporally assessing the ecological adaptability of this virus and the influence of multiple factors on outbreaks. This study quantitatively explores the ecological adaptability of Ebola virus and its response to different potential natural and anthropogenic factors from a spatiotemporal perspective.

Methodology

Based on historical Ebola cases and relevant environmental factors collected from 2014 to 2022 in Africa, the spatiotemporal distribution of Ebola adaptability is characterized by integrating four distinct ecological models into one synthesized spatiotemporal framework. Maxent and Generalized Additive Models were applied to further reveal the potential responses of the Ebola virus niche to its ever-changing environments.

Findings

Ebola habitats appear to aggregate across the sub-Saharan region and in north Zambia and Angola, covering approximately 16% of the African continent. Countries presently unaffected by Ebola but at increasing risk include Ethiopia, Tanzania, Côte d’Ivoire, Ghana, Cameroon, and Rwanda. In addition, among the thirteen key influencing factors, temperature seasonality and population density were identified as significantly influencing the ecological adaptability of Ebola. Specifically, those regions were prone to minimal seasonal variations in temperature. Both the potential anthropogenic influence and vegetation coverage demonstrate a rise-to-decline impact on the outbreaks of Ebola virus across Africa.

Conclusions

Our findings suggest new ways to effectively respond to potential Ebola outbreaks in Sub-Saharan Africa. We believe that this integrated modeling approach and response analysis provide a framework that can be extended to predict risk of other worldwide diseases from a similar epidemic study perspective.

Author summary

As one of the most lethal viral neglected tropical diseases in human history, Ebola virus disease typically causes organ damage, high fever, internal bleeding, and diarrhea with a high transmission rate and a case fatality rate between 50 and 90%. Traditional mathematical models neglect to incorporate the spatial variation of Ebola pathogens to reveal its potential outbreak source and sink areas. In addition, previous studies mainly regard EVD as an infectious disease to predict its distribution change, but rarely consider specific transmission process of the virus, resulting in sudden and rapid predictions with relatively low accuracy. In this study, we conducted a comprehensive analysis to spatiotemporally investigate the ecological adaptability of Ebola virus and its response to different environmental factors by proposing an integrated model. We found that Ebola habitats show significant differences in regional distribution. Temperature seasonality and population density might significantly influence the ecological adaptability of Ebola. Our findings can provide scientific support for practical prevention and control of future Ebola outbreaks in potential risk area.

1. Introduction

Ebola virus disease (EVD) is an acute infectious disease caused by the Ebola virus, a member of the Filoviridae family [1,2]. It is one of the most lethal viral diseases, characterized by high fever, organ damage, internal bleeding, and a case fatality rate of 50-90% worldwide [3]. EVD was firstly discovered in Sudan and the Democratic Republic of Congo in Africa in 1976 [4], and has since emerged in Central and West African nations [5], affecting Uganda, Democratic Republic of Congo (DRC), and Nigeria [6]. However, over the past decade, three major pandemics have occurred, including the 2014 Ebola outbreak in West Africa, the 2018 outbreak in DRC, and the 2022 outbreak in Uganda, with the first one recorded as the largest outbreak to date [7,8].

Previous scholars have been dedicated to researching virus spread through interactions between the susceptible agents and infected animals or humans. A variety of models have been developed or improved by continuously optimizing parameters [9]. For example, ecological niche models, incorporating environmental factors as covariates and using presence-only data, can be used to identify diverse habitats suitable for the spillover of different ebolaviruses [10]. From a zoonotic perspective, the Susceptible-Infectious-Recovered (SIR) compartmental model has also been deployed to explore the dynamics of Ebola transmission between species and simulate Ebola trends [1114]. Also, the Geographic Information System-based multicriteria evaluation (GIS-MCE) model can generates maps of risk area for Ebola spillover by taking account the spatial and temporal variability [15]. In addition, epidemiological compartmental models are used to forecast the spatial and temporal risks associated with Ebola Virus Disease [16]. And time-series regression models have significant potential to extract the filovirus distribution for improving the overall accuracy of traditional models [1719].

However, the majority of conventional mathematical models paid little attention to incorporate the spatial variability of Ebola pathogens, limiting their ability to accurately identify potential outbreak source and sink areas. Especially, reliance on a single model may compromise both the robustness and accuracy of the results, as well as result in a loss of information, due to the model’s inherent limitations [20]. Besides, the conventional models have paid little attention to simultaneously examining the impacts of both natural and anthropogenic factors on the host movement from a spatiotemporal perspective. Moreover, most relevant studies fail to use a comprehensive spatiotemporal dataset on potential influencing factors and the latest case data, such as from the 2022 Uganda outbreak based on a unified spatiotemporal scale.

Therefore, this study is purposed to propose a synthesized spatiotemporal framework to quantitatively explore the dynamic distribution of Ebola outbreaks in Africa, as well as the driving impact from different factors. A comprehensive and robust analysis can be provided by strategically integrating four distinct ecological models and incorporating an extensive dataset of multi-facet environmental factors and the most current case information. There were three specific objectives: 1) to extract the spatiotemporal distribution of Ebola by establishing a synthesized prediction model; 2) to investigating the pivotal factors that can affect the suitable niches of Ebola virus; and 3) to reveal the ecological adaptability of Ebola virus by characterizing its response to the influencing factors.

2. Methods

2.1. Study area and status

Since the first recorded outbreaks in Sudan and the DRC in 1976, the transmission of the EVD has predominantly been observed within the African continent. This study focused on Western and Central African nations where the most recent outbreaks occurred. According to the World Health Organization (WHO), two major outbreaks have been recorded since 2014. The first occurred in West Africa with over 28,600 infections and 11,325 fatalities in 2014 [7,21]. The second was in 2018 in the DRC, and had a total of 3,481 cases with 2299 fatalities [22]. Since then, there have been sporadic, relatively small outbreaks across Africa, in the DRC from 2018 to 2022, Guinea in February 2021, and Uganda in September 2022 as shown in Fig 1. Due to the sporadic nature of Ebola outbreaks prior to 2014 [23], we focused our investigation on the ecological adaptability of the Ebola virus based on EVD outbreaks occurring after the 2014 West Africa epidemic.

Fig 1. Study area and spatial-temporal distribution of EVD in Africa since 2014 (A) Spatial distribution of EVD cases, and (B) Temporal variations of EVD cases and fatality ratio in different countries.

Fig 1

The base layers of the map were obtained from the openly available source via the Natural Earth (https://www.naturalearthdata.com/downloads/50m-cultural-vectors/).

2.2. Data preparation

2.2.1. Surveyed EVD cases.

Long-term EVD surveillance data was used in this study and included cases from the 2014-2016 outbreaks (https://data.humdata.org/dataset/evd-cases-by-district), the 2018 DRC outbreak (https://data.humdata.org/dataset/ebola-cases-and-deaths-drc-north-kivu), and the 2022 outbreak in Uganda (https://global.health/). The first two groups of data were obtained from The Humanitarian Data Exchange (HDX) website, which serves as an open platform for facilitating easily accessible and useable data for this analysis. Epidemic data were primarily based on information provided by the WHO, along with real-time reporting by national health authorities of different countries. Data from the 2022 Uganda outbreak were acquired from the Global Health website, which is purported to provide accurate real-time disease data in the early stages of an outbreak, tracking cases for the first 100 days. Geographical points of each case were collected from the aforementioned sources and then abstracted by locating them in the center of a specific region or in proximity to the medical facilities. This pre-process helped to avoid potential conflicts or errors without affecting the reliability and accuracy of the final results.

2.2.2. Environmental data.

Previous research has demonstrated that the transmission of Ebola is influenced by multiple environmental factors such as climate conditions (e.g., precipitation and temperature) and vegetation abundance (e.g., deforestation and the consumption of forest prey) [24]. In this study, climate data were primarily obtained from WorldClim (https://www.worldclim.org/), a database that offers high-resolution weather and climate data at a global scale.

Surface coverage data were obtained from the Copernicus Climate Change Service (https://climate.copernicus.eu/), which combines climate system observations with scientific research to produce authoritative and quality-assured information on European and global climate. In addition, vegetation information was acquired from the global map data archive established by the Global Map Transfer Program and long-term time series normalized difference vegetation index (NDVI) data were retrieved from NOAA’s Advanced Very High Resolution Radiometer (AVHRR) sensor.

2.2.3. Human impact data.

Demographic data were extracted from Gridded Population of the World, version 4 data (GPW.v4) which is based on national censuses and population registers. We also utilized Human Influence Index grids data set which contains information such as population density raster, human land cover raster, and constructed roads. In addition, night time light remote sensing data were also utilized due to its strong correlation with socioeconomic factors, which was obtained from the Operational LineScan System (OLS) of the U.S. Defense Meteorological Satellite Program (DMSP) and the VIIRS Plus DMSP light change dataset.

All the aforementioned data was carefully examined and preprocessed. We cleaned the datasets by removing elements outside normal geographic boundaries and then standardizing raster resolutions to 2.45 km using Python and ArcGIS 10.8. To improve the spatial precision of Ebola-impacted provinces, we utilized the Random Point Generation tool in ArcGIS 10.8 to randomly generate additional points within a 0.3 arc minute rectangular frame around the center of each province, based on the proportion of cases relative to the total epidemic counts. Subsequently, a geospatial database was established in ArcGIS 10.8, integrating epidemic case data, remote sensing imagery, surveyed attributes, and spatial vector boundaries into three distinct feature datasets (Ebola case points, multisource raster, and national boundaries). Finally, topological rules were implemented and quality checks was performed to ensure the integrity and accuracy of database, providing reliable inputs for further in-depth analysis.

2.2.4. Variable filter.

To prevent overfitting and enhance the interpretability of the models, variable filtering was conducted to select significant factors. Based on JackKnife method implemented via Maxent software V3.4.3, and XGboost algorithm in Python, we assessed the significance of each factor [25,26]. In order to solve the strong interrelationships among those variables and improve the accuracy, we utilized Spearman’s correlation coefficient in Python to remove the variables exhibiting substantial correlation [27,28]. This process can help identify the more representative and significant factors that were used for subsequent modeling and predicting.

2.3. Methods

Spatial-temporal distribution of EVD in Africa from 2014 to 2023 were visualized by ArcGIS 10.8. In order to accurately estimate the current spatiotemporal distribution of EVD as well as its potential risk, we established a synthesized prediction model by integrating four distinct ecological niche approaches which are Maxent, Bioclim, Domain and GARP models. Table A in S1 Appendix shows the advantages and disadvantages of those four models and they complement each other in characteristics. Specifically, the Bioclim model can provide essential boundary conditions based on climatic extremes [29], serving as a foundational layer to define species’ ecological limits. This can be offset by the Maxent model[30], which incorporates the complex, nonlinear ecological interactions that the Bioclim model may overlook, thus enriching the predictive model with greater ecological depth. On the other hand, simplistic the Domain model[31], can offer valuable preliminary insights especially in data-limited situations, establishing a baseline distribution hypothesis that can be refined. Finally, the iterative optimization processes of the GARP model can better refine the initial predictions by systematically improving accuracy through complex rule handling [32]. Therefore, the integrated model by combining these approaches can not only consolidate basic climatic thresholds and ecological interactions but also increase the precision and adaptability in predicting specie distribution under varying environmental conditions.

The Maxent model, implemented using Maxent software V3.4.3, provides probabilities ranging from 0 to 1 which indicates the likelihood of Ebola presence. The Bioclim and Domain models, executed through DIVA-GIS software, yield larger values for regions more prone to Ebola outbreaks. Meanwhile, the GARP model, realized through DesktopGARP 1.1.3 software, categorizes the study area into suitable and unsuitable regions for Ebola occurrence. The description of those four models can be found in the supplementary materials (S2 Appendix). Our proposed synthesized model is given by

yj^i=fiXj,Θi#1

where i represents the type of ecological niche model, j represents the prediction unit, f denotes the model mapping, and X is the feature vector used to describe multidimensional influencing factors. After training the model, input features Xj are applied to each model for prediction, resulting in predicted values yj^i. By adjusting the model parameters Θi, we aimed to minimize the difference between the predicted values fiXj,Θi and the actual values yj, which is expressed as:

minΘiyjfiXj,Θi2#2

Since the Maxnet, Bioclim, and Domain models yield specific values while the GARP model provides logistic variables, it is necessary to standardize these different results to enable comparison. We reclassified the outputs from the first three models into five uniform intervals ranging from 0 to 1 with each interval at 0.2, to align with the logistic variables from the GARP model. Finally, the predicted values of the four models are weighted and summed to obtain the ultimate ecological niche prediction y^jinter. The weights Wi were derived through the Analytic Hierarchy Process (AHP), comprehensively considering factors such as model accuracy, nonlinear data handling capabilities, complexity, data volume requirements, and generalizability [33]. The summation is conducted as follows:

y^jinter=i=14Wiy^ji#3

In our study, the dataset was randomly partitioned into a training set (75% of the data) and a test set (25% of the data) for model validation. Each model was constructed using the training set and subsequently evaluated based on the test set. Based on the predictive accuracy of each model, the final outcome was retrieved by incorporating different weights according to the following standards: (1) the plotted Receiver Operating Characteristic curves (ROC); (2) the calculated Area Under the Curve (AUC) values, a widely used metric for evaluating the classification performance of heavily imbalanced data; (3) normalizing the different suitability representations of into five levels (1 for Not Suitable, 2 for Less Suitable, 3 for Moderate, 4 for More Suitable, and 5 for Suitable); (4) assigning weights to each model based on the corresponding AUC values, which were then standardized using a grid-based calculation method to obtain the final result. A schematic representation of the entire process was depicted in Fig 2.

Fig 2. Predicted ecologically suitable regions of Ebola virus based on Maxent, Bioclim, Domain, and GARP models with 13 independent, influential variables, i.e., bio3, bio4, bio6, bio7, bio11, bio12, bio14, NDVI, GPP, GPW, HII, LCCS, light.

Fig 2

The base layers of the map were obtained from the openly available source via the Natural Earth (https://www.naturalearthdata.com/downloads/50m-cultural-vectors/).

We applied the Maxent model and Jackknife testing for repeated experiments (100 iterations) to identify the key environmental variables influencing the prediction of Ebola-suitable habitats. To further explore these variables, we performed the Maxent model and GAM (generalized additive model) (see S2 Appendix) to assess how each environmental factor impacts the predicted risk. For the Maxent model, during the modeling process, all environmental variables were held constant at their mean values, except for one specific variable, which was allowed to vary. The resulting Environmental Marginal Response Curves derived from the Maxent model illustrate the relationship between the predicted probability of suitability and variations in this specific environmental factor. These curves provide insight into how ecological viral adaptability corresponds to specific environmental variables.

For the GAM model, we first generated 1,000 random spatial points within the study area using the Create Random Points tool in ArcGIS. Environmental data for these points, along with the predicted suitability values from the Maxent model, were then extracted. The predicted suitability values from the Maxent model served as the response variable in the subsequent GAM analysis, enabling the assessment of the influence of environmental predictors on the modeled suitability.

3. Results

3.1. Ecological adaptability assessment of Ebola

We ultimately selected 13 variables that are most likely to influence Ebola distribution, namely Isothermality (bio3), Temperature Seasonality (bio4), Min Temperature of Coldest Month (bio6), Temperature Annual Range (bio7), Mean Temperature of Coldest Quarter (bio11), Annual Precipitation (bio12), Precipitation of Driest Month (bio14), Gross Primary Productivity (GPP), Gridded Population of the World (GPW), Human Influence Index (HII), Land Cover Classification System (LCCS), Night Lights (light) and NDVI. Details regarding the units, ranges, standard deviations (STD), means, and data sources of these variables are provided in Table 1. The comprehensive definitions and descriptions of each variable can be referred to the official resource (https://www.worldclim.org/data/bioclim.html).

Table 1. Description of the selected 13 variables for this study, including their units, ranges, STD, means, and data sources.

Variables Minimum Maximum Mean STD Source
Isothermality (bio3) 9.1 100.0 34.4 18.8 https://www.worldclim.org/data/worldclim21.html
Temperature Seasonality (×100; bio4) 0 2377.6 888.5 467.8
Min Temperature of Coldest Month (°C; bio6) -72.5 26.5 -20.2 26.0
Temperature Annual Range (°C; bio7) 1 72.7 34.1 12.0
Mean Temperature of Coldest Quarter (°C; bio11) -66.4 29.3 -14.3 26.7
Annual Precipitation (mm; bio12) 0 11246.0 537.2 636.5
Precipitation of Driest Month(mm; bio14) 0 507.0 14.7 27.6
Gross Primary Productivity(GPP) 0 715.2 44.2 85.7 https://essd.copernicus.org/articles/14/1063/2022/essd-14-1063-2022-assets.html
Gridded Population of the World(GPW) 0 357341.3 46.2 423.7 https://sedac.ciesin.columbia.edu/data/set/gpw-v4-population-count-rev11
Human Influence Index(HII) 0 64 https://sedac.ciesin.columbia.edu/data/set/wildareas-v2-human-influence-index-geographic
Land Cover Classification System(LCCS) 10 220 https://maps.elie.ucl.ac.be/CCI/viewer/
Night Lights(light) 0 255.0 4.1 20.0 https://eogdata.mines.edu/products/dmsp/
Normalized Difference Vegetation Index(NDVI) -32768.0 9938.0 188.9 6370.3 https://www.ncei.noaa.gov/products/climate-data-records/normalized-difference-vegetation-index

Abbreviation: STD (Standard Deviation), Minimum (The smallest value in the raster), Maximum (The largest value in the raster), Mean (The average value of the raster).

Each of the four models demonstrate distinctive characteristics for predicting Ebola occurrences across different regions. The reclassified values derived from the first three models are depicted in Fig 3A, 3B and 3C, with darker colors indicating higher adaptability of Ebola. Results of the GARP model in the format of logistic variables are shown in Fig 3D.

Fig 3. Potential distribution of Ebola outbreak occurrences based on (A) Maxent(B) Bioclim(C) Domain(D) GARP models plus ROC curves for (E) Maxent(F) Boclim(G) Domain, and (H) GARP.

Fig 3

The base layers of the map were obtained from the openly available source via the Natural Earth (https://www.naturalearthdata.com/downloads/50m-cultural-vectors/).

The Maxent and Bioclim models show similar predictions for favorable virus habitats, primarily concentrating in West Africa, sub-Saharan nations, along the edge of the Congolese basin, around Lake Victoria, and on the Ethiopian plateau. These ecologically suitable zones are generally situated near the equator. In contrast, the Domain and GARP models utilize more stringent criteria to identify the non-viable areas, and predict a broader range of potential zones distributed across the African continent, spanning from sub-Saharan regions to the Kalahari Desert in the South African plateau.

According to the accuracy assessment of ROC curves illustrated by Fig 3E, 3F, 3G and 3H, the AUC values derived from these four prediction models are all relatively high with Maxent (0.99), Bioclim (0.92), Domain (0.93) and GARP (0.85), respectively. In order to synthesize the predictions of the four ecological niche models, we assigned 30%, 30%, 30%, and 10% weights to the four models mentioned, and obtained the integrated danger levels of Ebola outbreaks as well as the potential habitat of EVD in Africa as shown by Fig 4A, while higher levels indicate a greater risk of an Ebola outbreak.

Fig 4. Map of potentially suitable habitats for Ebola virus in Africa and a closer look at four high-risk regions with (A) Integrated risk levels of Ebola based on results derived from four models(B) Western Africa region(C) Coastal regions of the Gulf of Guinea(D) Central Africa region, and (E) Eastern African region.

Fig 4

The base layers of the map were obtained from the openly available source via the Natural Earth (https://www.naturalearthdata.com/downloads/50m-cultural-vectors/).

The areas at most risk predominantly lie in the western, central, and eastern regions of Africa, situated south of the Sahara Desert but close to the equator as illustrated by Fig 4B, 4C, 4D and 4E. Specific countries include Guinea, Liberia, Sierra Leone, Nigeria, DRC, Uganda, Ethiopia, among other. Using the Albers Equal Area Conic projection within corresponding coordinate system, the calculated total area occupied by these high-risk regions (areas above level mmoderately high in the map) are approximately 3, 509, 623.11 km2, accounting for nearly 11.61% of Africa’s entire land area.

By integrating population density grids, we estimated the total population affected in each country located within the high-risk areas depicted in Fig 5. The red bars in Fig 5C represents countries without reported Ebola outbreaks but with a substantial population at risk of infection. The blue bars signify countries with a history of Ebola outbreaks and documented cases. Based on the current predictive results, countries at higher risk yet without a history of Ebola include Ethiopia, Tanzania, Côte d’Ivoire, Ghana, Cameroon, Rwanda, Burundi, and others.

Fig 5. Populations in Ebola risk areas within each country with (A) Spatial distribution of countries within the Ebola risk zone with reported Ebola outbreak (B) The spatial distribution of countries within the Ebola risk zone without any reported Ebola outbreaks, and (C) Histogram of populations at high risk of Ebola infection.

Fig 5

The base layers of the map were obtained from the openly available source via the Natural Earth (https://www.naturalearthdata.com/downloads/50m-cultural-vectors/).

Côte d’Ivoire has reported one single Ebola case in 2021 but this did not lead to a further outbreak. Countries that have experienced recent Ebola outbreaks (e.g., Guinea, the DRC, and Liberia), as well as those that had outbreaks in earlier periods (e.g., Gabon, South Africa, and the Sudan), all matched with the predicted Ebola-adapted zones, providing a good validation for the reliability.

3.2. Significance analysis of impact factors

Given the superior performance of the Maxent model in accuracy assessment owing to a high AUC value of 0.99, this study conducted an impact factor analysis to examine the crucial environmental variables for predicting Ebola-suitable habitats. Table 2 presents the contribution and permutation importance of 13 environmental variables to the ecological adaptability prediction of EVD, which was derived from Maxent modeling. Averages of 100 experiments reveal that bio4, GPW, bio12, LCCS, and bio6 act in rank as pivotal factors that can affect the suitable niches of Ebola. Those key variables account for a cumulative percentage contribution of 97.8% and permutation importance of 98.5%.

Table 2. Contribution and permutation importance of variables derived through Maxent modeling (average of 100 experiments).

Variable Contribution (%) Permutation Importance
bio4(Temperature Seasonality) 44.89 48.19
GPW(Gridded Population of the World) 33.43 32.69
bio12(Annual Precipitation) 9.76 9.09
LCCS(Land Cover Classification System) 7.21 5.67
bio6(Min Temperature of Coldest Month) 2.47 2.82
HII(Human Influence Index) 1.55 0.59
bio3(Isothermality) 0.52 0.44
GPP(Gross Primary Productivity) 0.14 0.43
NDVI(Normalized Difference Vegetation Index) 0 0.06
Light(Night Lights) 0 0
bio7(Temperature Annual Range) 0 0
bio14(Precipitation of Driest Month) 0 0
bio11(Mean Temperature of Coldest Quarter) 0 0

Jackknife testing (Fig 6) based on Maxent software 3.4.3, applied two methods to evaluate the impact of environmental variables on Ebola virus models. The first method involves using only one variable at a time, highlighting the direct impact of variables like bio3, bio4, bio11, bio12, GPP, and GPW on model performance metrics. The second one excludes one variable to assess its unique contribution, revealing that the exclusion of GPW, LCCS, and bio12 significantly reduced model accuracy.

Fig 6. Jackknife test of the importance of environment variables in Maxent, including Jackknife of regularized training gain, test gain and AUC.

Fig 6

After combining those findings, it became evident that bio4, bio12 and GPW exert a significant impact on the suitable niches of Ebola.

3.3. Response of impact factors

To generate response curves for different variables, we utilized individual variables to construct the Maxent model. Consequently, we obtained the model’s predicted probabilities as a function of increasing the values of each environmental factor (Fig 7). The x-axis represents values of the environmental variables, while the y-axis corresponds to the consequent Ebola suitability probability. The line graph demonstrates the dependency of predicted suitability on the selected variable, as well as the dependency arising from the correlation between the chosen variable and other variables. Since HII and LCCS are categorical variables, the results have been illustrated using a standard bar chart.

Fig 7. Response curves of environmental factors and the probability of suitability for Ebola based on Maxent model.

Fig 7

Each curve (A-M) represents model-predicted probabilities associated with bio3, bio4, bio6, bio7, bio11, bio12, bio14, GPP, GPW, HII, LCCS, light, and NDVI. Curves display the mean response derived from 10 replicate Maxent runs (shown in red) with means and standard deviations (represented in blue, with two shades for categorical variables).

The GAM model-based impact factors and their response to Ebola’s ecological adaptability are shown in Fig 8. Each scatter plot in the figure represents a random point, with the x-axis representing the variable value and the y-axis showing its impact on Ebola’s ecological adaptability. Vertical lines on the x-axis depict data distribution and density, while the light purple background signifies the confidence interval. These lines can represent the fitting of the data points.

Fig 8. GAM model-based impact factors and ecological adaptability response of Ebola (A-M), final fitness classes with bio3, bio4, bio6, bio7, bio11, bio12, bio14, GPP, GPW, HII, LCCS, light, and NDVI.

Fig 8

Combining the outcomes yielded by the Maxent model and GAM, we revealed the distinct patterns in Ebola distribution:

  • (1)

    The ecological adaptability of the Ebola virus is inversely related to temperature seasonality (bio4) but positively correlates with higher winter average temperatures and annual precipitation (bio6, bio12).

  • (2)

    Greater population densities enhance the ecological adaptability of the Ebola virus (GPW). However, this adaptability exhibits a bell-shaped curve in relation to economic development, increasing initially before diminishing in more economically advanced regions (HII).

  • (3)

    The ecological adaptability of the Ebola virus shows an initial increase followed by a decline in response to rising vegetation levels, peaking in areas with moderate vegetation coverage.

  • (4)

    The Ebola virus adaptability responds positively to various land covers, with specific codes representing rainfed cropland (10), mosaic cropland (30), broadleaf deciduous forests (60), flooded forests (170), and urban zones (190) (LCCS).

4. Discussion

As an interdisciplinary piece of research, this study attempts to investigate suitable ecological niches for Ebola virus as well as its potential responses to different environmental factors. This study provides a practical example of how to apply geostatistical approaches to quantitatively explore the dynamic distribution, variation, and influential determinants of an epidemic, not only demonstrating an opportunity to address certain research gaps in the existing literature, but also presenting some potential to support public health decision-makers in formulating and implementing effective strategies.

Regarding spatiotemporal predictions for Ebola habitats at a macroscopic scale, the forecasted outcomes of the integrated model mainly aggregate to the south of the Sahara Desert and north of Zambia and Angola, covering approximately 16% of the entire African continent. This finding relatively matches with the zoonotic niche map of Ebola virus revealed by Pigott et al. (2014) [14], but in comparison, our high-risk areas are more dispersed and better correspond to the previous outbreak patterns [34]. This may be attributed to the utilization of the latest infection case data, including the 2018 Congolese outbreak and the 2022 Uganda outbreak. Additionally, we incorporated a more comprehensive set of potential influencing factors, which improved identifying the most significant ecological determinants that possibly affect the adaptability of Ebola virus in Africa, thereby further increasing the accuracy of final prediction.

Our research also compiled the demographic statistics within Ebola outbreak-prone regions across different African countries, estimating the total population at risk in these high-risk areas to be 20,504,885. Among this population, 11,464,743 individuals reside in regions with reported Ebola outbreaks, while 9,040,142 individuals live in areas with no reported outbreaks. Notably, Ethiopia, one of the countries with the highest at-risk population, although not previously documented with Ebola cases, has been identified as having the potential risk for an Ebola outbreak in our study, similar to the findings from Redding et al. (2019) [16]. Both researches highlighted environmental factors in Ethiopia are conducive to a potential Ebola outbreak, warranting serious concern from local governments and global public health institutions. Other countries at high risk of Ebola without previous reports include Tanzania, Cote d’Ivoire, Ghana, Cameroon, and Rwanda, which are urged to promptly implement effective prevention measures against potential Ebola epidemics.

Furthermore, this study scrutinized the responses of various influencing factors and explained the driving effect on Ebola outbreak from a spatiotemporal perspective. Utilizing the Jackknife test and the Maxent model, we identified the key factors closely associated with Ebola’s ecological adaptability, notably including bio4, GPW, bio12 and LCCS. In order to further investigate how these influencing factors affect Ebola ecological adaptability, we analyzed the different distribution patterns of the virus derived from the Maxent model and GAM, and provide the following elaborations:

  • (1)

    The ecological adaptability of Ebola virus decreases with a rise in temperature seasonality, indicating preference for regions with minimal temperature variation between seasons. This matches closely with the conclusions by Olivero et al. (2017) that the observation of Ebola outbreaks predominantly occurring in low-latitude areas near the equator [35]. Its adaptability also increases with higher winter average temperatures and annual precipitation, similar to the findings on environment temperature and precipitation trends [36].

  • (2)

    The ecological adaptability of Ebola virus escalates with greater population density, likely due to the rapid human-to-human virus transmission promoted by frequent interpersonal contacts [37,38]. Additionally, the adaptability increases initially and then decreases, becoming lower in more economically developed regions, which can be explained by the fact that the areas less impacted by anthropogenic activities especially rapid urbanization are almost not susceptible to large-scale outbreaks of Ebola [39], whereas the economically developed regions with better healthcare resources are able to effectively implement quarantine measures to reduce the risk of infection [40].

  • (3)

    The ecological adaptability of Ebola virus follows a pattern of growth to decline as vegetation increases, peaking within regions of moderate vegetation coverage. For equatorial area with more vegetation abundance, the reduction in vegetation possibly accelerates the risk of Ebola virus transmission. Other studies have also indicated that deforestation substantially elevates human interaction with infected wildlife, thereby favoring viral spread [41]. This perspective effectively corroborates our findings.

  • (4)

    The ecological adaptability of Ebola virus responds positively to land covers with values of 10, 30, 60, 170, and 190 respectively representing rainfed cropland, mosaic cropland, broadleaf deciduous forests, flooded forests and urban zones. This is consistent with the findings of Redding et al. (2019) who claimed that the land-use/land-cover determines distributions across the different reservoir host species [16]. Nevertheless, we took a step further by incorporating a quantitative analysis, particularly focusing on the nuances within land cover types.

Nonetheless, several limitations should be also taken into consideration. Firstly, the accessed Ebola case data used in this study were limited to the provincial-level records without precise geographical coordinates (e.g., longitude and latitude), which hindered the more accurate spatial analysis. Secondly, bats are known as critical hosts of Ebola virus, our evaluation did not take bats into consideration due to the difficulty in obtaining such information. This may lead to some incomplete understanding of how those host factors drive the risk of Ebola outbreaks. Thirdly, from a model perspective, our synthesized model focuses on the static ecological adaptability and does not incorporate spatial interaction dynamics, thereby incapable to simulate the mechanisms of Ebola virus spread across regions. The proposed framework also lacks a hierarchical multi-scale analytical design, weak in disentangling complex interactions at varying spatial and temporal scales. In addition, utilizing multi-source raster datasets at varying resolutions in one model involved normalization and resampling, and these steps may introduce uncertainties, such as the loss of fine-scale details or interpolation-induced artifacts, which could slightly affect the accuracy of subsequent analyses. Therefore, in further research efforts need to be devoted in optimizing the model function and enlarging the dataset, to provide more insights into the spread route of Ebola disease once spillover has occurred.

5. Conclusions

This study integrated different ecological niche models to effectively predict the spatial distribution of suitable habitats for the Ebola virus. Our findings indicate that those suitable habitats are primarily concentrated in regions near the equator, south of the Sahara Desert. Additionally, our research delved into the response patterns of key influencing factors related to Ebola, elucidating specific associations between Ebola outbreaks with variables such as climate, socio-economic conditions, vegetation, and land cover types. These insights offer valuable directions for future research in the field of epidemic prevention and control.

Supporting information

S1 Appendix. Supplemental table and figures.

Fig A. Multiple data sources, including Ebola epidemic case data, surveyed attribute data, remote sensing data, and spatial vector boundaries, and establishment of a geodatabase. The base layers of the map were obtained from the openly available source via the Natural Earth (https://www.naturalearthdata.com/downloads/50m-cultural-vectors/). Fig B. Heat map of spearman coefficients for each variable illustrating the correlations among variables. Table A. The advantages and disadvantages of Maxent, Bioclim, Domain, and GARP models.

(DOCX)

pntd.0012843.s001.docx (5.1MB, docx)
S2 Appendix. Characteristics and introduction of the models used in this study.

Table A. The format requirements of the input data for models used in this study.

(DOCX)

pntd.0012843.s002.docx (20.9KB, docx)

Acknowledgments

The project team would like to thank all the participants who consented to participate in the study. We thank our colleagues for their careful reading and editing of this manuscript. The corresponding author had full access to all the data in the study and had final responsibility for the decision to submit for publication.

Data Availability

The authors confirm that all data underlying the findings are fully available without restriction. All relevant data are within the paper and its Supporting Information files. Publicly available data (CC BY 4.0) has been uploaded to the Researchgate (https://www.researchgate.net/profile/Jiawei-Song-14/publication/386178115_Assessing_the_ecological_resilience_of_Ebola_virus_in_Africa_and_potential_influencing_factors_based_on_a_synthesized_model/data/6747e1be790d154bf9afacef/DataforManuscriptPLOSntd.zip?_tp=eyJjb250ZXh0Ijp7ImZpcnN0UGFnZSI6InB1YmxpY2F0aW9uIiwicGFnZSI6InB1YmxpY2F0aW9uIiwicG9zaXRpb24iOiJwYWdlQ29udGVudCJ9fQ).

Funding Statement

This work was partly supported by grants from the National Natural Science Foundation of China [grant number 42201448 to Li Shen and 82273689 to Zhongjun Shao], Fundamental Research Funds for the Central Universities [grant number 2042024kf0023 to Li Shen], Education Reform Project of Wuhan University and Three Aspects of Education Reform Projects in School of Remote Sensing and Information Engineering [grant number YGJY202305 to Li Shen]. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Feldmann H, Jones S, Klenk H-D, Schnittler H-J. Ebola virus: from discovery to vaccine. Nat Rev Immunol. 2003;3(8):677–85. doi: 10.1038/nri1154 [DOI] [PubMed] [Google Scholar]
  • 2.Malvy D, McElroy AK, de Clerck H, Günther S, van Griensven J. Ebola virus disease. Lancet. 2019;393(10174):936–48. doi: 10.1016/S0140-6736(18)33132-5 [DOI] [PubMed] [Google Scholar]
  • 3.Ansari AA. Clinical features and pathobiology of Ebolavirus infection. J Autoimmun. 2014;55:1–9. doi: 10.1016/j.jaut.2014.09.001 [DOI] [PubMed] [Google Scholar]
  • 4.Colebunders R, Borchert M. Ebola haemorrhagic fever–a review. J Infect. 2000;40(1):16–20. doi: 10.1053/jinf.1999.0603 [DOI] [PubMed] [Google Scholar]
  • 5.Pattyn S, van der Groen G, Jacob W, Piot P, Courteille G. Isolation of Marburg-like virus from a case of haemorrhagic fever in Zaire. Lancet (London, England). 1977;1(8011):573–4. doi: 10.1016/s0140-6736(77)92002-5 [DOI] [PubMed] [Google Scholar]
  • 6.Jacob ST, Crozier I, Fischer WA II, Hewlett A, Kraft CS, Vega M-A de L, et al. Ebola virus disease. Nat Rev Dis Primers. 2020;6(1):13. doi: 10.1038/s41572-020-0147-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Kalenga OI, Moeti M, Sparrow A, Vinh-Kim N, Lucey D, Ghebreyesus TA. The ongoing Ebola epidemic in the Democratic Republic of Congo, 2018-2019. N Engl J Med. 2019;381(4):373–83. [DOI] [PubMed] [Google Scholar]
  • 8.Samarasekera U. Solidarity against Ebola: an update. Lancet Microbe. 2023;4(3):e139. doi: 10.1016/S2666-5247(23)00035-6 [DOI] [PubMed] [Google Scholar]
  • 9.Abdalla SJ, Chirove F, Govinder KS. A systematic review of mathematical models of the Ebola virus disease. Int J Model Simul. 2022;42(5):814–30. [Google Scholar]
  • 10.Judson SD, Fischer R, Judson A, Munster VJ. Ecological contexts of index cases and spillover events of different Ebolaviruses. PLoS Pathog. 2016;12(8):e1005780. doi: 10.1371/journal.ppat.1005780 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Area I, Batarfi H, Losada J, Nieto JJ, Shammakh W, Torres Á. On a fractional order Ebola epidemic model. Adv Differ Equ. 2015. doi: 10.1186/s13662-015-0613-5 [DOI] [Google Scholar]
  • 12.Ristic B, Dawson P, IEEE, editors. Real-time forecasting of an epidemic outbreak: Ebola 2014/2015 case study. 19th International Conference on Information Fusion (FUSION). Heidelberg, Germany; 2016. [Google Scholar]
  • 13.Buceta J, Johnson K. Modeling the Ebola zoonotic dynamics: interplay between enviroclimatic factors and bat ecology. PLoS One. 2017;12(6):e0179559. doi: 10.1371/journal.pone.0179559 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Pigott DM, Golding N, Mylne A, Huang Z, Henry AJ, Weiss DJ. Mapping the zoonotic niche of Ebola virus disease in Africa. Elife. 2014;3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Lee-Cruz L, Lenormand M, Cappelle J, Caron A, De Nys H, Peeters M, et al. Mapping of Ebola virus spillover: suitability and seasonal variability at the landscape scale. PLoS Negl Trop Dis. 2021;15(8):e0009683. doi: 10.1371/journal.pntd.0009683 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Redding DW, Atkinson PM, Cunningham AA, Lo Iacono G, Moses LM, Wood JLN, et al. Impacts of environmental and socio-economic factors on emergence and epidemic potential of Ebola in Africa. Nat Commun. 2019;10(1):4531. doi: 10.1038/s41467-019-12499-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Bhaskaran K, Gasparrini A, Hajat S, Smeeth L, Armstrong B. Time series regression studies in environmental epidemiology. Int J Epidemiol. 2013;42(4):1187–95. doi: 10.1093/ije/dyt092 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Kim H, Lee J-T, Fong KC, Bell ML. Alternative adjustment for seasonality and long-term time-trend in time-series analysis for long-term environmental exposures and disease counts. BMC Med Res Methodol. 2021;21(1). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Valero KCW, Isokpehi RD, Douglas NE, Sivasundaram S, Johnson B, Wootson K, et al. Plant phenology supports the multi-emergence hypothesis for Ebola spillover events. Ecohealth. 2018;15(3):497–508. doi: 10.1007/s10393-017-1288-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Bonissone PP, Xue F, Subbu R. Fast meta-models for local fusion of multiple predictive models. Appl Soft Comput. 2011;11(2):1529–39. doi: 10.1016/j.asoc.2008.03.006 [DOI] [Google Scholar]
  • 21.Coltart CEM, Lindsey B, Ghinai I, Johnson AM, Heymann DL. The Ebola outbreak, 2013-2016: old lessons for new epidemics. Philos Trans R Soc Lond B Biol Sci. 2017;372(1721):20160297. doi: 10.1098/rstb.2016.0297 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.WHO (World Health Organization) [Internet]. Ebola outbreak in Democratic Republic of the Congo declared over. 2024 [cited 2024 Aug 8]. Office of International Medicine. Available from: https://www.who.int/southeastasia/publications/i/item/9789290211495.
  • 23.Weyer J, Grobbelaar A, Blumberg L. Ebola virus disease: history, epidemiology and outbreaks. Curr Infect Dis Rep. 2015;17(5):480. doi: 10.1007/s11908-015-0480-y [DOI] [PubMed] [Google Scholar]
  • 24.Fang L-Q, Yang Y, Jiang J-F, Yao H-W, Kargbo D, Li X-L, et al. Transmission dynamics of Ebola virus disease and intervention effectiveness in Sierra Leone. Proc Natl Acad Sci. 2016. Apr 19;113(16):4488–93. doi: 10.1073/pnas.1518587113 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Guo Y, Zhang S, Tang S, Pan J, Ren L, Tian X, et al. Analysis of the prediction of the suitable distribution of Polygonatum kingianum under different climatic conditions based on the MaxEnt model. Front Earth Sci. 2023;11. doi: 10.3389/feart.2023.1111878 [DOI] [Google Scholar]
  • 26.Snider B, McBean EA, Yawney J, Gadsden SA, Patel B. Identification of variable importance for predictions of mortality from COVID-19 using AI models for Ontario, Canada. J Med Internet Res. 2021;9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Kumar S, Stohlgren TJ, Chong GW. Spatial heterogeneity influences native and nonnative plant species richness. Ecology. 2006;87(12):3186–99. doi: 10.1890/0012-9658(2006)87[3186:shinan]2.0.co;2 [DOI] [PubMed] [Google Scholar]
  • 28.Hauke J, Kossowski T. Comparison of values of Pearson’s and Spearman’s correlation coefficients on the same sets of data. Quaestiones Geographicae. 2011;30:87–93. [Google Scholar]
  • 29.Booth TH, Nix H, Busby JR, Hutchinson MFJD. Bioclim: the first species distribution modelling package, its early applications and relevance to most current MaxEnt studies. Distributions 2014;20. [Google Scholar]
  • 30.Phillips SJ, Anderson RP, Schapire RE. Maximum entropy modeling of species geographic distributions. Ecol Model. 2006;190(3–4):231–59. doi: 10.1016/j.ecolmodel.2005.03.026 [DOI] [Google Scholar]
  • 31.Carpenter G, Gillison AN, Winter J. DOMAIN: a flexible modelling procedure for mapping potential distributions of plants and animals. Biodivers Conserv. 1993;2(6):667–80. doi: 10.1007/bf00051966 [DOI] [Google Scholar]
  • 32.Townsend Peterson A, Papeş M, Eaton M. Transferability and model evaluation in ecological niche modeling: a comparison of GARP and Maxent. Ecography. 2007;30(4):550–60. doi: 10.1111/j.0906-7590.2007.05102.x [DOI] [Google Scholar]
  • 33.Ge Z, Liu Y. Analytic hierarchy process based fuzzy decision fusion system for model prioritization and process monitoring application. IEEE Trans Ind Inf. 2019;15(1):357–65. doi: 10.1109/tii.2018.2836153 [DOI] [Google Scholar]
  • 34.Feldmann H, Sprecher A, Geisbert T. Ebola. N Engl J Med. 2020;382(19):1832–42. [DOI] [PubMed] [Google Scholar]
  • 35.Olivero J, Fa JE, Real R, Farfán MÁ, Márquez AL, Vargas JM, et al. Mammalian biogeography and the Ebola virus in Africa. Mamm Rev. 2016;47(1):24–37. doi: 10.1111/mam.12074 [DOI] [Google Scholar]
  • 36.Schmidt JP, Park AW, Kramer AM, Han BA, Alexander LW, Drake JM. Spatiotemporal fluctuations and triggers of Ebola virus spillover. Emerg Infect Dis. 2017;23(3):415–22. doi: 10.3201/eid2303.160101 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Matua GA, Van der Wal DM, Locsin RC. Ebola hemorrhagic fever outbreaks: strategies for effective epidemic management, containment and control. Braz J Infect Dis. 2015;19(3):308–13. doi: 10.1016/j.bjid.2015.02.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Longley JL, Danquah LO, Massa MS, Ross DA, Weiss HA. The impact of case and contact characteristics on contact tracing during the West Africa Ebola epidemic. J Infect. 2021;83(4):501–4. doi: 10.1016/j.jinf.2021.07.002 [DOI] [PubMed] [Google Scholar]
  • 39.Ali H, Dumbuya B, Hynie M, Idahosa P, Keil R, Perkins PJCC. The social and political dimensions of the Ebola response: global inequality, climate change, and infectious disease. Climate change and health: improving resilience and reducing risks. 2016. p.151–69. [Google Scholar]
  • 40.Grillet G, Marjanovic N, Diverrez J-M, Tattevin P, Tadié J-M, L’Her E. Intensive care medical procedures are more complicated, more stressful, and less comfortable with Ebola personal protective equipment: a simulation study. J Infect. 2015;71(6):703–6. doi: 10.1016/j.jinf.2015.09.003 [DOI] [PubMed] [Google Scholar]
  • 41.Olivero J, Fa JE, Real R, Márquez AL, Farfán MA, Vargas JM, et al. Recent loss of closed forests is associated with Ebola virus disease outbreaks. Sci Rep. 2017;7(1):14291. doi: 10.1038/s41598-017-14727-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
PLoS Negl Trop Dis. doi: 10.1371/journal.pntd.0012843.r002

Decision Letter 0

Mabel Carabali

6 Aug 2024

Dear Mr Li,

Thank you very much for submitting your manuscript "Assessing the ecological resilience of Ebola virus in Africa and potential influencing factors based on a synthesized model" for consideration at PLOS Neglected Tropical Diseases. As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. In light of the reviews (below this email), we would like to invite the resubmission of a significantly-revised version that takes into account the reviewers' comments.

We cannot make any decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is also likely to be sent to reviewers for further evaluation.

When you are ready to resubmit, please upload the following:

[1] A letter containing a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).

Important additional instructions are given below your reviewer comments.

Please prepare and submit your revised manuscript within 60 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. Please note that revised manuscripts received after the 60-day due date may require evaluation and peer review similar to newly submitted manuscripts.

Thank you again for your submission. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Mabel Carabali, M.D., M.Sc., Ph.D.,

Section Editor

PLOS Neglected Tropical Diseases

Mabel Carabali

Section Editor

PLOS Neglected Tropical Diseases

***********************

Reviewer's Responses to Questions

Key Review Criteria Required for Acceptance?

As you describe the new analyses required for acceptance, please consider the following:

Methods

-Are the objectives of the study clearly articulated with a clear testable hypothesis stated?

-Is the study design appropriate to address the stated objectives?

-Is the population clearly described and appropriate for the hypothesis being tested?

-Is the sample size sufficient to ensure adequate power to address the hypothesis being tested?

-Were correct statistical analysis used to support conclusions?

-Are there concerns about ethical or regulatory requirements being met?

Reviewer #1: None

Reviewer #2: The objectives are clearly stated, however the methodology and data are not clearly explained.

Line 108: The final number of cases and deaths according to WHO was 3481 and 2291 respectively. Please update and provide a citation

Line 112: Please provide a citation to support this statement: I find it very surprisingly that there would be an established epidemiological link between the 2014 West African outbreak and the 2022 Uganda outbreak given that the former was the Zaire subtype whilst the latter was the Sudan subtype.

Line 120: Please provide a link and citation to the actual dataset used, not just the platform. What variables did you extract: case numbers, deaths, or death? What temporal resolution (if any) did you use?

Line 126: Again, please provide a link to the exact data source, not just the data platform

Line 136: This is not the correct reference

Line 137: Please link to the exact database and provide more details on the variable used, their meanings and the resolution of the data. Please do the same for the CCCS, NDVI and AVHRR data. Did you consider any variables related to bat habitats, given they are a reservoir for Ebola?

Section 2.2.3: Please provide citations and links for GPW, HII, DMSP and VIIRS data, as well as details of the variables extracted and their resolution. How was data cleaning done? What software did you use? What variables had to be cleaned? How did you establish the geospatial database?

Section 2.2.4: Please provide references for the methods and tools used? Did you use pre-existing software packages or write your own? What language did you use? What what the correlation threshold used to remove variables? How sensitive are the results to the threshold used? The 13 variables listed here that are likely to influence Ebola transmission really should be part of the results, not methods. But it would be helpful to understand the full list of variables and what they mean. For example, how are temperature seasonality and isothermality defined? How is the coldest quarter defined, given that Sub-Saharan Africa cannot be divided into four distinct seasons?

Section 2.3: Details on the model are way too sparse. I understand there might not be space to provide a huge amount of detail in the main text, but I think a brief description of the models, their differences and why one would want to combine them rather than perform a model selection is necessary. The details in the appendix are also much too sparse. A general outline of the models is provided, but is too generic. How do you use the models with your data specifically?

How is the model trained? What formula do you use to calculate the weights?

--------------------

Results

-Does the analysis presented match the analysis plan?

-Are the results clearly and completely presented?

-Are the figures (Tables, Images) of sufficient quality for clarity?

Reviewer #1: (No Response)

Reviewer #2: It is very hard to assess the results given that the figures provided do not generally correspond with the figures referred to in the text.

Section 3.1: The first paragraph on reclassification should really be part of the methods section, and more details on how this reclassification is done are needed. Figure 2 does not have subfigures labelled a, b and c.

Figures 3a, b and c are never referred to - maybe rethink whether they need to be included?

How were the weights chosen calculated?

Please label your figures carefully - I'm guessing figures s2 and s3 should actually be figures 5 and 6? Are the countries in red bars actually at risk of Ebola outbreaks? Have you taken into account bat habitats and whether bats live there?

Section 3.2: How was the impact factor analysis done? More detail on how the variable interact with risk is needed. The Jackknife results are in figure 4, not figure 3. Reference 20 is probably not correct.

Section 3.3: I think figure S4 should be figure S1 and figure 4 should be figure 7? Please describe the relationship between the variables and outbreak risk? Are they significant? It is interesting that risk is high for low and high LCCS but not intermediate values - do you have any insight into why? Also, please replace bio3, bio4, bio6, bio7, bio11, bio12, GPP, GPW, HII, LCCS, light and NDVI with more descriptive names. What do the range of these variables represent? What are their units?

--------------------

Conclusions

-Are the conclusions supported by the data presented?

-Are the limitations of analysis clearly described?

-Do the authors discuss how these data can be helpful to advance our understanding of the topic under study?

-Is public health relevance addressed?

Reviewer #1: (No Response)

Reviewer #2: The papers by Pigott and Redding should really be discussed in the introduction.

Patterns 1-4 should really be described in the results, not in the discussion. Their significance and relevance can be assessed in the discussion.

--------------------

Editorial and Data Presentation Modifications?

<br/>

Use this section for editorial suggestions as well as relatively minor modifications of existing data that would enhance clarity. If the only modifications needed are minor and/or editorial, you may wish to recommend “Minor Revision” or “Accept”.

Reviewer #1: (No Response)

Reviewer #2: (No Response)

--------------------

Summary and General Comments

Use this section to provide overall comments, discuss strengths/weaknesses of the study, novelty, significance, general execution and scholarship. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. If requesting major revision, please articulate the new experiments that are needed.

Reviewer #1: Shen L et al. have assessed the ecological resilience of Ebola virus in Africa as well as its potential influencing factors based on developing a synthesized model. It is of much scientific significance to reveal the spatiotemporal distribution pattern of Ebola and how it responds to different environmental factors. I’m very glad to review the paper in great depth because the study is very interesting. The submission is worth of publication, and following are some minor comments:

1. It is advisable to simplify the description of EVD in the first paragraph (Line 59-63) to make it more succinct.

2. The literature review of different models in the application of EVD research needs to be shortened and succinct (Line 70-86).

3. Some of the figures need to be improved by using high resolution. Especially when it comes to zooming in those figures, a part of information looks really blur.

4. References need to be given to the four ecological niche models in Line 180.

5. Since the full names of different variables have been mentioned in Line 169-174, there is no need to repeat the full name for some variables in Line 279-281.

6. It is suitable to add a Data availability statement at the end of the research part to provide support for researchers who have an interest on this study.

7. One or more pertinent/related scholarly references could be cited in the current reference list.

8. Please double check the grammar mistake and carefully revise the English writing.

Reviewer #2: This paper presents a model combining 4 different ecological models in order to assess and predict the Ebola virus' ecological niche in Sub-Saharan Africa. I personally find this topic very interesting. As the authors point out, a lot of mathematical models of Ebola focus on transmission dynamics predicated on a spill over event happening, however are unsuitable to model the actual spill over event itself. Given land use, climate and population changes over the next few decades, it will be important to understand the spatiotemporal risk of Ebola epidemics in order to prepare and respond to them.

However, whilst I am interested in and consider this topic to be important, this paper is not currently ready for publication. In addition to the comments above, there are several issues that I would like the authors to address:

1. The introduction fails to motivate the study and demonstrate its novelty. The authors discuss the current state of Ebola modelling by solely discussing transmission modelling (and citing a very limited subset of methods and papers at that) - however, they do not really mention previous ecological models in this section. Based on this, the reader may think that this is the first time an ecological niche modelling (ENM) approach has been used in the context of Ebola outbreaks. In the discussion section, the authors do refer to two other ecological models which agree with their findings. However, these models really need to be discussed in the introduction section. What has been done before? What approaches have been used? What were these studies' limitations and how does your model specifically advance our understanding of the Ebola virus' ecological niche. In addition to Redding et al and Piggott et al, other papers to consider include: Judson et al (PLOS Pathogens 2016), Lee-Cruz et al (PLOS NTDs 2021), Buceta et al (PLOS One 2017), to name just a few.

2. More care needs to be taken with citations. There are some statements made in the paper that really need to be backed up with citations. Other papers are miscited. For example, a paper by Merler at el is cited to show that vegetation and climate both impact Ebola transmission; however this paper is an agent-based model on Ebola transmission that assesses the impact of non-pharmaceutical interventions and makes no mention of climate. Similarly, Guo et al is cited to suggest that specific variables are important for Ebola transmission, but is really about the spatial distribution of a plant species.

3. Much more care needs to be taken with figures. The submitted article contained two lists of figure legends: one for figures 1-4 and S1-S3, and one for figures 1-7 and S1. However, none of these seem to match up with the figures accompanying the paper (labelled 1 to 7 and S1). For example, the first list of figure legends says that figure 3 shows the results of a Jackknife test, while the second set of figure legends says that figure 6 shows the Jackknife test. Having looked at the figures, it is actually figure 4 that shows the results of the Jackknife test. It is virtually impossible for the reader to understand whether the results support the conclusions when some figures are never referred to, some figures that are referred to are not shown, and figures are generally mislabelled.

4. More care also needs to be taken with the paper organisation: the methods section contains some results, the results section contains some methods, and results are also presented in the discussion.

I would be amenable to reading a revised version of this paper as I believe it is an important topic. However, it is not ready for publication in its current form.

--------------------

PLOS authors have the option to publish the peer review history of their article (what does this mean? ). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy .

Reviewer #1: No

Reviewer #2: No

Figure Files:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Data Requirements:

Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5.

Reproducibility:

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

PLoS Negl Trop Dis. doi: 10.1371/journal.pntd.0012843.r004

Decision Letter 1

Mabel Carabali

13 Nov 2024

PNTD-D-24-00256R1Assessing the ecological resilience of Ebola virus in Africa and potential influencing factors based on a synthesized modelPLOS Neglected Tropical Diseases Dear Dr. Li, Thank you for submitting your manuscript to PLOS Neglected Tropical Diseases. After careful consideration, we feel that it has merit but does not fully meet PLOS Neglected Tropical Diseases's publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript within 60 days Dec 13 2024 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosntds@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pntd/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript:* A rebuttal letter that responds to each point raised by the editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers '. This file does not need to include responses to any formatting updates and technical items listed in the 'Journal Requirements' section below.* A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes '.* An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript '. If you would like to make changes to your financial disclosure, competing interests statement, or data availability statement, please make these updates within the submission form at the time of resubmission. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. We look forward to receiving your revised manuscript. Kind regards, Mabel Carabali, M.D., M.Sc., Ph.D.,Section EditorPLOS Neglected Tropical Diseases Mabel CarabaliSection EditorPLOS Neglected Tropical Diseases

Shaden Kamhawi

co-Editor-in-Chief

PLOS Neglected Tropical Diseases

orcid.org/0000-0003-4304-636XX

Paul Brindley

co-Editor-in-Chief

PLOS Neglected Tropical Diseases

orcid.org/0000-0003-1765-0002

 Additional Editor Comments (if provided): Thank you for submitting your manuscript. Although it has been read with great interest, there are several aspects related to the methodology that are deserving special attention. For instance, aspects of the data set and the splitting data for training and validation are not clear, as it is the case with mismatching results and analysis.  Journal Requirements:   [Note: HTML markup is below. Please do not edit.] Reviewers' Comments: Reviewer's Responses to Questions

Key Review Criteria Required for Acceptance?

As you describe the new analyses required for acceptance, please consider the following:

Methods

-Are the objectives of the study clearly articulated with a clear testable hypothesis stated?

-Is the study design appropriate to address the stated objectives?

-Is the population clearly described and appropriate for the hypothesis being tested?

-Is the sample size sufficient to ensure adequate power to address the hypothesis being tested?

-Were correct statistical analysis used to support conclusions?

-Are there concerns about ethical or regulatory requirements being met?

Reviewer #1: Yes

Reviewer #2: The objectives of the study are clearly explained. I'd like to thank the authors for updating the methods sections to provide further details, but I would still like a bit of clarification on a couple of points:

Line 120: the data set referenced here doesn't list Ebola cases, rather it lists the health facilities in DRC. Please can you check this and provide the correct data reference.

Line 124: I don't think the description of HDX really adds anything here as you don't use data from over 250 countries and the resolution varies by data set.

Line 160: I'm unclear about the purpose and method behind generating extra points via random sampling.

Line 209: You mention training the model. This would imply a training data set and a testing data set. Please can you provide further details of this?

Figure 1: I think more thought needs to be put into this figure and what you want it to show. The time scale is kind of confusing, as well as the way that total case numbers are assigned to the month of outbreak in a country. It also gives the impression that Guinea, Liberia and Sierra Leone are separate outbreaks whereas they are all part of the same outbreak. Two separate outbreaks in DRC have been conflated (one was declared in May 2018, one in August 2018). I'm not sure that CFR (should be ratio rather than rate!) is helpful or relevant to the study. The CFR for the single case in Senegal (which is again, really part of the West Africa outbreak) should either be 0 or 1, not 0.4!

General: Thank you for providing more information on the data and the models (both in the text and Appendix B). However, for anyone hoping to repeat your analysis, I think it is still hard to understand what the models need in terms of data and what the model parameters are. I understand that the models that previously been published in Ecology journals (thank you for adding references), but they will likely be new to readers of PLOS NTDs. Given the data described so far, it is unclear to me how the input data requirements for the Maxent model (with influential factors x and species occurrence y) corresponds to the points A and B in p-dimensional space for the bioclim model? Do the models require the same or different data processing steps. How many parameters does each model have?

**********

Results

-Does the analysis presented match the analysis plan?

-Are the results clearly and completely presented?

-Are the figures (Tables, Images) of sufficient quality for clarity?

Reviewer #1: Yes

Reviewer #2: The results generally match the analysis plan. However, given that in the introduction, one of the main motivations for combining four distinct models was that relying on a single specific model may cause a decrease in accuracy due to inherent limitations, I would have expected to see some kind of analysis showing the benefit (or lack thereof) or synthesizing models as oppose to relying on one.

Line 302: What are the 100 experiments and how are they conducted? Did I miss something in the methods section?

Line 327: How where the 1000 points generated?

I also think the image quality needs to be improved (see Editorial and Data presentation modifications?)

**********

Conclusions

-Are the conclusions supported by the data presented?

-Are the limitations of analysis clearly described?

-Do the authors discuss how these data can be helpful to advance our understanding of the topic under study?

-Is public health relevance addressed?

Reviewer #1: (No Response)

Reviewer #2: In the abstract and conclusions, the authors mention that their findings suggest new way to respond to small outbreaks, but nothing about this and public health is really discussed.

The limitations focus on the limitations of data availability and don't really think about limitations of the models, what they can or can't do and what errors or biases different stages of the data processing might introduce.

I also think the study is conflating two 'ecological' factors: one is where the zoonotic spillovers might occur, and one is where the disease might spread when the spillover has already occurred. I think this issue needs a more nuanced response.

**********

Editorial and Data Presentation Modifications?

Use this section for editorial suggestions as well as relatively minor modifications of existing data that would enhance clarity. If the only modifications needed are minor and/or editorial, you may wish to recommend “Minor Revision” or “Accept”.

Reviewer #1: Accept

Reviewer #2: Many thanks to the authors for updating the figure numbers. However, the figure resolution is pretty low and it is hard to make out the names of the countries in figures 1, 4, and 5, and the scales in figures 7 and 8. These scales should also have units and y axis labels (at least for the leftmost figures.)

I'm grateful to the authors that they have included a key for the variable names in table 1. However, if would also be helpful to have the proper variable names in the figures and table 2 so that the reader doesn't keep have to referring back to table 1 in order to understand the results.

For table 1, a couple of the data sources are missing, and I would like some clarity of the meaning and relevance of the means, maximums, minimums, etc. The area chosen for the study is Africa, but these values can't belong to Africa. Please can you check and amend these?

**********

Summary and General Comments

Use this section to provide overall comments, discuss strengths/weaknesses of the study, novelty, significance, general execution and scholarship. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. If requesting major revision, please articulate the new experiments that are needed.

Reviewer #1: None

Reviewer #2: I think there is still some confusion over what the aim of this paper is and what it does that is not already in the literature. The authors claim that other studies do not take into account spatiotemporal trends in EBOV adaptability, whereas I believe some of them do. I think there is a bit of confusion over modelling zoonotic spillover, versus modelling human to human transmission that needs to be more carefully though about. The authors claim that synthesizing 4 ecological models provide a better understanding of Ebola's niche, but don't really demonstrate it. I suggest that they think a little bit more about where they want to position this paper within the literature and focus on achieving those aims first and foremost.

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean? ). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy .

Reviewer #1: No

Reviewer #2: No

 [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] Figure resubmission: While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. If there are other versions of figure files still present in your submission file inventory at resubmission, please replace them with the PACE-processed versions. Reproducibility: To enhance the reproducibility of your results, we recommend that authors of applicable studies deposit laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

PLoS Negl Trop Dis. doi: 10.1371/journal.pntd.0012843.r006

Decision Letter 2

Mabel Carabali

15 Jan 2025

Dear Mr Li,

We are pleased to inform you that your manuscript 'Assessing the ecological resilience of Ebola virus in Africa and potential influencing factors based on a synthesized model' has been provisionally accepted for publication in PLOS Neglected Tropical Diseases.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Neglected Tropical Diseases.

Best regards,

Mabel Carabali, M.D., M.Sc., Ph.D.,

Section Editor

PLOS Neglected Tropical Diseases

Shaden Kamhawi

co-Editor-in-Chief

PLOS Neglected Tropical Diseases

orcid.org/0000-0003-4304-636XX

Paul Brindley

co-Editor-in-Chief

PLOS Neglected Tropical Diseases

orcid.org/0000-0003-1765-0002

***********************************************************

PLoS Negl Trop Dis. doi: 10.1371/journal.pntd.0012843.r007

Acceptance letter

Mabel Carabali

Dear Mr Li,

We are delighted to inform you that your manuscript, "Assessing the ecological resilience of Ebola virus in Africa and potential influencing factors based on a synthesized model," has been formally accepted for publication in PLOS Neglected Tropical Diseases.

We have now passed your article onto the PLOS Production Department who will complete the rest of the publication process. All authors will receive a confirmation email upon publication.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any scientific or type-setting errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript. Note: Proofs for Front Matter articles (Editorial, Viewpoint, Symposium, Review, etc...) are generated on a different schedule and may not be made available as quickly.

Soon after your final files are uploaded, the early version of your manuscript will be published online unless you opted out of this process. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting open-access publishing; we are looking forward to publishing your work in PLOS Neglected Tropical Diseases.

Best regards,

Shaden Kamhawi

co-Editor-in-Chief

PLOS Neglected Tropical Diseases

Paul Brindley

co-Editor-in-Chief

PLOS Neglected Tropical Diseases

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Appendix. Supplemental table and figures.

    Fig A. Multiple data sources, including Ebola epidemic case data, surveyed attribute data, remote sensing data, and spatial vector boundaries, and establishment of a geodatabase. The base layers of the map were obtained from the openly available source via the Natural Earth (https://www.naturalearthdata.com/downloads/50m-cultural-vectors/). Fig B. Heat map of spearman coefficients for each variable illustrating the correlations among variables. Table A. The advantages and disadvantages of Maxent, Bioclim, Domain, and GARP models.

    (DOCX)

    pntd.0012843.s001.docx (5.1MB, docx)
    S2 Appendix. Characteristics and introduction of the models used in this study.

    Table A. The format requirements of the input data for models used in this study.

    (DOCX)

    pntd.0012843.s002.docx (20.9KB, docx)
    Attachment

    Submitted filename: Response to Reviewers - 03.docx

    pntd.0012843.s004.docx (732KB, docx)
    Attachment

    Submitted filename: Response to Reviewers.docx

    pntd.0012843.s005.docx (1.3MB, docx)

    Data Availability Statement

    The authors confirm that all data underlying the findings are fully available without restriction. All relevant data are within the paper and its Supporting Information files. Publicly available data (CC BY 4.0) has been uploaded to the Researchgate (https://www.researchgate.net/profile/Jiawei-Song-14/publication/386178115_Assessing_the_ecological_resilience_of_Ebola_virus_in_Africa_and_potential_influencing_factors_based_on_a_synthesized_model/data/6747e1be790d154bf9afacef/DataforManuscriptPLOSntd.zip?_tp=eyJjb250ZXh0Ijp7ImZpcnN0UGFnZSI6InB1YmxpY2F0aW9uIiwicGFnZSI6InB1YmxpY2F0aW9uIiwicG9zaXRpb24iOiJwYWdlQ29udGVudCJ9fQ).


    Articles from PLOS Neglected Tropical Diseases are provided here courtesy of PLOS

    RESOURCES