Skip to main content
PLOS One logoLink to PLOS One
. 2025 Jan 30;20(1):e0317577. doi: 10.1371/journal.pone.0317577

Habitat suitability mapping and landscape connectivity analysis to predict African swine fever spread in wild boar populations: A focus on Northern Italy

Giulia Faustini 1,*, Marie Soret 2,3,4, Alexandre Defossez 2,3, Jaime Bosch 5,6, Annamaria Conte 7, Annelise Tran 3,8
Editor: Francesco Bisi9
PMCID: PMC11781678  PMID: 39883672

Abstract

African swine fever (ASF) is a highly contagious disease affecting wild and domestic pigs, characterised by severe haemorrhagic symptoms and high mortality rates. Originally confined to Sub-Saharan Africa, ASF virus genotype II has spread to Europe since 2014, mainly affecting Eastern Europe, and progressing through wild boar migrations and human action. In January 2022, the first case of ASF, due to genotype II, was reported in North-western Italy, in a wild boar carcass. Thereafter, numerous positive wild boars were identified, indicating an expanding wild epidemic, severely threatening Italian pig farming and trade. This study focused on the mapping of the suitable habitats for wild boars and their potential dispersal corridors in Northern Italy, using species distribution models and landscape connectivity analysis. The resulting maps identified areas with higher likelihood of wild boar presence, highlighting their preferential pathways crossing Northern Italy. The distribution of ASF positive wild boars along the major corridors predicted by the model suggests the obtained maps as valuable support to decision-makers to improve ASF surveillance and carcass early detection, aiming for eradication. The applied framework can be easily replicated in other regions and countries.

Introduction

African swine fever (ASF) is one of the most devastating diseases of swine, caused by a large, enveloped, double-stranded DNA virus (generally referred as ASFV), which is the only member of the Asfarviridae family, genus Asfivirus [1]. Over the years, ASF has evolved from a localised disease restricted in Sub-Saharan Africa [2], to a major threat to global pig populations [3]. Because of its significant health and socioeconomic impact [4], and the importance of early intervention to control new outbreaks, ASF is listed among the notifiable diseases (i.e., disease of public health importance, mandatory to be reported to related authorities) by the World Organization for Animal Health (WOAH).

ASF clinical manifestations may vary from a severe haemorrhagic disease, with hyperacute clinical signs and nearly 100% mortality, to asymptomatic and chronic forms, which turn animals into silent virus carriers contributing to the persistence and dissemination of the disease [5]. The lack of clinical signs is more frequent in African wild suids such as warthogs (genus Phacochoerus), bush pigs (Potamochoerus porcus and P. larvatus) and giant forest hogs (Hylochoerus meinertzhageni) [6], while domestic pigs and wild boars more frequently develop the haemorrhagic form. Although seroconversion is observed in the few animals that survive the disease [79], the neutralising activity of the antibody response is highly uncertain, making future reinfections possible [1012]. ASFV is a highly contagious and stable pathogen, able to persist from weeks to months in infected materials (e.g., blood, excretions, tissues), and even more than a year in contaminated animal products (e.g., frozen meat, cured ham) [5]. Transmission and spread of the virus occur mainly through direct contact with infected animals or carcasses, or indirectly through contaminated food, water, and semen, as well as people (e.g., hunters, workers in pig sectors, veterinarians, etc.) or fomites (i.e., any inanimate object/material which is contaminated, carrying and spreading the pathogen) acting as mechanical vectors. The transmission cycle of ASFV can also be sustained by the presence of soft ticks of the genus Ornithodoros, in which the virus can persist for more than 5 years [13]. Infection through Ornithodoros moubata plays an important role in the sylvatic cycle in Africa [14]. In contrast, in the current European epidemiological situation, despite the presence of some Ornithodoros species (e.g., O. erraticus, O. maritimus), their involvement appears marginal [5].

ASFV exhibits high genetic and antigenic variability, leading to the emergence of 24 known genotypes, five of which adapted to members of the Suidae family, without zoonotic potential [1, 15]. Only ASFV genotypes I and II are currently present in European countries, with two distinct epidemiological scenarios. Genotype I, introduced during a first wave in several European countries in the second half of the 1900s [16], is now circulating only in Sardinia, an Italian Island in the Mediterranean Sea. This region is characterised by a peculiar situation of endemicity, maintained by repeated interchanges between free-ranging domestic pigs and wild boars [17]. Genotype II is currently affecting continental Europe since its introduction in Georgia in 2007 via contaminated waste from ships [18, 19]. The virus began its spread to Eastern Europe in 2014, with Lithuania recording the first cases. From there, ASFV genotype II quickly expanded to neighbouring countries, particularly Poland, Latvia, and Estonia, often emerging near their borders with Belarus and the Russian Federation. By 2019, the ASFV genotype II epidemic had spread like wildfire, affecting most of the Baltic states (Lithuania, Latvia, and Estonia). The virus continued its westward expansion, reaching Germany through Poland by late 2019 [1820]. Notably, in the Baltic states and Poland, the virus has shown remarkable persistence within wild boar populations, with self-sustaining cycles over several years, while causing minimal outbreaks among domestic pigs [1921]. On a separate front, ASFV genotype II has also reached Romania, following cases recorded in wild boars in neighbouring countries (i.e., Moldova and Ukraine) [22].

While wild boar movements and territorial wild boar density played a key role in local spread [19, 21, 23], long-distance jumps often resulted from human activities. For instance, ASFV genotype II occurred into European countries such as Czech Republic (2017) [24], and Belgium (2018) [25], appearing as isolated cases, likely due to improper disposal of contaminated food by humans [20]. Less clear is the origin of first outbreaks reported in Hungary, Bulgaria, Slovakia and Serbia in both domestic pigs and wild boars [19].

African swine fever (ASF) spread in wild boar populations of Eastern Europe has been governed by host factors (population structure), viral characteristics (strain virulence), and environmental conditions (geographical barriers) [26, 27], with many dynamics still poorly understood [28]. Network analysis in affected European countries showed infection velocities of 2.9–11.7 km/year, excluding human-mediated transmission, with seasonal acceleration during summer [19]. This slow but persistent spread has caused substantial wild boar population declines in Eastern Europe [29, 30] while disrupting regional livestock economics through reduced pigmeat production and exports [31].

The continued geographic expansion of ASFV genotype II reached Italy in 2022, following an unexpected introduction likely human-mediated [32]. While ASFV genotype II incursion was expected from the eastern side, in January 2022, the National Reference Center for the Study of Pestivirus and Asfivirus Diseases (CEREP) at the Istituto Zooprofilattico Sperimentale dell’Umbria e delle Marche (IZSUM) confirmed the first case of ASFV genotype II in Italy, in the Northwest, in the municipality of Ovada, Piedmont [3234]. Following this first finding, several carcasses positive to ASFV genotype II were detected in Liguria. Over a year later, ASFV genotype II-positive wild boars were found in other regions such as Lombardy, and Emilia-Romagna (Fig 1). So far, in Northern Italy, only nine genotype II outbreaks have been reported in domestic pigs, all in Lombardy region (Italian national epidemiological bulletin, https://storymaps.arcgis.com/stories/9fe6aa3980ca438cb9c7e8d656358f35, accessed on 10th July 2024).

Fig 1. Map of Northern Italy, showing regions affected by African swine fever virus (ASFV) genotype II on 18th April 2024.

Fig 1

ASFV genotype II outbreak point locations in wild boars are also displayed (World Animal Health Information System (WAHIS)). Base map made with Natural Earth. North Italian regions based on the 2024 regional administrative boundaries from the Italian National Institute of Statistics (https://www.istat.it/).

Despite the detection of ASFV genotype II in isolated outbreaks in regions of Central and Southern Italy, and in Sardinia [33, 35], the epidemic front currently expanding is the one affecting wild boars in Northern Italy, severely threatening the socio-economic system related to Italian pig husbandry. Northern regions are indeed where most Italian intensive pig farming is concentrated (https://www.istat.it/), primarily aimed at the production of Italian cured ham (e.g., Parma ham). Any further involvement of domestic swine facilities in the ASFV genotype II epidemic due to negligence in biosecurity measures and transmission from the external environment would escalate to deaths or culling of large numbers of animals, as well as bans on the pork trade.

The persistence and spread of ASFV genotype II in Eastern Europe, strongly driven by the wild boar population, along with its eradication through effective hunting campaigns and early detection of carcasses, highlight the crucial need to better understand the distribution and mobility of the wild boar population to halt or at least mitigate the dispersal of the disease [19, 24, 25]. ASFV genotype II spread may vary greatly among the different habitats found in Northern Italy (Alps, pre-Alps, plains) [34, 36]. Wild boar suitable habitats and distribution have been largely estimated at European scale, through the application of different models, cumulatively named as species distribution models (SDM) [37]. SDM are based on the use of species occurrence/sighting data, which are related to explanatory variables describing the environments, to infer the related likelihood of presence [38]. In literature, presence/absence data for wild boar differs considerably, although often in relation with a common/similar set of environmental variables describing topography, climate, human disturbance and land cover [3946]. Briefly, SDM can be applied with presence-only data (e.g. BIOCLIM model) [41], presence-background data (e.g. MaxEnt model) [43, 47], presence-absence data (e.g. Random Forest model) [41], or following a Bayesian framework (e.g. INLA model) [44]. Since the best modelling technique remains a topic of debate in the scientific community [4850], a common and accepted approach is to choose the model by comparing performance metrics of multiple models [5153]. This approach has been suggested also by ENET wild consortium (i.e. an international network of wildlife professionals supported by the European Food Safety Agency) for wild boar abundance predictions to mitigate model-specific biases and potential artifacts which may otherwise impact estimations [37].

At European level, a helpful overview on high-risk routes for wild boar migration corridors and ASF spreading is currently available [54]. However, to effectively act at the local level, public authorities would benefit from detailed maps of hotspots for wild boar presence and movements. These maps would allow to target hunting and ASF surveillance activities, such as carcass detection. To fill this knowledge gap, the present study aimed to map suitable habitats for wild boars and their potential corridors of dispersal at high resolution in Northern Italy.

Material and methods

Data

Study area

The study area included the geographical region of Northern Italy, comprising the following administrative regions: Aosta-Valley, Piedmont, Lombardy, Trentino-Sud Tyrol, Veneto, Friuli-Venezia Giulia, Liguria and Emilia-Romagna (Fig 1). Geographically, Northern Italy features a diverse landscape. The region is bordered by vast mountainous areas and diverse orography: the Alps to the north and west, and the Northern Apennines to the south. In between lies the Po Valley (Fig 1), the main Italian plain, traversed by the Po River, the longest river in the country. Northern Italy is a densely populated area, with a total surface area of 120,325.67 km2, which concentrates 46.6% (27,490,042 inhabitants out of a total of 58,989,749 inhabitants) of the Italian population (June 5th, 2024, https://www.istat.it/). From a production perspective, the Po Valley is highly industrialised, characterised by intensive agriculture and livestock farming, as well as numerous urban centres of significant industrial and tourist importance.

Wild boar presence data

Wild boar presence data largely included occurrences available on https://www.gbif.org, the Global Biodiversity Information Facility (accessed on 15th February 2024, GBIF Occurrence Download https://doi.org/10.15468/dl.cgjtsj). GBIF is a free, open-access database providing data about different taxa, collected by many institutions and associations during surveys and/or citizen science programs. GBIF records located in the study area were selected and processed following the pipeline described by GBIF portal (https://docs.gbif.org/course-data-use/en/data-processing-pipeline.html). In addition, available field records from Veneto and Lombardy, obtained from official routine culling campaigns, aimed at controlling wild boar population expansion, were included in the study [55].

Only records with a known coordinate uncertainty of less than 100 m were considered. The available occurrences were also filtered for the time period 2014–2024, because of the limited data available in previous years. To meet the assumption of independence of records, reduce sampling bias and point clustering, preventing pseudoreplication (samples taken from the same experimental unit are treated as independent replicates) and spatial autocorrelation issues (neighbouring coordinates tend to have similar values for environmental variables) [5659], the density of occurrences was reduced to a minimum distance of 100 m. Duplicated records were also removed. Data cleaning and processing were performed using R statistical software [60].

Environmental variables

Based on models applied in previous studies, and accounting for wild boar ecology, a variety of environmental predictors were selected to depict different aspects of the habitat, such as topography (altitude, slope, topological aspect, topological diversity, topological position), climate (precipitation, temperature), human disturbance (population density, distance from highways, distance from urban areas, road density), land cover and vegetation (Normalized Difference Vegetation Index (NDVI), bare coverage, herbaceous coverage, tree coverage (all intended as percentage 0–100 per cell), distance from crop areas, distance from forest, distance from lakes, distance from rivers, distance from Natura 2000 parks) [37, 61]. Variable description, sources, and processing details are reported in S2 Table. Briefly, all variables were obtained from public databases, reprojected on the same coordinate reference system (RDN2008 / Italy zone (N-E)), and standardised as rasters with a pixel size of 100 m x 100 m (bilinear resampling method), over an extent equal to the study area. Categorical variables (land cover classes) and vector layers (showing lakes, rivers, parks, highways, and roads) were converted to continuous variables, considering the distance from the described feature, or the feature density. Variables with a temporal dimension (NDVI, temperature, precipitation), were processed to account for seasonal variations in habitat use and resource availability for wild boars. Specifically, a single spatial layer was obtained by calculating over the years (period 2014–2023, depending on available data) the overall mean (overall scenario), and the seasonal mean (seasonal scenarios). This approach was chosen to provide a comprehensive view of environmental conditions over time, aligning with the time frame of occurrence data, while simultaneously smoothing out short-term fluctuations and variability [41, 42, 62].

Based on the meteorological convention for the Northern hemisphere, the considered months for seasonal means were December, January, February for winter; March, April, May for spring; June, July, August for summer; September, October, November for autumn [63]. Spatial layer processing was performed on QGIS Geographic Information System [64] and Python [65].

To reduce collinearity between environmental predictors [66], the selected variables were tested for autocorrelation, via a correlation tree (or cluster dendrogram) obtained with the stats library in R [60]. In the obtained correlation tree, a minimum cut-off of 0.5 was considered for variable selection. Based on ecological and spatial criteria, the original resolution, and interpretive ease, only one variable was selected in each cluster of variables with node < 0.5. The selected variables were further tested for multicollinearity with HH library [67], calculating the variance inflation factor (VIF) [68], and selecting variables with a maximum value of 5 [43]. Collinearity and multicollinearity assessment was conducted for the full set of variables for the overall analysis, while for seasonal analysis the variables with a temporal dimension covered only the relative months.

The variable included in the study for all scenarios were 13 (S1 Table). Temperature, NDVI, herbaceous coverage, distance from urban areas, and topological diversity were excluded in all scenarios. Precipitation variable was excluded in the overall and summer scenarios; slope variable was excluded only in the summer scenario; tree coverage was excluded in the winter, spring and autumn scenarios, while distance from crop areas was excluded in all scenarios except for summer (S1 Table).

Modelling workflow

To capture broad, long-term trends as well as seasonal variations in wild boar habitat use and resource availability, both overall and seasonal suitability were modelled. The modelling workflow was structured in two main phases (Fig 2). Firstly, the wild boar occurrence records, and the set of environmental predictors for wild boar distribution, were used as input to model the potential distribution of wild boars in the study area (habitat suitability) for the overall and seasonal scenarios using SDM (Fig 2A) [38, 69]. Then, the obtained overall suitability map was used to perform the landscape connectivity analysis of the study area (Fig 2B).

Fig 2. Habitat suitability mapping and landscape connectivity analysis workflows.

Fig 2

(A) Modelling workflow to predict suitable habitats for wild boar presence from occurrence records and environmental predictors, (B) and to estimate the landscape connectivity of the study area considering the most suitable areas. “WB” is used as abbreviation of “wild boar”. Environmental variables made with Natural Earth for illustrative purpose only. Base map of Italy in the suitability maps based on the 2024 regional administrative boundaries from the Italian National Institute of Statistics (https://www.istat.it/).

Species distribution modelling (SDM)

For the overall scenario, all the occurrences were included in the suitability modelling, using an overall average for variables with time dimension. For the seasonal scenarios, only the related monthly records were used, with mean seasonal layer for precipitation variable.

Since for wild boars, real absence data are hardly available, SDM techniques rely on a sample of points from the study area, called “pseudoabsences” (PsA), which are used to sample the background, providing information about the environmental factors potentially driving the distribution of presence records [70]. Occurrences available on naturalistic databases (such as GBIF) are often spatially biased toward areas easily accessible by humans [70]. If PsA are randomly generated, the final model will show that the distribution of the species correlates higher with human facilities (e.g., roads, towns), rather than with real driving factors [70]. To reduce the sample selection bias and target the modelling on species distribution, rather than on survey effort, PsA were selected using the same underlying bias as occurrence data [71, 72]. With this approach, the model will capture any differentiation between the distribution of presence records and that of PsA, better highlighting preferential habitats, rather than area accessibility. A kernel density surface with higher probabilities in areas with more presence data was used as sampling bias surface, to generate a number of PsA equal to the number of occurrences data [73, 74].

The habitat suitability modelling was performed using the algorithms available in the package Biomod2 in R [75], with default options: Artificial Neural Network (ANN), Classification Tree Analysis (CTA), Flexible Discriminant Analysis (FDA), Generalized Additive Model (GAM, or BAM), Gradient Boosted Machine (GBM), Generalized Linear Model (GLM), Multiple Adaptive Regression Splines (MARS), Maximum Entropy (MAXNET), Random Forest (RF), Surface Range Envelop / BIOCLIM (SRE), eXtreme Gradient Boosting Training (XGBOOST) (Fig 2A). The protocol consisted of 50 runs per algorithm, with the same weight for presence and PsA data, using 70% of the datasets for training and 30% for testing [74], splitting randomly the data between the train and test datasets. The best algorithm was chosen considering True Skills Statistics (TSS) and AUC (Area Under ROC (Relative Operating Characteristic) Curve) as evaluation metrics, and excluding the algorithms displaying overfitting. For the selected algorithm, an ensemble model was computed through the median of all runs with AUC greater than 0.8 [76], and further evaluated with AUC, TSS and Cohen’s Kappa (Kappa) [7779]. A final continuous prediction raster of habitat suitability (suitability map ranging from 0 to 1) was obtained forecasting the ensemble model.

The presence of a significant difference between the suitability maps under the different scenarios was tested with Kruskal-Wallis test, while the pairwise multiple-comparison was performed through the Dunn test. The distribution of the overall suitability value of the presence data included in the model was compared through a Mann-Whitney test with the one of the unused records, removed during the occurrence’s density reduction.

Connectivity analysis

Since carcass search and removal is a process that should ideally be conducted routinely in the restriction zones (i.e., infected areas or surrounding areas under surveillance measures), especially in the epidemic phase [80], connectivity was estimated only for the overall scenario. The landscape connectivity of the study area was modelled on the overall suitability map using Circuitscape, an open-source program implemented in Julia [81]. Circuitscape is based on circuit theory, and can operate on networks of nodes or raster grids. With raster data, Circuitscape converts raster grids into electrical networks, in which each cell becomes a node that is connected to the neighbouring cells by resistors. According to settings, from a current source, Circuitscape estimates the flow between focal nodes, simulating animal movements. Input files in this study included a focal nodes location file, a resistance map, and a raster mask file. All input files were obtained using QGIS and Python.

Due to high computational and storage demands of Circuitscape, focal nodes were obtained using the following workflow (Fig 2B). The continuous overall suitability prediction was converted into a binary map considering as threshold the 10th percentile of the suitability score distribution of presence data, assuming that areas with higher suitability score better reflect wild boars’ habitat. This criterion has been largely adopted in the ecological field, showing good performances in obtaining suitable patches [74, 8284]. The regions with suitability scores greater than the threshold (suitable areas) illustrated in the binary map, were converted into vector patches, excluding those smaller than 4.4 km2, the mean population home range size for wild boars [54, 85, 86]. Ten sets of random points were generated within the obtained patches, taking into account wild boar animal density within patches [42], and a minimum distance of 12 km, the mean population maximum dispersal for wild boars (Fig 2B) [86]. Once converted to raster format, the obtained random points served as focal nodes.

The resistance map conceptually represents the opposite of habitat suitability: low suitability/permeability areas correspond to higher resistance, and vice versa. The resistance map was computed applying a negative exponential function, such that resistance = e[(ln(0.001)÷thresholdHS]×103, which enhances the barrier effect of less-suitable sites, returning smoother effects [8790].

To exclude cells outside the considered Italian regions, and big lakes from the analysis, a raster mask file was created, setting the corresponding pixels as NODATA values. Thus, NODATA values were excluded from the resistance map, and treated like barriers [81].

The connectivity analysis was conducted for all 10 sets of random points (focal nodes) using the “one-to-all” mode, which iterates across all random points, such that in each new iteration a different random point acts as the current source. Julia 1.10.2 version was used to run Circuitscape program [81].

To test if the 10 connectivity maps estimated were comparable, the presence of a significant difference in terms of connectivity values between maps was evaluated by Kruskal-Wallis test (Fig 2B). Since this test compares the median values of the maps (i.e., non-parametric test), a final connectivity map was obtained calculating the median of the 10 connectivity maps, reducing also the influence of potential outliers.

Model evaluation

The geographical coordinates of Italian ASF outbreaks were downloaded from the World Animal Health Information System (WAHIS), the reference database of the World Organisation for Animal Health (WOAH) (accessed on 18th April 2024). Outbreaks related to domestic pigs were excluded, and only notifications corresponding to ASF genotype II positive wild boars within the study area were included in the validation of the obtained connectivity map. From this point forward, all references to ASF outbreaks/cases/events/findings pertain specifically to ASFV genotype II.

To test if ASF events were associated with higher connectivity values, the distribution of the connectivity values at ASF event locations was compared with the one of random points in a minimum bounding geometry (convex hull) enclosing ASF outbreaks through Mann-Whitney test. A set of random points equal to the number of events were generated 100 times, without overlapping the location of the ASF cases, assuming that the entire territory in which ASF positive wild boars were found was scanned. Random points and ASF events connectivity values were extracted from the median connectivity map. The proportion of significant p-values was calculated.

For each of the 100 sets of random points, a dataset of ASF events and random points locations and relative connectivity values was created. The ROC curve was analysed for all the 100 datasets, using pROC package in R [91]. The average AUC, the lowest threshold corresponding to 0.9 of sensitivity, and the average best closest top left threshold (the point closest to the top-left part of the ROC curve, corresponding to the best compromise between sensitivity and specificity) were calculated. Binary risk maps of the median connectivity maps were obtained applying the calculated thresholds, and their true positive rate (outbreaks correctly classified as carcass location) was calculated. To increase the chance of wild boar detection, the search of wild boars in the infected areas and in those under surveillance are usually more focused in forest-covered areas [80, 92]. To compare the predictive power of the estimated risk maps with that of the land cover classes only, the proportion of correctly classified ASF positive wild boars was also calculated by land cover class (urban, crop, forest, water). The land cover classes were obtained by reclassifying the Corine Land Cover (CLC) dataset produced within the frame of the Copernicus Land Monitoring Service (S2 Table). All the statistical analysis was performed in R.

Results

Presence data and environmental variables

A total of 2496 wild boar occurrences were collected for the period 2000–2024: 2266 downloaded from GBIF data portal, while 230 from official routine culling campaigns. After the data cleaning process to reduce the spatial autocorrelation between occurrences, 1306 presence data were finally included in the overall model, covering the period from 2014 to 2024 (Fig 2A). For the seasonal scenarios, the occurrences included were 293 for winter, 312 for spring, 434 for summer, and 451 for autumn.

After standardisation, all variable layers had a dimension of 26,306,996 grid cells. After the screening for the presence of collinearity and multicollinearity described in material and methods section, out of 20 variables originally considered, 13 variables were selected for the overall and seasonal suitability models, with specific differences according to the scenario (S1 Table).

Suitability maps

Among the algorithms available in Biomod2 package, only RF, XGBOOST, and GBM showed acceptable performances for all the considered scenarios (overall and seasonal estimations). On average, the mean and the standard deviation (sd) of the evaluation metrics were AUC equal to 1.00 (sd = 0.00), TSS equal to 0.99 (sd = 0.00) for both RF and XGBOOST, and AUC equal to 0.95 (sd = 0.00), TSS equal to 0.77 (sd = 0.02) for GBM (details per scenario in S3 Table). The other algorithms (ANN, CTA, FDA, GAM, GLM, MARS, MAXNET, SRE) resulted in lower performances (average values: ROC < 0.89; TSS < 0.62).

For all considered scenarios, GBM model was finally preferred for ensemble modelling computation, because of its balanced combination of spatial generalization across the study area and predictive performances. Compared to other models, GBM provided continuous suitability estimates across the study area, in association with the strongest predictive power in terms of evaluation metrics, except for RF and XGBOOST, which showed perfect performance metrics (S3 Table) [9395]. However, RF and XGBOOST were excluded since the excellent results depicted by the metrics may be related more to an overfitting/overlearning of the occurrences used, than actually to perfect predictive performances (S3 Table) [53, 9699]. The GBM-based ensembled model was evaluated as acceptable for all the considered performance metrics, in all scenarios (Table 1). The mean values obtained were TSS = 0.73 (sd = 0.08), AUC = 0.94 (sd = 0.03), KAPPA = 0.73 (sd = 0.08). All the above evaluation metrics are rounded to the second decimal digit.

Table 1. Evaluation metrics of the ensemble Gradient Boosted Machine (GBM) model and suitability value statistics under the different scenarios.

Scenario Evaluation metric Suitability value
TSSa AUCb KAPPAc mean sd min max
Overall 0.60 0.89 0.60 0.38 0.19 0.12 0.91
Winter 0.81 0.97 0.81 0.33 0.21 0.07 0.93
Spring 0.76 0.95 0.75 0.40 0.19 0.10 0.89
Summer 0.75 0.95 0.74 0.35 0.20 0.08 0.92
Autumn 0.75 0.94 0.74 0.35 0.18 0.08 0.92

a True Skills Statistics

b Area Under ROC (Relative Operating Characteristic) Curve

c Cohen’s Kappa

The first variable for importance was the distance from the forest in all scenarios, while the second one was always altitude, except for the winter scenario which was more influenced by seasonal precipitation. In particular, according to ensemble model response curves (S1 Appendix), for all scenarios, the habitat suitability decreased sharply with altitude and distance to forest, but slightly with bare coverage, slope, distance from lakes, distance from parks. On the contrary, the suitability mildly increased with tree coverage, distance from river, road density. The correlation between the suitability and the variables “precipitation”, “distance from highways”, “topological aspect”, “topological position”, varied according to season and/or range of suitability. The suitability didn’t seem to be particularly influenced by the variability in the human population density and distance from crop. See further response curves details by scenario in S1 Appendix.

Visually, all the suitability maps displayed mountainous and hilly areas with good forest cover as highly suitable areas for the presence of wild boar (Fig 3B and Fig B in S1 Fig). With few exceptions, the lowest suitability was described in the area of the Po Valley, especially the lowlands, and in the northernmost Alps. Cells with suitability value greater than 0.6 were mainly located in the pre-Alps (i.e., groups of mountains and highlands from Lake Maggiore till the border with Slovenia, located on the inner side of the Alps and gradually lower towards the plain), the Northern Apennines, central Piedmont at the province of Asti, the Ticino park area, along the Po River in the Po Valley, and the regional parks of Colli Berici and of Colli Euganei in Veneto (Fig 3B, Fig B in S1 Fig and S2 Fig).

Fig 3. Distribution of wild boar records and estimated suitability map for the overall scenario.

Fig 3

(A) Distribution of presence data in the study area. Occurrence records included in the final model are displayed as red points, while those removed to meet the assumption of independence are shown as yellow points. (B) Map describing habitat suitability for the presence of wild boars in Northern Italy in an overall scenario. Seasonal suitability maps are available in S1 Fig. Original raster files can be downloaded from S2 File. Base map of Italy based on the 2024 regional administrative boundaries from the Italian National Institute of Statistics (https://www.istat.it/).

The mean suitability value of Northern Italy, under the overall scenario, is 0.38 (0.12–0.91). On average, the season with the highest suitability was spring (0.40, 0.10–0.89), followed by autumn (0.35, 0.08–0.92), summer (0.35, 0.08–0.92), and winter (0.33, 0.07–0.93). Here reported suitability values are rounded to the second decimal digit. A significant difference was observed in the suitability of the study area under the different scenarios. Specifically, all the five suitability maps resulted significantly different among each other at the post hoc analysis (p-value < 0.001).

Considering the overall scenario, no significant difference was detected between the suitability value distributions of the presence data included in the model and of the unused records, excluded during the density reduction (p-value = 0.36).

Connectivity map and ASF outbreaks spatial distribution

The threshold corresponding to the 10% percentile of the suitability score distribution for the presence data was 0.45. After excluding the suitable regions smaller than 4.4km2, the final patches covered 32.09% of the total study area. The percentages of occurrences included and excluded from the SDM, falling into the patches were respectively 86.52% and 86.93% [43].

Since the ten connectivity maps obtained from the ten different random points datasets were statistically comparable (p-value ≈ 1), a final connectivity map was calculated through the median formula (Fig 4A). In the median map, the mean connectivity value was 0.53x10-2 (0.00–74.83x10-2). The connectivity value describes the permeability of a grid cell to wild boar movement.

Fig 4. Connectivity map and binary risk maps according to different threshold.

Fig 4

(A) Map describing landscape connectivity in Northern Italy according to wild boar ecology, highlighting the main corridors of dispersal. The connectivity value describes the permeability of a grid cell to wild boar movement. The scale was resized considering the interval 2–98% of the connectivity value distribution, to highlight major corridors. A zoom on ASF affected area is shown. According to different thresholds of the connectivity value, different binary maps can be obtained: (B) binary risk map with a true positive rate of 90.02%, maximizing the sensitivity of the map in correctly classifying African swine fever (ASF) events due to ASF virus genotype II. A zoom on ASF wild boar cases is shown; (C) binary risk map with a true positive rate of 65.41%, respecting the best compromise between sensitivity and specificity in classifying ASF events due to ASF virus genotype II. A zoom on ASF wild boar cases is shown. Original raster files can be downloaded from S2 File. Base map of Italy in the suitability maps based on the 2024 regional administrative boundaries from the Italian National Institute of Statistics (https://www.istat.it/).

In total, 1758 wild boar ASFV genotype II notifications were downloaded from WAHIS. The mean distance between all the ASF events was 40.4 km, between consecutive ones was 20.8 km. Since three records fell outside the connectivity map, 1755 ASF events were finally included in the study. ASF event locations were associated on average with higher connectivity values (2.5x10-2, 0.00–20.0x10-2) than those extracted from random background points (1.5x10-2, 0.00–24.9x10-2), for all the 100 simulations (p-value < 0.001).

Risk maps for ASF outbreak according to the connectivity value

The median connectivity map demonstrated a fair/acceptable ability to binary classify ASF event records as empty (random point) or as positive wild boar location. The average AUC for the 100 datasets of random and ASF event points was 0.69 (0.67–0.71). The lowest threshold of connectivity value corresponding to 0.90 of sensitivity considering all datasets was 0.45x10-2 (min-max). According to this threshold, the derived binary maps resulted in a true positive rate of 90.02% (1580/1755), which is the percentage of ASF events correctly classified as positive wild boar location (Fig 4B). The average closest top left threshold was 1.14x10-2 (min-max), and the corresponding binary map reported a true positive rate of 65.41% (1148/1755) (Fig 4C). The land cover class with the highest true positive rate was “forest” class (55.44%, 973/1755), followed by “crop” (36.98%, 649/1755), “urban” (7.29%, 128/1755), and “water” (0.28%, 5/1755).

Discussion

This study performed a high-resolution estimation of the suitability of Northern Italy for wild boar presence and of the main dispersal corridors, describing the connectivity map as a potential tool for ASF control and surveillance. The suitability and connectivity maps were obtained by taking into account topographic, climatic, land cover and anthropological aspects that influence wild boar behaviour, outlining and summarising the complexity of wild boar ecology in a single index.

Northern Italy suitability for wild boars

Using input data mainly from open-source databases, the developed suitability maps visually resembled the pattern already described in previous studies [40, 100], also by season, showing high-resolution estimations (Fig 3B and Fig B in S1 Fig). In addition to the higher resolution of the maps presented here compared to those found in the literature [37, 40, 42, 100, 101], the influence of each variable on wild boar presence was also detailed (S1 Appendix). The low average suitability values reported for Northern Italy could be primarily influenced by the large area of the Po Valley, a wide plain poorly suited for wild boar presence, except for suitable “islands” at regional parks, and the strip that traces the path of the Po River. The area of the Northern Apennines and the pre-Alps creates a frame of suitable areas that almost continuously encircles the entire Po Valley (Fig 3B and Fig B in S1 Fig and S2 Fig).

In Northern Italy, wild boars seem to prefer moderate altitudes, below 1000 meters (S1 Appendix), in correspondence with forested areas. The strong anthropization, and the low vegetative cover of the Po Valley could therefore be the limiting factors for wild boar distribution in this area. On the contrary, the more flourishing vegetation present in regional parks, as well as in the area of the Apennines and pre-Alps, could provide food and shelter for wild boars, favouring their presence (Fig 3B and Fig B in S1 Fig and S2 Fig). Moreover, these areas are characterised by agricultural and commercial activities that interface with or are interspersed among wooded areas (e.g., vineyards and olive trees, orchards, chestnut groves). Such environments could provide advantage for opportunistic and highly adaptable animals like wild boars [102, 103].

Observed seasonal variations, such as increased suitability for wild boar presence in spring, summer, and autumn (Fig B in S1 Fig), may be linked to difference in precipitations and occurrences’ density due to reproductive cycles, food availability, and seasonal hunting regulations. For instance, a greater wild boar density in spring may be associated to higher precipitations which are linked to more food resources [102, 104106]. Higher wild boar activity in spring and summer may correspond to the birth of piglets, increased movement in search of food, and reduced hunting pressure, which typically occurs in autumn and winter [107, 108]. From a management perspective, these findings could inform targeted strategies, such as timing interventions to optimize hunting schedules, disease surveillance plans, and the implementation of biosecurity measures. However, also human-related reporting biases may have shaped the obtained suitability maps [70, 71, 109].

The observed seasonal differences also seem to be only partially supported by the pattern of confirmed cases of ASF in wild boars (Italian national epidemiological bulletin, https://storymaps.arcgis.com/stories/9fe6aa3980ca438cb9c7e8d656358f35, accessed on 20th June 2024), which so far has been upward from October to May, and downward from May to September. Whether this pattern of cases is explained by actual higher suitability in the intermediate seasons (spring and fall), rather than by other factors also accounting for population turnover and latency times between infection and death, needs further investigation [28].

As any model, suitability maps represent a simplification of reality and are the result of the variables and presence data included in the study, each with its limitations and potential biases. A ten-year period was chosen for NDVI and climate variables to emphasize broader landscape trends in wild boar habitat suitability. This approach offered a stable view of habitat conditions that is less affected by short-term fluctuations and more reflective of long-term environmental patterns [41, 42]. However, it may overlook annual fluctuations, especially for NDVI, which can vary due to climate changes or shifts in land use. The exclusion from the SDM of NDVI and temperature variables due to their collinearity with other variables, such as topographic and land cover characteristics, suggests that their variability and influence on wild boar ecology are likely captured by more static habitat features in the considered overall and seasonal time spans. To provide year-specific prediction, each year of wild boar data should be matched with the corresponding yearly environmental variables when available [62, 110]. Future research could incorporate this temporal variability for more precise seasonal or overall predictions of habitat suitability [110].

The greater importance of topographic and land cover variables, besides being compatible with wild boar ecology, was probably influenced by the scale chosen. Indeed, precipitation is a variable whose effect tends to be more measurable and significant at lower resolutions, and less at local scales [111, 112]. Moreover, in this kind of estimation, despite the minimization of the correlation between the chosen variables, the importance of a variable may be influenced by its correlation with real drivers of wild boar distribution in Northern Italy that were instead not included in the model, rather than by its direct contribution. Additionally, limiting the analysis on Northern Italy, unavoidably distort spatial estimations at the edge (i.e., edge effect), because of the exclusion of further occurrences or variable values potentially located right across the border of the study area, which would have provided related information in the model [113, 114]. To address these considerations, future research could explore multi-resolution approaches to balance local and broader-scale drivers of distribution, potentially enhancing model accuracy and applicability for different ecological and management contexts.

The suitability estimates displayed by the maps were also influenced by the occurrence records available, as they were biased by an unbalanced sampling effort, given by the heterogeneous distribution and activity of data providers (naturalists, researchers, hunters) (Fig 3A and Fig A in S1 Fig) [70, 71, 109].

In Northern Italy, the discontinuous natural parks (S2 Fig), the heterogeneity of the landscape, and the uneven/clustered anthropogenic pressure may have facilitated sampling in areas more easily accessible to humans, affecting the accuracy of estimates in greener, sloping, and border areas (e.g., Northern Apennines, and Alps). In regions featured by more homogeneous landscape and anthropogenic pressure, the estimation of land suitability could return better and more accurate performances in areas where greater wild boar suitability is expected, as smaller differences in accessibility would allow the variability associated with ecological factors to emerge.

Even variations in suitability between seasons were probably dependent more on the different sizes and distribution of occurrences than on the variables included. The variable "precipitation" was in fact the only one with a time dimension included in the final models, and it was among the most important variables only for the winter season. Any explanation of observed seasonal differences thus remains speculative. Increasing the number of occurrence records, collected in a homogeneous, complete, and widespread manner both spatially, over the entire study area, and temporally, could enhance the accuracy of predictions of Northern Italy suitability to the presence of wild boar. Additionally, the incorporation of more variables related to seasonal aspects (e.g., seasonal variability of food resources) is needed to offer a more comprehensive understanding of the wild boar’s spatial behaviour across seasons, which in this study remained obscure. A more dynamic approach adopting also daily or weekly variables, instead of static mean, together with VHF (very high frequency) radio-tracking data could validate here presented findings, and further enhance the knowledge on wild boar habitat preferences across diverse landscapes and seasonal conditions [62, 110, 115].

The evaluation of different models allowed the identification of the best-performing one for the specific dataset and objectives of the study. The analysis demonstrated that not all models yield equivalent results. Despite their strong predictive performance, RF and XGBOOST algorithms were not selected for the final analysis. These algorithms exhibited performance metrics approaching or equal to 1 (S3 Table), which, while mathematically optimal, warrant careful interpretation in ecological contexts [53, 96]. Near-perfect metrics could indicate overfitting, suggesting that the models may have captured noise patterns specific to the training data rather than true ecological relationships. Additionally, RF and XGBOOST introduce greater model complexity compared to GBM, which can increase the risk of overfitting and reduce results interpretability [96]. Such overfitting could ultimately limit model transferability and practical application in novel scenarios [53, 9799]. The minimization of sampling bias through the choice of pseudoabsences as described in [70], the reduction of occurrences density, and the final choice of a model (GBM) with less overfitting behaviour and excellent predictive capabilities even from presence-only data, still resulted in optimal estimation performance for all considered scenarios (Table 1) [9395]. The obtained suitability maps thus contribute to a deeper understanding of the real distribution of wild boar in the territory. The final suitability maps were an ensemble prediction from multiple runs of an individual algorithm (GBM). Future research could explore the potential benefits of an ensemble modelling combining different algorithms [116, 117], leveraging the strengths of each, to potentially provide more robust estimations on wild boar distribution [37, 49].

Northern Italy main corridors of wild boar dispersal

The suitability pattern was largely traced from the connectivity map, highlighting the main corridors of dispersal of the species, and thus the potential preferential routes of spread of ASFV genotype II in Northern Italy (Fig 4A). Compared with the suitability map, the connectivity estimation has the advantage of attributing a higher probability of being crossed/reached to areas accessible by multiple pathways, according to circuit theory [81]. This approach returns a continuous estimate of the main dispersal corridors accounting for the complexity of animal movement, showing the connection between the most suitable patches.

The presence of high-traffic corridors (i.e., routes with continuous high connectivity values that facilitate the movement of wild boars) crossing all of Northern Italy, poses a significant threat for the spread of ASFV genotype II to the wild and domestic pig populations. These corridors are found in both the highlands (mainly the Northern Apennines and pre-Alps) and the lowlands (Po valley and regional parks) (Fig 4A and S2 Fig). The high connectivity of these corridors, both in isolated high and low-lying lands and in areas close to urban settlements, increases the risk of contact among wild boars and between wild boar and humans. This heightens the potential for undetected circulation in areas that are more difficult to monitor for wild boar carcasses, as well as for long-distance spread of ASFV due to human activity [18, 20]. Furthermore, the high connectivity observed within regional parks suggests them as potential receptacles for ASFV. This makes regional parks a primary focus for enhanced surveillance and wild boar population density control efforts.

The connection of estimated corridors with currently infected areas (Fig 4) severely highlights the urgent need to improve control, surveillance and biosecurity strategies [118, 119]. Future research could examine the alignment of these wild boar dispersal corridors with existing farm locations to better understand transmission risks and refine prevention strategies.

Wild boar search is usually focused in wooded areas, and with a good vegetation cover, which may be more intuitive by human perspective. However, the capacity of forest land cover class to correctly classify ASF event locations close to a random classification, might suggest that a different approach should be adopted. Conversely, the connectivity map demonstrated a strong ability in accurately identifying ASF event outbreak locations (Fig 4B and 4C). ASFV genotype II positive wild boars tended to locate at high levels of connectivity (compared with random points), and the true positive rate of the binary risk maps was higher than that achieved using only land cover classes. These findings suggest the high potential of the obtained connectivity map as a significant tool for controlling the spread of ASFV genotype II across the territory. The binary maps provided here are just an example of the possible outcome that can be obtained. A more appropriate threshold can be set as needed, resulting in a new binary risk map that accounts for the available resources and evaluated hazards.

However, it is important to take note that the carcass detection/surveillance activity is likely to have been influenced by previous ASF findings in wild boars, which may have targeted the subsequent search by veterinary/forestry services in similar areas [92], overestimating the classification power of the connectivity map. This poses an unavoidable bias inherent in the data source. On the other hand, the different nature of occurrence input data (based on availability, convenience data) compared to ASFV genotype II positive wild boar coordinates, which instead come from targeted research as described by the ASF control plans [80], may have limited binary maps classification performance. In future studies, if the epidemic front progresses, and more locations of ASF events become available, positive wild boar records could be used as presence data input, while the suitability and/or connectivity maps could be included as predictor variables, to model “deathbeds” of ASFV affected wild boars [92]. Additionally, the integration of the suitability and connectivity maps with other factors driving the disease spreading as model variables, would allow the prediction of the epidemic directionality [120], both within the wild population and from wild boar to domestic pig.

Implication for ASF management and surveillance strategies

Despite the aforementioned limitations, mainly related to the nature of input data, both the suitability map and connectivity map developed in this study could serve as supporting tools to be integrated into current ASF epidemic management strategies. Briefly, based on the approach of regionalisation and according to the Commission Implementing Regulation (EU) 2021/605, when an ASF case is confirmed in a wild boar in the territory, an infected zone (later defined as restriction zone II) and a surrounding surveillance zone (restriction zone I) are defined, following administrative boundaries. In these areas, active search for wild boar carcasses is performed, also with the help of dogs and drones, ideally removing infected carcasses every two weeks [80]. In fact, it is estimated that transmission between wild boar and infected carcass does not occur until 12 to 15 days [121, 122]. However, this pace is hardly maintained, except at the onset of the epidemic when its expansion is still limited.

Currently, there are no defined rules in institutional reports/regulation on where to search for carcasses to increase detection rates. Additionally, people available to actively look for carcasses are often insufficient, resulting in the search being concentrated in sloping areas where high wild boar densities are estimated, and in declivous areas [80]. However, to the best of authors’ knowledge, an up-to-date, local and accurate estimate of wild boar density is often not available [37]. Studies performed in other countries suggest young, broad, moist, and cool forests, or grasslands with significant vegetation as preferred habitats for diseased wild boars, probably due to high fever, or because perceived as safer environments [92, 123]. Similar studies should be performed also in Italy, to assess if moribound wild boars behaviour is the same observed in other countries or if it is determined by local ecological drivers. The evidence on local deathbed preferences could be integrated with the risk maps obtained in this study to improve early detection of wild boar carcasses. Together with the active search for carcasses, physical barriers (natural or artificial) are built/reinforced to contain the progression of the epidemic front. However, this requires time-consuming bureaucracy often incompatible with the rapidity of the phenomenon [33]. Both suitability map and connectivity map could provide valuable support, to efficiently identify areas feature by a higher animal flow and optimize resources to, control wild boar population and disease spread. Indeed, the suitability score could provide an indirect indication of the likely wild boar density, when data are not available. The hunting activity should be focused on highest suitable area, to preventively reduce wild boar population in areas not yet infected [124126]. However, culling strategies should be properly designed, since hunting pressure could favour wild boar migrations, increasing the risk of wild boar dispersal in ASF affected areas, and therefore ASF spreading [127, 128].

In tandem, the connectivity map would recommend on which zones/paths to target and prioritize carcass search and barrier construction of wild boar dispersal corridors. Primary disruption of corridors with higher probability of passage (higher connectivity value) through construction and reinforcement of man-made (e.g., fences, highways) or natural (e.g., rivers) barriers would efficiently contain the disease spread. On the other hand, the impact on other species should not be ignored, as landscape connectivity guarantees their preservation, migration, biodiversity, and gene flow [129, 130]. Veterinary epidemiological services should work synergistically with conservation experts to find a compromise that can preserve both the ecology of local species and animal health.

Conclusion

The maps of suitable habitats and major corridors of wild boar spreading made available in this study represent a significant step forward in understanding and controlling ASF in Northern Italy. Having estimated the suitability and connectivity of the territory from wild boar presence data, not related to disease outbreaks, make the maps obtained in this study transversally applicable in other investigations, related to other transmissible diseases (e.g., Aujeszky disease, Classical swine fever, Tuberculosis, etc.), as well as in other research fields. Similarly, the replication of the framework here adopted in other Italian regions and other countries, necessarily represents a future perspective to act in prevention, especially considering the wild boar population density, and the related food/tourism market in some regions (e.g., Tuscany). The fruitful use of open-source presence data, collected by citizen associations and naturalists, demonstrates how citizen science is a promising approach that even indirectly could improve veterinary surveillance. Additionally, the application of species distribution methods, commonly used in the field of ecology and conservation, for the development of tools aimed at enhancing control and surveillance of diseases of veterinary concern, attests the potential of multidisciplinary approaches in addressing present and future challenges.

Supporting information

S1 Table. Environmental variables included in the species distribution models (SDM) for each scenario, after testing for collinearity and multicollinearity.

Variables taking into account distance from environmental features are shortened with "Dist. from”, while those considering topological patterns with "Top.". Variable references are available in S2 Table.

(PDF)

pone.0317577.s001.pdf (111.3KB, pdf)
S2 Table. Metadata associated to the variables included in the study.

(XLSX)

pone.0317577.s002.xlsx (24.8KB, xlsx)
S3 Table. Performance of algorithms of species distribution modelling (SDM), available in the package Biomod2, evaluated through Area Under ROC (Relative Operating Characteristic) Curve (AUC) and True Skills Statistics (TSS) metrics.

Mean and standard deviation (sd) are reported for each considered scenario.

(XLSX)

pone.0317577.s003.xlsx (19.6KB, xlsx)
S1 Appendix. Response curves of the computed Gradient Boosted Machine ensemble model.

(PDF)

pone.0317577.s004.pdf (650.1KB, pdf)
S1 Fig. Distribution of wild boar records and estimated suitability maps by season.

(A) Distribution of occurrence records included in the final model (red points) by season. (B) Maps describing habitat suitability for the presence of wild boars in Northern Italy referring to environmental conditions by season. Base map of Italy based on the 2024 regional administrative boundaries from the Italian National Institute of Statistics (https://www.istat.it/).

(TIF)

pone.0317577.s005.tif (1.7MB, tif)
S2 Fig. Map of the study area reporting geographical details.

Base map made with Natural Earth. Study area and Asti province based on the 2024 administrative boundaries from the Italian National Institute of Statistics (https://www.istat.it/). Lake Maggiore, Po River, regional parks details made from vector files available on the National geoportal of the Italian Ministry of Environment and Energy Security (https://gn.mase.gov.it/) for illustrative purpose only.

(TIF)

pone.0317577.s006.tif (5.6MB, tif)
S1 File. Main code scripts used in the workflow are available at https://doi.org/10.5281/zenodo.14639171.

(PDF)

pone.0317577.s007.pdf (96.8KB, pdf)
S2 File. Suitability maps, connectivity map and binary risk maps are available at https://doi.org/10.5281/zenodo.14637229.

(PDF)

pone.0317577.s008.pdf (90.9KB, pdf)

Acknowledgments

Special thanks to colleague Martina Ossola (freelance veterinary doctor) for providing epidemiological data on wild boar records from the official routine culling campaigns conducted in the Varese province, in Lombardy, and to the following colleagues for their advice and help during the course of the project: Maxime Lenormand, Jérémy Lavarenne, Renaud Marti (all affiliated with TETIS Unit, INRAE, National Research Institute on Agriculture, Food and the Environment, Montpellier, France), Anas Zakroum (French Agricultural Research Centre for International Development, CIRAD, Montpellier, France), Daria Di Sabatino and Lara Savini (both affiliated with Istituto Zooprofilattico Sperimentale dell’Abruzzo e del Molise ‘G. Caporale’ (IZS-Teramo), Teramo, Italy), Isadora Benvegnù (Veneto Agricoltura), Mario Chiari (Direzione Generale Welfare di Regione Lombardia, Unità Organizzativa Veterinaria, Milan, Italy), Francesca Meriggi (Ente Regionale per i Servizi all’Agricoltura e alle Foreste, Milan, Italy).

Data Availability

Main code scripts used in the workflow are available at https://doi.org/10.5281/zenodo.14639171. Link to Gitlab repository: https://gitlab.com/giuliafaustini1/habitat-suitability-mapping-and-landscape-connectivity-analysisto-predict-african-swine-fever-spread-in-wild-boar-population. Suitability maps, connectivity map and binary risk maps are available at https://doi.org/10.5281/zenodo.14637229.

Funding Statement

The research project was made possible by the financial support provided to G.F. by the Società Italiana di Patologia ed Allevamento dei Suini (SIPAS, https://www.sipas.org/chi-siamo/) and by the Erasmus + for traineeship mobility program (https://erasmus-plus.ec.europa.eu/opportunities/opportunities-for-individuals/students/traineeships-abroad-for-students). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. There was no additional external funding received for this study."

References

  • 1.Alonso C, Borca M, Dixon L, Revilla Y, Rodriguez F, Escribano JM. ICTV Virus Taxonomy Profile: Asfarviridae. J Gen Virol. 2018;99: 613–614. doi: 10.1099/jgv.0.001049 [DOI] [PubMed] [Google Scholar]
  • 2.Njau EP, Machuka EM, Cleaveland S, Shirima GM, Kusiluka LJ, Okoth EA, et al. African Swine Fever Virus (ASFV): Biology, Genomics and Genotypes Circulating in Sub-Saharan Africa. Viruses. 2021;13. doi: 10.3390/V13112285 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Chenais E, Ståhl K, Guberti V, Depner K. Identification of Wild Boar–Habitat Epidemiologic Cycle in African Swine Fever Epizootic. Emerg Infect Dis. 2018;24: 810. doi: 10.3201/eid2404.172127 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Brown VR, Miller RS, McKee SC, Ernst KH, Didero NM, Maison RM, et al. Risks of introduction and economic consequences associated with African swine fever, classical swine fever and foot-and-mouth disease: A review of the literature. Transbound Emerg Dis. 2021;68: 1910–1965. doi: 10.1111/tbed.13919 [DOI] [PubMed] [Google Scholar]
  • 5.Scientific Opinion on African swine fever. EFSA Journal. 2014;12. doi: 10.2903/J.EFSA.2014.3628 [DOI] [Google Scholar]
  • 6.Penrith ML, Kivaria FM. One hundred years of African swine fever in Africa: Where have we been, where are we now, where are we going? Transbound Emerg Dis. 2022;69: e1179–e1200. doi: 10.1111/tbed.14466 [DOI] [PubMed] [Google Scholar]
  • 7.Sehl J, Pikalo J, Schäfer A, Franzke K, Pannhorst K, Elnagar A, et al. Comparative Pathology of Domestic Pigs and Wild Boar Infected with the Moderately Virulent African Swine Fever Virus Strain “Estonia 2014.” Pathogens. 2020;9: 1–23. doi: 10.3390/PATHOGENS9080662 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Nurmoja I, Petrov A, Breidenstein C, Zani L, Forth JH, Beer M, et al. Biological characterization of African swine fever virus genotype II strains from north-eastern Estonia in European wild boar. Transbound Emerg Dis. 2017;64: 2034–2041. doi: 10.1111/tbed.12614 [DOI] [PubMed] [Google Scholar]
  • 9.Mur L, Igolkin A, Varentsova A, Pershin A, Remyga S, Shevchenko I, et al. Detection of African Swine Fever Antibodies in Experimental and Field Samples from the Russian Federation: Implications for Control. Transbound Emerg Dis. 2016;63: e436–e440. doi: 10.1111/tbed.12304 [DOI] [PubMed] [Google Scholar]
  • 10.Penrith ML, Vosloo W. Review of African swine fever: transmission, spread and control. J S Afr Vet Assoc. 2009;80: 58–62. doi: 10.4102/jsava.v80i2.172 [DOI] [PubMed] [Google Scholar]
  • 11.Mebus CA, Dardiri AH. Western hemisphere isolates of African swine fever virus: asymptomatic carriers and resistance to challenge inoculation. Am J Vet Res. 1980;41: 1867–1869. Available: https://europepmc.org/article/med/7212418 [PubMed] [Google Scholar]
  • 12.Gallardo C, Soler A, Nieto R, Cano C, Pelayo V, Sánchez MA, et al. Experimental Infection of Domestic Pigs with African Swine Fever Virus Lithuania 2014 Genotype II Field Isolate. Transbound Emerg Dis. 2017;64: 300–304. doi: 10.1111/tbed.12346 [DOI] [PubMed] [Google Scholar]
  • 13.Pereira de Oliveira R, Hutet E, Paboeuf F, Duhayon M, Boinas F, de Leon AP, et al. Comparative vector competence of the Afrotropical soft tick Ornithodoros moubata and Palearctic species, O. erraticus and O. verrucosus, for African swine fever virus strains circulating in Eurasia. PLoS One. 2019;14: e0225657. doi: 10.1371/journal.pone.0225657 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Wilkinson PJ. Epidemiology of African swine fever. Rev Sci Tech. 1986;5: 487–493. doi: 10.20506/rst.5.2.243 [DOI] [PubMed] [Google Scholar]
  • 15.Boshoff CI, Bastos ADS, Gerber LJ, Vosloo W. Genetic characterisation of African swine fever viruses from outbreaks in southern Africa (1973–1999). Vet Microbiol. 2007;121: 45–55. doi: 10.1016/j.vetmic.2006.11.007 [DOI] [PubMed] [Google Scholar]
  • 16.Gaudreault NN, Madden DW, Wilson WC, Trujillo JD, Richt JA. African Swine Fever Virus: An Emerging DNA Arbovirus. Front Vet Sci. 2020;7. doi: 10.3389/fvets.2020.00215 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Jurado C, Fernández-Carrión E, Mur L, Rolesu S, Laddomada A, Sánchez-Vizcaíno JM. Why is African swine fever still present in Sardinia? Transbound Emerg Dis. 2018;65: 557–566. doi: 10.1111/tbed.12740 [DOI] [PubMed] [Google Scholar]
  • 18.Depner K, Gortazar C, Guberti V, Masiulis M, More S, Olßsevskis E, et al. Epidemiological analyses of African swine fever in the Baltic States and Poland. EFSA Journal. 2017;15: e05068. doi: 10.2903/J.EFSA.2017.5068 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Miteva A, Papanikolaou A, Gogin A, Boklund A, Bøtner A, Linden A, et al. Epidemiological analyses of African swine fever in the European Union (November 2018 to October 2019). EFSA Journal. 2020;18. doi: 10.2903/J.EFSA.2020.5996 [DOI] [Google Scholar]
  • 20.Sauter-Louis C, Conraths FJ, Probst C, Blohm U, Schulz K, Sehl J, et al. African Swine Fever in Wild Boar in Europe—A Review. Viruses 2021, Vol 13, Page 1717. 2021;13: 1717. doi: 10.3390/v13091717 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Pepin KM, Golnar AJ, Abdo Z, Podgórski T. Ecological drivers of African swine fever virus persistence in wild boar populations: Insight for control. Ecol Evol. 2020;10: 2846–2859. doi: 10.1002/ece3.6100 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Ladoşi I, Ppuc TA, Ladoşi D. The Impact of African Swine Fever (ASF) on Romanian Pig Meat Production: A Review. Acta Vet Brno. 2023;73: 1–12. doi: 10.2478/ACVE-2023-0001 [DOI] [Google Scholar]
  • 23.Baños JV, Boklund A, Gogin A, Gortázar C, Guberti V, Helyes G, et al. Epidemiological analyses of African swine fever in the European Union: (September 2020 to August 2021). EFSA Journal. 2022;20. doi: 10.2903/j.efsa.2022.7290 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Self-declaration of the recovery of freedom from African swine fever in all suids by the Czech Republic. [cited 24 Jun 2024]. Available: http://eagri.cz/public/web/mze/legislativa/pravni-predpisy-mze/tematicky-prehled/Legislativa-MZe_uplna-zneni_zakon- [Google Scholar]
  • 25.Self-declaration of Belgium’s African swine fever-free status in all swine species. [cited 24 Jun 2024]. doi: 10.3201/eid2506.190286 [DOI] [Google Scholar]
  • 26.Podgórski T, Borowik T, Łyjak M, Woźniakowski G. Spatial epidemiology of African swine fever: Host, landscape and anthropogenic drivers of disease occurrence in wild boar. Prev Vet Med. 2020;177. doi: 10.1016/j.prevetmed.2019.104691 [DOI] [PubMed] [Google Scholar]
  • 27.Taylor RA, Podgórski T, Simons RRL, Ip S, Gale P, Kelly LA, et al. Predicting spread and effective control measures for African swine fever—Should we blame the boars? Transbound Emerg Dis. 2021;68: 397–416. doi: 10.1111/tbed.13690 [DOI] [PubMed] [Google Scholar]
  • 28.Schulz K, Conraths FJ, Blome S, Staubach C, Sauter-Louis C. African Swine Fever: Fast and Furious or Slow and Steady? Viruses. 2019;11. doi: 10.3390/v11090866 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Ståhl K, Boklund A, Podgórski T, Vergne T, Abrahantes JC, Papanikolaou A, et al. Epidemiological analysis of African swine fever in the European Union during 2022. EFSA Journal. 2023;21. doi: 10.2903/j.efsa.2023.8016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Morelle K, Bubnicki J, Churski M, Gryz J, Podgórski T, Kuijper DPJ. Disease-Induced Mortality Outweighs Hunting in Causing Wild Boar Population Crash After African Swine Fever Outbreak. Front Vet Sci. 2020;7: 515607. doi: 10.3389/fvets.2020.00378 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Niemi JK. Impacts of African Swine Fever on Pigmeat Markets in Europe. Front Vet Sci. 2020;7: 560588. doi: 10.3389/fvets.2020.00634 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Giammarioli M, Alessandro D, Cammà C, Masoero L, Torresi C, Marcacci M, et al. Molecular Characterization of the First African Swine Fever Virus Genotype II Strains Identified from Mainland Italy, 2022. Pathogens. 2023;12. doi: 10.3390/pathogens12030372 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Pavone S, Iscaro C, Dettori A, Feliziani F. African Swine Fever: The State of the Art in Italy. Animals 2023, Vol 13, Page 2998. 2023;13: 2998. doi: 10.3390/ani13192998 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Iscaro C, Dondo A, Ruocco L, Masoero L, Giammarioli M, Zoppi S, et al. January 2022: Index case of new African Swine Fever incursion in mainland Italy. Transbound Emerg Dis. 2022;69: 1707–1711. doi: 10.1111/tbed.14584 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Dei Giudici S, Loi F, Ghisu S, Angioi PP, Zinellu S, Fiori MS, et al. The Long-Jumping of African Swine Fever: First Genotype II Notified in Sardinia, Italy. Viruses. 2023;16. doi: 10.3390/v16010032 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Salazar LG, Rose N, Hayes B, Hammami P, Baubet E, Desvaux S, et al. Effects of habitat fragmentation and hunting activities on African swine fever dynamics among wild boar populations. Prev Vet Med. 2022;208: 105750. doi: 10.1016/j.prevetmed.2022.105750 [DOI] [PubMed] [Google Scholar]
  • 37.ENETwild consortium, Croft S, Smith G, Acevedo P, Vicente J. Wild boar in focus: Review of existing models on spatial distribution and density of wild boar and proposal for next steps. EFSA Supporting Publications. 2018;15. doi: 10.2903/sp.efsa.2018.en-1490 [DOI] [Google Scholar]
  • 38.Elith J, Leathwick JR. Species distribution models: Ecological explanation and prediction across space and time. Annu Rev Ecol Evol Syst. 2009;40: 677–697. doi: 10.1146/ANNUREV.ECOLSYS.110308.120159/1 [DOI] [Google Scholar]
  • 39.Bosch J, Peris S, Fonseca C, Martinez M, de La Torre A, Iglesias I, et al. Distribution, abundance and density of the wild boar on the Iberian Peninsula, based on the CORINE program and hunting statistics. 2012;61: 138–151. doi: 10.25225/FOZO.V61.I2.A7.2012 [DOI] [Google Scholar]
  • 40.Alexander NS, Massei G, Wint W. The European Distribution of Sus Scrofa. Model Outputs from the Project Described within the Poster–Where are All the Boars? An Attempt to Gain a Continental Perspective. Open Health Data. 2016;4. doi: 10.5334/OHD.24 [DOI] [Google Scholar]
  • 41.Croft S, Chauvenet ALM, Smith GC. A systematic approach to estimate the distribution and total abundance of British mammals. PLoS One. 2017;12: e0176339. doi: 10.1371/journal.pone.0176339 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Pittiglio C, Khomenko S, Beltran-Alcrudo D. Wild boar mapping using population-density statistics: From polygons to high resolution raster maps. PLoS One. 2018;13. doi: 10.1371/journal.pone.0193295 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Bosch J, Mardones F, Pérez A, De la Torre A, Muñoz MJ. A maximum entropy model for predicting wild boar distribution in Spain. Spanish Journal of Agricultural Research. 2014;12: 984–999. doi: 10.5424/sjar/2014124-5717 [DOI] [Google Scholar]
  • 44.Honda T. Environmental Factors Affecting the Distribution of the Wild Boar, Sika Deer, Asiatic Black Bear and Japanese Macaque in Central Japan, with Implications for Human-Wildlife Conflict. 2009;34: 107–116. doi: 10.3106/041.034.0206 [DOI] [Google Scholar]
  • 45.McClure ML, Burdett CL, Farnsworth ML, Lutman MW, Theobald DM, Riggs PD, et al. Modeling and Mapping the Probability of Occurrence of Invasive Wild Pigs across the Contiguous United States. PLoS One. 2015;10: e0133771. doi: 10.1371/journal.pone.0133771 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Acevedo P, Farfán MÁ, Márquez AL, Delibes-Mateos M, Real R, Vargas JM. Past, present and future of wild ungulates in relation to changes in land use. Landsc Ecol. 2011;26: 19–31. doi: 10.1007/S10980-010-9538-2/TABLES/4 [DOI] [Google Scholar]
  • 47.Phillips SB, Aneja VP, Kang D, Arya SP. Maximum entropy modeling of species geographic distributions. Ecol Modell. 2006;190: 231–259. doi: 10.1016/J.ECOLMODEL.2005.03.026 [DOI] [Google Scholar]
  • 48.Elith J, Graham CH. Do they? How do they? WHY do they differ? on finding reasons for differing performances of species distribution models. Ecography. 2009;32: 66–77. doi: 10.1111/J.1600-0587.2008.05505.X [DOI] [Google Scholar]
  • 49.Hao T, Elith J, Lahoz-Monfort JJ, Guillera-Arroita G. Testing whether ensemble modelling is advantageous for maximising predictive performance of species distribution models. Ecography. 2020;43: 549–558. doi: 10.1111/ecog.04890 [DOI] [Google Scholar]
  • 50.Zhu GP, Peterson AT. Do consensus models outperform individual models? Transferability evaluations of diverse modeling approaches for an invasive moth. Biol Invasions. 2017;19: 2519–2532. doi: 10.1007/S10530-017-1460-Y/FIGURES/5 [DOI] [Google Scholar]
  • 51.Bahn V, McGill BJ. Testing the Predictive Performance of Distribution Models. Oikos. 2012;122: 1. doi: 10.1111/J.1600-0706.2012.00299.X [DOI] [Google Scholar]
  • 52.Li X, Wang Y. Applying various algorithms for species distribution modelling. Integr Zool. 2013;8: 124–135. doi: 10.1111/1749-4877.12000 [DOI] [PubMed] [Google Scholar]
  • 53.Čengić M, Rost J, Remenska D, Janse JH, Huijbregts MAJ, Schipper AM. On the importance of predictor choice, modelling technique, and number of pseudo-absences for bioclimatic envelope model performance. Ecol Evol. 2020;10: 12307–12317. doi: 10.1002/ece3.6859 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Goicolea T, Cisneros-Araújo P, Vega CA, Sánchez-Vizcaíno JM, Mateo-Sánchez Mc, Bosch J. Landscape connectivity for predicting the spread of ASF in the European wild boar population. Scientific Reports 2024 14:1. 2024;14: 1–9. doi: 10.1038/s41598-024-53869-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Franzo G, Tucciarone CM, Drigo M, Cecchinato M, Martini M, Mondin A, et al. First report of wild boar susceptibility to Porcine circovirus type 3: High prevalence in the Colli Euganei Regional Park (Italy) in the absence of clinical signs. Transbound Emerg Dis. 2018;65: 957–962. doi: 10.1111/tbed.12905 [DOI] [PubMed] [Google Scholar]
  • 56.Heffner RA, Butler IV MJ, Reilly CK. Pseudoreplication Revisited. Ecology. 1996;77: 2558–2562. doi: 10.2307/2265754 [DOI] [Google Scholar]
  • 57.Hurlbert SH. Pseudoreplication and the Design of Ecological Field Experiments. Ecol Monogr. 1984;54: 187–211. doi: 10.2307/1942661 [DOI] [Google Scholar]
  • 58.Guisan A, Zimmermann NE. Predictive habitat distribution models in ecology. Ecol Modell. 2000;135: 147–186. doi: 10.1016/S0304-3800(00)00354-9 [DOI] [Google Scholar]
  • 59.Boria RA, Olson LE, Goodman SM, Anderson RP. Spatial filtering to reduce sampling bias can improve the performance of ecological niche models. Ecol Modell. 2014;275: 73–77. doi: 10.1016/J.ECOLMODEL.2013.12.012 [DOI] [Google Scholar]
  • 60.R Core Team. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing; 2023. [Google Scholar]
  • 61.Acevedo P, Croft S, Smith GC, Blanco‐Aguiar JA, Fernandez‐Lopez J, Scandura M, et al. ENETwild modelling of wild boar distribution and abundance: update of occurrence and hunting data‐based models. EFSA Supporting Publications. 2019;16. doi: 10.2903/sp.efsa.2019.EN-1674 [DOI] [Google Scholar]
  • 62.Roubicek AJ, VanDerWal J, Beaumont LJ, Pitman AJ, Wilson P, Hughes L. Does the choice of climate baseline matter in ecological niche modelling? Ecol Modell. 2010;221: 2280–2286. doi: 10.1016/J.ECOLMODEL.2010.06.021 [DOI] [Google Scholar]
  • 63.Trenberth KE. What are the Seasons? Bull Am Meteorol Soc. 1983;64: 1276–1282. Available: [DOI] [Google Scholar]
  • 64.QGIS Development Team. QGIS Geographic Information System. QGIS Association; 2023. [Google Scholar]
  • 65.Van Rossum G and D, Fred L. Python 3 Reference Manual. Scotts Valley, CA: CreateSpace; 2009. [Google Scholar]
  • 66.Brun P, Thuiller W, Chauvier Y, Pellissier L, Wüest RO, Wang Z, et al. Model complexity affects species distribution projections under climate change. J Biogeogr. 2020;47: 130–142. doi: 10.1111/JBI.13734 [DOI] [Google Scholar]
  • 67.Heiberger R. HH: Statistical Analysis and Data Display: Heiberger and Holland. R package version 3.1–52. 2024. doi: 10.32614/CRAN.PACKAGE.HH [DOI] [Google Scholar]
  • 68.James G, Witten D, Hastie T, Tibshirani R. An introduction to statistical learning. Springer; 2013. [Google Scholar]
  • 69.Guisan A, Thuiller W. Predicting species distribution: Offering more than simple habitat models. Ecol Lett. 2005;8: 993–1009. doi: 10.1111/j.1461-0248.2005.00792.x [DOI] [PubMed] [Google Scholar]
  • 70.Dubos N, Préau C, Lenormand M, Papuga G, Monsarrat S, Denelle P, et al. Assessing the effect of sample bias correction in species distribution models. Ecol Indic. 2022;145. doi: 10.1016/j.ecolind.2022.109487 [DOI] [Google Scholar]
  • 71.Phillips SJ, Dudík M, Elith J, Graham CH, Lehmann A, Leathwick J, et al. Sample selection bias and presence-only distribution models: implications for background and pseudo-absence data. Ecol Appl. 2009;19: 181–197. doi: 10.1890/07-2153.1 [DOI] [PubMed] [Google Scholar]
  • 72.Ferrier S, Drielsma M, Manion G, Watson G. Extended statistical approaches to modelling spatial pattern in biodiversity in northeast New South Wales. II. Community-level modelling. Biodivers Conserv. 2002;11: 2309–2338. doi: 10.1023/A:1021374009951/METRICS [DOI] [Google Scholar]
  • 73.Fitzpatrick MC, Gotelli NJ, Ellison AM. MaxEnt versus MaxLike: empirical comparisons with ant species distributions. Ecosphere. 2013;4: 1–15. doi: 10.1890/ES13-00066.1 [DOI] [Google Scholar]
  • 74.Préau C, Dubos N, Lenormand M, Denelle P, Louarn M Le, Alleaume S, et al. Dispersal-based species pools as sources of connectivity area mismatches. Landsc Ecol. 2021;37: 729–743. doi: 10.1007/s10980-021-01371-y [DOI] [Google Scholar]
  • 75.Thuiller W, Lafourcade B, Engler R, Araújo MB. BIOMOD–a platform for ensemble forecasting of species distributions. Ecography. 2009;32: 369–373. doi: 10.1111/J.1600-0587.2008.05742.X [DOI] [Google Scholar]
  • 76.Stockwell D, Peters D. The GARP modelling system: problems and solutions to automated spatial prediction. International Journal of Geographical Information Science. 1999;13: 143–158. doi: 10.1080/136588199241391 [DOI] [Google Scholar]
  • 77.Hanley JA, McNeil BJ. The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology. 1982;143: 29–36. doi: 10.1148/radiology.143.1.7063747 [DOI] [PubMed] [Google Scholar]
  • 78.Allouche O, Tsoar A, Kadmon R. Assessing the accuracy of species distribution models: prevalence, kappa and the true skill statistic (TSS). Journal of Applied Ecology. 2006;43: 1223–1232. doi: 10.1111/J.1365-2664.2006.01214.X [DOI] [Google Scholar]
  • 79.Hirzel AH, Le Lay G, Helfer V, Randin C, Guisan A. Evaluating the ability of habitat suitability models to predict species presences. 2006. doi: 10.1016/j.ecolmodel.2006.05.017 [DOI] [Google Scholar]
  • 80.Guberti Khomenko, Masiulis Kerba. African swine fever in wild boar—Ecology and biosecurity Second edition. FAO Animal Production and Health Manual No 28 Rome, FAO, World Organisation for Animal Health and European Commission. [cited 24 Jun 2024]. Available: doi: 10.4060/cc0785 [DOI] [Google Scholar]
  • 81.Anantharaman R, Hall K, Shah VB, Edelman A. Circuitscape in Julia: High Performance Connectivity Modelling to Support Conservation Decisions. JuliaCon Proceedings. 2020;1: 58. doi: 10.21105/JCON.00058/STATUS.SVG) [DOI] [Google Scholar]
  • 82.Capinha C, Larson ER, Tricarico E, Olden JD, Gherardi F. Effects of climate change, invasive species, and disease on the distribution of native European crayfishes. Conserv Biol. 2013;27: 731–740. doi: 10.1111/cobi.12043 [DOI] [PubMed] [Google Scholar]
  • 83.Pearson RG, Raxworthy CJ, Nakamura M, Townsend Peterson A. Predicting species distributions from small numbers of occurrence records: A test case using cryptic geckos in Madagascar. J Biogeogr. 2007;34: 102–117. doi: 10.1111/J.1365-2699.2006.01594.X [DOI] [Google Scholar]
  • 84.Ramirez-Reyes C, Bateman BL, Radeloff VC. Effects of habitat suitability and minimum patch size thresholds on the assessment of landscape connectivity for jaguars in the Sierra Gorda, Mexico. Biol Conserv. 2016;204: 296–305. doi: 10.1016/J.BIOCON.2016.10.020 [DOI] [Google Scholar]
  • 85.Fattebert J, Baubet E, Slotow R, Fischer C. Landscape effects on wild boar home range size under contrasting harvest regimes in a human-dominated agro-ecosystem. Eur J Wildl Res. 2017;63. doi: 10.1007/S10344-017-1090-9 [DOI] [Google Scholar]
  • 86.ENETWILD-consortium, Pascual-Rico R, Acevedo P, Apollonio M, Blanco-Aguiar J, Body G, et al. Wild boar ecology: a review of wild boar ecological and demographic parameters by bioregion all over Europe. EFSA Supporting Publications. 2022;19. doi: 10.2903/sp.efsa.2022.EN-7211 [DOI] [Google Scholar]
  • 87.Wang YH, Yang KC, Bridgman CL, Lin LK. Habitat suitability modelling to correlate gene flow with landscape connectivity. Landsc Ecol. 2008;23: 989–1000. doi: 10.1007/S10980-008-9262-3/TABLES/3 [DOI] [Google Scholar]
  • 88.Duflot R, Avon C, Roche P, Bergès L. Combining habitat suitability models and spatial graphs for more effective landscape conservation planning: An applied methodological framework and a species case study. J Nat Conserv. 2018;46: 38–47. doi: 10.1016/j.jnc.2018.08.005 [DOI] [Google Scholar]
  • 89.Préau C, Tournebize J, Lenormand M, Alleaume S, Boussada VG, Luque S. Habitat connectivity in agricultural landscapes improving multi-functionality of constructed wetlands as nature-based solutions. Ecol Eng. 2022;182. doi: 10.1016/j.ecoleng.2022.106725 [DOI] [Google Scholar]
  • 90.Keeley ATH, Beier P, Gagnon JW. Estimating landscape resistance from habitat suitability: effects of data source and nonlinearities. Landsc Ecol. 2016;31: 2151–2162. doi: 10.1007/S10980-016-0387-5/FIGURES/3 [DOI] [Google Scholar]
  • 91.Robin X, Turck N, Hainard A, Tiberti N, Lisacek F, Sanchez JC, et al. pROC: An open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics. 2011;12: 1–8. doi: 10.1186/1471-2105-12-77/TABLES/3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.Morelle K, Jezek M, Licoppe A, Podgorski T. Deathbed choice by ASF-infected wild boar can help find carcasses. Transbound Emerg Dis. 2019;66: 1821–1826. doi: 10.1111/tbed.13267 [DOI] [PubMed] [Google Scholar]
  • 93.De’ath G. Boosted trees for ecological modeling and prediction. Ecology. 2007;88: 243–251. doi: 10.1890/0012-9658(2007)88[243:btfema]2.0.co;2 [DOI] [PubMed] [Google Scholar]
  • 94.Effrosynidis D, Tsikliras A, Arampatzis A, Sylaios G. Species Distribution Modelling via Feature Engineering and Machine Learning for Pelagic Fishes in the Mediterranean Sea. Applied Sciences 2020, Vol 10, Page 8900. 2020;10: 8900. doi: 10.3390/APP10248900 [DOI] [Google Scholar]
  • 95.Elith J H. Graham C, P Anderson R, Dudík M, Ferrier S, Guisan A, et al. Novel methods improve prediction of species’ distributions from occurrence data. Ecography. 2006;29: 129–151. doi: [DOI] [Google Scholar]
  • 96.Chollet Ramampiandra E, Scheidegger A, Wydler J, Schuwirth N. A comparison of machine learning and statistical species distribution models: Quantifying overfitting supports model interpretation. Ecol Modell. 2023;481: 110353. doi: 10.1016/J.ECOLMODEL.2023.110353 [DOI] [Google Scholar]
  • 97.Crimmins SM, Dobrowski SZ, Mynsberge AR. Evaluating ensemble forecasts of plant species distributions under climate change. Ecol Modell. 2013;266: 126–130. doi: 10.1016/J.ECOLMODEL.2013.07.006 [DOI] [Google Scholar]
  • 98.Randin CF, Dirnböck T, Dullinger S, Zimmermann NE, Zappa M, Guisan A. Are niche-based species distribution models transferable in space? The Authors Journal compilation a. 2006;33: 1689–1703. doi: 10.1111/j.1365-2699.2006.01466.x [DOI] [Google Scholar]
  • 99.Merow C, Smith MJ, Edwards TC, Guisan A, Mcmahon SM, Normand S, et al. What do we gain from simplicity versus complexity in species distribution models? Ecography. 2014;37: 1267–1281. doi: 10.1111/ECOG.00845 [DOI] [Google Scholar]
  • 100.Bosch J, Iglesias I, Muñoz MJ, de la Torre A. A Cartographic Tool for Managing African Swine Fever in Eurasia: Mapping Wild Boar Distribution Based on the Quality of Available Habitats. Transbound Emerg Dis. 2017;64: 1720–1733. doi: 10.1111/tbed.12559 [DOI] [PubMed] [Google Scholar]
  • 101.Vilaça ST, Biosa D, Zachos F, Iacolina L, Kirschning J, Alves PC, et al. Mitochondrial phylogeography of the European wild boar: the effect of climate on genetic diversity and spatial lineage sorting across Europe. J Biogeogr. 2014;41: 987–998. doi: 10.1111/JBI.12268 [DOI] [Google Scholar]
  • 102.Colomer J, Massei G, Roos D, Rosell C, Rodríguez-Teijeiro JD. What drives wild boar density and population growth in Mediterranean environments? Science of The Total Environment. 2024;931: 172739. doi: 10.1016/j.scitotenv.2024.172739 [DOI] [PubMed] [Google Scholar]
  • 103.Calosi M, Gabbrielli C, Lazzeri L, Fattorini N, Cesaretti G, Burrini L, et al. Seasonal and Ecological Determinants of Wild Boar Rooting on Priority Protected Grasslands. Environ Manage. 2024. [cited 24 Jun 2024]. doi: 10.1007/s00267-024-01952-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104.Bisi F, Chirichella R, Chianucci F, Von Hardenberg J, Cutini A, Martinoli A, et al. Climate, tree masting and spatial behaviour in wild boar (Sus scrofa L.): insight from a long-term study. Ann For Sci. 2018;75: 1–9. doi: 10.1007/S13595-018-0726-6/FIGURES/3 [DOI] [Google Scholar]
  • 105.Frauendorf M, Gethöffer F, Siebert U, Keuling O. The influence of environmental and physiological factors on the litter size of wild boar (Sus scrofa) in an agriculture dominated area in Germany. Sci Total Environ. 2016;541: 877–882. doi: 10.1016/j.scitotenv.2015.09.128 [DOI] [PubMed] [Google Scholar]
  • 106.Fernández-Llario P, Carranza J. Reproductive performance of the wild boar in a Mediterranean ecosystem under drought conditions. Ethol Ecol Evol. 2000;12: 335–343. doi: 10.1080/08927014.2000.9522791 [DOI] [Google Scholar]
  • 107.Fonseca C, Alves Silva A, Alves J, Vingada J, V M Soares AM, Alves da Silva A. Reproductive performance of wild boar females in Portugal. Eur J Wildl Res. 2010;57: 363–371. doi: 10.1007/s10344-010-0441-6ï [DOI] [Google Scholar]
  • 108.Rosell C, Navàs F, Romero S. Reproduction of wild boar in a cropland and coastal wetland area: implications for management. Anim Biodivers Conserv. 2012;35: 209–217. doi: 10.32800/ABC.2012.35.0209 [DOI] [Google Scholar]
  • 109.Beck J, Böller M, Erhardt A, Schwanghart W. Spatial bias in the GBIF database and its effect on modeling species’ geographic distributions. Ecol Inform. 2014;19: 10–15. doi: 10.1016/J.ECOINF.2013.11.002 [DOI] [Google Scholar]
  • 110.Perez-Navarro MA, Broennimann O, Esteve MA, Moya-Perez JM, Carreño MF, Guisan A, et al. Temporal variability is key to modelling the climatic niche. Divers Distrib. 2021;27: 473–484. doi: 10.1111/DDI.13207 [DOI] [Google Scholar]
  • 111.Soberón J, Nakamura M. Niches and distributional areas: Concepts, methods, and assumptions. Proc Natl Acad Sci U S A. 2009;106: 19644–19650. doi: 10.1073/pnas.0901637106 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 112.Ficetola GF, Bonardi A, Mücher CA, Gilissen NLM, Padoa-Schioppa E. How many predictors in species distribution models at the landscape scale? Land use versus LiDAR-derived canopy height. International Journal of Geographical Information Science. 2014;28: 1723–1739. doi: 10.1080/13658816.2014.891222 [DOI] [Google Scholar]
  • 113.Rodeiro CLV, Lawson AB. An evaluation of the edge effects in disease map modelling. Comput Stat Data Anal. 2005;49: 45–62. doi: 10.1016/J.CSDA.2004.05.012 [DOI] [Google Scholar]
  • 114.Ripley BD. Modelling Spatial Patterns. J R Stat Soc Series B Stat Methodol. 1977;39: 172–192. doi: 10.1111/J.2517-6161.1977.TB01615.X [DOI] [Google Scholar]
  • 115.Hemmingmoore H, Aronsson M, Åkesson M, Persson J, Andrén H. Evaluating habitat suitability and connectivity for a recolonizing large carnivore. Biol Conserv. 2020;242: 108352. doi: 10.1016/J.BIOCON.2019.108352 [DOI] [Google Scholar]
  • 116.Araújo MB, New M. Ensemble forecasting of species distributions. Trends Ecol Evol. 2007;22: 42–47. doi: 10.1016/j.tree.2006.09.010 [DOI] [PubMed] [Google Scholar]
  • 117.Hao T, Elith J, Guillera-Arroita G, Lahoz-Monfort JJ. A review of evidence about use and performance of species distribution modelling ensembles like BIOMOD. Divers Distrib. 2019;25: 839–852. doi: 10.1111/DDI.12892 [DOI] [Google Scholar]
  • 118.Cadenas-Fernández E, Sánchez-Vizcaíno JM, Pintore A, Denurra D, Cherchi M, Jurado C, et al. Free-Ranging Pig and Wild Boar Interactions in an Endemic Area of African Swine Fever. Front Vet Sci. 2019;6: 481555. doi: 10.3389/FVETS.2019.00376/BIBTEX [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 119.Costard S, Wieland B, De Glanville W, Jori F, Rowlands R, Vosloo W, et al. African swine fever: how can global spread be prevented? Philosophical Transactions of the Royal Society B: Biological Sciences. 2009;364: 2683. doi: 10.1098/RSTB.2009.0098 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 120.Gervasi V, Sordilli M, Loi F, Guberti V. Estimating the Directional Spread of Epidemics in Their Early Stages Using a Simple Regression Approach: A Study on African Swine Fever in Northern Italy. Pathogens. 2023;12. doi: 10.3390/pathogens12060812 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 121.Cukor J, Linda R, Václavek P, Mahlerová K, Šatrán P, Havránek F. Confirmed cannibalism in wild boar and its possible role in African swine fever transmission. Transbound Emerg Dis. 2020;67: 1068–1073. doi: 10.1111/tbed.13468 [DOI] [PubMed] [Google Scholar]
  • 122.Probst C, Gethmann J, Amler S, Globig A, Knoll B, Conraths FJ. The potential role of scavengers in spreading African swine fever among wild boar. Scientific Reports 2019 9:1. 2019;9: 1–13. doi: 10.1038/s41598-019-47623-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 123.Cukor J, Linda R, Václavek P, Šatrán P, Mahlerová K, Vacek Z, et al. Wild boar deathbed choice in relation to ASF: Are there any differences between positive and negative carcasses? Prev Vet Med. 2020;177. doi: 10.1016/j.prevetmed.2020.104943 [DOI] [PubMed] [Google Scholar]
  • 124.Servanty S, Gaillard JM, Ronchi F, Focardi S, Baubet É, Gimenez O. Influence of harvesting pressure on demographic tactics: implications for wildlife management. Journal of Applied Ecology. 2011;48: 835–843. doi: 10.1111/J.1365-2664.2011.02017.X [DOI] [Google Scholar]
  • 125.Keuling O, Lauterbach K, Stier N, Roth M. Hunter feedback of individually marked wild boar Sus scrofa L.: Dispersal and efficiency of hunting in northeastern Germany. Eur J Wildl Res. 2010;56: 159–167. doi: 10.1007/S10344-009-0296-X/TABLES/3 [DOI] [Google Scholar]
  • 126.Keuling O, Baubet E, Duscher A, Ebert C, Fischer C, Monaco A, et al. Mortality rates of wild boar Sus scrofa L. in central Europe. Eur J Wildl Res. 2013;59: 805–814. doi: 10.1007/S10344-013-0733-8/FIGURES/3 [DOI] [Google Scholar]
  • 127.Altizer S, Bartel R, Han BA. Animal migration and infectious disease risk. Science. 2011;331: 296–302. doi: 10.1126/science.1194694 [DOI] [PubMed] [Google Scholar]
  • 128.Gervasi V, Gubertì V. Combining hunting and intensive carcass removal to eradicate African swine fever from wild boar populations. Prev Vet Med. 2022;203. doi: 10.1016/j.prevetmed.2022.105633 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 129.Jongman RHG, Jones-Walters L. The Pan European Ecological Network: PEEN. Landsc Ecol. 2011. [cited 24 Jun 2024]. Available: https://www.academia.edu/5697122/The_Pan_European_Ecological_Network_PEEN [Google Scholar]
  • 130.Newbold T, Hudson LN, Hill SLL, Contu S, Lysenko I, Senior RA, et al. Global effects of land use on local terrestrial biodiversity. Nature 2015 520:7545. 2015;520: 45–50. doi: 10.1038/nature14324 [DOI] [PubMed] [Google Scholar]

Decision Letter 0

Francesco Bisi

17 Sep 2024

PONE-D-24-33277Habitat suitability mapping and landscape connectivity analysis to predict African swine fever spread in wild boar population: a focus on Northern ItalyPLOS ONE

Dear Dr. Faustini,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Dear Authors, the topic of the manuscript is interesting, and you are working with a valuable dataset. However, as highlighted by the reviewers, all sections require improvements. While ensemble modeling can be a powerful tool, it may also lead to misinterpretations if not applied properly. Simply adding more statistics does not necessarily result in better outcomes. Morevoer, since the main objective of your manuscript focuses on connectivity and the role of corridors, these aspects should be discussed in greater detail. I encourage you to carefully address the reviewers' constructive comments.

Please submit your revised manuscript by Nov 01 2024 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Francesco Bisi, Ph.D.

Academic Editor

PLOS ONE

Journal Requirements:

1. When submitting your revision, we need you to address these additional requirements.

Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at 

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Thank you for stating in your Funding Statement: 

The research project was made possible by the financial support provided to G.F. by the Società Italiana di Patologia ed Allevamento dei Suini (SIPAS, https://www.sipas.org/chi-siamo/) and by the Erasmus + for traineeship mobility program (https://erasmus-plus.ec.europa.eu/opportunities/opportunities-for-individuals/students/traineeships-abroad-for-students). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Please provide an amended statement that declares *all* the funding or sources of support (whether external or internal to your organization) received during this study, as detailed online in our guide for authors at http://journals.plos.org/plosone/s/submit-now.  Please also include the statement “There was no additional external funding received for this study.” in your updated Funding Statement. 

Please include your amended Funding Statement within your cover letter. We will change the online submission form on your behalf.

3. Please note that your Data Availability Statement is currently missing the repository name and the DOI/accession number of each dataset OR a direct link to access each database. If your manuscript is accepted for publication, you will be asked to provide these details on a very short timeline. We therefore suggest that you provide this information now, though we will not hold up the peer review process if you are unable.

4. When completing the data availability statement of the submission form, you indicated that you will make your data available on acceptance. We strongly recommend all authors decide on a data sharing plan before acceptance, as the process can be lengthy and hold up publication timelines. Please note that, though access restrictions are acceptable now, your entire data will need to be made freely accessible if your manuscript is accepted for publication. This policy applies to all data except where public deposition would breach compliance with the protocol approved by your research ethics board. If you are unable to adhere to our open data policy, please kindly revise your statement to explain your reasoning and we will seek the editor's input on an exemption. Please be assured that, once you have provided your new statement, the assessment of your exemption will not hold up the peer review process.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: No

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: I appreciate the effort of the authors to pull together this information, however, evaluating all possible models in the Biomod2 package does not seem like an ideal study design. Even more confusing is the reason for choosing GBM as more preferred. The authors have a suitable dataset to use a few models for their purposes. With their knowledge and data available, a model or even a few could be chosen but certainly 11 models are not needed to be evaluated. Then we have to delve into the evaluation criteria for why the other 8 models are not suitable which just detracts from the focus of your study design..predicting ASF across the landscape.

In my comments below, I tried to assist the authors on how to address some of these issue so their objectives and methods are more clear and support their findings with some specific comments by line:

Title: Habitat suitability mapping and landscape connectivity analysis to predict African swine fever spread in wild boar population: a focus on northern Italy. I believe populations should be plural or place an “a” before “wild boar” if authors are suggesting this is one single wild population throughout their study site where records are documented?

Introduction

Lines 44 and 51: Defining African swine fever as ASF and African swine fever virus as ASFV seems confusing and unnecessary. The authors then go into ASFV genotype I and ASFV genotype II but reference ASF to discuss the disease in general terms. It is difficult for the reader to know which is meant or which is most appropriate each time either are used. I would suggest do not define ASFV and refer to these as genotype I and genotype II each time it is needed. Then use ASF when discussing the disease in general. Furthermore, there is no mention of these in the remainder of the manuscript aside from ASF.

Lines Line 116: “different wild boar presence data types” is very confusing as written. I would suggest “Presence/absence data for wild boar differs considerably, although often in relation to a common/similar set of environmental variables describing topography, climate, human disturbance and land cover [29].” The authors then need to cite more than one manuscript if these types do differ in the literature.

Line 122: ENET needs to be spelled out prior to using an acronym because most won’t be familiar with this term.

Line 128: “..dispose of..” I am not sure what this means? This entire sentence is very confusing and might be better split into 2 separate sentences.

Materials and methods

Line 177: Supplement Table S1. This information is often overlooked and I appreciate this table. This is the most complete and detailed variable table I have ever reviewed, thank you for this great summary table!

Lines 170-188: A few points of clarity would be helpful in this section.

1. Density – are these within a 100 x 100 m raster that lines up with all of your rasters? If not, how do you get percentage assigned to each raster cell for bare, herbaceous, and tree cover?

2. Bilinear sampling of 30x30 m raster to derive 100x100m rasters? If so, this should be presented in your “percentage of forest” in your S1 Table.

3. By “overall mean” do you mean “annual mean” for each year from 2014-2023 or a single static 10 year mean from all years: 2014-2023?

a. A mean for a 10 year window, if that is what was done, is of no value so please be clear here. Each year of wild boar data should be matched to variables when available. Specifically, NDVI could have massive fluctuations from year to year so a 10 year average does not seem acceptable in any way?

4. What are the “seasons” based on? Some citations for seasons in northern Italy would be advisable.

Lines 195-197: What variables were removed due to collinearity and multicollinearity? The authors state in line 213 that “Only non-collinear variables were incorporated” but these should be listed here in the methods if they were not included. Then delete Line 213? I see now these are in Table 1 (better to reference in Methods) but this table can be deleted. Simply include a sentence of which are not included because they don’t meet assumptions of collinearity/multicollinearity (i.e., opposite of Lines 328-332) in the methods not results.

Lines 227-232: In your introduction, the authors discuss that SDMs can be implemented based on data available and aim of the study. However, here the authors appear to go on a habitat suitability modeling exploration with everything in Biomod2 (11 methods) which is inappropriate and unnecessary post hoc analysis that goes against the authors own introduction. Why not select the best model that fits your data and use it instead of evaluating which is best based on some criteria that we really are not clear on? More on this later.

Results

Line 341: Here the authors refer to “annual” not “overall” leading to confusion on just how these were calculated?

Line 347: I am a bit confused here. In the Introduction, the authors were recommending an “ensemble modelling approach” but here they are saying just use one, GBM? So this entire effort of evaluating all of these models was not necessary?

Line 348: See my comments above for Lines 227-232 and Line 347. The authors claim GBM was preferred due to “easier parameterization” and “more realistic performance metrics?” I am not sure how you define this or justify it being a criteria for selecting the model chosen?

Line 347: What is a “privileged area?”

Figure 1-4: These figures appear blurry and may not be up to the quality required by the journal.

Figure 3: It seems to me that the suitability of habitat lines up with where the presence data is located for this region, regardless of season?

Figure 4: It is difficult to determine this figure for panel A. I suggest you zoom in on the connectivity map like in B and C but replace those panels with connectivity values instead of binary risk?

Discussion

Lines 4342-436: I am not sure the authors succeeded here based on my point above that the suitability and connectivity seem to follow the presence data? Using a 100 m resolution also seems too fine a scale considering the size of their study area. There may be too similar characteristics across the study area at the 100 m scale thus masking the variables that were really influencing wild boar distribution across your landscape? A larger scale, with your variables summarized to each raster cell (500m or 1000 m) might provide more value and identify more influential variables in your models?

Lines 442-449: Agreed, does the lack of low suitability in Po Valley make sense for how wild boar select habitat in Europe? Are there any detailed VHF or GPS data collected in this region to determine if wild boar prefer agriculture-forest valleys compared to forested, mountainous terrain?

Lines 484-486: The authors acknowledge the temporal pattern but it was not clear how these were modeled with “time” as stated in the methods? All climate variables are available daily, weekly, monthly correct? However, the authors used a static annual mean or 3-4 months seasonal mean which likely rendered these variables of limited value in their assessment. These climate variables and NDVI could have avoided these pitfalls the author outline if they were evaluated differently here?

Reviewer #2: Faustini et al. presented a ms entitled “Habitat suitability mapping and landscape connectivity analysis to predict African swine fever spread in wild boar population: a focus on Northern Italy” for publication in PLOS ONE. The paper deals with a relevant and interesting topic: a highly contagious disease affecting wild and domestic pigs (ASF) reported in Eastern Europe and, in January 2022, in North-western Italy. Thereafter, the expanding wild epidemic, has posed serious management problems related to the presence of Italian pig farms and trade.

I enjoyed reading this ms and I think that this study is a valuable contribution to improve the knowledge/management of this species.

However, despite the interesting topic and the potential of management implications for the target species (i.e., mapping of suitable habitats for wild boars and their potential dispersal corridors in Northern Italy, could be an important issue in preventing the ASF expansion), I believe that a revision is required (please, see my comments below).

General comments:

1) A first flaw that I found concerns the absence of an accurate description of ASF Genotype II expansion (speed, population crash, corridors) in the Eastern European countries. I believe this is a fundamental point: to better introduce the issue, the authors should describe the situation of the last 10 years in Eastern Europe in detail, also in relation to the economic impact of this epidemic.

2) I think the ms would gain more value with a reorganization of the discussion section. I suggest dealing first with the description of the choices made by wild boar, then the description of the potential expansion corridors (with details on the most important directions and their characteristics) and finally an analysis of the possible management and surveillance interventions. These three parts are already present in the discussion but should be better outlined, especially the part related to connectivity.

3) A detailed explanation/comparison of seasonal models is also missing. How does expansion relate to seasons?

4) The manuscript often reports the names of valleys, rivers, mountain areas with reference to the figures (e.g., Fig. 3 - L. 442-446) but these are not reflected in the maps. For a reader who is not familiar with these areas, it is difficult to interpret the results. Supplementary material with more detailed maps could be added.

5) Also the “discontinuous natural zones” (L. 471) should be represented in a map, according to their importance in connectivity

Minor comments:

Abstract: I suggest to delete the description of ASF Genotype I that is not the target of this ms.

L.439-441: “In addition to the higher resolution of the maps presented here compared to those found in the literature, the influence of each variable on wild boar presence was also detailed.”

Please include the references of the studies compared with the results obtained in this research.

L. 507-510: I think that high-traffic corridors need to be described: characteristics, directions and relationships with existing farms.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2025 Jan 30;20(1):e0317577. doi: 10.1371/journal.pone.0317577.r002

Author response to Decision Letter 0


5 Nov 2024

Response to Academic Editor

Dear Authors,

the topic of the manuscript is interesting, and you are working with a valuable dataset. However, as highlighted by the reviewers, all sections require improvements. While ensemble modeling can be a powerful tool, it may also lead to misinterpretations if not applied properly. Simply adding more statistics does not necessarily result in better outcomes. Morevoer, since the main objective of your manuscript focuses on connectivity and the role of corridors, these aspects should be discussed in greater detail. I encourage you to carefully address the reviewers' constructive comments.

A: We thank the academic editor for the positive feedback and for highlighting key areas for improvement. We have addressed each of the reviewers' comments and incorporated their suggestions into the revised manuscript. Specifically, we have better clarified our approach, clarifying its application and limitations to avoid any potential misinterpretations. Additionally, as recommended, we have enriched the discussion of connectivity and corridor roles, detailing how these features impact wild boar movement and potential disease spread across the landscape.

Moreover, the following adjustments were made in figures, tables based on the reviewers' valuable suggestions:

• Figure 3: We now present only the overall habitat suitability map, with seasonal results provided separately as S1 Fig.

• Figure 4: As suggested by Reviewer 1, we included a zoomed-in view of infected areas in the connectivity map to enhance focus on regions most affected by African swine fever.

• S2 Fig: We added a new figure illustrating detailed geographical context, addressing Reviewer 2’s request.

• Table 1: We clarified the methodology by adding a description of the standardization process for the raster data, as suggested by Reviewer 1.

• Data Availability: Following the suggestions for data transparency, we have made all associated scripts (S1 Files) and original raster files (S2 Files) available via GitLab and Google Drive links, with the files set to be publicly accessible upon acceptance of the manuscript.

• Corresponding Author Contact: We included an additional email for the corresponding author to accommodate a potential upcoming transition to a new institutional affiliation at the University of Padova.

• Funding Statement: the funding acknowledgment now reads: "The research project was made possible by the financial support provided to G.F. by the Società Italiana di Patologia ed Allevamento dei Suini (SIPAS, https://www.sipas.org/chi-siamo/) and by the Erasmus + for traineeship mobility program (https://erasmus-plus.ec.europa.eu/opportunities/opportunities-for-individuals/students/traineeships-abroad-for-students). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. There was no additional external funding received for this study."

We hope to have addressed each of the reviewers' feedback comprehensively, and we thank them for helping us in refining the manuscript. We look forward to your further feedback.

Response to Reviewers

Reviewer #1: I appreciate the effort of the authors to pull together this information, however, evaluating all possible models in the Biomod2 package does not seem like an ideal study design. Even more confusing is the reason for choosing GBM as more preferred. The authors have a suitable dataset to use a few models for their purposes. With their knowledge and data available, a model or even a few could be chosen but certainly 11 models are not needed to be evaluated. Then we have to delve into the evaluation criteria for why the other 8 models are not suitable which just detracts from the focus of your study design..predicting ASF across the landscape.

A: We thank the reviewer for the valuable comments, and hope that our revisions will address his/her concerns effectively. Please see below.

In my comments below, I tried to assist the authors on how to address some of these issue so their objectives and methods are more clear and support their findings with some specific comments by line:

A: We would like to sincerely thank the reviewer for the insightful and constructive comments. We truly appreciate the time and effort taken to provide suggestions on our manuscript.

Title: Habitat suitability mapping and landscape connectivity analysis to predict African swine fever spread in wild boar population: a focus on northern Italy. I believe populations should be plural or place an “a” before “wild boar” if authors are suggesting this is one single wild population throughout their study site where records are documented?

A: We thank the reviewer for the suggestion. We modified the title accordingly, to improve it clarity.

Introduction

Lines 44 and 51: Defining African swine fever as ASF and African swine fever virus as ASFV seems confusing and unnecessary. The authors then go into ASFV genotype I and ASFV genotype II but reference ASF to discuss the disease in general terms. It is difficult for the reader to know which is meant or which is most appropriate each time either are used. I would suggest do not define ASFV and refer to these as genotype I and genotype II each time it is needed. Then use ASF when discussing the disease in general. Furthermore, there is no mention of these in the remainder of the manuscript aside from ASF.

A: We thank the reviewer for the valuable suggestion to improve the clarity of our manuscript. Following reviewer’s suggestion, the introduction was structured presenting at the beginning only the disease as caused by the virus, and only later the virus with its genotypes. We preferred to keep the distinction between genotype I and II, to present the different epidemiological scenarios of Sardinia Island compared to the rest of Europe and Italy (mainland). However, we acknowledge reviewer observation that ASFV genotypes I and II are not referenced in the remainder of the manuscript. In the revised version, we ensured that any reference to ASF cases specifies the genotype II, including a general statement in the “model evaluation” section in line 333-334.

Lines Line 116: “different wild boar presence data types” is very confusing as written. I would suggest “Presence/absence data for wild boar differs considerably, although often in relation to a common/similar set of environmental variables describing topography, climate, human disturbance and land cover [29].” The authors then need to cite more than one manuscript if these types do differ in the literature.

A: We thank the reviewer for the suggestion. We modified the manuscript accordingly, citing also more articles.

Line 122: ENET needs to be spelled out prior to using an acronym because most won’t be familiar with this term.

A: We acknowledge that this term may appear to be an acronym, but it is, in fact, the official name of the project/consortium. Unfortunately, there is no acronym definition provided by the European Food Safety Authority (EFSA). Instead, EFSA defines ENETWILD as an international network of wildlife professionals, which we had described in line 142-143 as "an international network of wildlife professionals supported by the European Food Safety Agency." We apologize for any confusion and hope this clarification resolves the issue.

Line 128: “..dispose of..” I am not sure what this means? This entire sentence is very confusing and might be better split into 2 separate sentences.

A: We apologize for the misleading sentence. The sentence has been revised to enhance its clarity. Specifically, we have replaced "dispose of" with "would benefit from" and have split the sentence into two parts for better readability (see lines 148-151).

Materials and methods

Line 177: Supplement Table S1. This information is often overlooked and I appreciate this table. This is the most complete and detailed variable table I have ever reviewed, thank you for this great summary table!

A: We sincerely thank the reviewer for appreciating our work and effort! We wanted to provide something helpful for improving the reproducibility of our methods.

Lines 170-188: A few points of clarity would be helpful in this section.

1. Density – are these within a 100 x 100 m raster that lines up with all of your rasters? If not, how do you get percentage assigned to each raster cell for bare, herbaceous, and tree cover?

A: Yes, all the rasters were standardized to 100x100m grid cells using a bilinear sampling method , including the coverage layers, as explained in lines 200-203. To avoid confusion, we add this specification also in the description of each variable in S1 Table.

2. Bilinear sampling of 30x30 m raster to derive 100x100m rasters? If so, this should be presented in your “percentage of forest” in your S1 Table.

A: All rasters were standardized to 100x100m grid cells, as explained in lines 200-203 in material and methods section. We are not sure what the reviewer refers to as there aren’t any variables that were standardized from 30x30 m raster to derive 100x100m raster. However, to avoid confusion we add the specification “Bilinear sampling method was applied to standardize the variable as raster with a pixel size of 100 m x 100 m, over an extent equal to the study area.” also in the description of each variable in S1 Table. We thank the reviewer for the opportunity to clarify better our methodology to the reader.

3. By “overall mean” do you mean “annual mean” for each year from 2014-2023 or a single static 10 year mean from all years: 2014-2023?

a. A mean for a 10 year window, if that is what was done, is of no value so please be clear here. Each year of wild boar data should be matched to variables when available. Specifically, NDVI could have massive fluctuations from year to year so a 10 year average does not seem acceptable in any way?

A: We thank the reviewer for highlighting this important point. To clarify, when we refer to "overall mean," we are specifically addressing environmental/climatic variables, which represent long-term environmental patterns rather than short-term weather fluctuations. In niche modeling, the objective is to capture stable climatic conditions that shape habitat suitability over extended periods, rather than ephemeral weather-driven variations. Generally, climatic data spanning even 30 years is standard practice for such analyses.

Therefore, we respectfully disagree with the suggestion that a 10-year mean holds no value in this context. We believe that our approach is well-suited to the scale and aims of our study, focused more on the general suitability of the landscape for wild boars (which is a highly adaptable species), rather than the year-to-year variability that might arise from short-term shifts.

Our approach, which involves calculating an overall mean for the 2014–2023 period, was chosen deliberately to smooth out short-term fluctuations and provide a more stable representation of the environmental/climatic conditions affecting wild boar populations over time. While annual data may indeed capture year-to-year variability, such as NDVI fluctuations, our primary focus was on long-term trends and consistent drivers of wild boar presence. Using a 10-year mean allowed us to generalize the broader environmental context, providing a more robust measure of habitat conditions that is less sensitive to short-term fluctuations. Furthermore, it is not uncommon in ecological and environmental studies to use multi-year averages for environmental/climatic variables, particularly considering data availability and when the goal is to account for general patterns over an extended time frame rather than year-to-year variability (here some examples: https://doi.org/10.1371/journal.pone.0193295, https://doi.org/10.1371/journal.pone.0176339, https://doi.org/10.1007/s10980-021-01371-y).

That said, we acknowledge the limitations of our method, particularly as pointed out by the reviewer and that using year-specific NDVI could enhance temporal precision in future studies focused on annual fluctuations and dynamics. We added this points in the manuscript in line 208-210 and 514-522.

4. What are the “seasons” based on? Some citations for seasons in northern Italy would be advisable.

A: The seasons were defined based on the meteorological convention, which divides the year into four three-month periods: December, January, and February for winter; March, April, and May for spring; June, July, and August for summer; and September, October, and November for autumn. This method does not account for shorter units of time, such as weeks or days, and was used for its simplicity and convenience. Unlike the astronomical definition, which aligns the seasons with solstices and equinoxes, the meteorological definition directly reflects temperature and weather patterns, which are more relevant for understanding seasonal dynamics in wild boar distribution. We clarified this choice in the manuscript and provide references to support the use of this definition (line 211-213).

Lines 195-197: What variables were removed due to collinearity and multicollinearity? The authors state in line 213 that “Only non-collinear variables were incorporated” but these should be listed here in the methods if they were not included. Then delete Line 213? I see now these are in Table 1 (better to reference in Methods) but this table can be deleted. Simply include a sentence of which are not included because they don’t meet assumptions of collinearity/multicollinearity (i.e., opposite of Lines 328-332) in the methods not results.

A: We thank the reviewer for the helpful feedback. We have moved the description of the included variables, as well as the corresponding Table 1, into the methods section to improve clarity (lines 224-230). We acknowledge that Table 1 could seem a repetition with the text. However, we decided to maintain Table 1 as we think it provides a clear and immediate overview of the variables, making it easier for readers to reference at a glance. Keeping the table also allows us to maintain a logical and seamless flow in the result subsection "presence data and environmental variables" referring to the methods, ensuring the text remains easy to follow. We appreciate reviewer’s input in helping to enhance the clarity and linearity of the manuscript.

Lines 227-232: In your introduction, the authors discuss that SDMs can be implemented based on data available and aim of the study. However, here the authors appear to go on a habitat suitability modeling exploration with everything in Biomod2 (11 methods) which is inappropriate and unnecessary post hoc analysis that goes against the authors own introduction. Why not select the best model that fits your data and use it instead of evaluating which is best based on some criteria that we really are not clear on? More on this later.

A: We apologize for the confusion and thank the reviewer for the valuable comment. About evaluating all possible models in the Biomod2 package, we completely understand that this may seem excessive. However, since behind the choice of a model there are many considerations and assumptions, choosing the best performing model by comparing multiple ones is an accepted approach when SDMs are considered (https://doi.org/10.1002/ece3.6859; https://doi.org/10.1111/1749-4877.12000; https://doi.org/10.1007/s10980-021-01371-y; https://doi.org/10.1016/j.ecoleng.2022.106725). This rationale was suggested also by ENETWILD report (https://doi.org/10.2903/sp.efsa.2018.EN-1490), at page 12: “In order to provide a best possible output we would argue that all potential models should be fitted and compared to limit the risk of artefactual outputs as a result of model-specific assumptions”. Comparing a variety of models allowed us to identify the best-performing one for our dataset. We wanted to avoid making assumptions about which models would be most suitable, and instead, allow the data to guide the selection process. We added a specification in the discussion section in line 560-561 to support our rationale.

However, we agree on the confusing introduction. Thus, we modified it accordingly, also with some references, in line 140-145.

Results

Line 341: Here

Attachment

Submitted filename: Response to Reviewers.docx

pone.0317577.s009.docx (57.5KB, docx)

Decision Letter 1

Francesco Bisi

4 Dec 2024

PONE-D-24-33277R1Habitat suitability mapping and landscape connectivity analysis to predict African swine fever spread in wild boar population: a focus on Northern ItalyPLOS ONE

Dear Dr. Faustini,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

==============================

ACADEMIC EDITOR: the manuscript requires minor revisions. Please carefully follow my comments below. 

==============================

Please submit your revised manuscript by Jan 18 2025 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Francesco Bisi, Ph.D.

Academic Editor

PLOS ONE

Journal Requirements:

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

Additional Editor Comments:

Dear Authors,

I would like to thank you for considering and addressing most of the reviewers' comments. I believe the manuscript contributes significantly to the understanding of a very interesting and sensitive subject.

I must say that given the wide range of potential applications for suitability models, there may never be a "perfect" model or an entirely "wrong" one. Furthermore, due to the complexities of wild boar ecology, modeling its distribution will always present challenges. This species in Europe can be found in very different habitats, ranging from urban areas to mountainous regions up to 2000 m a.s.l. Initially, as Reviewer 1, I was skeptical about the use of NDVI data averaged over a ten-year period. However, I now realize that investigating the subject in greater detail would involve a vast array of additional covariates (e.g., beech and chestnut masting, snow cover, predator densities, hunting pressure), making the endeavor endless. Thus, I consider your selection of covariates appropriate.

However I would like to request a few additional improvements:

As suggested by Reviewer 1, please carefully confirm the number of variables used in the modeling process 11 or 13? Additionally, I agree with Reviewer 1 that Table 1 does not add substantial value to the main text and should be moved to the supplementary material.

Please carefully review the references, as suggested by Reviewer 1. For example, while working with grids in R, I currently use the "Terra" package, but I understand that the "Raster" package is still functional for older versions of R. Kindly verify which package was utilized and ensure the references reflect this accurately.

While reading the manuscript, I felt that the seasonal models were not well integrated into the overall narrative. It would be helpful to clarify why you chose to model seasonal distributions and to better explain why the finding of increased suitability in spring could be of any interest. Does this have any management implications?

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: (No Response)

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: No

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: No

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: I appreciate the effort of the authors to pull together this information, the manuscript has been improved considerably in the Introduction specifically, and I appreciate the authors efforts in addressing reviewer concerns. There were still a few concerns that were not addressed and one that I overlooked that I will go into detail below:

Materials and methods

Lines 215-217: What is reference 67 actually referring to here? There is no mention of a correlation tree or cluster dendrogram or raster package in the document? How can the raster package have been used when it was removed from CRAN for use in program R in December 2023? It appears the authors are citing a review (Anderson; Journal of Experimental Marine Biology and Ecology 289(2):303–305) of the book:

Analysis of Ecological Communities: Bruce McCune and James B. Grace, MjM Software Design, Gleneden Beach, USA, 2002, ISBN 0 9721290 0 6, US$ 35 (Pbk) May 2003.

Regardless, none of these terms appear in the book either as far as I can tell.

Line 224: “The variables included in the study for all scenarios were 11:…” Do the authors mean “overall” here instead of “all” to align with text and table 1 that refers to all seasons combined (Line 207)?

Lines 224-228: In my previous review, I suggested “Simply include a sentence of which are not included because they don’t meet assumptions of collinearity/multicollinearity (i.e., opposite of Lines 328-332) in the methods not results.” The authors chose to not edit with my recommendations and also chose not to delete Table 1. Perhaps this time around I might provide a more compelling argument to deleting (or modifying) Table 1 due to the number of issues with it’s inclusion:

1. Out of 20 variables, 11 were acceptable to include in the overall analysis although Table 1 has x in 13 columns along with the total that confirms the 13 and not the 11?

2. The authors list the 11 here as well as several that were not included in overall or seasonal models which, in turn, is presenting "results" in the Materials and Methods? There is also a section in the Results (Lines 365-368) that says out of the 20 variables that 13 were selected in the overall and seasonal suitability models that references Table 1. It is not clear why some of this analysis is in the Materials and Methods and some is in Results?

3. Lines 226-228 identifies several variables that were excluded in all scenarios (overall and seasonal, I presume), and interestingly, 2 of the variables (Temperature and NDVI) are those averaged over a 10 year span that I mentioned was not appropriate in my previous review (see below comments from my previous review and author response).

4. Table 1 repeats some of what is in text in both sections and I would still suggest that it be deleted. If the authors believe it has value, then why not include mean (SDs) for each variable during each season and overall instead of just an x? An entire page for a table with x’s that could be included in 4 sentences in the Materials and Methods (like Lines 226-228 so one line for overall and one line for each season) does not seem necessary and I don’t see how it helps the “flow” considering it is also presented in the Results?

From my previous review:

Lines 484-486 (original submission): The authors acknowledge the temporal pattern but it was not clear how these were modeled with “time” as stated in the methods? All climate variables are available daily, weekly, monthly correct? However, the authors used a static annual mean or 3-4 months seasonal mean which likely rendered these variables of limited value in their assessment. These climate variables and NDVI could have been match with each week-month-year presence data were recorded?

To which the authors responded:

A: While it is true that climate variables are available at more granular temporal resolutions (daily, weekly, and monthly), our decision to employ annual or seasonal means was driven by the objective of focusing on broader climate trends relevant to wild boar habitat suitability. We added a sentence in the discussion in line 556-559 to recognize that a more dynamic approach, utilizing daily or weekly variables, could provide a richer understanding of how seasonal variability may influence habitat selection over time. We thank the reviewer for highlighting this important aspect, as it encourages us to refine our methodologies and consider more nuanced evaluations in subsequent studies.

Reviewer second response: Again, I suggested the authors assign temperature and NDVI to the monthly or even seasonal means instead of a 10 year mean (overall or seasonal). The authors identified, in their response and in the manuscript that a “more dynamic approach, utilizing daily or weekly variables, could provide a richer understanding…” however the authors chose not to do it in their revision? Instead, they argue how it can be done in the future? Furthermore, do the authors realize that both Temperature and NDVI were not included in any models (Based on Table 1 multicollinearity) and Precipitation was include in 3 of the 5 scenarios only? Regardless of what past research claims, the authors have the ability to match these variables to the annual/seasonal means to match them to the data but chose not to do so. The authors might be better off removing them entirely in the manuscript and models considering they provide no value based on how they were presented in the models (perhaps due to static 10 year averages)? Based on how you present them in Table 1, Temp and NDVI were not included in any models, correct?

Supporting information, S1 Files: This Gitlab repository does not exist or is not accessible to the reviewer. The authors agreed that it should be available “upon acceptance,” however, not available to the reviewer? It seems that accepted or not, a GitLab repo would be valuable to document your work and provide this analysis as an archive for your efforts. Including this link in your revisions has little value if a reviewer is not able to access it?

Reviewer #2: This revised version has improved a lot and accounted for most of the problems I identified when reading the first version.

I think the authors have made a considerable effort to increase the quality of their ms.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2025 Jan 30;20(1):e0317577. doi: 10.1371/journal.pone.0317577.r004

Author response to Decision Letter 1


30 Dec 2024

Response to Editor and Reviewers

Journal Requirements:

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

A: We thank for the reminder to review the reference list. We confirm that the manuscript does not contain any retracted papers. Based on Reviewer 1's comments, we have updated reference 67 (now reference 60) to accurately reflect the correct source. The details of this modification are described in our response to Reviewer 1.

Additional Editor Comments:

Dear Authors,

I would like to thank you for considering and addressing most of the reviewers' comments. I believe the manuscript contributes significantly to the understanding of a very interesting and sensitive subject.

I must say that given the wide range of potential applications for suitability models, there may never be a "perfect" model or an entirely "wrong" one. Furthermore, due to the complexities of wild boar ecology, modeling its distribution will always present challenges. This species in Europe can be found in very different habitats, ranging from urban areas to mountainous regions up to 2000 m a.s.l. Initially, as Reviewer 1, I was skeptical about the use of NDVI data averaged over a ten-year period. However, I now realize that investigating the subject in greater detail would involve a vast array of additional covariates (e.g., beech and chestnut masting, snow cover, predator densities, hunting pressure), making the endeavor endless. Thus, I consider your selection of covariates appropriate.

A: We thank the Editor for the kind words and the encouraging feedback. We appreciate the recognition of the challenges inherent in modeling the distribution of such an adaptable and widely dispersed species as the wild boar.

However I would like to request a few additional improvements:

As suggested by Reviewer 1, please carefully confirm the number of variables used in the modeling process 11 or 13?

A: Thank you for pointing this out. For each scenario, 13 variables were selected for the modeling process, 11 of which were common across all scenarios. We apologize for the misunderstanding and have revised the manuscript accordingly to make this point clearer (lines 225-230). We appreciate editor and reviewers’ attention and hope this resolves the confusion.

Additionally, I agree with Reviewer 1 that Table 1 does not add substantial value to the main text and should be moved to the supplementary material.

A: We now acknowledge that Table 1 does not add substantial value to the main text. Accordingly, we have moved Table 1 to the supplementary material as recommended (now S1 Table). We appreciate Editor and reviewer 1 input, which has helped improve the clarity and focus of the manuscript.

Please carefully review the references, as suggested by Reviewer 1. For example, while working with grids in R, I currently use the "Terra" package, but I understand that the "Raster" package is still functional for older versions of R. Kindly verify which package was utilized and ensure the references reflect this accurately.

A: We thank the editor for highlighting the importance of accurate referencing. Upon review, we realized that there was an oversight in our citation. We sincerely apologize for the confusion.

The functions we utilized for the correlation tree belong to the stats package in R, and not the Raster package as previously cited. We have corrected this in the manuscript and updated the references accordingly. Additionally, we have verified all other references to ensure their accuracy.

We greatly appreciate editor and reviewer 1 attention to detail, which has helped us improve the quality and precision of our work.

While reading the manuscript, I felt that the seasonal models were not well integrated into the overall narrative. It would be helpful to clarify why you chose to model seasonal distributions and to better explain why the finding of increased suitability in spring could be of any interest. Does this have any management implications?

A: We thank the editor for the observation and have revised the manuscript to better explain the rationale for modeling seasonal distributions.

We chose to model seasonal distributions to account for the variations in habitat use and resource availability for wild boars across seasons. We specified this choice in materials and methods section, in the lines 205-207 and lines 233-234. Specifically, the finding of increased suitability in spring suggests a period of heightened activity and resource exploitation, which is likely influenced by factors such as vegetation growth and reproductive cycles (line 494-500).

From a management perspective, these findings could inform targeted strategies, such as timing interventions to optimize hunting schedules, disease surveillance plans, and the implementation of biosecurity measures. To clarify this connection, we have added this specification to line 500-502 in the revised manuscript (discussion section).

We appreciate Editor feedback, which has allowed us to strengthen the narrative and highlight the practical implications of our findings.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: (No Response)

Reviewer #2: All comments have been addressed

________________________________________

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

Reviewer #2: Yes

________________________________________

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: No

Reviewer #2: Yes

________________________________________

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: No

Reviewer #2: Yes

________________________________________

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

________________________________________

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: I appreciate the effort of the authors to pull together this information, the manuscript has been improved considerably in the Introduction specifically, and I appreciate the authors efforts in addressing reviewer concerns.

A: We thank the reviewer for the kind words, and we are pleased to read that the revisions have significantly improved the manuscript. We appreciated reviewer 1's thoughtful comments and constructive suggestions, which were instrumental in enhancing the quality of the manuscript.

There were still a few concerns that were not addressed and one that I overlooked that I will go into detail below:

A: We thank the reviewer for the comments. We carefully considered the points that reviewer 1 has raised, and provided a detailed response to each one below, trying to meet each request.

Materials and methods

Lines 215-217: What is reference 67 actually referring to here? There is no mention of a correlation tree or cluster dendrogram or raster package in the document? How can the raster package have been used when it was removed from CRAN for use in program R in December 2023? It appears the authors are citing a review (Anderson; Journal of Experimental Marine Biology and Ecology 289(2):303–305) of the book:

Analysis of Ecological Communities: Bruce McCune and James B. Grace, MjM Software Design, Gleneden Beach, USA, 2002, ISBN 0 9721290 0 6, US$ 35 (Pbk) May 2003.

Regardless, none of these terms appear in the book either as far as I can tell.

A: We thank the reviewer for pointing out the oversight regarding the reference now on lines 216–218. The reviewer is absolutely correct that there was an error in our citation. We sincerely apologize for the mistake and any confusion it may have caused.

The functions used for the correlation tree actually belong to the stats package in R, and we have corrected the manuscript to reflect this (line 218). As reported by “> citation("stats")” function: “The ‘stats’ package is part of R. To cite R in publications use: R Core Team (2023). _R: A Language and Environment for Statistical Computing_. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/.”

Additionally, we have included the appropriate reference for the Variance Inflation Factor (VIF) to enhance reproducibility (line 221).

We are grateful for the reviewer’s careful review and for bringing this to our attention, allowing us to improve the accuracy and clarity of our work.

Line 224: “The variables included in the study for all scenarios were 11:…” Do the authors mean “overall” here instead of “all” to align with text and table 1 that refers to all seasons combined (Line 207)?

A: We apologize for the potential confusion regarding the use of "all" versus "overall". In the previous review, we verified that the term "overall" was consistently used to avoid ambiguity. In this section, we decided instead to retain "all" because it specifically refers to the fact that for each scenario, 13 variables were selected for the modeling process, 11 of which were common across all scenarios. Table 1 (now S1 Table) served instead as a visual summary to complement this explanation.

We apologize for any misunderstanding and have revised the manuscript to avoid further misunderstandings (line 225).

Lines 224-228: In my previous review, I suggested “Simply include a sentence of which are not included because they don’t meet assumptions of collinearity/multicollinearity (i.e., opposite of Lines 328-332) in the methods not results.” The authors chose to not edit with my recommendations and also chose not to delete Table 1. Perhaps this time around I might provide a more compelling argument to deleting (or modifying) Table 1 due to the number of issues with it’s inclusion:

A: We thank the reviewer for the thoughtful feedback and for providing a more detailed explanation regarding his/her previous suggestion. We apologize for not fully addressing the reviewer’s earlier recommendation, and we appreciate his/her patience in this matter.

We now understand the importance of simply including a sentence in the methods section to explain which variables were excluded due to assumptions of collinearity/multicollinearity. We have revised the manuscript accordingly, incorporating this clarification as suggested in lines 225-230.

Regarding Table 1, we have reconsidered your suggestion, and, after further evaluation of editor’s comment, we have decided to move Table 1 to the supplementary material to better align with the focus of the main text.

We are grateful for reviewer constructive comments, which have been instrumental in improving the clarity and quality of our manuscript.

1. Out of 20 variables, 11 were acceptable to include in the overall analysis although Table 1 has x in 13 columns along with the total that confirms the 13 and not the 11?

A: We are sorry for the confusion. To clarify, the 11 variables refer to those common across all models, including both overall and seasonal scenarios. The 13 variables, as indicated in Table 1 (now S1 Table), represent the total number of variables considered for each scenario, with 11 of them being shared across all models.

We modified the manuscript according to reviewer and editor feedbacks (line 225). We hope this clears up the confusion.

2. The authors list the 11 here as well as several that were not included in overall or seasonal models which, in turn, is presenting "results" in the Materials and Methods? There is also a section in the Results (Lines 365-368) that says out of the 20 variables that 13 were selected in the overall and seasonal suitability models that references Table 1. It is not clear why some of this analysis is in the Materials and Methods and some is in Results?

A: We understand the concern regarding the placement of certain details in the Materials and Methods and in the Results sections. To clarify, both sections refer to the same analysis, not to two different analyses. We simply decided to include a brief sentence in the Results section to describe the results of the selection of variables, as we aimed to present the process of variable choice for clarity, transparency and reproducibility. We acknowledge that this might seem repetitive. However, by placing and recalling this information in the Results, we hoped to maintain a logical flow of the modeling process, making it easier for readers to follow the progression of the analysis. We specified that the collinearity/multicollinearity analysis refers to the one described in material and methods section in line 362.

We hope this explanation clarifies our approach.

3. Lines 226-228 identifies several variables that were excluded in all scenarios (overall and seasonal, I presume), and interestingly, 2 of the variables (Temperature and NDVI) are those averaged over a 10 year span that I mentioned was not appropriate in my previous review (see below comments from my previous review and author response).

A: We thank the reviewer for providing additional arguments. We modified the manuscript according to the reviewer and editor’s suggestions. We believe this revision will streamline the manuscript and better align with the flow of the content (see below).

4. Table 1 repeats some of what is in text in both sections and I would still suggest that it be deleted. If the authors believe it has value, then why not include mean (SDs) for each variable during each season and overall instead of just an x? An entire page for a table with x’s that could be included in 4 sentences in the Materials and Methods (like Lines 226-228 so one line for overall and one line for each season) does not seem necessary and I don’t see how it helps the “flow” considering it is also presented in the Results?

A: We really appreciate reviewer effort in providing such detailed arguments. We thank the reviewer for the thoughtful comments and su

Attachment

Submitted filename: Response to Reviewers.docx

pone.0317577.s010.docx (36.9KB, docx)

Decision Letter 2

Francesco Bisi

2 Jan 2025

Habitat suitability mapping and landscape connectivity analysis to predict African swine fever spread in wild boar populations: a focus on Northern Italy

PONE-D-24-33277R2

Dear Dr. Faustini,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice will be generated when your article is formally accepted. Please note, if your institution has a publishing partnership with PLOS and your article meets the relevant criteria, all or part of your publication costs will be covered. Please make sure your user information is up-to-date by logging into Editorial Manager at Editorial Manager® and clicking the ‘Update My Information' link at the top of the page. If you have any questions relating to publication charges, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Francesco Bisi, Ph.D.

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Acceptance letter

Francesco Bisi

22 Jan 2025

PONE-D-24-33277R2

PLOS ONE

Dear Dr. Faustini,

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now being handed over to our production team.

At this stage, our production department will prepare your paper for publication. This includes ensuring the following:

* All references, tables, and figures are properly cited

* All relevant supporting information is included in the manuscript submission,

* There are no issues that prevent the paper from being properly typeset

If revisions are needed, the production department will contact you directly to resolve them. If no revisions are needed, you will receive an email when the publication date has been set. At this time, we do not offer pre-publication proofs to authors during production of the accepted work. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few weeks to review your paper and let you know the next and final steps.

Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

If we can help with anything else, please email us at customercare@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Francesco Bisi

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Table. Environmental variables included in the species distribution models (SDM) for each scenario, after testing for collinearity and multicollinearity.

    Variables taking into account distance from environmental features are shortened with "Dist. from”, while those considering topological patterns with "Top.". Variable references are available in S2 Table.

    (PDF)

    pone.0317577.s001.pdf (111.3KB, pdf)
    S2 Table. Metadata associated to the variables included in the study.

    (XLSX)

    pone.0317577.s002.xlsx (24.8KB, xlsx)
    S3 Table. Performance of algorithms of species distribution modelling (SDM), available in the package Biomod2, evaluated through Area Under ROC (Relative Operating Characteristic) Curve (AUC) and True Skills Statistics (TSS) metrics.

    Mean and standard deviation (sd) are reported for each considered scenario.

    (XLSX)

    pone.0317577.s003.xlsx (19.6KB, xlsx)
    S1 Appendix. Response curves of the computed Gradient Boosted Machine ensemble model.

    (PDF)

    pone.0317577.s004.pdf (650.1KB, pdf)
    S1 Fig. Distribution of wild boar records and estimated suitability maps by season.

    (A) Distribution of occurrence records included in the final model (red points) by season. (B) Maps describing habitat suitability for the presence of wild boars in Northern Italy referring to environmental conditions by season. Base map of Italy based on the 2024 regional administrative boundaries from the Italian National Institute of Statistics (https://www.istat.it/).

    (TIF)

    pone.0317577.s005.tif (1.7MB, tif)
    S2 Fig. Map of the study area reporting geographical details.

    Base map made with Natural Earth. Study area and Asti province based on the 2024 administrative boundaries from the Italian National Institute of Statistics (https://www.istat.it/). Lake Maggiore, Po River, regional parks details made from vector files available on the National geoportal of the Italian Ministry of Environment and Energy Security (https://gn.mase.gov.it/) for illustrative purpose only.

    (TIF)

    pone.0317577.s006.tif (5.6MB, tif)
    S1 File. Main code scripts used in the workflow are available at https://doi.org/10.5281/zenodo.14639171.

    (PDF)

    pone.0317577.s007.pdf (96.8KB, pdf)
    S2 File. Suitability maps, connectivity map and binary risk maps are available at https://doi.org/10.5281/zenodo.14637229.

    (PDF)

    pone.0317577.s008.pdf (90.9KB, pdf)
    Attachment

    Submitted filename: Response to Reviewers.docx

    pone.0317577.s009.docx (57.5KB, docx)
    Attachment

    Submitted filename: Response to Reviewers.docx

    pone.0317577.s010.docx (36.9KB, docx)

    Data Availability Statement

    Main code scripts used in the workflow are available at https://doi.org/10.5281/zenodo.14639171. Link to Gitlab repository: https://gitlab.com/giuliafaustini1/habitat-suitability-mapping-and-landscape-connectivity-analysisto-predict-african-swine-fever-spread-in-wild-boar-population. Suitability maps, connectivity map and binary risk maps are available at https://doi.org/10.5281/zenodo.14637229.


    Articles from PLOS ONE are provided here courtesy of PLOS

    RESOURCES