ABSTRACT
Vibrio spp. comprise ecologically significant bacteria that thrive in warm, moderately saline water, and their incidence and proliferation are strongly influenced by environmental factors. In recent years, Vibrio spp. infections have been reported more frequently and over a greater geographical area along the US eastern seaboard. This study provides an analysis of Vibrio spp. infections, notably caused by Vibrio alginolyticus, Vibrio cholerae non-O1/non-O139, Vibrio fluvialis, Vibrio mimicus, V. parahaemolyticus, and V. vulnificus, extracted from the Centers for Disease Control and Prevention’s Cholera and Other Vibrio Illness Surveillance system, located within 200 km of the eastern US coast, to analyze latitudinal distribution trends between 1990 and 2019. For each Vibrio spp., case incidence (presence/absence) was modeled, including environmental data and employing extreme gradient boosting machine learning algorithms (XGBoost). Environmental parameters associated with the incidence of vibriosis were clustered using k-means clustering. The northern limit of total cases of vibriosis was found to have increased ca. 40 km/year, with V. alginolyticus (ca. 70 km/year), V. fluvialis (ca. 60 km/year), and Vibrio parahaemolyticus (ca. 60 km/year) showing the greatest latitudinal shifts. These changes were found to be linked to environmental parameters that enhance the proliferation of Vibrio spp. The average accuracy of the XGBoost models was 60.9%–71.0%, with temperature and salinity being the most significant predictors. Relationships among other environmental parameters were complex and nonlinear, but phytoplankton and precipitation served to differentiate the models. Clustering using k-means yielded results that supported temperature, salinity, and phytoplankton as important environmental parameters. Research in progress will aid in developing global predictive risk models for Vibrio spp. infections.
IMPORTANCE
Vibrio spp. are ecologically significant bacteria, and their incidence and proliferation are strongly influenced by environmental factors. In recent years, Vibrio spp. infections have been reported more frequently and over a greater geographical area along the US eastern seaboard. This study provides an analysis of latitudinal distribution trends of Vibrio spp. infections, notably caused by Vibrio alginolyticus, Vibrio cholerae non-O1/non-O139, Vibrio fluvialis, Vibrio mimicus, Vibrio parahaemolyticus, and V. vulnificus, within 200 km of the eastern US coast. The northern limit of total cases of vibriosis was found to have increased ca. 40 km/year. These changes were found to be linked to environmental parameters that enhance the proliferation of Vibrio spp. Temperature and salinity were the most significant predictors of vibriosis case presence and absence. Phytoplankton and precipitation changes served to differentiate Vibrio sp. presence. Research in progress will aid in developing global predictive risk models for Vibrio spp. infections.
KEYWORDS: vibriosis, Vibrio mimicus, Vibrio fluvialis, Vibrio cholerae, Vibrio alginolyticus, Vibrio vulnificus, Vibrio parahaemolyticus, Vibrio spp., Vibrio
INTRODUCTION
Vibrio spp. are autochthonous to riverine, estuarine, and coastal environments and also freshwater ecosystems globally (1). Vibrio spp. thrive in warm, moderately saline water, with incidence and proliferation heavily influenced by a variety of environmental factors (1, 2). Furthermore, Vibrio spp. form mutualistic associations with aquatic multicellular hosts, most notably zooplankton (3, 4), corals (5), fish (6), mollusks and bivalves (7), seagrass, sponges, and shrimps (8, 9). In addition, Vibrio spp. play an important role in carbon and nitrogen cycling (10, 11). However, despite their ecological significance in the aquatic ecosystem, Vibrio spp. cause severe infections in humans, notably Vibrio cholerae O1, the causative agent of pandemic cholera. Non-cholera Vibrio spp., notably V. cholerae non-O1/non-O139, Vibrio vulnificus, Vibrio parahaemolyticus, Vibrio alginolyticus, Vibrio fluvialis, and Vibrio mimicus, can cause acute gastroenteritis, septicemia, and other extra-intestinal infections.
In developed countries of the world, the risk of cholera has essentially been eliminated with the availability of safe water and effective sanitation. However, non-cholera Vibrio infections (vibriosis) persist in essentially all countries, notably associated with consumption of raw and undercooked seafood or contact of open wounds with water containing pathogenic agents (12). The United States Centers for Disease Control and Prevention (CDC) estimates that 80,000 cases of vibriosis occur in the United States each year, with approximately 52,000 cases caused by consumption of contaminated seafood (12). Vibriosis can present as a serious infection, with V. vulnificus being fatal in ca. 20% of reported wound infections and greater than 50% for primary septicemia (12). Given the environmental importance of Vibrio spp. in the aquatic ecosystem, namely carbon and nitrogen cycling, eradication of Vibrio spp. and the diseases they cause is not possible. Hence, it is essential to understand the environmental parameters associated with their incidence and proliferation to provide early warning for risk of infection, an essential tool for public health and aquaculture.
Altered geographic abundance of Vibrio spp. (2, 13) and, notably, the incidence of V. vulnificus infections in the eastern United States from 1988 to 2018 (14) show significant northern shifts in the distribution of cases over time. Other investigators have reported similar distributional changes in V. vulnificus infections globally that have been linked to climate change (15). Between 1996 and 2010, the majority of cases in the U.S. were caused by V. parahaemolyticus ca. 45%, followed by V. vulnificus ca. 19%, V. alginolyticus ca. 11%, and V. cholerae non-O1/non-O139 ca. 9% (16). However, infections caused by V. fluvialis and V. mimicus have increased in recent years (17).
Concerningly, the incidence of cholera and non-cholera Vibrio spp. infections has increased in recent decades (18–20), likely linked to changes in the environment, notably temperature and salinity (2, 13), that make regions better suited for the proliferation of Vibrio spp. (15). Sea surface temperature (SST) was found to be associated with outbreaks of cholera (21), and warmer water temperatures were correlated with increased numbers of non-cholera Vibrio spp. (22). Similarly, sea surface salinity (SSS) was found to be associated with the abundance of Vibrio spp., and concerns have been raised with respect to rising sea surface levels and changes in precipitation patterns impacting salinity concentrations, hence the incidence of pathogenic Vibrio spp. (19, 22, 23).
Chlorophyll-a (chl-a) has been found to have an association with both cholera (24–26) and non-cholera Vibrio spp (27–29). Chl-a concentrations have been used to estimate nutrient loads and phytoplankton blooms, with the latter serving as a proxy for the incidence of zooplankton (21, 30, 31). Since zooplankton, notably copepods, provide a nutrient-rich niche to which Vibrio spp. attach (32, 33), chl-a can also serve as a useful indirect indicator of Vibrio incidence, when appropriate lag times are considered (34).
In addition, extreme precipitation events have been linked to both cholera (35–37) and non-cholera Vibrio spp. infections (38). Anomalous precipitation has been shown to alter the salinity of coastal waters (39), and with extreme precipitation events expected to increase in intensity and frequency with climate change (40), it has been proposed that infections caused by Vibrio spp. may further increase in the coming years (18).
Optimal conditions for the growth of Vibrio spp. vary depending on genetic composition (9, 41–44). Thus, identifying environmental parameters associated with the incidence and abundance of pathogenic Vibrio spp. and identifying those factors associated with the incidence and distribution of Vibrio spp. is essential for the development of predictive intelligence and early warning systems. A hypothesis of environmental factors influencing risk of cholera has been proposed (20, 21, 31, 45, 46) and has been employed to mitigate risk of cholera outbreaks in several regions globally (36, 47–50). A few predictive models for non-cholera Vibrio spp. have been proposed for regions of the U.S., namely, V. parahaemolyticus (51, 52) and V. vulnificus (53). Following the proposal of using remote sensing data for investigations of Vibrio spp. (31), earth observation satellite products have been used for some predictive models, e.g., V. parahaemolyticus in Chesapeake Bay (54), non-cholera Vibrio spp. in the Baltic Sea (55), and V. vulnificus in some areas of the U.S. (56). Here, the objective was to identify potential environmental parameters associated with incidence and distribution of vibriosis along the eastern seaboard of the U.S. to facilitate development of predictive models for risk assessment.
MATERIALS AND METHODS
Vibriosis case data
Data for reported vibriosis (non-cholera) infections in the U.S. were provided by the CDC Cholera and Other Vibrio Illness Surveillance system (57) and included the number of reported cases between 1990 and 2019, along with the date patients arrived at the hospital, whether the case was confirmed or probable, causative agent, location of the hospital where the case was reported, travel history, and transmission route (foodborne, likely foodborne, nonfoodborne, likely nonfoodborne, or unknown source). For this study, both confirmed and probable cases of vibriosis were considered, but it should be noted that, of the ca. 10,000 total cases, only ca. 1,000 were categorized as probable cases. Cases in which the patient had traveled recently were omitted from the analysis. Locations of each case were given on varying spatial scales, with some cases reported at the county level, some at the city level, and others at the state level. Hence, state-level cases were not considered because of coarse spatial resolution. City-level cases were converted to the county level, using the county of the identified city for each case. Each case was assigned a latitude and longitude location based on the centroid of each US county where the case was reported. Only cases with assigned latitude and longitude within 200 km of the US east coast (including the Gulf of Mexico) were considered (see Fig. 1 for depiction of the study area).
Fig 1.
Vibrio spp. responsible for most of the vibriosis cases (highest relative abundance) in US counties within 200 km of the US eastern coastline (study area), between 2010 and 2019. The right side of the graph shows changes in the number of cases per year at each latitude point. The base map is from the U.S. Census Bureau and Natural Earth.
Geographical case trend analyses
To observe change in the latitudinal distribution of vibriosis cases over time, an annual trend analysis was conducted as opposed to a smaller time scale because seasonal trends of vibriosis typically show case peaks in the summer months, potentially masking long-term trends. After assigning latitude, the cases were separated by year. The minimum, 25th percentile, median, 75th percentile, and maximum latitude values were computed for the latitudes of all cases in each calendar year. This was done to examine both the general movement of case distributions as well as the extreme extent of case distribution. Once latitude values were computed for each year, Pearson’s correlation coefficient between each percentile and time was computed to determine if there was a significant positive or negative correlation of case latitude over time. The slope of case latitude over time was also computed to determine the magnitude of these changes. These computations were done for all cases of vibriosis, cases of only likely or confirmed foodborne sources, cases of only likely or confirmed nonfoodborne sources, and cases of non-V. cholerae Vibrio spp., as well as individual Vibrio spp., including V. cholerae non-O1 and non-O139, V. alginolyticus, V. fluvialis, V. mimicus, V. parahaemolyticus, and V. vulnificus. For the remaining Vibrio spp., there was an insufficient number of reported cases (ca. 20) over the 30-year period to perform statistical analyses; hence, they were excluded. Cases for the six Vibrio spp. were grouped into 10-year periods (1990–1999, 2000–2009, and 2010–2019), and geographic centroids for each 10-year period were computed for each Vibrio spp.
Environmental parameters
To understand the relationship between long-term geographical trends in vibriosis case data and environmental changes over time, several environmental parameters were selected for analysis. All environmental data were extracted from within 20 km of the US eastern coast (including the Gulf of Mexico) and comprise SST, SSS, chl-a concentration, phytoplankton concentration (separated by size classification), and precipitation. These data were converted to monthly averages prior to subsequent analysis.
SST data were collected from two different sources for 1990–2019. Data for 1990 to July 2002 were collected from the National Oceanic and Atmospheric Administration (NOAA) optimum interpolation sea surface temperature (OISST) data set obtained using the advanced very high resolution radiometer (AVHRR) and visible infrared imaging radiometer suite (VIIRS) (58). Data for August 2002 to 2019 were collected from the Aqua-MODIS daytime 4 km monthly SST data set, using the NASA Ocean Color levels 3 and 4 data browser (59). The Aqua-MODIS data set has a much higher spatial resolution than OISST, but Aqua-MODIS data span only to 2002; hence, two different data sets were used. Data for SSS were from the Ocean Reanalysis System 5 (ORAS5) global ocean reanalysis monthly data set (60). Precipitation data were from ERA5 monthly averaged data sets for total precipitation (61).
Chl-a concentration data were collected from Global Ocean Color (Copernicus-GlobColour), Bio-Geo-Chemical, L4 (monthly and interpolated) from Satellite Observations (1997-ongoing) data set provided by Copernicus Marine Data Store (62). These data were from September 1997 to 2019. Other remote sensing data sources for chl-a concentrations were not readily available prior to this date. This level 4 data set integrates chl-a observations from multiple satellite sensors, including SeaWiFS, MODIS, MERIS, VIIRS (SNPP and JPSS1), and OLCI (Fig. S3A and B), to produce a continuous, blended product improving spatial and temporal accuracy.
Chl-a concentrations can be categorized by phytoplankton size classification. Specifically, microphytoplankton, nanophytoplankton, and picophytoplankton concentrations were calculated from chl-a concentration using several transformation formulas. For this study (63), formulas were used to transform chl-a concentration to the size classification of the phytoplankton. Size classifications of phytoplankton show the highest concentration at different ranges of chl-a, and because chl-a concentration has a non-linear association with Vibrio spp. (13), it is important to explore the possibility of relationships within size classification, as they may explain the optimal chl-a concentration range with respect to Vibrio spp. abundance. Although other sensors and data sets are available for these environmental parameters, the study reported here prioritized continuity and consistent spatial resolution to enhance accuracy and minimize noise introduced by varying algorithms. The exception was SST, for which the data prior to 2002 were obtained from AVHRR-VIIRS at a coarser 0.25° resolution. Post-2002 data from sensors like MODIS offer finer 4 km resolution. However, previous studies have shown that temperature resolution has little impact on identifying environmental associations and model performance (21, 49). Nonetheless, higher resolution is preferred.
Environmental parameter trend analysis
To observe variability in selected environmental parameters for the study area throughout the study period, the data were grouped into five multi-year periods: 1990–1996 (except for phytoplankton due to data availability), 1997–2002, 2003–2008, 2009–2014, and 2015–2019. Data were also grouped by season to determine long-term seasonal variability.
Pearson’s correlation coefficients were calculated between minimum, 25th percentile, median, 75th percentile, and maximum values for each parameter to assess the direction and significance of temporal trends. Slopes were also computed to quantify the rate and magnitude of change over time. In summary, this approach provides a standardized method to evaluate both annual and seasonal shifts in environmental conditions across the eastern and southern U.S. coastal study area.
Extreme gradient boosting modeling
To determine the relative influence and complex relationships of each environmental parameter on reported vibriosis cases, eXtreme Gradient Boosting (XGBoost) Models were used. XGBoost is a machine learning method that uses gradient-boosted decision trees (64). Environmental data were associated with each data point by using the average value from the five data points closest to the location. Environmental data were lagged by 1 month, as there is likely to be a lagged impact of environmental variables on reported cases. Location data points were selected using all counties within 200 km of the U.S. eastern coastline that had reported a case of vibriosis for causative species, including V. cholerae non-O1 and non-O139, V. alginolyticus, V. fluvialis, V. mimicus, V. parahaemolyticus, and V. vulnificus. Each data point was assigned a categorical variable indicating vibriosis case presence or absence. Therefore, each data point included a specific location, environmental data (1-month lag) associated with that location, and an indicator of case presence or absence for that month and location. Data from October 1997 (the earliest available data for chl-a/phytoplankton) to December 2019 were used.
The data had an imbalanced class distribution, with the presence class being a minority and highly imbalanced for all Vibrio spp. (~99 absence data points to every 1 presence data point). This bias in data sets can affect the XGBoost model, potentially leading to ignoring the minority class entirely. To address this problem, two approaches used were to randomly resample the imbalanced data set: undersampling, which involves deleting some data from the majority class (absence), and oversampling, which involves duplicating data from the minority class (presence). Given that our minority class was significantly smaller than the majority class, we opted for random undersampling, ensuring equal representation of each class. This is preferable to random oversampling of the minority class because random oversampling can lead to model overfitting and a misrepresentation of model performance under real predictive scenarios. This preprocessing step was crucial for unbiased model training. The imbalanced-learn () library was used for RandomUnderSampler. As random undersampling can have large differences in the majority class data points selected, impacting model fitting and results, random undersampling was performed 50 times for each Vibrio sp. data set, thus providing data for 50 models for each Vibrio sp. The use of multiple iterations assists in providing clarity on model performance across most of the data set and reduces the majority class selection bias.
In establishing the framework for the XGBoost model, six environmental parameters were selected as input features, with chl-a excluded from the input features. This was to reduce the impact of collinearity and overfitting since phytoplankton data were also included. The binary classification of vibriosis presence served as the response variable. Each data set was partitioned into two subsets using the train_test_split function from the sklearn.model_selection module: 70% allocated for training and 30% for testing. The classification model was constructed using the XGBClassifier function from the XGBoost library. Initial settings for the model parameters included the number of estimators, learning rate, maximum depth of trees, subsample ratio, and column sample by tree fraction. To refine the parameters, GridSearchCV imported from the sklearn.model_selection was employed to conduct hyperparameter tuning, aiming to optimize model settings. The grid search emphasized maximizing accuracy and incorporated fivefold cross-validation to validate model performance across different subsets of data. The best-performing model was determined based on the results of this hyperparameter tuning, ensuring the selected model offered the most effective combination of parameters for generalizability and predictive accuracy.
In the model evaluation phase, the performance of the best model obtained from the GridSearchCV was assessed using several key metrics to determine effectiveness on the testing data set. These metrics included accuracy, precision, recall, F1 score, and area under the curve (AUC) score. SHapley Additive exPlanations (SHAP) values proposed by Lundberg and Lee (65) were used to explain the physical interpretation of model outputs. Utilizing the game-theoretic approach of Shapley values, SHAP provides a detailed decomposition of the predictive contribution of each environmental parameter. This approach allowed both global and local interpretability. Globally, SHAP reveals the overall importance and effect of each feature across all predictions, helping to identify key environmental drivers of vibriosis presence. Locally, it explains the contribution of each feature to individual predictions, offering insight into specific cases that may be critical for targeted environmental or public health intervention. Fifty XGBoost models were created for each species (V. cholerae, non-O1 and non-O139, V. alginolyticus, V. fluvialis, V. mimicus, V. parahaemolyticus, and V. vulnificus), in addition to total Vibrio spp. infections, using three case groupings: foodborne only, nonfoodborne only, and all cases combined.
K-means clustering analysis
As a complementary analysis to better understand the environmental conditions under which vibriosis cases were reported, k-means clustering analysis was conducted. For this analysis, vibriosis cases were divided according to individual Vibrio spp. (including V. cholerae, non-O1 and non-O139, V. alginolyticus, V. fluvialis, V. mimicus, V. parahaemolyticus, and V. vulnificus) and further divided into foodborne and nonfoodborne cases. K-means clustering was performed in Minitab for three centroids in each analysis. Results for different numbers of centroids did not provide greater physical clarity with respect to environmental conditions associated with cases and using the same number of centroids for each group made direct comparison easier, particularly for foodborne and nonfoodborne cases caused by the same Vibrio spp. Environmental data were associated with each case just as they were in the XGBoost models. Monthly cases and environmental data were used, beginning with October 1997 to December 2019. Only data points with vibriosis case presence were used in the analyses.
RESULTS
Geographical case trends
Latitudinal shifts in vibriosis cases over time, including median, 75th percentile, and maximum latitude, are shown in Fig. 2. Correlation coefficient and slope with time for minimum, 25th percentile, median, 75th percentile, and maximum latitude are provided in Table 1. Interestingly, the minimum latitude for the reported causative agent of vibriosis, except V. vulnificus, decreased over time, suggesting a southernmost latitude shift southward. All case types (foodborne, nonfoodborne, cholera, non-cholera, and causative agent) showed positive correlation and slope for the remaining values. All values were statistically significant at 99% confidence level, except the 25th percentile for nonfoodborne, V. alginolyticus, and V. mimicus cases and median for V. cholerae non-O1, non-O139 cases.
Fig 2.
Median, 75th percentile, and maximum latitude of vibriosis cases within 200 km of the eastern US coastline and the Gulf of Mexico, 1990–2019.
TABLE 1.
Correlation coefficients and slopes of minimum, 25th percentile, median, 75th percentile, and maximum latitude of vibriosis cases within 200 km of the eastern US coastline and the Gulf of Mexico, 1990–2019a
| Species/infection type | Total | Foodborne | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Percentile | Minimum | 25th Percentile | Median | 75th Percentile | Maximum | Minimum | 25th Percentile | Median | 75th Percentile | Maximum |
| Correlation | −0.373 | 0.822 | 0.658 | 0.920 | 0.782 | −0.153 | 0.786 | 0.831 | 0.885 | 0.769 |
| Slope (km/year) | −1.73 | 8.86 | 14.98 | 52.28 | 42.19 | −0.89 | 10.61 | 38.69 | 52.93 | 40.95 |
| Species/infection type | Non-V. cholerae | Nonfoodborne | ||||||||
| Percentile | Minimum | 25th Percentile | Median | 75th Percentile | Maximum | Minimum | 25th Percentile | Median | 75th Percentile | Maximum |
| Correlation | −0.373 | 0.802 | 0.784 | 0.925 | 0.790 | −0.440 | 0.408 | 0.585 | 0.910 | 0.852 |
| Slope (km/year) | −1.73 | 8.79 | 22.27 | 53.25 | 42.51 | −2.40 | 3.42 | 8.11 | 49.55 | 61.38 |
| Species/infection type | V. cholerae Non-O1, Non-O139 | V. alginolyticus | ||||||||
| Percentile | Minimum | 25th Percentile | Median | 75th Percentile | Maximum | Minimum | 25th Percentile | Median | 75th Percentile | Maximum |
| Correlation | −0.062 | 0.686 | 0.380 | 0.693 | 0.822 | −0.403 | 0.243 | 0.605 | 0.918 | 0.878 |
| Slope (km/year) | −1.03 | 8.41 | 10.16 | 37.59 | 54.61 | −6.35 | 3.34 | 19.76 | 61.55 | 70.30 |
| Species/infection type | V. fluvialis | V. mimicus | ||||||||
| Percentile | Minimum | 25th Percentile | Median | 75th Percentile | Maximum | Minimum | 25th Percentile | Median | 75th Percentile | Maximum |
| Correlation | −0.074 | 0.659 | 0.716 | 0.891 | 0.823 | −0.196 | 0.145 | 0.478 | 0.477 | 0.727 |
| Slope (km/year) | −1.01 | 14.12 | 35.31 | 52.07 | 60.22 | −3.56 | 1.61 | 3.72 | 14.40 | 42.11 |
| Species/infection type | V. parahaemolyticus | V. vulnificus | ||||||||
| Percentile | Minimum | 25th Percentile | Median | 75th Percentile | Maximum | Minimum | 25th Percentile | Median | 75th Percentile | Maximum |
| Correlation | −0.305 | 0.719 | 0.890 | 0.867 | 0.835 | 0.197 | 0.581 | 0.798 | 0.801 | 0.767 |
| Slope (km/year) | −2.12 | 14.73 | 49.65 | 54.13 | 61.01 | 1.45 | 5.07 | 3.40 | 32.14 | 39.29 |
Bolded correlations are statistically significant (P < 0.01).
For the reported total, foodborne, and non-V. cholerae, the 75th percentile latitude shifted most, with reported foodborne cases greatest, 0.477° (~52.9 km) per year. For others, the northernmost latitude shifted most over time, the largest being 0.633° (~70.3 km) per year for V. alginolyticus and the least being 0.354° (~39.3 km) for V. vulnificus.
Figure 3 presents results for 10-year geographic centroids, with the least total geographic shift over the last three decades for V. mimicus and the most for V. alginolyticus, V. fluvialis, and V. parahaemolyticus. Results show case volume, by color scale, with increases noted for all species, but especially V. parahaemolyticus, V. vulnificus, and V. alginolyticus. For clarity on case distribution, Fig. 4 presents the relative abundance of vibriosis cases for each state along the U.S. eastern seaboard and number of vibriosis cases between 1990 and 2019.
Fig 3.
Ten-year mean geographic centroids of vibriosis cases by Vibrio spp. identified as the causative agent, 1990–2019. The number of cases represents the total number of cases for each 10-year period. Bars on the right are latitude markers for each 10-year period. The base map is from the U.S. Census Bureau and Natural Earth.
Fig 4.
(A) Vibriosis cases over time separated by causative agent (Vibrio sp.). Data obtained from records of counties located within 200 km of the eastern coastline and the Gulf of Mexico, 1990–2019. (B) Composition of vibriosis cases by Vibrio sp. responsible for infection, presented by state using data from counties within 200 km of the eastern coastline and the Gulf of Mexico, 1990–2019. Vibriosis cases caused by other Vibrio spp. are not included.
Environmental parameter trends
Box and whisker plots of environmental parameters for multi-year periods between 1990 and 2019 are provided in the supplemental material. In general, SST and SSS have significantly increased along the U.S. eastern and southern seaboard between 1990 and 2019. Other parameters did not show consistent or statistically significant trends across all seasons. However, some exhibited notable seasonal variability over the past few decades. Notably, picophytoplankton median and 75th percentile values significantly (P < 0.05) decreased during summer and fall (i.e., seasons associated with high vibriosis case reports) from 1997 to 2019. Also, SST slopes for the 25th, median, and 75th percentile values were greatest in the summer. Annual correlation coefficient and slope with time for minimum, 25th percentile, median, 75th percentile, and maximum latitude are provided in Table 2.
TABLE 2.
| Foodborne | Nonfoodborne | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| V. alginolyticus | ||||||||||||||||
| Variable | SSS (ppt) | SST (°C) | Chl-a (mg/L) | Micro (mg/L) | Pico (mg/L) | Nano (mg/L) | Precip (mm) | Observations (N) | SSS (ppt) | SST (°C) | Chl-a (mg/L) | Micro (mg/L) | Pico (mg/L) | Nano (mg/L) | Precip (mm) | Observations (N) |
| Cluster 1 | 25.4 | 23.5 | 16.52 | 16.51 | 0 | 0.02 | 3 | 39 | 28.6 | 26.7 | 16.58 | 16.56 | 0 | 0.02 | 3.3 | 192 |
| Cluster 2 | 31.3 | 18.6 | 9.58 | 9.36 | 0.01 | 0.21 | 3.2 | 59 | 31.2 | 16.5 | 8.91 | 8.64 | 0.02 | 0.25 | 3.3 | 207 |
| Cluster 3 | 35.6 | 26.3 | 3.79 | 3.28 | 0.14 | 0.36 | 2.6 | 58 | 35.5 | 27 | 3.83 | 3.33 | 0.15 | 0.35 | 3 | 433 |
| Grand centroid | 31.4 | 22.7 | 9.16 | 8.89 | 0.06 | 0.22 | 2.9 | 156 | 32.9 | 24.3 | 8.04 | 7.7 | 0.08 | 0.25 | 3.1 | 832 |
| Q1–Q3 | 30.4–35.6 | 19.1–27.8 | 4.13–13.13 | 3.54–12.99 | 0.00–0.09 | 0.03–0.40 | 1.8–4.0 | N/A | 31.4–36.0 | 21.4–28.9 | 3.01–11.69 | 2.36–11.62 | 0.00–0.15 | 0.05–0.41 | 1.5–4.1 | N/A |
| V. cholerae non-O1, non-O139 | ||||||||||||||||
| Variable | SSS (ppt) | SST (°C) | Chl-a (mg/L) | Micro (mg/L) | Pico (mg/L) | Nano (mg/L) | Precip (mm) | Observations (N) | SSS (ppt) | SST (°C) | Chl-a (mg/L) | Micro (mg/L) | Pico (mg/L) | Nano (mg/L) | Precip (mm) | Observations (N) |
| Cluster 1 | 28.3 | 19 | 10.57 | 10.4 | 0 | 0.18 | 3.3 | 130 | 32 | 26.7 | 14.93 | 14.9 | 0 | 0.02 | 3.4 | 40 |
| Cluster 2 | 34.6 | 26.8 | 4.78 | 4.28 | 0.12 | 0.38 | 2.9 | 81 | 33.9 | 26.3 | 5.38 | 4.97 | 0.07 | 0.33 | 3 | 31 |
| Cluster 3 | 20 | 26.6 | 16.34 | 16.32 | 0 | 0.02 | 4.2 | 68 | 18.9 | 27.2 | 14.66 | 14.6 | 0 | 0.06 | 5.1 | 26 |
| Grand centroid | 28.1 | 23.1 | 10.3 | 10.06 | 0.04 | 0.2 | 3.4 | 279 | 29.1 | 26.7 | 11.81 | 11.65 | 0.03 | 0.13 | 3.7 | 97 |
| Q1–Q3 | 22.6–34.3 | 18.4–28.1 | 6.17–13.84 | 5.69–13.84 | 0.00–0.00 | 0.00–0.37 | 1.5–4.4 | N/A | 23.3–34.6 | 23.9–30.3 | 8.15–15.39 | 7.87–15.39 | 0.00–0.00 | 0.00–0.25 | 1.6–4.9 | N/A |
| V. fluvialis | ||||||||||||||||
| Variable | SSS (ppt) | SST (°C) | Chl-a (mg/L) | Micro (mg/L) | Pico (mg/L) | Nano (mg/L) | Precip (mm) | Observations (N) | SSS (ppt) | SST (°C) | Chl-a (mg/L) | Micro (mg/L) | Pico (mg/L) | Nano (mg/L) | Precip (mm) | Observations (N) |
| Cluster 1 | 24.1 | 26.1 | 17.79 | 17.78 | 0 | 0.01 | 3.6 | 81 | 19.9 | 25.8 | 13.09 | 13.01 | 0 | 0.08 | 3.7 | 28 |
| Cluster 2 | 28.6 | 17.3 | 10.01 | 9.82 | 0.01 | 0.18 | 3.1 | 124 | 32.7 | 26.2 | 14.15 | 14.11 | 0 | 0.04 | 3.7 | 31 |
| Cluster 3 | 34.3 | 26.2 | 5.86 | 5.47 | 0.08 | 0.32 | 3.2 | 125 | 33.8 | 21.4 | 5.3 | 4.85 | 0.1 | 0.35 | 3.1 | 47 |
| Grand centroid | 29.7 | 22.8 | 10.35 | 10.13 | 0.03 | 0.19 | 3.2 | 330 | 29.8 | 24 | 9.95 | 9.71 | 0.05 | 0.19 | 3.4 | 106 |
| Q1–Q3 | 25.7–35.0 | 18.7–27.8 | 6.51–13.68 | 6.06–13.65 | 0.00–0.00 | 0.00–0.35 | 1.9–4.4 | N/A | 23.8–35.4 | 21.4–28.8 | 5.90–13.08 | 5.41–13.06 | 0.00–0.01 | 0.00–0.34 | 2.0–4.0 | N/A |
| Foodborne | Nonfoodborne | |||||||||||||||
| V. mimicus | ||||||||||||||||
| Variable | SSS (ppt) | SST (°C) | Chl-a (mg/L) | Micro (mg/L) | Pico (mg/L) | Nano (mg/L) | Precip (mm) | Observations (N) | SSS (ppt) | SST (°C) | Chl-a (mg/L) | Micro (mg/L) | Pico (mg/L) | Nano (mg/L) | Precip (mm) | Observations (N) |
| Cluster 1 | 32.8 | 25.5 | 15.59 | 15.57 | 0 | 0.028 | 3.8 | 20 | 32.1 | 28.5 | 15.18 | 15.17 | 0 | 0.015 | 3.6 | 13 |
| Cluster 2 | 21.1 | 24.9 | 12.82 | 12.74 | 0 | 0.088 | 4.1 | 51 | 20.7 | 24.8 | 11.27 | 11.17 | 0 | 0.101 | 4.7 | 15 |
| Cluster 3 | 33.5 | 21.2 | 5.27 | 4.79 | 0.073 | 0.408 | 3.2 | 47 | 34 | 23.3 | 6.77 | 6.44 | 0.048 | 0.284 | 1.9 | 16 |
| Grand centroid | 28 | 23.5 | 10.29 | 10.05 | 0.029 | 0.205 | 3.7 | 118 | 28.9 | 25.3 | 10.79 | 10.63 | 0.018 | 0.142 | 3.4 | 44 |
| Q1–Q3 | 22.0–34.5 | 20.2–28.6 | 6.01–13.04 | 5.52–12.95 | 0.00–0.00 | 0.00–0.40 | 2.3–4.5 | N/A | 22.7–35.0 | 22.4–29.4 | 8.77–14.23 | 8.54–14.23 | 0.00–0.00 | 0.00–0.21 | 1.1–3.8 | N/A |
| V. parahaemolyticus | ||||||||||||||||
| Variable | SSS (ppt) | SST (°C) | Chl-a (mg/L) | Micro (mg/L) | Pico (mg/L) | Nano (mg/L) | Precip (mm) | Observations (N) | SSS (ppt) | SST (°C) | Chl-a (mg/L) | Micro (mg/L) | Pico (mg/L) | Nano (mg/L) | Precip (mm) | Observat-ions (N) |
| Cluster 1 | 14.9 | 22 | 18.55 | 18.55 | 0 | 0.01 | 3.7 | 192 | 15.2 | 25.5 | 17.84 | 17.83 | 0 | 0.02 | 3.3 | 122 |
| Cluster 2 | 31.6 | 20.9 | 6.93 | 6.56 | 0.05 | 0.31 | 3.3 | 656 | 34.1 | 25 | 4.81 | 4.31 | 0.1 | 0.4 | 2.9 | 312 |
| Cluster 3 | 31.1 | 22.2 | 15.98 | 15.97 | 0 | 0.02 | 3.4 | 396 | 31.2 | 24.1 | 13.22 | 13.15 | 0 | 0.07 | 3.4 | 312 |
| Grand centroid | 28.9 | 21.5 | 11.6 | 11.4 | 0.03 | 0.17 | 3.4 | 1244 | 29.8 | 24.7 | 10.46 | 10.22 | 0.05 | 0.2 | 3.2 | 746 |
| Q1–Q3 | 27.9–33.5 | 18.0–25.5 | 7.12–15.45 | 6.74–15.45 | 0.00–0.00 | 0.00–0.35 | 2.0–4.3 | N/A | 28.4–35.2 | 21.2–29.1 | 5.56–14.38 | 5.05–14.37 | 0.00–0.04 | 0.00–0.39 | 1.7–4.2 | N/A |
| V. vulnificus | ||||||||||||||||
| Variable | SSS (ppt) | SST (°C) | Chl-a (mg/L) | Micro (mg/L) | Pico (mg/L) | Nano (mg/L) | Precip (mm) | Observations (N) | SSS (ppt) | SST (°C) | Chl-a (mg/L) | Micro (mg/L) | Pico (mg/L) | Nano (mg/L) | Precip (mm) | Observations (N) |
| Cluster 1 | 31.9 | 24.8 | 12.85 | 12.78 | 0 | 0.065 | 3.5 | 119 | 28.7 | 28.1 | 12.85 | 12.79 | 0 | 0.06 | 3.9 | 381 |
| Cluster 2 | 34.9 | 26 | 4.2 | 3.68 | 0.13 | 0.392 | 3.1 | 135 | 34.6 | 26.9 | 5.04 | 4.56 | 0.11 | 0.37 | 3.8 | 323 |
| Cluster 3 | 18.2 | 25 | 14.95 | 14.89 | 0 | 0.056 | 4 | 81 | 14.1 | 25.7 | 22.27 | 22.27 | 0 | 0 | 3.2 | 137 |
| Grand centroid | 29.8 | 25.3 | 9.87 | 9.63 | 0.05 | 0.2 | 3.5 | 335 | 28.6 | 27.3 | 11.38 | 11.17 | 0.04 | 0.17 | 3.7 | 841 |
| Q1–Q3 | 25.0–35.3 | 22.3–29.6 | 5.08–13.54 | 4.52–13.54 | 0.00–0.06 | 0.00–0.38 | 1.8–4.6 | N/A | 22.3–35.3 | 25.0–30.5 | 6.61–14.79 | 6.19–14.79 | 0.00–0.01 | 0.00–0.35 | 2.1–4.9 | N/A |
The left side of the table includes clusters for foodborne cases and the right side for nonfoodborne cases. The grand centroid is the overall center for each dataset and Q1–Q3 the range of values between the 25th percentile to the 75th percentile.
“N/A” indicates the number of observations that are not provided for the interquartile range (Q1-Q3) because the values are a range, not a single value.
Extreme gradient boosting model results
Model performance results for the test samples are shown in Fig. 5A. For some Vibrio spp., models using only nonfoodborne cases showed modest improvements compared to models using all cases. However, these improvements were inconsistent across species, and performance metrics became more variable, likely due to the reduced number of data points in the subset models. As a result, only models trained using total case data are evaluated in the main text. A detailed comparison of model performance for foodborne, nonfoodborne, and total cases is provided in the supplemental material.
Fig 5.
(A) Box and whisker plots of performance metrics for XGBoost models. Medians are inclusive. Outliers are not excluded. (B) SHAP values for XGBoost models with the highest accuracy for each Vibrio sp. Environmental variables are listed in order of importance for model results, with the top variable being the highest relative importance and the bottom variable the least relative importance. Ranges of environmental values for the testing data sets are provided below each variable.
For models exploring the total incidence of reported vibriosis, the average and median accuracy for all 50 iterations of the models were between 60.9% and 71.0%. The range of accuracies was largest for V. mimicus (0.53–0.72), likely because V. mimicus had fewest reported cases, therefore least data points for random undersampling, leading to greater variety in majority class data selection. V. mimicus models showed the largest range of results for other performance metrics, as did the V. cholerae non-O1, non-O139 and V. fluvialis models, the two other species showing relatively lower reported cases. In contrast, models for V. alginolyticus, V. parahaemolyticus, and V. vulnificus had smaller ranges of performance statistics, likely due to larger minority classes. The best-performing model was V. vulnificus by all performance metrics, followed by V. alginolyticus. The other models yielded mixed rankings among the performance metrics, but V. mimicus models had relatively low recall and F1 score, while V. fluvialis models showed both low AUC scores and low precision. Most models had fewer false negatives than false positives (recall was greater than precision) apart from V. mimicus. This is beneficial, as cases are very likely underreported, and the presence of Vibrio spp. in the aquatic environment does not necessarily lead to infection. Thus, slightly overpredicting vibriosis case presence and having more false positives should be expected and is preferable to false negatives, which could be more dangerous to public health if the model is used for future prediction and mitigation.
Figure 5B depicts SHAP values for the models with the highest accuracy for each Vibrio sp., along with environmental parameters listed in order of relative importance to the model output. Variation was noted among SHAP values, relationships between environmental variables, and ranking of variable importance for each iteration of the models, but the best performing were selected as they were best able to model results between environmental parameters and vibriosis, warranting the closest examination compared to other model results. Unfortunately, it was not possible to compile the model iterations into one result with SHAP values, as they are based on the testing data sets used, which is dependent on random undersampling. However, trends among model iterations are captured in Fig. 5B and explored further.
A positive SHAP value indicated a feature had a positive impact on case presence in the model results, while a negative SHAP value indicates a feature had a negative impact on case presence. Higher absolute SHAP values indicate more influence on model outputs. A high feature value indicated a high value for that variable within the context of the data set, while a low feature value indicated a low value for that variable. For example, an SSS point with a positive SHAP value and low feature value would indicate that low SSS contributed to the presence of a case in the model. The range of environmental values in the testing results is provided in Fig. 5B for clarity on high and low feature values.
SST and SSS were considered the most important variables in the model results, with very few model iterations deviating from this trend. Ranking of relative importance for the remaining variables varied from model to model. SST had a general positive linear relationship with SHAP values, with increasing SST leading to increased prediction of vibriosis. All other variables experienced a nonlinear relationship with SHAP values for all models, with only a few exceptions.
For V. alginolyticus, the highest contribution to vibriosis presence was consistent with high SST (~33°C), high SSS (~36.5 ppt), moderate-to-high picophytoplankton (~0.16–0.33 mg/L), low precipitation (~0.12 mm), low microphytoplankton (~0.01 mg/L), and a large range of nanophytoplankton. The highest contribution to vibriosis absence was consistent, low SST (~1.3°C), low SSS (~9.4 ppt), low picophytoplankton (~0 mg/L), high precipitation (~14.4 mm), a large range of microphytoplankton, and high nanophytoplankton (~0.60 mg/L). Highest SSS values provided the most contribution to presence, while the lowest SST values had the highest contribution to absence.
For V. cholerae non-O1, non-O139, highest contributions to vibriosis occurrence were consistent, with high SST (~33°C), low SSS (~9.4 ppt), and a wide range of precipitation values. Highest contributions to vibriosis absence were low SST values (slightly above ~1.3°C), high SSS (~36.4 ppt), a large range of precipitation values, high microphytoplankton (~31.7 mg/L), and high picophytoplankton (~0.32 mg/L). Other contributions to presence and absence by other variables were either small or for very few data points. High and low SST values had the highest contribution to presence and absence.
For V. fluvialis, the highest contribution to vibriosis occurrence was consistent with high SST (slightly below ~33°C), moderate SSS (~23 ppt), low-to-moderate nanophytoplankton (~0–0.28 mg/L), and low microphytoplankton (~0.01 mg/L). Highest contributions to vibriosis absence were low SST values (~1.3°C), high SSS (~36.4 ppt), high nanophytoplankton (~0.57 mg/L), high microphytoplankton (~31.1 mg/L), and a large range of precipitation values. Other contributions to presence and absence by other variables were either small or for very few data points. Low microphytoplankton values had the highest contribution to occurrence, though for only a few data points; otherwise, high SST values and moderate SSS values and low SST values had the highest contribution to absence.
For V. mimicus, the highest contributions to vibriosis occurrence were consistent with high SST (~36°C), moderate SSS (~23 ppt), low microphytoplankton (~0.04 mg/L), low picophytoplankton (~0 mg/L), and high nanophytoplankton (~0.58 mg/L). Highest contributions to vibriosis absence were low SST values (~1.7°C), high SSS (~36.4 ppt), high microphytoplankton (~24.6 mg/L), high picophytoplankton (~0.33 mg/L), and low nanophytoplankton (~0.00 mg/L). Other contributions to the presence and absence by other variables were either small or had very few data points. High SST values had the highest contribution to occurrence, and low SST values had the highest contribution to absence.
For V. parahaemolyticus, the highest contributions to vibriosis occurrence were consistent with high SST (slightly below ~33°C), moderate SSS (slightly above ~23 ppt and slightly below ~36 ppt), high microphytoplankton (~32.0 mg/L), high precipitation (slightly below ~16.8 mm), and high nanophytoplankton (~0.60 mg/L). Highest contributions to vibriosis absence were low SST values (~1.7°C), high SSS (~36.5 ppt), low microphytoplankton (~0.01 mg/L), low precipitation (~0.06 mm), high nanophytoplankton (slightly below ~0.60 mg/L), and high picophytoplankton (~0.33 mg/L). Other contributions to occurrence and absence by other variables were either small or had very few data points. Moderate SSS values had the highest contribution to presence, and low SST values had the highest contribution to absence.
For V. vulnificus, the highest contributions to vibriosis occurrence were consistent with high SST (~34°C), low SSS (~8.7 ppt), large range of microphytoplankton, and high precipitation (~23.2 mm). Highest contributions to vibriosis absence were low SST values (~1.5°C), high SSS (slightly below ~36.5 ppt), high nanophytoplankton (~0.58 mg/L), large range of microphytoplankton, low precipitation (~0.15 mm), and high picophytoplankton (~0.33 mg/L). Other contributions to presence and absence by other variables were either small or had very few data points. High SST values had the highest contribution to presence, and low SST values had the highest contribution to absence.
K-means clustering results
Table 3 displays results of k-means clustering analysis for all Vibrio spp., separated by foodborne and nonfoodborne. Most had three distinct clusters, as illustrated by Fig. 6. Generally, one with SSS ~30–35 ppt, SST ~24°C–27°C, and chl-a concentration ~4–7 mg/L, and a second cluster with slightly lower SSS, slightly different SST (some higher, some lower), a higher chl-a concentration of ~10–15 mg/L, and higher precipitation. The third generally had SSS of ~15–20 ppt, SST of ~22°C–27 °C, and chl-a concentrations of ~12–22 mg/L. The three clusters were consistent for all Vibrio spp. and both foodborne and nonfoodborne cases. However, the distribution of cases within the clusters (i.e., the number of cases assigned to each cluster) was inconsistent between Vibrio spp. and foodborne and nonfoodborne cases, indicating that, while these are patterns for environmental conditions, cases are reported at different frequencies within these conditions, depending on species and infection mechanism. Additionally, differences in environmental values between foodborne and nonfoodborne clusters, while noted, were small and inconsistent across Vibrio spp., with the only consistency being nonfoodborne infections having generally higher SST (~1°C–3°C for all species) and SSS (~1 ppt in four out of six species) compared to foodborne infections. Therefore, both foodborne and nonfoodborne cases appear to be associated with similar environmental conditions.
TABLE 3.
Correlation coefficients and slopes of minimum, 25th percentile, median, 75th percentile, and maximum values of environmental variables for multi-year periods (1990–1996, 1997–2002, 2003–2008, 2009–2014, and 2015–2019) within 20 km of the eastern and southern US coastlinea,b
| Parameter | SST | ||||
|---|---|---|---|---|---|
| Percentile | Minimum | 25th Percentile | Median | 75th Percentile | Maximum |
| Correlation | −0.547 | 0.010 | 0.919 | 0.947 | 0.864 |
| Slope (°C/period) | −0.272 | 0.002 | 0.183 | 0.471 | 2.638 |
| Parameter | SSS | ||||
| Percentile | Minimum | 25th Percentile | Median | 75th Percentile | Maximum |
| Correlation | 0.891 | 0.985 | 0.951 | 0.965 | 0.929 |
| Slope (ppt/period) | 0.521 | 0.184 | 0.254 | 0.110 | 0.051 |
| Parameter | Precipitation | ||||
| Percentile | Minimum | 25th Percentile | Median | 75th Percentile | Maximum |
| Correlation | −0.757 | 0.371 | 0.577 | 0.771 | 0.547 |
| Slope (dm3/period) | −0.02 | 0.04 | 0.07 | 0.11 | 1.45 |
| Parameter | Microphytoplankton | ||||
| Percentile | Minimum | 25th Percentile | Median | 75th Percentile | Maximum |
| Correlation | −0.007 | 0.906 | 0.834 | 0.751 | 0.855 |
| Slope (mg/L/period) | 0.000 | 0.138 | 0.285 | 0.633 | 1.773 |
| Parameter | Picophytoplankton | ||||
| Percentile | Minimum | 25th Percentile | Median | 75th Percentile | Maximum |
| Correlation | N/A | N/A | −0.831 | −0.884 | −0.908 |
| Slope (mg/L/period) | 0.000 | 0.000 | −0.033 | −0.006 | −1.24E-12 |
| Parameter | Nanophytoplankton | ||||
| Percentile | Minimum | 25th Percentile | Median | 75th Percentile | Maximum |
| Correlation | N/A | −0.702 | −0.470 | −0.332 | 0.528 |
| Slope (mg/L/period) | 0.000 | −0.030 | −0.002 | −0.002 | 5.03E-07 |
Bolded correlations are statistically significant (P < 0.05).
“N/A” indicates the number of observations that are not provided for the interquartile range (Q1-Q3) because the values are a range, not a single value.
Fig 6.
3D scatterplots of SST, SSS, and chl-a from k-means centroids.
DISCUSSION
Observed environmental changes, including increased coastal water temperatures in the Gulf of Maine and Middle Atlantic Bight (66), frequency and severity of marine heatwaves along the northeastern United States (67), salinity variability in the upper ocean (68), precipitation extremes (69), and chlorophyll concentrations in coastal regions (70), coincide with results obtained in the study, namely significant increases to coastal SST and SSS and northern shift of the incidence of Vibrio spp. infections occurring in the population living along the eastern U.S. coastline as reported for 1990–2019. Determination of the latitudinal shift of V. vulnificus supports the findings reported by Archer et al. (14). Interestingly, latitudinal increase in the northern case limit for V. vulnificus over time, while significant, was lowest of all Vibrio spp., at 0.354 degrees (~39.3 km) per year for northernmost reported case and 0.031° (~3.44 km) per year for median latitude of reported cases. Per contra, V. alginolyticus and V. parahaemolyticus showed the largest distributional changes, with shifts of 0.633° (~70.3 km) and 0.550° (~61.1 km) per year to the latitude of their northernmost reported cases and 0.178° (~19.8 km) and 0.447° (~49.6 km) per year to the latitude of their median reported cases, respectively. Therefore, while the changes between climate and vibriosis coincide, not all Vibrio spp. were affected equally.
As vibriosis cases continue to rise in northern latitudes of the eastern U.S., there is growing concern about the potential larger outbreaks, particularly in densely populated coastal states such as New York and New Jersey, which rank among the top five in coastal population (71), as well as other northern states. The potential risk of vibriosis becoming endemic in these areas is therefore significant. While the findings of this study are specific to the U.S. eastern and southern coasts, increasing vibriosis incidence has been documented globally, including Japan and Southeast Asia (72), northern Europe (73), Peru and Alaska (74), and Malaysia (75). Although the models developed here may not directly transfer to other regions, the modeling framework and environmental data sources are broadly applicable and can support global efforts to predict the risk of vibriosis and other waterborne diseases.
Results of this study show a strong connection between environmental conditions and reported vibriosis cases. Specifically, the XGBoost models identified vibriosis accurately for all Vibrio spp. (average accuracy >60%) using only environmental data as model input, especially important given the large spatial extent of the data and large class imbalance. Models for V. vulnificus and V. alginolyticus performed best, likely due to the fact that vibriosis caused by the two species was almost all nonfoodborne infections, compared to V. parahaemolyticus, which is generally associated with foodborne infection. Nonfoodborne infections perhaps may be more easily incorporated into predictive algorithms, since environmental conditions are strongly indicative of environmental conditions where the infection occurred. Foodborne infections, in contrast, involve greater variability, since contamination of seafood can occur during transport and improper storage can result in bacterial contamination.
The importance of SST and SSS as indicators for vibriosis is evident, given that they are the two most important variables in each model and relatively high SHAP values compared to other variables. These variables should be central to any future models, particularly given the observation that these parameters along the coastline and coastal vibriosis cases have both increased significantly and concurrently. While precipitation and phytoplankton had highly complex nonlinear relationships with vibriosis, they did show high relative importance and high SHAP values, indicating that while the relationship is complex, they are important to include in future models. It is useful to include them in models to differentiate Vibrio sp., since most of the differences between models were captured by response to phytoplankton size and precipitation.
Other investigators have reported an increase in Vibrio spp. proliferation related to increasing temperature (22). Ocean warming has contributed to a 0.93°C (0.73°C–1.04°C) increase in ocean temperatures in 2013–2022 compared to 1850–1900 (76). Here, we observed an increase of annual mean SST within 20 km of the U.S. eastern and southern coasts of ca. 1.33°C (1990–1996 compared to 2015–2019), with increases greatest in summer months (between ca. 2.47°C and 2.67°C; see supplemental material). With climate change linked to both of these increased temperatures and SST, the strongest indicator of vibriosis in XGBoost models, it is concluded that ocean warming is a very strong contributor to the observed change in vibriosis case distribution over time.
Salinity is important with respect to vibriosis modeling and clustering. Previous studies reported Vibrio spp. have a preferred salinity range, and for some species, the range is highly specific. V. cholerae, for example, grows optimally at salinities between 15 and 25 ppt (77), while for V. alginolyticus, optimal growth is achieved at salinity concentrations between 30 and 35 ppt (78). V. vulnificus and V. parahaemolyticus isolates from the Chesapeake Bay grow best at 10–15 ppt salinity (13) and V. mimicus at 4 ppt in some aquatic environments, e.g., in Dhaka, Bangladesh, and Okayama, Japan (79). Thus, the salinity results of this study align well with previous reports, except V. mimicus, for which limited data are available with respect to salinity tolerance. Changes in ocean salinity since the mid-20th century have been documented (76) and an increasing frequency of salinity variability predicted (68).
A growing body of evidence has clarified the role of environmental parameters in shaping the occurrence and distribution of Vibrio spp., notably V. cholerae, V. parahaemolyticus, V. vulnificus, and V. alginolyticus, all of which are naturally occurring in aquatic ecosystems (2). These bacteria typically thrive in warm, moderately saline waters. However, some Vibrio spp., notably V. parahaemolyticus and V. alginolyticus, exhibit broad salinity tolerance, and others exist in hypersaline waters (78, 80, 81). In the study reported here, an increased salinity was noted along the coastline, pushing these regions above an optimal range for the proliferation of Vibrio spp. However, in estuarine and brackish environments, where the baseline salinity is lower, such increases may create more favorable conditions for the growth of Vibrio spp. The case reports served as useful proxies for environmental proliferation, and a more pronounced shift in vibriosis distribution in the dynamic coastal regions can be expected.
The contributions of chl-a and phytoplankton to Vibrio spp. growth and proliferation are complex, as determined by the models, with each Vibrio sp. having a specific optimal chl-a relationship with respect to its distribution in the environment. The k-means clustering analysis showed two clusters of chl-a concentration, ca. 10–20 mg/L with differing salinities for the clusters. However, clusters of cases associated with chl-a concentrations of ca. 4–7 mg/L were noted. Clearly, increased nutrient and phytoplankton concentrations have been shown to be associated with increased abundance of Vibrio spp. (21, 30, 82, 83). Nonetheless, the results of the study reported here indicate that the relationship is complex, with multiple factors involved in Vibrio spp. replication and abundance in the environment. For example, the microphytoplankton is composed of diatoms and dinoflagellates, the picophytoplankton pico-eukaryotes, prokaryotes, and Prochlorococcus spp., and the nanophytoplankton green algae and prymnesiophytes (haptophytes) (63). Understanding their thresholds indicative of vibriosis potential for each Vibrio spp. in future studies will provide insight into the complexity of communities with which Vibrio spp. are associated.
Study limitations
Primary limitations of this study derive from the uncertainty of the exact location of case initiation and the proximity of cases to given environmental conditions. That is, cases are reported correctly in the counties of interest, but there is the possibility of lagged reporting of cases, unreported cases, or cases having no association with source environmental conditions, i.e., imported seafood or hospital visits not in the county where the case occurred. Such factors have the potential to introduce spatial and temporal uncertainty, which can reduce the precision of the association between environmental parameter and case occurrence. In particular, foodborne infections linked to transported seafood will obscure the local environmental driver, while nonfoodborne cases (predominantly wound infections) are more likely to reflect exposure at or near the site of reporting.
Additionally, the application of the models is limited by training models using random under-sampling on imbalanced data sets. Applying them to real-world scenarios, where outcomes are highly imbalanced, would provide lower precision and recall than reported here. This methodological tradeoff improves model interpretability but reduces performance when applied to unbalanced population-level data.
However, the results indicate reliable use of environmental parameters to differentiate case presence and absence. Importantly, model outputs should be interpreted as identifying ecological suitability for vibriosis, rather than predicting exact case counts. Future efforts should be able to incorporate improved case attribution, higher-resolution exposure data, and alternative approaches for handling class imbalance to enhance predictive accuracy and generalizability on a global scale.
Conclusions
Incidence of vibriosis has increased over the past 30 years and has also expanded with respect to geographic distribution, notably along the U.S. eastern coast. The northernmost reported case latitude increase for V. parahaemolyticus (61.0 km/year), V. vulnificus (39.3 km/year), V. alginolyticus (70.3 km/year), V. cholerae non-O1, non-O139 (54.6 km/year), V. fluvialis (60.2 km/year), and V. mimicus (42.1 km/year) cases is significant. The total number of cases of vibriosis attributed to these six species is ca. 500 annually. This study provides a foundation for assessing environmental drivers of vibriosis associated with climate change, notably SST, SSS, and chl-a. Predictive risk models for non-cholera Vibrio spp. infections and those caused by less commonly reported species, such as V. alginolyticus, V. fluvialis, and V. mimicus, can now be constructed. In summary, modeling efforts will be critical in addressing future changes in the incidence of vibriosis.
ACKNOWLEDGMENTS
This research was funded by research grants from the National Science Foundation (CBET1751854 ) and the National Aeronautics and Space Administration (80NSSC20K0814 and 80NSSC22K1044). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Contributor Information
Antarpreet S. Jutla, Email: ajutla@ufl.edu.
Julia C. van Kessel, Indiana University Bloomington, Bloomington, Indiana, USA
SUPPLEMENTAL MATERIAL
The following material is available online at https://doi.org/10.1128/aem.00716-25.
Figures S1 to S12.
ASM does not own the copyrights to Supplemental Material that may be linked to, or accessed through, an article. The authors have granted ASM a non-exclusive, world-wide license to publish the Supplemental Material files. Please contact the corresponding author directly for reuse.
REFERENCES
- 1. Singleton FL, Attwell RW, Jangi MS, Colwell RR. 1983. Influence of salinity, nutrient concentration and temperature on growth and survival of Vibrio Choleras in the aquatic environment, p 101–112. In Advances in research on Cholera and related diarrheas. Springer Netherlands, Dordrecht. [Google Scholar]
- 2. Brumfield KD, Usmani M, Chen KM, Gangwar M, Jutla AS, Huq A, Colwell RR. 2021. Environmental parameters associated with incidence and transmission of pathogenic Vibrio spp. Environ Microbiol 23:7314–7340. doi: 10.1111/1462-2920.15716 [DOI] [PubMed] [Google Scholar]
- 3. Tamplin ML, Gauzens AL, Huq A, Sack DA, Colwell RR. 1990. Attachment of Vibrio cholerae serogroup O1 to zooplankton and phytoplankton of Bangladesh waters. Appl Environ Microbiol 56:1977–1980. doi: 10.1128/aem.56.6.1977-1980.1990 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Yang JH, Mok JS, Jung YJ, Lee KJ, Kwon JY, Park K, Moon SY, Kwon SJ, Ryu AR, Lee TS. 2017. Distribution and antimicrobial susceptibility of Vibrio species associated with zooplankton in coastal area of Korea. Mar Pollut Bull 125:39–44. doi: 10.1016/j.marpolbul.2017.07.054 [DOI] [PubMed] [Google Scholar]
- 5. Kemp KM, Westrich JR, Alabady MS, Edwards ML, Lipp EK. 2018. Abundance and multilocus sequence analysis of Vibrio bacteria associated with diseased Elkhorn Coral (Acropora palmata) of the Florida keys. Appl Environ Microbiol 84:e01035-17. doi: 10.1128/AEM.01035-17 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Halpern M, Izhaki I. 2017. Fish as hosts of Vibrio cholerae. Front Microbiol 8. doi: 10.3389/fmicb.2017.00282 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Silva IP, de Carneiro CS, Saraiva MAF, de Oliveira TAS, de Sousa OV, Evangelista-Barreto NS. 2018. Antimicrobial resistance and potential virulence of Vibrio parahaemolyticus isolated from water and bivalve mollusks from Bahia, Brazil. Mar Pollut Bull 131:757–762. doi: 10.1016/j.marpolbul.2018.05.007 [DOI] [PubMed] [Google Scholar]
- 8. Vandenberghe J, Verdonck L, Robles-Arozarena R, Rivera G, Bolland A, Balladares M, Gomez-Gil B, Calderon J, Sorgeloos P, Swings J. 1999. Vibrios associated with Litopenaeus vannamei larvae, postlarvae, broodstock, and hatchery probionts. Appl Environ Microbiol 65:2592–2597. doi: 10.1128/AEM.65.6.2592-2597.1999 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Thompson JR, Randa MA, Marcelino LA, Tomita-Mitchell A, Lim E, Polz MF. 2004. Diversity and dynamics of a North Atlantic coastal Vibrio community. Appl Environ Microbiol 70:4103–4110. doi: 10.1128/AEM.70.7.4103-4110.2004 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Zhang X, Lin H, Wang X, Austin B. 2018. Significance of Vibrio species in the marine organic carbon cycle—a review. Sci China Earth Sci 61:1357–1368. doi: 10.1007/s11430-017-9229-x [DOI] [Google Scholar]
- 11. Li Y, Wang Y, Fu L, Gao Y, Zhao H, Zhou W. 2017. Aerobic-heterotrophic nitrogen removal through nitrate reduction and ammonium assimilation by marine bacterium Vibrio sp. Y1-5. Bioresour Technol 230:103–111. doi: 10.1016/j.biortech.2017.01.049 [DOI] [PubMed] [Google Scholar]
- 12. CDC . 2024. Vibrio Infection (Vibriosis). Available from: https://www.cdc.gov/vibrio/about/index.html
- 13. Brumfield KD, Chen AJ, Gangwar M, Usmani M, Hasan NA, Jutla AS, Huq A, Colwell RR. 2023. Environmental factors influencing occurrence of Vibrio parahaemolyticus and Vibrio vulnificus. Appl Environ Microbiol 89:e0030723. doi: 10.1128/aem.00307-23 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Archer EJ, Baker-Austin C, Osborn TJ, Jones NR, Martínez-Urtaza J, Trinanes J, Oliver JD, González FJC, Lake IR. 2023. Climate warming and increasing Vibrio vulnificus infections in North America. Sci Rep 13:3893. doi: 10.1038/s41598-023-28247-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Jayakumar JM, Martinez-Urtaza J, Brumfield KD, Jutla AS, Colwell RR, Cordero OX, Almagro-Moreno S. 2024. Climate change and Vibrio vulnificus dynamics: a blueprint for infectious diseases. PLoS Pathog 20:e1012767. doi: 10.1371/journal.ppat.1012767 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Newton A, Kendall M, Vugia DJ, Henao OL, Mahon BE. 2012. Increasing rates of vibriosis in the United States, 1996-2010: review of surveillance data from 2 systems. Clin Infect Dis 54 Suppl 5:S391–S395. doi: 10.1093/cid/cis243 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Morgado ME, Brumfield KD, Mitchell C, Boyle MM, Colwell RR, Sapkota AR. 2024. Increased incidence of vibriosis in Maryland, U.S.A., 2006–2019. Environ Res 244:117940. doi: 10.1016/j.envres.2023.117940 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Froelich BA, Daines DA. 2020. In hot water: effects of climate change on Vibrio-human interactions. Environ Microbiol 22:4101–4111. doi: 10.1111/1462-2920.14967 [DOI] [PubMed] [Google Scholar]
- 19. Vezzulli L, Grande C, Reid PC, Hélaouët P, Edwards M, Höfle MG, Brettar I, Colwell RR, Pruzzo C. 2016. Climate influence on Vibrio and associated human diseases during the past half-century in the coastal North Atlantic. Proc Natl Acad Sci USA 113:E5062–E5071. doi: 10.1073/pnas.1609157113 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Jutla AS, Akanda AS, Griffiths JK, Colwell R, Islam S. 2011. Warming oceans, phytoplankton, and river discharge: implications for Cholera outbreaks. The American Society of Tropical Medicine and Hygiene 85:303–308. doi: 10.4269/ajtmh.2011.11-0181 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Lobitz B, Beck L, Huq A, Wood B, Fuchs G, Faruque ASG, Colwell R. 2000. Climate and infectious disease: use of remote sensing for detection of Vibrio cholerae by indirect measurement. Proc Natl Acad Sci USA 97:1438–1443. doi: 10.1073/pnas.97.4.1438 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Vezzulli L, Baker‐Austin C, Kirschner A, Pruzzo C, Martinez‐Urtaza J. 2020. Global emergence of environmental non‐O1 / O139 Vibrio cholerae infections linked with climate change: a neglected research field. Environ Microbiol 22:4342–4355. doi: 10.1111/1462-2920.15040 [DOI] [PubMed] [Google Scholar]
- 23. Baker-Austin C, Trinanes J, Gonzalez-Escalona N, Martinez-Urtaza J. 2017. Non-cholera vibrios: the microbial barometer of climate change. Trends Microbiol 25:76–84. doi: 10.1016/j.tim.2016.09.008 [DOI] [PubMed] [Google Scholar]
- 24. Emch M, Feldacker C, Yunus M, Streatfield PK, DinhThiem V, Canh DG, Ali M. 2008. Local environmental predictors of cholera in Bangladesh and Vietnam. Am J Trop Med Hyg 78:823–832. [PubMed] [Google Scholar]
- 25. Jutla AS, Akanda AS, Islam S. 2013. A framework for predicting endemic cholera using satellite derived environmental determinants. Environ Model Softw 47:148–158. doi: 10.1016/j.envsoft.2013.05.008 [DOI] [Google Scholar]
- 26. Constantin de Magny G, Murtugudde R, Sapiano MRP, Nizam A, Brown CW, Busalacchi AJ, Yunus M, Nair GB, Gil AI, Lanata CF, Calkins J, Manna B, Rajendran K, Bhattacharya MK, Huq A, Sack RB, Colwell RR. 2008. Environmental signatures associated with cholera epidemics. Proc Natl Acad Sci USA 105:17676–17681. doi: 10.1073/pnas.0809654105 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Davis BJK, Jacobs JM, Davis MF, Schwab KJ, DePaola A, Curriero FC. 2017. Environmental Determinants of Vibrio parahaemolyticus in the Chesapeake Bay. Appl Environ Microbiol 83:e01147-17. doi: 10.1128/AEM.01147-17 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Wetz JJ, Blackwood AD, Fries JS, Williams ZF, Noble RT. 2014. Quantification of Vibrio vulnificus in an estuarine environment: a multi-year analysis using QPCR. Estuaries Coast 37:421–435. doi: 10.1007/s12237-013-9682-4 [DOI] [Google Scholar]
- 29. Johnson CN, Bowers JC, Griffitt KJ, Molina V, Clostio RW, Pei S, et al. 2012. Ecology of Vibrio parahaemolyticus and Vibrio vulnificus in the coastal and estuarine waters of Louisiana 78:7249–7257. doi: 10.1128/AEM.01296-12 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Thickman JD, Gobler CJ. 2017. The ability of algal organic matter and surface runoff to promote the abundance of pathogenic and non-pathogenic strains of Vibrio parahaemolyticus in Long Island Sound, USA. PLoS One 12:e0185994. doi: 10.1371/journal.pone.0185994 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Colwell RR. 1996. Global climate and infectious disease: the Cholera paradigm. Science 274:2025–2031. doi: 10.1126/science.274.5295.2025 [DOI] [PubMed] [Google Scholar]
- 32. Huq A, Small EB, West PA, Huq MI, Rahman R, Colwell RR. 1983. Ecological relationships between Vibrio cholerae and planktonic crustacean copepods. Appl Environ Microbiol 45:275–283. doi: 10.1128/aem.45.1.275-283.1983 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Berk SG, Colwell RR, Small EB. 1976. A study of feeding responses to bacterial prey by Estuarine ciliates. Trans Am Microsc Soc 95:514. doi: 10.2307/3225146 [DOI] [Google Scholar]
- 34. Henson SA, Sarmiento JL, Dunne JP, Bopp L, Lima I, Doney SC, John J, Beaulieu C. 2010. Detection of anthropogenic climate change in satellite records of ocean chlorophyll and productivity. Biogeosciences 7:621–640. doi: 10.5194/bg-7-621-2010 [DOI] [Google Scholar]
- 35. Kirpich A, Weppelmann TA, Yang Y, Ali A, Morris JG, Longini IM. 2015. Cholera transmission in ouest department of Haiti: dynamic modeling and the future of the epidemic. PLoS Negl Trop Dis 9:e0004153. doi: 10.1371/journal.pntd.0004153 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Jutla A, Aldaach H, Billian H, Akanda A, Huq A, Colwell R. 2015. Satellite based assessment of hydroclimatic conditions related to Cholera in Zimbabwe. PLoS ONE 10:e0137828. doi: 10.1371/journal.pone.0137828 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Eisenberg MC, Kujbida G, Tuite AR, Fisman DN, Tien JH. 2013. Examining rainfall and Cholera dynamics in Haiti using statistical and dynamic modeling approaches. Epidemics 5:197–207. doi: 10.1016/j.epidem.2013.09.004 [DOI] [PubMed] [Google Scholar]
- 38. CDC . 2005. Vibrio illnesses after hurricane katrina --- multiple states, August--September 2005. Available from: https://www.cdc.gov/mmWr/preview/mmwrhtml/mm5437a5.htm [PubMed]
- 39. Esteves K, Hervio-Heath D, Mosser T, Rodier C, Tournoud M-G, Jumas-Bilak E, Colwell RR, Monfort P. 2015. Rapid proliferation of Vibrio parahaemolyticus, Vibrio vulnificus, and Vibrio cholerae during freshwater flash floods in French Mediterranean coastal lagoons. Appl Environ Microbiol 81:7600–7609. doi: 10.1128/AEM.01848-15 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Tabari H. 2020. Climate change impact on flood and extreme precipitation increases with water availability. Sci Rep 10:13768. doi: 10.1038/s41598-020-70816-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Sheikh HI, Najiah M, Fadhlina A, Laith AA, Nor MM, Jalal KCA, Kasan NA. 2022. Temperature upshift mostly but not always enhances the growth of Vibrio species: a systematic review. Front Mar Sci 9:959830. doi: 10.3389/fmars.2022.959830 [DOI] [Google Scholar]
- 42. Farmer JJ, Janda JM, Brenner FW, Cameron DN, Birkhead KM. 2005. Genus Vibrio, p 494–546. In Krieg NR, Staley JT (ed), Bergey’s Manual of Systematic Bacteriology, 2nd edn. Springer, New York. [Google Scholar]
- 43. Urakawa H, Rivera ING. 2014. Aquatic Environment, p 173–189. In The Biology of Vibrios. ASM Press, Washington, DC, USA. [Google Scholar]
- 44. Eiler A, Gonzalez-Rey C, Allen S, Bertilsson S. 2007. Growth response of Vibrio cholerae and other Vibrio spp. to cyanobacterial dissolved organic matter and temperature in brackish water. FEMS Microbiol Ecol 60:411–418. doi: 10.1111/j.1574-6941.2007.00303.x [DOI] [PubMed] [Google Scholar]
- 45. Jutla A, Whitcombe E, Hasan N, Haley B, Akanda A, Huq A, Alam M, Sack RB, Colwell R. 2013. Environmental factors influencing epidemic Cholera. The American Society of Tropical Medicine and Hygiene 89:597–607. doi: 10.4269/ajtmh.12-0721 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46. Hashizume M, Armstrong B, Hajat S, Wagatsuma Y, Faruque ASG, Hayashi T, Sack DA. 2008. The effect of rainfall on the incidence of cholera in Bangladesh. Epidemiology 19:103–110. doi: 10.1097/EDE.0b013e31815c09ea [DOI] [PubMed] [Google Scholar]
- 47. Khan R, Anwar R, Akanda S, McDonald MD, Huq A, Jutla A, Colwell R. 2017. Assessment of risk of Cholera in haiti following hurricane matthew. Am J Trop Med Hyg 97:896–903. doi: 10.4269/ajtmh.17-0048 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48. Khan R, Nguyen TH, Shisler J, Lin LS, Jutla A, Colwell R. 2018. Evaluation of risk of cholera after a natural disaster: lessons learned from the 2015 Nepal earthquake. J Water Resour Plann Manage 144. doi: 10.1061/(ASCE)WR.1943-5452.0000929 [DOI] [Google Scholar]
- 49. Usmani M, Brumfield KD, Magers BM, Chaves-Gonzalez J, Ticehurst H, Barciela R, McBean F, Colwell RR, Jutla A. 2023. Combating cholera by building predictive capabilities for pathogenic Vibrio cholerae in Yemen. Sci Rep 13:2255. doi: 10.1038/s41598-022-22946-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. Usmani M, Brumfield KD, Magers BM, Huq A, Barciela R, Nguyen TH, Colwell RR, Jutla A. 2022. Predictive intelligence for cholera in Ukraine? Geohealth 6:e2022GH000681. doi: 10.1029/2022GH000681 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51. FDA . 2005. Pathogenic Vibrio parahaemolyticus in raw oysters
- 52. National Centers for Coastal Ocean Science . 2018. Vibrio predictive models. Available from: https://products.coastalscience.noaa.gov/vibrioforecast/
- 53. National Centers for Coastal Ocean Science . 2019. Water guidance - Vibrio vulnificus in Chesapeake Bay. Available from: https://products.coastalscience.noaa.gov/vibrioforecast/vulnificus/chesapeake/default.aspx
- 54. DeLuca NM, Zaitchik BF, Guikema SD, Jacobs JM, Davis BJK, Curriero FC. 2020. Evaluation of remotely sensed prediction and forecast models for Vibrio parahaemolyticus in the Chesapeake Bay. Remote Sens Environ 250:112016. doi: 10.1016/j.rse.2020.112016 [DOI] [Google Scholar]
- 55. Semenza JC, Trinanes J, Lohr W, Sudre B, Löfdahl M, Martinez-Urtaza J, Nichols GL, Rocklöv J. 2017. Environmental suitability of Vibrio infections in a warming climate: an early warning system. Environ Health Perspect 125:107004. doi: 10.1289/EHP2198 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56. Campbell AM, Cabrera-Gumbau JM, Trinanes J, Baker-Austin C, Martinez-Urtaza J. 2025. Machine learning potential for identifying and forecasting complex environmental drivers of Vibrio vulnificus infections in the United States. Environ Health Perspect 133:17006. doi: 10.1289/EHP15593 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57. CDC. 2024. Cholera and Other Vibrio Illness Surveillance System. Available from: https://www.cdc.gov/vibrio/php/surveillance/index.html
- 58. Huang B, Liu C, Banzon V, Freeman E, Graham G, Hankins B, Smith T, Zhang H-M. 2021. Improvements of the daily optimum interpolation sea surface temperature (DOISST) Version 2.1. J Clim 34:2923–2939. doi: 10.1175/JCLI-D-20-0166.1 [DOI] [Google Scholar]
- 59. NASA . 2024. Ocean Color Level 3&4 Browser. Available from: https://oceancolor.gsfc.nasa.gov/
- 60. Copernicus Climate Change Service CDS. 2021. ORAS5 Global Ocean Reanalysis Monthly Data from 1958 to Present. Copernicus Climate Change Service (C3S) Climate Data Store (CDS. Available from: 10.24381/cds.67e8eeb7 [DOI]
- 61. Hersbach H, Bell B, Berrisford P, Biavati G, Horányi A, Muñoz Sabater J. 2023. ERA5 monthly averaged data on single levels from 1940 to present. Copernicus Climate Change Service (C3S). doi: 10.24381/cds.f17050d7 [DOI]
- 62. E.U. Copernicus Marine Service Information (CMEMS) Marine Data Store (MDS) . 2024. Global ocean colour (copernicus-globcolour), bio-geo-chemical, L4 (monthly and interpolated) from satellite observations (1997-ongoing)
- 63. Hirata T, Hardman-Mountford NJ, Brewin RJW, Aiken J, Barlow R, Suzuki K, Isada T, Howell E, Hashioka T, Noguchi-Aita M, Yamanaka Y. 2011. Synoptic relationships between surface Chlorophyll- a and diagnostic pigments specific to phytoplankton functional types. Biogeosciences 8:311–327. doi: 10.5194/bg-8-311-2011 [DOI] [Google Scholar]
- 64. Chen T, Guestrin C. 2016. XGBoost. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; San Francisco California USA: , p 785–794ACM, New York, NY, USA [Google Scholar]
- 65. Lundberg SM, Lee SI. 2017. A unified approach to interpreting model predictions. Proceedings of the 31st International Conference on Neural Information Processing Systems. , p 4768–4777Curran Associates Inc, Red Hook, NY, USA [Google Scholar]
- 66. Shearman RK, Lentz SJ. 2010. Long-term sea surface temperature variability along the U.S. East Coast. J Phys Oceanogr 40:1004–1017. doi: 10.1175/2009JPO4300.1 [DOI] [Google Scholar]
- 67. Xu T, Newman M, Capotondi A, Stevenson S, Di Lorenzo E, Alexander MA. 2022. An increase in marine heatwaves without significant changes in surface ocean temperature variability. Nat Commun 13:7396. doi: 10.1038/s41467-022-34934-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68. Cheng L, Trenberth KE, Gruber N, Abraham JP, Fasullo JT, Li G, Mann ME, Zhao X, Zhu J. 2020. Improved estimates of changes in upper ocean salinity and the hydrological cycle. J Clim 33:10357–10381. doi: 10.1175/JCLI-D-20-0366.1 [DOI] [Google Scholar]
- 69. Trenberth K. 2011. Changes in precipitation with climate change. Clim Res 47:123–138. doi: 10.3354/cr00953 [DOI] [Google Scholar]
- 70. Gregg WW, Casey NW, McClain CR. 2005. Recent trends in global ocean chlorophyll. Geophys Res Lett 32. doi: 10.1029/2004GL021808 [DOI] [Google Scholar]
- 71. NOAA . 2025. NOAA Office for Coastal Management. Available from: https://coast.noaa.gov/states/fast-facts/economics-and-demographics.html#:~:text=California%20tops%20the%20coastal%20populations,and%20Texas%20with%206.9%20million
- 72. Okuda J, Ishibashi M, Hayakawa E, Nishino T, Takeda Y, Mukhopadhyay AK, Garg S, Bhattacharya SK, Nair GB, Nishibuchi M. 1997. Emergence of a unique O3:K6 clone of Vibrio parahaemolyticus in Calcutta, India, and isolation of strains from the same clonal group from Southeast Asian travelers arriving in Japan. J Clin Microbiol 35:3150–3155. doi: 10.1128/jcm.35.12.3150-3155.1997 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73. Amato E, Riess M, Thomas-Lopez D, Linkevicius M, Pitkänen T, Wołkowicz T, Rjabinina J, Jernberg C, Hjertqvist M, MacDonald E, Antony-Samy JK, Dalsgaard Bjerre K, Salmenlinna S, Fuursted K, Hansen A, Naseer U. 2022. Epidemiological and microbiological investigation of a large increase in vibriosis, Northern Europe, 2018. Euro Surveill 27:2101088. doi: 10.2807/1560-7917.ES.2022.27.28.2101088 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74. Martinez-Urtaza J, Bowers JC, Trinanes J, DePaola A. 2010. Climate anomalies and the increasing risk of Vibrio parahaemolyticus and Vibrio vulnificus illnesses. Food Res Int 43:1780–1790. doi: 10.1016/j.foodres.2010.04.001 [DOI] [Google Scholar]
- 75. Hassan M, Mohd Ali MR, Zamri HF, Nor Amdan NA, Azmai MNA, Maniam S, Mohamed Alipiah N, Hashim R. 2023. Distribution, prevalence, and antibiotic susceptibility profiles of infectious noncholera Vibrio species in Malaysia. J Trop Med 2023:2716789. doi: 10.1155/2023/2716789 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76. Calvin K, Dasgupta D, Krinner G, Mukherji A, Thorne PW, Trisos C. 2023. Climate change 2023: synthesis report. contribution of working groups I, II and III to the sixth assessment report of the intergovernmental panel on climate change. Edited by Lee H. and Romero J.. Geneva, Switzerland: IPCC [Google Scholar]
- 77. Singleton FL, Attwell R, Jangi S, Colwell RR. 1982. Effects of temperature and salinity on Vibrio cholerae growth. Appl Environ Microbiol 44:1047–1058. doi: 10.1128/aem.44.5.1047-1058.1982 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78. Norfolk WA, Shue C, Henderson WM, Glinski DA, Lipp EK. 2023. Vibrio alginolyticus growth kinetics and the metabolic effects of iron. Microbiol Spectr 11:e0268023. doi: 10.1128/spectrum.02680-23 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79. Chowdhury MA, Yamanaka H, Miyoshi S, Aziz KM, Shinoda S. 1989. Ecology of Vibrio mimicus in aquatic environments. Appl Environ Microbiol 55:2073–2078. doi: 10.1128/aem.55.8.2073-2078.1989 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80. Gorrasi S, Pasqualetti M, Franzetti A, Pittino F, Fenice M. 2020. Vibrio communities along a salinity gradient within a marine saltern hypersaline environment (Saline di Tarquinia, Italy). Environ Microbiol 22:4356–4366. doi: 10.1111/1462-2920.15041 [DOI] [PubMed] [Google Scholar]
- 81. Naughton LM, Blumerman SL, Carlberg M, Boyd EF. 2009. Osmoadaptation among Vibrio species and unique genomic features and physiological responses of Vibrio parahaemolyticus. Appl Environ Microbiol 75:2802–2810. doi: 10.1128/AEM.01698-08 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82. Julie D, Solen L, Antoine V, Jaufrey C, Annick D, Dominique H-H. 2010. Ecology of pathogenic and non-pathogenic Vibrio parahaemolyticus on the French Atlantic coast. Effects of temperature, salinity, turbidity and chlorophyll a. Environ Microbiol 12:929–937. doi: 10.1111/j.1462-2920.2009.02136.x [DOI] [PubMed] [Google Scholar]
- 83. Namadi P, Deng Z. 2023. Optimum environmental conditions controlling prevalence of vibrio parahaemolyticus in marine environment. Mar Environ Res 183:105828. doi: 10.1016/j.marenvres.2022.105828 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Citations
- Hersbach H, Bell B, Berrisford P, Biavati G, Horányi A, Muñoz Sabater J. 2023. ERA5 monthly averaged data on single levels from 1940 to present. Copernicus Climate Change Service (C3S). doi: 10.24381/cds.f17050d7 [DOI]
- E.U. Copernicus Marine Service Information (CMEMS) Marine Data Store (MDS) . 2024. Global ocean colour (copernicus-globcolour), bio-geo-chemical, L4 (monthly and interpolated) from satellite observations (1997-ongoing)
Supplementary Materials
Figures S1 to S12.






