Abstract
Life-history traits have been identified as major indicators of mammals' susceptibility and exposure to viruses due to evolutionary constraints that link life-history speed with species’ ecology and immunity. Nonetheless, it is unclear where along the fast-slow continuum of mammalian life-history lies the greatest diversity of host species. Consequently, life-history patterns that govern host–virus associations remain largely unknown. Here we analyse the virome of 1350 wild mammals and detect the characteristics that drive species' compatibility with different groups of viruses. We highlight that mammals with larger body size and either very rapid or very slow life histories are more likely to carry different groups of viruses, particularly zoonotic ones. While some common life-history patterns emerge across carriers, eco-evolutionary characteristics of viral groups appear to determine association with certain carrier species. Our findings underline the importance of incorporating both mammals’ life-history information and viruses' ecological diversity into surveillance strategies to identify potential zoonotic carriers in wildlife.
Keywords: disease ecology, life-history, viruses, zoonosis
1. Introduction
Two-thirds of emerging human diseases are zoonotic [1], and mammals account for approximately 88% of emerging viral zoonoses [2]. Nonetheless, very little is known of the total diversity of the mammalian virome [3], with many unknown zoonotic viruses predicted to circulate in wildlife [4]. Many studies [4–6] tried to uncover the evolutionary dynamics that link mammalian life-history traits to the diversity of viruses that they host. Understanding the patterns that drive viruses’ associations with mammalian species can advance knowledge on host–virus coevolutionary dynamics and direct viral surveillance towards groups of mammals that are more likely to harbour viruses of concern to human and animal health.
A growing body of research in the fields of ecoimmunology and evolutionary biology suggests a fundamental trade-off between immune defenses against pathogens and life-history traits due to the allocation of limited resources within an organism [7–10]. Fast-lived species—with small body sizes, short lifespans, rapid growth and early reproduction—are hypothesized to allocate relatively little energy to immune defences [8]. Probably, these species rely mainly on non-specific innate immune responses against pathogens [9], as these have little metabolic costs for development and maintenance [8]. Fast-lived species might have a higher predisposition to acquire, maintain and transmit viruses due to their weaker defences and tendency to produce numerous offspring that are naïve to infection, which might play an important role in maintaining the infection within the reservoir population [7,11]. By contrast, slow-lived species tend to be large and long-lived, investing more resources into costly defences provided by adaptive immune responses, which are typically slower and pathogen-specific [9]. A large investment in adaptive immunity may be positively selected in slow-lived species, as they are more likely to come into contact with a wide diversity of pathogens multiple times [9].
Several studies have investigated the association between mammalian life-history traits and susceptibility to viruses, reporting mixed evidence as to which of the two ends of the fast-slow continuum is associated with higher propensity to serve as a host [11–18]. Some studies found an association between faster life histories—particularly high offspring production and early sexual maturity—and greater viral competence in mammals [11–13]. But other works provided support of a relationship with slow pace-of-life instead. As an example, zoonotic viral richness in bats has been found to be positively associated with smaller litter size, larger body mass and greater longevity [14–16]. Longer gestation and larger body mass have also been linked to Ebolavirus and Ross River virus seroprevalence across wild mammals [17,18]. In these cases, it has been hypothesized that the immune systems of slow-lived mammals may provide higher tolerance to viral infection, making them suitable hosts for viruses' replication and shedding [9].
Here, we explore the hitherto known mammalian virome to detect patterns of compatibility between mammal species and different groups of viruses, identifying ecological profiles of viral carriers along the fast-slow continuum of mammalian life-history [19]. Our aim is to unveil commonalities and peculiarities across mammalian viral carriers, highlighting common patterns as well as unique features that drive mammals’ susceptibility to individual viral groups. We focus on viral ‘carriers’, instead of ‘hosts’, as the latter implies the species to be competent to the infection and capable of viral shedding, which cannot be assumed from currently available data. We include in the analyses multiple levels of viral complexity by breaking down viral diversity into groups according to three classification schemes: i) structural diversity (based on viral genomic structure and replication), ii) evolutionary diversity (based on viral taxonomy and evolution), iii) ecological diversity (based on a selection of traits that relate to viruses' relationship with their hosts and the environment). We perform separate analyses for all viruses and for the subset of viruses with known zoonotic potential, to highlight any differences in characteristics of general versus zoonotic mammalian carriers.
2. Methods
2.1. Carrier-virus associations data
We obtained carrier-virus associations from the Global Virome in One Network (VIRION), the most comprehensive database of the global vertebrate virome [20]. We filtered VIRION for unique associations between terrestrial wild mammal species and viruses that allowed for species-level identification of both carriers and viruses (detected by isolation, PCR, or serology). None of the analysed carrier-viral group associations was based on serology alone. Data entries of laboratory and domestic species were removed from our dataset to avoid introducing biased or inaccurate information in our analyses, since experimental infections often represent interactions that do not occur under natural conditions, and domesticated species exhibit life-history traits that have been shaped by artificial selection rather than natural evolutionary dynamics. We identified livestock and domestic species according to FAO's World Watch List for Domestic Animal Diversity [21], while a list of the most common laboratory species used for research purposes worldwide was obtained from the Directive of the European Parliament and of the Council on the protection of animals used for scientific purposes [22] and the Guide for the Care and Use of Laboratory Animals of the United States [23]. The filtering process left us with 9795 unique associations involving 1350 mammal species and 4482 viruses.
We also identified a subset of 3501 associations between 936 zoonotic carriers and 287 zoonotic viruses (figure 1). Zoonotic carriers were defined as those mammals that share at least one viral species (ratified by the International Committee for the Taxonomy of Viruses, ICTV) with humans according to the VIRION database [20]. Since zoonotic carrier status of mammals was defined using information on viral species (i.e. whether they were zoonotic or not), we only included ICTV ratified species to avoid erroneously including unofficially recognized viral strains and quasi-species. We acknowledge that our definition of zoonotic carriers does not strictly imply competence nor capacity of viral transmission to humans. Integrating experimental data on host competence would help ascertain between susceptibility and competence; however, these data are not currently available for most species and interactions [24].
Figure 1.
Taxonomic representation of mammalian species in our dataset. (a) Comparison between the proportional number of species in the six most represented mammalian orders in the dataset versus their overall relative number. We show the proportion of general (light red) and zoonotic (dark red) carriers represented by a given order, and a comparison with the proportion of mammalian species richness represented by the order (blue). The absolute number of species is shown on top of the corresponding bar. (b) Number of unique carrier-virus associations per each mammalian order. (c) Number of unique zoonotic carrier-virus associations per each mammalian order. In the circular plots, radial axes are log scaled for representation purposes.
2.2. Carrier characteristics
For each of the 1350 carrier species in the dataset, we collected a set of covariates potentially correlated to carrier status. We obtained life-history characteristics (body mass, longevity, gestation length, litter size, litters per year, interbirth interval and weaning age) from COMBINE, a harmonized repository of published and imputed data for mammal species [25]. To control for environmental confounding factors, we extracted the mean value of ten bioclimatic variables representing temperatures and precipitations within the carrier species' range (electronic supplementary material, table S1). We selected these variables because of their proven ability to predict mammal species distribution at a global scale [26]. All bioclimatic variables were obtained at a spatial resolution of 5 arcminutes (approximately 10 Km at the equator) from the WorldClim database [27]. Species’ range size (km2) was obtained from the spatial database of the International Union for Conservation of Nature (IUCN [28]) and included as an additional ecological variable. We also controlled for phylogenetic signals in carrier traits by including ten phylogenetic eigenvectors as indicators of species' phylogenetic position within the mammalian tree of life. The eigenvectors were retrieved from PHYLACINE v. 1.2 [29].
We controlled for correlation among covariates via variance inflation factors (VIF) using the usdm package in R [30]. We discarded highly correlated variables (VIF > 9), keeping those that showed relatively higher variable importance during preliminary analyses. Three interrelated bioclimatic variables (i.e. mean annual temperature, temperature of the wettest quarter, and temperature of the driest quarter within the species’ range) were kept as they are usually used to define the essential climatic requirements for mammal species at a global scale [26]. Ultimately, we retained eight ecological variables (range size, mean annual temperature, temperature of the wettest quarter, temperature of the driest quarter, temperature of the warmest quarter, precipitation of the driest quarter, precipitation of the warmest quarter, precipitation of the coldest quarter), and four life-history characteristics (body mass, longevity, gestation length and interbirth interval). A detailed description of all variables is provided in electronic supplementary material, table S1.
2.3. Sampling bias
We checked for the existence of bias in the representation of carrier mammalian species included in our dataset. This kind of bias could arise from an unbalanced sampling effort across mammal species, driven by different aspects such as intrinsic biological traits, geographic location, taxonomy and convenience [31]. We found that the statistical distribution of values for the life-history traits we considered did not differ substantially between mammal species in our sample and all existing terrestrial mammal species, meaning that the composition of our sample was not biased in terms of life-history (electronic supplementary material, figure S3). As for taxonomic representation, we found bats to be overrepresented as general carriers, while carnivores, primates and artiodactyls were slightly overrepresented both as general and zoonotic carriers (figure 1). Instead, rodents were proportionally underrepresented as both general and zoonotic carriers (approx. 30%, compared to them representing approx. 39% of all mammalian species).
To account for the different research efforts in identifying viral associations among different species, we counted the number of virus-related publications of each species. The number of publications was retrieved from Web of Science by querying a string made of the scientific binomial of the carrier species AND the terms ‘virus*’ OR ‘viral’. The search was performed in R using the packages httr [32] and jsonlite [33]. Including this variable in the model allowed us to reduce the effect of differential sampling effort, which might otherwise have influenced our results.
2.4. Viral grouping
To have a more detailed view of the complexity of the mammalian virome, we aggregated viral species (42 families) into coherent groups according to three classification schemes: (i) structural, (ii) evolutionary, (iii) ecological (electronic supplementary material, figure S4). Structural groups were derived from the Baltimore classification [34]. The Baltimore classification groups viruses were based on viral genomes' nature and polarity, splitting viruses into seven classes of replication [35]. All genomic information and replication routes needed for viral grouping were retrieved from ViralZone, an online repository of viral bionformatic data [36]. The evolutionary classification reflected the aggregation of viral families into ten phyla, as represented in the ICTV. As for the ecological classification, we split viral families according to traits that have implications for viral ecology, such as evolvability and interaction with the host and the environment. We chose three binary viral traits recognizable at the family level, and then identified seven unique mutually exclusive ecological groups. The selected traits accounted for (i) genomic organization (i.e. linear or circular), (ii) genomic partition (i.e. monopartite or segmented), (iii) pericapsidic envelop (i.e. present or absent). These viral characteristics were chosen as they proved to be relatively important predictors of virus host-range in previous work [37], and were thus able to capture meaningful elements of the interaction between viruses and their carriers. Family-specific viral traits were obtained from ViralZone and Wardeh et al. [36]. After grouping viral families together according to the three classification schemes, we identified 7 structural groups, 10 evolutionary groups and 5 ecological groups (totalling 22 groups) (electronic supplementary material, figure S4). We ran separate models for each group, to predict the probability of mammal species to be carriers of viruses in the group. We called these the ‘general models’, as they did not distinguish between zoonotic and non-zoonotic viruses. We then repeated the models after limiting prediction to zoonotic carriers only, thereby defining a set of ‘zoonotic models’. Two viral groups (Preplasmiviricota and Cressdnaviricota) were excluded from zoonotic models due to an insufficient number of carrier species on which to train the models, leaving us with 20 zoonotic models. A complete description of viral grouping is available in the electronic supplementary material (electronic supplementary material, methods).
2.5. Definition of non-carrier status
Our analyses relied on confirmed carrier–virus associations, therefore, they lacked any information regarding associations that do not occur in the wild, as no database of unsusceptible associations currently exists [38]. To tackle this issue, we generated pseudo-negatives (i.e. instances of non-carrier status) as substitutes for null interactions for each viral group in both general and zoonotic models. This pseudo-negative design aimed at mimicking virological sampling through an approach based on a combination of taxonomic and spatial cues. Specifically, if the carrier status of a mammal species was unknown, we considered it to be a pseudo-negative whenever the species simultaneously met two criteria: (i) one or more species in the same mammalian family were found susceptible to the viral group (i.e. had a positive carrier status) and (ii) these susceptible species were in the same biogeographical realm as the focal species. This method allowed us to have enough null associations to include in each model, as well as to partially buffer the effect of uneven sampling effort across carrier families. In this respect, we provide estimates of model accuracy for each one of the families included in the analyses in electronic supplementary material, tables S5–S6.
2.6. Modelling carrier status
We predicted whether a species had carrier status for different viral groups as a function of the above-described variables. We used random forest models for analysis, a powerful machine learning technique capable of modelling complex interactions among predictors [39]. We ran 22 classification models to assess general carrier status and 20 classification models to assess zoonotic carrier status, where each model targets a different viral group. As species tended to be unevenly distributed among the two responses' classes (carrier versus non-carrier), we assigned different weights to each observation to avoid bias in predictions towards the overrepresented class. Hence, in every model, we assigned each observation a weight equal to the number of observations of the opposite class.
We used a 10-fold cross-validation and a tuning grid to train each model and find the optimal combination of two hyperparameters: the number of randomly selected variables available for splitting at each tree's node (mtry) and the minimal number of observations required for a node to be split further (n_min). The 10-fold cross-validation divided the training set (75% of total data) into 10 random samples and iteratively held one out to use it for testing. We repeated the iteration process using different combinations of hyperparameters, then kept the combination that yielded the best performance (quantified by the true skill statistic metric, TSS) to obtain the final model. TSS is calculated as follows:
| 2.1 |
where sensitivity represents the proportion of true positives, i.e. carrier species that are correctly classified as such, and specificity is the proportion of true negatives, i.e. non-carrier species that are correctly classified as such. TSS ranges between −1 and +1. A TSS equal to +1 denotes perfect classification performance while a TSS equal to −1 denotes perfect misclassification [40].
The predictive performance of each final model was then evaluated in two distinct ways. First, we calculated the models' TSS by predicting the carrier status of species in the test set (25% of total data) and comparing the predictions with the observed carrier status. Then, we performed a taxonomic block validation at the family level. This validation method works by iteratively holding out a target group of observations (in this case, each mammalian family) from the training set, to use it for model testing. Families with fewer than ten observations per class were not included in the validation pipeline. In addition to making training and testing sets independent, the block validation allowed us to assess the generalizability of our results across different carrier taxa. Based on the cross-validation performance, we excluded from analysis models with a low TSS value (<0.30).
From the models’ outputs, we extracted the estimate of the importance of each variable in predicting carrier status for the different viral groups. We quantified variable importance as the mean decrease accuracy of the model, i.e. the decrement of the accuracy of model predictions caused by the variable's permutation, averaged across all trees. A larger value indicated a relatively more important predictor, as its exclusion led to a bigger loss of model accuracy. Lastly, the variables' importance scores (x) obtained from the models were rescaled within classification schemes to allow comparison between viral groups, according to:
| 2.2 |
We used partial dependence plots to assess the relationship between the predicted probability of a species being a general/zoonotic carrier and selected species’ characteristics, according to each model. To obtain the partial dependence on a selected variable, the outcome of the model was marginalized over the distribution of the remaining variables, so that the resulting function depended only on the targeted predictor [41]. Then, the functions were plotted through locally estimated scatterplot smoothing (LOESS) curves, to display the average trend of carrier status probability given different values of the explanatory trait. We used the R package ranger [42] and the R suite tidymodels to run and validate the models [43].
3. Results
3.1. Model accuracy
We used a set of life-history, ecological, and bioclimatic covariates to assess carrier status of 1350 wild mammalian species, totalling 9795 general carrier–virus associations and 3501 zoonotic carrier-virus associations (figure 1, electronic supplementary material, figures S1–S2). We ran 22 general models (including all viruses) and 20 zoonotic models (including just zoonotic viruses) to identify how mammalian species traits affect the probability of being carrier species of different structural, evolutionary and ecological viral groups. Eight models (five general and three zoonotic) were excluded from further interpretation due to low performance (TSS < 0.30) during cross-validation.
General and zoonotic models showed moderate accuracy during cross-validation, with a TSS of 0.42 (range: 0.31–0.53) and 0.46 (range: 0.32–0.74) respectively. As expected, when assessed with a taxonomic block validation, general and zoonotic models showed a lower average accuracy of 0.22 (mean range: 0.04–0.39) and 0.19 (mean range: 0.00–0.40), respectively. Some carrier families exhibited noticeably low performance during taxonomic block-validation, compared to cross-validation, suggesting that certain family-specific patterns have not been effectively captured by our models. To test whether filtering models based on the taxonomic block validation would have affected the patterns found in this work, we ran a sensitivity test on a subset of our dataset (Baltimore class I). Importantly, our test demonstrates that the shapes of relationships between carrier status and key life-history traits do not change substantially when using the more stringent inclusion criteria (taxonomic block-validation threshold: TSS > 0.3) (electronic supplementary material, figure S5). Model performance metrics extracted from both cross-validation and block-validation processes are available in electronic supplementary material, tables S3–S6.
3.2. Characteristics of general carriers
After accounting for research effort, body mass was the most important predictor of general carrier status across most viral groups in the three classification schemes. The next most important predictors were indicators of species' phylogenetic position (i.e. phylogenetic eigenvectors), followed by species’ longevity and interbirth interval (figure 2).
Figure 2.
Mean variable importance of the mammalian traits used to assess (a) general and (b) zoonotic carrier status within ecological, evolutionary, and structural viral classification schemes. The importance of each variable is quantified as the mean decrease accuracy of models' predictions caused by variable random permutation. Mammalian traits are further classified visually by ecology, life history, phylogeny and taxonomy. Variable importance scores outputted by each model, for each viral group, were rescaled between 0 and 1. Here we only show the most important variables across the various schemes (see electronic supplementary material, figures S5–S6 for a full representation of all predictors). For representation purposes, we excluded research effort (quantified as the number of virus-related citations) which we used to account for sampling bias in our dataset.
The relationships between mammals' life-history traits and carrier status were largely consistent across viral groups, with few exceptions. The relationship between body mass and the probability of being a carrier was typically positive (figure 3), with a rapid increase in probability for species larger than 0.3–3 kg depending on the viral group considered (electronic supplementary material, figure S8). The relationship appeared stronger with viruses in the 4th ecological group (non-enveloped viruses with a linear segmented genome), the 3rd Baltimore class (dsRNA viruses) and the phyla Duplornaviricota, Peploviricota and Nucleocytoviricota. Species’ longevity showed an idiosyncratic effect on carrier status, with species on both extremes of mammals' lifespan range more likely to be predicted as carriers of several viral groups (figure 3). This average trend was mostly driven by the negative response of Cressdnaviricota carriers as well as the bimodal relationship between longevity and carrier status for the 3rd ecological group (enveloped viruses with a linear non-segmented genome), the 2nd Baltimore class (ssDNA viruses), and the phylum Cossaviricota (electronic supplementary material, figure S9). In the remaining groups, long-lived species were associated with a higher probability of being carriers. As for interbirth interval, its effect on carrier status probability was generally positive (figure 3), although the 6th (enveloped viruses with a linear segmented genome) and 1st (non-enveloped viruses with a linear non-segmented genome) ecological groups, and the phylum Cressdnaviricota showed an opposite (negative) trend (electronic supplementary material, figure S10).
Figure 3.
Mean variable importance of the mammalian traits used to assess (a) general and (b) zoonotic carrier status within ecological, evolutionary and structural viral classification schemes. The importance of each variable is quantified as the mean decrease accuracy of models’ predictions caused by variable random permutation. Mammalian traits are further classified visually by ecology, life history, phylogeny and taxonomy. Variable importance scores outputted by each model, for each viral group, were rescaled between 0 and 1. Here we only show the most important variables across the various schemes (see electronic supplementary material, figures S5–S6 for a full representation of all predictors). For representation purposes, we excluded research effort (quantified as the number of virus-related citations) which we used to account for sampling bias in our dataset.
3.3. Characteristics of zoonotic viral carriers
Once research effort was accounted for, zoonotic carrier status for most viral groups was best predicted by two of the life history traits already identified as drivers of general carrier status—longevity and interbirth interval—together with species’ phylogenetic position (figure 2). In most viral groups, the relationship between life-history traits and zoonotic carrier status showed the same qualitative trends as in general carriers, but with a comparatively stronger quantitative effect. This pattern was evident in the average relationship between body mass and zoonotic carrier status, which showed more accentuated upward trends compared to that of general carriers (figure 3). Similar to general carrier models, longevity showed an idiosyncratic relationship with zoonotic carrier status probability. The relationship between interbirth interval and zoonotic carrier status was also similar to that of general carriers, with more likely reservoirs falling on both extreme ends of the variable's range. This pattern was consistent across most viral groups, with the exceptions of the 1st (non-enveloped viruses with a linear non-segmented genome) and the 6th (enveloped viruses with a linear segmented genome) ecological groups where the effect was negative, and the 3rd ecological group (non-enveloped viruses with a circular non-segmented genome) where the effect was strongly positive (electronic supplementary material, figure S10).
4. Discussion
Despite high scientific interest in the ecological and evolutionary drivers of mammal–virus associations, contrasting evidence exists on the characteristics that make wild mammals more subject to acquiring and transmitting viruses. Once research effort and phylogeny are accounted for, our results show that the probability of mammals' carrier status for different viral groups is best predicted by species’ body mass, longevity, and interbirth interval, although variable importance in some groups showed a substantial difference from the others. Such differences might be explained by group-specific patterns of associations with mammalian carriers as well as unaccounted for sampling biases in carrier-viral group associations. Our models predicted large-sized mammals on both extreme ends of the fast-slow continuum of life-history as being more likely to carry viruses. These results suggest that although mammals on opposite ends of the fast-slow continuum have likely evolved diverging immunological strategies, viruses are able to exploit both strategies for replication and spread. This means that the peculiarities of each viral group may drive patterns of ‘carrier preference' reflecting viruses' replication speed, adaptability and possibly evolutionary history.
Mammalian species with larger body sizes had higher carrier probability for most general and zoonotic groups. Evidence from comparative studies in the field of eco-immunology suggests that elements of immunity [44], and pathogenesis [45] scale with body size. It has been hypothesized that larger species evolved greater tolerance to infection to minimize the costs of infection in favour of survival, which may, in turn, make them more capable of being viral carriers. Still, the mechanisms driving the relationship between body mass and immunity phenotypes remain uncertain and their effect on virus suitability and tolerance are open to question [46].
Additionally, large-bodied species have higher dispersal abilities and need large home ranges in order to meet their high metabolic requirements [47]. The ability to move farther may play a role in increasing exposure to viral infections, as these species are more likely to come into contact with disease hosts (e.g. primary, secondary or reservoirs) and vectors. Past works have found mammals' body mass to be related to susceptibility to tick-borne encephalitis virus, yellow fever virus, and Zika virus [48], suggesting that large-bodied species might significantly contribute to vector-borne viruses' transmission. Moreover, home range was a key predictor of wildlife hosts of Rift Valley fever virus [49], in agreement with the hypothesis that larger home ranges may enhance exposure to viruses by increasing inter- and intraspecific contacts. Large-sized mammals also consume large amounts of food to meet high energetic requirements [50], which makes them more likely to ingest viruses that rely on indirect routes of transmission via fomites (such as contaminated food sources).
In addition to the aforementioned mechanisms of susceptibility and exposure to viral infection, the larger body mass of zoonotic carriers may be explained by dynamics that underlie zoonotic viral spillover from wildlife. Humans interact more frequently with large wild mammals through several forms of direct exploitation, such as hunting and wildlife trade [51], and hunters typically have a preference towards large-bodied target species [52–54]. Practices such as wildlife hunting and trade may pose a public health risk because of the often-lacking sanitary controls and inadequate animal manipulation [55], which may facilitate viral spillover to people at different stages of the supply chain. Large mammals may also contribute to the emergence of zoonotic viral diseases as amplification hosts (i.e. organisms in which the infectious agent can replicate rapidly and reach high concentrations) [56]. It is possible that large mammals can develop higher viral loads and release greater quantities of virus, increasing the probability of transmission in case of encounter with humans or other species [17]. Traditionally, livestock animals are considered to be amplification hosts for zoonotic viruses [57,58], but evidence of wild amplifiers has been found for Old World monkeys as chikungunya virus hosts in Senegal [59], great apes and forest antelopes as Zaire Ebola virus hosts in West Africa [60], and white-tailed deer as Cache Valley virus hosts in the United States [61].
In both general and zoonotic models, mammals placed on the slower end of the life-history continuum (longer longevity and interbirth intervals) were more likely to have carrier status for relatively slow-evolving viral groups associated with chronic or persistent infection, such as RNA retroviruses (phylum Artverviricota) and dsDNA viruses (phyla Peploviricota, Baltimore classes VII and I). To ensure transmission, most dsDNA viruses and retroviruses are required to remain in the host for extended periods of time, resulting in persistent or chronic infections with low or delayed disease severity and virulence [62]. Because of the extended infectious period, viruses that cause chronic or persistent infection (e.g. Herpesvirus, Lentivirus) are expected to have a greater viral fitness (i.e. the capacity of a virus to produce infectious progeny) in long-lived hosts [15]. Our results sustain this hypothesis, suggesting that viruses which stay in the host for longer got a selective advantage by co-evolving with—and adapting to—slow-lived species which may carry the infection for prolonged periods of time and increase the probability of transmission.
For general and zoonotic carriers of viruses with linear genomic segments and an envelope (i.e. those in our 6th ecological group, such as Arenaviridae, Hantaviridae and Peribunyaviridae), carrier status probability was highest for species with faster life-history as represented by short longevity and interbirth interval. These RNA viruses, which include Lassa fever virus and Sin Nombre virus, are considered relatively fast-evolving due to rapid nucleotide substitution rate and ability to reassort homologous genome segments [62,63]. Our findings suggest that such evolutionary mechanisms may be further favoured in fast reproducing carriers, where population density and generation overlap may increase viral fitness due to the large availability of naïve individuals. Still, we cannot fully exclude that these results were driven by taxa representation in the training set, as bats, moles, shrews and especially rodents—that are considered the main natural hosts of these viruses—display shorter lifespans and faster life-histories. To further explore the adaptive links between viral evolvability and host traits it will be necessary to address critical data gaps on host–virus interactions which currently limit our ability to make inference.
It is important to consider that our study has limitations, associated with the available data on mammal-virus associations. We defined mammal-virus associations as the detection of a virus in a given mammalian species via PCR, isolation or serology. These detection methods do not necessarily imply that the species is an actual viral reservoir, rather, they inform us that the species is susceptible to infection. Thus, it is likely that we included different levels of susceptibility within the term ‘carrier’, such as: reservoir, natural host, and incidental host. Additionally, we recognize that sampling bias is present in the data, as the observed patterns of viral distribution across host taxa are not an accurate representation of the—largely unknown—mammalian virome [64]. Such limitations affect any study investigating general host-pathogen associations from a limited number of observations, as even well-known taxa of zoonotic carriers (such as rodents) remain understudied. For example, due to the underrepresentation of rodents as carrier species, our models might have not detected patterns that involve combinations of life-history traits typical of rodents, such as small body mass, short lifespan, and high offspring production. By working above the viral species level using dichotomous outcomes (carrier status), and accounting for species' virus-related citations (i.e. reporting effort), we mitigated the impact of such bias.
In this work, we identified common patterns along the fast-slow continuum of mammalian viral carriers, highlighting that mammals' associations with viruses are generalizable only to a certain extent. The eco-evolutionary peculiarities of each viral group drive patterns of compatibility between mammals and viruses, as we partially captured by separately assessing different eco-evolutionary groups of viruses. By including multiple sides of viral diversity, we were able to pick up on trends that went undetected in previous studies where pathogen richness was analysed as a whole [4,65]. We found that larger mammals are carriers for several viral groups, but both fast- and slow-living species are exploited by multiple viral groups with different ecological, evolutionary and structural features. General and zoonotic carriers' profiles did not differ substantially in terms of life-histories, providing new evidence in support of the hypothesis that there are no intrinsic traits that disproportionately affect mammals' susceptibility to zoonotic viruses [66]. We argue that underestimating viruses' functional diversity may lead to neglecting potentially important sources of zoonoses, with the risk of impairing hazard assessments and outbreak preparedness. Using the insights provided by life-history theory may enhance surveillance strategies in areas where species with a higher proneness for carrying viruses are more abundant.
Contributor Information
Andrea Tonelli, Email: andrea.tonelli@uniroma1.it.
Moreno Di Marco, Email: moreno.dimarco@uniroma1.it.
Ethics
This work did not require ethical approval from a human subject or animal welfare committee.
Data accessibility
No new data were created in this study.
Code to reproduce the study: https://doi.org/10.13133/UNIROMA1/VXMHCX.
The data are provided in electronic supplementary material [67].
Declaration of AI use
We have not used AI-assisted technologies in creating this article.
Authors' contributions
A.T.: conceptualization, data curation, formal analysis, methodology, writing—original draft; H.C.: conceptualization, methodology, supervision, writing—review and editing; M.S.C.B.: resources, writing—review and editing; M.W.: resources, writing—review and editing; M.DM.: conceptualization, data curation, methodology, project administration, supervision, writing—original draft.
All authors gave final approval for publication and agreed to be held accountable for the work performed therein.
Conflict of interest declaration
The authors declare no competing interests.
Funding
This research was supported by EU funding within the NextGeneration EU-MUR PNRR Extended Partnership initiative on Emerging Infectious Diseases (Project no. PE00000007, INF-ACT Spoke4).
References
- 1.Taylor LH, Latham SM, Woolhouse MEJ. 2001. Risk factors for human disease emergence. Phil. Trans. R. Soc. Lond. B 356, 983-989. ( 10.1098/rstb.2001.0888) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Carroll D, Daszak P, Wolfe ND, Gao GF, Morel CM, Morzaria S, Pablos-Méndez A, Tomori O, Mazet JAK. 2018. The global virome project. Science 359, 872-874. ( 10.1126/science.aap7463) [DOI] [PubMed] [Google Scholar]
- 3.Carlson CJ, Zipfel CM, Garnier R, Bansal S. 2019. Global estimates of mammalian viral diversity accounting for host sharing. Nat Ecol Evol 3, 1070-1075. ( 10.1038/s41559-019-0910-6) [DOI] [PubMed] [Google Scholar]
- 4.Olival KJ, Hosseini PR, Zambrana-Torrelio C, Ross N, Bogich TL, Daszak P. 2017. Host and viral traits predict zoonotic spillover from mammals. Nature 546, 646-650. ( 10.1038/nature22975) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Han BA, Kramer AM, Drake JM. 2016. Global patterns of zoonotic disease in mammals. Trends Parasitol. 32, 565-577. ( 10.1016/j.pt.2016.04.007) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Johnson CK, Hitchens PL, Pandit PS, Rushmore J, Evans TS, Young CCW, Doyle MM. 2020. Global shifts in mammalian population trends reveal key predictors of virus spillover risk. Proc. R. Soc. B 287, 20192736. ( 10.1098/rspb.2019.2736) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Albery GF, Becker DJ. 2021. Fast-lived Hosts and Zoonotic Risk. Trends Parasitol. 37, 117-129. ( 10.1016/j.pt.2020.10.012) [DOI] [PubMed] [Google Scholar]
- 8.Lee KA. 2006. Linking immune defenses and life history at the levels of the individual and the species. Integr. Comp. Biol. 46, 1000-1015. ( 10.1093/icb/icl049) [DOI] [PubMed] [Google Scholar]
- 9.Previtali MA, Ostfeld RS, Keesing F, Jolles AE, Hanselmann R, Martin LB. 2012. Relationship between pace of life and immune responses in wild rodents. Oikos 121, 1483-1492. ( 10.1111/j.1600-0706.2012.020215.x) [DOI] [Google Scholar]
- 10.Johnson PTJ, Rohr JR, Hoverman JT, Kellermanns E, Bowerman J, Lunde KB. 2012. Living fast and dying of infection: Host life history drives interspecific variation in infection and disease risk. Ecol. Lett. 15, 235-242. ( 10.1111/j.1461-0248.2011.01730.x) [DOI] [PubMed] [Google Scholar]
- 11.Plourde BT, Burgess TL, Eskew EA, Roth TM, Stephenson N, Foley JE. 2017. Are disease reservoirs special? Taxonomic and life history characteristics. PLoS ONE 12, e0180716. ( 10.1371/journal.pone.0180716) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Han BA, Schmidt JP, Bowden SE, Drake JM. 2015. Rodent reservoirs of future zoonotic diseases. Proc. Natl Acad. Sci. USA 112, 7039-7044. ( 10.1073/pnas.1501598112) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Worsley-Tonks KEL, Escobar LE, Biek R, Castaneda-Guzman M, Craft ME, Streicker DG, White LA, Fountain-Jones NM. 2020. Using host traits to predict reservoir host species of rabies virus. PLoS Negl. Trop. Dis. 14, e0008940. ( 10.1371/journal.pntd.0008940) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Guy C, Ratcliffe JM, Mideo N. 2020. The influence of bat ecology on viral diversity and reservoir status. Ecol. Evol. 10, 5748-5758. ( 10.1002/ece3.6315) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Luis AD, et al. 2013. A comparison of bats and rodents as reservoirs of zoonotic viruses: Are bats special? Proc. R. Soc. B 280, 20122753. ( 10.1098/rspb.2012.2753) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Plowright RK, Becker DJ, Crowley DE, Washburne AD, Huang T, Nameer PO, Gurley ES, Han BA. 2019. Prioritizing surveillance of nipah virus in India. PLoS Negl. Trop. Dis. 13, e0007393. ( 10.1371/journal.pntd.0007393) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Schmidt JP, Maher S, Drake JM, Huang T, Farrell MJ, Han BA. 2019. Ecological indicators of mammal exposure to Ebolavirus. Phil. Trans. R. Soc. B 374, 20180337. ( 10.1098/rstb.2018.0337) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Walsh MG, Hossain S. 2020. Population structure and diet generalism define a preliminary ecological profile of zoonotic virus hosts in the Western Ghats, India. Epidemics 33, 100416. ( 10.1016/j.epidem.2020.100416) [DOI] [PubMed] [Google Scholar]
- 19.Bielby J, Mace GM, Bininda-Emonds ORP, Cardillo M, Gittleman JL, Jones KE, Orme CDL, Purvis A. 2007. The fast-slow continuum in mammalian life history: An empirical reevaluation. Am. Nat. 169, 748-757. ( 10.1086/516847) [DOI] [PubMed] [Google Scholar]
- 20.Carlson CJ, et al. 2022. The Global Virome in One Network (VIRION): an Atlas of Vertebrate-Virus Associations. mBio 13, e0298521. ( 10.1128/mbio.02985-21) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Scherf BD. 2000. World watch list for domestic animal diversity, 3rd edn. Rome, Italy: Food and Agriculture Organization of the United Nations. [Google Scholar]
- 22.European Parliament Council of the European Union. 2010. Directive 2010/63/EU of the European Parliament and of the Council of 22 September 2010 on the protection of animals used for scientific purposes Text with EEA relevance.
- 23.Garber J, et al. 2011. Guide for the care and Use of laboratory animals, 8th edn. Washington, DC: The National Academy Press. [Google Scholar]
- 24.Becker DJ, Seifert SN, Carlson CJ. 2020. Beyond Infection: Integrating Competence into Reservoir Host Prediction. Trends Ecol. Evol. 35, 1062-1065. ( 10.1016/j.tree.2020.08.014) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Soria CD, Pacifici M, Di Marco M, Stephen SM, Rondinini C. 2021. COMBINE: a coalesced mammal database of intrinsic and extrinsic traits. Ecology 102, e03344. ( 10.1002/ecy.3344) [DOI] [PubMed] [Google Scholar]
- 26.Visconti P, et al. 2016. Projecting Global Biodiversity Indicators under Future Development Scenarios. Conserv Lett. 9, 5-13. ( 10.1111/conl.12159) [DOI] [Google Scholar]
- 27.Fick SE, Hijmans RJ. 2017. WorldClim 2: new 1-km spatial resolution climate surfaces for global land areas. Int. J. Climatol. 37, 4302-4315. ( 10.1002/joc.5086) [DOI] [Google Scholar]
- 28.2021 IUCN. 2021. The IUCN Red List of Threatened Species. Version 2021-3. See https://www.iucnredlist.org. Accessed on [21/10/2021].
- 29.Faurby S, Davis M, Pedersen R, Schowanek SD, Antonelli A, Svenning JC. 2018. PHYLACINE 1.2: The Phylogenetic Atlas of Mammal Macroecology. Ecology. 99, 2626. ( 10.1002/ecy.2443) [DOI] [PubMed] [Google Scholar]
- 30.Naimi B. 2015. Package ‘usdm’. R Topics Document
- 31.Albery GF, Carlson CJ, Cohen LE, Eskew EA, Gibb R, Ryan SJ, Sweeny AR, Becker DJ. 2022. Urban-adapted mammal species have more known pathogens. Nat. Ecol. Evol. 6, 794-801. ( 10.1038/s41559-022-01723-0) [DOI] [PubMed] [Google Scholar]
- 32.Wickham H, Wickham MH. 2022. Package ‘httr’. R Topics Document
- 33.Ooms J. 2014. The jsonlite package: A practical and consistent mapping between json data and r objects. arXiv preprint arXiv:1403.2805
- 34.Baltimore D. 1971. Expression of animal virus genomes. Bacteriol. Rev 35, 235-241. ( 10.1128/br.35.3.235-241.1971) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Koonin EV, Krupovic M, Agol VI. 2021. The Baltimore Classification of Viruses 50 Years Later: How Does It Stand in the Light of Virus Evolution? Microbiol. Mol. Biol. Rev. 85, 10-1128. ( 10.1128/mmbr.00053-21) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.2022 ViralZone. 2022. SIB Swiss Institute of Bioinformatics. See https://viralzone.expasy.org. (accessed on [28/11/2021]).
- 37.Wardeh M, Blagrove MSC, Sharkey KJ, Baylis M. 2021. Divide-and-conquer: machine-learning integrates mammalian and viral traits with network features to predict virus-mammal associations. Nat. Commun. 12, 3954. ( 10.1038/s41467-021-24085-w) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Blagrove MSC, Baylis M, Wardeh M. 2022. Reply to: Machine-learning prediction of hosts of novel coronaviruses requires caution as it may affect wildlife conservation. Nat. Commun. 13, 1-3. ( 10.1038/s41467-022-32747-6) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Cutler DR, Edwards TC Jr, Beard KH, Cutler A, Hess KT, Gibson J, Lawler JJ. 2007. Random forests for classification in ecology. Ecology 88, 2783-2792. ( 10.1890/07-0539.1) [DOI] [PubMed] [Google Scholar]
- 40.Allouche O, Tsoar A, Kadmon R. 2006. Assessing the accuracy of species distribution models: Prevalence, kappa and the true skill statistic (TSS). Journal of Applied Ecology 43, 1223-1232. ( 10.1111/j.1365-2664.2006.01214.x) [DOI] [Google Scholar]
- 41.Molnar C. 2022. Interpretable machine learning: A guide for making black Box models explainable, 2nd edn. Victoria, Canada: Leanpub. [Google Scholar]
- 42.Wright MN, Wager S, Probst P. 2020. A Fast Implementation of Random Forests' Package ‘ranger’. CRAN Repository
- 43.Kuhn M. 2020. Package ‘tidymodels’: Easily Install and Load the ‘Tidymodels’ Packages. CRAN Repository
- 44.Downs CJ, Dochtermann NA, Ball R, Klasing KC, Martin LB. 2020. The effects of body mass on immune cell concentrations of mammals. Am. Nat. 195, 107-114. ( 10.1086/706235) [DOI] [PubMed] [Google Scholar]
- 45.Cable JM, Enquist BJ, Moses ME. 2007. The allometry of host-pathogen interactions. PLoS One 2, e1130. ( 10.1371/journal.pone.0001130) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Downs CJ, Schoenle LA, Han BA, Harrison JF, Martin LB. 2019. Scaling of Host Competence. Trends Parasitol. 35, 182-192. ( 10.1016/j.pt.2018.12.002) [DOI] [PubMed] [Google Scholar]
- 47.Jetz W, Carbone C, Fulford J, Brown JH. 2004. The scaling of animal space use. Science 306, 266-268. ( 10.1126/science.1102138) [DOI] [PubMed] [Google Scholar]
- 48.Pandit PS, Doyle MM, Smart KM, Young CCW, Drape GW, Johnson CK. 2018. Predicting wildlife reservoirs and global vulnerability to zoonotic Flaviviruses. Nat. Commun. 9, 5425. ( 10.1038/s41467-018-07896-2) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Walsh MG, Mor SM. 2018. Interspecific network centrality, host range and early-life development are associated with wildlife hosts of Rift Valley fever virus. Transbound Emerg Dis 65, 1568-1575. ( 10.1111/tbed.12903) [DOI] [PubMed] [Google Scholar]
- 50.Nagy K. 2021. Food requirements of wild animals: Predictive equations for free-living mammals, reptiles, and birds.
- 51.Scheffers BR, Oliveira BF, Lamb I, Edwards DP. 2019. Global wildlife trade across the tree of life. Science 366, 71-76. ( 10.1126/science.aav5327) [DOI] [PubMed] [Google Scholar]
- 52.Jerozolimski A, Peres CA. 2003. Bringing home the biggest bacon: A cross-site analysis of the structure of hunter-kill profiles in Neotropical forests. Biol Conserv 111, 415-425. ( 10.1016/S0006-3207(02)00310-5) [DOI] [Google Scholar]
- 53.Rowcliffe JM, de Merode E, Cowlishaw G. 2004. Do wildlife laws work? Species protection and the application of a prey choice model to poaching decisions. Proc. R. Soc. B 271, 2631-2636. ( 10.1098/rspb.2004.2915) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Johnson PJ, Kansky R, Loveridge AJ, Macdonald DW. 2010. Size, rarity and charisma: Valuing African wildlife trophies. PLoS One 5, e12866. ( 10.1371/journal.pone.0012866) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Bezerra-Santos MA, Mendoza-Roldan JA, Thompson RCA, Dantas-Torres F, Otranto D. 2021. Illegal Wildlife Trade: A Gateway to Zoonotic Infectious Diseases. Trends Parasitol. 37, 181-184. ( 10.1016/j.pt.2020.12.005) [DOI] [PubMed] [Google Scholar]
- 56.Weaver SC. 2005. Host range, amplification and arboviral disease emergence. Arch. Virol. Suppl. 19, 33-44. ( 10.1007/3-211-29981-5_4) [DOI] [PubMed] [Google Scholar]
- 57.Rohr JR, et al. 2019. Emerging human infectious diseases and the links to global food production. Nat Sustain 2, 445-456. ( 10.1038/s41893-019-0293-3) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Karesh WB, et al. 2012. Ecology of zoonoses: natural and unnatural histories. The Lancet 380, 1936-1945. ( 10.1016/S0140-6736(12)61678-X) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Althouse BM, et al. 2018. Role of monkeys in the sylvatic cycle of chikungunya virus in Senegal. Nat. Commun. 9, 1046. ( 10.1038/s41467-018-03332-7) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Heeney JL. 2015. Ebola: Hidden reservoirs. Nature. 527, 453-455. ( 10.1038/527453a) [DOI] [PubMed] [Google Scholar]
- 61.Blackmore CGM, Grimstad PR. 1998. Cache Valley and Potosi viruses (Bunyaviridae) in white-tailed deer (Odocoileus virginianus): Experimental infections and antibody prevalence in natural populations. American Journal of Tropical Medicine and Hygiene 59, 704-709. ( 10.4269/ajtmh.1998.59.704) [DOI] [PubMed] [Google Scholar]
- 62.Holmes EC. 2009. The evolution and emergence of RNA viruses. Oxford, UK: Oxford University Press. [Google Scholar]
- 63.Ramsden C, Melo FL, Figueiredo LM, Holmes EC, Zanotto PMA, Consortium V. 2008. High rates of molecular evolution in hantaviruses. Mol. Biol. Evol. 25, 1488-1492. ( 10.1093/molbev/msn093) [DOI] [PubMed] [Google Scholar]
- 64.Gibb R, Albery GF, Mollentze N, Eskew EA, Brierley L, Ryan SJ, Seifert SN, Carlson CJ. 2022. Mammal virus diversity estimates are unstable due to accelerating discovery effort. Biol. Lett. 18, 20210427. ( 10.1098/rsbl.2021.0427) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Choo J, Nghiem LTP, Benítez-López A, Carrasco LR. 2023. Range area and the fast–slow continuum of life history traits predict pathogen richness in wild mammals. Sci. Rep. 13, 20191. ( 10.1038/s41598-023-47448-3) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Mollentze N, Streicker DG. 2020. Viral zoonotic risk is homogenous among taxonomic orders of mammalian and avian reservoir hosts. Proc. Natl Acad. Sci. USA 117, 9423-9430. ( 10.1073/pnas.1919176117) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Tonelli A, Caceres-Escobar H, Blagrove MSC, Wardeh M, Di Marco M. 2024. Identifying life-history patterns along the fast-slow continuum of mammalian viral carriers. Figshare. ( 10.6084/m9.figshare.c.7355106) [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
No new data were created in this study.
Code to reproduce the study: https://doi.org/10.13133/UNIROMA1/VXMHCX.
The data are provided in electronic supplementary material [67].



