Skip to main content
Proceedings of the Royal Society B: Biological Sciences logoLink to Proceedings of the Royal Society B: Biological Sciences
. 2022 Jan 12;289(1966):20212358. doi: 10.1098/rspb.2021.2358

The pitfalls of inferring virus–virus interactions from co-detection prevalence data: application to influenza and SARS-CoV-2

Matthieu Domenech de Cellès 1,, Elizabeth Goult 1, Jean-Sebastien Casalegno 2,3, Sarah C Kramer 1
PMCID: PMC8753173  PMID: 35016540

Abstract

There is growing experimental evidence that many respiratory viruses—including influenza and SARS-CoV-2—can interact, such that their epidemiological dynamics may not be independent. To assess these interactions, standard statistical tests of independence suggest that the prevalence ratio—defined as the ratio of co-infection prevalence to the product of single-infection prevalences—should equal unity for non-interacting pathogens. As a result, earlier epidemiological studies aimed to estimate the prevalence ratio from co-detection prevalence data, under the assumption that deviations from unity implied interaction. To examine the validity of this assumption, we designed a simulation study that built on a broadly applicable epidemiological model of co-circulation of two emerging or seasonal respiratory viruses. By focusing on the pair influenza–SARS-CoV-2, we first demonstrate that the prevalence ratio systematically underestimates the strength of interaction, and can even misclassify antagonistic or synergistic interactions that persist after clearance of infection. In a global sensitivity analysis, we further identify properties of viral infection—such as a high reproduction number or a short infectious period—that blur the interaction inferred from the prevalence ratio. Altogether, our results suggest that ecological or epidemiological studies based on co-detection prevalence data provide a poor guide to assess interactions among respiratory viruses.

Keywords: SARS-CoV-2, COVID-19, influenza, virus–virus interaction, mathematical modelling

1. Introduction

The pandemic of coronavirus disease 2019 (COVID-19), caused by the novel severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), has emphasized the persistent threat posed by respiratory viruses. In addition to SARS-CoV-2, other major respiratory viruses like influenza and the respiratory syncytial virus (RSV) cause a substantial burden every year, estimated at 78 million cases of lower respiratory infections and 130 000 associated deaths worldwide in 2016 [1]. As evidenced by the current and past pandemics, the large host range of respiratory viruses—and the correspondingly high risk of spillover from animals into humans—also makes them prime candidates for emergence of currently unknown ‘diseases X’ [2]. Interaction—here broadly defined as the ability of one pathogen to affect infection or disease caused by another pathogen—is an intriguing yet understudied aspect of respiratory viruses' biology [3]. Although different nomenclatures have been proposed [4], such interactions can be classified according to their sign, either positive (synonymously, synergistic or facilitatory) or negative (synonymously, antagonistic or competitive). According to experimental evidence, various biological mechanisms exist which make either sign a priori plausible [4]. Examples include, in the case of positive interactions, upregulation of viral target receptors [5] or cell fusion [6]; and, in the case of negative interactions, blocking of viral replication caused by the interferon response [7,8]. Intriguingly, different respiratory viruses may have opposing effects on COVID-19, e.g., rhinoviruses can inhibit SARS-CoV-2 infection via the interferon response [8], while influenza A viruses can facilitate it via upregulation of ACE2, the cognate receptor of SARS-CoV-2 in human cells [5,9]. SARS-CoV-2 interactions may have far-reaching implications for predicting not only the future course of the COVID-19 pandemic, but also the indirect effects of non-COVID-19 vaccines on COVID-19 [10]. Indeed, vaccines against a target pathogen may also indirectly affect non-target pathogens that interact with this target pathogen—an effect expected (by the law of signs) to reduce the non-target pathogen burden in case of positive interactions, and to increase it in case of negative interactions [1113].

Because of their relevance to epidemiology and public health, a natural question is how best to identify and estimate interactions between respiratory viruses. Arguably, challenge studies in animals or humans provide the strongest form of evidence, because they can pinpoint the within-host mechanisms of interaction in a controlled experimental setting. However, such studies remain scarce and, more generally, it is not easy to predict their consequences at the scale of human populations [14]. Hence, epidemiological studies—ideally informed by experimental evidence to narrow the search range of interacting pathogens—remain indispensable to assess interactions, but it is unclear whether methods commonly used in such studies are well-suited to this task.

In particular, recent studies of SARS-CoV-2 interactions used a test-negative design [15] to compare the risk of SARS-CoV-2 infection among those infected with another respiratory virus (e.g. influenza) to that among those uninfected [1618]. The underlying idea is conceptually simple: if two (or more) viruses do not interact and circulate independently, then the frequency of co-detection estimated from cross-sectional data should be approximately equal to the product of each virus's detection frequency—conversely, any significant deviation from equality should indicate interaction. However, earlier epidemiological and ecological modelling studies have cautioned against seemingly intuitive statistics of interaction [14,19,20]. In fact, to our knowledge the validity of this study design has not yet been systematically tested for emerging or seasonal respiratory viruses.

In this study, we aimed to determine whether epidemiological studies based on co-detection prevalence data enabled reliable estimation of interactions between respiratory viruses. To do so, we designed a simulation study that built on a general epidemiological model of co-circulation of two emerging or seasonal respiratory viruses. We show that cross-sectional estimates of co-infection prevalence—interpreted either alone or in combination with estimates of single-infection prevalences—provide a poor guide to assess interaction. Hence, we argue that earlier epidemiological studies based on this design should be interpreted with caution and that further longitudinal studies will be needed to elucidate the epidemiological interactions of SARS-CoV-2 with other respiratory viruses.

2. Methods

(a) . Transmission model of viral co-circulation

We developed a deterministic model of circulation of two respiratory viruses, assumed to interact during the infectious period (i.e. the period of transmissible viral infection, denoted by I) or during a transient period following clearance of infection (denoted by T). According to experimental evidence, such interactions can result from an antiviral state caused by non-specific innate immune responses (such as the interferon response), which develop early during infection and can persist for a short period after clearance of infection [7]. By contrast, we did not model long-term interactions (effected, for example, by adaptive cross-immunity), which are less likely for different species of respiratory viruses [7]. The model was similar to that originally proposed by Shrestha et al. [14], with the addition of a latent period (denoted by E) and of a realistic distribution for the infectious period, modelled as a Gamma distribution with shape parameter 2 [21]. The transmission dynamic of each virus was, therefore, represented by a susceptible exposed infected temporary recovered (SEITR) model [22], where S represents susceptible individuals and R recovered individuals. Following Shrestha et al., we used a double index notation to indicate the infection status with respect to each virus, e.g. XSE represents the proportion of individuals susceptible to virus 1 and exposed to virus 2. As we primarily focused on respiratory viruses that cause epidemics lasting a few months, we made the reasonable assumption of a constant, closed population.

The model was defined by a set of 6 × 6 = 36 ordinary differential equations, represented schematically in figure 1. The force of infection of each virus i ∈ {1, 2} was given by:

λ1(t)=R11r0,1γ1p1(t)p1(t)=xΞ[XIax(t)+XIbx(t)]λ2(t)=R21r0,2γ2p2(t)p2(t)=xΞ[XxIa(t)+XxIb(t)],

where Ξ={S,E,Ia,Ib,T,R} is the set of state variables, Ri is the reproduction number of virus i, r0,i the initial fraction immune to virus i, 1/γi the average infectious period of virus i and pi(t) the prevalence of infection with virus i. Importantly, the parameter Ri is best interpreted here as the initial reproduction number in a partially immune population, as opposed to the basic reproduction number (given by Ri/(1 − r0,i) for virus i) in a fully susceptible population [23]. We also defined the prevalence of individuals co-infected (purple compartments in figure 1):

p12(t)=XIaIa(t)+XIaIb(t)+XIbIa(t)+XIbIb(t).

Figure 1.

Figure 1.

Schematic of epidemiological model of viral co-circulation. Individuals infectious with virus 1 are highlighted in blue, with virus 2 in red, and with both viruses in purple. Dashed lines indicate epidemiological transitions affected by interactions. (Online version in colour.)

(b) . Statistic to infer interaction from co-detection prevalence data

Standard statistical tests of independence suggest that the following prevalence ratio (PR):

PR(t)=p12(t)p1(t)×p2(t),

could be used to infer interaction [1618]. Intuitively, a prevalence ratio above unity indicates that the frequency of co-detection is higher than that expected by chance, suggesting that co-infection is facilitated—that is, that the interaction is positive, i.e. synergistic [19,20]. Correspondingly, a prevalence ratio below unity would suggest a negative, or antagonistic interaction. In numerical applications, we calculated the prevalence ratio at the time of peak co-infection prevalence, tmax = argmaxtp12(t) (cf. electronic supplementary material, figure  S1), as we reasoned that empirical studies would have maximal statistical power to detect co-infection at that time point. Nevertheless, this choice is arbitrary, and we considered an alternative calculation in a sensitivity analysis, described below. In the following, we drop the time argument (PR = PR(tmax)) and we simply refer to the prevalence ratio calculated at that time point.

Of note, other definitions of the prevalence ratio are possible and have been used in previous studies. For example, earlier studies of the association between SARS-CoV-2 and influenza compared the fraction of individuals infected with virus 2 among those infected with virus 1 to the fraction infected with virus 2 among those uninfected with virus 1—that is, a test-negative design [1618]. Using the above notations and after some algebra, the corresponding prevalence ratio PR′ equals:

PR=p12/p1(p2p12)/(1p1)=PR1p11PR×p1.

However, this alternative prevalence ratio is no longer symmetric in viruses 1 and 2, which implies an arbitrary choice of virus 1. We, therefore, prefer our formulation, but we point out that the two prevalence ratios are approximately equal for low prevalence of infection with virus 1. Furthermore, it can be shown that PR′ ≥ 1 ⇔ PR ≥ 1, such that the sign of the interaction inferred from either ratio is identical.

(c) . Model parametrization

In numerical applications, we considered the pair influenza (virus 1)–SARS-CoV-2 (virus 2) and we fixed the parameters accordingly (table 1). Specifically, for influenza we assumed an average latent period of 1 day and an average infectious period of 4 days, resulting in an average generation time of 3 days [24,26]. For SARS-CoV-2, we assumed an average latent period of 4 days and an average infectious period of 5 days (average generation time of 6.5 days) [25,27]. The reproduction number of influenza was fixed to 1.3 [23] and that of SARS-CoV-2 to 2.5 [25]. To initialize the model, we assumed that a small fraction XES(0) = e0,1 = 10−3 had been exposed to influenza and XSE(0) = e0,2 = 10−5 to SARS-CoV-2. These initial conditions were chosen to reflect the epidemiological situation in early 2020 in Europe, where influenza was already circulating before the emergence of SARS-CoV-2 [28]. We further assumed that XRS(0) = r0,1 = 40% of individuals were initially immune to influenza [29], and XSR(0) = r0,2 = 0% to SARS-CoV-2. Other individuals were assumed fully susceptible (XSS(0) = 1 − e0,1e0,2r0,1r0,2), and all other compartments were initialized to 0. For simplicity, in the base models we considered only symmetric interactions, that is, the effect of virus 1 on virus 2 was assumed equal to that of virus 2 on virus 1. Furthermore, we assumed that interaction could not change sign over the course of infection, and we therefore tested negative (0.2 ≤ θ(T), θ(I) ≤ 1) and positive (1 ≤ θ(T), θ(I) ≤ 5) interactions separately.

Table 1.

List of model parameters.

parameter meaning fixed value or interval (influenza–SARS-CoV-2 analysis) fixed value or interval (global sensitivity analysis) source/comment
σ11 average latent period of influenza 1 day σ11=σ21=σ1σ1[114]days [24]
σ21 average latent period of SARS-CoV-2 4 days [25]
γ11 average infectious period of influenza 4 days γ11=γ21=γ1γ1[414]days [26]
γ21 average infectious period of SARS-CoV-2 5 days [25,27]
R1 reproduction number of influenza 1.3 R1=R2=RR{1.5,2.0,2.5} [23]
R2 reproduction number of SARS-CoV-2 2.5 [25]
e0,1 initial fraction exposed to influenza 10−3 e0,1 = e0,2 = 10−5 assumption: influenza circulated before SARS-CoV-2 [28]
e0,2 initial fraction exposed to SARS-CoV-2 10−5
r0,1 initial fraction immune to influenza 0.4 r0,1=r0,2=r0r0[0.00.4] [29]
r0,2 initial fraction immune to SARS-CoV-2 0 assumption
δ1=δ11=δ21 average post-infectious period 1–14 days 1–14 days [30]
θ(I)=θ1(I)=θ2(I) strength of interaction during infectious period 0.2–5 1–5 assumption
θ(T)=θ1(T)=θ2(T) strength of interaction during post-infectious period 0.2–5 1–5 assumption

(d) . Simulation protocol

In all scenarios, the model was integrated numerically for a period of 400 days, with state variable values recorded every 0.05 days.

(e) . Sensitivity analyses for influenza and SARS-CoV-2

To verify the robustness of our results, we conducted three sensitivity analyses. First, we considered an alternative prevalence ratio, similarly defined but averaged ±14 days around the time of peak co-infection prevalence. Second, although earlier experimental studies found that influenza can affect SARS-CoV-2 infection [5,9], the effect of SARS-CoV-2 on influenza infection, if any, is currently unknown. Previous experimental studies—e.g. of influenza and RSV [7]—demonstrated the possibility of non-symmetric interactions, where one virus affects the other, but not the other way around. We therefore tested an alternative hypothesis of non-symmetric interactions, for which influenza affected SARS-CoV-2 infection, while SARS-CoV-2 did not affect influenza infection (θ2(I)=θ2(T)=1). Third, we tested a scenario with seasonal transmission (representing, for example, weather-induced changes in virus survival [31,32]), modelled as a multiplicative effect on the transmission rate of each virus. For simplicity and interpretability, the seasonal forcing function was chosen as a sine wave:

s(t)=1+Acosω(tϕ),

where ω = 2π/365 day−1 is the annual angular frequency, A ∈ {0, 0.1, 0.2} the peak relative semi-amplitude and ϕ = 100 days the peak time. This peak time was chosen to approximately coincide with the peak time of co-infection (electronic supplementary material, figure  S1), under the assumption of co-circulation during periods of higher seasonal transmission (e.g. during winter in temperate climates).

(f) . Global sensitivity analyses

To examine more generally the properties of viral infection and interaction that affected the prevalence ratio, we conducted a global sensitivity analysis for a broad range of respiratory viruses [33]. For simplicity, we assumed a fully symmetric model with identical characteristics of the two viruses, and we then proceeded in three steps. First, we used a Latin hypercube design to sample 103 values (over the ranges indicated in table 1, [30]) of the following six parameters: average latent period (1/σ), average infectious period (1/γ), average post-infectious period (1/δ), degree of interaction during the infectious period (θ(I)), degree of interaction during the post-infectious period (θ(T)) and initial fraction immune (r0). Second, we simulated the model and calculated the prevalence ratio for every parameter set. Finally, we used a normal generalized additive regression model (GAM) with log-link to simultaneously estimate the association between the prevalence ratio and every input parameter [34]. For every parameter, the association was modelled using a basis of cubic splines, with a maximum basis dimension of 10. Preliminary analyses indicated that the prevalence ratio was sensitive to the reproduction number, in isolation and in interaction with other parameters. To simplify the regression model, we, therefore, ran the global sensitivity analysis for three different values of the reproduction number (1.5, 2.0 and 2.5). To dissect the association of the prevalence ratio with every input parameter, we fitted the same GAM to the prevalence of co-infection (p12, numerator of prevalence ratio) and to the product of single-infection prevalences (p1p2, denominator of prevalence ratio).

(g) . Numerical implementation

We implemented and simulated all the models using the pomp package [35] in R v. 3.6.3 [36]. For the global sensitivity analysis, we used the mgcv package [34] to fit the GAMs and the ggeffects package [37] to plot the marginal effect of each input parameter. Finally, we used the renv package to keep track of all packages' version and to increase reproducibility [38].

3. Results

(a) . The prevalence ratio correctly identifies the sign, but not the degree, of uniform interactions

We first considered interactions of equal strength during the infectious and post-infectious periods (θ = θ(I) = θ(T))—henceforth referred to as uniform interactions. Example simulations of negative, neutral and positive interactions between influenza and SARS-CoV-2 are plotted in the electronic supplementary material, figure S1. Compared with the no-interaction scenario (peak co-infection prevalence: 0.2%), the peak amplitude of co-infection was lower for negative interaction (0.1%) and higher for positive interaction (1.3%). In all scenarios, however, the time of peak co-infection was approximately identical (range: 77.7–79.7 days). Next, we examined the general relationship between the strength of interaction and the prevalence ratio for different values of the post-infectious period in the range 1–14 days (figure 2). We found that the prevalence ratio equalled 1 for non-interacting viruses and thus permitted correct identification of neutral interactions (θ = 1). For interacting viruses (θ ≠ 1), the prevalence ratio also correctly estimated the sign of the interaction, but systematically underestimated its strength. In addition, the degree of underestimation increased with the strength of interaction and the duration of the post-infectious period. Hence, we found evidence that the prevalence ratio enabled qualitative, but not quantitative, estimation of uniform interactions.

Figure 2.

Figure 2.

Relationship between strength of interaction and prevalence ratio for uniform interactions between influenza and SARS-CoV-2. The scenarios tested correspond to θ=θ1(I)=θ1(T)=θ2(T)=θ2(I) (x-axis), for three different values of the average post-infectious period (1/δ = 1/δ1 = 1/δ2); other parameters were fixed to model the coupled dynamics of influenza and SARS-CoV-2 (cf. table 1). The dashed grey identity line depicts equality between the prevalence ratio and the true strength of interaction (PR = θ). The x- and y-axes are log-transformed. (Online version in colour.)

(b) . Higher interaction post-infection can cause the prevalence ratio to mis-identify non-uniform interactions

Next, we considered the more general case of interactions that differed during the infectious and the post-infectious periods, or non-uniform interactions (θ(I)θ(T)). For these experiments, we assumed an average post-infectious period of 7 days and we tested negative (0.2 ≤ θ(I), θ(T) ≤ 1) and positive (1 ≤ θ(I), θ(T) ≤ 5) interactions separately. Because higher values of θ actually resulted in lower interaction when the true interaction was assumed negative, in the following we define the strength of interaction as 1 − θ for negative interactions and as θ for positive interactions during either the infectious or the post-infectious period. As shown in figure 3, we found that the prevalence ratio was a monotonic function of the strength of interaction during the infectious period, either decreasing for negative interactions or increasing for positive interactions. Hence, in either case stronger interaction during the infectious period helped the prevalence ratio identify the true interaction. By contrast, higher interaction during the post-infectious period blurred the interaction inferred from the prevalence ratio. For weak interaction during infection (0.90 ≤ θ(I) ≤ 1.75, 11% of tested scenarios for negative interactions and 14% for positive interactions), these two opposing effects combined caused the prevalence ratio to mis-identify the sign of interaction in scenarios with strong interaction post-infection. In the other scenarios, the prevalence ratio correctly identified the sign of the interaction, but substantially underestimated its strength (e.g. prevalence ratio of 0.56 for θ(I) = 0.2 and θ(T) = 1, of 1.91 for θ(I) = 5 and θ(T) = 1). These experiments demonstrate that the prevalence ratio is an unreliable measure of interaction between influenza and SARS-CoV-2.

Figure 3.

Figure 3.

Relationship between strength of interaction and prevalence ratio for non-uniform interactions between influenza and SARS-CoV-2. The scenarios tested correspond to θ1(I)=θ2(I)=θ(I) and θ1(T)=θ2(T)=θ(T); other parameters were fixed to model the coupled dynamics of influenza and SARS-CoV-2 (cf. table 1). For negative interactions (top panel), the x-axis represents 1 − θ(I) and the y-axis 1 − θ(T); for positive interactions (bottom panel) θ(I) and θ(T). Hence, in either panel the true strength of interaction increases from left to right and from bottom to top. (Online version in colour.)

(c) . Sensitivity analyses demonstrate the results' robustness for influenza and SARS-CoV-2

In sensitivity analyses, we first verified that our results were robust to an alternative calculation of the prevalence ratio (electronic supplementary material, figure S2). Second, we repeated our analyses for non-symmetric interactions with no effect of SARS-CoV-2 on influenza infection (electronic supplementary material, figure S3). The results were broadly comparable to those for symmetric interactions (figure 3), except that fewer parameter combinations caused the prevalence ratio to mis-identify the sign of interaction (7% of all combinations tested, compared with 13% for symmetric interactions). However, the strength of positive interaction was also more severely underestimated in this scenario (prevalence ratio range: 0.83–1.51, compared with 0.56–1.91 for symmetric interactions). Third, we tested a scenario with seasonal forcing in transmission (electronic supplementary material, figure S4). We found that the prevalence ratio was moderately sensitive to the amplitude of seasonality, with interaction more severely underestimated for higher amplitude. These results remained robust to alternative seasonal peak time values that approximately coincided with the time of peak co-infection.

(d) . Global sensitivity analysis highlights properties of viral infection that obscure or facilitate estimation of interaction

In a global sensitivity analysis of positive interactions (θ(I), θ(T) ≥ 1), we assessed how different properties of viral infection and interaction affected the prevalence ratio. As shown in figure 4, the prevalence ratio decreased with the average latent period, the average post-infectious period, the strength of interaction post-infection, and the initial fraction immune. Hence, these four parameters blurred the interaction inferred from the prevalence ratio. Conversely, the average length of, and the strength of interaction during the infectious period increased with the prevalence ratio and therefore facilitated estimation of the interaction. Of note, higher values of the reproduction number dampened all these variations and decreased the prevalence ratio. To better understand these associations, we estimated the effect of each parameter on the numerator (prevalence of co-infection p12) and the denominator (product of single-infection prevalences p1p2) of the prevalence ratio. As shown in the electronic supplementary material, figure S5, the prevalences of single- and co-infections decreased as the average latent period and the initial fraction immune increased. However, for both parameters co-infection prevalence decreased more rapidly than the product of single-infection prevalences, thereby explaining the overall variation of the prevalence ratio (figure 4). By contrast, the prevalences of single- and co-infections increased with all the other parameters—as expected, since these parameters either intensified (θ(I) and θ(T)) or extended (γ−1 and δ−1) (positive) interaction. The overall effect of these parameters on the prevalence ratio was again explained by their different effects on the prevalence of co-infection and on the product of single-infection prevalences. In summary, these results confirm our earlier experiments on influenza and SARS-CoV-2 and highlight additional factors that make it difficult to interpret the prevalence ratio as a measure of interactions between respiratory viruses.

Figure 4.

Figure 4.

Global sensitivity analysis of prevalence ratio for positive virus–virus interactions. The association between the prevalence ratio and each input parameter was estimated using a GAM with cubic splines (sample size n = 103), for three different values of the initial reproduction number (1.5, 2.0 and 2.5). The corresponding adjusted R-squared equalled 97.4%, 97.5% and 97.9%. Each panel represents an input parameter, with the line (envelope) indicating the estimated (99% confidence interval) prevalence ratio while holding the other parameters fixed at σ−1 = γ−1 = δ−1 = 7 days, θ(I) = θ(T) = 3 and r0 = 0.2. For visual clarity, the x-axis values differ between panels (all periods are in days). (Online version in colour.)

4. Discussion

In this study, we aimed to determine whether the prevalence ratio—defined as the ratio of the prevalence of co-infection to the product of individual infection prevalences—enabled reliable estimation of interactions between respiratory viruses. To do so we designed a simulation study that built on a broadly applicable epidemiological model of co-circulation of two emerging or seasonal respiratory viruses. By focusing on the pair influenza–SARS-CoV-2, we first demonstrated that the prevalence ratio systematically underestimated the strength of interaction, and could even mistake the sign of interactions that persisted after clearance of infection. In a global sensitivity analysis, we further identified properties of viral infection—such as a high reproduction number, a long latent period, or a short infectious period—that blurred the interaction inferred from the prevalence ratio. Our results show that, in the absence of precise information about the timing of interaction, ecological or epidemiological studies designed to estimate the prevalence ratio, or variations thereof, may be unreliable.

With the likely prospect of COVID-19 becoming endemic, there is a pressing need to elucidate the potential interactions of SARS-CoV-2 with other pathogens, in particular respiratory viruses. Thus far, most epidemiological studies of SARS-CoV-2 interaction used simple statistics of co-circulation, such as the prevalence of co-infection, the prevalence ratio, or some variation thereof [1618,39,40]. As we showed here, however, such studies—even those carefully designed to control for various sources of bias like age or co-morbidities—are probably uninformative. Besides the prevalence ratio, we found that the prevalence of co-infection was also an unreliable measure of interaction, as low prevalences (less than or equal to 2.8% in the electronic supplementary material, figure S1, bottom panel) could be consistent with strong, positive interactions. As suggested by our global sensitivity analysis, the deficiencies of statistics based on co-infection prevalence may be even more severe for SARS-CoV-2 infection, characterized by a relatively high reproduction number [25]. In summary, we submit that further epidemiological studies will be needed to elucidate the interactions of SARS-CoV-2 with other respiratory viruses.

More generally, our study adds to the growing body of evidence demonstrating the shortcomings of seemingly intuitive measures of interaction. Using the same model, Shrestha et al. demonstrated the unreliability of phase as an indicator of interaction [14]. Using a susceptible infected susceptible (SIS)-like model of multiple pathogens causing chronic infection, Hamelin et al. showed that the prevalence ratio (as defined in this study) exceeded unity for non-interacting pathogens [20]. By contrast, we found that the prevalence ratio equalled unity for non-interacting pathogens (figure 2). This discrepancy, explained by the different pathogens and modelling assumptions considered in [20], highlights the sensitivity of the prevalence ratio to the characteristics of infection. More generally, it suggests that our results cannot be extrapolated to pathogens not well described by the susceptible infected recovered (SIR)-like model used here. Using a series of SIS and susceptible infected recovered susceptible (SIRS) models, Man et al. examined the properties of the odds ratio, defined as the ratio of the odds of one type in the presence of the other type, relative to the odds of this type in the absence of the other type—a quantity closely related to the alternative prevalence ratio PR′ defined above [19]. They proved that odds ratios exceeding unity could mask negative interactions. Despite differences in the scope of, and the models used in, this study, our results replicate this finding (figure 3). Furthermore, the association between the prevalence ratio and the interaction parameter in our study (figure 2) is comparable to that in [19] (fig. 2a; SIRS direct model). Finally, in a field study to assess interactions between an intestinal pathogen and nematodes in mice (where the true sign of interaction was known from previous experimental evidence), Fenton et al. reported that statistical methods based on cross-sectional data performed poorly and typically estimated the wrong sign of interaction [41]. Our results align with these findings, and we second Fenton et al.'s caution against the use of such methods to study pathogen-pathogen interactions. In summary, our study broadly agrees with previous evidence, and provides new evidence specific to the epidemiology of emerging or seasonal respiratory viruses.

The shortcomings of the prevalence ratio demonstrated here might suggest the need for new statistical methods to estimate interaction from co-detection prevalence data. However, seconding Fenton et al. [41], we propose that methods based on longitudinal data—collected at an appropriately fine time scale—offer a more promising avenue of research. Among those methods, mathematical models of transmission provide a powerful tool to formulate and test biologically explicit mechanisms of interaction, while capturing the underlying, nonlinear dynamics of infection of each pathogen [42]. Robust statistical inference techniques now facilitate fitting these models to epidemiological time series [43,44], as demonstrated by earlier successful applications in the field of pathogen interactions [4548]. Alternatively, advanced regression models have been developed to assess interactions between respiratory viruses [49], but such models may be limited because they lack a mechanistic formulation of interaction. Altogether we propose that empirical or mechanistic models of longitudinal data will be required to study the interactions of SARS-CoV-2 with other respiratory viruses, and more generally the interactions between respiratory pathogens [50].

Our study has four important limitations. First, because we used a deterministic model expressed in proportions, we sidestepped the important issue of statistical uncertainty, caused for example by finite sample size or imperfect measurement of infection prevalences. As the prevalence ratio was found to systematically underestimate the strength of interaction, such uncertainty—inevitable in practice—may further limit the ability of the prevalence ratio to correctly identify interactions. Second, for simplicity we did not include confounding variables (e.g. age) that may also affect estimation of the prevalence ratio. Third, we considered only short-term interactions that rapidly waned after clearance of infection. Although such interactions appear to be the most biologically plausible for different species of respiratory viruses [7,8], long-term interactions resulting from adaptive cross-immunity have been documented and could be relevant to other systems, such as the multiple types or subtypes of influenza [51,52]. Fourth, for simplicity we only modelled interactions that affected susceptibility to infection, because experimental evidence suggests this mechanism predominates among respiratory viruses [7,8]. However, other mechanisms—like changes in the transmissibility or the duration of infection—are biologically likely and could be tested for other classes of pathogens. Acknowledging all these limitations, our simple model could serve as a building block for further research on epidemiological interactions.

In conclusion, our results show that the inherently complex, nonlinear dynamic of emerging and seasonal respiratory viruses makes the interpretation of seemingly intuitive measures of interaction difficult, if not impossible. Despite these pitfalls, other statistical or mathematical methods based on longitudinal data should enable epidemiological research on pathogen interactions. Indeed, with increasing evidence that SARS-CoV-2 and other pathogens do not circulate in isolation but within polymicrobial systems, such research should remain a priority.

Supplementary Material

Acknowledgements

We thank Laura Barrero and Michael Briga for helpful comments on the manuscript.

Data accessibility

All R programming codes are freely available from Edmond, the Open Data Repository of the Max Planck Society: https://doi.org/10.17617/3.8k.

Authors' contributions

M.D.d.C.: conceptualization, formal analysis, methodology, software, visualization, writing—original draft, writing—review and editing; E.G.: conceptualization, methodology, writing—review and editing; J.-S.C.: conceptualization, methodology, writing—review and editing; S.C.K.: conceptualization, methodology, writing—review and editing. All authors gave final approval for publication and agreed to be held accountable for the work performed therein.

Competing interests

M.D.d.C. received postdoctoral funding (2017–2019) from Pfizer and consulting fees from GSK. All other authors declare no competing interests.

Funding

Open access funding provided by the Max Planck Society.

References

  • 1.GBD. 2016 Lower respiratory infections collaborators. 2018. Estimates of the global, regional, and national morbidity, mortality, and aetiologies of lower respiratory infections in 195 countries, 1990–2016: a systematic analysis for the global burden of disease study 2016. Lancet Infect. Dis. 18, 1191-1210. ( 10.1016/S1473-3099(18)30310-4) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.WHO Research and Development Blueprint. 2018. Annual review of diseases prioritized under the research and development blueprint. Geneva, Switzerland: World Health Organization. See https://s.gwdg.de/fUFmoO. [Google Scholar]
  • 3.Griffiths EC, Pedersen AB, Fenton A, Petchey OL. 2011. The nature and consequences of coinfection in humans. J. Infect. 63, 200-206. ( 10.1016/j.jinf.2011.06.005) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.DaPalma T, Doonan BP, Trager NM, Kasman LM. 2010. A systematic approach to virus-virus interactions. Virus Res. 149, 1-9. ( 10.1016/j.virusres.2010.01.002) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Bai L, et al. 2021. Coinfection with influenza a virus enhances SARS-CoV-2 infectivity. Cell Res. 31, 395-403. ( 10.1038/s41422-021-00473-1) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Goto H, Ihira H, Morishita K, Tsuchiya M, Ohta K, Yumine N, Tsurudome M, Nishio M. 2016. Enhanced growth of influenza a virus by coinfection with human parainfluenza virus type 2. Med. Microbiol. Immunol. 205, 209-218. ( 10.1007/s00430-015-0441-y) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Chan KF, Carolan LA, Korenkov D, Druce J, McCaw J, Reading PC, Barr IG, Laurie KL. 2018. Investigating viral interference between influenza A virus and human respiratory syncytial virus in a ferret model of infection. J. Infect. Dis. 218, 406-417. ( 10.1093/infdis/jiy184) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Dee K, Goldfarb DM, Haney J, Amat JAR, Herder V, Stewart M, Szemiel AM, Baguelin M, Murcia PR. 2021. Human rhinovirus infection blocks severe acute respiratory syndrome coronavirus 2 replication within the respiratory epithelium: implications for COVID-19 Epidemiology. J. Infect. Dis. 224, 31-38. ( 10.1093/infdis/jiab147) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Ziegler CGK, et al. 2020. SARS-CoV-2 receptor ACE2 is an interferon-stimulated gene in human airway epithelial cells and is detected in specific cell subsets across tissues. Cell 181, 1016-1035.e19. ( 10.1016/j.cell.2020.04.035) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Thindwa D, Garcia Quesada M, Liu Y, Bennett J, Cohen C, Knoll MD, Von Gottberg A, Hayford K, Flasche S. 2020. Use of seasonal influenza and pneumococcal polysaccharide vaccines in older adults to reduce COVID-19 mortality. Vaccine 38, 5398-5401. ( 10.1016/j.vaccine.2020.06.047) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Brogden KA, Guthmiller JM, Taylor CE. 2005. Human polymicrobial infections. Lancet 365, 253-255. ( 10.1016/S0140-6736(05)70155-0) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Shann F. 2010. The non-specific effects of vaccines. Arch. Dis. Child. 95, 662-667. ( 10.1136/adc.2009.157537) [DOI] [PubMed] [Google Scholar]
  • 13.Cowling BJ, Fang VJ, Nishiura H, Chan K-H, Ng S, Ip DKM, Chiu SS, Leung GM, Peiris JSM. 2012. Increased risk of noninfluenza respiratory virus infections associated with receipt of inactivated influenza vaccine. Clin. Infect. Dis. 54, 1778-1783. ( 10.1093/cid/cis307) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Shrestha S, King AA, Rohani P. 2011. Statistical inference for multi-pathogen systems. PLoS Comput. Biol. 7, e1002135. ( 10.1371/journal.pcbi.1002135) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Vandenbroucke JP, Pearce N. 2019. Test-negative designs: differences and commonalities with other case-control studies with “other patient” controls. Epidemiology 30, 838-844. ( 10.1097/EDE.0000000000001088) [DOI] [PubMed] [Google Scholar]
  • 16.Stowe J, Tessier E, Zhao H, Guy R, Muller-Pebody B, Zambon M, Andrews N, Ramsay M, Lopez Bernal J. 2021. Interactions between SARS-CoV-2 and influenza, and the impact of coinfection on disease severity: a test-negative design. Int. J. Epidemiol. 50, 1124-1133. ( 10.1093/ije/dyab081) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Kim D, Quinn J, Pinsky B, Shah NH, Brown I. 2020. Rates of co-infection between SARS-CoV-2 and other respiratory pathogens. J. Am. Med. Assoc. 323, 2085-2086. ( 10.1001/jama.2020.6266) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Nowak MD, Sordillo EM, Gitman MR, Paniz Mondolfi AE. 2020. Coinfection in SARS-CoV-2 infected patients: where are influenza virus and rhinovirus/enterovirus? J. Med. Virol. 92, 1699-1700. ( 10.1002/jmv.25953) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Man I, Wallinga J, Bogaards JA. 2018. Inferring pathogen type interactions using cross-sectional prevalence data: opportunities and pitfalls for predicting type replacement. Epidemiology 29, 666-674. ( 10.1097/EDE.0000000000000870) [DOI] [PubMed] [Google Scholar]
  • 20.Hamelin FM, et al. 2019. Coinfections by noninteracting pathogens are not independent and require new tests of interaction. PLoS Biol. 17, e3000551. ( 10.1371/journal.pbio.3000551) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Lloyd AL. 2001. Destabilization of epidemic models with the inclusion of realistic distributions of infectious periods. Proc. R. Soc. B 268, 985-993. ( 10.1098/rspb.2001.1599) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Keeling MJ, Rohani P. 2008. Modeling infectious diseases in humans and animals. Princeton, NJ: Princeton University Press. [Google Scholar]
  • 23.Chowell G, Miller MA, Viboud C. 2008. Seasonal influenza in the United States, France, and Australia: transmission and prospects for control. Epidemiol. Infect. 136, 852-864. ( 10.1017/S0950268807009144) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Carrat F, Vergu E, Ferguson NM, Lemaitre M, Cauchemez S, Leach S, Valleron A-J. 2008. Time lines of infection and disease in human influenza: a review of volunteer challenge studies. Am. J. Epidemiol. 167, 775-785. ( 10.1093/aje/kwm375) [DOI] [PubMed] [Google Scholar]
  • 25.Li R, Pei S, Chen B, Song Y, Zhang T, Yang W, Shaman J. 2020. Substantial undocumented infection facilitates the rapid dissemination of novel coronavirus (SARS-CoV-2). Science 368, 489-493. ( 10.1126/science.abb3221) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Vink MA, Bootsma MCJ, Wallinga J. 2014. Serial intervals of respiratory infectious diseases: a systematic review and analysis. Am. J. Epidemiol. 180, 865-875. ( 10.1093/aje/kwu209) [DOI] [PubMed] [Google Scholar]
  • 27.Bi Q, et al. 2020. Epidemiology and transmission of COVID-19 in 391 cases and 1286 of their close contacts in Shenzhen, China: a retrospective cohort study. Lancet Infect. Dis. 20, 911-919. ( 10.1016/S1473-3099(20)30287-5) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Melidou A, et al. 2020. Virological surveillance of influenza viruses in the WHO European Region in 2019/20: impact of the COVID-19 pandemic. Euro Surveill. 25, 2001822. ( 10.2807/1560-7917.ES.2020.25.46.2001822) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Kramer SC, Shaman J. 2019. Development and validation of influenza forecasting for 64 temperate and tropical countries. PLoS Comput. Biol. 15, e1006742. ( 10.1371/journal.pcbi.1006742) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Spencer JA, Shutt DP, Moser SK, Clegg H, Wearing HJ, Mukundan H, Manore CA. 2021. Distinguishing viruses responsible for influenza-like illness. medRxiv 2020–02. ( 10.1101/2020.02.04.20020404) [DOI] [PubMed]
  • 31.Lowen AC, Mubareka S, Steel J, Palese P. 2007. Influenza virus transmission is dependent on relative humidity and temperature. PLoS Pathog. 3, 1470-1476. ( 10.1371/journal.ppat.0030151) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Morris DH, et al. 2021. Mechanistic theory predicts the effects of temperature and humidity on inactivation of SARS-CoV-2 and other enveloped viruses. Elife 10, e65902. ( 10.7554/eLife.65902) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Wu J, Dhingra R, Gambhir M, Remais JV. 2013. Sensitivity analysis of infectious disease models: methods, advances and their application. J. R. Soc. Interface 10, 20121018. ( 10.1098/rsif.2012.1018) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Wood SN. 2017. Generalized additive models: an introduction with R, 2nd edn. Boca Raton, FL: CRC Press. [Google Scholar]
  • 35.King AA, et al. 2020 pomp: statistical inference for partially observed Markov processes. R package, version 2.7. See https://kingaa.github.io/pomp/.
  • 36.R Core Team. 2020. R: a language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. See https://www.R-project.org/. [Google Scholar]
  • 37.Lüdecke D. 2018. ggeffects: tidy data frames of marginal effects from regression models. J. Open Source Softw. 3, 772. ( 10.21105/joss.00772) [DOI] [Google Scholar]
  • 38.Ushey K. 2021. renv: project environments. See https://CRAN.R-project.org/package=renv.
  • 39.Lansbury L, Lim B, Baskaran V, Lim WS. 2020. Co-infections in people with COVID-19: a systematic review and meta-analysis. J. Infect. 81, 266-275. ( 10.1016/j.jinf.2020.05.046) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Musuuza JS, Watson L, Parmasad V, Putman-Buehler N, Christensen L, Safdar N. 2021. Prevalence and outcomes of co-infection and superinfection with SARS-CoV-2 and other pathogens: a systematic review and meta-analysis. PLoS ONE 16, e0251170. ( 10.1371/journal.pone.0251170) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Fenton A, Knowles SCL, Petchey OL, Pedersen AB. 2014. The reliability of observational approaches for detecting interspecific parasite interactions: comparison with experimental results. Int. J. Parasitol. 44, 437-445. ( 10.1016/j.ijpara.2014.03.001) [DOI] [PubMed] [Google Scholar]
  • 42.Metcalf CJE, Walter KS, Wesolowski A, Buckee CO, Shevliakova E, Tatem AJ, Boos WR, Weinberger DM, Pitzer VE. 2017. Identifying climate drivers of infectious disease dynamics: recent advances and challenges ahead. Proc. R. Soc. B 284, 20170901. ( 10.1098/rspb.2017.0901) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Ionides EL, Nguyen D, Atchadé Y, Stoev S, King AA. 2015. Inference for dynamic and latent variable models via iterated, perturbed Bayes maps. Proc. Natl Acad. Sci. USA 112, 719-724. ( 10.1073/pnas.1410597112) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.King AA, Nguyen D, Ionides EL. 2016. Statistical inference for partially observed markov processes via the R package pomp. J. Stat. Softw. 69, 1-43. ( 10.18637/jss.v069.i12) [DOI] [Google Scholar]
  • 45.Shrestha S, Foxman B, Weinberger DM, Steiner C, Viboud C, Rohani P. 2013. Identifying the interaction between influenza and pneumococcal pneumonia using incidence data. Sci. Transl. Med. 5, 191ra84. ( 10.1126/scitranslmed.3005982) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Opatowski L, Varon E, Dupont C, Temime L, van der Werf S, Gutmann L, Boëlle PY, Watier L, Guillemot D. 2013. Assessing pneumococcal meningitis association with viral respiratory infections and antibiotics: insights from statistical and mathematical models. Proc. R. Soc. B 280, 20130519. ( 10.1098/rspb.2013.0519) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Domenech de Cellès M, Arduin H, Lévy-Bruhl D, Georges S, Souty C, Guillemot D, Watier L, Opatowski L. 2019. Unraveling the seasonal epidemiology of pneumococcus. Proc. Natl Acad. Sci. USA 116, 1802-1807. ( 10.1073/pnas.1812388116) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Noori N, Rohani P. 2019. Quantifying the consequences of measles-induced immune modulation for whooping cough epidemiology. Phil. Trans. R. Soc. B 374, 20180270. ( 10.1098/rstb.2018.0270) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Nickbakhsh S, et al. 2019. Virus-virus interactions impact the population dynamics of influenza and the common cold. Proc. Natl Acad. Sci. USA 116, 27 142-27 150. ( 10.1073/pnas.1911083116) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Opatowski L, Baguelin M, Eggo RM. 2018. Influenza interaction with cocirculating pathogens and its impact on surveillance, pathogenesis, and epidemic profile: a key role for mathematical modelling. PLOS Pathog. 14, e1006770. ( 10.1371/journal.ppat.1006770) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Terajima M, Babon JAB, Ennis FA. 2013. Cross-reactive human B cell and T cell epitopes between influenza A and B viruses. Virol. J. 10, 1-10. ( 10.1186/1743-422X-10-244) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Laurie KL, et al. 2015. Interval between infections and viral hierarchy are determinants of viral interference following influenza virus infection in a ferret model. J. Infect. Dis. 212, 1701-1710. ( 10.1093/infdis/jiv260) [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

All R programming codes are freely available from Edmond, the Open Data Repository of the Max Planck Society: https://doi.org/10.17617/3.8k.


Articles from Proceedings of the Royal Society B: Biological Sciences are provided here courtesy of The Royal Society

RESOURCES