Applying mixture model methods to SARS-CoV-2 serosurvey data from Geneva

Judith A Bouman; Sarah Kadelka; Silvia Stringhini; Francesco Pennacchio; Benjamin Meyer; Sabine Yerly; Laurent Kaiser; Idris Guessous; Andrew S Azman; Sebastian Bonhoeffer; Roland R Regoes

doi:10.1016/j.epidem.2022.100572

. 2022 May 7;39:100572. doi: 10.1016/j.epidem.2022.100572

Applying mixture model methods to SARS-CoV-2 serosurvey data from Geneva

Judith A Bouman ^a,^⁎, Sarah Kadelka ^a, Silvia Stringhini ^b,^f,^k, Francesco Pennacchio ^b, Benjamin Meyer ⁱ, Sabine Yerly ^c,^d, Laurent Kaiser ^d,^e,^g, Idris Guessous ^b,^f, Andrew S Azman ^f,^h,^j, Sebastian Bonhoeffer ^a, Roland R Regoes ^a,^⁎

PMCID: PMC9076579 PMID: 35580458

Abstract

Serosurveys are an important tool to estimate the true extent of the current SARS-CoV-2 pandemic. So far, most serosurvey data have been analyzed with cutoff-based methods, which dichotomize individual measurements into sero-positives or negatives based on a predefined cutoff. However, mixture model methods can gain additional information from the same serosurvey data. Such methods refrain from dichotomizing individual values and instead use the full distribution of the serological measurements from pre-pandemic and COVID-19 controls to estimate the cumulative incidence. This study presents an application of mixture model methods to SARS-CoV-2 serosurvey data from the SEROCoV-POP study from April and May 2020 in Geneva (2766 individuals). Besides estimating the total cumulative incidence in these data (8.1% (95% CI: 6.8%–9.9%)), we applied extended mixture model methods to estimate an indirect indicator of disease severity, which is the fraction of cases with a distribution of antibody levels similar to hospitalized COVID-19 patients. This fraction is 51.2% (95% CI: 15.2%–79.5%) across the full serosurvey, but differs between three age classes: 21.4% (95% CI: 0%–59.6%) for individuals between 5 and 40 years old, 60.2% (95% CI: 21.5%–100%) for individuals between 41 and 65 years old and 100% (95% CI: 20.1%–100%) for individuals between 66 and 90 years old. Additionally, we find a mismatch between the inferred negative distribution of the serosurvey and the validation data of pre-pandemic controls. Overall, this study illustrates that mixture model methods can provide additional insights from serosurvey data.

Keywords: Serosurvey, Mixture model methods, SARS-CoV-2, Serological assays, COVID-19

1. Introduction

Serological surveys (serosurveys) are an important tool to estimate the cumulative incidence of SARS-CoV-2 infections in various geographic locations or risk groups during the current pandemic (Koopmans and Haagmans, 2020). Based on the estimated cumulative incidence, one can even calculate several related parameters such as: the ascertainment rate, i.e. the fraction of cases detected, the relative risk of infection for sub-groups (Stringhini et al., 2020), and the infection fatality rate (Levin et al., 2020). In 2020, many serosurveys have been conducted in a wide variety of geographic locations (Chen et al., 2021). The vast majority of these serosurvey studies have been analyzed with cutoff-based methods, meaning that each individual serological measurement has been dichotomized into sero-negative or positive based on a predefined cutoff value. This cutoff value has been defined based on a receiver operating characteristic (ROC) curve constructed from samples from pre-pandemic controls and known SARS-CoV-2 infections.

The cutoff-based method for analyzing serosurveys has two main challenges. Firstly, the cutoff depends on validation data from known SARS-CoV-2 infections, which are often not representative of the full spectrum of possible infections. Instead, cases used for the validation data are, especially at the beginning of an epidemic, biased towards severe infections and early convalescent periods (Føns and Krogfelt, 2021). However, it is known that disease severity influences the antibody level after infection (Serre-Miranda et al., 2021) and that antibody levels wane over time (den Hartog et al., 2021). This can lead to overly confident estimates of the sensitivity and specificity of the serological test and therefore bias the estimated cumulative incidence. Secondly, the amount of information obtained from the serosurvey is reduced by dichotomizing the continuous measurements. As a result, the cutoff-based method does not allow to differentiate between several types of SARS-CoV-2 infections (for instance mild and severe infections), nor to detect or correct for a possible mismatch between the cases included in the validation data and those in the serosurvey.

Both of the posed challenges can be circumvented by using mixture model methods. Instead of dichotomizing the individual serological observations, mixture model methods estimate the cumulative incidence directly based on the full distribution of serological measurements for the pre-pandemic controls and known SARS-CoV-2 infections (Bouman et al., 2021). As a result, this inference framework can also be used to determine whether the cases included as positive COVID-19 controls are a good representation of the cases in the serosurvey data or whether cases with a distinct distribution of serological measurements (such as measurements from individuals with an asymptomatic or mild infection) are missing in the validation data. Moreover, mixture model methods allow to use multiple distinct distributions of cases separately in the analysis. Even though mixture models have been successfully applied to serosurvey data for several pathogens (Vyse et al., 2006, Rota et al., 2008, Vink et al., 2015, van Boven et al., 2017), they are rarely used to analyze serosurvey data from SARS-CoV-2 studies (Bottomley et al., 2021).

In this study, we apply mixture model methods to serosurvey data from the SEROCoV-POP study that was performed in Geneva in April and May of 2020 (Stringhini et al., 2020). In addition to corroborating previous estimates of the cumulative incidence for these data (4.6% in first week (95% CI: 2.4%–8.0%) to 10.9% in the fifth week (95% CI: 8.2%–13.9%)) – we estimated a cumulative incidence of 8.1% (95% CI: 6.8%–9.9%) over the whole period of sampling –, our aim is to show how mixture model methods can be used to extract more information from serosurveys. We use an extended mixture model that takes into consideration the distribution of antibody levels of both hospitalized COVID-19 patients and outpatients. This results in an estimate of what we call the indirect indicator of severity, which is defined as the fraction of individuals in the serosurvey that display a distribution of antibody levels similar to that of the hospitalized patients in the control data. This fraction is not a direct estimate of the fraction of cases in the serosurvey that were treated in a hospital, as the validation data does not contain positive control data from asymptomatic and mild cases. Therefore, we rather refer to this quantity as the indirect indicator of disease severity.

2. Methods

2.1. Data

We used the pre-pandemic and COVID-19 control data from Meyer et al. as the validation data in this study (Meyer et al., 2020). The pre-pandemic control data consists of 326 samples, 276 of these originated from adults and 50 from children (Meyer et al., 2020). They were collected in 2013, 2014 and 2018 at the University Hospitals of Geneva (Meyer et al., 2020). 84 of the samples came from healthy individuals and 242 from patients consulting the hospital (Meyer et al., 2020). The COVID-19 control data was collected from 181 individuals at the University Hospitals of Geneva. The severity of their infection is indicated by either ‘hospitalized’ (n=91) or ‘outpatient’ (n=90) (Meyer et al., 2020). Both hospitalized and outpatient individuals displayed at least mild symptoms.

We also used data from the SEROCoV-POP study from April and May 2020 from Stringhini et al. (2020). Each week of the study, 1300 participants of the Bus Santé study were invited to participate via email and were asked to invite any household members (Stringhini et al., 2020). The Bus Santé study is an annual cross-sectional study of adults residing in Geneva state (Switzerland) that, at the time of the study, had 17 225 participants on record (Morabia et al., 1997, Guessous et al., 2012, Guessous et al., 2014, Mestral et al., 2020, Stringhini et al., 2020). The invitation process resulted in the participation of 2766 individuals of which 52.6% are female (Stringhini et al., 2020). Individuals aged between 50 and 64 were over-represented compared to the general population of Geneva and the age groups 5–9, 20–49 and 80–104 were under-represented (Stringhini et al., 2020). Recruited participants have a higher educational level than the general population of Geneva (Stringhini et al., 2020). The data from the 2766 recruited participants contains age, measured IgG OD ratio of the Euroimmun SARS-CoV-2 serological assay and sex. Additionally, the household structure between the individuals is indicated.

The serological assay measurements for all sera in both datasets were obtained with the Euroimmun SARS-CoV-2 serological assay which quantifies the IgG antibodies against the S1-domain of the spike protein of SARS-CoV-2 (Meyer et al., 2020). The IgG OD ratio is the result of the immunoreactivity of the sample measured at an optical density of 450 nm (OD450) divided by the OD450 of the calibrator (Meyer et al., 2020, Okba et al., 2020).

2.2. Mixture model methods

We have assembled all observations of the SEROCoV-POP study from April and May and apply the mixture model described by Bouman et al. (2021). All analyses are performed in R (Team R. Core et al., 2013). The basic mixture model maximizes the likelihood Eq. (1). Here, $U$ is the vector of observed IgG OD ratios in the serosurvey data, $σ$ is a binary vector of length $n$ with their underlying true serological status ( $1$ for past infection and $0$ for no past infection). The probabilities $p (U_{i} | σ_{i} = 0)$ and $p (U_{i} | σ_{i} = 1)$ capture the empirical distributions of IgG OD ratios for the pre-pandemic and COVID-19 control measurements, and $π$ is the cumulative incidence. The empirical distributions are obtained by smoothing the observed distributions. This is done with the ‘density’-function in R using the default kernel setting, ‘gaussian’. Team R. Core et al. (2013). The use of this smoothing function has been validated with simulated data in Bouman et al. (2021). The specific smoothing algorithm does influence the results, even though the differences are small. For example, the kernel ‘cosine’ results in a point estimate of the cumulative incidence of 8.7% ( $95 % C I : 6.9 % - 10.3 %$ ). More extensive data would allow us to determine the antibody distribution more reliable.

l l (U) = \sum_{i = 1}^{i = n} log (p (σ_{i} = 1 | π) p (U_{i} | σ_{i} = 1) + p (σ_{i} = 0 | π) p (U_{i} | σ_{i} = 0))

(1)

The likelihood is extended for the model where the outpatient and hospitalized cases are estimated separately, see Eq. (2). Here, $π_{o u t}$ is the cumulative incidence of outpatient cases and $π_{h o s p}$ the cumulative incidence of hospitalized cases, $σ_{i}$ can be $0$ (no past infection), $1$ (past outpatient infection) or $2$ (past hospitalized infection).

l l (U) = \sum_{i = 1}^{i = n} log (p (σ_{i} = 1 | π_{o u t}) p (U_{i} | σ_{i} = 1) + p (σ_{i} = 2 | π_{h o s p}) p (U_{i} | σ_{i} = 2) + p (σ_{i} = 0 | π_{o u t}, π_{h o s p}) p (U_{i} | σ_{i} = 0))

(2)

The 95% confidence intervals are estimated by bootstrapping the control distributions as well as the observations from the serosurvey. The various mixture models are compared with a likelihood ratio test.

We applied the extended model described above to the serosurvey data segregated into three age categories: 5–40 years, 41–65 years and 66–90 years. Even though the ages of the outpatient and hospitalized case populations are significantly different, we used the whole distribution of both populations for these analyses.

Testing for a mismatch between serosurvey and validation data.

To test if there is a mismatch between the observed serosurvey data and the validation data, we extend Eq. (2) with an additional class (see Eq. (3)). Thus, $σ$ can now take one of four categorical values where the new one represents an additional, yet unknown, category of cases. The distribution of this additional category ( $p (U_{i} | σ_{i} = 3)$ ) is modelled to be a normal distribution, where the mean and standard deviation are under optimization.

l l (U) = \sum_{i = 1}^{i = n} log (p (σ_{i} = 1 | π_{o u t}) p (U_{i} | σ_{i} = 1) + p (σ_{i} = 2 | π_{h o s p}) p (U_{i} | σ_{i} = 2) + p (σ_{i} = 3 | π_{a d d}) p (U_{i} | σ_{i} = 3) + p (σ_{i} = 0 | π_{o u t}, π_{h o s p}, π_{a d d}) p (U_{i} | σ_{i} = 0))

(3)

This model is then compared to the model of Eq. (2) to test if the additional distribution has significantly improved the likelihood of observing the serosurvey data.

We have also used an adjusted version of the method described above, where we summarized all observations below 0.34 into a point mass for the empirical distribution. The value of 0.34 is two standard deviations larger than the mean of the inferred mismatch in the distribution of pre-pandemic controls, to make sure that this mismatch is not included in the new distributions. The model is then performed with these distributions instead of the original empirical distributions of the negative and positive controls.

3. Results

3.1. Distributions of IgG OD ratios significantly differ for hospitalized and outpatient SARS-CoV-2 positive controls

Meyer et al. (2020) validated the diagnostic accuracy of the Euroimmun SARS-CoV-2 IgG and IgA immunoassay for SARS-CoV-2 infection (Meyer et al., 2020). For this validation, they used a pre-pandemic negative control group (negative controls, $326$ individuals) and two clinically distinguishable positive control groups: individuals who were hospitalized in the University Hospitals of Geneva (COVID-19 hospitalized, $91$ individuals), and individuals who were treated in outpatient clinics (COVID-19 outpatients, $90$ individuals). All positive controls tested positive for SARS-CoV-2 by PCR and showed at least mild symptoms. The observed IgG OD ratios of the Euroimmun SARS-CoV-2 immunoassay are shown in Fig. 1 for the negative controls and both groups of positive controls. The distribution of the IgG OD ratios for the hospitalized positive controls is significantly different from the outpatient positive controls (two-sample Wilcoxon test, $p$ -value $=$ 1.122e−05).

Fig. 1 — Histograms of IgG OD ratios of the Euroimmun SARS-CoV-2 IgG from the SEROCoV-POP study from April to May (Stringhini et al., 2020) and the validation data from Meyer et al. (2020). Solid lines indicate the empirical distributions. The purple solid line shows the inferred additional distribution that is an indication of the mismatch between the pre-pandemic controls and the serosurvey data. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)

3.2. Model that separately estimates the cumulative incidence for hospitalized and outpatient control data is significantly better than model based on one type of controls only

The significant difference between the distributions of the IgG OD ratios for the hospitalized and the outpatient controls allows the mixture model method to simultaneously estimate the cumulative incidence of both types of cases in the data from the SEROCoV-POP study from April and May 2020 (see Eq. (2)). We find a cumulative incidence of 4.0% (95% CI: 0.8%–7.4%)) for cases with a distribution of antibody levels similar to hospitalized controls and 4.2% (95% CI: 1.4%–7.4%)) for cases with a distribution of antibody levels similar to outpatient controls. As a result, the fraction of cases in the serosurvey that can be explained with the distribution of the IgG OD ratios from the hospitalized controls, which we refer to as the indirect indicator of disease severity, is 51.2% (95% CI: $9.9 %– 83.7 %$ ). The large 95% CI of this indicator of disease severity is caused by the overlap in the two positive control distributions.

To investigate if the model improves by including a separate estimate for both types of positive controls, we compared the likelihood of the estimates above to the likelihood from a model that is based on either the hospitalized or outpatient control data only (see Eq. (1) and Table 1). The p-values in Table 1 indicate that the model is indeed significantly improved by estimating two cumulative incidences separately. Table 1 also shows that the point estimate of the total cumulative incidence estimate is higher if the mixture model is based on the outpatient controls only and lower if it is based on the hospitalized controls only, compared to the model that uses both distributions. This is expected, as the distribution obtained from the COVID-19 hospitalized controls is more distinguishable from the pre-pandemic controls than the COVID-19 outpatient controls.

Table 1.

Overview of cumulative incidence estimates based on various positive control data. The p-values are the result of a likelihood ratio test.

Type of positive control data	Cumulative incidence estimate	p-value compared with separate outpatient and hospitalized data
Outpatient data only	8.4% (6.9%–10.1%)	4.1e−08
Hospitalized data only	7.7% (6.3%–9.6%)	5.0e−07
Outpatient and hospitalized data treated as separate distributions	8.1% (6.8%–9.9%)	–

Open in a new tab

3.3. Indirect indicator of disease severity differs between age groups

It is known that there is a correlation between the age of an infected individual and the severity of a SARS-CoV-2 infection (Liu et al., 2020). To validate our methodology, we estimated the indirect indicator of disease severity for three age-classes: 5 to 40 years, 41 to 65 years and 66 to 90 years. These estimates, together with the total cumulative incidence estimates for the age-classes, are shown in Table 2. Indeed, the indirect indicator of disease severity is highest for the oldest age class: we estimated that 100% of the cases in the serosurvey can be explained by the distribution of the hospitalized COVID-19 controls, for the middle and young class this is 60.2% and 21.4%, respectively (see Fig. 2). Fig. 3 shows that the maximal observed IgG OD ratio as well as the median of all values above the cutoff provided by the manufacturer (red dots) increase with age. However, the overall median of the distribution does not increase with age (black dots). This illustrates that the observed increase in the indirect indicator of disease severity is driven by the upper part of the IgG OD ratio distributions. The model that separately considers the age classes is significantly better than the model without these age classes after correcting for the increased amount of parameters (likelihood-ratio test, $p$ -value $=$ 0.009).

Fig. 2 — The indirect indicator of disease severity per age class, including the 95% confidence intervals.

Fig. 3 — Violin plots of the distributions from the validation data (pre-pandemic controls, COVID-19 outpatient cases and COVID-19 hospitalized cases) and age-stratified serosurvey data. The black dots indicate the median of the full distribution and the red dots the median of all values larger than the cutoff of seropositivity provided by the manufacturer (1.1).

Men, compared to women, are more likely to suffer from a severe SARS-CoV-2 infection (Peckham et al., 2020). Again, this can also be found by applying the mixture model method to the serosurvey data (see Table 2). The point estimate of the indirect indicator of disease severity is higher for males compared to females, although this difference is not significant. The $p$ -value of a likelihood-ratio test for the model that separates female and male participants with the original model is 0.046. The age distribution of the males and females are comparable in the serosurvey (two-sample Wilcoxon test, $p$ -value $=$ 0.18).

Table 2.

Cumulative incidence and indicator of disease severity for three age-classes.

Sub-population	Number of observations	Total cumulative incidence	Indirect indicator of disease severity
age-range [5-40]	1077	10.1% (7.8%–12.7%)	21.4% (0%-59.6%)
age-range [41-65]	1355	7.6% (5.8%–9.6%)	60.2% (21.5%–100%)
age-range [66-90]	334	4.1% (0%–6.9%)	100% (20.1%–100%)
Females	1454	7.1% (5.3%–9.0%)	45.3% (4.1%–91.9%)
Males	1312	9.3% (7.3%–11.6%)	50.8% (16.7%–90.7%)

Open in a new tab

3.4. Mismatch between pre-pandemic controls and individuals without previous SARS-CoV-2 infection in the serosurvey

Mixture model methods give unbiased results when the pre- pandemic control data represent individuals without previous COVID-19 infection and the COVID-19 control data span the whole range of COVID-19 severity and their relative occurrence. The method presented in this manuscript can be used to test whether there is a mismatch between the validation and serosurvey data. This is done by testing if there is more statistical support for a extended mixture model that assumes an additional, hidden distribution of antibody levels (see Methods) (Bouman et al., 2021).

In the SEROCoV-POP serosurvey, we indeed infer such a mismatch between the validation and the serosurvey data ( $p$ -value likelihood ratio test $= 8 e - 105$ ). Fig. 1 shows the distribution of serological measurements that are inferred to be present in the serosurvey data but not in the validation data (purple line — ’additional distribution’). This distribution is on the lower end of the range of antibody levels, covering part of the distribution of the pre-pandemic control samples. This indicates that the inferred mismatch is due to a discrepancy between the measurements from the pre-pandemic control samples and the individuals from the serosurvey study who likely did not have a past SARS-CoV-2 infection.

The total cumulative incidence of SARS-CoV-2 infections for the model that includes the additional case distribution is 9.3% (95% CI: 6.7%–10.6%) and thus higher than without this distribution. The reason for this is that the additional distribution is on the lower range of the observed IgG OD ratios, hence some of the lower range values from the serosurvey data are now inferred to be similar to the additional distribution and thus also COVID-19 cases.

3.5. No evidence for an additional missing positive control distribution with lower mean

The previous subsection describes the mismatch we identified between the negative controls and individuals with low serological measurements in the serosurvey. We hypothesized that there is a further mismatch between the distributions of serological measurements of COVID-19 cases and the serosurvey because the COVID-19 cases in the validation data were all symptomatic and relatively severe and the cases in the serosurvey span the whole spectrum of severity. Specifically, we expected to find evidence for an additional distribution representing COVID-19 cases with a lower mean than the distributions for the outpatient and hospitalized cases.

To test for such an additional case distribution, we lumped all IgG OD ratios below 0.34 into a single point mass to direct the focus of the analysis away from low serological measurements and investigated a potential additional mismatch on the higher end of the observed IgG OD ratios. However, we did not find any statistical support for such an additional mismatch. This suggests that the individuals with high IgG OD ratios in the serosurvey are well represented by the positive control data.

4. Discussion

In this study, we present an application of mixture model methods to SARS-CoV-2 serosurvey data. Serosurvey data are currently used to determine the proportion of seropositivity and to estimate the cumulative incidence and the relative risk of seropositivity in various sub-groups. This is usually done by introducing a cutoff for seropositivity.

We show that mixture models that use the entire distribution of the antibody levels rather than a cutoff for seropositivity, provide additional insights into aspects of an epidemic that are usually not addressed in serosurveys. Specifically, we have used mixture models to infer the cumulative incidence from distinct serological distributions, in this case those from hospitalized and outpatient COVID-19 positive controls. We found that the indirect indicator of disease severity (the fraction of individuals with antibody distributions similar to hospitalized cases) increases with age mirroring evidence from clinical studies. Additionally, mixture model methods can be used to test for a mismatch between the pre-pandemic and COVID-19 control data and the serosurvey data, which could indicate that the cases observed in the population are not well represented by those included in the control data. While we provide evidence for such discrepancies, they are not indicative of a large fraction of cases with intermediate antibody levels that would be expected for asymptomatically infected individuals.

Other studies using mixture model methods to analyze SARS-CoV-2 serosurvey data have been conducted. Vos et al. (2021) used mixture models to validate their cutoff value (Vos et al., 2021). They assumed and inferred control and case distributions from the serosurvey data (Vos et al., 2021). Our approach in contrast, is based on the observed distributions of serological measurements in prepandemic sera and sera of individuals with PCR-confirmed SARS-CoV-2 infection. These observed distributions are not adequately captured by the normal distributions Vos et al. assumed. Hence, our study represents a more stringent and empirically-supported use of mixture models.

Although the mixture model approach naturally allows to implement declining antibody levels and sero-reversion (Kadelka et al., 2021), we have not corrected our estimate of the cumulative incidence for the possible effect of sero-reversion. The reason for this is that the serosurvey was conducted within 4 months of the start of the pandemic. Current estimates of antibody half-lives IgG RBD are around 50–106 days (Dan et al., 2021). Therefore we expect the effect of sero-reversion to be negligible. Furthermore, we did not correct the estimate of cumulative incidence for age nor household structure because our study was aiming to provide a proof of concept rather than additional estimates for the sero-prevalence in Geneva. As a result, the estimates presented here are only representative for the study population and not for the general population of Geneva. Estimates for the cumulative incidence of the general population of Geneva from these data can be found in Stringhini et al. (2020) (Stringhini et al., 2020).

The presented estimates of the indirect indicator of disease severity have wide confidence intervals. This is caused by the fact that while the distributions of the antibody levels for COVID-19 hospitalized and outpatient cases are significantly different, there is quite a lot of overlap (see figure S1). This could potentially be improved if more detailed positive control data would be available to guide the construction of more distinguishable distributions of IgG OD ratios based on characteristics of the infections or infected individuals. Despite the large confidence intervals, we found that the indirect indicator of disease severity increases with age, corroborating previous reports (Liu et al., 2020). Similarly, the point estimate of the indirect indicator of disease severity is higher for males compared to females, consistent with reported sex differences in ICU admission and death (Peckham et al., 2020).

Mixture model methods give unbiased results when both the negative and positive controls in the validation data represent the general population well (Bouman et al., 2021). However, we know that there are some issues with our validation data. First, the pre-pandemic control data over-represent individuals with pathological conditions and, for both the pre-pandemic and the COVID-19 controls, the age-distribution is different from the general population (Meyer et al., 2020). Second, all cases in the COVID-19 control group show at least mild symptoms and half of them have been hospitalized. Therefore, the severity of the selected cases is higher than expected in a random group of COVID-19 patients and is not consistent with the estimated fraction of 20% asymptomatic cases (Buitrago-Garcia et al., 2020). This is likely to result in a different distribution of serological measurements as severe cases have been shown to give rise to higher antibody levels than mild cases (GeurtsvanKessel et al., 2020, Okba et al., 2020). Third, the ratio of outpatient to hospitalized cases in the control group is lower than expected, which could lead to an underestimation of the cumulative incidence (Bouman et al., 2021). More extensive and representative validation data could improve the cumulative incidence estimates, however, such data are difficult to collect, especially at the beginning of a pandemic, when asymptomatic and mild cases often go undetected and their proportion is unknown.

Part of the aim of our approach was to identify potential biases caused by any of the mentioned limitations in the validation data. Interestingly, however, the mismatch we identify is not characterized by an intermediate level of antibodies in between the level of the pre-pandemic sera and the outpatients as we would expect for a missing distribution of mild or asymptotic cases. Opposite to our expectation we found that the serosurvey data display a narrower distribution at the lower end of the antibody levels than the pre-pandemic, negative controls — as if there were asymptomatic or mild SARS-CoV-2 infections among the pre-pandemic controls. A more detailed characterization of the individuals from whom the pre-pandemic control sera were sampled, as well as the determination of antibody levels in asymptomatic and mild cases could shed further light on this mismatch and thus further improve the estimation of the cumulative incidence. An additional improvement could be obtained when a quantitative immuno-assay would be used, instead of the semi-quantitative Euroimmun that was available at the beginning of the pandemic.

CRediT authorship contribution statement

Judith A. Bouman: Conceptualization, Methodology, Validation, Formal analysis, Writing – original draft, Writing – review & editing, Visualization. Sarah Kadelka: Methodology, Writing – original draft, Writing – review & editing. Silvia Stringhini: Resources, Writing – original draft, Writing – review & editing. Francesco Pennacchio: Resources, Writing – original draft, Writing – review & editing. Benjamin Meyer: Resources, Writing – original draft, Writing – review & editing. Sabine Yerly: Resources, Writing – original draft, Writing – review & editing. Laurent Kaiser: Resources, Writing – original draft, Writing – review & editing. Idris Guessous: Resources, Writing – original draft, Writing – review & editing. Andrew S. Azman: Methodology, Writing – original draft, Writing – review & editing. Sebastian Bonhoeffer: Conceptualization, Writing – original draft, Writing – review & editing. Roland R. Regoes: Conceptualization, Methodology, Validation, Formal analysis, Writing – original draft, Writing – review & editing, Visualization, Supervision.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgements

We would like to thank Peter Ashcroft, Sonja Lehtinen and Jana Huisman for valuable comments on the manuscript. Roland Regoes gratefully acknowledges funding from the Botnar Research Centre for Child Health, Switzerland (grant number 2020-FS-354).

Ethics statement

The SEROCoV-POP study was approved by the Cantonal Research Ethics Commission of Geneva, Switzerland (CER16-363). The full study protocol is available online (in French).

Footnotes

^{Appendix A}

Supplementary material related to this article can be found online at https://doi.org/10.1016/j.epidem.2022.100572.

Appendix A. Supplementary data

The following is the Supplementary material related to this article.

MMC S1

ROC curves of hospitalized and outpatient validation data.

mmc1.pdf^{(78.1KB, pdf)}

Data availability

Data will be made available on request.

References

Bottomley C., Otiende M., Uyoga S., Gallagher K., Kagucia E., Etyang A., Mugo D., Gitonga J., Karanja H., Nyagwange J., Adetifa I., Agweyu A., Nokes D., Warimwe G., Scott J. 2021. Improving SARS-CoV-2 cumulative incidence estimation through mixture modelling of antibody levels. MedRxiv. [DOI] [PMC free article] [PubMed] [Google Scholar]
Bouman J.A., Riou J., Bonhoeffer S., Regoes R.R. Estimating the cumulative incidence of SARSCoV-2 with imperfect serological tests: Exploiting cutoff-free approaches. PLoS Comput. Biol. 2021;17(2) doi: 10.1371/JOURNAL.PCBI.1008728. [DOI] [PMC free article] [PubMed] [Google Scholar]
van Boven M., van de Kassteele J., Korndewal M.J., van Dorp C.H., Kretzschmar M., van der Klis F., de Melker H.E., Vossen A.C., van Baarle D. Infectious reactivation of cytomegalovirus explaining age- and sex-specific patterns of seroprevalence. PLoS Comput. Biol. 2017;13(9) doi: 10.1371/journal.pcbi.1005719. [DOI] [PMC free article] [PubMed] [Google Scholar]
Buitrago-Garcia D., Egli-Gany D., Counotte M.J., Hossmann S., Imeri H., Ipekci A.M., Salanti G., Low N. Occurrence and transmission potential of asymptomatic and presymptomatic SARS-CoV-2 infections: A living systematic review and meta-analysis. PLOS Med. 2020;17(9) doi: 10.1371/journal.pmed.1003346. [DOI] [PMC free article] [PubMed] [Google Scholar]
Chen X., Chen Z., Azman A.S., Deng X., Sun R., Zhao Z., Zheng N., Chen X., Lu W., Zhuang T., Yang J., Viboud C., Ajelli M., Leung D.T., Yu H. Serological evidence of human infection with SARS-CoV-2: a systematic review and meta-analysis. Lancet Glob. Health. 2021 doi: 10.1016/S2214-109X(21)00026-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
Dan J.M., Mateus J., Kato Y., Hastie K.M., Yu E.D., Faliti C.E., Grifoni A., Ramirez S.I., Haupt S., Frazier A., Nakao C., Rayaprolu V., Rawlings S.A., Peters B., Krammer F., Simon V., Saphire E.O., Smith D.M., Weiskopf D., Sette A., Crotty S. Immunological memory to SARS-CoV-2 assessed for up to 8 months after infection. Science. 2021;371(6529) doi: 10.1126/science.abf4063. [DOI] [PMC free article] [PubMed] [Google Scholar]
Føns S., Krogfelt K.A. How can we interpret SARS-CoV-2 antibody test results? Pathogens Dis. 2021;79(1) doi: 10.1093/femspd/ftaa069. [DOI] [PMC free article] [PubMed] [Google Scholar]
GeurtsvanKessel C.H., Okba N.M.A., Igloi Z., Bogers S., Embregts C.W.E., Laksono B.M., Leijten L., Rokx C., Rijnders B., Rahamat-Langendoen J., van den Akker J.P.C., van Kampen J.J.A., van der Eijk A.A., van Binnendijk R.S., Haagmans B., Koopmans M. An evaluation of COVID-19 serological assays informs future diagnostics and exposure assessment. Nature Commun. 2020;11(1) doi: 10.1038/s41467-020-17317-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
Guessous I., Bochud M., Theler J.-M., Gaspoz J.-M., Pechère-Bertschi A. 1999–2009 Trends in prevalence, unawareness, treatment and control of hypertension in Geneva, Switzerland. PLoS One. 2012;7(6) doi: 10.1371/journal.pone.0039877. [DOI] [PMC free article] [PubMed] [Google Scholar]
Guessous I., Gaspoz J.-M., Theler J.-M., Kayser B. Eleven-year physical activity trends in a Swiss urban area. Prev. Med. 2014;59:25–30. doi: 10.1016/j.ypmed.2013.11.005. [DOI] [PubMed] [Google Scholar]
den Hartog G., Vos E.R.A., van den Hoogen L.L., van Boven M., Schepp R.M., Smits G., van Vliet J., Woudstra L., Wijmenga-Monsuur A.J., van Hagen C.C.E., Sanders E.A.M., de Melker H.E., van der Klis F.R.M., van Binnendijk R.S. Persistence of antibodies to SARS-CoV-2 in relation to symptoms in a nationwide prospective study. Clin. Infect. Dis. 2021 doi: 10.1093/cid/ciab172. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kadelka S., Bouman J.A., Ashcroft P., Regoes R.R. 2021. Comment on buss others, Science 2021: An alternative, empirically-supported adjustment for sero-reversion yields a 10 percentage point lower estimate of the cumulative incidence of SARS-CoV-2 in Manaus by October 2020. Arxiv. [Google Scholar]
Koopmans M., Haagmans B. Assessing the extent of SARS-CoV-2 circulation through serological studies. Nat. Med. 2020;26(8) doi: 10.1038/s41591-020-1018-x. [DOI] [PubMed] [Google Scholar]
Levin A.T., Hanage W.P., Owusu-Boaitey N., Cochran K.B., Walsh S.P., Meyerowitz-Katz G. Assessing the age specificity of infection fatality rates for COVID-19: systematic review, meta-analysis, and public policy implications. Eur. J. Epidemiol. 2020;35(12) doi: 10.1007/s10654-020-00698-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
Liu Y., Mao B., Liang S., Yang J.-W., Lu H.-W., Chai Y.-H., Wang L., Zhang L., Li Q.-H., Zhao L., He Y., Gu X.-L., Ji X.-B., Li L., Jie Z.-J., Li Q., Li X.-Y., Lu H.-Z., Zhang W.-H., Song Y.-L., Qu J.-M., Xu J.-F. Association between age and clinical characteristics and outcomes of COVID-19. Eur. Respir. J. 2020;55(5) doi: 10.1183/13993003.01112-2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
Mestral C., Stringhini S., Guessous I., Jornayvaz F.R. Thirteen-year trends in the prevalence of diabetes in an urban region of Switzerland: a population-based study. Diabetic Med. 2020;37(8):1374–1378. doi: 10.1111/dme.14206. [DOI] [PubMed] [Google Scholar]
Meyer B., Torriani G., Yerly S., Mazza L., Calame A., Arm-Vernez I., Zimmer G., Agoritsas T., Stirnemann J., Spechbach H., Guessous I., Stringhini S., Pugin J., Roux-Lombard P., Fontao L., Siegrist C.A., Eckerle I., Vuilleumier N., Kaiser L. Validation of a commercially available SARS-CoV-2 serological immunoassay. Clin. Microbiol. Infect. 2020;26(10):1386–1394. doi: 10.1016/j.cmi.2020.06.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
Morabia A., Bernstein M., Héritier S., Ylli A. Community-based surveillance of cardiovascular risk factors in Geneva: Methods, resulting distributions, and comparisons with other populations. Prev. Med. 1997;26(3):311–319. doi: 10.1006/pmed.1997.0146. [DOI] [PubMed] [Google Scholar]
Okba N.M., Müller M.A., Li W., Wang C., GeurtsvanKessel C.H., Corman V.M., Lamers M.M., Sikkema R.S., de Bruin E., Chandler F.D., Yazdanpanah Y., Le Hingrat Q., Descamps D., Houhou-Fidouh N., Reusken C.B., Bosch B.-J., Drosten C., Koopmans M.P., Haagmans B.L. Severe acute respiratory syndrome coronavirus 2 specific antibody responses in Coronavirus disease patients. Emerg. Infect. Diseases. 2020;26(7) doi: 10.3201/eid2607.200841. [DOI] [PMC free article] [PubMed] [Google Scholar]
Peckham H., de Gruijter N.M., Raine C., Radziszewska A., Ciurtin C., Wedderburn L.R., Rosser E.C., Webb K., Deakin C.T. Male sex identified by global COVID-19 meta-analysis as a risk factor for death and ITU admission. Nature Commun. 2020;11(1) doi: 10.1038/s41467-020-19741-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
Rota M.C., Massari M., Gabutti G., Guido M., De Donno A., Atti M.L.C.d. Measles serological survey in the Italian population: Interpretation of results using mixture model. Vaccine. 2008;26(34) doi: 10.1016/j.vaccine.2008.05.094. [DOI] [PubMed] [Google Scholar]
Serre-Miranda C., Nobrega C., Roque S., Canto-Gomes J., Silva C., Vieira N., Barreira-Silva P., Alves-Peixoto P., Cotter J., Reis A., Formigo M., Sarmento H., Pires O., Carvalho A., Petrovykh D., Diéguez L., Sousa J., Sousa N., Capela C., Palha J., Cunha P., Correia-Neves M. Performance assessment of 11 commercial serological tests for SARS-CoV-2 on hospitalised COVID-19 patients. Int. J. Infect. Dis. 2021;104 doi: 10.1016/j.ijid.2021.01.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
Stringhini S., Wisniak A., Piumatti G., Azman A.S., Lauer S.A., Baysson H., De Ridder D., Petrovic D., Schrempft S., Marcus K., Yerly S., Arm Vernez I., Keiser O., Hurst S., Posfay-Barbe K.M., Trono D., Pittet D., Gétaz L., Chappuis F., Eckerle I., Vuilleumier N., Meyer B., Flahault A., Kaiser L., Guessous I. Seroprevalence of anti-SARS-CoV-2 IgG antibodies in Geneva, Switzerland (seroCoV-POP): a population-based study. Lancet. 2020;396(10247):313–319. doi: 10.1016/S0140-6736(20)31304-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
Team R. Core S., et al. 2013. R: A language and environment for statistical computing. Vienna, Austria. [Google Scholar]
Vink M.A., van de Kassteele J., Wallinga J., Teunis P.F.M., Bogaards J.A. Estimating seroprevalence of human papillomavirus type 16 using a mixture model with smoothed age-dependent mixing proportions. Epidemiology. 2015;26(1) doi: 10.1097/EDE.0000000000000196. [DOI] [PubMed] [Google Scholar]
Vos E.R.A., van Boven M., den Hartog G., Backer J.A., Klinkenberg D., van Hagen C.C.E., Boshuizen H., van Binnendijk R.S., Mollema L., van der Klis F.R.M., de Melker H.E. Associations between measures of social distancing and severe acute respiratory syndrome coronavirus 2 seropositivity: A nationwide population-based study in the Netherlands. Clin. Infect. Dis. 2021 doi: 10.1093/cid/ciab264. [DOI] [PMC free article] [PubMed] [Google Scholar]
Vyse A.J., Gay N.J., Hesketh L.M., Pebody R., Morgan-Capner P., Miller E. Interpreting serological surveys using mixture models: the seroepidemiology of measles, mumps and rubella in England and Wales at the beginning of the 21st century. Epidemiol. Infect. 2006;134(6) doi: 10.1017/S0950268806006340. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

MMC S1

ROC curves of hospitalized and outpatient validation data.

mmc1.pdf^{(78.1KB, pdf)}

Data Availability Statement

Data will be made available on request.

[b1] Bottomley C., Otiende M., Uyoga S., Gallagher K., Kagucia E., Etyang A., Mugo D., Gitonga J., Karanja H., Nyagwange J., Adetifa I., Agweyu A., Nokes D., Warimwe G., Scott J. 2021. Improving SARS-CoV-2 cumulative incidence estimation through mixture modelling of antibody levels. MedRxiv. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b2] Bouman J.A., Riou J., Bonhoeffer S., Regoes R.R. Estimating the cumulative incidence of SARSCoV-2 with imperfect serological tests: Exploiting cutoff-free approaches. PLoS Comput. Biol. 2021;17(2) doi: 10.1371/JOURNAL.PCBI.1008728. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b3] van Boven M., van de Kassteele J., Korndewal M.J., van Dorp C.H., Kretzschmar M., van der Klis F., de Melker H.E., Vossen A.C., van Baarle D. Infectious reactivation of cytomegalovirus explaining age- and sex-specific patterns of seroprevalence. PLoS Comput. Biol. 2017;13(9) doi: 10.1371/journal.pcbi.1005719. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b4] Buitrago-Garcia D., Egli-Gany D., Counotte M.J., Hossmann S., Imeri H., Ipekci A.M., Salanti G., Low N. Occurrence and transmission potential of asymptomatic and presymptomatic SARS-CoV-2 infections: A living systematic review and meta-analysis. PLOS Med. 2020;17(9) doi: 10.1371/journal.pmed.1003346. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b5] Chen X., Chen Z., Azman A.S., Deng X., Sun R., Zhao Z., Zheng N., Chen X., Lu W., Zhuang T., Yang J., Viboud C., Ajelli M., Leung D.T., Yu H. Serological evidence of human infection with SARS-CoV-2: a systematic review and meta-analysis. Lancet Glob. Health. 2021 doi: 10.1016/S2214-109X(21)00026-7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b6] Dan J.M., Mateus J., Kato Y., Hastie K.M., Yu E.D., Faliti C.E., Grifoni A., Ramirez S.I., Haupt S., Frazier A., Nakao C., Rayaprolu V., Rawlings S.A., Peters B., Krammer F., Simon V., Saphire E.O., Smith D.M., Weiskopf D., Sette A., Crotty S. Immunological memory to SARS-CoV-2 assessed for up to 8 months after infection. Science. 2021;371(6529) doi: 10.1126/science.abf4063. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b7] Føns S., Krogfelt K.A. How can we interpret SARS-CoV-2 antibody test results? Pathogens Dis. 2021;79(1) doi: 10.1093/femspd/ftaa069. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b8] GeurtsvanKessel C.H., Okba N.M.A., Igloi Z., Bogers S., Embregts C.W.E., Laksono B.M., Leijten L., Rokx C., Rijnders B., Rahamat-Langendoen J., van den Akker J.P.C., van Kampen J.J.A., van der Eijk A.A., van Binnendijk R.S., Haagmans B., Koopmans M. An evaluation of COVID-19 serological assays informs future diagnostics and exposure assessment. Nature Commun. 2020;11(1) doi: 10.1038/s41467-020-17317-y. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b9] Guessous I., Bochud M., Theler J.-M., Gaspoz J.-M., Pechère-Bertschi A. 1999–2009 Trends in prevalence, unawareness, treatment and control of hypertension in Geneva, Switzerland. PLoS One. 2012;7(6) doi: 10.1371/journal.pone.0039877. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b10] Guessous I., Gaspoz J.-M., Theler J.-M., Kayser B. Eleven-year physical activity trends in a Swiss urban area. Prev. Med. 2014;59:25–30. doi: 10.1016/j.ypmed.2013.11.005. [DOI] [PubMed] [Google Scholar]

[b11] den Hartog G., Vos E.R.A., van den Hoogen L.L., van Boven M., Schepp R.M., Smits G., van Vliet J., Woudstra L., Wijmenga-Monsuur A.J., van Hagen C.C.E., Sanders E.A.M., de Melker H.E., van der Klis F.R.M., van Binnendijk R.S. Persistence of antibodies to SARS-CoV-2 in relation to symptoms in a nationwide prospective study. Clin. Infect. Dis. 2021 doi: 10.1093/cid/ciab172. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b12] Kadelka S., Bouman J.A., Ashcroft P., Regoes R.R. 2021. Comment on buss others, Science 2021: An alternative, empirically-supported adjustment for sero-reversion yields a 10 percentage point lower estimate of the cumulative incidence of SARS-CoV-2 in Manaus by October 2020. Arxiv. [Google Scholar]

[b13] Koopmans M., Haagmans B. Assessing the extent of SARS-CoV-2 circulation through serological studies. Nat. Med. 2020;26(8) doi: 10.1038/s41591-020-1018-x. [DOI] [PubMed] [Google Scholar]

[b14] Levin A.T., Hanage W.P., Owusu-Boaitey N., Cochran K.B., Walsh S.P., Meyerowitz-Katz G. Assessing the age specificity of infection fatality rates for COVID-19: systematic review, meta-analysis, and public policy implications. Eur. J. Epidemiol. 2020;35(12) doi: 10.1007/s10654-020-00698-1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b15] Liu Y., Mao B., Liang S., Yang J.-W., Lu H.-W., Chai Y.-H., Wang L., Zhang L., Li Q.-H., Zhao L., He Y., Gu X.-L., Ji X.-B., Li L., Jie Z.-J., Li Q., Li X.-Y., Lu H.-Z., Zhang W.-H., Song Y.-L., Qu J.-M., Xu J.-F. Association between age and clinical characteristics and outcomes of COVID-19. Eur. Respir. J. 2020;55(5) doi: 10.1183/13993003.01112-2020. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b16] Mestral C., Stringhini S., Guessous I., Jornayvaz F.R. Thirteen-year trends in the prevalence of diabetes in an urban region of Switzerland: a population-based study. Diabetic Med. 2020;37(8):1374–1378. doi: 10.1111/dme.14206. [DOI] [PubMed] [Google Scholar]

[b17] Meyer B., Torriani G., Yerly S., Mazza L., Calame A., Arm-Vernez I., Zimmer G., Agoritsas T., Stirnemann J., Spechbach H., Guessous I., Stringhini S., Pugin J., Roux-Lombard P., Fontao L., Siegrist C.A., Eckerle I., Vuilleumier N., Kaiser L. Validation of a commercially available SARS-CoV-2 serological immunoassay. Clin. Microbiol. Infect. 2020;26(10):1386–1394. doi: 10.1016/j.cmi.2020.06.024. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b18] Morabia A., Bernstein M., Héritier S., Ylli A. Community-based surveillance of cardiovascular risk factors in Geneva: Methods, resulting distributions, and comparisons with other populations. Prev. Med. 1997;26(3):311–319. doi: 10.1006/pmed.1997.0146. [DOI] [PubMed] [Google Scholar]

[b19] Okba N.M., Müller M.A., Li W., Wang C., GeurtsvanKessel C.H., Corman V.M., Lamers M.M., Sikkema R.S., de Bruin E., Chandler F.D., Yazdanpanah Y., Le Hingrat Q., Descamps D., Houhou-Fidouh N., Reusken C.B., Bosch B.-J., Drosten C., Koopmans M.P., Haagmans B.L. Severe acute respiratory syndrome coronavirus 2 specific antibody responses in Coronavirus disease patients. Emerg. Infect. Diseases. 2020;26(7) doi: 10.3201/eid2607.200841. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b20] Peckham H., de Gruijter N.M., Raine C., Radziszewska A., Ciurtin C., Wedderburn L.R., Rosser E.C., Webb K., Deakin C.T. Male sex identified by global COVID-19 meta-analysis as a risk factor for death and ITU admission. Nature Commun. 2020;11(1) doi: 10.1038/s41467-020-19741-6. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b21] Rota M.C., Massari M., Gabutti G., Guido M., De Donno A., Atti M.L.C.d. Measles serological survey in the Italian population: Interpretation of results using mixture model. Vaccine. 2008;26(34) doi: 10.1016/j.vaccine.2008.05.094. [DOI] [PubMed] [Google Scholar]

[b22] Serre-Miranda C., Nobrega C., Roque S., Canto-Gomes J., Silva C., Vieira N., Barreira-Silva P., Alves-Peixoto P., Cotter J., Reis A., Formigo M., Sarmento H., Pires O., Carvalho A., Petrovykh D., Diéguez L., Sousa J., Sousa N., Capela C., Palha J., Cunha P., Correia-Neves M. Performance assessment of 11 commercial serological tests for SARS-CoV-2 on hospitalised COVID-19 patients. Int. J. Infect. Dis. 2021;104 doi: 10.1016/j.ijid.2021.01.038. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b23] Stringhini S., Wisniak A., Piumatti G., Azman A.S., Lauer S.A., Baysson H., De Ridder D., Petrovic D., Schrempft S., Marcus K., Yerly S., Arm Vernez I., Keiser O., Hurst S., Posfay-Barbe K.M., Trono D., Pittet D., Gétaz L., Chappuis F., Eckerle I., Vuilleumier N., Meyer B., Flahault A., Kaiser L., Guessous I. Seroprevalence of anti-SARS-CoV-2 IgG antibodies in Geneva, Switzerland (seroCoV-POP): a population-based study. Lancet. 2020;396(10247):313–319. doi: 10.1016/S0140-6736(20)31304-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b24] Team R. Core S., et al. 2013. R: A language and environment for statistical computing. Vienna, Austria. [Google Scholar]

[b25] Vink M.A., van de Kassteele J., Wallinga J., Teunis P.F.M., Bogaards J.A. Estimating seroprevalence of human papillomavirus type 16 using a mixture model with smoothed age-dependent mixing proportions. Epidemiology. 2015;26(1) doi: 10.1097/EDE.0000000000000196. [DOI] [PubMed] [Google Scholar]

[b26] Vos E.R.A., van Boven M., den Hartog G., Backer J.A., Klinkenberg D., van Hagen C.C.E., Boshuizen H., van Binnendijk R.S., Mollema L., van der Klis F.R.M., de Melker H.E. Associations between measures of social distancing and severe acute respiratory syndrome coronavirus 2 seropositivity: A nationwide population-based study in the Netherlands. Clin. Infect. Dis. 2021 doi: 10.1093/cid/ciab264. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b27] Vyse A.J., Gay N.J., Hesketh L.M., Pebody R., Morgan-Capner P., Miller E. Interpreting serological surveys using mixture models: the seroepidemiology of measles, mumps and rubella in England and Wales at the beginning of the 21st century. Epidemiol. Infect. 2006;134(6) doi: 10.1017/S0950268806006340. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Applying mixture model methods to SARS-CoV-2 serosurvey data from Geneva

Judith A Bouman

Sarah Kadelka

Silvia Stringhini

Francesco Pennacchio

Benjamin Meyer

Sabine Yerly

Laurent Kaiser

Idris Guessous

Andrew S Azman

Sebastian Bonhoeffer

Roland R Regoes

Abstract

1. Introduction

2. Methods

2.1. Data

2.2. Mixture model methods

Testing for a mismatch between serosurvey and validation data.

3. Results

3.1. Distributions of IgG OD ratios significantly differ for hospitalized and outpatient SARS-CoV-2 positive controls

Fig. 1.

3.2. Model that separately estimates the cumulative incidence for hospitalized and outpatient control data is significantly better than model based on one type of controls only

Table 1.

3.3. Indirect indicator of disease severity differs between age groups

Fig. 2.

Fig. 3.

Table 2.

3.4. Mismatch between pre-pandemic controls and individuals without previous SARS-CoV-2 infection in the serosurvey

3.5. No evidence for an additional missing positive control distribution with lower mean

4. Discussion

CRediT authorship contribution statement

Declaration of Competing Interest

Acknowledgements

Ethics statement

Footnotes

Appendix A. Supplementary data

Data availability

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases