Skip to main content
Wolters Kluwer - PMC COVID-19 Collection logoLink to Wolters Kluwer - PMC COVID-19 Collection
. 2022 Mar 29;33(4):457–464. doi: 10.1097/EDE.0000000000001488

Beyond Vaccination Rates: A Synthetic Random Proxy Metric of Total SARS-CoV-2 Immunity Seroprevalence in the Community

Yajuan Si a,, Leonard Covello b, Siquan Wang c, Theodore Covello d, Andrew Gelman e,f
PMCID: PMC9148633  PMID: 35394966

Background:

Explicit knowledge of total community-level immune seroprevalence is critical to developing policies to mitigate the social and clinical impact of SARS-CoV-2. Publicly available vaccination data are frequently cited as a proxy for population immunity, but this metric ignores the effects of naturally acquired immunity, which varies broadly throughout the country and world. Without broad or random sampling of the population, accurate measurement of persistent immunity post-natural infection is generally unavailable.

Methods:

To enable tracking of both naturally acquired and vaccine-induced immunity, we set up a synthetic random proxy based on routine hospital testing for estimating total immunoglobulin G (IgG) prevalence in the sampled community. Our approach analyzed viral IgG testing data of asymptomatic patients who presented for elective procedures within a hospital system. We applied multilevel regression and poststratification to adjust for demographic and geographic discrepancies between the sample and the community population. We then applied state-based vaccination data to categorize immune status as driven by natural infection or by vaccine.

Results:

We validated the model using verified clinical metrics of viral and symptomatic disease incidence to show the expected biologic correlation of these entities with the timing, rate, and magnitude of seroprevalence. In mid-July 2021, the estimated immunity level was 74% with the administered vaccination rate of 45% in the two counties.

Conclusions:

Our metric improves real-time understanding of immunity to COVID-19 as it evolves and the coordination of policy responses to the disease, toward an inexpensive and easily operational surveillance system that transcends the limits of vaccination datasets alone.

Keywords: Multilevel regression and poststratification, Naturally and vaccine-acquired, Sociodemographic discrepancy, Total immunity seroprevalence


The COVID-19 pandemic has drastically affected everyday life worldwide for nearly 2 years, claiming millions of lives and shuttering businesses. Even as we have made progress with the ongoing implementation of vaccines, new challenges continue to arise. Vaccination numbers remain stubbornly limited and, among some demographics, there is a hardened resistance to participate. Furthermore, reinfection with SARS-CoV-2 and vaccine breakthrough have both been observed, and new potentially antibody-resistant virus variants are likely to continue to evolve.

To effectively develop policies to address an epidemic such as COVID-19, we require explicit knowledge of the level of immunity in any given community or state, and across the nation. Understanding how many people have functional immunity, whether by vaccine or through natural acquisition, allows for a better prediction of how resistant a community may be to severe infection. This information would inform mitigation strategies and help to manage social and business interactions. State vaccination records are followed intensely in the media and scientific community,1 but only tell part of the story. There is an explicit need to comprehend the place of naturally acquired immunity as well.

Though incompletely informative, the presence of antibody immunoglobulin G (IgG) to COVID-19 is an important indicator of immunity within a given population. Assays for IgG both to the spike protein (IgG S) and to the nucleocapsid protein (IgG N) are broadly available commercially. Both antibodies will be present in individuals who have acquired natural immunity, while vaccinated individuals will only have the IgG S antibody, because the genetic material encoding the S protein, and not the virus itself, is what is administered through the various vaccines. As a result, not only can such antibody assays detect immunity, but they can also potentially distinguish between these two forms of immunity. Systematic trending of both forms of seropositivity provides multiple benefits for understanding the presence and evolution of immune status within the community, both naturally and vaccine-acquired.

Ideally, any given population could be randomly sampled serially, and the shifting antibody levels trended. Although such sampling has been done to assess IgG prevalence nationwide at a given moment, this method is cumbersome, expensive, and impractical for tracking levels over time and in various jurisdictions. In our hospital system, we have been tracking the level of IgG N in our community since May 2020 and both IgG N and IgG S since February 2021 using a method that is created to function as a proxy for synthetic random sampling. By serially assaying patients for IgG N as part of their routine presurgical workup, we generate a demographically stable serial sample in high numbers, at low cost, and repeatable over time, and then normalize the demographics of the sample to those of the community at large using multilevel regression and poststratification.2 In doing so, we generate a satisfactory approximation for random sampling, as we are able to validate in the analysis that follows. Our method can be easily adapted to incorporate data from multiple hospital systems to provide robust population inferences. We advocate for the broad adoption of this metric to inform policymakers about immunity levels in their respective jurisdictions and are engaged in building a server-based utility to facilitate the data and result management.

METHODS

All preoperative patients in our three-hospital system (Community Hospital, St. Catherine Hospital, and St. Mary Medical Center) underwent surgical risk evaluation according to accepted American Society of Anesthesiology standards. A subset of these patients was deemed to benefit from preoperative blood tests of whatever variety, as indicated by their age or health characteristics or by the intensity of the planned surgery. Since May 1, 2020, all such patients already needing preoperative blood testing were also tested for the presence of IgG N using a commercially available qualitative test (Roche). The actual threshold titer for this assay was set relatively high to improve specificity; SARS-CoV-2 IgG assays in general suffered from cross-reactivity to the human endemic coronaviruses, whose incidence was quite high throughout the US population.3 At the chosen threshold, such cross-reactivity was virtually eliminated. Therefore, the results were expected to reflect extremely high specificity of identifying actual humoral immunity to COVID-19 (at or near 100% per vendor internal analysis), while sacrificing some sensitivity to lower titers of antibody in any given individual. The data were thus best viewed as a lower bound of IgG N positivity in the sample population.

With the advent of SARS-CoV-2 spike protein (S) mRNA-based vaccines in our community, we recognized the apparent importance of accounting for IgG S and it evolved in our vaccinated population. To that end, as of February 16, 2021, our protocol was modified to obtain both IgG N and IgG S results for the target population. The IgG S test was a semiquantitative ECLIA based assay that quantifies IgG S titers over a 300-fold range above a threshold minimum (Elecsys Anti-SARS-CoV-2 S). Again, the analytical specificities were at or near 100% for this assay.

The University of Michigan Institutional Review Board has determined that the project is exempt from the requirement for Institutional Review Board review and approval.

As in our previous study of SARS-CoV-2 RNA acute incidence,4 we recognize that the sample population differs from the community demographically—it is older, whiter, and sicker—and, therefore, any inference about the community incidence of SARS-CoV-2 IgG will require demographic normalization of the outputs. Multilevel regression and poststratification has an established history of applications in the social science literature for accounting for such demographic decomposition.57 With this goal in mind, our data were subjected to this method according to our previously described statistical construct.4 We rigorously validated that modification statistically in our earlier study, and deemed the output a quality proxy for true random sampling of the incidence of IgG to SARS-CoV-2.

To test the validity of our hypothesis, we need to first understand the clinical behavior of acute SARS-CoV-2 RNA incidence with respect to the timing of asymptomatic and symptomatic COVID-19 and, further, with respect to the timing and magnitude of the rise, plateau, and decay of SARS-CoV-2 IgG prevalence. Fortunately, these characteristics are already well described in the literature (e.g., 8). At the time of our study, peak positivity in SARS-CoV-2 from pharyngeal samples was estimated to occur at a mean of about 5 days post-viral transmission from infected host to new recipient for the variants extant. Moderate to severe symptoms, should they occur, were then expected to arise approximately a week later. An infected patient is then expected to develop IgG to the virus about a week after the onset of symptoms, reaching peak levels a few weeks later. Given these established observations, we employ valid clinical metrics for both positive RNA status and symptomatic disease incidence measured by emergency department (ED) visits and compare these proxies to our multilevel regression and poststratification-modified IgG prevalence data to confirm our expectation of the above biologic behavior.

To that end, we first employed the same model for acute viral RNA incidence from pharyngeal samples and estimated the polymerase chain reaction (PCR) prevalence as that in Covello et al. (2021). The earlier study found that state-based positive case numbers and positivity rates substantially lagged any reasonable estimate of viral incidence.4 Instead, we demonstrated that a synthetic random proxy sampling of asymptomatic individuals gave the earliest and most reliable evidence of RNA positivity.

We compared IgG N (and later, both N and S) positivity data statistically and graphically. IgG N positivity in particular—as a marker of naturally acquired immunity—with trends in ED visits in our community (Lake and Porter Counties, Indiana) and in asymptomatic RNA positivity normalized by multilevel regression and poststratification. We deemed these latter metrics reasonable proxy measures for moderate-to-severe symptomatic COVID-19 and acute viral transmission, respectively, based on the known biology of the virus and our earlier analysis.4,8 We rejected state-based positivity rates, positive case numbers, and hospitalizations as proxies, the former due to its demonstrated poor reliability in our previous study and elsewhere,9 and the latter two due to their demonstrated delay relative to the ED data in our study and on our state website.10

We used a Bayesian approach accounting for unknown sensitivity and specificity11 and applied multilevel regression and poststratification to testing records for population representation, here using the following adjustment variables: biologic sex, age (0–17, 18–34, 35–64, 65–74, and 75+), race (Black, White, and other), and county (Lake and Porter). Multilevel regression and poststratification has two key steps: first, fit a multilevel model for the prevalence with the adjustment variables based on the testing data; next, poststratify using the population distribution of the adjustment variables, yielding prevalence estimates in the target population. The statistical details are provided in the eAppendix; http://links.lww.com/EDE/B918.

RESULTS

From May 1, 2020 to July 12, 2021, our hospital system enrolled 39,952 asymptomatic patients who presented for elective procedures in this study, and a subset of 5,065 qualitative IgG N assays (Roche Elecsys Cobas) were performed. Selection into the study was based merely on the need for preprocedural blood testing. From February 16, 2021 to date, both the above IgG N assay and the semiquantitative IgG S assay (Roche Elecsys Cobas) were performed, adding an additional 2,621 patients.

Table 1 presents the descriptive summary of sociodemographic distributions for asymptomatic patients and the population in the hospital system and the community. The overall prevalence was 0.7% for all PCR tests, 14% for IgG N-only test from May 1, 2020 to July 12, 2021, and 64% for either IgG N or IgG S seropositivity from February 16, 2021 to date. The asymptomatic patients with IgG tests have similar sex and race distributions to those with PCR tests but have a lower proportion of young or early adult patients (younger than 35 years old), the proportions of which are 6.6% for IgG S/N tests, 8.5% for IgG N tests, and 13% for PCR tests. The group with IgG S/N tests included more older adults (75 years and above, 21%), in whom vaccination began earlier than other age groups. As compared to the hospital system patients, asymptomatic patients with PCR/IgG tests tend to be female, middle-aged (35–64) or old (65–74), and White.

TABLE 1.

Summary of Demographic Characteristics as Distributed Among Hospital, Community, and Asymptomatic Cases by Results of Different Tests

Asymptomatic Cases Hospital Cases Community Cases
PCR IgG N IgG S/N
Size 39,952 5,065 2,621 35,838 654,890
Female (%) 59 60 61 57 51
Male (%) 41 40 39 43 49
Age, 0–17 (%) 3 0.5 0.3 8.7 24
Age, 18–34 (%) 10 8 6.3 12 21
Age, 35–64 (%) 46 49 46 30 40
Age, 65–74 (%) 24 25 26 20 9
Age, 75+ (%) 17 17 21 29 6.6
White (%) 72 73 73 65 69
Black (%) 14 13 13 19 19
Other (%) 14 14 14 16 12
Lake (%) 84 91 92 88 74
Porter (%) 16 9 7.7 12 26

Given that IgG testing was only performed on those patients who would benefit from preoperative blood tests, the above skews are not surprising. For this reason, neither the hospital patients nor the asymptomatic patients serve as a precise representation of the community population, in particular with an under-coverage for young, male, and non-White residents. As it happens, these differences are not large; nonetheless, they are potential sources of error if not accounted for in our statistical model, and can also interfere with estimates of trends if the demographic breakdown of hospital patients varies over time. Furthermore, the county representation is unbalanced. Some patients are from south Cook County, Illinois, and are grouped into the Lake County as a proxy. Fortunately for our analysis, these contiguous communities have similar socioeconomic and ethnic demographics.

We then modified the resulting outputs according to the previously described multilevel regression and poststratification-based demographic normalization. We subjected the results to the chosen clinical validation metrics. In Figure 1, we present three separate measures of viral behavior in our community. Figure 1A shows the multilevel regression and poststratification-modified asymptomatic rates of viral RNA incidence in our sample. As a measure of clinical burden, we chose to follow trends in SARS-CoV-2 related ED visits for our catchment area, Lake and Porter Counties, Indiana. These data are presented graphically in Figure 1B. We then applied these established clinical metrics to assess the validity of our multilevel regression and poststratification-modified IgG prevalence data, presented in parallel below (Figure 1C).

FIGURE 1.

FIGURE 1.

Combined estimated (A) polymerase chain reaction (PCR) prevalence in the hospital system and community-based on asymptomatic patients, (B) COVID-related emergency department (ED) visits in Lake and Porter counties, and (C) estimated immunoglobulin G (IgG) to the nucleocapsid protein (IgG N) prevalence till February 15, 2021. The error bars represent 1 SD of uncertainty. The positions of the months on the x-axis correspond to the week of data containing the first of that month.

In northwest Indiana, there was a substantial outbreak of COVID-19 in March and April 2020, filling hospital intensive care unit beds and taxing the ventilator supply. As this study was initiated upon reopening of elective surgical services, May 1, 2020, the data presented here necessarily missed this trend clinically, but should be reflected in the development of a nonzero SARS-CoV-2 IgG prevalence. Indeed, Figure 1C shows that our multilevel regression and poststratification-modified IgG prevalence hovered above 10% in May. We did not have an effective way to test the validity of this number, given the unreliable RNA testing availability at that time, but the result did not seem prima facie unreasonable to our group.

Following the progress of COVID-19 through the summer, it is evident from COVID-19 specific ED visits and multilevel regression and poststratification-modified asymptomatic RNA incidence data (Figure 1A and B) that there was a very low prevalence of SARS-CoV-2 infection and, therefore, very little anticipated accretion of IgG positivity during that interval. In fact, one observed an exceptionally slow decrease in IgG prevalence from May to August, eventually finding its nadir below 10% at the end of that period (Figure 1C). We believe that the very slow fade of IgG positivity during that interval reflects a slow decay of seropositivity beyond the minimal accretion of newly seropositive patients. As the acute infection data shows that new incidence is quite low, the decay in seropositivity is also quite slow, albeit modestly quicker than the accretion during this interval. The findings of those cohort studies served as a substantial reassurance of its validity. We find more supporting evidence for validity during the fall surge. In the study area, both multilevel regression and poststratification-modified asymptomatic RNA positivity and COVID-19 specific ED visits showed some evidence of an increase in trough numbers in mid-September. By early October, it had become apparent that a surge was evolving. A comparison of the peak in the RNA positivity or the number of ED visits to the slope and peak of the multilevel regression and poststratification IgG prevalence is revealing: We note a peak in both ED visits and RNA positivity in early November (Figure 1A and B). Peak incidence of acute infection should correlate with peak accretion of new IgG seroconversions; that is, the maximum incidence of viral infection seen in Figure 1A and B should correlate with the maximum rate of increase of IgG seropositivity in Figure 1C. From the figures, we find that the maximum incidence of the virus occurred in early November and the maximum slope of the IgG curve occurred in mid-November. This finding is consistent with the biologic prediction that this largest cohort of viral positive patients will show maximum seroconversion a few weeks later,12 adding themselves at the highest rate to the IgG prevalence predicted in Figure 1C.

From early February to the conclusion of the study interval, ED visits remained near their summer 2020 lows. These findings are consistent with a host of studies that identify persistence of 6 months or longer of IgG positivity in individuals, albeit at lower titers.13,14 In the current population-based study, using a qualitative assay with a lower bound threshold, one would expect a prolonged loss of positivity in the assay as patients whose titers were at or near threshold fell into the negative range.

The multilevel regression and poststratification IgG prevalence data serve to validate those cohort study findings of prolonged positivity at gradually lower titers, but using a population sample method instead. The coherence of the current study’s findings with those cohort studies is a significant reassurance of its validity. Accordingly, our multilevel regression and poststratification-modified IgG N prevalence curve approximates a plateau from mid-February through June, as can be seen in Figure 1C.

After we consider both IgG S and IgG N test results, as shown in Figure 2, the multilevel regression and poststratification-modified IgG curve increases rapidly and remains relatively high through time. Our estimates are consistent with the findings in Ref. 15 based on 1,443,519 blood donation specimens from a catchment area representing 74% of the US population. The ongoing vaccination process certainly partially explains such an increase and we should be aware that it also suggests the spread of the virus is still not slowing down much, which is also reflected in Table 2.

FIGURE 2.

FIGURE 2.

Estimated combined immunoglobulin G (IgG) to the nucleocapsid protein (IgG N) and the spike protein (IgG S) prevalence in the hospital system and the community based on asymptomatic patients after February 15, 2021. The error bars represent ±1 SD of uncertainty. The positions of the months on the x-axis correspond to the week of data containing the first of that month.

TABLE 2.

Comparison Between Immunoglobulin G (IgG) to Either the Nucleocapsid or Spike Protein (IgG N/S) Positivity Rates (Adjusted by the Multilevel Regression and Poststratification) and Administered Vaccination Rates in the Community (Lake and Porter Counties, Indiana) During the Week of July 6–12, 2021

IgG N/S Positivity Rate (Standard Error) Vaccination Rates (First/Single-dose Administered)
Overall 0.74 (0.066) 0.45
Sex
 Female 0.74 (0.065) 0.47
 Male 0.74 (0.068) 0.41
Race
 Black 0.71 (0.076) 0.31
 White 0.74 (0.064) 0.60
 Other 0.74 (0.071) 0.57
Age
 0–17 0.76 (0.108) 0.14
 18–39 0.59 (0.095) 0.30
 40–64 0.75 (0.068) 0.54
 65–75 0.85 (0.052) 0.90
 75+ 0.88 (0.048) 0.81

Table 2 compares the estimated immunity levels, either IgG S or N positive (74%), with the administered vaccination rates in the two counties (45%). Levels of immunity are up in the 59%–88% range across all demographic groups, whereas the vaccination rates are highly variable. Black and younger individuals have low vaccination rates, especially in the lowest age category. This observation indicates that Black and youth have acquired substantial immunity through natural exposure to offset their lower vaccination rates. The comparison assumes that the officially released records approximate the vaccination rates of the demographically adjusted groups. Currently, we are in the process of attempting to cross-reference with respect to the identified vaccination status. Our analysis of the IgG N test results before February 16, 2021 showed that younger age groups had higher tendency to acquire positivity than the elder, but the tendency became lower to have either IgG S or N positive after February 16, 2021.

Throughout the study, we have used IgG N seroprevalence as an estimate for naturally acquired immunity. Once we incorporated IgG S alongside IgG N testing, evidence began to mount that IgG N seropositivity was likely an underestimate of true naturally acquired immunity. A comparison of Table 2 total seropositivity (IgG N or IgG S) with vaccination rates suggests a true natural immunity rate of around 30% in early July (total IgG N/S prevalence minus vaccinated prevalence = 29% overall) whereas IgG N positive patients for that interval hover around 5%. As with summer 2020, it was likely that slowly fading IgG N seropositivity during spring 2021 was matched by low-level accretion of seropositivity from new infection, hence a roughly flat IgG N curve in both intervals (June to October 2020 in Figure 1C and February to May 2021 in Figure 3). Figure 3 shows the estimated IgG N seropositive and IgG S seronegative phenotype prevalence in the hospital system and the community based on asymptomatic patients after February 15, 2021. This finding suggests that around 25% of the population is IgG N negative, but IgG S positive from natural infection. Such patients may have IgG N antibodies below the detection rate of the assay. In any event, for the purposes of this analysis, IgG N positivity should be regarded as merely the lower bound of true naturally acquired immunity, though the true threshold values for functional immunity are not yet defined.

FIGURE 3.

FIGURE 3.

Estimated combined immunoglobulin N (IgG N) seropositive and immunoglobulin S (IgG S) seronegative phenotype prevalence in the hospital system and the community based on asymptomatic patients after February 15, 2021. The error bars represent ±1 SD of uncertainty. The positions of the months on the x-axis correspond to the week of data containing the first of that month.

DISCUSSION

We analyzed viral IgG testing data of asymptomatic patients who presented for elective procedures within a hospital system and applied multilevel regression and poststratification to adjust for demographic and geographic discrepancies between the sample and the community population. The innovation of our approach lies in the analysis of a demographically stable sample, the development of a pragmatic proxy for random studies measuring population immunity, and the multilevel regression and poststratification adjustment of demographic discrepancies. In our study, we set up a synthetic random proxy for estimating IgG prevalence in the sampled community. Our model is based on a demographically stable population sample. By applying this adjustment to our results, we used an accepted statistical modification to correct the otherwise stable demographic skew. We have validated the model using reasonable clinical metrics of viral and symptomatic disease incidence to show the expected biologic correlation of these entities with the timing, rate, and magnitude of seroprevalence. In short, the model is able to empower routine hospital testing to measure IgG seroprevalence in the community. We acknowledge that our sampling method will skew slightly in the direction of less healthy individuals than the population as a whole in that it selects for those undergoing outpatient procedures and qualifying for preoperative blood testing.

Of course, IgG positivity is only a single element of overall immunity. Cellular immunity is also clearly crucial, and there is indeed some evidence that humoral and cellular immunity may not fully correlate.16 Nonetheless, although strong humoral immunity may not be required for an effective immune response, it is likely that measurable IgG levels are reasonable evidence of some functional immunity; indeed, high circulating levels of IgG may be the most potent suppressor of disease during a surge in viral transmission. In the event, it is clear that reinfection and vaccine breakthrough do occur, but that preexisting naturally or vaccine-acquired immunity substantially decrease the incidence and severity of the disease.17 Thus far, this observation remains valid for current variants of concern as well. In the absence of an easily acquired global view of the immune response within the population, monitoring IgG positivity seems a good way to achieve a sense of that underlying immunity in a policy-actionable sense. As new variants arise and we acquire ongoing experience with their ability to evade naturally and vaccine-acquired humoral immunity, these data will nonetheless continue to be of crucial importance.

Understandably, media and popular scientific conversation on immunity acquisition has been dominated by commentary on the uneven implementation of the vaccine rollout, but this emphasis on vaccine-mediated immunity ignores the important humoral immunity demonstrable from the previous infection, as has followed the spring and fall surges in our community and elsewhere. In our data, it is probable that, as of mid-February 2021, 30% or so of our community at large has retained naturally acquired antibodies to the SARS-CoV-2 nucleocapsid and spike proteins above detection threshold levels, and while the mere presence of such antibodies offered no guarantee of true immunity to second infection within this population, it seems likely that this group largely avoided severe COVID-19 during the delta outbreak, as long as they remained IgG positive. Superimposed vaccination-induced positivity from February onward complicates our ability to discern what percentage of the population may be relying solely on this naturally-conferred immunity persisting. As of the close of the study in July, the presented comparison of publicly available vaccination rates with our normalized measures of IgG seropositivity show that approximately 30% of the population still has assay detectable antibodies obtained by natural infection. Moreover, multiple cohort studies imply that naturally acquired humoral immunity persists for 6 months or more, so the contribution of this nonvaccinated seropositive group to the total immunity of the community is both significant in magnitude and prolonged in duration. Indeed, delta variant outbreaks in the southern states probably reflected not merely low vaccination rates, but also relatively low prior experience of SARS-CoV-2 infection, in Missouri and Arkansas most notably.

The more recent omicron outbreak presents a distinct challenge to the model. As of January 2022, it is becoming increasingly clear that naturally acquired immunity, as well as limited vaccinated immunity, provides less of a barrier to infection and even serious illness than in prior outbreaks. We, therefore, should have less confidence that the immunity identified in our study is reliable as an indicator of community resistance to infection. Indeed, our metric will encounter challenges imposed by the limitations of current testing.

As we develop increasing knowledge of the relative durability and effectiveness of naturally acquired, vaccine-induced, and combined exposure seropositive status, the power of tracking antibodies arising from natural infection becomes even more fundamental to understanding the total immunity of an identified population to serious infection or death. Combining our metric with state-based data on vaccine administration will allow us a true picture of that overall immunity, informing decisions about opening or closing economic and social commerce within a jurisdiction with more confidence and durability. As we continue to incorporate IgG testing for the spike protein receptor-binding domain into our ongoing protocol, the model will be able to follow vaccine-acquired immunity as well, avoiding cumbersome additional analysis of state vaccine data, itself often not easily available for analysis due to confidentiality concerns.

Our protocol for serial proxy random sampling of presurgical patients is inexpensive and can be seamlessly integrated into routine preoperative testing in any US hospital. It yields demonstrably valid measures of humoral immunity. Making use of robust demographic data within both hospital electronic health records and publicly available census data, this method of sampling allows for a deeper investigation of demographic distributions of immunity and can easily aid in identifying vulnerable populations for targeted vaccine campaigns.

Supplementary Material

ede-33-457-s001.docx (17.7KB, docx)

Footnotes

This study was supported by the National Science Foundation and National Institutes of Health.

Reproducibility: Codes are publicly available at https://github.com/yajuansi-sophie/covid19-mrp. The data are confidential and cannot be released to the public.

The authors report no conflicts of interest.

Supplemental digital content is available through direct URL citations in the HTML and PDF versions of this article (www.epidem.com).

REFERENCES

  • 1.Regenstrief Institute. State of Indiana COVID-19 Data. COVID Dashboard. Available at: https://www.regenstrief.org/data-downloads/. Accessed October 26, 2021.
  • 2.Gelman A, Little T. Poststratification into many categories using hierarchical logistic regression. Surv Methodol. 1997;23:127–135. [Google Scholar]
  • 3.Becker M, Strengert M, Junker D, et al. Exploring beyond clinical routine SARS-CoV-2 serology using MultiCoV-Ab to evaluate endemic coronavirus cross-reactivity. Nat Commun. 2021;12:1152. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Covello L, Gelman A, Si Y, Wang S. Routine hospital-based SARS-CoV-2 testing outperforms state-based data in predicting clinical burden. Epidemiology. 2021;32:792–799. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Si Y, Trangucci R, Gabry J, Gelman A. Bayesian hierarchical weighting adjustment and survey inference. Surv Methodol. 2020;46:181–214. [Google Scholar]
  • 6.Downes M, Gurrin LC, English DR, et al. Multilevel regression and poststratification: a modeling approach to estimating population quantities from highly selected survey samples. Am J Epidemiol. 2018;187:1780–1790. [DOI] [PubMed] [Google Scholar]
  • 7.Zhang X, Holt JB, Lu H, et al. Multilevel regression and poststratification for small-area estimation of population health outcomes: a case study of chronic obstructive pulmonary disease prevalence using the behavioral risk factor surveillance system. Am J Epidemiol. 2014;179:1025–1033. [DOI] [PubMed] [Google Scholar]
  • 8.Lauer SA, Grantz KH, Bi Q, et al. The incubation period of Coronavirus Disease 2019 (COVID-19) from publicly reported confirmed cases: estimation and application. Ann Intern Med. 2020;172:577–582. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Kissane E, Malaty-Rivera J. Test positivity in the US is a mess. The COVID Tracking Project at The Atlantic. Available at: https://covidtracking.com/analysis-updates/test-positivity-in-the-us-is-a-mess. Accessed October 13, 2020.
  • 10.Indiana State Department of Health. COVID-19 region-wide test, case, and death trends. https://hub.mph.in.gov/dataset/covid-19-region-wide-test-case-and-death-trends. Accessed December 21, 2020.
  • 11.Gelman A, Carpenter B. Bayesian analysis of tests with unknown specificity and sensitivity. J R Stat Soc Ser C Appl Stat. 2020;69:1269–1283. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Iyer AS, Jones FK, Nodoushani A, et al. Persistence and decay of human antibody responses to the receptor binding domain of SARS-CoV-2 spike protein in COVID-19 patients. Sci Immunol. 2020;5:eabe0367. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Duysburgh E, Mortgat L, Barbezange C, et al. Persistence of IgG response to SARS-CoV-2. Lancet Infect Dis. 2021;21:163–164. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Glück V, Grobecker S, Tydykov L, et al. SARS-CoV-2-directed antibodies persist for more than six months in a cohort with mild to moderate COVID-19. Infection. 2021;49:739–746. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Jones JM, Stone M, Sulaeman H, et al. Estimated US infection- and vaccine-induced SARS-CoV-2 seroprevalence based on blood donations, July 2020-May 2021. JAMA. 2021;326:1400–1409. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Amanna IJ, Slifka MK. Contributions of humoral and cellular immunity to vaccine-induced protection in humans. Virology. 2011;411:206–215. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Fontanet A, Cauchemez S. COVID-19 herd immunity: where are we? Nat Rev Immunol. 2020;20:583–584. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

ede-33-457-s001.docx (17.7KB, docx)

Articles from Epidemiology (Cambridge, Mass.) are provided here courtesy of Wolters Kluwer Health

RESOURCES