Skip to main content
Wellcome Open Research logoLink to Wellcome Open Research
. 2021 Sep 3;6:220. [Version 1] doi: 10.12688/wellcomeopenres.17143.1

Risk factors for SARS-CoV-2 seroprevalence following the first pandemic wave in UK healthcare workers in a large NHS Foundation Trust

Hayley Colton 1,2,a,#, David Hodgson 3,b,#, Hailey Hornsby 2,#, Rebecca Brown 2, Joanne Mckenzie 2, Kirsty L Bradley 2, Cameron James 2, Benjamin B Lindsey 1,2, Sarah Birch 4, Louise Marsh 4, Steven Wood 5, Martin Bayley 5, Gary Dickson 5, David C James 6, Martin J Nicklin 2, Jon R Sayers 2,7,8, Domen Zafred 2, Sarah L Rowland-Jones 1,2,7, Goura Kudesia 9, Adam Kucharski 3; CMMID COVID-19 Working Group, Thomas C Darton 1,2,7, Thushan I de Silva 1,2,7, Paul J Collini 1,2,7
PMCID: PMC9091808  PMID: 35600250

Abstract

Background: We aimed to measure SARS-CoV-2 seroprevalence in a cohort of healthcare workers (HCWs) during the first UK wave of the COVID-19 pandemic, explore risk factors associated with infection, and investigate the impact of antibody titres on assay sensitivity.

Methods: HCWs at Sheffield Teaching Hospitals NHS Foundation Trust were prospectively enrolled and sampled at two time points. SARS-CoV-2 antibodies were tested using an in-house assay for IgG and IgA reactivity against Spike and Nucleoprotein (sensitivity 99·47%, specificity 99·56%). Data were analysed using three statistical models: a seroprevalence model, an antibody kinetics model, and a heterogeneous sensitivity model.

Results: As of 12th June 2020, 24·4% (n=311/1275) of HCWs were seropositive. Of these, 39·2% (n=122/311) were asymptomatic. The highest adjusted seroprevalence was measured in HCWs on the Acute Medical Unit (41·1%, 95% CrI 30·0–52·9) and in Physiotherapists and Occupational Therapists (39·2%, 95% CrI 24·4–56·5). Older age groups showed overall higher median antibody titres. Further modelling suggests that, for a serological assay with an overall sensitivity of 80%, antibody titres may be markedly affected by differences in age, with sensitivity estimates of 89% in those over 60 years but 61% in those ≤30 years.

Conclusions:  HCWs in acute medical units working closely with COVID-19 patients were at highest risk of infection, though whether these are infections acquired from patients or other staff is unknown. Current serological assays may underestimate seroprevalence in younger age groups if validated using sera from older and/or more symptomatic individuals.

Keywords: Seroprevalence; antibody; Healthcare Worker; SARS-CoV-2; COVID; modelling; age; risk

Introduction

Healthcare workers (HCWs) are at increased risk of COVID-19 1, 2 . The true number of HCWs exposed to SARS-CoV-2 to-date is unknown, particularly during the early stages of the pandemic. Initial methods to estimate HCW exposure included extrapolation from work absenteeism rates, and are unlikely to be reliable 3 . Confirmation by molecular testing increased the accuracy of case detection, although access to nucleic acid amplification testing (NAAT) was limited during the early stages of the pandemic in the UK 4 . Detection of exposure by antibody seroconversion may provide a more accurate estimate of risk in HCW populations, can be performed at large scale, and is less affected by symptom-activated testing pathways 58 .

To enable the accurate interpretation of seroprevalence readouts, detailed characterisation of antibody evolution relative to the sampling time-frame, immunoglobulin isotype, antigenic target and assay performance is required 914 . Many antibody assays have been evaluated using samples from hospitalised patients; it is unclear how these assays perform with the lower antibody levels found in those with more mild or asymptomatic SARS-CoV-2 infection 10, 12 . Furthermore, while antibody levels to some coronaviruses are higher in older individuals, it is unclear whether this results from a higher risk or exposure to the virus, greater antigenic load or boosting of antibodies from previous seasonal coronavirus infections 1519 . This may also lead to age-specific differences in antibody assay sensitivity, which could be a significant confounder in population seroprevalence studies.

In this study we measure SARS-CoV-2 seroprevalence in HCWs at Sheffield Teaching Hospitals NHS Foundation Trust (STH), following the first wave of the pandemic in the UK. We model the kinetics of the evolving antibody response, the impact of age on assay sensitivity and explore risk factors associated with infection.

Methods

Background and setting

STH offers secondary- and tertiary-level care across four sites in South Yorkshire, UK, and has 1,669 inpatient beds and 18,500 employees 20 . The first patient with confirmed COVID-19 was admitted to STH on 23 February 2020; the first wave of the UK pandemic occurred between March 2020 and June 2020.

Testing of symptomatic staff for SARS-CoV-2 by NAAT was introduced on 17 March 2020. On the same day, Public Health England (PHE) de-escalated the recommendations for the personal protective equipment (PPE) required by HCWs caring for inpatients with suspected or confirmed COVID-19 from ‘Level 3 Airborne’ to ‘Level 2 Droplet’ for routine care 21 . Subsequently, the requirement for universal ‘Level 2 Droplet’ PPE for all inpatient and outpatient care began on 08 April 2020. Local STH policy was changed on 15 June 2020 to mandate staff use surgical face masks while on hospital premises.

Recruitment and consent

From 13–18 May 2020, all contactable STH staff (n=17,757) were invited to take part in the COVID-19 Humoral ImmunE RespOnses in front-line HCWs (HERO) study by email and intranet alert. To engage staff in areas with limited communications access, additional recruitment posters and face-to-face enrolment sessions were used.

Following an electronic informed consent process, participants provided self-reported data on-line on age, gender, ethnicity, job role, and pandemic working environment (‘COVID-19 zones’) 21 . Details of any possible or confirmed prior COVID-19 illnesses occurring since 01 February 2020 was also collected. These were categorised as: i), diagnosed with COVID-19 and confirmed by NAAT, ii), clinically diagnosed with COVID-19 but NAAT not performed, and iii), self-reported symptoms only 21 . Together, we defined these three groups as “symptomatic”, as asymptomatic testing was only introduced after the study recruitment period. Those reporting no illness between 01 February 2020 to the date of recruitment were defined as “asymptomatic”. All that had enrolled were emailed times of phlebotomy appointments, and were invited to attend on a first come first served basis for the first visit, and then invited by email to book a specific appointment slot to attend for their second visit after four weeks +/- 7 days. An 8.5ml serum sample was taken at each visit to outpatient phlebotomy services for serological testing.

SARS-CoV-2 serology

Serum samples were tested for IgG and IgA reactivity to two SARS-CoV-2 proteins using an in-house ELISA: the full-length extracellular domain (amino acids 14-1213) of Spike glycoprotein, including a replacement of the furin cleavage site R684-R689 by a single alanine residue and replacement of K986-V987 by PP, produced in mammalian cells; and full-length untagged Nucleocapsid protein (NCP) produced in E. coli ( Uniprot ID P0DTC9 (NCAP_SARS2)) . 2224 . High binding microtitre plates (Immulon 4HBX; Thermo Scientific, 6405) were coated overnight with proteins diluted in phosphate buffered saline, washed with 0·05% PBS-Tween, and blocked for one hour with 200 µL/well casein buffer. Following optimisation, sample dilutions used were 1:200 for the IgG assay or 1:100 for the IgA assay 21 . Plates were emptied and 100 µL/well of sample or control loaded. After two hours incubation, plates were washed and loaded with goat anti-human IgG-HRP conjugate (Invitrogen, 62-8420) at 1:500, or goat anti-human IgA-HRP conjugate (Invitrogen, 11594230) at 1:1000, for one hour. Plates were washed and developed for 10 minutes with 100 µL/well TMB substrate (KPL, 5120-0074). Development was stopped with 100 µL/well HCl Stop solution (KPL, 5150-0021), and absorbance read at 450nm. All steps were performed at room temperature.

A calibration curve of sera pooled from convalescent SARS-CoV-2 NAAT-confirmed patients with high antibody titres for both spike and NCP was included on plates to allow quantification of antibody concentrations. The calibration curve was generated by serially diluting in 1·75× steps from a starting concentration of 1:200 for the IgG assay or 1:100 for the serum IgA assay. When the WHO International Standard for anti-SARS-CoV-2 immunoglobulin (NIBSC, 20/136) later became available, the calibration curve was run in parallel for the IgG assay 21 . Data for the IgG assay are therefore given in WHO antibody units, whereas IgA assay data are given in arbitrary antibody units.

Sample size

To meet the primary objective of measuring the SARS-CoV-2 IgG seroprevalence, we calculated a sample size of 1,000 HCWs would provide +/-1·4% precision based on a seroprevalence estimate that ~4% of the UK population may have been infected by April 2020, with a two-sided 95% confidence interval (with n=753, Binomial exact 95%CI has been estimated to be 2·7-5·6%) 25 .

Statistical modelling

We considered three statistical models, i) a seroprevalence model, ii) an antibody kinetics model, and iii) a heterogeneous sensitivity model. For the seroprevalence model, we used the serostatus of all participants at first blood draw in a sensitivity- and specificity-adjusted Bayesian multilevel logistic regression model. Using seropositivity as the binary response variable, we considered three different model subtypes with varying primary exposures; job location, contact with COVID-19 patients, and job type 21 . In addition, we fitted a symptomatic prevalence model, where the data used were seropositive persons only, and the binary response variable was asymptomatic or symptomatic infection.

For the antibody kinetics model, we included samples from seropositive individuals in a Bayesian multilevel linear regression model in two parts: i) using log2 antibody units (logAU) at the first blood draw as the response variable and ii) using the change in antibody titre at the follow up bleed (median 28 days) as the response variable. Age, ethnicity, gender and symptom severity (asymptomatic or symptomatic) were used as covariates and each model was run separately for four different antibody-antigen combinations; Spike-IgG, NCP-IgG, Spike-IgA, NCP-IgA. The time until seroreversion was calculated for each covariate group and antibody-antigen interactions by i) sampling a starting titre value and a rate of decline from the two models, and then ii) calculating the time until the minimum observed antibody value was reached for that antibody-antigen interaction, assuming a continuous rate of decrease.

In our heterogeneous sensitivity and specificity model, we explored how estimates derived from our assay validation dataset generalise to covariate groups, e.g. participant age. To model the generalisability of these performance measures, we compared the seropositivity classification of our study dataset using our in-house antibody assay, with the predicted seropositivity classification from hypothetical assays with a quoted sensitivity and specificity. Our model considers the different distribution of the A 450 values in the assay validation and HERO study datasets to model how reliably quoted performance measures generalise. Using the assay validation dataset, we estimated the A 450 cut-off value for a range of chosen sensitivity values, and then used this A 450 cut-off to classify seropositivity in the study dataset. We then estimated the implied sensitivity on the HERO dataset by comparing seropositivity classification based on the estimated A 450 cut-off value, with the seropositivity classification from our in-house assay (which for ease of comparison, we assume represents the maximum possible sensitivity and specificity (i.e. 100%) in this model). This framework allowed us to estimate the hypothetical performance of serological assays reported in the literature on our HERO dataset, along with co-variate specific sensitivity.

All analysis was performed in R version 4.0.2 and cmdstanr version 0.2.0. An R package containing all the analysis in this study is available at https://doi.org/10.5281/zenodo.5215671.

Regulatory review

Following internal scientific review, local R&D (5 May 2020 ref: STH21394) and HRA and Health and Care Research Wales (HCRW) approval were given (29 April 2020 ref: 20/HRA/2180, IRAS ID: 283461). Anonymised serum samples from hospitalised COVID-19 patients and serum collected prior to 2017 during routine clinical care were used for assay validation with approval from STH R&D office.

Results

Serology assay development

The assay validation dataset 26 . consisted of serum from 190 SARS-CoV-2 NAAT-confirmed cases (52 hospitalised patients and 138 healthcare workers with mild infections sampled between 14 and 120 days from NAAT positivity), and 675 patients sampled prior to 2017 ( Extended data: Table S1). Thresholds for defining reactivity to spike ( A 450 0·1750) or NCP ( A 450 0·1905) were set to optimise the sensitivity of each assay. Given the IDSA guidance for ensuring a specificity of ≥99·5% in assays used for SARS-CoV-2 seroprevalence studies, specificity was enhanced by defining a SARS-CoV-2 seropositive sample as one where both spike and NCP were reactive 27 . This resulted in a sensitivity of 99·47% (95% confidence interval (CI) 97·10% - 99·99%) and specificity of 99·56% (95% CI 98·71% - 99·91%) for our IgG assay ( Extended data: Figure S1). Rapid waning of IgA responses following SARS-CoV-2 infection complicated defining positive and negative samples based on the convalescent sera we used for assay validation. We also observed more background reactivity for IgA compared to IgG in pre-pandemic samples. We therefore opted to use our spike and NCP IgA ELISA purely for comparing IgA levels in individuals classified as seropositive in our IgG assay ( Extended data: Figure S2). Antibody units at each given dilution of the calibration curve are shown in Table S2 ( Extended data).

Registration and study visits

1478 STH staff consented to take part between 13 May and 5 June 2020 ( Extended data: Figure S3). Of these, 1277 attended for a first visit (V1) between 15 May 2020 and 12 June 2020. As two samples were contaminated in transit, we obtained a valid serostatus for 1275 samples. 1174 attended for a second visit (V2) between 15 June and 10 July 2020 ( Extended data: Figures S3 and S4).

Demographics, job role, work locations, and environment

The majority of participants were female (n=1008/1275, 79·1%) and most described their ethnicity as white (n=1130/1275, 88·6%, Table 1). Nurses (433/1275, 33·9%), doctors (232/1275, 18·2%), health care assistants (163/1275, 12·8%), and domestic services staff (136/1275, 10·7%) constituted the largest proportion of job roles. Almost half (593/1275, 46·5%) of HCWs worked in parts of the hospital managing acute COVID-19 admissions including the Emergency Department (ED), Acute Medical Unit (AMU), Critical Care, and inpatient medical wards (respiratory, geriatric care, infectious diseases). Participants reported working in areas with the highest level of COVID-19 patient contact (red zones), either most days (n=423/1275, 33·2%) or occasionally (n=305/1275, 23·9%).

Table 1. Characteristics and serostatus of recruited participants who had a valid baseline result.

Recruited
with an
initial valid
serostatus
Seropositive
at V1 (%)
Asymptomatic
(% of
seropositive at
V1)
Completed
both V1 and
V2 (% of
recruited)
Seroincident
cases (% of
seronegative
at V1)
Total 1275 311 (24·4) 122 (39·2) 1166 (87·5) 16 (1·7%)
Job location
Emergency
Department (ED)
103 26 (25·2) 13 (50·0) 90 (87·4) 1 (1·2)
Acute Medical Unit
(AMU)
83 38 (45·8) 17 (44·4) 66 (79·5) 0 (0·0)
Critical Care 100 18 (18·0) 7 (38·9) 95 (95·0) 0 (0·0)
Geriatric Care 23 3 (13·0) 1 (33·3) 22 (95·7) 1 (5·0)
Infectious Disease
Ward
139 26 (18·7) 11 (42·3) 121 (87·1) 7 (6·2)
Other 664 157 (23·6) 56 (35·7) 621 (93·5) 2 (0·3)
Respiratory Geriatric
Ward
92 27 (29·3) 10 (37·0) 85 (92·4) 2 (3·1)
Respiratory Ward 58 13 (22·4) 5 (38·5) 54 (93·1) 2 (4·4)
Job role
Admin 127 26 (20·5) 12 (46·2) 118 (92·9) 1 (0·9)
Allied medical 1 38 0 (0·0) 37 (97·4) 0 (0·0)
Domestic services 136 39 (28·7) 24 (61·5) 127 (93·4) 4 (4·1)
Healthcare
assistants
163 39 (23·9) 21 (53·8) 140 (85·9) 3 (2·4)
Doctors 232 52 (22·4) 18 (34·6) 211 (90·9) 0 (0·0)
Nurses 433 116 (26·7) 34 (29·3) 391 (90·3) 7 (2·2)
Other 31 5 (16·1) 2 (40·0) 29 (93·5) 0 (0·0)
Pharmacists 35 8 (22·8) 5 (62·5) 33 (94·3) 1 (3·7)
Occupational and
physiotherapists
33 15 (45·5) 3 (20·0) 33 (100·0) 0 (0·0)
Radiographers 42 9 (21·4) 2 (22·2) 42 (100·0) 0 (0·0)
COVID-19 zone 2
1 (lowest COVID-19
contact)
104 22 (21·2) 10 (54·5) 96 (92·3) 1 (1·2)
2 248 50 (20·2) 27 (46·0) 232 (93·5) 0 (0·0)
3 41 7 (17·1) 6 (14·3) 39 (95·1) 1 (2·9)
4 153 35 (22·9) 24 (31·4) 142 (92·8) 1 (0·8)
5 305 69 (22·6) 46 (33·3) 280 (91·8) 3 (1·3)
6 (highest COVID-19
contact)
423 128 (30·3) 76 (40·6) 376 (88·9) 10 (3·4)
Age (years)
<30 267 67 (25·1) 32 (47·8) 236 (88·4) 6 (3·0)
30–39 306 69 (22·5) 22 (31·8) 279 (91·2) 5 (2·1)
40–49 314 72 (22·9) 29 (40·2) 293 (93·3) 2 (0·8)
50–59 314 76 (24·2) 28 (36·8) 288 (91·7) 1 (0·4)
60+ 74 27 (36·5) 11 (40·7) 70 (94·6) 2 (4·3)
Ethnicity
White 1130 281 (24·9) 108 (38·4) 1035 (91·6) 15 (1·8)
Black/Black British 33 6 (18·2) 3 (50·0) 30 (90·9) 0 (0·0)
Asian/Asian British 76 17 (22·4) 7 (41·2) 70 (92·1) 1 (1·7)
Other 33 7 (21·2) 4 (57·1) 30 (90·9) 0 (0·0)
Gender 3
Female 1008 253 (24·1) 105 (41·5) 922 (91·5) 14 (1·9)
Male 265 58 (21·9) 17 (29·3) 242 (91·3) 2 (0·9)

1Allied Medical includes Speech and Language Therapists, Cardiac Physiologists, Dental Hygienists, Dietitians, ECG technicians, Orthotists, Podiatrists, Rehabilitation assistants

2COVID-19 Zones are defined in extended data 21 .

3Participants were able to define their gender as non-binary, transgender or could choose not to disclose

Unadjusted seroprevalence

Analysis of V1 samples revealed that 24·4% (n=311/1275) of HCWs were seropositive by 12 June 2020 ( Table 1). Of these, 39·2% (n=122/311) did not report a prior illness consistent with COVID-19. The second blood draw occurred a median of 28 days following the first visit (IQR 27-31) and 1166 had a valid V2 serology result ( Extended data: Figure S3). Comparison of serology data from both visits demonstrated that 16 out of 964 participants had seroconverted and 9 out of 311 participants seroreverted (i.e. loss of reactivity against either spike ( A 450 <0·175) or NCP ( A 450 <0·1905)).

Adjusted seroprevalence estimates

The overall adjusted seroprevalence in the cohort was 23·1% (95% CrI 14·1–33·3), but varied across job type, job location and COVID-19 zone ( Figure 1; Extended data: Tables S3 to S7). A relatively high seroprevalence was seen in occupational and physiotherapists (39·2%, 95% CrI 24·4–56·5), and low seroprevalence in allied medical staff (9·2%, 95% CrI 1·4–21·3). Between wards, there was higher seroprevalence in the AMU (41·1%, 95% CrI 30·0–52·9) compared to other wards. Across COVID-19 zones, working in the areas with the highest degree of COVID-19 patient contact (zone 6) was associated with a slightly higher seroprevalence of 28·6% (95% CrI 24·0–33·5) compared to the other five groups ( Figure 1). The adjusted proportion of asymptomatic cases was 38·9% (95% CrI 23·6–57·3) ( Extended data: Figure S5). The proportion of asymptomatic cases remained relatively consistent across all covariates except for job type, where it ranged from 21·4% for occupational and physiotherapists up to 61·5% for domestic services staff ( Extended data: Figure S5).

Figure 1. Model-predicted seroprevalence estimates for three different models (A-C), adjusted and unadjusted with covariates.

Figure 1.

Black stars represent point values from the data. The point and whiskers represent the mean value and 95% CrI of the posterior distribution. The three models differed by their primary exposure, Model A used COVID-19 zones (1 refers to lowest COVID-19 contact and 6 refers to highest COVID-19 contact 21 .), Model B the job role, and Model C the job location. Each model was evaluated either unadjusted (primary exposure only) or adjusted (primary exposure with age, gender, and ethnicity).

Antibody kinetics model

Differences in antibody concentration between samples were calculated for four different antibody-antigen interactions (spike-IgG, NCP-IgG, spike-IgA, NCP-IgA). Though there was a positive correlation between Spike-IgG and NCP-IgG across all samples (R 2.= 0·53), the correlations between serum IgG and IgA were much weaker (R 2. between 0·17 and 0·3) ( Extended data: Figure S6).

For both serum IgG and IgA, older age groups showed higher antibody titres ( Figure 2); E.g., the median log2 titre of spike-IgG in those ≤30 years was 6·6 (95% CrI 6·2–7·0), and at 60+ years was 7·1 (95% CrI 6·6–7·8), while spike-IgA titre in those ≤30 years was 6·8 (95% CrI 6·3–7·3), and at 60+ years was 7·6 (95% CrI 7·0–8·3). Symptomatic cases showed similar titres compared to asymptomatic cases across IgG and IgA-serum measures. The reduction in antibody titre at the second blood draw was less in the spike-IgG (mean -0·15 (95% CrI -0·25– -0·06)) compared to NCP-IgG (mean -0·49 (95% CrI -0·58– -0·540)) and Spike-IgA/NCP-IgA. These estimated rates of decline remained consistent across all covariate groups studied. Consequently, the estimated time until seroreversion for seropositive samples from symptomatic participants was around 100 weeks for the spike-IgG, and 52 weeks for NCP-IgG and IgA serum measures ( Figure 2). When considering seropositive samples from symptomatic participants, there was little difference in the decrease in antibody levels as time post-symptom onset increased ( Extended data: Figure S7).

Figure 2. Outputs from the antibody kinetics model for four antibody-antigen interactions (spike-IgG, NCP-IgG, spike-IgA, and NCP-IgA).

Figure 2.

The IgG measures are in the WHO standard universal log2 antibody units, whereas the IgA measures are in log2(AU) units scaled relative to the values in the study. The dots show the median and the line segments show the 95% credible interval of the posterior distributions. Top panel shows the log2(AU) at the first bleed across four different covariates (Age group, ethnicity, gender, and disease severity). Middle panels show the change in log2(AU) after 30 days. The bottom panels show the time until seroreversion in weeks. Asymp (asymptomatic participants), Symp (symptomatic participants) PSO (post symptom onset).

Heterogeneous sensitivity model

The heterogeneous sensitivity model demonstrates that using varying A 450 cut-offs (corresponding to varying sensitivity values) to categorise seropositivity in the HERO dataset will result in a lower sensitivity than that defined using our assay validation dataset ( Figure 3a). The model also shows that there is no difference in implied sensitivity between using spike or NCP as the antigenic target in the ELISA assay.

Figure 3.

Figure 3.

( a) Sensitivity of the assay validation dataset against the implied sensitivity of the HERO dataset for spike and nucleoprotein. ( b) Sensitivity of the assay validation dataset against the implied age-specific sensitivity in the HERO dataset for spike and nucleoprotein. Black line and ribbon shows median and 95% CrI for the posterior distributions respectively.

The relationship between the A 450 cut-off value and the sensitivity and specificity for the assay validation datasets for each antigen were plotted with the associated ROC curves ( Extended data: Figures S8 and S9). We hypothesised that the higher A 450 values seen in older adults suggest that some commercially available serological assays may have a higher sensitivity in detecting COVID-19 antibodies in older age groups compared with younger age groups. We therefore used our model to estimate age-specific implied sensitivity for assays of different sensitivity profiles in estimating seroprevalence in our HERO dataset. We found that the sensitivity of a serological assay decreases with age due to the higher antibody titres seen in older people, with a clearer trend in an NCP-based assay compared to a spike-based assay ( Figure 3b). Assuming a theoretical assay validation set sample sensitivity of 80% for the NCP protein, the resulting median implied sensitivity for age groups <30, 30–39, 40–49, 50–59, and 60+ years was 61%, 77%, 70%, 85%, and 89% respectively.

Discussion

We found a high SARS-CoV-2 seroprevalence in HCWs at a large UK hospital trust compared to national seroprevalence estimates, following the first pandemic wave in the UK 28 . In addition, we identified important risk factors associated with occupational exposure to COVID-19, and described a significant association between age and the likelihood of a positive serological result which has important implications for the validation of SARS-CoV-2 antibody assays and the hitherto interpretation of population-level COVID-19 serology data.

Over 20% of HCWs at STH had evidence of SARS-CoV-2 infection within just over 100 days of the first confirmed COVID-19 patient being admitted to our NHS trust. This high proportion over a short space of time is likely representative of the much higher exposure to SARS-CoV-2 infection among certain subpopulations of the workforce that we tested. Although data from other settings and countries suggested infection risk in HCWs is similar to community exposure, this seroprevalence is much higher than estimated seropositivity in the UK population at a similar time (6·0%, 95 CrI 5·8–6·1 in July 2020) 28, 29 .

Our data show that HCWs working in AMUs are at significantly higher risk of infection with SARS-CoV-2, with seropositivity rates above that of other wards, consistent with other UK studies 5, 6, 30 . EDs face a similar patient turnover yet have lower HCW seroprevalence rates in both ours and previous reports 5, 6, 30, 31 . Although some patient factors may increase HCW risk of infection on AMU compared to ED (cohort bays, longer stays, more fomites e.g. bedside tables, chairs), more frequent interactions and therefore transmission between HCWs may also play a significant additional role. In the event of a further wave or outbreak, infection prevention and control (IPC) interventions to reduce risk in these areas could include targeted IPC training and auditing (particularly of PPE use and break areas), serial staff testing, pop-up isolation units in bay areas and optimising staff-to-patient ratios. At the other end of the spectrum, we and others have found that HCWs in critical care units have some of the lowest seropositivity rates, which likely reflects ‘Level 3 Airborne’ PPE use in Critical Care units from an early point in the pandemic and the increased availability of negative pressure rooms 5, 6, 8, 30, 31 .

Occupational and physiotherapists (OT/PT) had the highest rates of seroprevalence across all of the job roles included in our cohort (45.5%), which is consistent with some other UK studies 6, 32 . OT/PT work involves prolonged close contact with patients in addition to PT performing chest physiotherapy and open suctioning of the respiratory tract. In addition, OT/PT work across multiple inpatient and outpatient areas in our Trust, which could increase risk of transmission from both patients and other HCWs.

Increasing age was associated with seropositivity, with over a third of our HCWs aged >60 testing seropositive, and with higher antibody titres. We demonstrate that the sensitivity of a serological assay increases with increasing age due to the higher antibody titres seen in older people, and with a clearer trend in NCP- compared to spike-based assays. Consistent with other studies, we also found that NCP-IgG is likely to wane more quickly than Spike-IgG. Depending on the sampling time frame relative to pandemic wave, therefore, serological testing based on NCP-IgG alone may therefore underestimate seroprevalence i.e. potential exposure to infection. As two of the major commercial SARS-CoV-2 antibody assays (Roche Elecsys and Abbott SARS-CoV-2 IgG) were validated with patient sera collected from those with more severe disease early on in the pandemic (i.e. those who presented to health services). Previous studies clearly demonstrate that patients with more severe COVID-19 have higher antibody titres and it would be reasonable to assume these cases were likely to also be older in age 3336 . Our findings suggest that using such samples collected from severely symptomatic older patients for the purposes of assay calibration may result in an assay with lower or insufficient sensitivity when applied to less symptomatic or younger (often community) populations. With increasing vaccine coverage, use of spike IgG to determine seroprevalence also becomes more problematic when distinguishing whether an individual is seropositive from vaccination or previous infection. Assays which combine antibody responses to membrane protein with NCP antibodies may overcome these challenges 37, 38 .

We note the limitations of our study, which include a potential for selection bias due to participants self-enrolling for convenience, rather than using systematic sampling. Reassuringly, our seroprevalence rates are similar to those seen in other UK based seroprevalence studies 5, 32 . In addition, we recognise that our cohort has relatively low numbers of HCWs from minority ethnic backgrounds (~10%), compared to the Sheffield general population (19%) 39 .

With the ongoing global devastation caused by the COVID-19 pandemic and its lasting effect on healthcare services, understanding the risk factors leading to HCW exposure is paramount to ensuring the continuity of effective and safe patient care. Our real-world data suggest that NHS HCWs face high levels of exposure to SARS-CoV-2, plus highlights locations and job roles at greatest risk during the first wave of the pandemic. Population seroprevalence data can help guide decision makers on risk management. Using assays that have been validated using serum samples from a broad population, combined with antibody kinetic modelling and/or with age-adjusted cut-offs could overcome the potential limitations we have highlighted.

Data availability

Underlying data

Zenodo: dchodge/hero-study: Submitted version for WOR, https://doi.org/10.5281/zenodo.5215671 26 .

The project contains the following underlying data:

  • datafit.RData

  • fit_mcmc_asymp.RData

  • fit_mcmc_change.RData

  • fit_mcmc_prev.RData

  • fit_mcmc_sens.RData

  • fit_mcmc_spec.RData

  • fit_mcmc_start.RData

  • fit_mcmc_time.RData

  • sensspec.RData

  • sero_clean.RData

Extended data

Zenodo: dchodge/hero-study: Submitted version for WOR, https://doi.org/10.5281/zenodo.5215671 26 .

The project contains the following extended data:

  •    –   Table S1. Details of the samples used to set thresholds during assay validation

  •    –   Table S2. Comparison of antibody units in assay calibration curve sera assigned to the assay with WHO international standard antibody units

  •    –   Table S3. Summary of the response variables and the covariates used in the regression model.

  •    –   Table S4. Summary of the model parameters used in the regression model.

  •    –   Table S5. Summary of the response variables and the covariates used in the regression model.

  •    –   Table S6. Summary of the model parameters used in the regression model.

  •    –   Table S7. Summary of the model parameters used in the Heterogenous sensitivity model.

  •    –   Figure S1. ROC curves of the spike and NCP assays

  •    –   Figure S2. Comparison of IgA assay A450 based on IgG Serostatus

  •    –   Figure S3. Study flow diagram.

  •    –   Figure S4. Histogram (overlayed) showing the symptom onset, date of first bleed (all cases and symptomatic cases only), and time at second bleed (all cases and symptomatic cases only).

  •    –   Figure S5. Model-predicted proportion of asymptomatic estimates for three different models (A-C), adjusted and unadjusted with covariates gender, age group and ethnicity.

  •    –   Figure S6. Correlation between the four different antibody measures for 264 serological samples.

  •    –   Figure S7. Rate of decline for the antibody concentrations post-symptom onset for the four antibody measures. The fitted line is from a linear regression, with the 95% CI shown in red.

  •    –   Figure S8. Relationship between sensitivity/specificity and the cutoff value for the control dataset.

  •    –   Figure S9. ROC curves with the A 450 cut-off value indicated in red for the control dataset. x-axis shows the False Positive Rate, y-axis is the sensitivity.

  •    –   Figure S10. ROC curves for different age groups and antigen proteins, with the A 450 cut-off value indicated in various colours for the control dataset.

  •    –   Figure S11. (a) Specificity of the control data set against the implied specificity of the HERO dataset for spike and nucleoprotein. (b) Specificity of the control data set against the implied age-specific specificity of the HERO dataset for spike and nucleoprotein.

  •    –   HERO Completed STROBE Checklist[60].doc

Data are available under the terms of the Creative Commons Attribution 4.0 International license (CC-BY 4.0).

Acknowledgements

We would like to thank all the participants who generously donated their time for this study. We would like to thank the technician team from the Department of Infection, Immunity and Cardiovascular Disease (Mark Ariaans, Yvonne Stephenson, Linda Kay, Ben Durham, Amy Lewis, Barbora Ndreca) and Department of Oncology and Metabolism (Julie Porter, Maggie Glover and Ana Lopez) for help with processing samples, as well as Amanda Blythe, Julie Schofield, Phil Ravencroft, Elizabeth Kirkssen, Katie Jenkins, Sarah Berry, Debbie Allen and Aimee Card for help with participant recruitment, sampling and regulatory support. We thank Professor William Egner and Ravi Sargur for useful discussions on serological assay validation. We are grateful to Professor Florian Krammer, Fatima Amanat and Parnavi Desai for the provision of the spike constructs.

An earlier version of this article can be found on medRxiv (doi: 10.1101/2021.07.07.21260151)

Funding Statement

Funding support was generously given by Sheffield Teaching Hospitals NHS Foundation Trust and an unrestricted charitable donation from The Danson Foundation ref DAN581929. The funding sources had no involvement within the study design, data collection/analaysis/interpretetation, or in the writing of (and decision to submit) the report. Thushan de Silva is supported by a Wellcome Trust Intermediate Clinical Fellowship (110058/Z/15/Z). Kirsty L. Bradley is funded by the Rosetrees Trust (X/154753-12). Joanne Mckenzie is funded by the Faculty of Medicine, Dentistry and Health scholarship from the University of Sheffield. Adam Kucharski is supported by a Sir Henry Dale Fellowship jointly funded by the Wellcome Trust and the Royal Society (206250/Z/17/Z). David Hodgson and Adam Kucharski are supported by National Institutes of Health (1R01AI141534-01A1). Domen Zafred has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie grant agreement number 843245. Jon Sayers, Thomas Darton and Sarah Rowland-Jones have received funding from Florey Institute AMR Research Capital Funding (grant number 200636) from National Institute of Health Research (UK).

The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

[version 1; peer review: 1 approved with reservations]

References

  • 1. Nguyen LH, Drew DA, Graham MS, et al. : Risk of COVID-19 among front-line health-care workers and the general community: a prospective cohort study. Lancet Public Heal. 2020;5(9):e475–e483. 10.1016/S2468-2667(20)30164-X [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Miriam M, Claire N, Ewan Beaton M, et al. : Occupation and risk of severe COVID-19: prospective cohort study of 120 075 UK Biobank participants. Occup Environ Med. 2020. 10.1136/oemed-2020-106731 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Behrens GMN, Cossmann A, Stankov MV, et al. : Perceived versus proven SARS-CoV-2-specific immune responses in health-care professionals. Infection. 2020;48(4):631–4. 10.1007/s15010-020-01461-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Keeley AJ, Evans C, Colton H, et al. : Roll-out of SARS-CoV-2 testing for healthcare workers at a large NHS Foundation Trust in the United Kingdom, March 2020. Euro Surveill. 2020;25(14):2000433. 10.2807/1560-7917.ES.2020.25.14.2000433 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Shields A, Faustini SE, Perez-Toledo M, et al. : SARS-CoV-2 seroprevalence and asymptomatic viral carriage in healthcare workers: A cross-sectional study. Thorax. 2020;75(12):1089–1094. 10.1136/thoraxjnl-2020-215414 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Eyre DW, Lumley SF, O’donnell D, et al. : Differential occupational risks to healthcare workers from SARS-CoV-2 observed during a prospective observational study. eLife. 2020;9:e60675. 10.7554/eLife.60675 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Cooper DJ, Lear S, Watson L, et al. : A prospective study of risk factors associated with seroprevalence of SARS-CoV-2 antibodies in healthcare workers at a large UK teaching hospital. MedRxiv. 2020. 10.1101/2020.11.03.20220699 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Valdes AM, Moon JC, Vijay A, et al. : Longitudinal assessment of symptoms and risk of SARS-CoV-2 infection in healthcare workers across 5 hospitals to understand ethnic differences in infection risk. EClinicalMedicine. 2021;34: 100835. 10.1016/j.eclinm.2021.100835 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Hou H, Wang T, Zhang B, et al. : Detection of IgM and IgG antibodies in patients with coronavirus disease 2019. Clin Transl Immunol. 2020;9(5):e01136. 10.1002/cti2.1136 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Deeks JJ, Dinnes J, Takwoingi Y, et al. : Antibody tests for identification of current and past infection with SARS-CoV-2. Cochrane Database Syst Rev. 2020;6(6):CD013652. 10.1002/14651858.CD013652 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Burbelo PD, Riedo FX, Morishima C, et al. : Detection of nucleocapsid antibody to SARS-CoV-2 is more sensitive than antibody to spike protein in COVID-19 patients. MedRxiv. 2020; 2020.04.20.20071423. 10.1101/2020.04.20.20071423 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Ainsworth M, Andersson M, Auckland K, et al. : Performance characteristics of five immunoassays for SARS-CoV-2: a head-to-head benchmark comparison. Lancet Infect Dis. 2020;20(12):1390–1400. 10.1016/S1473-3099(20)30634-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Reynolds CJ, Swadling L, Gibbons JM, et al. : Discordant neutralizing antibody and T cell responses in asymptomatic and mild SARS-CoV-2 infection. Sci Immunol. 2020;5(54): eabf3698. 10.1126/sciimmunol.abf3698 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Grandjean L, Saso A, Torres A, et al. : Humoral response dynamics following infection with SARS-CoV-2. MedRxiv. 2020. 10.1101/2020.07.16.20155663 [DOI] [Google Scholar]
  • 15. Wang Y, Li J, Li H, et al. : Persistence of SARS-CoV-2-specific antibodies in COVID-19 patients. Int Immunopharmacol. 2021;90: 107271. 10.1016/j.intimp.2020.107271 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Wu F, Wang A, Liu M, et al. : Neutralizing antibody responses to SARS-CoV-2 in a COVID-19 recovered patient cohort and their implications. MedRxiv. 2020. 10.1101/2020.03.30.20047365 [DOI] [Google Scholar]
  • 17. Orner EP, Rodgers MA, Hock K, et al. : Comparison of SARS-CoV-2 IgM and IgG seroconversion profiles among hospitalized patients in two US cities. Diagn Microbiol Infect Dis. 2021;99(4): 115300. 10.1016/j.diagmicrobio.2020.115300 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Gorse GJ, Donovan MM, Patel GB: Antibodies to coronaviruses are higher in older compared with younger adults and binding antibodies are more sensitive than neutralizing antibodies in identifying coronavirus-associated illnesses. J Med Virol. 2020;92(5):512–517. 10.1002/jmv.25715 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Ho MS, Chen WJ, Chen HY, et al. : Neutralizing antibody response and SARS severity. Emerg Infect Dis. 2005;11(11):1730–7. 10.3201/eid1111.040659 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. CQC: Sheffield Teaching Hospitals CQC Report 2018.2018. Reference Source [Google Scholar]
  • 21. HERO Extended Data. Accessed August 10, 2021. Reference Source [Google Scholar]
  • 22. Amanat F, Stadlbauer D, Strohmeier S, et al. : A serological assay to detect SARS-CoV-2 seroconversion in humans. Nat Med. 2020;26(7):1033–1036. 10.1038/s41591-020-0913-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Johari YB, Jaffé SRP, Scarrott JM, et al. : Production of trimeric SARS-CoV-2 spike protein by CHO cells for serological COVID-19 testing. Biotechnol Bioeng. 2021;118(2):1013–1021. 10.1002/bit.27615 [DOI] [PubMed] [Google Scholar]
  • 24. Tugaeva Kristina V, Hawkins DEDP, Smith JLR, et al. : The Mechanism of SARS-CoV-2 Nucleocapsid Protein Recognition by the Human 14-3-3 Proteins. J Mol Biol. 2021;433(8): 166875. 10.1016/j.jmb.2021.166875 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Flaxman S, Mishra S, Gandy A, et al. : Estimating the effects of non-pharmaceutical interventions on COVID-19 in Europe. Nature. 2020;584(7820):257–261. 10.1038/s41586-020-2405-7 [DOI] [PubMed] [Google Scholar]
  • 26. Hodgson D, Abbott S: dchodge/hero-study: Submitted version for WOR (v1.0.0). Zenodo. 2021. 10.5281/zenodo.5215672 [DOI] [Google Scholar]
  • 27. Hanson KE, Caliendo AM, Arias CA, et al. : Infectious Diseases Society of America Guidelines on the Diagnosis of COVID-19: Serologic Testing. Clin Infect Dis. 2020; ciaa1343. 10.1093/cid/ciaa1343 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Ward H, Cooke G, Atchison C, et al. : Declining prevalence of antibody positivity to SARS-CoV-2: A community study of 365,000 adults. MedRxiv. 2020. 10.1101/2020.10.26.20219725 [DOI] [Google Scholar]
  • 29. Steensels D, Oris E, Coninx L, et al. : Hospital-Wide SARS-CoV-2 Antibody Screening in 3056 Staff in a Tertiary Center in Belgium. JAMA - J Am Med Assoc. 2020;324(2):195–197. 10.1001/jama.2020.11160 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Houlihan CF, Vora N, Byrne T, et al. : Pandemic peak SARS-CoV-2 infection and seroconversion rates in London frontline health-care workers. Lancet. 2020;396(10246):e6–e7. 10.1016/S0140-6736(20)31484-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Galán MAI, Velasco M, Casas MAL, et al. : SARS-CoV-2 Seroprevalence among all workers in a teaching hospital in Spain: Unmasking the risk. MedRxiv. 2020. 10.1101/2020.05.29.20116731 [DOI] [Google Scholar]
  • 32. El Bouzidi K, Pirani T, Rosadas C, et al. : Severe Acute Respiratory Syndrome Coronavirus-2 Infections in Critical Care Staff: Beware the Risks beyond the Bedside. Crit Care Med. 2021;49(3):428–436. 10.1097/CCM.0000000000004878 [DOI] [PubMed] [Google Scholar]
  • 33. Public Health England: Evaluation of Roche Elecsys Anti-SARS-CoV-2 serology assay for the detection of anti-SARS-CoV-2 antibodies About Public Health England 2020. Reference Source [Google Scholar]
  • 34. Public Health England: Evaluation of the Abbott SARS-CoV-2 IgG for the detection of anti-SARS-CoV-2 antibodies About Public Health England.2020;1–19. Reference Source [Google Scholar]
  • 35. Garcia-Beltran WF, Lam EC, Astudillo MG, et al. : COVID-19-neutralizing antibodies predict disease severity and survival. Cell. 2021;184(2):476–488.e11. 10.1016/j.cell.2020.12.015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Lau EHY, Tsang OTY, Hui DSC, et al. : Neutralizing antibody titres in SARS-CoV-2 infections. Nat Commun. 2021;12(1):63. 10.1038/s41467-020-20247-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Bruckner TA, Parker DM, Bartell SM, et al. : Estimated seroprevalence of SARS-CoV-2 antibodies among adults in Orange County, California. Sci Rep. 2021;11(1): 3081. 10.1038/s41598-021-82662-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Martin S, Heslan C, Jégou G, et al. : SARS-CoV2 Envelop Proteins Reshape the Serological Responses of COVID-19 Patients. SSRN Electron J. 2021. 10.2139/ssrn.3821960 [DOI] [Google Scholar]
  • 39. Sheffield City Council: Briefing Note 2: Ethnicity.2012. [Google Scholar]
Wellcome Open Res. 2021 Dec 6. doi: 10.21956/wellcomeopenres.18936.r47144

Reviewer response for version 1

Mo Yin 1,2

I congratulate the authors for conducting this large and informative study. It is an important finding that SARS-CoV-2 were highly prevalent amongst healthcare workers so rapidly into the pandemic.

My overall comments are:

  1. Figure 1 shows that the association between high seroprevalence and age attenuated after adjustment with zones, job and wards. Could there be confounding relationships? If zones, job and wards were added to the antibody kinetics model, would the association between age and high titre attenuate as well?

  2. It appears that a major aim here is to valid an in-house assay. The manuscript could be more clear if this was clearly stated in the beginning. An important caveat for this work is that the dataset was based on this in-house assay.

My complete list of comments are listed below: 

Abstract:

  1. “SARSCoV- 2 antibodies were tested using an in-house assay for IgG and IgA reactivity against Spike and Nucleoprotein (sensitivity 99·47%, specificity 99·56%).” This was in the results section of the manuscript. Was validation of the in-house assay an objective of this work? If yes this should be clearly specified and the quoted text should go into results.

  2. “HCWs in acute medical units working closely with COVID-19 patients were at highest risk of infection, though whether these are infections acquired from patients or other staff is unknown.” I would assume HCWs from other ward types also work closely with patients?

  3. “Current serological assays may underestimate seroprevalence in younger age groups if validated using sera from older and/or more symptomatic individuals.” It was not shown in the results that symptomatic individuals had higher titres than asymptomatic ones?

Introduction:

  1. “The true number of HCWs exposed to SARS-CoV-2 to-date is unknown, particularly during the early stages of the pandemic. Initial methods to estimate HCW exposure included extrapolation from work absenteeism rates, and are unlikely to be reliable3. Confirmation by molecular testing increased the accuracy of case detection, although access to nucleic acid amplification testing (NAAT) was limited during the early stages of the pandemic in the UK4. Detection of exposure by antibody seroconversion may provide a more accurate estimate of risk in HCW populations, can be performed at large scale, and is less affected by symptom activated testing pathways5–8.”  Do the authors mean infected/infection instead of exposed/exposure since confirmation testing by NAAT refers to infection rather than exposure?

  2. “Many antibody assays have been evaluated using samples from hospitalised patients; it is unclear how these assays perform with the lower antibody levels found in those with more mild or asymptomatic SARS-CoV-2 infection10,12” There have been a number of studies looking at serology response in mild COVID cases, e.g. https://www.nature.com/articles/s41598-020-77125-8. 1 Suggest to include in the discussion how the findings from this study compare to the existing ones.

  3. Please specify the aims in the introduction.

Methods:

  1. Please include more details on the ward set ups in the ‘Background and setting’ for readers not familiar with the NHS. There are some descriptions in the results and discussion sections. It is probably clearer if these can be explained upfront.

  2. It appears that willing healthcare workers enrolled themselves into the study. Were there any measures to prevent volunteer bias? Could volunteer bias influence the results?

  3. “These were categorised as: i), diagnosed with COVID-19 and confirmed by NAAT, ii), clinically diagnosed with COVID-19 but NAAT not performed, and iii), self-reported symptoms only21. Together, we defined these three groups as “symptomatic”, as asymptomatic testing was only introduced after the study recruitment period.” If we expect a proportion of those in groups ii) and iii) are not true COVID-19 cases, how would that affect the results? Were any sensitivity analyses performed to address this uncertainty?

  4. “Using seropositivity as the binary response variable, we considered three different model subtypes with varying primary exposures; job location, contact with COVID-19 patients, and job type21.” It is not clear from this sentence what were the actual independent variables used in the model subtypes? Which variable was given a random effect in the multilevel model? In the GitHub account, where the README says ‘The stan code and the raw mcmc outputs is contained in the outputs/ folder.’, I could not find the model codes?

  5. “For the antibody kinetics model, we included samples from seropositive individuals in a Bayesian multilevel linear regression model in two parts:…” Does this refer to the HCWs who were positive in the first blood draw only?

  6. “In our heterogeneous sensitivity and specificity model, we explored how estimates derived from our assay validation dataset generalise to covariate groups, e.g. participant age.” What is the assay validation dataset? It is probably clearer to move the explanation from results to the methods section.

  7. “In our heterogeneous sensitivity and specificity model, we explored how estimates derived from our assay validation dataset generalise to covariate groups, e.g. participant age. To model the generalisability of these performance measures, we compared the seropositivity classification of our study dataset using our in-house antibody assay, with the predicted seropositivity classification from hypothetical assays with a quoted sensitivity and specificity. Our” These sentences are a little confusing. Firstly, what does ‘estimates’ in ‘estimates derived from our assay validation dataset generalise to covariate groups, e.g. participant age’ refer to? Is it referring to sensitivity and specificity? Secondly, “we compared the seropositivity classification of our study dataset using our in-house antibody assay” sounds like you compared the classification using the antibody assay. Thirdly, in “with the predicted seropositivity classification from hypothetical assays with a quoted sensitivity and specificity”, what does ‘quoted’ mean? Does it mean ‘assumed sensitivity and specificity since it is a hypothetical assay?

  8. Please introduce A450 when it was first mentioned in the manuscript.

  9. In “to model how reliably quoted performance measures generalise.”, what does ‘quoted’ refer to?

  10. “Using the assay validation dataset, we estimated the A450 cut-off value for a range of chosen sensitivity values, and then used this A450 cut-off to classify seropositivity in the study dataset.” How did you choose a single A450 value from a range of sensitivity values?

  11. “We then estimated the implied sensitivity on the HERO dataset by comparing seropositivity classification based on the estimated A450 cut-off value, with the seropositivity classification from our in-house assay (which for ease of comparison, we assume represents the maximum possible sensitivity and specificity (i.e. 100%) in this model).” What does ‘implied’ mean here?

  12. Please cite R and cmdstanr.

Results:

  1. “Rapid waning of IgA responses following SARS-CoV-2 infection complicated defining positive and negative samples based on the convalescent sera we used for assay validation.” Why is this so?

  1. “1174 attended for a second visit (V2) between 15 June and 10 July 2020” ‘for’ is unnecessary in this sentence.

  2. Figure 1: Why do the black stars not overlap with the unadjusted mean estimates?

  3. Why were zones, job and wards not used in the same model? Was their model selection done?

  4. Figure 1: The raw data showed that a higher proportion of HCWs who were >60 years old were seropositive compared to the other age groups. However, there’s no longer this association after adjustment. What was/were the confounder(s)?

  5. In the antibody kinetics model, neither zones, job and wards were used as an independent variable. As mentioned in the earlier point, would this finding of age and higher titres modulate/disappear if zones, job and/or wards are included in this model as well?

  6. Figure 3: Please specify the meaning of ‘control’ in the x-axis.

Discussion:

  1. Were the infection prevention and control measures similar in the ED and AMU? Could the higher prevalence in AMU be due to transmissions from asymptomatic/ mildly symptomatic patients who were not triaged to be isolated by the ED?

  2. “Previous studies clearly demonstrate that patients with more severe COVID-19 have higher antibody titres” However this was not found in this study?

  3. “Reassuringly, our seroprevalence rates are similar to those seen in other UK based seroprevalence studies” Citation 5 also enrolled volunteer participants (publicised study using social media), and citation 21 is the HERO extended data?

  4. It will be informative to have a more detailed discussion on if the finding of age affecting serology titres were reported in other studies.

Is the work clearly and accurately presented and does it cite the current literature?

Partly

If applicable, is the statistical analysis and its interpretation appropriate?

Partly

Are all the source data underlying the results available to ensure full reproducibility?

Yes

Is the study design appropriate and is the work technically sound?

Partly

Are the conclusions drawn adequately supported by the results?

Partly

Are sufficient details of methods and analysis provided to allow replication by others?

Partly

Reviewer Expertise:

Mathematical modelling, data analysis, trial designs, randomised controlled trials

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

References

  • 1. : Serological follow-up of SARS-CoV-2 asymptomatic subjects. Scientific Reports .2020;10(1) : 10.1038/s41598-020-77125-8 10.1038/s41598-020-77125-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
Wellcome Open Res. 2022 Mar 1.
Hayley Colton 1

Thank you to the reviewer for their considerate and thorough comments. Please see below our responses to each point, and changes made.

1. Figure 1 shows that the association between high seroprevalence and age attenuated after adjustment with zones, job and wards. Could there be confounding relationships? If zones, job and wards were added to the antibody kinetics model, would the association between age and high titre attenuate as well?

David Hodgson: The reviewer’s interpretation of Figure 1 seems correct, but to be crystal clear; Figure 1 shows the (unadjusted) association between high seroprevalence, and the covariates age, gender, and race, and the (adjusted) association between high seroprevalence, and the covariates age, gender, race, and one primary exposure variable (either zones, jobs or wards). In Figure 1, the similarity between the unadjusted and adjusted seroprevalence estimates suggest that neither zone, jobs or wards were strong confounders on the age-dependent prevalence, therefore they are dropped in the subsequent modes for parsimony.

2. It appears that a major aim here is to valid an in-house assay. The manuscript could be more clear if this was clearly stated in the beginning. An important caveat for this work is that the dataset was based on this in-house assay.

Hailey Hornsby: As seen later in this response, the manuscript intro has been expanded to include the ELISA validation as an objective of the work:

To achieve this, we sought to measure SARS-CoV-2 antibody titres by developing an in-house assay (...)”

Abstract:

1. “SARSCoV- 2 antibodies were tested using an in-house assay for IgG and IgA reactivity against Spike and Nucleoprotein (sensitivity 99·47%, specificity 99·56%).” This was in the results section of the manuscript. Was validation of the in-house assay an objective of this work? If yes this should be clearly specified and the quoted text should go into results.

Hailey Hornsby / Hayley Colton: Thank you for pointing out, this has now been corrected:

“Methods: HCWs at Sheffield Teaching Hospitals NHS Foundation Trust were prospectively enrolled and sampled at two time points. We developed an in-house ELISA for testing participant serum for SARS-CoV-2 for IgG and IgA reactivity against Spike and Nucleoprotein. Data were analysed using three statistical models: a seroprevalence model, an antibody kinetics model, and a heterogeneous sensitivity model. Results: Our in-house assay had sensitivity 99·47% and specificity 99·56%. We found that 24·4% (n=311/1275) of HCWs were seropositive as of 12th June 2020.”

2. “HCWs in acute medical units working closely with COVID-19 patients were at highest risk of infection, though whether these are infections acquired from patients or other staff is unknown.” I would assume HCWs from other ward types also work closely with patients?

Hayley Colton: HCWs were asked in the survey what zone they worked in, and if they were patient facing (see Extended Data, reference 21). The analysis showed that the highest seroprevalence was seen in AMU and those who worked in red zones regularly in close contact with COVID patients (group 6). Thank you for pointing out that the wording is not clear, it has been changed in the manuscript:

“HCWs in acute medical units and those working closely with COVID-19 patients were at highest risk of infection, though whether these are infections acquired from patients or other staff is unknown”

3.  “Current serological assays may underestimate seroprevalence in younger age groups if validated using sera from older and/or more symptomatic individuals.” It was not shown in the results that symptomatic individuals had higher titres than asymptomatic ones?

Hayley Colton: Commercial assays referred to in this sentence were validated against samples from patients who were hospitalised with severe disease; whereas our assay was validated using samples from both inpatients with severe disease and outpatients with mild (or no) symptoms). Although we found that symptomatic cases showed similar titres compared to asymptomatic cases within our HCWs, we did not analyse the titres against severity, whereas other studies have. For our assay development, we did compare inpatient and outpatient antibody titres, this has now been added to the Extended Data as Figure S1b. Thank you for pointing this out, the manuscript has been amended to reflect that we are referring to severity rather than symptoms exclusively:

“Current serological assays may underestimate seroprevalence in younger age groups if validated using sera from older and/or more severe COVID-19 cases.”

Introduction:

1. " The true number of HCWs exposed to SARS-CoV-2 to-date is unknown, particularly during the early stages of the pandemic. Initial methods to estimate HCW exposure included extrapolation from work absenteeism rates, and are unlikely to be reliable3. Confirmation by molecular testing increased the accuracy of case detection, although access to nucleic acid amplification testing (NAAT) was limited during the early stages of the pandemic in the UK4. Detection of exposure by antibody seroconversion may provide a more accurate estimate of risk in HCW populations, can be performed at large scale, and is less affected by symptom activated testing pathways5–8.”  Do the authors mean infected/infection instead of exposed/exposure since confirmation testing by NAAT refers to infection rather than exposure?

Hayley Colton: Thank you for pointing out, rephrased as below:

“The true number of HCWs infected with SARS-CoV-2 to-date is unknown, particularly during the early stages of the pandemic. Initial methods to estimate HCW COVID-19 cases included extrapolation from work absenteeism rates, and are unlikely to be reliable3. Confirmation by molecular testing increased the accuracy of case detection, although access to nucleic acid amplification testing (NAAT) was limited during the early stages of the pandemic in the UK4. Serological testing can be performed at large scale, and is less affected by symptom activated testing pathways, so may provide a more accurate estimate of previously infected HCWs and could be used in conjunction with other data to determine their risk factors for exposure 5–8.”

2.  "Many antibody assays have been evaluated using samples from hospitalised patients; it is unclear how these assays perform with the lower antibody levels found in those with more mild or asymptomatic SARS-CoV-2 infection10,12” There have been a number of studies looking at serology response in mild COVID cases, e.g. https://www.nature.com/articles/s41598-020-77125-8. 1 Suggest to include in the discussion how the findings from this study compare to the existing ones.

Hayley Colton: Thank you, this didn’t come up in our original literature search. It appears Wantai ELISA which is used in the UNICORN consortium reference was also validated using sera from samples from PCR positive patients who all had clinical symptoms, laboratory abnormalities or pulmonary imaging manifestations. (WANTAI SARS-CoV-2 Ab ELISA Instructions for use https://www.fda.gov/media/140929/download accessed 30 Dec 2021). I have added this to the introduction and discussion.

Several commercial SARS-CoV-2 antibody assays have been validated using samples from patients with more severe COVID-19, and some studies have suggested those with milder or asymptomatic COVID-19 are less likely to develop detectable antibodies. 40, 41, 42.”

3. Please specify the aims in the introduction.

Hayley Colton: We have edited the manuscript introduction to make our aims clearer:

“In this study we aimed to investigateSARS-CoV-2 seroprevalence in HCWs at Sheffield Teaching Hospitals NHS Foundation Trust (STH) following the first way of the pandemic in the UK. T o achieve this, we sought to measure SARS-CoV-2 antibody titres by developing an in-house assay, and use statistical modelling to explore risk factors associated with seropositivity, the evolving antibody response, and the impact of age on assay sensitivity”

Methods:

1. Please include more details on the ward set ups in the ‘Background and setting’ for readers not familiar with the NHS. There are some descriptions in the results and discussion sections. It is probably clearer if these can be explained upfront.

Hayley Colton:  Thanks for pointing this out, the background section has been edited:

“STH is an NHS trust offer ing secondary- and tertiary-level care across four sites in South Yorkshire, UK, with 1,669 inpatient beds and 18,500 employees 20. Patients with a medical reason for admission are typically admitted to STH either through attending the Emergency Department (ED) or through referral to the Acute Medical Unit (AMU) by their General Practitioner (GP). On AMU, patients are given an initial management plan then triaged to the most appropriate medical speciality ward e.g. respiratory medicine. The first patient with confirmed COVID-19 was admitted to STH on 23 February 2020; the first wave of the UK pandemic occurred between March 2020 and June 2020. Patients with suspected or confirmed COVID-19 were referred directly to the infectious diseases wards by GPs or from STH admission units. When infectious diseases capacity was reached, suspected COVID-19 patients would either be placed in side rooms or cohorted in bays, whilst confirmed COVID-19 patients could be moved to COVID cohort wards

2. It appears that willing healthcare workers enrolled themselves into the study. Were there any measures to prevent volunteer bias? Could volunteer bias influence the results?

Paul Collini: Volunteer bias could conceivably increase the observed prevalence if those who thought they had been exposed to COVID were more likely to volunteer and really had been more likely to have been exposed and infected. The only measure we had to prevent bias was that we approached domestic services staff through their managers as well as email as we anticipated they wouldn’t be accessing emails. However, we had no measures to mitigate any volunteer enrolment bias. While we cannot measure the extent of this effect on the measured prevalence of antibody detection, we suspect that it is small; at the time of the study there was a huge interest among staff as to whether they had yet been exposed to SARS CoV-2, which appeared to be irrespective of perceived exposure and was reflected in the rapid enrolment; also at that time it was still not appreciated how common asymptomatic COVID was, yet a large number of people without any prior symptoms enrolled. Even if volunteer bias were to artificially increase the observed prevalence, we think any volunteer bias would have been equal across all categories we compared, and so not altering the validity of these comparisons. We have altered the discussion to better reflect this - see response to ‘Discussion point 3’.’

3. “These were categorised as: i), diagnosed with COVID-19 and confirmed by NAAT, ii), clinically diagnosed with COVID-19 but NAAT not performed, and iii), self-reported symptoms only21. Together, we defined these three groups as “symptomatic”, as asymptomatic testing was only introduced after the study recruitment period.” If we expect a proportion of those in groups ii) and iii) are not true COVID-19 cases, how would that affect the results? Were any sensitivity analyses performed to address this uncertainty?

David Hodgson: Just to be clear, case positivity was defined through seropositivity and this sentence refers to the classification of these seropositive individuals as either a symptomatic or asymptomatic COVID-19 case. For patients in group ii) the COVID case was diagnosed by a clinical professional but a NAAT test was not possible at the time. Therefore it is unlikely, especially given the short time-scale between reporting COVID-like symptoms (Feb 2020) and the first bleed of the experiment (Apr 2020) that this is a misdiagnosed case. For people in iii) it is possible that some people were asymptomatic with COVID-19 and were reporting symptoms of another respiratory disease, however the sample size of this group is small (22 / 340 (6.4%) of non-asymptomatic cases) and even if such misdiagnoses were common, it is unlikely to impact the results.

4. "Using seropositivity as the binary response variable, we considered three different model subtypes with varying primary exposures; job location, contact with COVID-19 patients, and job type21.” It is not clear from this sentence what were the actual independent variables used in the model subtypes? Which variable was given a random effect in the multilevel model? In the GitHub account, where the README says ‘The stan code and the raw mcmc outputs is contained in the outputs/ folder.’, I could not find the model codes?"

David Hodgson: Further details of the model are given in the Extended Data (reference 21). To make the model independent variables as clear as possible in manuscript, we have edited the manuscript as below:

“Using seropositivity as the binary response variable, we considered three different Bayesian Hierarchical Regression model subtypes all with explanatory demographic variables age, race and gender, and each model with a different primary exposure; job location, contact with COVID-19 patients, and job type21. ”

Regarding the stan code and raw mcmc outputs; there is a typo in the README file, and these files are actually contained in the ‘include’ folder. This has now been updated in the Github Repository. Thankyou for pointing out this error.

5. "For the antibody kinetics model, we included samples from seropositive individuals in a Bayesian multilevel linear regression model in two parts:…” Does this refer to the HCWs who were positive in the first blood draw only?

David Hodgson: We included HCWs who were positive at both their first and second bleed as stated in the Extended Data (reference 21 in the manuscript) Manuscript edited as following:

“For the antibody kinetics model, we included samples from individuals who were seropositive at both bleeds, in a Bayesian multilevel linear regression model in two parts:”

6. “In our heterogeneous sensitivity and specificity model, we explored how estimates derived from our assay validation dataset generalise to covariate groups, e.g. participant age.” What is the assay validation dataset? It is probably clearer to move the explanation from results to the methods section.

Hayley Colton: The validation of the in-house assay has now been added as an objective:

“T o achieve this, we sought to measure SARS-CoV-2 antibody titres by developing and using an in-house assay, prior to using statistical modelling to explore risk factors associated with seropositivity, the evolving antibody response, and the impact of age on assay sensitivity”

The development technique of the in-house assay has been moved to methods, whilst the validation results have been moved to results.

7. "In our heterogeneous sensitivity and specificity model, we explored how estimates derived from our assay validation dataset generalise to covariate groups, e.g. participant age. To model the generalisability of these performance measures, we compared the seropositivity classification of our study dataset using our in-house antibody assay, with the predicted seropositivity classification from hypothetical assays with a quoted sensitivity and specificity. Our” These sentences are a little confusing. Firstly, what does ‘estimates’ in ‘estimates derived from our assay validation dataset generalise to covariate groups, e.g. participant age’ refer to? Is it referring to sensitivity and specificity? Secondly, “we compared the seropositivity classification of our study dataset using our in-house antibody assay” sounds like you compared the classification using the antibody assay. Thirdly, in “with the predicted seropositivity classification from hypothetical assays with a quoted sensitivity and specificity”, what does ‘quoted’ mean? Does it mean ‘assumed sensitivity and specificity since it is a hypothetical assay?

David Hodgson: In answer to the reviewer’s first question; yes, “estimates” in this sentence refers to sensitivity and specificity. For the reviewer’s second point, we have clarified the sentence to ensure that this confusion does not arise.  Finally, for the reviewer's third point, yes, “quoted” in this sentence can be read as “assumed”. With these comments in mind, we have changed the text accordingly:

“In our heterogeneous sensitivity and specificity model, we explored how estimates for the sensitivity and specificity derived from our assay validation dataset generalise to covariate groups, e.g. participant age. To model the generalisability of these performance measures, we compared the seropositivity classification of our study dataset according to our in-house antibody assay, with the predicted seropositivity classification from hypothetical assays with an assumed sensitivity and specificity.”

8. Please introduce A450 when it was first mentioned in the manuscript.

Hailey Hornsby: Added an introduction to A450 to Serology Assay Development: “Thresholds based on the absorbance value at 450nm (A 450) for defining reactivity to spike ( A 450 0·1750) or NCP ( A 450 0·1905) were set to optimise the sensitivity of each assay.

9.In “to model how reliably quoted performance measures generalise.”, what does ‘quoted’ refer to? David Hodgson: “Quoted” refers to the sensitivity and specificity given by the manufacturers of the assays. To make this clearer we have changed the text:

“to model how reliably the sensitivity and specificity given by the manufacturers of the assays  generalise to specific subpopulations.”

10. “Using the assay validation dataset, we estimated the A450 cut-off value for a range of chosen sensitivity values, and then used this A450 cut-off to classify seropositivity in the study dataset.” How did you choose a single A450 value from a range of sensitivity values?

David Hodgson: There is an A450 value for each sensitivity value considered. To make this as clear as possible in the sentence we have changed the sentence accordingly:

“Using the assay validation dataset, we estimated an A450 cut-off value for every chosen   sensitivity value, and then used this A450 cut-off to classify seropositivity in the study dataset.”

11. “We then estimated the implied sensitivity on the HERO dataset by comparing seropositivity classification based on the estimated A450 cut-off value, with the seropositivity classification from our in-house assay (which for ease of comparison, we assume represents the maximum possible sensitivity and specificity (i.e. 100%) in this model).” What does ‘implied’ mean here?

David Hodgson: “Implied sensitivity” is defined by the description following the sentence. To make this clearer we have rearranged the sentence accordingly:

“By comparing seropositivity classification based on the estimated A450 cut-off value, with the seropositivity classification from our in-house assay (which for ease of comparison, we assume represents the maximum possible sensitivity and specificity (i.e. 100%) in this model), we estimate an “implied” sensitivity on the HERO dataset which would arise if the commercial assay alone had been used to detect seropositivity.”

12. Please cite R and cmdstanr.

David Hodgson:  Apologies for this oversight, these are now referenced in the updated manuscript.

Results:

1.  “Rapid waning of IgA responses following SARS-CoV-2 infection complicated defining positive and negative samples based on the convalescent sera we used for assay validation.” Why is this so?

Hailey Hornsby: This sentence has been expanded and rewritten in manuscript to clarify:

Compared with IgG, we saw more rapid waning of the IgA responses following SARS-CoV-2 infection, a s well as higher levels of cross-reactivity in pre-pandemic samples. These factors complicated defining seropositivity based on an A450 threshold, as there was no clear separation between titres in these two groups. We therefore opted to use our spike and NCP IgA ELISA solely to compare IgA titres of individuals classified as seropositive by our IgG assay.”

2. “1174 attended for a second visit (V2) between 15 June and 10 July 2020” ‘for’ is unnecessary in this sentence.

Hayley Colton: Amended in manuscript:

“1174 attended a second visit (V2) between 15 June and 10 July 2020”

3.Figure 1: Why do the black stars not overlap with the unadjusted mean estimates?

David Hodgson: As explained in the caption, the black stars are point estimates from the raw data (no regression was performed), calculated by as the mean value for that specific subgroup. The remaining estimates on the Figure are from a hierarchical regression analysis. We remind the reviewer that here “unadjusted” means relative to the primary exposure (job, zone, wards), and that regularisation from the hierarchical model means that estimates from this model are likely to be different from the raw point estimates. E.g. The oldest (60+ years) age group had a much smaller sample size compared to the other age groups, therefore it is more susceptible to regularisation in a regression analysis.

4. Why were zones, job and wards not used in the same model? Was their model selection done?

David Hodgson: There were two issues with using more than one of the primary response variables (zones, jobs, and wards) in the model. First we found multicollinearity between these exposures, making parameter inference difficult to interpret when more than one primary exposure was included in the regression analysis model. Second, the number of categories of these age groups gave rise to data sparsity issues (many groups have 0 entries) meaning again, results were unreliable. These reasons are explained in the Extended Data (reference 21 in the manuscript).

5. Figure 1: The raw data showed that a higher proportion of HCWs who were >60 years old were seropositive compared to the other age groups. However, there’s no longer this association after adjustment. What was/were the confounder(s)?

David Hodgson:  This is due to regularisation of the hierarchical regression model. The oldest (60+ years) age group had a much smaller sample size compared to the other age groups, therefore it is more susceptible to regularisation in a regression analysis. In other words, the magnitude of the high seroprevalence observed in the older age group (with such a small sample size) is unlikely to be a true reflection of the seroprevalence in that age groups, given the consistency in seroprevalence across the other age groups with a large sample size.

6. In the antibody kinetics model, neither zones, job and wards were used as an independent variable. As mentioned in the earlier point, would this finding of age and higher titres modulate/disappear if zones, job and/or wards are included in this model as well?

David Hodgson: As explained in the response to the reviewers main comment above, the similarity between the unadjusted and adjusted seroprevalence estimates suggest that neither zone, jobs or wards were strong confounders on the age-dependent prevalence, therefore they are dropped in the subsequent modes for parsimony.

7. Figure 3: Please specify the meaning of ‘control’ in the x-axis.

David Hodgson:  Here the control means the “Sensitivity of assay validation dataset.” This change has been made to Figure 3.

Discussion:

1. Were the infection prevention and control measures similar in the ED and AMU? Could the higher prevalence in AMU be due to transmissions from asymptomatic/ mildly symptomatic patients who were not triaged to be isolated by the ED?

Hayley Colton: Local infection prevention and control measures were trust wide rather than department led, so it would be reasonable to presume they would be similar in both areas. Trust guidance was changed so HCWs should wear surgical masks, aprons and gloves for all patient contact from 08 April 2021, so it could be possible that transmissions in this manner could have been a factor prior to 08 April 2021. If the patient had been admitted to AMU via ED, they would have exposed HCWs in both the ED and AMU, however in the AMU the exposure time to HCWs is likely to be longer, plus patients could have come via GP and therefore not exposed ED HCWs. This has been added to the manuscript:

Despite universal PPE and IPC guidelines across STH, our data show that HCWs working in AMUs are at significantly higher risk of infection with SARS-CoV-2, with seropositivity rates above that of other wards, consistent with other UK studies. [...] Although some patient factors may increase HCW risk of infection on AMU compared to ED (cohort bays, longer stays of asymptomatic/paucisymptomatic cases prior to universal PPE use, more fomites e.g. bedside tables, chairs), more frequent interactions and therefore transmission between HCWs may also play a significant additional role”

2. “Previous studies clearly demonstrate that patients with more severe COVID-19 have higher antibody titres” However this was not found in this study?

Hayley Colton: Our data looking at HCW antibody responses didn’t include information about severity of symptomatic COVID-19 infection in our HCWs, so we cannot say that we found, or did not find this in our study. However, we have added Figure S1b concerning our assay validation to our Extended Data, and referenced it in the manuscript.

3. “Reassuringly, our seroprevalence rates are similar to those seen in other UK based seroprevalence studies” Citation 5 also enrolled volunteer participants (publicised study using social media), and citation 21 is the HERO extended data?

Hayley Colton/Paul Collini: The citations on this sentence are 5 and 32 which are referring to other UK seroprevalence studies on healthcare workers. We were trying to convey that our seroprevalence rates are similar to other UK HCWs, however I appreciate that the recruitment methods are similar to our study, which probably negates the point. I have rephrased as following:

“We note the limitations of our study, which include a potential for selection bias due to participants self-enrolling for convenience, rather than using systematic sampling. While we cannot measure the extent of this effect on the measured seroprevalence, we think any volunteer bias would have been equal across all categories we compared, and so not altering the validity of these comparisons.”

4. It will be informative to have a more detailed discussion on if the finding of age affecting serology titres were reported in other studies

Hayley Colton: Thank you - we have added further detail to the age section in the discussion:

Our data complements the existing literature, which shows antibody titres against SARS-CoV-2 and other coronaviruses are higher in older individuals, which could be due to a higher risk of exposure to the virus, greater antigenic load, or boosting of antibodies from cumulative seasonal coronavirus infections throughout their lifetime15-19.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Availability Statement

    Underlying data

    Zenodo: dchodge/hero-study: Submitted version for WOR, https://doi.org/10.5281/zenodo.5215671 26 .

    The project contains the following underlying data:

    • datafit.RData

    • fit_mcmc_asymp.RData

    • fit_mcmc_change.RData

    • fit_mcmc_prev.RData

    • fit_mcmc_sens.RData

    • fit_mcmc_spec.RData

    • fit_mcmc_start.RData

    • fit_mcmc_time.RData

    • sensspec.RData

    • sero_clean.RData

    Extended data

    Zenodo: dchodge/hero-study: Submitted version for WOR, https://doi.org/10.5281/zenodo.5215671 26 .

    The project contains the following extended data:

    •    –   Table S1. Details of the samples used to set thresholds during assay validation

    •    –   Table S2. Comparison of antibody units in assay calibration curve sera assigned to the assay with WHO international standard antibody units

    •    –   Table S3. Summary of the response variables and the covariates used in the regression model.

    •    –   Table S4. Summary of the model parameters used in the regression model.

    •    –   Table S5. Summary of the response variables and the covariates used in the regression model.

    •    –   Table S6. Summary of the model parameters used in the regression model.

    •    –   Table S7. Summary of the model parameters used in the Heterogenous sensitivity model.

    •    –   Figure S1. ROC curves of the spike and NCP assays

    •    –   Figure S2. Comparison of IgA assay A450 based on IgG Serostatus

    •    –   Figure S3. Study flow diagram.

    •    –   Figure S4. Histogram (overlayed) showing the symptom onset, date of first bleed (all cases and symptomatic cases only), and time at second bleed (all cases and symptomatic cases only).

    •    –   Figure S5. Model-predicted proportion of asymptomatic estimates for three different models (A-C), adjusted and unadjusted with covariates gender, age group and ethnicity.

    •    –   Figure S6. Correlation between the four different antibody measures for 264 serological samples.

    •    –   Figure S7. Rate of decline for the antibody concentrations post-symptom onset for the four antibody measures. The fitted line is from a linear regression, with the 95% CI shown in red.

    •    –   Figure S8. Relationship between sensitivity/specificity and the cutoff value for the control dataset.

    •    –   Figure S9. ROC curves with the A 450 cut-off value indicated in red for the control dataset. x-axis shows the False Positive Rate, y-axis is the sensitivity.

    •    –   Figure S10. ROC curves for different age groups and antigen proteins, with the A 450 cut-off value indicated in various colours for the control dataset.

    •    –   Figure S11. (a) Specificity of the control data set against the implied specificity of the HERO dataset for spike and nucleoprotein. (b) Specificity of the control data set against the implied age-specific specificity of the HERO dataset for spike and nucleoprotein.

    •    –   HERO Completed STROBE Checklist[60].doc

    Data are available under the terms of the Creative Commons Attribution 4.0 International license (CC-BY 4.0).


    Articles from Wellcome Open Research are provided here courtesy of The Wellcome Trust

    RESOURCES