Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2024 Jun 6;19(6):e0282451. doi: 10.1371/journal.pone.0282451

Excess burden of respiratory and abdominal conditions following COVID-19 infections during the ancestral and Delta variant periods in the United States: An EHR-based cohort study from the RECOVER program

Jay K Varma 1,*, Chengxi Zang 1, Thomas W Carton 3, Jason P Block 4, Dhruv J Khullar 1,2, Yongkang Zhang 1, Mark G Weiner 1, Russell L Rothman 5, Edward J Schenck 2, Zhenxing Xu 1, Kristin Lyman 1, Jiang Bian 6, Jie Xu 6, Elizabeth A Shenkman 6, Christine Maughan 7, Leah Castro-Baucom 8, Lisa O’Brien 7, Fei Wang 1, Rainu Kaushal 1; on behalf of the RECOVER Consortium
Editor: Roberto Scendoni9
PMCID: PMC11156291  PMID: 38843159

Abstract

Importance

The frequency and characteristics of post-acute sequelae of SARS-CoV-2 infection (PASC) may vary by SARS-CoV-2 variant.

Objective

To characterize PASC-related conditions among individuals likely infected by the ancestral strain in 2020 and individuals likely infected by the Delta variant in 2021.

Design

Retrospective cohort study of electronic medical record data for approximately 27 million patients from March 1, 2020-November 30, 2021.

Setting

Healthcare facilities in New York and Florida.

Participants

Patients who were at least 20 years old and had diagnosis codes that included at least one SARS-CoV-2 viral test during the study period.

Exposure

Laboratory-confirmed COVID-19 infection, classified by the most common variant prevalent in those regions at the time.

Main outcome(s) and measure(s)

Relative risk (estimated by adjusted hazard ratio [aHR]) and absolute risk difference (estimated by adjusted excess burden) of new conditions, defined as new documentation of symptoms or diagnoses, in persons between 31–180 days after a positive COVID-19 test compared to persons without a COVID-19 test or diagnosis during the 31–180 days after the last negative test.

Results

We analyzed data from 560,752 patients. The median age was 57 years; 60.3% were female, 20.0% non-Hispanic Black, and 19.6% Hispanic. During the study period, 57,616 patients had a positive SARS-CoV-2 test; 503,136 did not. For infections during the ancestral strain period, pulmonary fibrosis, edema (excess fluid), and inflammation had the largest aHR, comparing those with a positive test to those without a COVID-19 test or diagnosis (aHR 2.32 [95% CI 2.09 2.57]), and dyspnea (shortness of breath) carried the largest excess burden (47.6 more cases per 1,000 persons). For infections during the Delta period, pulmonary embolism had the largest aHR comparing those with a positive test to a negative test (aHR 2.18 [95% CI 1.57, 3.01]), and abdominal pain carried the largest excess burden (85.3 more cases per 1,000 persons).

Conclusions and relevance

We documented a substantial relative risk of pulmonary embolism and a large absolute risk difference of abdomen-related symptoms after SARS-CoV-2 infection during the Delta variant period. As new SARS-CoV-2 variants emerge, researchers and clinicians should monitor patients for changing symptoms and conditions that develop after infection.

Background

SARS-CoV-2 virus may cause persistent symptoms, exacerbations of existing conditions, or onset of new diseases in the weeks to months after initial infection [1]. These symptoms and conditions are generally referred to as “post-acute sequelae of SARS-CoV-2 infection” (PASC) in the medical literature and “long COVID” in the lay press, and defined as ongoing, relapsing, or new symptoms, or other health effects occurring after the acute phase of SARS-CoV-2 infection (i.e., present four or more weeks after the acute infection). Studies have produced markedly varying estimates of PASC, which may be due to differences in methods or due to true differences in the populations studied, severity of illness, viral genotype, or dose of virus that caused infection. One meta-analysis estimated that, globally, 49% of people report persistent symptoms 120 days after infection with an increased frequency in persons who are female or required hospitalization [2].

Since the emergence of SARS-CoV-2 in 2019, mutations in the viral nucleic acid sequence have changed the transmissibility, virulence, and immunogenicity of the virus [3]. The World Health Organization (WHO) determines whether a new genotype has phenotypic characteristics that impact public health sufficiently to be classified as a variant of concern (VOC). In 2020, the US experienced a COVID-19 “wave” (generally considered a marked increase in infections, hospitalizations, and deaths) due to the ancestral strain and subsequent waves at the end of 2020 due to a mix of the ancestral strain and the alpha variant, then again in 2021 due to the Delta variant. Analyses suggest the Delta variant was more transmissible than the ancestral strain, but not necessarily more likely to cause severe illness and death [4].

PASC may result from some combination of persistent viral infection, an exaggerated immune response to initial infection, and tissue damage from the combination of initial infection and immune response [5]. It is, therefore, possible that the frequency or characteristics of PASC may vary depending on infection with different VOCs.

We analyzed data from a large database of electronic health records (EHR) in the United States to evaluate differences in PASC among those infected during the COVID-19 wave caused by the ancestral lineage in 2020 and those infected during the COVID-19 wave caused by the Delta variant in 2021.

Methods

Cohort enrollment and follow-up

We conducted a retrospective cohort study of approximately 27 million people receiving medical care in New York and Florida in the United States from March 1, 2020, to November 30, 2021. Patient records were obtained from two large clinical research networks within PCORnet, the National Patient Centered Clinical Research Network [6]: INSIGHT, which contains records from approximately 12 million persons who received services across five health systems (Albert Einstein School of Medicine/Montefiore Medical Center, Columbia University and Weill Cornell Medicine/New York-Presbyterian Hospital, lcahn School of Medicine/Mount Sinai Health System, and New York University School of Medicine/Langone Medical Center) in the New York City (NYC) metropolitan area [7], and OneFlorida+, which contains records from approximately 15 million persons receiving services across 13 health systems (University of Florida and UF Health, Florida State University, University of Miami and UHealth, Orlando Health System, AdventHealth, Tallahassee Memorial HealthCare, Tampa General Hospital, Bond Community Health Center Inc., Nicklaus Children’s Hospital, CommunityHealth IT, University of South Florida and USF Health, University of Alabama at Birmingham, Emory University) in Florida [8]. Data elements from these databases are maintained on the PCORnet Common Data Model and mapped to the Observational Medical Outcomes Partnership Common Data Model for interoperability. Both networks receive data monthly from each in-network facility and create an integrated dataset of all encounters, diagnoses, procedures, medications, vitals, and social history.

In this analysis, patients were included if they were at least 20 years old and had at least one diagnosis code during a baseline period, and the follow-up period included at least one SARS-CoV-2 viral diagnostic test (antigen or molecular). The baseline period was defined as three years to seven days before the date of the first documented positive or negative SARS-CoV-2 test (referred to as the index date) for the infected group or the negative group, respectively. We required that patients in the negative group had all negative SARS-CoV-2 viral diagnostic tests and no COVID-19-related diagnoses. Requiring that patients had a least one diagnosis during this baseline period helped ensure that they were connected to the healthcare system and could have been diagnosed during baseline for relevant symptoms and conditions. The baseline period was defined as 3 years to 1 week before lab-confirmed SARS-CoV-2 infection and the follow-up period was defined as 31 to 180 days after the index date.

COVID-19 Variants

Our dataset did not include data about the genotype of the virus that infected each person with COVID-19. Instead, we defined the beginning and end of each variant wave using COVID-19 test data from the networks [S1 Fig] and assumed that all infections occurring during those waves were attributable to the most common genotype prevalent in those regions at the time according to data from CDC [9]. We defined the ancestral strain period as March 1 -September 30, 2020, and the Delta variant period as June 1-November 30, 2021. For this analysis, we did not analyze the winter 2020–2021 period, because multiple genotypes were circulating [10].

Data analysis

We investigated the following likely PASC categories based on prior analyses, including anemia, thromboembolism, pulmonary embolism, dementia, pulmonary fibrosis, edema, and inflammation, pressure ulcer, diabetes mellitus, malnutrition, fluid disorders, encephalopathy, abnormal heartbeat, chest pain, abdominal pain, constipation, joint pain, cognitive problems, headache, sleep disorders, dyspnea, acute pharyngitis, hair loss, edema, fever, malaise and fatigue [11]. In addition, we used ICD-10 diagnostic codes B948 (sequelae of other specified infectious and parasitic diseases) and U099 (post-COVID-19 condition, unspecified) to capture general or unspecified PASC conditions. Each condition was defined based on the Clinical Classifications Software Refined (CCSR) v2022.1. Codes that could not be attributed to COVID-19 were removed by clinicians.

We analyzed the risks of newly incident conditions, defined as new documentation of the above-mentioned PASC categories in the follow-up period that were not present in the baseline period. Specifically, we compared adjusted hazard ratios (aHR) and adjusted excess burdens of these events occurring 31–180 days after the index date, namely the follow-up period, between the SARS-CoV-2 positive group and the negative group. For each potential PASC condition, aHR was estimated by a Cox proportional hazard model, and the excess burden was defined as the difference in cumulative incidence per 1,000 patients in the positive group and negative group over the follow-up period. For example, an excess burden of 40 for symptom X indicates there were 40 more people per 1,000 with symptom X after COVID-19 infection compared with people not infected with COVID-19. We estimated cumulative incidence by the Aalen-Johansen model [12] considering death to be a competing risk for target outcomes. We adjusted for a wide range of baseline covariates by stabilized inverse propensity score re-weighting [13]. The standardized mean difference (SMD) was used to quantify the goodness-of-balance of covariates after reweighting. We considered SMD < 0.1 as being balanced in terms of each covariate and required all covariates to be balanced after re-weighting.

Both the aHR and excess burden calculations used the same covariates for adjustment. We summarized the baseline covariates in Table 1 and have included detailed descriptions in our previous studies [14, 17]. Briefly speaking, the baseline covariates included age, gender, race, ethnicity, and additional factors described here. The national-level area deprivation index (ADI) was used to assess the socioeconomic disadvantage of patients [15]. We imputed a missing ADI value with median ADI per site. Healthcare utilization was measured as the number of inpatient, outpatient, and emergency encounters (0, 1–2, 3–4, 5 or more visits for each encounter type). The Body Mass Index (BMI) was categorized according to WHO guidelines. We adopted a tailored list of the Elixhauser comorbidities and related drug categories (e.g., corticosteroid and immunosuppressant prescriptions) to capture comorbidities [16]. Patients were defined as having comorbidity if they had at least two corresponding diagnoses documented during the baseline period. The clinical features were collected during the baseline period, namely 3 years to 1 week before lab-confirmed SARS-CoV-2 infection.

Table 1. Population characteristics of the lab-confirmed SARS-CoV-2 positive patients (cases) and SARS-CoV-2 negative patients (control) for the entire study period (March 2020 to November 2021), ancestral strain period (March 2020 to September 2020), and Delta variant period (June 2021 to November 2021).

Entire Study Period Ancestral Strain Period Delta Period
Characteristics Cases Control Cases Control Cases Control
N 57,616 503,136 19,943 215,842 8,097 52,717
Median age (IQR)—yrs 53 (36–66) 57 (40–69) 53 (36–66) 57 (41–69) 48 (33–63) 57 (40–70)
Age group—no. (%)
20-<40 years 17,035 (29.6) 119,689 (23.8) 5,928 (29.7) 50,855 (23.6) 2,972 (36.7) 13,119 (24.9)
40-<55 years 13,448 (23.3) 107,868 (21.4) 4,680 (23.5) 46,058 (21.3) 2,003 (24.7) 10,579 (20.1)
55-<65 years 11,001 (19.1) 103,503 (20.6) 3,832 (19.2) 45,262 (21.0) 1,380 (17.0) 10,219 (19.4)
65-<75 years 8,641 (15.0) 97,461 (19.4) 2,957 (14.8) 41,961 (19.4) 997 (12.3) 9,993 (19.0)
75+ years 7,491 (13.0) 74,615 (14.8) 2,546 (12.8) 31,706 (14.7) 745 (9.2) 8,807 (16.7)
Sex—no. (%)
Female 34,690 (60.2) 303,693 (60.4) 11,907 (59.7) 129,090 (59.8) 5,126 (63.3) 32,304 (61.3)
Male 22,921 (39.8) 199,394 (39.6) 8,034 (40.3) 86,732 (40.2) 2,970 (36.7) 20,408 (38.7)
Race—no. (%)
Asian 2,011 (3.5) 20,351 (4.0) 771 (3.9) 9,470 (4.4) 123 (1.5) 1,795 (3.4)
Black 14,295 (24.8) 97,662 (19.4) 5,589 (28.0) 40,850 (18.9) 2,253 (27.8) 11,049 (21.0)
White 23,631 (41.0) 245,033 (48.7) 7,036 (35.3) 107,251 (49.7) 4,143 (51.2) 26,341 (50.0)
Other 13,574 (23.6) 99,544 (19.8) 5,124 (25.7) 42,773 (19.8) 1,237 (15.3) 9,307 (17.7)
Missing 4,105 (7.1) 40,546 (8.1) 1,423 (7.1) 15,498 (7.2) 341 (4.2) 4,225 (8.0)
Ethnic group—no. (%)
Hispanic: Yes 15,158 (26.3) 95,006 (18.9) 5,482 (27.5) 38,199 (17.7) 1,701 (21.0) 10,221 (19.4)
Hispanic: No 35,636 (61.9) 336,494 (66.9) 12,152 (60.9) 149,272 (69.2) 5,506 (68.0) 34,280 (65.0)
Hispanic: Other/Missing 6,822 (11.8) 71,636 (14.2) 2,309 (11.6) 28,371 (13.1) 890 (11.0) 8,216 (15.6)
Median area deprivation index (IQR)—rank 25 (12–53) 24 (10–48) 25 (12–52) 24 (10–48) 50 (27–70) 30 (15–58)
Follow-up days (IQR) 246 (133–386) 263 (136–394) 471 (317–539) 403 (255–479) 92 (59–128) 86 (57–121)
Death in follow-up (%) 724 (1.3) 4,676 (0.9) 275 (1.4) 2,665 (1.2) 102 (1.3) 380 (0.7)
No. of hospital visits in the past 3 yrs—no. (%)
Inpatient 0 38,555 (66.9) 391,264 (77.8) 13,543 (67.9) 156,773 (72.6) 4,927 (60.8) 43,269 (82.1)
Inpatient 1–2 11,419 (19.8) 70,955 (14.1) 3,704 (18.6) 35,211 (16.3) 1,741 (21.5) 6,672 (12.7)
Inpatient >= 3 7,642 (13.3) 40,917 (8.1) 2,696 (13.5) 23,858 (11.1) 1,429 (17.6) 2,776 (5.3)
Outpatient 0 5,327 (9.2) 28,119 (5.6) 1,829 (9.2) 11,857 (5.5) 1,249 (15.4) 3,846 (7.3)
Outpatient 1–2 5,944 (10.3) 50,669 (10.1) 2,263 (11.3) 22,269 (10.3) 849 (10.5) 4,909 (9.3)
Outpatient >= 3 46,345 (80.4) 424,348 (84.3) 15,851 (79.5) 181,716 (84.2) 5,999 (74.1) 43,962 (83.4)
Emergency 0 29,234 (50.7) 335,023 (66.6) 10,174 (51.0) 139,775 (64.8) 3,279 (40.5) 34,423 (65.3)
Emergency 1–2 14,669 (25.5) 97,067 (19.3) 4,959 (24.9) 41,575 (19.3) 2,050 (25.3) 11,031 (20.9)
Emergency >= 3 13,713 (23.8) 71,046 (14.1) 4,810 (24.1) 34,492 (16.0) 2,768 (34.2) 7,263 (13.8)
BMI (IQR) 28 (23–33) 26 (21–31) 27 (23–33) 26 (21–31) 29 (24–34) 26 (20–31)
BMI: <18.5 under weight 6,848 (11.9) 93,344 (18.6) 2,425 (12.2) 35,439 (16.4) 537 (6.6) 11,013 (20.9)
BMI: 18.5-<25 normal weight 10,304 (17.9) 103,463 (20.6) 3,586 (18.0) 46,361 (21.5) 1,405 (17.4) 10,271 (19.5)
BMI: 25-<30 overweight 12,936 (22.5) 116,673 (23.2) 4,376 (21.9) 50,686 (23.5) 1,794 (22.2) 11,924 (22.6)
BMI: >= 30 obese 18,248 (31.7) 137,366 (27.3) 6,224 (31.2) 58,810 (27.2) 2,826 (34.9) 14,672 (27.8)
BMI: missing 9,280 (16.1) 52,290 (10.4) 3,332 (16.7) 24,546 (11.4) 1,535 (19.0) 4,837 (9.2)
Pre-existing conditions—no. (%)
Alcohol Abuse 1,741 (3.0) 15,236 (3.0) 602 (3.0) 7,547 (3.5) 284 (3.5) 1,480 (2.8)
Anemia 8,933 (15.5) 53,673 (10.7) 3,188 (16.0) 26,112 (12.1) 1,369 (16.9) 5,314 (10.1)
Arrythmia 8,425 (14.6) 57,916 (11.5) 3,075 (15.4) 28,058 (13.0) 1,042 (12.9) 5,782 (11.0)
Asthma 6,546 (11.4) 39,531 (7.9) 2,115 (10.6) 17,885 (8.3) 1,000 (12.4) 3,888 (7.4)
Autism 136 (0.2) 866 (0.2) 58 (0.3) 359 (0.2) 23 (0.3) 104 (0.2)
Cancer 5,584 (9.7) 58,310 (11.6) 1,981 (9.9) 29,302 (13.6) 691 (8.5) 5,646 (10.7)
Chronic Kidney Disease 8,383 (14.5) 47,376 (9.4) 3,016 (15.1) 22,614 (10.5) 1,086 (13.4) 5,209 (9.9)
Chronic Pulmonary Disorders 10,644 (18.5) 70,105 (13.9) 3,535 (17.7) 33,185 (15.4) 1,596 (19.7) 6,760 (12.8)
Cirrhosis 950 (1.6) 7,953 (1.6) 295 (1.5) 3,958 (1.8) 147 (1.8) 804 (1.5)
Coagulopathy 3,652 (6.3) 17,039 (3.4) 1,319 (6.6) 8,880 (4.1) 466 (5.8) 1,595 (3.0)
Congestive Heart Failure 6,602 (11.5) 42,007 (8.3) 2,310 (11.6) 20,811 (9.6) 944 (11.7) 4,322 (8.2)
COPD 3,326 (5.8) 23,118 (4.6) 1,180 (5.9) 12,019 (5.6) 478 (5.9) 2,058 (3.9)
Coronary Artery Disease 7,400 (12.8) 52,920 (10.5) 2,522 (12.6) 25,232 (11.7) 919 (11.3) 5,296 (10.0)
Cystic Fibrosis 29 (0.1) 307 (0.1) 4 (0.0) 174 (0.1) 9 (0.1) 24 (0.0)
Dementia 2,313 (4.0) 9,430 (1.9) 997 (5.0) 4,526 (2.1) 205 (2.5) 970 (1.8)
Diabetes Type 1 829 (1.4) 4,676 (0.9) 291 (1.5) 2,241 (1.0) 132 (1.6) 440 (0.8)
Diabetes Type 2 13,004 (22.6) 77,745 (15.5) 4,694 (23.5) 34,920 (16.2) 1,635 (20.2) 8,376 (15.9)
Down’s Syndrome 65 (0.1) 251 (0.0) 30 (0.2) 94 (0.0) 14 (0.2) 28 (0.1)
End Stage Renal Disease on Dialysis 2,468 (4.3) 10,795 (2.1) 1,012 (5.1) 5,611 (2.6) 320 (4.0) 1,089 (2.1)
Hemiplegia 785 (1.4) 4,292 (0.9) 313 (1.6) 2,216 (1.0) 86 (1.1) 385 (0.7)
HIV 774 (1.3) 7,074 (1.4) 278 (1.4) 3,060 (1.4) 95 (1.2) 695 (1.3)
Hypertension 23,650 (41.0) 169,604 (33.7) 8,041 (40.3) 75,509 (35.0) 3,226 (39.8) 18,288 (34.7)
Inflammatory Bowel Disorder 618 (1.1) 7,693 (1.5) 199 (1.0) 3,602 (1.7) 100 (1.2) 655 (1.2)
Lupus or Systemic Lupus Erythematosus 547 (0.9) 3,430 (0.7) 157 (0.8) 1,617 (0.7) 96 (1.2) 346 (0.7)
Mental Health Disorders 7,665 (13.3) 54,170 (10.8) 2,615 (13.1) 25,421 (11.8) 1,252 (15.5) 5,724 (10.9)
Multiple Sclerosis 296 (0.5) 2,003 (0.4) 71 (0.4) 960 (0.4) 45 (0.6) 198 (0.4)
Other Substance Abuse 4,919 (8.5) 45,367 (9.0) 1,509 (7.6) 21,651 (10.0) 1,109 (13.7) 4,850 (9.2)
Parkinson’s Disease 365 (0.6) 3,347 (0.7) 141 (0.7) 1,474 (0.7) 33 (0.4) 411 (0.8)
Peripheral vascular disorders 4,257 (7.4) 30,330 (6.0) 1,521 (7.6) 14,727 (6.8) 527 (6.5) 3,076 (5.8)
Pregnant 3,112 (5.4) 26,072 (5.2) 1,085 (5.4) 10,982 (5.1) 678 (8.4) 3,432 (6.5)
Pulmonary Circulation Disorder 2,272 (3.9) 11,967 (2.4) 796 (4.0) 6,341 (2.9) 329 (4.1) 1,110 (2.1)
Rheumatoid Arthritis 997 (1.7) 6,870 (1.4) 347 (1.7) 3,176 (1.5) 131 (1.6) 666 (1.3)
Seizure/Epilepsy 1,585 (2.8) 9,720 (1.9) 628 (3.1) 4,814 (2.2) 244 (3.0) 974 (1.8)
Severe Obesity (BMI>= 40 kg/m2) 5,298 (9.2) 31,619 (6.3) 1,803 (9.0) 13,859 (6.4) 882 (10.9) 3,477 (6.6)
Sickle Cell 482 (0.8) 2,644 (0.5) 164 (0.8) 1,248 (0.6) 90 (1.1) 299 (0.6)
Weight Loss 2,728 (4.7) 14,932 (3.0) 985 (4.9) 8,111 (3.8) 385 (4.8) 1,283 (2.4)
Corticosteroids Drugs 9,252 (16.1) 56,698 (11.3) 3,076 (15.4) 27,730 (12.8) 1,467 (18.1) 5,271 (10.0)
Immunosuppressant Drugs 3,123 (5.4) 18,042 (3.6) 1,054 (5.3) 8,888 (4.1) 432 (5.3) 1,713 (3.2)

a. The lab-confirmed SARS-CoV-2 positive and negative patients were identified by polymerase chain reaction (PCR) test or antigen test. Negative patients were further required no documented COVID-19 related diagnoses at any time. IQR denotes inter-quartile range. Percentage may not sum up to 100 because of rounding.

c. Coexisting conditions existed if two records in the 3-years prior to index event. See detailed phenotyping codes in the appendix. SLE: Systemic Lupus Erythematosus; COPD: Chronic obstructive pulmonary disease.

We reported PASC conditions if they had: adjusted hazard ratio > 1; P-value <3.6 x 10−4 (corrected by the Bonferroni method to control for false discovery) in multiple test settings; and at least 100 patients with the condition. We reported adjusted hazard ratios with a 95% confidence interval. We used Python 3.9, python package lifelines-0.2666 for survival analysis, and scikit-learn-0.2318 for machine learning models. The code is available at https://github.com/calvin-zcx/pasc_phenotype.

Ethical review

The use of the INSIGHT data was approved by the Institutional Review Board (IRB) of Weill Cornell Medicine following NIH protocol 21-10-95-380 with protocol title: Adult PCORnet-PASC Response to the Proposed Revised Milestones for the PASC EHR/ORWD Teams (RECOVER). The use of the OneFlorida+ data for this study was approved under the University of Florida IRB number IRB202001831. The IRBs waived informed consent for this observational cohort study that involves a limited, but not de-identified, data set.

Results

Demographics

Our final dataset included data from 560,752 patients [Fig 1]. The median age of patients was 57 years (interquartile range [IQR] 40–69), and 338,383 (60.3%) were female [Table 1]. According to EHR data, 268,664 (47.9%) patients identified as non-Hispanic white, 111,957 (20.0%) identified as non-Hispanic Black, 110,164 (19.6%) identified as Hispanic, 22,362 (4.0%) identified as Asian/Pacific Islander, and 44,651 (8.0%) had missing race and ethnicity. The median Body Mass Index (BMI) was 26 (IQR 21, 32).

Fig 1. Patients included from both the INSIGHT and OneFlorida+, March 2020 to November 2021.

Fig 1

The post-acute sequelae of SARS-CoV-2 infection (PASC) outcomes were ascertained from day 30 after the SARS-CoV-2 infection (index date) and the adjusted risk was computed 180 days after the index date.

For the overall study period, 57,616 patients had a positive molecular or antigen test (99.8% were molecular) for SARS-CoV-2 (19,943 during the ancestral strain period, 8,097 during Delta variant period, 29,576 outside of these two periods), and 503,136 did not (215,842 during ancestral strain period, 52,717 during Delta variant period, 234,577 outside of these two periods). We observed that cases in the Delta variant period had a median age of 48 (IQR 33, 63), and cases in the ancestral strain period had a median age of 53 ([IQR 36, 66]). The proportion of female cases in the Delta variant period was 63.3% compared with 59.7% in the ancestral strain period.

Further information about co-morbid conditions of both cases and controls over the different periods is in Table 1.

PASC associated with ancestral strain period

We found an increased risk of multiple symptoms and conditions from ancestral strain infections. As shown in Fig 2, the largest adjusted hazard ratios, comparing patients with positive vs. negative were for: pulmonary fibrosis, edema, and inflammation (2.32 [95% CI 2.09 2.57]), hair loss (2.10[95% CI 1.74 2.52]), pressure ulcers (1.98 [95% CI 1.69 2.32]), pulmonary embolism (1.65 [95% CI 1.36 2.99]), dyspnea (1.58 [95% CI 1.45 1.69]), dementia (1.50 [95% CI 1.28 1.74]). As shown in Fig 3, the largest excess burdens were for: dyspnea (47.6 more cases per 1000 persons), pulmonary fibrosis, edema, and inflammation (21.5), malaise and fatigue (18.2), edema (17.6), chest pain (16.9), abnormal heartbeat (15.4), cognitive problems (12.8), and joint pain (11.5).

Fig 2. Adjusted hazard ratios of likely incident PASC conditions in all time (March 2020 to November 2021) versus ancestral strain period (March 2020 to September 2020) versus Delta variant period (June 2021 to November 2021).

Fig 2

Sequelae outcomes were ascertained from day 30 after the SARS-CoV-2 infection and the adjusted hazard ratio was computed at 180 days after the SARS-CoV-2 infection. The color panels from top to bottom represent different organ systems, including the nervous system or mental disorders, skin, respiratory system, circulatory system, blood, endocrine and metabolic, digestive system, and other signs.

Fig 3. Adjusted excess burden per 1,000 patients of potential incident PASC conditions for the entire study period (March 2020-November 2021), ancestral strain period (March 2020-September 2020), and Delta variant period (June 2021-November 2021).

Fig 3

The excess burdens were represented by a heatmap with deeper colors representing more significant burdens. The color panels from top to bottom represent different organ systems, including the nervous system or mental disorders, skin, respiratory system, circulatory system, blood, endocrine and metabolic, digestive system, and other signs.

PASC associated with Delta variant period

The spectrum of PASC-related symptoms and conditions from Delta infections varied from ancestral strain infections. For the Delta variant period, the largest adjusted hazard ratios, comparing patients with positive vs. negative tests [Fig 2] were for: pulmonary embolism (2.18 [95% CI 1.57 3.01]), hair loss (2.07 [95% CI 1.31 3.27]), pulmonary fibrosis, edema, and inflammation (1.99 [95% CI 1.54 2.57]), pressure ulcers (1.64 [95% CI 1.15 2.35]), acute pharyngitis (1.64 [95% CI 1.31 2.05]), chest pain (1.43 [95% CI 1.21 1.69]). As shown in Fig 3, the largest excess burdens were for: abdominal pain (85.3 more cases per 1000 persons), dyspnea (84.8), chest pain (70.7), joint pain (68.4), anemia (48.9), headache (37.2), acute pharyngitis (36.0), malaise and fatigue (34.3), dementia (25.8), pulmonary fibrosis, edema, and inflammation (21.0), and edema (18.8).

Similar patterns were observed in the region-specified sensitivity analyses as shown in S2 Fig and S1 Table.

Discussion

In this study of more than half a million patients, we documented an increased burden of abdomen-related conditions during the Delta variant period and a substantial burden of post-acute pulmonary-related conditions in people infected with COVID-19 during both the ancestral strain and Delta variant periods.

Our statistical approach considered both the relative risk of a new condition, expressed as a hazard ratio, and the risk difference, expressed as an excess burden. The risk difference provides a measure of public health impact because it estimates how much each symptom or condition could potentially be reduced if patients had not been infected with COVID-19 ancestral or Delta variants. We estimate that, for every 1,000 patients, there were an additional 85 persons with abdominal pain after Delta variant infection than in those without documentation of Delta variant infection. The excess burden was, notably, not found during the ancestral strain period. Since the beginning of the pandemic, clinicians have noted that patients may present with pronounced gastrointestinal symptoms, possibly due to direct infection, alteration of the gut microbiome, or enhanced immune response [17, 18]. Whether these same mechanisms explain post-acute sequelae is unclear. Further research is needed to understand the long-term prognosis of PASC-related abdominal pain and to quantify its excess burden, given that international experts did not include gastrointestinal conditions in its core outcome set for evaluating patients [19].

The excess burden of pulmonary-related conditions was large and markedly increased for the Delta variant period. We found a 78% increase in post-acute dyspnea from the ancestral strain to Delta variant periods (47.6 additional persons vs. 84.8 additional persons), even though the excess burden of diagnosed conditions that could explain dyspnea (pulmonary fibrosis, edema, and inflammation) was similar (21.5 vs. 21.0) during both periods. We do not know whether the increase in burden during the Delta variant period represents a specific change in the virus or some other factor, such as the types of persons infected, the dose, duration, or route of exposure, or increased awareness and documentation by providers. In the absence of hypoxemia, no current treatment exists for persistent dyspnea, although novel strategies to help patients, such as breathlessness training, are being evaluated [20]. Notably, the largest relative risk was associated with pulmonary embolism, a well-documented COVID-19 complication that could also cause persistent dyspnea [21].

Our study is subject to important limitations. First, we may have misclassified patients as not infected with COVID-19, because a test was never performed or not recorded in the in-network facilities. This may have been more likely during the follow-up of the post-Delta wave, as the follow-up period overlapped with the increasing availability of home testing. Such misclassification would likely lead us to underestimate the prevalence and relative risk of PASC, particularly during the first wave when diagnostic testing was less widely available. It is also possible that persons who did not test positive had other infections, which could produce illnesses similar to PASC [22]. Further misclassification could have occurred, because we assumed all infections during a wave were attributable to the most prevalent variant circulating at that time or because patients suffered viral or bacterial co-infections at the same time as COVID-19 [2326]. Second, we were not able to obtain vaccination information, which is important for further research given our finding that the burden of PASC was high during the Delta wave, despite the widespread availability of vaccines at that time [27]. Third, the number of cases during the Delta variant period was substantially higher in Florida (S1 Fig and S1 Table), and, for several conditions, the magnitude of excess burden varied between NY and Florida (S2 Fig), but our major conclusions still hold. Fourth, while we adjusted for many characteristics, other unmeasured factors or missing values (e.g., missing ADI value, smoking status, etc.) could explain differences. We examined PASC in patients who were still alive beyond their acute infection period, namely alive beyond 30 days, and we modeled death in the follow-up period (Table 1) as a competing risk in our analyses. However, different mortality rates in different periods might also contribute to the observed PASC differences. Finally, some conditions, such as pressure ulcers, may be attributable to prolonged hospitalization, rather than infection itself.

In conclusion, we found that conditions associated with PASC vary by viral variant. As the virus continues to evolve new variants rapidly, researchers and clinicians need to monitor for changing symptoms and conditions associated with COVID-19 infection. From the perspective of patients, physicians need to be aware that PASC may present differently in the future as new variants emerge and that treatment efficacy may vary by the variant that initially caused PASC.

Supporting information

S1 Fig. Lab-confirmed new SARS-CoV-2 cases per 10,000 patients with different variants of concerns in the INSIGHT and OneFlorida+ cohorts.

(TIFF)

pone.0282451.s001.tiff (207KB, tiff)
S2 Fig. Adjusted excess burdens per 1,000 patients of potential incident PASC conditions in all time (March 2020 to November 2021) versus ancestral strain period (March 2020 to September 2020) versus Delta variant period (June 2021 to November 2021), stratified by different regions (NYC vs. Florida).

The sequelae outcomes were ascertained from day 30 after the SARS-CoV-2 infection and the adjusted hazard ratio were computed 180 days after the SARS-CoV-2 infection.

(TIFF)

pone.0282451.s002.tiff (13MB, tiff)
S1 Table. Number of patients in all time (March 2020 to November 2021) versus ancestral strain period (March 2020 to September 2020) versus Delta variant period (June 2021 to November 2021), stratified by different regions, NYC Insight vs. Florida OneFlorida.

(DOCX)

pone.0282451.s003.docx (10.7KB, docx)

Acknowledgments

Members of the RECOVER Consortium include: Parsa Mirhaji, Albert Einstein College of Medicine; Ravi Jhaveri, Ann & Robert H. Lurie Children’s Hospital of Chicago; Chris Forrest, Children’s Hospital of Philadelphia; Hiroki Morizono, Children’s National Medical Center; Nathan Pajor, Cincinnati Children’s; Soumitra Sengupta, Columbia University; Schuyler Jones, Duke University/Health System; Benjamin D. Horne, Intermountain Healthcare; Tom Carton, LPHI; Bradley Taylor, Medical College of Wisconsin; Leslie Lenert, Medical University of South Carolina; Abu Mosa, University of Missouri; Carol Horowitz, Icahn School of Medicine at Mount Sinai; Kelly Kelleher, Nationwide Children’s Hospital; H Timothy Bunnell, Nemours; David Liebovitz, Northwestern University; Saul Blecker, NYU Langone Health; Marion Sills, OCHIN, Inc.; Dan Fort, Ochsner Health; William Hogan, University of Florida; Asa Oxner, University of South Florida; Rishi Kamaleswaran, Emory; Nick Tsinoremas, University of Miami; Daria Salyakina, Nicklaus Children’s Hospital; Cynthia Chuang, Penn State U College of Medicine; Dimitri Christakis, Seattle Children’s; Anuradha Paranjape, Temple University; Soledad Fernandez, The Ohio State University; Susan Kim, University of California San Fransisco; Elizabeth Chrischilles, University of Iowa; David Williams, University of Michigan; Carol Geary, University of Nebraska Medical Center; Suresh Srinivasan, Jonathan Arnold; Michael Bechich, University of Pittsburgh; Mollie Cummins, University of Utah; Lindsay Cowell, UT Southwestern Medical Center; Yacob Tedla, Vanderbilt University Medical Center; Stephen Downs, Wake Forest University Health Sciences; Rainu Kaushal, Weill Cornell Medicine (corresponding author); Alka Khaitan, Indiana University; Alan Schroeder, Stanford; Suchitra Rao, Univ. Colorado- Denver; Jyotsna Fuloria, University Medical Center New Orleans; Jason Block, Harvard Pilgram; Daria Salyakina, Nicklaus Children’s Hospital.

Statements and acknowledgements

  1. Authorship has been determined by ICJME recommendation

  2. Disclosures to be obtained at time of submission

  3. The content is solely the responsibility of the authors and does not necessarily represent the official views of the RECOVER Program or funders

  4. We would like to thank the National Community Engagement Group (NCEG), all patient, caregiver and community Representatives, and all the participants enrolled in the RECOVER Initiative.

Data Availability

There are legal and ethical restrictions on data sharing because the Institutional Review Board of Weill Cornell Medicine did not approve public data deposition. The data set used for this study constitutes sensitive patient information extracted from the electronic health records. Accordingly, it is subject to federal legislation that limits our ability to disclose it to the public, even after it has been subjected to deidentification techniques. To request the access of the de-identified minimal dataset underlying these findings, interested and qualified researchers should contact INSIGHT Clinical Research Network (https://insightcrn.org/) or Alexandra LaMar at all4008@med.cornell.edu.

Funding Statement

This study is part of the NIH Researching COVID to Enhance Recovery (RECOVER) Initiative, which seeks to understand, treat, and prevent the post-acute sequelae of SARS-CoV-2 infection (PASC). This research was funded by the National Institutes of Health (NIH) Agreement OTA OT2HL161847 as part of the Researching COVID to Enhance Recovery (RECOVER) research program. NIH played a role in evaluating and developing the overall structure of the RECOVER research program, but not in the design and analysis of this specific study.

References

  • 1.Michelen M, Manoharan L, Elkheir N, et al. Characterising long COVID: a living systematic review. BMJ Global Health 2021;6:e005427. doi: 10.1136/bmjgh-2021-005427 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Chen C, Haupert SR, Zimmermann L, Shi X, Fritsche LG, Mukherjee B. Global Prevalence of Post COVID-19 Condition or Long COVID: A Meta-Analysis and Systematic Review. J Infect Dis. 2022. Apr 16:jiac136. doi: 10.1093/infdis/jiac136 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Otto SP, Day T, Arino J, et al. The origins and potential future of SARS-CoV-2 variants of concern in the evolving COVID-19 pandemic. Curr Biol. 2021;31(14):R918–R929. doi: 10.1016/j.cub.2021.06.049 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Kläser K., Molteni E., Graham M. et al. COVID-19 due to the B.1.617.2 (Delta) variant compared to B.1.1.7 (Alpha) variant of SARS-CoV-2: a prospective observational cohort study. Sci Rep 2022;12:10904. doi: 10.1038/s41598-022-14016-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Castanares-Zapatero D, Chalon P, Kohn L, et al. Pathophysiology and mechanism of long COVID: a comprehensive review. Ann Med. 2022;54(1):1473–1487. doi: 10.1080/07853890.2022.2076901 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Forrest CB, McTigue KM, Hernandez AF, et al. PCORnet® 2020: current state, accomplishments, and future directions. J Clin Epidemiol. 2021. Jan;129:60–67. Epub 2020 Sep 28. doi: 10.1016/j.jclinepi.2020.09.036 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Kaushal R. et al. Changing the research landscape: the New York City Clinical Data Research Network. J. Am. Med. Inform. Assoc. 2014;21:587–590. doi: 10.1136/amiajnl-2014-002764 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Shenkman E. et al. OneFlorida Clinical Research Consortium: Linking a Clinical and Translational Science Institute With a Community-Based Distributive Medical Education Model. Acad. Med. 2018;93:451–455. doi: 10.1097/ACM.0000000000002029 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Centers for Disease Control and Prevention. https://covid.cdc.gov/covid-data-tracker/#variant-proportions-state-jurisdiction
  • 10.Annavajhala MK, Mohri H, Wang P, Nair M, Zucker JE, Sheng Z, et al. Emergence and expansion of SARS-CoV-2 B.1.526 after identification in New York. Nature. 2021. Sep;597(7878):703–708. Epub 2021 Aug 24. doi: 10.1038/s41586-021-03908-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Zang Chengxi, Zhang Yongkang, Xu Jie, Bian Jiang, Morozyuk Dmitry, et al. "Data-driven analysis to understand long COVID using electronic health records from the RECOVER initiative." Nature Communications 14, no. 1 (2023): 1948. doi: 10.1038/s41467-023-37653-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Aalen O. O. & Johansen S. An Empirical Transition Matrix for Non-Homogeneous Markov Chains Based on Censored Observations. Scand. J. Stat. 1978;5:141–150. [Google Scholar]
  • 13.Xu S. et al. Use of Stabilized Inverse Propensity Scores as Weights to Directly Estimate Relative Risk and Its Confidence Intervals. Value Health. 2010;13:273–277. doi: 10.1111/j.1524-4733.2009.00671.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Zhang Hao, Zang Chengxi, Xu Zhenxing, Zhang Yongkang, Xu Jie, et al. "Data-driven identification of post-acute SARS-CoV-2 infection subphenotypes." Nature Medicine 29, no. 1 (2023): 226–235. doi: 10.1038/s41591-022-02116-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Kind A. J. H. & Buckingham W. R. Making Neighborhood-Disadvantage Metrics Accessible—The Neighborhood Atlas. N. Engl. J. Med. 2018;378:2456–2458. doi: 10.1056/NEJMp1802313 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Elixhauser Comorbidity Software Refined for ICD-10-CM. https://www.hcup-us.ahrq.gov/toolssoftware/comorbidityicd10/comorbidity_icd10.jsp
  • 17.Cheung KS, Hung IFN, Chan PPY, et al. Gastrointestinal Manifestations of SARS-CoV-2 Infection and Virus Load in Fecal Samples From a Hong Kong Cohort: Systematic Review and Meta-analysis. Gastroenterology. 2020;159(1):81–95. doi: 10.1053/j.gastro.2020.03.065 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Chen TH, Hsu MT, Lee MY, Chou CK. Gastrointestinal Involvement in SARS-CoV-2 Infection. Viruses. 2022;14(6):1188. Published 2022 May 30. doi: 10.3390/v14061188 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Munblit D, Nicholson T, Akrami A, et al. A core outcome set for post-COVID-19 condition in adults for use in clinical practice and research: an international Delphi consensus study. Lancet Respir Med. 2022;10(7):715–724. doi: 10.1016/S2213-2600(22)00169-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Philip KEJ, Owles H, McVey S, et al. An online breathing and wellbeing programme (ENO Breathe) for people with persistent symptoms following COVID-19: a parallel-group, single-blind, randomised controlled trial. Lancet Respir Med. 2022. doi: 10.1016/S2213-2600(22)00125-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Suh YJ, Hong H, Ohana M, et al. Pulmonary Embolism and Deep Vein Thrombosis in COVID-19: A Systematic Review and Meta-Analysis. Radiology. 2021;298(2):E70–E80. doi: 10.1148/radiol.2020203557 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Taquet M, Dercon Q, Luciano S, Geddes JR, Husain M, Harrison PJ. Incidence, co-occurrence, and evolution of long-COVID features: A 6-month retrospective cohort study of 273,618 survivors of COVID-19. PLoS Med 2021;18(9): e1003773. doi: 10.1371/journal.pmed.1003773 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Dadashi M, Khaleghnejad S, Abedi Elkhichi P, et al. COVID-19 and Influenza Co-infection: A Systematic Review and Meta-Analysis. Front Med (Lausanne). 2021;8:681469. Published 2021 Jun 25. doi: 10.3389/fmed.2021.681469 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Scendoni R, Bury E, Lima Arrais Ribeiro I, et al. Leading Pathogens Involved in Co-Infection and Super-Infection with COVID-19: Forensic Medicine Considerations after a Systematic Review and Meta-Analysis. Pathogens. 2023;12(5):646. Published 2023 Apr 27. doi: 10.3390/pathogens12050646 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Patton MJ, Orihuela CJ, Harrod KS, et al. COVID-19 bacteremic co-infection is a major risk factor for mortality, ICU admission, and mechanical ventilation. Crit Care. 2023;27(1):34. Published 2023 Jan 23. doi: 10.1186/s13054-023-04312-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Burrel S, Hausfater P, Dres M, et al. Co-infection of SARS-CoV-2 with other respiratory viruses and performance of lower respiratory tract samples for the diagnosis of COVID-19. Int J Infect Dis. 2021;102:10–13. doi: 10.1016/j.ijid.2020.10.040 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Notarte KI, Catahay JA, Velasco JV, et al. Impact of COVID-19 vaccination on the risk of developing long-COVID and on existing long-COVID symptoms: A systematic review. EClinicalMedicine. 2022;53:101624. Published 2022 Aug 27. doi: 10.1016/j.eclinm.2022.101624 [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

Roberto Scendoni

17 Aug 2023

PONE-D-23-04388Excess burden of respiratory and abdominal conditions following COVID-19 infections during the ancestral and Delta variant periods in the United States: An EHR-based cohort study from the RECOVER ProgramPLOS ONE

Dear Dr. Jay K Varma, 

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Oct 01 2023 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Roberto Scendoni

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Please provide additional details regarding participant consent. In the ethics statement in the Methods and online submission information, please ensure that you have specified (1) whether consent was informed and (2) what type you obtained (for instance, written or verbal, and if verbal, how it was documented and witnessed). If your study included minors, state whether you obtained consent from parents or guardians. If the need for consent was waived by the ethics committee, please include this information.

If you are reporting a retrospective study of medical records or archived samples, please ensure that you have discussed whether all data were fully anonymized before you accessed them and/or whether the IRB or ethics committee waived the requirement for informed consent. If patients provided informed written consent to have data from their medical records used in research, please include this information.

3. Thank you for stating the following in the Acknowledgments Section of your manuscript:

"This research was funded by the National Institutes of Health (NIH) Agreement OTA OT2HL161847 as part of the Researching COVID to Enhance Recovery (RECOVER) research program."

We note that you have provided additional information within the Acknowledgements Section that is not currently declared in your Funding Statement. Please note that funding information should not appear in the Acknowledgments section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form.

Please remove any funding-related text from the manuscript and let us know how you would like to update your Funding Statement. Currently, your Funding Statement reads as follows:

"This study is part of the NIH Researching COVID to Enhance Recovery (RECOVER) Initiative, which seeks to understand, treat, and prevent the post-acute sequelae of SARS-CoV-2 infection (PASC). This research was funded by the National Institutes of Health (NIH) Agreement OTA OT2HL161847 as part of the Researching COVID to Enhance Recovery (RECOVER) research program. NIH played a role in evaluating and developing the overall structure of the RECOVER research program, but not in the design and analysis of this specific study."

Please include your amended statements within your cover letter; we will change the online submission form on your behalf.

4. One of the noted authors is a group or consortium "the RECOVER Consortium". In addition to naming the author group, please list the individual authors and affiliations within this group in the acknowledgments section of your manuscript. Please also indicate clearly a lead author for this group along with a contact email address

5. We note that you have indicated that data from this study are available upon request. PLOS only allows data to be available upon request if there are legal or ethical restrictions on sharing data publicly. For more information on unacceptable data access restrictions, please see http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions.   

In your revised cover letter, please address the following prompts:

    a) If there are ethical or legal restrictions on sharing a de-identified data set, please explain them in detail (e.g., data contain potentially sensitive information, data are owned by a third-party organization, etc.) and who has imposed them (e.g., an ethics committee). Please also provide contact information for a data access committee, ethics committee, or other institutional body to which data requests may be sent.

   b) If there are no restrictions, please upload the minimal anonymized data set necessary to replicate your study findings as either Supporting Information files or to a stable, public repository and provide us with the relevant URLs, DOIs, or accession numbers. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories.

We will update your Data Availability statement on your behalf to reflect the information you provide.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: No

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Thank you for the opportunity to review this manuscript. It is generally well written and clear, so my comments focus primarily on a few methodological questions and presentation.

In the methods, the authors state, “Patients were defined as having a comorbidity… 2 corresponding diagnoses documented during the baseline period.” I believe that I read earlier that the baseline period was as short as 7 days. Is it possible to have had 2 corresponding diagnoses during that short a time period? This is minor, but it does not make sense to me. Also, earlier in that paragraph “inpatients” should be “inpatient.”

Which covariates from Table 1 were included in the hazards ratio calculations? These should be stated in the methods.

The excess burden is quite different between the two time periods. Is it possible that more people died during the ancestral period, thus did not develop PASC? Were they included if they subsequently died? Was death considered a PASC? In any event, it seems that these differences in the magnitude of excess burden bears more discussion.

Figure 1 Legend: “Patient” should be “Patients,” suggest adding “(index date)” after “SARS-CoV-2.” The text about measurement dates following the title should be included in the methods.

Figure 1: There seem to be connector lines missing below the SARS cases and controls boxes and individual connector lines missing below the 6 boxes on the next row.

Figure 2: What is the significance of the dotted lines?

Figure 3: This figure is pretty, but a bit confusing. I do not believe that the figure of the person is necessary and should come out. You could more clearly present the data by adding a column to the left of the “all” column that indicates the system for each group of symptoms/conditions and also remove the color from those boxes. It might be helpful to state that the figure is a heat map and indicate that the “thermometer” is the legend.

I did not see reference to Supplemental Fig 2 in the text. Did I miss it?

Reviewer #2: I thank you the authors for the work they have done. I think the manuscript touches upon an important topic in medicine. The large sample makes the analysis robust, although this study is observational in nature. The main limitation of the study is that even though the data may reveal some associations, it has to be interpreted with caution due to the inherent limitations of the study design. I have some comments that would help to improve the overall quality of the manuscript:

-Can the authors comment how many facilities send data to the networks used to collect the data of the study?

-One important limitation for the study is that negative cases are only classified based on the data from the in-network facilities, but if a patient classified as “negative” had indeed a positive test reported in a different facility (or an in-house / over-the-counter test) it would be missed by the current study search strategy. The authors should state this important limitation in the discussion section of the paper.

-As the authors state, an important limitation is the fact that other infections can account for the PASC conditions. However, the authors should also mention that non-infectious conditions could account for some of these conditions, and that’s another limitation as the authors did not control for comorbid conditions.

-What does the U099/B948 code mean?

-The authors should also state in their limitations that the imputation of a missing ADI value with median ADI per site may be not an accurate estimation.

-Can the authors comment on the fact that the number of patients with positive tests during the time the Delta variant circulated was significantly lower than the number of positive tests during the time of the ancestral variant? As the Delta variant is believed to have spread faster, it is really surprising the authors found these differences. If this is related to the fact that most positive patients for the ancestral variant were diagnosed in NY state, the authors should elaborate on this. The same is true for the negative cases. There seems to be a significantly lower number of cases tested during the Delta period.

-Could the authors provide the data of how many positive cases and how many negative cases in each variant wave (ancestral vs. delta variant) were from New York, and how many were from Florida?

-I am not sure if the following sentence is appropriate for the discussion “In the absence of hypoxemia, no current treatment exists for persistent dyspnea, although novel strategies to help patients, such as breathlessness training, are being evaluated.18”it feels a little bit out of context to me.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2024 Jun 6;19(6):e0282451. doi: 10.1371/journal.pone.0282451.r002

Author response to Decision Letter 0


28 Nov 2023

RESPONSE TO REVIEWERS

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

Response: We have revised accordingly.

2. Please provide additional details regarding participant consent. In the ethics statement in the Methods and online submission information, please ensure that you have specified (1) whether consent was informed and (2) what type you obtained (for instance, written or verbal, and if verbal, how it was documented and witnessed). If your study included minors, state whether you obtained consent from parents or guardians. If the need for consent was waived by the ethics committee, please include this information.

Response: We have inserted the following sentence into the section on Ethical Review: “The IRBs waived informed consent for this observational cohort study that involves a limited, but not de-identified, data set.”

3. Thank you for stating the following in the Acknowledgments Section of your manuscript:

"This research was funded by the National Institutes of Health (NIH) Agreement OTA OT2HL161847 as part of the Researching COVID to Enhance Recovery (RECOVER) research program."

We note that you have provided additional information within the Acknowledgements Section that is not currently declared in your Funding Statement. Please note that funding information should not appear in the Acknowledgments section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form.

Please remove any funding-related text from the manuscript and let us know how you would like to update your Funding Statement. Currently, your Funding Statement reads as follows:

"This study is part of the NIH Researching COVID to Enhance Recovery (RECOVER) Initiative, which seeks to understand, treat, and prevent the post-acute sequelae of SARS-CoV-2 infection (PASC). This research was funded by the National Institutes of Health (NIH) Agreement OTA OT2HL161847 as part of the Researching COVID to Enhance Recovery (RECOVER) research program. NIH played a role in evaluating and developing the overall structure of the RECOVER research program, but not in the design and analysis of this specific study."

Please include your amended statements within your cover letter; we will change the online submission form on your behalf.

Response: We have revised accordingly.

4. One of the noted authors is a group or consortium "the RECOVER Consortium". In addition to naming the author group, please list the individual authors and affiliations within this group in the acknowledgments section of your manuscript. Please also indicate clearly a lead author for this group along with a contact email address

Response: Consortium members are now listed in the Acknowledgements. Corresponding author listed on title page.

5. We note that you have indicated that data from this study are available upon request. PLOS only allows data to be available upon request if there are legal or ethical restrictions on sharing data publicly. For more information on unacceptable data access restrictions, please see http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions.

In your revised cover letter, please address the following prompts:

a) If there are ethical or legal restrictions on sharing a de-identified data set, please explain them in detail (e.g., data contain potentially sensitive information, data are owned by a third-party organization, etc.) and who has imposed them (e.g., an ethics committee). Please also provide contact information for a data access committee, ethics committee, or other institutional body to which data requests may be sent.

b) If there are no restrictions, please upload the minimal anonymized data set necessary to replicate your study findings as either Supporting Information files or to a stable, public repository and provide us with the relevant URLs, DOIs, or accession numbers. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories.

We will update your Data Availability statement on your behalf to reflect the information you provide.

Response: Data availability now addressed in Cover Letter.

Reviewers' comments:

RESPONSE TO REVIEWERS

Reviewer #1: Thank you for the opportunity to review this manuscript. It is generally well written and clear, so my comments focus primarily on a few methodological questions and presentation. In the methods, the authors state, “Patients were defined as having a comorbidity… 2 corresponding diagnoses documented during the baseline period.” I believe that I read earlier that the baseline period was as short as 7 days. Is it possible to have had 2 corresponding diagnoses during that short a time period?

Response: The clinical features were collected during the baseline period, which is defined as 3 years to 1 week before lab-confirmed SARS-CoV-2 infection. We have revised the texts in Methods-Cohort Enrollment and Follow-up, and Methods-Data Analysis subsections accordingly to reflect this.

Reviewer #1: This is minor, but it does not make sense to me. Also, earlier in that paragraph “inpatients” should be “inpatient.”

Response: We have changed inpatients to inpatient in the Methods-Data Analysis subsection.

Reviewer #1: Which covariates from Table 1 were included in the hazards ratio calculations? These should be stated in the methods.

Response: We used all the covariates in Table 1 for our adjusted analyses following our previous work 17,22. We have revised the paragraph as follows:

Both the aHR and excess burden calculations used the same covariates for adjustment. We summarized the baseline covariates in Table 1 and have included detailed descriptions in our previous studies. 17,22 Baseline covariates included age, gender, race, ethnicity, and additional factors described here. The national-level area deprivation index (ADI) was used to assess the socioeconomic disadvantage of patients. We imputed a missing ADI value with median ADI per site. Healthcare utilization was measured as the number of inpatient, outpatient, and emergency encounters (0, 1-2, 3-4, 5 or more visits for each encounter type). The Body Mass Index (BMI) was categorized according to WHO guidelines. We adopted a tailored list of the Elixhauser comorbidities and related drug categories (e.g., corticosteroid and immunosuppressant prescriptions) to capture comorbidities. Patients were defined as having comorbidity if they had at least two corresponding diagnoses documented during the baseline period. The clinical features were collected during the baseline period, namely 3 years to 1 week before lab-confirmed SARS-CoV-2 infection.

17Zang, Chengxi, Yongkang Zhang, Jie Xu, Jiang Bian, Dmitry Morozyuk, Edward J. Schenck, Dhruv Khullar et al. "Data-driven analysis to understand long COVID using electronic health records from the RECOVER initiative." Nature Communications 14, no. 1 (2023): 1948.

22 Zhang, Hao, Chengxi Zang, Zhenxing Xu, Yongkang Zhang, Jie Xu, Jiang Bian, Dmitry Morozyuk et al. "Data-driven identification of post-acute SARS-CoV-2 infection subphenotypes." Nature Medicine 29, no. 1 (2023): 226-235.”

Reviewer #1: The excess burden is quite different between the two time periods. Is it possible that more people died during the ancestral period, thus did not develop PASC? Were they included if they subsequently died? Was death considered a PASC? In any event, it seems that these differences in the magnitude of excess burden bears more discussion.

Response: We did not consider death as a component of PASC. We analyzed PASC in patients who were still alive beyond their acute infection period, namely alive beyond + 30 days, and we modeled death as a competing risk in our analyses. We further reported death data in the follow-up period (+30 – 180 days) in Table 1. We added text about this to the Discussion-Limitations paragraph.

Entire Study Period Ancestral Strain Period Delta Period

Characteristics Cases Control Cases Control Cases Control

Death in follow-up (%) 724 (1.3) 4,676 (0.9) 275 (1.4) 2,665 (1.2) 102 (1.3) 380 (0.7)

Reviewer #1: Figure 1 Legend: “Patient” should be “Patients,” suggest adding “(index date)” after “SARS-CoV-2.”

Response: We have revised accordingly.

Reviewer #1: The text about measurement dates following the title should be included in the methods.

Response: We have added the following description to the Methods section, “The baseline period was defined as 3 years to 1 week before lab-confirmed SARS-CoV-2 infection and the follow-up period was defined as 31 to 180 days after the index date.17,22”

Reviewer #1: Figure 1: There seem to be connector lines missing below the SARS cases and controls boxes and individual connector lines missing below the 6 boxes on the next row.

Response: We have re-designed the Fig1.

Reviewer #1: Figure 2: What is the significance of the dotted lines?

Response: We have re-designed Fig2 and used different color panels (separated by the dotted lines) to represent different organ systems, including (from top to bottom): the nervous system or mental disorders, skin, respiratory system, circulatory system, blood, endocrine and metabolic, digestive system, and other signs.

Reviewer #1: Figure 3: This figure is pretty, but a bit confusing. I do not believe that the figure of the person is necessary and should come out. You could more clearly present the data by adding a column to the left of the “all” column that indicates the system for each group of symptoms/conditions and also remove the color from those boxes. It might be helpful to state that the figure is a heat map and indicate that the “thermometer” is the legend.

Response: We have re-designed Fig3 and revised the associated caption texts.

Reviewer #1: I did not see reference to Supplemental Fig 2 in the text. Did I miss it?

Response: Thanks for your question. We have added the reference to Supp Fig 2 in the result-section and discussion section.

Reviewer #2: Can the authors comment how many facilities send data to the networks used to collect the data of the study?

Response: We have revised our Methods text to incorporate these information: “…INSIGHT, which contains records from approximately 12 million persons who received services across five health systems (Albert Einstein School of Medicine/Montefiore Medical Center, Columbia University and Weill Cornell Medicine/New York-Presbyterian Hospital, lcahn School of Medicine/Mount Sinai Health System, and New York University School of Medicine/Langone Medical Center) in the New York City (NYC) metropolitan area, and OneFlorida+, which contains records from approximately 15 million persons receiving services across 13 health systems (University of Florida and UF Health, Florida State University, University of Miami and UHealth, Orlando Health System, AdventHealth, Tallahassee Memorial HealthCare, Tampa General Hospital, Bond Community Health Center Inc., Nicklaus Children’s Hospital, CommunityHealth IT, University of South Florida and USF Health, University of Alabama at Birmingham, Emory University) in Florida.”

Reviewer #2: One important limitation for the study is that negative cases are only classified based on the data from the in-network facilities, but if a patient classified as “negative” had indeed a positive test reported in a different facility (or an in-house / over-the-counter test) it would be missed by the current study search strategy. The authors should state this important limitation in the discussion section of the paper.

Response: We have added this limitation in the discussion section: “…Our study is subject to important limitations. First, we may have misclassified patients as not infected with COVID-19 because a test was never performed or not recorded in the in-network facilities. This may have been more likely during the follow-up of the post-Delta wave, as the follow-up period overlapped with the increasing availability of home testing. Such misclassification would likely lead us to underestimate the prevalence of PASC or underestimate the relative risk of PASC, particularly during the first wave when diagnostic testing was less widely available. …”

Reviewer #2: As the authors state, an important limitation is the fact that other infections can account for the PASC conditions. However, the authors should also mention that non-infectious conditions could account for some of these conditions, and that’s another limitation as the authors did not control for comorbid conditions.

Response: We are not sure which conditions the reviewer is referring to, because we compared the new onset of non-infectious conditions (and symptoms) between those with a COVID-19 diagnosis and those without during the study period. If other non-infectious conditions also mimic PASC, we would expect this to bias our results toward no or less association between COVID-19 and PASC.

Reviewer #2: What does the U099/B948 code mean?

Response: We used ICD-10 diagnostic codes B948 (sequelae of other specified infectious and parasitic diseases) and U099 (post-COVID-19 condition, unspecified) to capture general or unspecified PASC conditions. We added this to the Methods-Data Analysis section.

Reviewer #2: The authors should also state in their limitations that the imputation of a missing ADI value with median ADI per site may be not an accurate estimation.

Response: Thanks for your suggestions. We have added this to the Discussion-Limitations paragraph: “…While we adjusted for many characteristics, other unmeasured factors or missing values (e.g., missing ADI value, smoking status, etc.) could explain differences…”

Reviewer #2: Can the authors comment on the fact that the number of patients with positive tests during the time the Delta variant circulated was significantly lower than the number of positive tests during the time of the ancestral variant? As the Delta variant is believed to have spread faster, it is really surprising the authors found these differences. If this is related to the fact that most positive patients for the ancestral variant were diagnosed in NY state, the authors should elaborate on this. The same is true for the negative cases. There seems to be a significantly lower number of cases tested during the Delta period.

Response: We covered site-specific analyses in Supplementary materials (Fig S1 for temporal dynamics, Fig S2 for the excess burdens, and Table S1 for the number of patients in different period). We reference this issue in the Discussion section, “Third, the number of cases during the Delta variant period was substantially higher in Florida (Supplementary Figure S1 and Supplementary Table S1), and, for several conditions, the magnitude of excess burden varied between NY and Florida (Supplementary Figure S2).” In our sensitivity analyses, we observed magnitude differences but the major conclusions still held.

Reviewer #2: Could the authors provide the data of how many positive cases and how many negative cases in each variant wave (ancestral vs. delta variant) were from New York, and how many were from Florida?

Response: We have incorporated this information in the supplemental data, see below.

Table. S1. Number of patients in All time (March 2020 to November 2021) versus Ancestral Strain Period (March 2020 to September 2020) versus Delta Variant Period (June 2021 to November 2021), stratified by different regions, NYC Insight vs. Florida OneFlorida.

Insight OneFlorida

All SARS-CoV-2 Positive SARS-CoV-2 Negative All SARS-CoV-2 Positive SARS-CoV-2 Negative

All time 361,401 35,275 326,126 199,351 22,341 177,010

Ancestral Strain Period 149,734 12,611 137,123 86,051 7,332 78,719

Delta Period 30,491 2,035 28,456 30,323 6,062 24,261

Reviewer #2: I am not sure if the following sentence is appr

Attachment

Submitted filename: Response to Reviewers.docx

pone.0282451.s004.docx (28.9KB, docx)

Decision Letter 1

Roberto Scendoni

11 Jan 2024

PONE-D-23-04388R1Excess burden of respiratory and abdominal conditions following COVID-19 infections during the ancestral and Delta variant periods in the United States: An EHR-based cohort study from the RECOVER ProgramPLOS ONE

Dear Dr. Varma,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

==============================

The requested changes have been made. However, some minor fixes are missing. Please follow what the reviewers suggest.

==============================

Please submit your revised manuscript by Feb 25 2024 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Roberto Scendoni

Academic Editor

PLOS ONE

Journal Requirements:

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: (No Response)

Reviewer #3: (No Response)

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #3: (No Response)

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #3: (No Response)

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #3: (No Response)

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #3: (No Response)

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The authors have addressed my concerns, but on Figure 1, the last two boxes indicate that they were patients without targeted diagnosis. Just confirming that this should not be with targeted diagnosis.

Reviewer #3: Indeed, it is acceptable to request only a minor revision, specifying the need to expand the bibliography, as it is currently too concise. Specifically, include reviews on coinfections in hospitalized individuals, along with a brief comment (e.g., doi: 10.3389/fmed.2021.681469; doi: 10.3390/pathogens12050646; doi: 10.1186/s13054-023-04312-0; doi: 10.1016/j.ijid.2020.10.040).

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #3: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2024 Jun 6;19(6):e0282451. doi: 10.1371/journal.pone.0282451.r004

Author response to Decision Letter 1


14 Jan 2024

Reviewer #1: The authors have addressed my concerns, but on Figure 1, the last two boxes indicate that they were patients without targeted diagnosis. Just confirming that this should not be with targeted diagnosis.

Response: The Figure is labeled correctly. The population being studied is patients without one of the PASC diagnoses at baseline, and, within that population, we are comparing those with at COVID-19 diagnosis to those without a COVID-19 diagnosis.

Reviewer #3: Indeed, it is acceptable to request only a minor revision, specifying the need to expand the bibliography, as it is currently too concise. Specifically, include reviews on coinfections in hospitalized individuals, along with a brief comment (e.g., doi: 10.3389/fmed.2021.681469; doi: 10.3390/pathogens12050646; doi: 10.1186/s13054-023-04312-0; doi: 10.1016/j.ijid.2020.10.040).

Response: We have added the requested references.

Attachment

Submitted filename: Response to Reviewers.docx

pone.0282451.s005.docx (28.9KB, docx)

Decision Letter 2

Roberto Scendoni

16 Jan 2024

Excess burden of respiratory and abdominal conditions following COVID-19 infections during the ancestral and Delta variant periods in the United States: An EHR-based cohort study from the RECOVER Program

PONE-D-23-04388R2

Dear Dr. Varma,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Roberto Scendoni

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Acceptance letter

Roberto Scendoni

9 May 2024

PONE-D-23-04388R2

PLOS ONE

Dear Dr. Varma,

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now being handed over to our production team.

At this stage, our production department will prepare your paper for publication. This includes ensuring the following:

* All references, tables, and figures are properly cited

* All relevant supporting information is included in the manuscript submission,

* There are no issues that prevent the paper from being properly typeset

If revisions are needed, the production department will contact you directly to resolve them. If no revisions are needed, you will receive an email when the publication date has been set. At this time, we do not offer pre-publication proofs to authors during production of the accepted work. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few weeks to review your paper and let you know the next and final steps.

Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

If we can help with anything else, please email us at customercare@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Roberto Scendoni

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. Lab-confirmed new SARS-CoV-2 cases per 10,000 patients with different variants of concerns in the INSIGHT and OneFlorida+ cohorts.

    (TIFF)

    pone.0282451.s001.tiff (207KB, tiff)
    S2 Fig. Adjusted excess burdens per 1,000 patients of potential incident PASC conditions in all time (March 2020 to November 2021) versus ancestral strain period (March 2020 to September 2020) versus Delta variant period (June 2021 to November 2021), stratified by different regions (NYC vs. Florida).

    The sequelae outcomes were ascertained from day 30 after the SARS-CoV-2 infection and the adjusted hazard ratio were computed 180 days after the SARS-CoV-2 infection.

    (TIFF)

    pone.0282451.s002.tiff (13MB, tiff)
    S1 Table. Number of patients in all time (March 2020 to November 2021) versus ancestral strain period (March 2020 to September 2020) versus Delta variant period (June 2021 to November 2021), stratified by different regions, NYC Insight vs. Florida OneFlorida.

    (DOCX)

    pone.0282451.s003.docx (10.7KB, docx)
    Attachment

    Submitted filename: Response to Reviewers.docx

    pone.0282451.s004.docx (28.9KB, docx)
    Attachment

    Submitted filename: Response to Reviewers.docx

    pone.0282451.s005.docx (28.9KB, docx)

    Data Availability Statement

    There are legal and ethical restrictions on data sharing because the Institutional Review Board of Weill Cornell Medicine did not approve public data deposition. The data set used for this study constitutes sensitive patient information extracted from the electronic health records. Accordingly, it is subject to federal legislation that limits our ability to disclose it to the public, even after it has been subjected to deidentification techniques. To request the access of the de-identified minimal dataset underlying these findings, interested and qualified researchers should contact INSIGHT Clinical Research Network (https://insightcrn.org/) or Alexandra LaMar at all4008@med.cornell.edu.


    Articles from PLOS ONE are provided here courtesy of PLOS

    RESOURCES