Skip to main content
PLOS One logoLink to PLOS One
. 2021 Oct 21;16(10):e0258914. doi: 10.1371/journal.pone.0258914

SARS-CoV-2 infection and cardiovascular or pulmonary complications in ambulatory care: A risk assessment based on routine data

Siranush Karapetyan 1,*, Antonius Schneider 1, Klaus Linde 1, Ewan Donnachie 2, Alexander Hapfelmeier 1,3
Editor: Huei-Kai Huang4
PMCID: PMC8530335  PMID: 34673818

Abstract

Background

Risk factors of severe COVID-19 have mainly been investigated in the hospital setting. We investigated pre-defined risk factors for testing positive for SARS-CoV-2 infection and cardiovascular or pulmonary complications in the outpatient setting.

Methods

The present cohort study makes use of ambulatory claims data of statutory health insurance physicians in Bavaria, Germany, with polymerase chain reaction (PCR) test confirmed or excluded SARS-CoV-2 infection in first three quarters of 2020. Statistical modelling and machine learning were used for effect estimation and for hypothesis testing of risk factors, and for prognostic modelling of cardiovascular or pulmonary complications.

Results

A cohort of 99 811 participants with PCR test was identified. In a fully adjusted multivariable regression model, dementia (odds ratio (OR) = 1.36), type 2 diabetes (OR = 1.14) and obesity (OR = 1.08) were identified as significantly associated with a positive PCR test result. Significant risk factors for cardiovascular or pulmonary complications were coronary heart disease (CHD) (OR = 2.58), hypertension (OR = 1.65), tobacco consumption (OR = 1.56), chronic obstructive pulmonary disease (COPD) (OR = 1.53), previous pneumonia (OR = 1.53), chronic kidney disease (CKD) (OR = 1.25) and type 2 diabetes (OR = 1.23). Three simple decision rules derived from prognostic modelling based on age, hypertension, CKD, COPD and CHD were able to identify high risk patients with a sensitivity of 74.8% and a specificity of 80.0%.

Conclusions

The decision rules achieved a high prognostic accuracy non-inferior to complex machine learning methods. They might help to identify patients at risk, who should receive special attention and intensified protection in ambulatory care.

Introduction

The danger posed by COVID-19 in a population results from the interplay of the high infectivity of the SARS-CoV-2 virus and the mortality risk to infected persons. Several internal and external factors, such as age, a person’s pre-existing health condition, social behaviour or containment measures taken by governments, have been discussed to affect these risks.

With regard to the risk of infection, governments around the world have imposed containment measures, such as the wearing of masks, social distancing and special hygiene measures. These measures have been subject of controversial discussion concerning effectiveness and the potential to aggravate other health conditions due to a delay or failure of treatment [1]. Some evidence has also related the risk of infection to living and social conditions, such as residential care for the elderly, assisted living and mobility [2]. Further exploration identified health conditions such as diabetes, kidney disease, dementia and obesity as relevant risk factors [3]. By contrast, a recent investigation did not reveal any health conditions to be associated with the risk of infection [4].

Concerning the risk of severe COVID-19, studies investigated risk factors in dependence of health conditions, mainly in the hospital setting. Accordingly, diabetes, chronic obstructive pulmonary disease (COPD), coronary heart disease (CHD), hypertension, chronic kidney disease (CKD), cancer, dementia and asthma were found to be related to an increased risk of hospitalization or mortality [511]. Further evidence is also provided for obesity, smoking, liver disease and depression [58, 1216]. In particular, age was found to have the most prominent impact on the lethality of COVID-19 [17]. However, this list is not exhaustive and the role of other factors, such as vitamin D [18, 19], is still under discussion.

The research objectives of the present cohort study and nested case-control study were to explore the risk of testing positive for SARS-CoV-2 infection and the risk of cardiovascular or pulmonary complications in dependence of pre-existing health conditions. In a hypothesis-driven approach, the International Classification of Diseases 10th Revision (ICD-10) diagnoses of pre-defined diseases and health conditions were used to examine the validity of known risk factors of the hospital setting in the outpatient setting. Another important research objective was to support ambulatory care decision-making by developing a respective prognostic model with the use of regression modelling and machine learning techniques.

Methods

The analysis is based on large routine data of two cohorts with polymerase chain reaction (PCR) test confirmed or excluded SARS-CoV-2 infection, respectively. The anonymous ambulatory claims data was provided by the Bavarian Association of Statutory Health Insurance Physicians (BASHIP) and covers all 11.2 million statutorily insured persons in Bavaria, covering approximately 85% of the population [20]. During the evaluation period from February to the end of September 2020 (i.e., first to third quarter 2020), patients suspected to suffer from COVID-19 infection received naso-pharyngeal swabs for PCR testing in general practice. According to the national testing strategy, participants without symptoms could also be tested in general practice, for example travelers from risk areas, staff in health care or other vulnerable sectors, and contacts of infected persons. However, these cases were to be billed separately by the Ministry and were thus not documented as claims data. Individual ambulatory claims data was provided for quarterly billing periods from the first quarter of 2015 to the last quarter of 2020. Consent from participants was not required as the analyses are based on secondary billing data and conducted according to the German guideline “Good Practice of Secondary Data Analysis” [21]. We used the German modification of the International Classification of Diseases 10th Revision (ICD-10-GM) [22] to define diseases and health conditions. Codes that have changed between 2015 and 2020 were updated to the coding valid in 2020 according to the official documentation of the ICD-10-GM, which is released by the German Institute of Medical Documentation and Information [22]. For the definition of pre-existing health conditions we considered only those ICD-10 codes marked as secure diagnoses.

According to the “test-negative design” approach [23], cohorts of individuals with a secured U07.1 diagnosis or a U07.1 diagnosis of exclusion, which codes a positive or negative PCR test result for SARS-CoV-2 infection, were defined for analysis. These cases and controls are henceforth called „test-positives”and „test-negatives“. We further excluded individuals from the test-negatives who had an additional secured U07.2 diagnosis, coding a clinically-epidemiologically diagnosed COVID-19 according to the case definition of the World Health Organization [24]. Briefly, the U07.2 diagnosis is used for individuals with COVID-19 symptoms who have been in contact with a confirmed case or live in a facility with a suspected outbreak but have not received a PCR test.

The observation period was defined to include the five years preceding and the quarter following the index quarter of PCR test. Accordingly, determination of an index quarter was restricted to the first three quarters in 2020. Individuals who were not residing in Bavaria within the five years preceding PCR test and had no data record before this observational period were excluded from analysis in an attempt to ensure complete observation periods. We additionally removed individuals with implausible ICD-10 coding, i.e. patients with a death diagnosis (R96, R99 and I46.9) within the observation period (Fig 1).

Fig 1. Flow chart for participant selection process.

Fig 1

Further information about age, sex, urbanization and nursing home living was used to adjust for potential confounding. To adjust for different settlement and health care supply densities we included a measure of urbanization, categorized by four levels as defined by the German Federal Institute for Research on Building, Urban Affairs and Spatial Development: ‚large cities‘ with at least 100 000 residents, and ‚urban areas‘, ‚rural areas‘ or ‚sparsely populated rural areas‘ with population densities of >150, ≤150 or ≤100 inhabitants per km2, respectively [25]. Seven recorded fee codes were used to identify individuals living in nursing homes up to three quarters before or during the quarter of PCR test.

The underlying data for this study are pseudonymized and the study was approved by the Ethics Commission of the Technical University of Munich (Ethikkommission der Technischen Universität München) (approval No 673/20 S-EB).

Statistical analysis

Two multivariable binary regression models were used to investigate the risk of positive PCR test result and the risk of cardiovascular or pulmonary complications separately. Known risk factors for severe COVID-19 in the hospital setting were pre-defined for inclusion into the models as independent predictor variables. These were hypertension, diabetes, CKD, dementia, obesity, CHD, COPD, pneumonia, asthma, tobacco consumption, cancer, liver disease and depression. Additional potential risk factors of interest included in the analysis were anxiety disorder, vitamin D deficiency, immunodeficiency and flu. Respective ICD-10 codes are listed in S1 Table. Potential confounding by age, sex, urbanization and nursing home living was addressed by including respective factor variables and an interaction between age and sex in the models. Thereby, age was categorized to the intervals 0–20, 21–30, 31–40,. . ., 80+ and urbanization to the levels ‚large cities‘, ‚urban areas‘, ‚rural areas‘ and ‚sparsely populated rural areas‘.

In the absence of hospital data we decided a priori to use cardiovascular or pulmonary complications occurring in the first quarter after PCR-confirmed SARS-CoV-2 infection as a proxy for a severe course of COVID-19 in outpatient setting. Diagnoses included in this outcome were acute respiratory distress syndrome (ARDS), hypoxia, stroke, angina pectoris, heart attack, cardiac arrest, pulmonary embolism and apnea (S1 Table). In addition, to elaborate whether a COVID-19 specific risk model is meaningful beyond a general risk model, we investigated COVID-19 as an independent predictor of defined complications. We therefore fitted the multivariable regression model to both groups simultaneously, i.e. to test-positives and test-negatives, in two steps. First, we included the result of PCR test as an additional risk factor, and second, all interaction effects between the PCR test result and the investigated risk factors were sequentially added and screened for possible inclusion to the multivariable regression model with forward stepwise variable selection based on the Akaike’s information criterion (AIC). Goodness-of-fit of these nested models was compared by a descriptive likelihood-ratio test without formal adjsutment for AIC-based model selection.

Any hypothesis testing was performed at local and global 5% levels of significance, i.e. with and without adjustment for the multiple testing problem. Therefore, P values have additionally been adjusted using the joint distribution of the regression coefficients of the multivariable models [26].

With the intention of assisting decision-making in ambulatory care, we additionally developed a prognostic model for the risk of cardiovascular or pulmonary complications in the outpatient setting with the use of regression modelling and machine learning. Selected algorithms were random forest, conditional inference tree, least absolute shrinkage and selection operator (LASSO), ridge regression, elastic net and binary logistic regression with and without stepwise variable selection based on AIC. In this regard, we randomly selected 75% of the participants (derivation set) to develop the models and internally validated the performance of the models on the remaining participants (validation set) using the area under the receiver operating characteristic curve (AUC) as a measure of discriminatory ability. To improve performance of the prognostic models we tuned the parameters of machine learning algorithms using three-fold cross-validation within the derivation set. With the aim to visualize the results of the best performing model in a comprehensive manner, recursive segmentation and recursive partitioning were applied to the best performing model’s predictions [27] to identify subgroups of different risks [28]. This resulted in decision rules enabling a specific characterization of patients with an increased risk of defined complications.

All statistical analyses were performed in R, version 4.0.3 (The R Foundation for Statistical Computing, Vienna, Austria).

Results

Risk of positive PCR test result

A total of 99 811 participants were included in the analysis of the risk of positive PCR test result for SARS-CoV-2 infection. Of these participants, 58 336 (58.4%) were female, 79 236 (79.4%) were younger than 60 years (mean±SD = 44.3±20.8). Among the participants we identified 53 904 (54.0%) test-positives. Overall characteristics of test-positives and test-negatives were similar (Table 1). A flow chart of the participant selection process is given in Fig 1.

Table 1. Characteristics of participants; n (%).

Infection risk analysis Complications risk analysis
Total Test-negatives Test-positives Total No complications Complications
(n = 99 811) (n = 45 907) (n = 53 904) (n = 46 071) (n = 44 167) (n = 1904)
Gender
    Male 41 475 (41.6) 19 215 (41.9) 22 260 (41.3) 17 794 (38.6) 16 883 (38.2) 911 (47.8)
    Female 58 336 (58.4) 26 692 (58.1) 31 644 (58.7) 28 277 (61.4) 27 284 (61.8) 993 (52.2)
Age
    < 21 13 992 (14.0) 7911 (17.2) 6081 (11.3) 4818 (10.5) 4810 (10.9) 8 (0.4)
    21–30 14 564 (14.6) 6384 (13.9) 8180 (15.2) 6610 (14.3) 6576 (14.9) 34 (1.8)
    31–40 15 894 (15.9) 7509 (16.4) 8385 (15.6) 6924 (15.0) 6871 (15.6) 53 (2.8)
    41–50 16 410 (16.4) 7319 (15.9) 9091 (16.9) 7851 (17.0) 7688 (17.4) 163 (8.6)
    51–60 18 376 (18.4) 8005 (17.4) 10 371 (19.2) 9253 (20.1) 8926 (20.2) 327 (17.2)
    61–70 8838 (8.9) 4230 (9.2) 4608 (8.5) 4265 (9.3) 3921 (8.9) 344 (18.1)
    71–80 5503 (5.5) 2387 (5.2) 3116 (5.8) 2862 (6.2) 2459 (5.6) 403 (21.2)
    80+ 6234 (6.2) 2162 (4.7) 4072 (7.6) 3488 (7.6) 2916 (6.6) 572 (30.0)
Residence
    Sparsely populated rural area 22 860 (22.9) 10 589 (23.1) 12 271 (22.8) 10 545 (22.9) 10 058 (22.8) 487 (25.6)
    Rural area 27 170 (27.2) 12 923 (28.2) 14 247 (26.4) 12 203 (26.5) 11 687 (26.5) 516 (27.1)
    Urban area 27 732 (27.8) 12 644 (27.5) 15 088 (28.0) 12 839 (27.9) 12 349 (28.0) 490 (25.7)
    Large city 22 049 (22.1) 9751 (21.2) 12 298 (22.8) 10 484 (22.8) 10 073 (22.8) 411 (21.6)
Nursing home living 7448 (7.5) 1987 (4.3) 5461 (10.1) 4727 (10.3) 4112 (9.3) 615 (32.3)
Tobacco consumption 10 006 (10.0) 5124 (11.2) 4882 (9.1) 4375 (9.5) 4065 (9.2) 310 (16.3)
Obesity 18 208 (18.2) 7906 (17.2) 10 302 (19.1) 9404 (20.4) 8751 (19.8) 653 (34.3)
Diagnosis
    CHD 8104 (8.1) 3343 (7.3) 4761 (8.8) 4335 (9.4) 3478 (7.9) 857 (45.0)
    Hypertension 30 750 (30.8) 13 052 (28.4) 17 698 (32.8) 16 223 (35.2) 14 702 (33.3) 1521 (79.9)
    COPD 10 103 (10.1) 4790 (10.4) 5313 (9.9) 4866 (10.6) 4291 (9.7) 575 (30.2)
    Asthma 14 617 (14.6) 7013 (15.3) 7604 (14.1) 6906 (15.0) 6533 (14.8) 373 (19.6)
    Pneumonia 5145 (5.2) 2369 (5.2) 2776 (5.1) 2464 (5.3) 2182 (4.9) 282 (14.8)
    Flu 7717 (7.7) 3945 (8.6) 3772 (7.0) 3273 (7.1) 3161 (7.2) 112 (5.9)
    Immunodeficiency 2990 (3.0) 1444 (3.1) 1546 (2.9) 1410 (3.1) 1339 (3.0) 71 (3.7)
    CKD 7369 (7.4) 2978 (6.5) 4391 (8.1) 3932 (8.5) 3301 (7.5) 631 (33.1)
    Liver disease 12 621 (12.6) 5390 (11.7) 7231 (13.4) 6643 (14.4) 6110 (13.8) 533 (28.0)
    Type 1 diabetes 1738 (1.7) 734 (1.6) 1004 (1.9) 936 (2.0) 823 (1.9) 113 (5.9)
    Type 2 diabetes 9634 (9.7) 3803 (8.3) 5831 (10.8) 5373 (11.7) 4665 (10.6) 708 (37.2)
    Vitamin D deficiency 9213 (9.2) 4126 (9.0) 5087 (9.4) 4674 (10.1) 4393 (9.9) 281 (14.8)
    Cancer 11 113 (11.1) 5047 (11.0) 6066 (11.3) 5540 (12.0) 5021 (11.4) 519 (27.3)
    Dementia 4466 (4.5) 1236 (2.7) 3230 (6.0) 2762 (6.0) 2317 (5.2) 445 (23.4)
    Depression 28 562 (28.6) 12 547 (27.3) 16 015 (29.7) 14 678 (31.9) 13 762 (31.2) 916 (48.1)
    Anxiety disorder 14 498 (14.5) 6665 (14.5) 7833 (14.5) 7268 (15.8) 6899 (15.6) 369 (19.4)

CHD, coronary heart disease; COPD, chronic obstructive pulmonary disese; CKD, chronic kidney disease.

Dementia (odds ratio (OR) = 1.36 [1.25–1.49]), type 2 diabetes (OR = 1.14 [1.08–1.20]), obesity (OR = 1.08 [1.05–1.12]) and liver disease (OR = 1.07 [1.03–1.11]) were identified to be significantly associated with an increased risk of a positive PCR test result. After correcting for multiple testing dementia, type 2 diabetes and obesity still remained statistically significant. Conversely, participants with tobacco consumption (OR = 0.75 [0.72–0.78]), previous flu (OR = 0.83 [0.79–0.87]), cancer (OR = 0.90 [0.86–0.94]), COPD (OR = 0.94 [0.89–0.98]), anxiety disorder (OR = 0.95 [0.92–0.99]) and asthma (OR = 0.96 [0.92–0.99]) were less likely to test postive for SARS-CoV-2 infection (Table 2).

Table 2. Odds ratio (OR) and 95% confidence intervals (CI) for the risk of positive PCR test result for SARS-CoV-2 infection and the risk of cardiovascular or pulmonary complications.

Multivariable binary regression models adjusted for age, sex, urbanisation, nursing home living and diseases shown.

Infection risk analysis Complications risk analysis
OR (95% CI) P value P value* OR (95% CI) P value P value*
Tobacco consumption 0.75 (0.72–0.78) p < 0.001 p < 0.001 1.56 (1.35–1.81) p < 0.001 p < 0.001
Obesity 1.08 (1.05–1.12) p < 0.001 p < 0.001 1.13 (1.01–1.27) 0.035 0.578
Diagnosis
    CHD 1.04 (0.99–1.10) 0.151 0.992 2.58 (2.31–2.89) p < 0.001 p < 0.001
    Hypertension 1.04 (1.00–1.08) 0.046 0.761 1.65 (1.43–1.90) p < 0.001 p < 0.001
    COPD 0.94 (0.89–0.98) 0.004 0.126 1.53 (1.36–1.73) p < 0.001 p < 0.001
    Asthma 0.96 (0.92–0.99) 0.017 0.413 1.18 (1.03–1.35) 0.016 0.335
    Pneumonia 0.95 (0.90–1.01) 0.088 0.937 1.53 (1.32–1.78) p < 0.001 p < 0.001
    Flu 0.83 (0.79–0.87) p < 0.001 p < 0.001 1.06 (0.86–1.30) 0.606 1.000
    Immunodeficiency 0.97 (0.90–1.05) 0.444 1.000 1.27 (0.97–1.66) 0.077 0.856
    CKD 0.96 (0.90–1.01) 0.126 0.982 1.25 (1.10–1.42) p < 0.001 0.012
    Liver disease 1.07 (1.03–1.11) 0.002 0.057 1.00 (0.89–1.12) 0.982 1.000
    Type 1 diabetes 0.93 (0.83–1.03) 0.154 0.993 0.89 (0.71–1.12) 0.330 1.000
    Type 2 diabetes 1.14 (1.08–1.20) p < 0.001 p < 0.001 1.23 (1.09–1.38) 0.001 0.028
    Vitamin D deficiency 1.02 (0.97–1.06) 0.480 1.000 1.15 (0.99–1.32) 0.060 0.779
    Cancer 0.90 (0.86–0.94) p < 0.001 p < 0.001 1.02 (0.90–1.14) 0.802 1.000
    Dementia 1.36 (1.25–1.49) p < 0.001 p < 0.001 1.00 (0.85–1.17) 0.984 1.000
    Depression 1.03 (1.00–1.06) 0.065 0.870 1.18 (1.06–1.31) 0.003 0.067
    Anxiety disorder 0.95 (0.92–0.99) 0.018 0.443 1.01 (0.89–1.16) 0.831 1.000

*P values corrected for multiple testing.

CHD, coronary heart disease; COPD, chronic obstructive pulmonary disese; CKD, chronic kidney disease.

Risk of cardiovascular or pulmonary complications

For the analysis of defined complications the cohort of test-positives was reduced to 46 071 participants with available data records in the first quarter after the index quarter of PCR test. Complications could be identified in 1904 (4.1%) individuals, including ARDS (55 (2.9%)), hypoxia (441 (23.2%)), stroke (617 (32.4%)), angina pectoris (250 (13.1%)), heart attack (192 (10.1%)), cardiac arrest (1 (0.1%)), pulmonary embolism (211 (11.1%)), apnea (5 (0.3%)) and two or more complications (132 (6.9%)). Participants with these complications were older (mean±SD = 69.0±16.3) and tend to have more chronic conditions including CHD (857 (45.0%)), hypertension (1521 (79.9%)), COPD (575 (30.2%)), CKD (631 (33.1%)), type 2 diabetes (708 (37.2%)) (Table 1).

In the analysis of the risk of complications ten out of 18 candidate predictors were statistically significant. The predictors include CHD (OR = 2.58 [2.31–2.89]), hypertension (OR = 1.65 [1.43–1.90]), tobacco consumption (OR = 1.56 [1.35–1.81]), COPD (OR = 1.53 [1.36–1.73]), previous pneumonia (OR = 1.53 [1.32–1.78]), CKD (OR = 1.25 [1.10–1.42]), type 2 diabetes (OR = 1.23 [1.09–1.38]), depression (OR = 1.18 [1.06–1.31]), asthma (OR = 1.18 [1.03–1.35]) and obesity (OR = 1.13 [1.01–1.27]). Among these predictors, CHD, hypertension, tobacco consumption, COPD, previous pneumonia, CKD and type 2 diabetes remained statistically significant after the correction for multiple testing (Table 2). In an additional sensitivity analysis we considered hematooncological and carcinoma in situ diagnoses separately from cancer and categorized diagnoses by time (<1 year, 1–5 years). Opposed to the findings in primary analysis (cancer, OR = 1.02 [0.90–1.14]), cancer diagnosed within the last year showed increased risk of complications (OR = 1.27 [1.05-1-54]), however, this was not significant after correcting for multiple testing (S2 Table).

We additionally analysed the specific relevance of our results of the cohort of test-positives in comparison with test-negatives. When including the result of PCR test as an additional risk factor to a multivariable regression model that is fit to both groups, we found increased risk (OR = 1.13 [1.04–1.22]) for test-positives. Further, a multivariable regression model with forward stepwise variable selection by AIC included interaction effects between the PCR test result and the investigated risk factors, in particular, the interaction to CHD, type 2 and type 1 diabetes, previous pneumonia and dementia. Overall, the PCR test result and the selected interactions showed a statistically significant effect (likelihood-ratio test P<0.001). Because of this evidence of specific risks in test-positives and our research goal to evaluate known risk factors in outpatient setting, it seemed reasonable to investigate and model specific risks only in test-positives.

Prognostic model

For the development of a prognostic model for the risk of cardiovascular or pulmonary complications, 46 071 test-positive participants were randomly allocated to a derivation set (34 553 participants) and to a validation set (11 518 participants). In the derivation and validation sets 1448 (4.2%) and 456 (4.0%) participants had complications, respectively (S3 Table).

Internal validation of the prognostic models showed AUC values ranging from 0.83 to 0.85, indicating excellent [29] and similar discriminatory ability of all models (Table 3). A random forest achieved the best performance, with the disadvantage of lacking interpretability. Therefore, based on its predictions we performed recursive segmentation and partitioning to characterize subgroups with an increased risk of defined complications, enabling better assistance of decision-making in ambulatory care. From the resulting tree‘s structure (Fig 2) we derived the following decision rules defining patients of increased risk: 1) age >70 years, 2) diagnosis of CHD, 3) diagnosis of CKD or COPD with an additional diagnosis of hypertension or age >60 years. A validation of these decision rules resulted in a sensitivity and specificity of 74.8% and 80.0%, respectively. The positive and negative predictive values (PPV, NPV) were respectively 13.3% and 98.7%. The performance of these simple decision rules was not inferior to the performance of the more complex prognostic models (cf. Fig 3).

Table 3. Area under the receiver operating characteristic curve (AUC) and 95% Confidence Intervals (CI) of machine learning methods for cardiovascular or pulmonary complications.

Internal Validation.

Model AUC (95% CI)
Random Forest 0.853 (0.837–0.870)
Logistic Regression 0.850 (0.834–0.866)
Stepwise Logistic Regression 0.849 (0.833–0.866)
LASSO 0.849 (0.833–0.866)
Ridge Regression 0.847 (0.829–0.864)
Conditional Inference Tree 0.843 (0.826–0.860)
Elastic Net 0.835 (0.815–0.854)

LASSO, least absolute shrinkage and selection operator.

Fig 2. Recursive segmentation and recursive partitioning based tree predictions for the risk of cardiovascular or pulmonary complications.

Fig 2

Fig 3. Receiver Operating Characteristic (ROC) curves for machine learning methods for the risk of cardiovascular or pulmonary complications.

Fig 3

Internal validation.

Discussion

The analysis of ambulatory claims data with regression modelling and machine learning techniques revealed, that patients with tobacco consumption, previous flu and cancer were less likely to test positive for SARS-CoV-2 infection; and patients with dementia, type 2 diabetes and obesity showed an increased risk of a positive PCR test result. CHD, CKD, COPD, hypertension and increased age were identified as predictors for unfavourable complications after PCR-confirmed infection.

Consistent with previous findings [3], the present study provides evidence for a strong association of the diagnosis of dementia with a positive PCR test result for SARS-CoV-2 infection. Additionally, our results indicate type 2 diabetes and obesity being positively associated with testing positive, while some health conditions previously found to increase the risk of severe COVID-19 including tobacco consumption and cancer were surprisingly associated with a lower risk of testing positive. The latter might be explained with possible behavioural adjustment in the patients belonging to respective vulnerable subgroups, however, this assumption might be impaired as the cohort of tested participants may also include asymptomatic participants. Opposed to previous assumption [30], our analysis did not suggest significant association with hypertension or type 1 diabetes and the risk of SARS-CoV-2 infection. This might be explained by increased efforts of the potentially vulnerable population to protect themselves from infection. In line with this, dementia showed a comparatively strong association with a positive PCR test result, as it is known to substantially affect daily functioning [31].

With regard to the risk of cardiovascular or pulmonary complications, we found various health conditions including CHD, hypertension, tobacco consumption, COPD, CKD and type 2 diabetes to be associated with an increased risk, which is consistent with findings of previous studies [59, 11, 13]. Additionally, we observed associations with previous pneumonia, depression, asthma and obesity. In contrast to the findings of [8, 32], our results do not suggest a significant association with cancer, while it is important to note that we did not restrict our analyses to a recent cancer diagnosis but investigated cancer history in the whole observation period of five years. An additional sensitivity analysis showed however that consideration of recent cancer diagnosis may yield increased risk for complications. We also did not find significant associations for some risk factors found in previous studies [68] including dementia, liver disease and type 1 diabetes. This might be explained by the process of data collection in outpatient setting. There are probably less complications in these patients than in hospitalized patients, which could lead to weaker or absent associations with unfavourable complications.

Our prognostic model can be applied in ambulatory care for the identification of patients with an increased risk of cardiovascular or pulmonary complications, based on information which is readily accessible for treating physicians. Our best performing model, a random forest, achieved excellent prognostic accuracy. Based on its predictions we identified subgroups of interest. In this regard, we derived decision rules for patients with an increased risk of complications, which achieved high prognostic accuracy. The resulting PPV = 13.3% indicates that patients with a risk constellation according to our prognostic model should receive special attention and intensified protection in ambulatory care. At first sight, this PPV seems to be low although the sensitivity and specificity values are high at 74.8% and 80.0%, respectively. However, this PPV is acceptable given the low prevalence of 4.1%; higher PPV values would result from a higher prevalence. On the other hand, the resulting NPV = 98.7% helps to rule-out cardiovascular or pulmonary complications in patients without these pre-existing conditions.

Strengths and limitations

A strength of our study is the high representativeness due to the large sample size covering the majority of the Bavarian population. Our study also encountered for main potential confounders, i.e. age, sex, urbanization and nursing home living. Investigations were hypothesis-driven, focusing on pre-defined diseases and health conditions. Finally, in the derivation of the prognostic model, we used a separate part of the data for internal validation. However, the generalizability of the prognostic model still requires assessment by external validation.

Our study has some further limitations. Without information on hospital data, our outcome was a proxy of severe COVID-19 and was defined a priori including cardiovascular or pulmonary complications occurring in the quarter following SARS-CoV-2 infection. This definition may have led to obvious relations with pre-existing diseases and health conditions known to be associated with cardiovascular or pulmonary complications. This limitation was addressed by an additional investigation of risks that are specific to test-positives compared to test-negatives. Despite adjustment for potential confounders there are still some unobserved confounding risks, e.g. vulnerable subgroups might be more alert to symptoms and therefore more likely to test positive for COVID-19 than the other participants, leading to bias in our study. Another possible limitation was that the data and inherent diagnoses are not audited and reflect the coding and clinical practices of treating physicians. The determination of the cohort of test-negatives was based on the coding of a negative PCR test result, which was optional for physicians. The latter might introduce bias to the study, as the willingness of physicians to code the negative PCR test result may vary between different participant groups. Diagnoses could also only be made through physician contact, which may result in incomplete data. Potential incorrect coding was addressed by careful quality checks as described in the methods section. Beyond that, the possibility of asymptomatic participants in a cohort can presumably introduce bias. However, tests of asymptomatic patients were to be billed separately by the government and consequently not documated as claims data for the BASHIP. Therefore, the association between risk factors and the outcome, the estimated odds ratios, should be unbiased in this respect.

Conclusion

The prediction rule based on presence or absence of CHD, CKD, COPD, hypertension and increased age might help to rule-in and rule-out, respectively, unfavourable complications in ambulatory care. The risk of infection in itself might be reduced in patients with tobacco consumption, previous flu and cancer due to behavioural adjustment in terms of increased self-protection and contact reduction.

Supporting information

S1 Table. International classification of diseases 10th revision (ICD-10) diagnoses included in the multivariable analyses.

(PDF)

S2 Table. Odds ratio (OR) and 95% confidence intervals (CI) for the risk of cardiovascular or pulmonary complications.

Multivariable binary regression model adjusted for age, sex, urbanisation, nursing home living and diseases shown.

(PDF)

S3 Table. Characteristics of participants in derivation and validation cohorts; n (%).

(PDF)

Data Availability

The data are held by the Bavarian Association of Statutory Health Insurance Physicians (BASHIP) but restrictions apply to the availability of these data, which were used within the framework of the contractual agreement. The data are not publicly available due to data protection regulations, but may be obtained from the authors upon reasonable request and with the consent of the BASHIP (versorgungsforschung@kvb.de).

Funding Statement

This work was supported by the Bavarian State Ministry of Science and the Arts (Bayerische Staatsministerium für Wissenschaft und Kunst) (grant No H.40001.1.7(DMS)-TUM-1)

References

  • 1.Ioannidis JPA. Infection fatality rate of COVID-19 inferred from seroprevalence data. Bull World Health Organ. 2021;99(1), 19. doi: 10.2471/BLT.20.265892 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Chang S, Pierson E, Koh PW, Gerardin J, Redbird B, Grusky D, et al. Mobility network models of COVID-19 explain inequities and inform reopening. Nature. 2021;589(7840):82–7. doi: 10.1038/s41586-020-2923-3 [DOI] [PubMed] [Google Scholar]
  • 3.Rozenfeld Y, Beam J, Maier H, Haggerson W, Boudreau K, Carlson J, et al. A model of disparities: risk factors associated with COVID-19 infection. Int J Equity Health. 2020;19(1):126. doi: 10.1186/s12939-020-01242-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Chadeau-Hyam M, Bodinier B, Elliott J, Whitaker MD, Tzoulaki I, Vermeulen R, et al. Risk factors for positive and negative COVID-19 tests: a cautious and in-depth analysis of UK biobank data. Int J Epidemiol. 2020;49(5):1454–67. doi: 10.1093/ije/dyaa134 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Atkins JL, Masoli JAH, Delgado J, Pilling LC, Kuo CL, Kuchel GA, et al. Preexisting Comorbidities Predicting COVID-19 and Mortality in the UK Biobank Community Cohort. J Gerontol A Biol Sci Med Sci. 2020;75(11):2224–30. doi: 10.1093/gerona/glaa183 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Barron E, Bakhai C, Kar P, Weaver A, Bradley D, Ismail H, et al. Associations of type 1 and type 2 diabetes with COVID-19-related mortality in England: a whole-population study. Lancet Diabetes Endocrinol. 2020;8(10):813–22. doi: 10.1016/S2213-8587(20)30272-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Clift AK, Coupland CAC, Keogh RH, Diaz-Ordaz K, Williamson E, Harrison EM, et al. Living risk prediction algorithm (QCOVID) for risk of hospital admission and mortality from coronavirus 19 in adults: national derivation and validation cohort study. BMJ. 2020;371:m3731. doi: 10.1136/bmj.m3731 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Williamson EJ, Walker AJ, Bhaskaran K, Bacon S, Bates C, Morton CE, et al. Factors associated with COVID-19-related death using OpenSAFELY. Nature. 2020;584(7821):430–6. doi: 10.1038/s41586-020-2521-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Fang X, Li S, Yu H, Wang P, Zhang Y, Chen Z, et al. Epidemiological, comorbidity factors with severity and prognosis of COVID-19: a systematic review and meta-analysis. Aging (Albany NY). 2020;12(13):12493–503. doi: 10.18632/aging.103579 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Williams RD, Markus AF, Yang C, Salles TD, Falconer T, Jonnagaddala J, et al. Seek COVER: Development and validation of a personalized risk calculator for COVID-19 outcomes in an international network. medRxiv. 2020;2020.05.26.20112649 [Google Scholar]
  • 11.Zheng Z, Peng F, Xu B, Zhao J, Liu H, Peng J, et al. Risk factors of critical & mortal COVID-19 cases: A systematic literature review and meta-analysis. J Infect. 2020;81(3):e16–e25. doi: 10.1016/j.jinf.2020.07.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Altschul DJ, Unda SR, Benton J, de la Garza Ramos R, Cezayirli P, Mehler M, et al. A novel severity score to predict inpatient mortality in COVID-19 patients. Sci Rep. 2020;10(1):16726. doi: 10.1038/s41598-020-73962-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Chen R, Liang W, Jiang M, Guan W, Zhan C, Wang T, et al. Risk Factors of Fatal Outcome in Hospitalized Subjects With Coronavirus Disease 2019 From a Nationwide Analysis in China. Chest. 2020;158(1):97–105. doi: 10.1016/j.chest.2020.04.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Popkin BM, Du S, Green WD, Beck MA, Algaith T, Herbst CH, et al. Individuals with obesity and COVID‐19: A global perspective on the epidemiology and biological relationships. Obes Rev. 2020;21(11):e13128. doi: 10.1111/obr.13128 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Yanover C, Mizrahi B, Kalkstein N, Marcus K, Akiva P, Barer Y, et al. What Factors Increase the Risk of Complications in SARS-CoV-2–Infected Patients? A Cohort Study in a Nationwide Israeli Health Organization. JMIR Public Health Surveill. 2020;6(3):e20872. doi: 10.2196/20872 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Yang J, Ma Z, Lei Y. A meta-analysis of the association between obesity and COVID-19. Epidemiol Infect. 2020;149:e11. doi: 10.1017/S0950268820003027 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Levin AT, Hanage WP, Owusu-Boaitey N, Cochran KB, Walsh SP, Meyerowitz-Katz G. Assessing the age specificity of infection fatality rates for COVID-19: systematic review, meta-analysis, and public policy implications. Eur J Epidemiol. 2020;35(12):1123–38. doi: 10.1007/s10654-020-00698-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Hastie CE, Pell JP, Sattar N. Vitamin D and COVID-19 infection and mortality in UK Biobank. Eur J Nutr. 2021;60(1):545–8. doi: 10.1007/s00394-020-02372-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Ilie PC, Stefanescu S, Smith L. The role of vitamin D in the prevention of coronavirus disease 2019 infection and mortality. Aging Clin Exp Res. 2020;32(7):1195–8. doi: 10.1007/s40520-020-01570-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Olm M, Donnachie E, Tauscher M, Gerlach R, Linde K, Maier W, et al. Impact of the abolition of copayments on the GP-centred coordination of care in Bavaria, Germany: analysis of routinely collected claims data. BMJ Open. 2020;10(9):e035575. doi: 10.1136/bmjopen-2019-035575 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Swart E, Gothe H, Geyer S, Jaunzeme J, Maier B, Grobe T, et al. Good Practice of Secondary Data Analysis (GPS): guidelines and recommendations. Gesundheitswesen. 2015;77(02):120–6. doi: 10.1055/s-0034-1396815 [DOI] [PubMed] [Google Scholar]
  • 22.Deutsches Institut für Medizinische Dokumentation und Information. The International Statistical Classification Of Diseases And Related Health Problems, 10th revision, German Modification. 2020. https://www.dimdi.de/static/de/klassifikationen/icd/icd-10-gm/kode-suche/htmlgm2020/ (22 April 2021, date last accessed).
  • 23.Vandenbroucke JP, Brickley EB, Vandenbroucke-Grauls C, Pearce N. A Test-Negative Design with Additional Population Controls Can Be Used to Rapidly Study Causes of the SARS-CoV-2 Epidemic. Epidemiology. 2020;31(6):836–43. doi: 10.1097/EDE.0000000000001251 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.World Health Organization. International Guidelines for Certification and Classification (Coding) of COVID-19 as Cause of Death. 2020. https://www.who.int/classifications/icd/COVID-19-coding-icd10.pdf?%20ua=1 (14 April 2021, date last accessed).
  • 25.BBSR. Laufende Raumbeobachtung—Raumabgrenzungen. Siedlungsstrukturelle Kreistypen. 2010. https://www.bbsr.bund.de/BBSR/DE/forschung/raumbeobachtung/Raumabgrenzungen/deutschland/kreise/siedlungsstrukturelle-kreistypen/kreistypen.html (2 March 2021, date last accessed).
  • 26.Hothorn T, Bretz F, Westfall P. Simultaneous inference in general parametric models. Biom J. 2008;50(3):346–63. doi: 10.1002/bimj.200810425 [DOI] [PubMed] [Google Scholar]
  • 27.Molnar C. Interpretable machine learning: Lulu. com; 2020. [Google Scholar]
  • 28.Hapfelmeier A, Ulm K, Haller B. Subgroup identification by recursive segmentation. J Appl Stat. 2018;45(15):2864–87. [Google Scholar]
  • 29.Hosmer DW Jr, Lemeshow S, Sturdivant RX. Applied logistic regression: John Wiley & Sons; 2013. [Google Scholar]
  • 30.Fang L, Karakiulakis G, Roth M. Are patients with hypertension and diabetes mellitus at increased risk for COVID-19 infection? Lancet Respir Med. 2020;8(4):e21. doi: 10.1016/S2213-2600(20)30116-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Gale SA, Acar D, Daffner KR. Dementia. Am J Med. 2018;131(10):1161–9. doi: 10.1016/j.amjmed.2018.01.022 [DOI] [PubMed] [Google Scholar]
  • 32.Rößler M, Jacob J, Risch L, Tesch F, Enders D, Wende D, et al. Hierarchisierung von Risikofaktoren für schwere COVID-19-Erkrankungs-verläufe im Kontext der COVID-19-Schutzimpfungen–Eine gepoolte GKV-Routinedatenanalyse basierend auf 30 Mio. Versicherten. Epid Bull. 2021;19:3–12. [Google Scholar]

Decision Letter 0

Huei-Kai Huang

20 Jul 2021

PONE-D-21-20330

SARS-CoV-2 infection and cardiovascular or pulmonary complications in ambulatory care: a risk assessment based on routine data

PLOS ONE

Dear Dr. Karapetyan,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Sep 03 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Huei-Kai Huang, M.D.

Academic Editor

PLOS ONE

Journal requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Please provide additional details regarding participant consent. In the ethics statement in the Methods and online submission information, please ensure that you have specified (1) whether consent was informed and (2) what type you obtained (for instance, written or verbal, and if verbal, how it was documented and witnessed). If your study included minors, state whether you obtained consent from parents or guardians. If the need for consent was waived by the ethics committee, please include this information.

If you are reporting a retrospective study of medical records or archived samples, please ensure that you have discussed whether all data were fully anonymized before you accessed them and/or whether the IRB or ethics committee waived the requirement for informed consent. If patients provided informed written consent to have data from their medical records used in research, please include this information.

3. We note that you have indicated that data from this study are available upon request. PLOS only allows data to be available upon request if there are legal or ethical restrictions on sharing data publicly. For information on unacceptable data access restrictions, please see http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions.

In your revised cover letter, please address the following prompts:

a) If there are ethical or legal restrictions on sharing a de-identified data set, please explain them in detail (e.g., data contain potentially identifying or sensitive patient information) and who has imposed them (e.g., an ethics committee). Please also provide contact information for a data access committee, ethics committee, or other institutional body to which data requests may be sent.

b) If there are no restrictions, please upload the minimal anonymized data set necessary to replicate your study findings as either Supporting Information files or to a stable, public repository and provide us with the relevant URLs, DOIs, or accession numbers. Please see http://www.bmj.com/content/340/bmj.c181.long for guidelines on how to de-identify and prepare clinical data for publication. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories.

We will update your Data Availability statement on your behalf to reflect the information you provide.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Reviewer #1: Karapetyan and colleagues initial an interesting study to investigate the factors and building model for COVID-19 and following complications using Germany PCR database. The result showed good accuracy, they provided technical information detailly. However, I have some questions about the study design and the source material.

(1) Was authors investigate the source of study populations? For example, some patient felt uncomfortable so they decided to take PCR test by themselves, others could be notice or required by government. In my point of view, there were quite different in the risk of SARS-CoV-2 infection, if this factor is available and meaningful for model building, I would happy to see revised result, and please describe some related regulation briefly to help reader understand Germany’s policy.

(2) According to your study setting, residence area was categorized to urbanization level and including in the model. Urbanization was an interesting factor, also, provide some information of people’s probability to working, communication, interaction with other people. But my question is, an area, its local COVID-19 prevalence, maybe more important than the urbanization. Could you consider this factor in your analysis?

(3) Finally, in Results line 186, there was written “For the analysis of defined complications the cohort of test-positives was reduced to 46 071 participants with available data for the first quarter after the index quarter of PCR test”. What is it mean about “available”? Please address in detail.

Reviewer #2: In this study, the authors used ambulatory claims data to determine possible risk factors for (1) COVID infection, using a test-negative design (2) cardiovascular or pulmonary complications in patients with positive COVID tests, using a cohort design. The authors also developed a rule to predict the risk of cardiovascular or pulmonary complications among those with positive COVID tests.

Overall, this is a well-designed study that answers important clinical questions using available data. I only have some minor points for the authors to consider: ​

1. In the model mentioned in page 11 (line 208-217), do the P values account for multiple comparison and the usage of stepwise regression? If not, it can be serious inflated and should not be used to argue that the interaction terms are statistically significant (so a separate risk factor model should be constructed for test-positives). Since the authors are essentially testing the relative fit between models with and without interaction terms, maybe it would more sound to calculate the likelihood ratio between the two models and use bootstrap to test for its significance.

2. In a test-negative design, it is optimal that the symptoms prompting the patients to undergo testing are similar between test-positives and test-negative, so as to prevent certain risk groups showing different patterns of probabilities of receiving tests, which then introduces bias. For example, if obese patients are more alert to anosmia than others because they know they're at higher risk of complications once infected by COVID, then the testing rate of test-positive obese patients would be higher than others, leading to the conclusion of "higher risk of infection" when the test-negative design is applied. The authors may discuss more on possible scenarios that violate the assumptions of test-negative designs.

3. In the second paragraph of the discussion section, the author attributed the attenuation or inversion of risk to the behavioral adjustment. In a test-negative design, this argument is only valid if the behavioral adjustment decreases the COVID (those who would be test-positive) infection rate more than common cold (those who would be test-negative). Do behavior adjustments demonstrate differential protective effects against COVID vs common cold?

4. Are all patients tested symptomatic, or are there screening tests for asymptomatic patients? If the latter is the case, differential screening rate among different risk groups may also introduce bias.

5. The authors mentioned that the recording of negative test results was optional for physicians (line 304). This could introduce bias if the behavior of selective recording differs between patient groups. For example, the physician is more willing to record the negative tests of a smoking patient than a non-smoking patient, smoking would be falsely considered as a protective factor.

6. The predictive model had adequate performance in the internal validation. However, the generalizability of the model should be assessed by external validation. The authors may list this as one of the limitations of the study.

7. In the discussion section line 284, PPV = 13.3% may seem low to readers. Maybe the authors can reiterate the prevelance of complications (~4%) to demonstrate why this PPV is acceptable.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: Yes: Ming-Chieh Shih

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2021 Oct 21;16(10):e0258914. doi: 10.1371/journal.pone.0258914.r002

Author response to Decision Letter 0


10 Aug 2021

Rebuttal Letter

Dear Editor, dear Reviewers,

Thank you for reviewing our manuscript and for your helpful comments and suggestions for improvement. In the following, we address these point by point (line references refer to the version of the tracked changes).

Journal requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

R: Thank you for the templates. We hope to meet the Journal style requirements by updating the following parts:

• In the author names we have moved the commas to the end.

• We have now updated figure citations (e.g. „Figure 1“ is now „Fig 1“, titles are bold) and figure files naming (e.g. „Figure 1.tiff“ is now „Fig1.tif“).

• We have formatted table citations (titles are now bold).

• We have now updated Supporting Information citations in the text and listed Supporting Information captions at the end of the manuscript in a section titled “Supporting information”. In addition, we uploaded Supporting Information files separately.

• We have updated the font size for the headings.

2. Please provide additional details regarding participant consent. In the ethics statement in the Methods and online submission information, please ensure that you have specified (1) whether consent was informed and (2) what type you obtained (for instance, written or verbal, and if verbal, how it was documented and witnessed). If your study included minors, state whether you obtained consent from parents or guardians. If the need for consent was waived by the ethics committee, please include this information.

If you are reporting a retrospective study of medical records or archived samples, please ensure that you have discussed whether all data were fully anonymized before you accessed them and/or whether the IRB or ethics committee waived the requirement for informed consent. If patients provided informed written consent to have data from their medical records used in research, please include this information.

R: The data are pseudonymized (the data are not technically anonymized because the source data with identifying information are held by the Bavarian Association of Statutory Health Insurance Physicians). We hope to meet your requirements by reformulating ethics statement: „The underlying data for this study are pseudonymized and the study was approved by the Ethics Commission of the Technical University of Munich (Ethikkommission der Technischen Universität München) (approval No 673/20 S-EB).“ In addition, we addressed details regarding participant consent in the Methods section, specifically that no further participant consent was required because the analyses were based on secondary billing data and were conducted in accordance with the German guideline "Good Practice in Secondary Data Analysis." (see lines 79-81)

3. We note that you have indicated that data from this study are available upon request. PLOS only allows data to be available upon request if there are legal or ethical restrictions on sharing data publicly. For information on unacceptable data access restrictions, please see http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions.

In your revised cover letter, please address the following prompts:

a) If there are ethical or legal restrictions on sharing a de-identified data set, please explain them in detail (e.g., data contain potentially identifying or sensitive patient information) and who has imposed them (e.g., an ethics committee). Please also provide contact information for a data access committee, ethics committee, or other institutional body to which data requests may be sent.

b) If there are no restrictions, please upload the minimal anonymized data set necessary to replicate your study findings as either Supporting Information files or to a stable, public repository and provide us with the relevant URLs, DOIs, or accession numbers. Please see http://www.bmj.com/content/340/bmj.c181.long for guidelines on how to de-identify and prepare clinical data for publication. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories.

We will update your Data Availability statement on your behalf to reflect the information you provide.

R: We updated Data Availability statement to reflect Journal requirements: „The data are held by the Bavarian Association of Statutory Health Insurance Physicians (BASHIP) and availability is restricted by a contractual agreement. The data are therefore not publically available due to data protection regulations, but may be obtained from the authors upon reasonable request and with the consent of the BASHIP (versorgungsforschung@kvb.de).“

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

R: Questions without reviewer comments (1.-4. and 6.) and notes have been removed to improve readability.

5. Review Comments to the Author

Reviewer #1: Karapetyan and colleagues initial an interesting study to investigate the factors and building model for COVID-19 and following complications using Germany PCR database. The result showed good accuracy, they provided technical information detailly. However, I have some questions about the study design and the source material.

R: Thank you for your review and the valuable comments.

(1) Was authors investigate the source of study populations? For example, some patient felt uncomfortable so they decided to take PCR test by themselves, others could be notice or required by government. In my point of view, there were quite different in the risk of SARS-CoV-2 infection, if this factor is available and meaningful for model building, I would happy to see revised result, and please describe some related regulation briefly to help reader understand Germany’s policy.

R: Thank you for your note. We also think it would be interesting to include this factor in the models. However, the distinction between codes describing the reason for taking the PCR test did not exist at the beginning of the pandemic; it was introduced during the third quarter in 2020 and, unfortunately, was not used consistently by physicians after introduction, as we have seen in our data, so we could not include this factor in the model. We do now describe Germany’s testing strategy in the methods section (see lines 72-75).

(2) According to your study setting, residence area was categorized to urbanization level and including in the model. Urbanization was an interesting factor, also, provide some information of people’s probability to working, communication, interaction with other people. But my question is, an area, its local COVID-19 prevalence, maybe more important than the urbanization. Could you consider this factor in your analysis?

R: Thank you for this valuable point. We indeed did not consider this factor in our analysis. However, our objective was not to follow infection chains or to describe hotspots, but to adjust our analyses to the more general effects of different settlement and health care supply densities. We added this to the methods section and now we write: „To adjust for different settlment and health care supply densities we included a measure of urbanization …“ (see lines 106-107).

(3) Finally, in Results line 186, there was written “For the analysis of defined complications the cohort of test-positives was reduced to 46 071 participants with available data for the first quarter after the index quarter of PCR test”. What is it mean about “available”? Please address in detail.

R: Thank you for addressing this. We hope to make this clear by reformulating the sentence (see line 194).

Reviewer #2: In this study, the authors used ambulatory claims data to determine possible risk factors for (1) COVID infection, using a test-negative design (2) cardiovascular or pulmonary complications in patients with positive COVID tests, using a cohort design. The authors also developed a rule to predict the risk of cardiovascular or pulmonary complications among those with positive COVID tests.

Overall, this is a well-designed study that answers important clinical questions using available data. I only have some minor points for the authors to consider:

R: Thank you for your review and the supportive comments.

1. In the model mentioned in page 11 (line 208-217), do the P values account for multiple comparison and the usage of stepwise regression? If not, it can be serious inflated and should not be used to argue that the interaction terms are statistically significant (so a separate risk factor model should be constructed for test-positives). Since the authors are essentially testing the relative fit between models with and without interaction terms, maybe it would more sound to calculate the likelihood ratio between the two models and use bootstrap to test for its significance.

R: Thank you very much for raising this important point. The mentioned P values indeed do not account for multiple comparison, however we choose the model based on Akaike’s Information Criterion (AIC). We now removed the P values and pointed out that we compared the models based on a descriptive likelohood-ratio test. In the methods section we now write: “Goodness-of-fit of these nested models was compared by a descriptive likelihood-ratio test without formal adjustment for AIC-based model selection.” (see lines 140-141) Further, in the results section we write now: “… a multivariable regression model with forward stepwise variable selection by AIC included interaction effects between the PCR test result and the investigated risk factors …” (see lines 218-223). We hope this will clear up any confusion.

2. In a test-negative design, it is optimal that the symptoms prompting the patients to undergo testing are similar between test-positives and test-negative, so as to prevent certain risk groups showing different patterns of probabilities of receiving tests, which then introduces bias. For example, if obese patients are more alert to anosmia than others because they know they're at higher risk of complications once infected by COVID, then the testing rate of test-positive obese patients would be higher than others, leading to the conclusion of "higher risk of infection" when the test-negative design is applied. The authors may discuss more on possible scenarios that violate the assumptions of test-negative designs.

R: Thank you for raising this important point. We discussed it in the limitations detailed in the discussion section (see lines 314-317).

3. In the second paragraph of the discussion section, the author attributed the attenuation or inversion of risk to the behavioral adjustment. In a test-negative design, this argument is only valid if the behavioral adjustment decreases the COVID (those who would be test-positive) infection rate more than common cold (those who would be test-negative). Do behavior adjustments demonstrate differential protective effects against COVID vs common cold?

R: We agree with the point that the argument would be valid if behavioral adjustment is more successful in reducing risk for COVID-19 than risk for other infections. However, this is only true in a setting where only symptomatic participants are tested. Because we also have asymptomatic participants in our cohort, i.e. within the test-positives and within the test-negatives, we cannot directly state that. We therefore now write: „The latter might be explained with possible behavioural adjustment in the patients belonging to respective vulnerable subgroups, however, this assumption might be impaired as the cohort of tested participants also includes asymptomatic participants.” (see lines 268-269)

4. Are all patients tested symptomatic, or are there screening tests for asymptomatic patients? If the latter is the case, differential screening rate among different risk groups may also introduce bias.

R: There have also been PCR tests for asymptomatic patients, e.g. for travelers, asymptomatic individuals in health care or other vulnerable sectors, contacts (with criteria of exposure or disposition) etc. This can indeed introduce bias to the assessment of total risks. However, we assume that the assessment of relative risks is unaffected. Therefore, the association between risk factors and the outcome, the estimated odds ratios, should be unbiased.

5. The authors mentioned that the recording of negative test results was optional for physicians (line 304). This could introduce bias if the behavior of selective recording differs between patient groups. For example, the physician is more willing to record the negative tests of a smoking patient than a non-smoking patient, smoking would be falsely considered as a protective factor.

R: Thank you very much for this valuable point. We added it to the limitations detailed in the discussion section (see lines 320-322).

6. The predictive model had adequate performance in the internal validation. However, the generalizability of the model should be assessed by external validation. The authors may list this as one of the limitations of the study.

R: We agree with this point and add this now as one of the limitations of our study (see lines 306-307).

7. In the discussion section line 284, PPV = 13.3% may seem low to readers. Maybe the authors can reiterate the prevelance of complications (~4%) to demonstrate why this PPV is acceptable.

R: Thank you for your suggestion. We now explain this in the discussion section (see lines 295-297).

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

R: We uploaded our figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool to ensure that our figures meet PLOS requirements. We have updated our figures accordingly.

Attachment

Submitted filename: Response to Reviewers.docx

Decision Letter 1

Huei-Kai Huang

23 Aug 2021

PONE-D-21-20330R1

SARS-CoV-2 infection and cardiovascular or pulmonary complications in ambulatory care: a risk assessment based on routine data

PLOS ONE

Dear Dr. Karapetyan,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

ACADEMIC EDITOR: Please mention and discuss your assumption in the Discussion section (please refer to the reviewer comment).

Please submit your revised manuscript by Oct 07 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Huei-Kai Huang, M.D.

Academic Editor

PLOS ONE

Journal Requirements:

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: (No Response)

Reviewer #2: 4. Are all patients tested symptomatic, or are there screening tests for asymptomatic patients? If the latter is the case, differential screening rate among different risk groups may also introduce bias.

R: There have also been PCR tests for asymptomatic patients, e.g. for travelers, asymptomatic individuals in health care or other vulnerable sectors, contacts (with criteria of exposure or disposition) etc. This can indeed introduce bias to the assessment of total risks. However, we assume that the assessment of relative risks is unaffected. Therefore, the association between risk factors and the outcome, the estimated odds ratios, should be unbiased.

The authors may consider mentioning the assumptions they made in the manuscript.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: Yes: Ming-Chieh Shih

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2021 Oct 21;16(10):e0258914. doi: 10.1371/journal.pone.0258914.r004

Author response to Decision Letter 1


24 Sep 2021

Rebuttal Letter

Dear Editor, dear Reviewers,

Thank you very much for considering our manuscript as potentially acceptable for publication in PLoS One. We incorporated all of the comments into the revised version and attached a point by point response to all comments (line references refer to the version of the tracked changes). We would be pleased to see the actual version published in PLoS One.

ACADEMIC EDITOR: Please mention and discuss your assumption in the Discussion section (please refer to the reviewer comment).

R: Thank you for your suggestion. We have discussed our assumptions in the Discussion section as proposed from the Reviewer (see lines 335-339).

Journal Requirements:

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

R: Thank you for your note. We have checked our reference list. All articles are still available and none have been retracted. We have noted that the article by Ioannisids J. (2020) has now been published in Bulletin of the World Health Organization and have changed the reference accordingly:

• Ioannidis JPA. The infection fatality rate of COVID-19 inferred from seroprevalence data. medRxiv. 2020;2020.05.13.20101253 � Ioannidis, J. P. Infection fatality rate of COVID-19 inferred from seroprevalence data. Bull World Health Organ. 2021;99(1), 19.

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

6. Review Comments to the Author

Reviewer #2: 4. Are all patients tested symptomatic, or are there screening tests for asymptomatic patients? If the latter is the case, differential screening rate among different risk groups may also introduce bias.

R: There have also been PCR tests for asymptomatic patients, e.g. for travelers, asymptomatic individuals in health care or other vulnerable sectors, contacts (with criteria of exposure or disposition) etc. This can indeed introduce bias to the assessment of total risks. However, we assume that the assessment of relative risks is unaffected. Therefore, the association between risk factors and the outcome, the estimated odds ratios, should be unbiased.

The authors may consider mentioning the assumptions they made in the manuscript.

R: Thank you for this valuable suggestion. We have now reformulated our explanation and now write in the methods section: “During the evaluation period from February to the end of September 2020 (i.e., first to third quarter 2020), patients suspected to suffer from COVID-19 infection received naso-pharyngeal swabs for PCR testing in general practice. According to the national testing strategy, participants without symptoms could also be tested in general practice, for example travelers from risk areas, staff in health care or other vulnerable sectors, and contacts of infected persons. However, these cases were to be billed separately by the Ministry and were thus not documented as claims data.” (see lines 75-81). We have also discussed this in the limitations detailed in the discussion section, where we write: “Beyond that, the possibility of asymptomatic participants in a cohort can presumably introduce bias. However, tests of asymptomatic patients were to be billed separately by the government and consequently not documated as claims data for the BASHIP. Therefore, the association between risk factors and the outcome, the estimated odds ratios, should be unbiased in this respect.” (see lines 335-339).

In addition, we have corrected one formulation: “… this assumption might be impaired as the cohort of tested participants may also include asymptomatic participants.” (see lines 279-280). We are sorry for the confusing wording before and hope this clears up any confusion.

Attachment

Submitted filename: Response to Reviewers.docx

Decision Letter 2

Huei-Kai Huang

8 Oct 2021

SARS-CoV-2 infection and cardiovascular or pulmonary complications in ambulatory care: a risk assessment based on routine data

PONE-D-21-20330R2

Dear Dr. Karapetyan,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Huei-Kai Huang, M.D.

Academic Editor

PLOS ONE

Acceptance letter

Huei-Kai Huang

13 Oct 2021

PONE-D-21-20330R2

SARS-CoV-2 infection and cardiovascular or pulmonary complications in ambulatory care: a risk assessment based on routine data

Dear Dr. Karapetyan:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Huei-Kai Huang

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Table. International classification of diseases 10th revision (ICD-10) diagnoses included in the multivariable analyses.

    (PDF)

    S2 Table. Odds ratio (OR) and 95% confidence intervals (CI) for the risk of cardiovascular or pulmonary complications.

    Multivariable binary regression model adjusted for age, sex, urbanisation, nursing home living and diseases shown.

    (PDF)

    S3 Table. Characteristics of participants in derivation and validation cohorts; n (%).

    (PDF)

    Attachment

    Submitted filename: Response to Reviewers.docx

    Attachment

    Submitted filename: Response to Reviewers.docx

    Data Availability Statement

    The data are held by the Bavarian Association of Statutory Health Insurance Physicians (BASHIP) but restrictions apply to the availability of these data, which were used within the framework of the contractual agreement. The data are not publicly available due to data protection regulations, but may be obtained from the authors upon reasonable request and with the consent of the BASHIP (versorgungsforschung@kvb.de).


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES