Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2024 Jun 3;14:12713. doi: 10.1038/s41598-024-63184-8

Predictive biomarkers of COVID-19 prognosis identified in Bangladesh patients and validated in Japanese cohorts

Kazuko Uno 1,, Abu Hasan 2, Emi E Nakayama 3, Rummana Rahim 2, Hiromasa Harada 4, Mitsunori Kaneko 5, Shoji Hashimoto 6, Toshio Tanaka 7, Hisatake Matsumoto 8, Hitoshi Fujimiya 9, Tatsuo Shioda 3, Mizanur Rahman 2, Kazuyuki Yoshizaki 10
PMCID: PMC11148188  PMID: 38830928

Abstract

Despite high vaccination rates globally, countries are still grappling with new COVID infections, and patients diagnosed as mild dying at home during outpatient treatment. Hence, this study aim to identify, then validate, biomarkers that could predict if newly infected COVID-19 patients would subsequently require hospitalization or could recover safely with medication as outpatients. Serum cytokine/chemokine data from 129 COVID-19 patients within 7 days after the onset of symptoms in Bangladesh were used as training data. The majority of patients were infected with the Omicron variant and over 88% were vaccinated. Patients were divided into those with mild symptoms who recovered, and those who deteriorated to moderate or severe illness. Using the Lasso method, 15 predictive markers were identified and used to classify patients into these two groups. The biomarkers were then validated in a cohort of 194 Covid patients in Japan with a predictive accuracy that exceeded 80% for patients infected with Delta and Omicron variants, and 70% for Wuhan and Alpha variants. In an environment of widespread vaccination, these biomarkers could help medical practitioners determine if newly infected COVID-19 patients will improve and can be managed on an out-patient basis, or if they will deteriorate and require hospitalization.

Keywords: COVID-19 prognosis, COVID-19 biomarker, Lasso method, Bangladesh, Japan

Subject terms: Immunology, Microbiology, Medical research

Introduction

The COVID-19 pandemic has infected over 700 million people and resulted in more than 7 million deaths worldwide as of September 2023. Prior to the availability of vaccines, approximately 80% of infected patients were asymptomatic or mildly ill, while 15%, mainly the elderly, developed severe pneumonia, and 5% progressed to the fatal acute respiratory distress syndrome (ARDS)1. Severe COVID-19 has also been associated with various complications, including vasculitis, thrombosis, cerebral infarction, myocardial damage, and multiple organ failure2,3. Risk factors for severe COVID-19 symptoms include underlying conditions such as cardiovascular disease, hypertension, diabetes, chronic lung disease, and chronic kidney disease, as well as demographic factors like age, obesity, and smoking47.

In light of the above, hospitalization rates across the globe was extremely high, leading to shortages in not only hospital beds, but also medical personnel, ventillators and other essential medical equipment. In Japan for example, at the beginning of the pandemic, mortality rate was high and all COVID patients were hospitalized which caused a massive over-run on medical facilities. As a solution, the government mandated that some private hotels were to be used to house infected patients; this lasted until May 2023 well after vaccination had started.

Mortality rates gradually decreased when global vaccination began in early 2021, by end of that year vaccination rates stood at about 80% in Japan and 90% in individuals aged over 50 (https://www.mhlw.go.jp/stf/seisakunitsuite/bunya/kenkou_iryou/kenkou/kekkaku-kansenshou/yobou-sesshu/syukeihou_00002.html). However, the emergence of the Omicron and other variants presented new challenges and in August 2021 Japan had a doubling of infections with the span of a week even in rural areas. The initial Omicron variant, BA.1, showed reduced affinity for lung epithelial cells but higher transmissibility, resulting in attenuated disease severity810. Subsequent Omicron variants such as BA.5 and XBB regained affinity for the lungs, and the effectiveness of treatments with anti-S protein monoclonal antibodies declined due to escape mutations11,12. Antiviral drugs such as Remdesivir, Ritonavir-boosted Nirmatrelvir and Molnupiravir13 were available but the number of deaths in Japan in 2022 exceeded the number of cases seen in 2020–2021, thereby overwhelming hospital capacity. As a result, due to new infections skyrocketing, even the makeshift hospitals housed inside hotels, exceeded maximum capacity. Hence since early 2022 in Japan, only cases that were judged as being severe were hospitalized, leading to a well-needed steady decrease in the number patients who were admitted. By September 2022, out-pattient treatment had become commonplace.

Outpatient care however presented other major challenges including high death rates among the elderly and among patients who were considered as mild on their first hospital visit. Data from the Japanese Ministry of Health Labour and Welfare in 2022, showed that among patients who died during home care, 58% were 80 years and older, and only 20% were confirmed as being unvaccinated. Additionally, 42% of these patients who died while being treated on an out-patient basis were initially diagnosed as mild cases (https://www.mhlw.go.jp/content/10900000/001021500.pdf).

In this context where COVID patients can still experience severe disease progression despite the widespread use of vaccines, having a predictive model that can easily can distinguish patients at high risk of deterioration with just a blood sample would be highly beneficial.

Patients with severe COVID-19 have exhibited elevated levels of inflammatory cytokines and chemokines4,5,7,14,15, especially Interleukin 1 (IL-1), Tumor Necrosis Factor alpha (TNFα), and IL-61619. This has led to the hypothesis that ARDS in COVID-19 is driven by a cytokine storm20,21. With this in mind, in this study, we quantified cytokines/chemokines/soluble receptors in serum samples from COVID-19 patients in Bangladesh who visited the hospital during the omicron stage within 7 days on the onset of symptoms. We used machine learning techniques to successfully identify a combination of 15 biomarkers that could predict if patients would subsequently deteriorate to severe COVID-19 or if they would have only mild symptoms that could be treated with out-patient care.

Results

Predicting patients’ outcome using pretreatment serum cytokine/chemokine levels within 7 days of onset of COVID-19 from Bangladesh data

The serum from patients in Bangladesh collected December 2021–December 2022 was used to train this decision model for several reasons (Table 1). First, the Japanese serum of COVID patients covers a period of approximately 2 years, during which several COVID-19 variants (Wuhan, Alpha, Delta, and Omicron) emerged, raising concern about the model’s reliability due to the diversity of the data. In contrast, the Bangladesh data was limited to a single strain, and serum sample were collected immediately after the onset of the symptoms as described in the Materials and Methods (Table 2). For these reasons, the Bangladeshi samples were more likely to satisfy the uniformity required for the learning model, and therefore the biomarkers idenitified in the Bangladeshi sample were used to predict recovery or deterioration in Japanese patients.

Table 1.

Demographic of COVID-19 patients.

Demographic data of COVID-19 patients
Characteristics Mild Moderate Severe Total
Bangladesh (n) 64 46 19 129
 Age--median year (IQR) 47 (27–59) 68 (55–76) 75 (60–80) 59 (36–73)
 Sex (Male: Female) 43:21 21:25 12:7 76:53
Japan (n) 51 25 121 197
 Age-median year (IQR) 64 (17–99) 73 (20–93) 75(19–97) 73(17–99)
 Sex (Male: Female) 22:29 16:9 85:36 123:74

Table 2.

Timing of sampling of patient specimens.

Period Bangladesh Japan
2021/12/25–2022/9/21 2020/6/14–2020/10/9 2020/10/10–2021/2/28 2021/3/1–2021/6/20 2021/6/21–2021/10/17 2021/12/26–2022/6/19
Wave in Japan 2nd wave 3rd wave 4th wave 5th wave 6th wave
Virus main variant Omicron Wuhan Wuhan Alpha Delta Omicron
Final outcome
 Mild: n (%) 64 (49.6) 3 (15.8) 19 (25.7) 11 (25.6) 10 (31.3) 8 (27.6)
 Moderate: n (%) 46 (35.7) 1 (5.3) 10 (13.5) 5 (11.6) 6 (18.8) 3 (10.3)
 Severe: n (%) 19 (14.7) 15 (78.9) 45 (60.8) 27 (62.8) 16 (50) 18 (62.1)
Total: n 129 19 74 43 32 29

The prediction model was constructed by categorizing patients in two groups. The first group was labeled “no hospitalization required” because they were only mildly ill (N = 64 patients) throughout the course of the disease and did not develop pneumonia (although some wealthy patients were hospitalized due to anxiety). The second group (N = 65) was labeled “hospitalization required”, because although they were only mildly ill initially, they subsequently developed pneumonia. Additionally, data from 23 healthy individuals were added to the “no hospitalization required” group to ensure that the model could reliably predict that hospitalization is not necessary when the data from healthy subjects were entered. When the number of patients in each group was known, variables were selected using Logistic LASSO regression22 with the weight of the inverse of the ratio of the number of each on the objective variable side, and we identified a stable 15-biomarker model with an AUC of 0.9321 (Fig. 1A).

Figure 1.

Figure 1

Comparison of ROC curves generated using Bangladeshi and Japanese data. (A) ROC curves obtained by evaluating the performance of the logistic regression model using the Bangladeshi training data. (B) ROC curves resulting from applying the same logistic regression model to the validation data of the Japanese population.

As the number of patients in each group is known, the variables were selected using Logistic LASSO regression22 with weights on the objective variable side that are the inverse of the respective headcount ratios, and 15 predictive markers were identified.

Table 3 presents an overview of the performance metrics for this logistic regression model. For the training data (Bangladesh), the model showed a specificity of approximately 91.95%, a sensitivity of 84.85%, and an overall accuracy of 88.89%. Both the negative predictive value (NPV) and the positive predictive value (PPV) were found to be 88.89%. When validated with the Japanese dataset, the results were similar, although the model displayed a lower specificity of about 87.32% and a lower sensitivity of 86.30%. With an accuracy of 86.81%, it was also slightly lower than the training dataset. The NPV was 86.11%, while the PPV were both at 87.50%. These metrics suggest that the logistic regression model performance was consistent across the two diverse datasets, demonstrating robust specificity, sensitivity, and accuracy in both training and validation phases.

Table 3.

Performance Metrics of the Logistic Regression Model for Both Training and Validation Datasets.

Dataset Specificity Sensitivity Accuracy NPV PPV
Bangladesh 0.9195 0.8485 0.8889 0.8889 0.8889
Japan 0.8732 0.863 0.8681 0.8611 0.875

The 15 predictive biomarkers extracted from the Bangladesh data are: BAFF, CTACK, Eotaxin, HGF, IL-6, IL-13, M-CSF, MCP-3, MMP-3, Osteocalcin, SDF-1a, TNF-R1, TNF-R2, TRAIL and TWEAK (Table 4). The data for Bangladesh and Japanese patients, divided into healthy, mild, and severe cases are shown in Fig. 2. BAFF, CTACK, HGF, IL-6, M-CSF, MMP-3, TNF-R1, and TNF-R2 were higher in the severe group than in the mild group, while TRAIL, osteocalcin, and TWEAK showed a decreasing trend. The patients’ final outcomes and the percentage of those outcomes that were correctly predicted by the model are shown in Table 5.

Table 4.

The best combination of predictive markers.

Coefficients
(Intercept) − 11.615
BAFF 0.663
CTACK 0.215
Eotaxin 0.379
HGF 1.854
IL-6 0.497
IL-13 0.491
M-CSF 0.499
MCP-3 0.029
MMP-3 0.011
Osteocalcin − 0.595
SDF-1a − 0.569
TNF-R1 1.048
TNF-R2 1.983
TRAIL − 1.668
TWEAK − 0.734

Figure. 2.

Figure. 2

Overview of value of fifteen cytokine/chemokine/soluble receptors in comparison with mild and moderate/severe patients Bangladesh and Japan. Fifteen predictive markers selected from the Bangladesh data are shown. The data are divided into healthy (He1, He2), Bangladesh (BD) and Japanese (JP) mild, and severe cases. The boxplots show medians (middle line) with first and third quartiles (boxes), while the whiskers show 1.5× the interquartile range (IQR) above and below the box.

Table 5.

Percentage of correct predictions for each wave of COVID-19 in Japan by markers extracted from the Bangladesh model.

Bangladesh Japan
Period 2021/12/25–2022/9/21 2020/6/14–2020/10/9 2020/10/10–2021/2/28 2021/3/1–2021/6/20 2021/6/21–2021/10/17 2021/12/26–2022/6/19
2nd wave 3rd wave 4th wave 5th wave 6th wave Japan
Main variant Omicron Wuhan Wuhan Alpha Delta Omicron Total
Final outcome Prediction n (%)
 Mild Mild 53 (41.1) 3(15.8) 12 (16.2) 9 (20.9) 8 (25.0) 6 (20.7) 38
 Mild Severe 14 (10.9) 0(0) 7 (9.5) 2 (4.7) 2 (6.3) 2 (6.9) 13
 Severe Mild 13 (10.1) 5 (26.3) 10 (13.5) 14 (32.6) 3 (9.4) 3 (10.3) 41
 Severe Severe 50 (38.7) 11 (57.9) 45 (60.8) 18 (41.9) 19 (59.4) 18 (62.1) 105
129 19 74 43 32 29 197
Correct (%) 79.8 73.7 77.0 62.8 84.4 82.8

Attempting to predict prognosis in Japanese patients by extracted predictive markers

The Bangladesh training data set (Table 4) were fitted to the data of Japanese patients to predict their prognosis and compare the predictions to patients’ actual clinical outcomes. As shown in Fig. 1B, the results yielded an AUC value of 0.9219 suggesting that the extracted biomarkers made accurate predictions of the prognosis of patients in Japan (Table 5). Comparing the biomarker predictions with the actual clinical outcomes of Japanese subjects, particularly in the second and fourth waves, showed that a high percentage of subjects (26.3% and 39.5%, respectively) were judged as “mild” initially, even though their actual outcome was “severe/moderate”. However, the percentage of correct predictions increased in the 5th and 6th waves. In addition, 60% of the respondents whose predictions were inaccurate were aged 70 years or older, with 40% being over 80 years of age. This suggests that elderly patients are at higher risk of deterioration and should be carefully monitored, even if they initially present with only mild symptoms23,24.

In waves 5 and 6, after the vaccine was widely available in Japan, the accuracy rate of the predictive model exceeded 80%. In Bangladesh, approximately 88% of patients were vaccinated, and n Japan, the vaccine became available to the elderly on May 24, 2021, and to ages 12 and older in August. By August, more than 80% of the elderly had been vaccinated. Our results indicate that the predictive biomarker panel is particularly useful in determining the risk of severe disease in COVID-19 patients in environments where public vaccination rates are high.

Discussion

Since late 2021, the development and widespread use of COVID-19 vaccines and therapeutics has reduced the severity of the disease. As a result, many patients are cured with medication and home care, however, some later become seriously ill and require hospitalization. It is therefore important to determine early in the course of the disease whether patients will recover from the mild symptoms or will worsen and need to be hospitalized. We therefore measured serum cytokine/chemokine/soluble receptor levels in untreated COVID-19 patients presenting at the MILD stage within 7 days of the onset of the disease. The patients were then identified whether they improved as they were or deteriorated to moderate/severe, from which deterioration markers were calculated using the least absolute shrinkage selection operator (LASSO) regression analysis method. We believe that cytokines/chemokines/soluble receptors in the serum reflect a variety of conditions, including the patient's underlying disease, and therefore can be used to determine the worsening status even in the absence of clinical background information.

Data from Evercare Hospital Dhakab in Bangladesh, where patients had nearly identical viral strains, was used as training data for the analysis. From the data 15 biomarkers were extracted: BAFF, CTACK, Eotaxin, HGF, IL-6, IL-13, M-CSF, MCP-3, MMP-3, Osteocalcin, SDF-1a, TNF-R1, TNF-R2, TRAIL and TWEAK (Table 4)25. Based on this group of biomarkers, we predicted patient prognosis by applying it to the cytokines/chemokines/soluble receptors data of Japanese patients. The Bangladesh data was collected over 9 months, whereas the Japanese data was collected over a 2-year span. The Japanese data included patients from wave 2 to wave 6 during which time the viral strains were evolving. Therefore, when calculated, the correct response rate from wave 2 to wave 4 was slightly lower, especially in wave 4, where there were more alpha strains (Table 5). This may be because the alpha strain is highly infectious and carries a high risk of mortality.

Interestingly, during waves 2–4, the model tended to underestimate the severity of disease progression in elderly patients, with 60% of the incorrectly predicted patients being over 70 years of age, and 40% being over 80 years old. These cases were initially predicted by the bio-markers to have a mild outcome but patients actually became severe; disease severity was also incorrectly predicted in older patients whose cytokine levels did not significantly increase. Based on these results, it can be said that the biomarkers selected in this study are very effective for predicting the prognosis of COVID-19 patients after widespread vaccination among the public and that caution should be exercised with elderly patients, even if their symptoms are initially classified as mild.

Data from Evercare Hospital Dhakab, Bangladesh, showed that CTACK, G-CSF, HGF, IL-2Ra, IL-6, IL-8, IL-10, IL-12p40, M-CSF, PDGF, SCF, BAFF, CD-30, CD-163, sgp130, IL-11, MMP-3, Pentraxin-3, TNF-R1, TNF-R2 and TSLP were significantly higher in the severe group than in the mild group. The inclusion of many of the markers that differ between severe and mild cases, such as CTACK, HGF, IL-6, M-CSF, BAFF, MMP-3, TNF-R1 and TNF-R2. In this study, differential markers identified for mild and severe COVID-19 cases support the validity of these markers.

As shown in Fig. 2, BAFF, CTACK, HGF, IL-6, M-CSF, MCP-3, MMP-3, TNF-R1 and TNF-R2 tended to increase when patients worsened. We theorize that there may be an underlying mechanism that connects these biomarkers with COVID severity. Among these markers some are associated with inflammation, mortality or lung damage. For example, IL-6, an important mediator of cytokine release syndrome (CRS) toxicity23,2634, signals inflammatory response leading to coagulation, similar to HGF35. Additonally, expression levels of HGF and MCP-3 were reported to correlate positively with the Murray score used to assess the severity of lung injury in acute respiratory distress syndrome (ARDS)14,3638. Other cytokines such as MMP3 also play an important role in lung pathological processes such as ARDS, ALI, and lung fibrosis3941. Predictors of motality include soluble tumor necrosis factor receptor 1 and 2 which have been reported as predictive markers of death in patients with severe COVID-1942,43. As well, serum TWEAK levels rose during the first week of patients being in intensive care unit (ICU), whereas a decline to baseline values were observed in the second week post-ICU admission (p = 0.032) but not for patients who died while in hospital25. An analysis of receiver-operator characteristics demonstrated that serum TWEAK at the time patients were admitted to ICU is a significant predictor of in-hospital mortality (AUC = 0.689, p = 0.019) in that TWEAK showed a decreasing trend when patients worsened (Fig. 2). Finally BAFF levels hinted at significantly higher risk of both in-hospital and 30-day mortality44. To summarize, many of the predictive bio-markers we identified were associated with symptoms typical of COVID-19 in particular lung damage which exacerbates symptoms in COVID patients compared to other viral diseases such as influenza.

While several cytokines and chemokines have been identified as markers for COVD-19 exacerbation, there are still few reports on markers for COVID-19 in patients that have been vaccinated. This study, successfully identified a panel of 15 biomarkers that could predict, with particularly high accuracy, whether vaccinated COVID-19-infected individuals would subsequently have mild symptoms or worsen to medium or severe states that required inpatient treatment. This was done by quantifying serum cytokines within seven days of COVID-19 onset.

A major strength of this study includes the use of a standardized, well-characterized dataset from Bangladesh to develop the predictive model, and the subsequent validation of the model using a diverse dataset from Japan, which spanned multiple COVID-19 variants. The consistent performance of the model across these two distinct populations suggests its robustness. As well, another strength of this predictive model is that blood cytokines and chemokines can be used to predict patient outcome without other clinical markers or details relating to patients’ history of other health issues that may affect the underlying disease status of the patient. Our previous studies have shown that serum cytokine/chemokine levels also reflect the patient's overall health, including the state of any underlying diseases45,46 and this allows the model to have potential for broader application. As well, even though high percentages of the population is vaccinated, the future of COVID-19 is not certain, and there are still moderate to severe cases presenting at hospitals in addition to mortality among patients diagnosed as mild cases. Our predictive model could help to alleviate hospital congestion by quickly identifying patients that can recover safely as out-patients, while identifying mild cases that will subsequently worsen so that these patients can be hospitalized and receive preventative treatment. This is important particurly in Japan, since as an aging nation, it has a high percentage of patients over 70 who are vulnerable to deterioration. As well, similar to other least developed nations, public resources have been drained in Bangladesh due to the pandemic, hence being able to ascertain with high accuracy which COVID-19 cases are appropriate for out-patient treatment versus those that require hospitalization can go a long way in using the country’s limited and strained resources efficiently, while still protecting the health of citizens.

Materials and methods

Subjects

Patients with COVID-19 in Bangladesh

Patients with clinically suspected SARS-CoV-2 infection who visited Evercare Hospital Dhaka between December 25, 2021 and September 21, 2022 were considered for this study. The study was approved by the Ethical Practice Committee of Evercare Hospital Dhaka (approval number ERC 33/2022-01) and the Research Ethics Committee of the Research Institute for Microbial Diseases, Osaka University, Japan (No. 2021-3).

From the patients considered, 129 confirmed COVID-19 patients who were within 7 days of onset were included. Most of these cases in Bangladesh coincide with the sixth and part of the seventh wave in Japan. Patients were recruited at the first visit to the hospital, blood serum was collected, and the attending physician classified the outcomes as mild, moderate, or severe according to the criteria established by the World Health Organization (WHO)47 (Table 1), at the time of discharge. Disease severity was not determined on the patients’ first visit, rather it was on their case sheets at the time they were discharged from the hospital. Laboratory confirmed Mild COVID-19 cases were those with one or more symptoms (e.g., fever, cough, runny nose, fatigue, headache, nausea, vomiting, diarrhea, chest pain, abdominal pain, and loss of taste or smell), but lacked shortness of breath, dyspnea on exertion, and abnormal radiological findings. Laboratory confirmed Moderate COVID-19 cases were those with pneumonia, with oxygen saturation > 93% and may have required low oxygen support. Severe cases developed COVID-19 pneumonia and required hospitalization; patients had dyspnea, respiratory frequency ≥ 30 breaths /min, blood oxygen saturation ≤ 93% on room air, lung infiltrates > 50%, and may have required mechanical ventilation and/or ICU support.

All patients visited the hospital with mild symptoms; 64 subsequently remained in mild condition, 46 declined to moderate, and 19 deteriorated to severe stage . The mean time of hospital visit from onset of symptoms was 2.3 ± 0.12 days for mild patients, 2.3 ± 0.14 days for moderate, and 3.0 ± 0.37(mean ± SE) days for severe patients. COVID-19 were diagnosed based on the PCR tests and the date of onset and vaccination status was recorded by the doctor with the patients’ interview. Most of the patients’ laboratory findings were reported preciously (ref.48).

Patients with COVID-19 in Japan

This sample consisted of 197 patients with clinical suspicion of SARS-CoV-2 infection who were admitted to Habikino hospital, and Tokushukai Hospital from the end of June 2020 to the middle of June 202249,50. All patients provided blood samples on their first visit and written informed consent and the study was approved by the Ethics Committee of Osaka Habikino Medical Center (Approved ID: 150-7), Tokushukai Hospital (TGE01547) and Louis Pasteur Center for Medical Research (LPC.29). This study followed the principles of the Declaration of Helsinki, and was approved by the institutional review board of Osaka University Hospital (No-885). Data for healthy subjects were obtained from Louis Pasteur Center for Medical Research (LPC.8 and LPC.25).

In Japan, the disease severity of patients was determined at hospital admission according to The Guideline for Medical Treatment of COVID-19 (https://www-mhlw-go-jp/content/000785119-pdf). COVID-19 is classified as Mild, Moderate I, Moderate II, and Severe. However, for comparison with Bangladesh, Moderate II and Severe cases were combined and considered as Severe. The severity of illness here refers to the final outcome of the patient and not their condition at the time they were admitted to hospital. The breakdown of patients, the severity classification and age distribution are shown in Table 1. As indicated in Table 1, 197 untreated COVID-19 patients who visited Habikino and Tokushukai Hospitals within 7 days of onset were included. The date of onset was determined by the doctor based on the results of PCR tests and interviews with the patient. The mean number of days to onset were Mild: 3.08 ± 0.32, Moderate: 3.24 ± 0.45, and Severe: 3.71 ± 0.21 days. Ninety one healthy Japanese subjects with an average age of 63.6 ± 1.9 years were also included.

Cytokines/chemokines/soluble receptors assay

Cytokines, chemokines, and soluble receptors were quantified using Bio-Plex 200, a multiplex cytokine array system (Bio-Rad Laboratories, CA, USA) according to the manufacturer's instructions. Blood sera from healthy subjects and COVID-19 patients were collected and centrifuged at 1600 g for 10 min. Serum samples were frozen at − 80 °C until they were analyzed. We simultaneously quantified cytokines, chemokines, and soluble receptors. The Bio-Plex Human Cytokine 48-Plex Panel and Inflammation Panel (Bio-Rad Laboratories, CA, USA) was used to simultaneously quantify 78 items: CTACK, Eotaxin, FGF basic, G-CSF, GM-CSF, GRO-α, HGF, IFN-α2, IFN-γ, IL-1α, IL-1β, IL-1ra, IL-2, IL-2Rα, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-12(p40), IL-12(p70), IL-13, IL-15, IL-16, IL-17, IL-18, IP-10, LIF, MCP-1(MCAF), MCP-3, M-CSF, MIF, MIG, MIP-1α, MIP-1β, β-NGF, PDGF-BB, RANTES, SCF, SCGF-β, SDF-1α, TNF-α, TNF-β, TRAIL, VEGF) and inflammation panel (37 plex: APRIL, BAFF, CD30, CD163, Chitinase, sgp130, IFN-α 2, IFN-β, IFN-γ, sIL-6Ra, IL-10, IL-11, IL-12(p40), IL-12 (p70), IL-19, IL-20, IL-22, IL-26, IL-27, IL-28A, IL-29, IL-32, IL-34, IL-35, LIGHT, MMP-1, MMP-2, MMP-3, Osteocalcin, Osteopontin, Pentraxin-3, sTNF-R1, sTNF-R2, TSLP, TWEAK. The following were excluded from the inflammation panel for data analysis due to overlap with 48-plex: IFN-α2, IFN-γ, IL-2, IL-8, IL-10, IL-12(p40), and IL-12(p70). Patients’ samples were quantified several times over a 2-year period. Since there were lot-to-lot and measurement-to-measurement errors, these data were corrected based on the values of healthy subjects. The Bangladeshi samples were simultaneously measured with healthy Japanese subjects as a control, and the values of the healthy subjects were used as a reference to correct for lot-to-lot and inter-measurement errors; this was then used to make comparisons with the serum samples of patients in Japan. Biomarkers with large inter-kit errors and low measurement sensitivity were excluded from the analysis: IL-19, IL-20, IL-26, IL-28A, IL-29, IL-35, LIGHT.

Statistical analysis

The distribution of cytokine/chemokine/soluble receptor values in healthy controls was analyzed to determine whether the raw values or log-transformed values were more normally distributed. All parameters had log-transformed values that were more normally distributed (data not shown), and so these were used in our analysis45. The t-test results for the data used in Bangladesh and Japan, categorized by mild, moderate, and severe disease are shown in Fig. 2. To correct for inter-measurement error, Japanese healthy controls were used as reference and adjusted accordingly. To allow for multiple comparisons, the p-values were corrected by the Holm using the p.adjust function in the stats package of the R language.

ANOVA was performed and quantitative data was presented as means ± SEM. The significance of the difference between the groups was evaluated using Dunnett’s test with a value of p < 0.05 considered significant. All statistical analyses were carried out with JMP 20.0 Statistical Software (JMP Statistical Discovery LLC, NC, USA).

The aim of the study was to use cytokines/chemokines/soluble receptors data, collected within 7 days of COVID onset, to predict whether a patient would subsequently deteriorate to a moderate disease state or worse and require hospitalization. To achieve this, we employed a binary logistic regression model. To refine the predictors, we utilized the Least Absolute Shrinkage and Selection Operator (LASSO) regression to select relevant cytokines as candidate markers22. The selection of the optimal number of variables was guided by Leave-One-Out Cross Validation (LOO CV). LOO-CV is known to be over-trained, however, in this case, the correct generalization performance could be evaluated using validation data, without the need to consider variations due to partitioning (such as k-fold).

Due to the disproportionate distribution of disease severities in the Bangladeshi COVID-19 patients, we applied weights to the Logistic LASSO regression model to minimize potential bias in the data. In line with the WHO severity classification, the groups that did not require hospitalization were defined as those with mild illness and recovered, while the groups that were hospitalization and required medical treatment were defined as moderate and severe. In addition, healthy individuals were added to the “no hospitalization group” so that the model would correctly predict that hospitalization was not necessary. Here, the number of persons in each group is clinically known. Characteristic markers for the smallest group, especially for the most severely ill patients, may be buried. Therefore, we determined the ratio of the number of people in each breakdown and weighted the objective variable by its reciprocal (the glmnet function of the glmnet package has a weight argument, which can be used to assign weights to the objective variable).

The performance of the logistic regression model was assessed using both training and validation datasets. This included determining the Area Under the Curve (AUC) from the Receiver Operating Characteristic (ROC) curve, and calculating performance metrics such as specificity and sensitivity. All analyses were conducted using the R language v4.2 (https://www.r-project.org/), with the glmnet 4.1-7 package supporting variable selection and logistic regression analysis. The pROC package version 1.18.4 was utilized to evaluate the model's performance through the ROC curve and AUC.

Acknowledgements

This work was supported by JSPS KAKENHI Grant (Grant-in-Aid for Scientific Research (C)) Number JP.20K10460. We thank Center for Infectious Disease Education and Research, CiDER for their support (JM00000160). A.H. is a recipient of a scholarship from the RONPAKU Program of Japan Society for the Promotion of Science (ID no. R12209). We are grateful to Ms. L. Russell for the substantive editing of this manuscript.

Author contributions

Conceptualization: K.U., E.E.N and K.Y. Investigation: K.U., E.E.N., A.H. Data analysis; K.U. and H.F. Resources: R.U., R.R., H.H., M.K., S.H., T.T, H.M., Writing-Original Draft: K.U. and E.E.N.; Writing-Review and Editing: K.Y, Supervision: K.Y, Project Administration: K.Y. and Tat S; Funding Acquisition: Tat.S, K.U. and K.Y.

Data availability

The data that support the findings of this study are available from Osaka Habikino Medical Center, Tokushukai Hospital and Evercare Hospital Dhaka. However, the data is not publicly available since it was obtained under license for the current study. Upon reasonable request howoever, the data may be obtained from the corresponding author Kazuko Uno with permission from the respective institutions.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Sugiyama M, Kinoshita N, et al. Serum CCL17 levels becomes a predictive marker to distinguish mild/moderate and severe/critical disease in patients with COVID-19. Gene. 2021;766:145145. doi: 10.1016/j.gene.2020.145145. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Cao X. COVID-19: Immunopathology and its implications for therapy. Nat. Rev. Immunol. 2020;20(5):269–270. doi: 10.1038/s41577-020-0308-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Huang C, et al. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet. 2020;395(10223):497–506. doi: 10.1016/S0140-6736(20)30183-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Herold T, et al. Elevated levels of IL-6 and CRP predict the need for mechanical ventilation in COVID-19. J. Allergy Clin. Immunol. 2020;146(1):128–136-e124. doi: 10.1016/j.jaci.2020.05.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Mehta P, et al. COVID-19: Consider cytokine storm syndromes and immunosuppression. Lancet. 2020;395(10229):1033–1034. doi: 10.1016/S0140-6736(20)30628-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Ruan Q, et al. Clinical predictors of mortality due to COVID-19 based on an analysis of data of 150 patients from Wuhan, China. Intensive Care Med. 2020;46(5):846–848. doi: 10.1007/s00134-020-05991-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Wu C, et al. Risk factors associated with acute respiratory distress syndrome and death in patients with coronavirus disease 2019 pneumonia in Wuhan, China. JAMA Intern. Med. 2020;180(7):934–943. doi: 10.1001/jamainternmed.2020.0994. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Pulliam JRC, et al. Increased risk of SARS-CoV-2 reinfection associated with emergence of Omicron in South Africa. Science. 2022;376(6593):eabnb4947. doi: 10.1126/science.abn4947. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Meng B, et al. Altered TMPRSS2 usage by SARS-CoV-2 Omicron impacts infectivity and fusogenicity. Nature. 2022;603(7902):706–714. doi: 10.1038/s41586-022-04474-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Suzuki R, et al. Attenuated fusogenicity and pathogenicity of SARS-CoV-2 Omicron variant. Nature. 2022;603(7902):700–705. doi: 10.1038/s41586-022-04462-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Hoffmann M, et al. Omicron subvariant BA.5 efficiently infects lung cells. Nat. Commun. 2023;14(1):3500. doi: 10.1038/s41467-023-39147-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Aggarwal A, et al. SARS-CoV-2 Omicron BA.5: Evolving tropism and evasion of potent humoral responses and resistance to clinical immunotherapeutics relative to viral variants of concern. EBioMedicine. 2022;84:104270. doi: 10.1016/j.ebiom.2022.104270. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Schilling WHK, et al. Antiviral efficacy of molnupiravir versus ritonavir-boosted nirmatrelvir in patients with early symptomatic COVID-19 (PLATCOV): An open-label, phase 2, randomised, controlled, adaptive trial. Lancet Infect. Dis. 2023 doi: 10.1016/S1473-3099(23)00493-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Yang Y, et al. Plasma IP-10 and MCP-3 levels are highly associated with disease severity and predict the progression of COVID-19. J. Allergy Clin. Immunol. 2020;146(1):119–127-e114. doi: 10.1016/j.jaci.2020.04.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Zhang X, et al. Viral and host factors related to the clinical outcome of COVID-19. Nature. 2020;583(7816):437–440. doi: 10.1038/s41586-020-2355-0. [DOI] [PubMed] [Google Scholar]
  • 16.Chen N, et al. Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in Wuhan, China: A descriptive study. The Lancet. 2020;395(10223):507–513. doi: 10.1016/S0140-6736(20)30211-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Chen X, et al. Detectable serum severe acute respiratory syndrome coronavirus 2 viral load (RNAemia) is closely correlated with drastically elevated interleukin 6 level in critically ill patients with coronavirus disease 2019. Clin. Infect. Dis. 2020;71(8):1937–1942. doi: 10.1093/cid/ciaa449. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Giamarellos-Bourboulis EJ, et al. Complex immune dysregulation in COVID-19 patients with severe respiratory failure. Cell Host Microbe. 2020;27(6):992–1000. doi: 10.1016/j.chom.2020.04.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Huang C, et al. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet. 2020;395:497–506. doi: 10.1016/s0140-6736(20)30183-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Hirano T, Murakami M. COVID-19: A new virus, but a familiar receptor and cytokine release syndrome. Immunity. 2020;52(5):731–733. doi: 10.1016/j.immuni.2020.04.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.McGonagle D, et al. The role of cytokines including interleukin-6 in COVID-19 induced pneumonia and macrophage activation syndrome-like disease macrophage activation syndrome-like disease. Autoimmun. Rev. 2020;19(6):102537. doi: 10.1016/j.autrev.2020.102537. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Tibshirani R. Regression shrinkage and selection via the lasso. J. R. Stat. Soc. Ser. B (Methodol.) 1996;58(1):267–288. doi: 10.1111/j.2517-6161.1996.tb02080.x. [DOI] [Google Scholar]
  • 23.Lee DW, Gardner R, Porter DL, Louis CU, Ahmed N, Jensen M, et al. Current concepts in the diagnosis and management of cytokine release syndrome. Blood. 2014;124(2):188–195. doi: 10.1182/blood-2014-05-552729. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Hirahata K, et al. Characteristics of long COVID: Cases from the first to the fifth wave in greater Tokyo, Japan. J. Clin. Med. 2022 doi: 10.3390/jcm11216457. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Mikacic M, et al. Dynamic of serum TWEAK levels in critically Ill COVID-19 male patients. J. Clin. Med. 2022;11(13):3699. doi: 10.3390/jcm11133699. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Coomes EA, Haghbayan H. Interleukin-6 in Covid-19: A systematic review and meta-analysis. Rev. Med. Virol. 2020;30(6):1–9. doi: 10.1002/rmv.2141. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Nakayama EE, et al. Anti-nucleocapsid antibodies enhance the production of IL-6 induced by SARS-CoV-2 N protein. Sci. Rep. 2022;12:8108. doi: 10.1038/s4598-022-12252-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Galvan-Roman JM. IL-6 serum levels predict severity and response to tocilizumab in COVID-19: An observational study. J. Allergy Clin. Immunol. 2021;147(1):72–80e78. doi: 10.1016/j.jaci.2020.09.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Chi Y, et al. Serum cytokine and chemokine profile in relation to the severity of coronavirus disease 2019 in China. J. Infect. Dis. 2020;222(5):746–754. doi: 10.1093/infdis/jiaa363. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Ling L, et al. Longitudinal cytokine profile in patients with mild to critical COVID-19. Front. Immunol. 2021 doi: 10.3389/fimmu.2021.763292. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Liu Y, et al. An inter-correlated cytokine network identified at the center of cytokine storm predicted COVID-19 prognosis. Cytokine. 2021 doi: 10.1016/j.cyto.2020.155365. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Kwon J-S, et al. Factors of severity in patients with COVID-19: Cytokine/chemokine concentrations, viral load, and antibody responses. Am. J. Trop. Med. Hyg. 2020;103(6):2412–2418. doi: 10.4269/ajtmh.20-1110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Queiroz MAF, et al. Cytokine profiles associated with acute COVID-19 and long COVID-19 syndrome. Fron. Cell. Infect. Microbiol. 2022 doi: 10.3389/fcimb.2022.922422. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Lucas C, et al. Longitudinal analyses reveal immunological misfiring in severe COVID-19. Nature. 2020;584:463–469. doi: 10.1038/s41586-020-2588-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Zaira B, et al. Correlation between hepatocyte growth factor (HGF) with D-dimer and interleukin-6 as prognostic markers of coagulation and inflammation in long COVID-19 survivors. Curr. Issues Mol. Biol. 2023;45(7):5725–5740. doi: 10.3390/cimb45070361. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Gelzo M, et al. Matrix metalloproteinases (MMP) 3 and 9 as biomarkers of severity in COVID-19 patients. Sci. Rep. 2022;12(1):1212. doi: 10.1038/s41598-021-04677-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Chen L, et al. Scoring cytokine storm by the levels of MCP-3 and IL-8 accurately distinguished COVID-19 patients with high mortality. Sign. Transduct. Target Ther. 2020;5(1):292. doi: 10.1038/s41392-020-00433-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Perreau M, et al. The cytokines HGF and CXCL13 predict the severity and the mortality in COVID-19 patients. Nat. Commun. 2021;12(1):4888. doi: 10.1038/s41467-021-25191-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Lee HS, Kim WJ. The role of matrix metalloproteinase in inflammation with a focus on infectious diseases. Int. J. Mol. Sci. 2022;23(18):10546. doi: 10.3390/ijms231810546. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Kadry R, et al. Pharmacological inhibition of MMP3 as a potential therapeutic option for COVID-19 associated acute respiratory distress syndrome. Infect. Disord. Drug Targets. 2021;21(6):e170721187996. doi: 10.2174/1871526520666201116100310. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Shi S, et al. Matrix metalloproteinase 3 as a valuable marker for patients with COVID-19. J. Med. Virol. 2021;93(1):528–532. doi: 10.1002/jmv.26235. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Tegethoff SA, et al. TNF-related apoptosis-inducing ligand, interferon gamma-induced protein 10, and C-reactive protein in predicting the progression of SARS-CoV-2 infection: A prospective cohort study. Int. J. Infect. Dis. 2022;122:178–187. doi: 10.1016/j.ijid.2022.05.051. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Choreno-Parra JA, et al. Clinical and immunological factors that distinguish COVID-19 from pandemic influenza A(H1N1) Front. Immunol. 2021;12:593595. doi: 10.3389/fimmu.2021.593595. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Mansouri L, et al. Role of kidney function and concentrations of BAFF, sPD-L1 and sCD25 on mortality in hospitalized patients with COVID-19. BMC Nephrol. 2022;23(1):299. doi: 10.1186/s12882-022-02924-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Uno K, et al. Pretreatment prediction of individual rheumatoid arthritis patients' response to anti-cytokine therapy using serum cytokine/chemokine/soluble receptor biomarkers. PLoS One. 2015;10(7):e0132055. doi: 10.1371/journal.pone.0132055.eCollection. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Uno K, et al. Cytokine/chemokine changes in plasma of patients with MPO-ANCA RPGN: Before and after IVIg therapy. ADC Lett. Infect. Dis. Control. 2017;4(2):41–43. [Google Scholar]
  • 47.World Health Organization. Clinical Management of COVID-19. Geneva, Switzerland: WHO. WHO Reference Number:WHO/2019-nCoV/clinical/2020.5 (2020).
  • 48.Hasan A, et al. Enhancement of IL-6 production induced by SARS-CoV-2 nucleocapsid protein and Bangladeshi COVID-19 patients’ sera. Viruses. 2023;15:2018. doi: 10.3390/v15102018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Kayano T, et al. Number of averted COVID-19 cases and deaths attributable to reduced risk in vaccinated individuals in Japan. Lancet Reg. Health West. Pac. 2022;28:100571. doi: 10.1016/j.lanwpc.2022.100571. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Baba H, et al. Statistical analysis of mortality rates of coronavirus disease 2019 (COVID-19) patients in Japan across the 4C mortality score risk groups, age groups, and epidemiological waves: A report from the nationwide COVID-19 cohort. Open Forum Infect. Dis. 2023;10:ofac638. doi: 10.1093/ofid/ofac638. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The data that support the findings of this study are available from Osaka Habikino Medical Center, Tokushukai Hospital and Evercare Hospital Dhaka. However, the data is not publicly available since it was obtained under license for the current study. Upon reasonable request howoever, the data may be obtained from the corresponding author Kazuko Uno with permission from the respective institutions.


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES