Integrative analysis for COVID-19 patient outcome prediction

Hanqing Chao; Xi Fang; Jiajin Zhang; Fatemeh Homayounieh; Chiara D Arru; Subba R Digumarthy; Rosa Babaei; Hadi K Mobin; Iman Mohseni; Luca Saba; Alessandro Carriero; Zeno Falaschi; Alessio Pasche; Ge Wang; Mannudeep K Kalra; Pingkun Yan

doi:10.1016/j.media.2020.101844

. 2020 Oct 13;67:101844. doi: 10.1016/j.media.2020.101844

Integrative analysis for COVID-19 patient outcome prediction

Hanqing Chao ^a,¹, Xi Fang ^a,¹, Jiajin Zhang ^a,¹, Fatemeh Homayounieh ^b, Chiara D Arru ^b, Subba R Digumarthy ^b, Rosa Babaei ^c, Hadi K Mobin ^c, Iman Mohseni ^c, Luca Saba ^d, Alessandro Carriero ^e, Zeno Falaschi ^e, Alessio Pasche ^e, Ge Wang ^a, Mannudeep K Kalra ^b,^⁎, Pingkun Yan ^a,^⁎

PMCID: PMC7553063 PMID: 33091743

Graphical abstract

Keywords: COVID-19, Chest CT, Outcome prediction, Artificial intelligence

Abstract

While image analysis of chest computed tomography (CT) for COVID-19 diagnosis has been intensively studied, little work has been performed for image-based patient outcome prediction. Management of high-risk patients with early intervention is a key to lower the fatality rate of COVID-19 pneumonia, as a majority of patients recover naturally. Therefore, an accurate prediction of disease progression with baseline imaging at the time of the initial presentation can help in patient management. In lieu of only size and volume information of pulmonary abnormalities and features through deep learning based image segmentation, here we combine radiomics of lung opacities and non-imaging features from demographic data, vital signs, and laboratory findings to predict need for intensive care unit (ICU) admission. To our knowledge, this is the first study that uses holistic information of a patient including both imaging and non-imaging data for outcome prediction. The proposed methods were thoroughly evaluated on datasets separately collected from three hospitals, one in the United States, one in Iran, and another in Italy, with a total 295 patients with reverse transcription polymerase chain reaction (RT-PCR) assay positive COVID-19 pneumonia. Our experimental results demonstrate that adding non-imaging features can significantly improve the performance of prediction to achieve AUC up to 0.884 and sensitivity as high as 96.1%, which can be valuable to provide clinical decision support in managing COVID-19 patients. Our methods may also be applied to other lung diseases including but not limited to community acquired pneumonia. The source code of our work is available at https://github.com/DIAL-RPI/COVID19-ICUPrediction.

1. Introduction

Coronavirus disease 2019 (COVID-19), which results from contracting an extremely contagious beta-coronavirus, is responsible for the latest pandemic in human history. The resultant lung injury from COVID-19 pneumonia can progress rapidly to diffuse alveolar damage, acute lung failure, and even death (Vaduganathan, Vardeny, Michel, McMurray, Pfeffer, Solomon, 2020, Danser, Epstein, Batlle, 2020). Given the highly contagious nature of the infection, the burden of COVID-19 pneumonia has imposed substantial constraints on the global healthcare systems. In this paper, we present a novel framework of integrative analysis of heterogeneous data including not only medical images, but also patient demographic information, vital signs and laboratory blood test results for assessing disease severity and predicting intensive care unit (ICU) admission of COVID-19 patients. Screening out the high-risk patients, who may need intensive care later, and monitoring them more closely to provide early intervention may help save their lives.

Reverse transcription polymerase chain reaction (RT-PCR) assay with detection of specific nuclei acid of SARS-CoV-2 in oral or nasopharyngeal swabs is the preferred test for diagnosis of COVID-19 infection. Although chest computed tomography (CT) can be negative in early disease, it can achieve higher than 90% sensitivity in detecting COVID-19 pneumonia but with low specificity (Kim et al., 2020). For diagnosis of COVID-19 pneumonia, CT is commonly used in regions with high prevalance and limited RT-PCR availability as well as in patients with suspected false negative RT-PCR. CT provides invaluable information in patients with moderate to severe disease to assess the severity and complications of COVID-19 pneumonia (Yang et al., 2020). Prior clinical studies with chest CT have reported that qualitative scoring of lung lobar involvement by pulmonary opacities (high lobar involvement scores) can help assess severe and critical COVID-19 pneumonia. Li et al. (2020a) showed that high CT severity scores (suggestive of extensive lobar involvement) and consolidation are associated with severe COVID-19 pneumonia. Zhao et al. (2020) reported that extent and type of pulmonary opacities can help establish severity of COVID-19 pneumonia. The lung attenuation values change with the extent and type of pulmonary opacities, which differ in patients with more extensive, severe disease from those with milder disease. Most clinical studies focus on qualitative assessment and grading of pulmonary involvement in each lung lobe to establish disease severity, which is both time-consuming and associated with interobserver variations (Zhao, Zhong, Xie, Yu, Liu, 2020, Ai, Yang, Hou, Zhan, Chen, Lv, Tao, Sun, Xia, 2020). To address the urgent clinical needs, artificial intelligence (AI), especially deep learning, has been applied to COVID-19 CT image analysis (Shi et al., 2020). AI has been used to differentiate COVID-19 from community acquired pneumonia (CAP) on chest CT images (Li, Qin, Xu, Yin, Wang, Kong, Bai, Lu, Fang, Song, Cao, Liu, Wang, Xu, Fang, Zhang, Xia, Xia, 2020, Sun, Mo, Yan, Xia, Shan, Ding, Shao, Shi, Yuan, Jiang, Wu, Wei, Gao, Gao, Sui, Zhang, Shen). To unveil what deep learning uses to diagnose COVID-19 from CT, Wu et al. (2020) proposed an explainable diagnosis system by classifying and segmenting infections. Gozes et al. (2020b) developed a deep learning based pipeline to segment lung, classify 2D slices and localize COVID-19 manifestation from chest CT scans. Shan et al. (2020) went on to quantify lung infection of COVID-19 pneumonia from CT images using deep learning based image segmentation.

Among the emerging works, a few AI based methods target at severity assessment from chest CT. Huang et al. (2020) developed a deep learning method to quantify severity from serial chest CT scans to monitor the disease progression of COVID-19. Tang et al. (2020) used random forest to classify pulmonary opacity volume based features into four severity groups. By automatically segmenting the lung lobes and infection areas, Gozes et al. (2020a) suggested a “Corona Score” to measure the progression of disease over time. Zhu et al. (2020) further proposed to use AI to predict if a patient may develop severe symptoms of COVID-19 and how long it may take if that is the case. Although promising results have been presented, the existing methods primarily focus on the volume of pulmonary opacities and their relative ratio to the lung volume for severity assessment. The type of pulmonary opacities (e.g. ground glass, consolidation, crazy-paving pattern, organizing pneumonia) is also an important indicator of the stage of the disease and is often not quantified by the AI algorithms (Chung et al., 2020).

Furthermore, in addition to measuring and monitoring the progression of severity, it could be life-saving to predict mortality risk of patients by learning from the clinical outcomes. Since majority of the infected patients will recover, managing the high-risk patients is the key to lower the fatality rate (Ruan, 2020, Phua, Weng, Ling, Egi, Lim, Divatia, Shrestha, Arabi, Ng, Gomersall, Nishimura, Koh, Du, 2020, Li, Lu, Zhang, 2020). Longitudinal study analyzing the serial CT findings over time in patients with COVID-19 pneumonia shows that the temporal changes of the diverse CT manifestations follow a specific pattern correlating with the progression and recovery of the illness (Wang et al., 2020). Thus, it is promising for AI to perform this challenging task.

In this paper, our objective is to predict outcome of COVID-19 pneumonia patients in terms of the need for ICU admission with both imaging and non-imaging information. The work has two major contributions.

1.
While image features have been commonly exploited by the medical image analysis community for COVID-19 diagnosis and severity assessment, non-imaging features are much less studied. However, non-imaging health data may also be strongly associated with patient severity. For example, Yan et al. (2020) showed that machine learning tools using three biomarkers, including lactic dehydrogenase (LDH), lymphocyte and high-sensitivity C-reactive protein (hs-CRP), can predict the mortality of individual patients. Thus, we propose to integrate heterogeneous data from different sources, including imaging data, age, sex, vital signs, and blood test results to predict patient outcome. To the best of our knowledge, this is the first study that uses holistic information of a patient including both imaging and non-imaging data for outcome prediction.
2.
In addition to the simple volume measurement based image features, radiomics features are computed to describe the texture and shape of pulmonary opacities. A deep learning based pyramid-input pyramid-output image segmentation algorithm is used to quantify the extent and volume of lung manifestations. A feature dimension reduction algorithm is further proposed to select the most important features, which is then followed by a classifier for prediction.

It is worth noting that although the presented application on COVID-19 pneumonia, the proposed method is a general approach and can be applied to other diseases.

The proposed method was evaluated on datasets collected from teaching hospitals across three countries, These datasets included 113 CT images from Firoozgar Hospital (Tehran, Iran)(Site A), 125 CT images from Massachusetts General Hospital (Boston, MA, USA) (Site B), and 57 CT images from University Hospital Maggiore della Carita (Novara, Piedmont, Italy) (Site C). Promising experimental results for outcome prediction were obtained on all the datasets with our proposed method, with reasonable generalization across the datasets. Details of our work are presented in the following sections.

2. Datasets

The data used in our work were acquired from three sites. All the CT imaging data were from patients who underwent clinically indicated, standard-of-care, non-contrast chest CT without intravenous contrast injection. Age and gender of all patients were recorded. For datasets from Sites A and B, lymphocyte count and white blood cell count were also available. For datasets of Sites A and C, peripheral capillary oxygen saturation (SpO2) and temperature on hospital admission were recorded. Information pertaining patient status (discharged, deceased, or under treatment at the time of data analysis) was also recorded as well as the number of days of hospitalization to the outcome.

Site A dataset We reviewed medical records of adult patients admitted with known or suspected COVID-19 pneumonia in Firoozgar Hospital (Tehran, Iran) between February 23, 2020 and March 30, 2020. Among the 117 patients with positive RT-PCR assay for COVID-19, three patients were excluded due to presence of extensive motion artifacts on their chest CT. With one patient who neither admitted to ICU nor discharged, 113 patients are used in this study.

Site B dataset We reviewed medical records of adult patients admitted with COVID-19 symptom in MGH between March 11 and May 3, 2020. 125 RT-PCR positive admitted patients underwent unenhanced chest CT are selected to form this dataset.

Site C dataset We reviewed medical records of adult patients admitted with COVID-19 pneumonia in the Novara Hospital (Piedmont, Italy) between March 4, 2020 and April 6, 2020. We collected clinical and outcome information of 57 patients with positive RT-PCR assay for COVID-19.

Two experienced thoracic subspecialty radiologists evaluated all chest CT examinations and recorded opacity type, distribution and extent of lobar involvement. Information on symptom duration prior to hospital admission, duration of hospital admission, presence of comorbid conditions, laboratory data, and outcomes (recovery or death) was obtained from the medical records. Entire lung volume was segmented on thin-section DICOM images (1.5–2 mm) to obtain whole-lung analysis. Statistics of the datasets are shown in Table 3, Table 4, Table 5 in Section 3.4.

Table 3.

Statistics (mean ± std, except for gender) of DVB features for Site A dataset.

ICU admission	Not admitted	ICU admitted	Data #
Gender (M:F)	43: 28	29: 13	113
Age (year)	56.7 ± 16.0	66.9 ± 16.2	113
Lym_r (%)	22.7 ± 8.3	15.6 ± 12.8	113
WBC	5831.0 ± 1848.9	7966.7 ± 4556.2	113
Lym	1244.7 ± 482.8	1010.4 ± 943.7	113
Temperature (^∘)	37.3 ± 0.6	37.6 ± 0.6	98
SpO2 (%)	91.9 ± 7.41	86.5 ± 8.53	100

ICU admission	Not admitted	ICU admitted	Data #
Gender (M:F)	23: 24	39: 39	125
Age (year)	74.8 ± 15.0	72.7 ± 11.1	125
Lym_r (%)	18.6 ± 12.7	13.0 ± 12.8	125
WBC	7175.7 ± 4288.9	11722.3 ± 7249.3	125
Lym	1058.1 ± 596.7	1613.8 ± 3872.7	125

ICU admission	Not admitted	ICU admitted	Data #
Gender (M:F)	13: 8	24: 12	57
Age (year)	70.0 ± 13.7	66.9 ± 12.3	57
Temperature (^∘)	39.0 ± 1.0	37.8 ± 0.9	50
SpO2 (%)	92.3 ± 5.25	84.5 ± 7.74	31

Group	Feature type	# features	Sum
Texture	First order	18	93
	GLCM	24
	GLRLM	16
	GLSZM	16
	NGTDM	5
	GLDM	14
Shape	Shape (3D)	17	17

Image filter type	Extracted features	# features
No filter (Original image)	Texture + Shape	$93 + 17 = 110$
Square filter	Texture	93
Square-root(Sqrt) filter	Texture	93
Logarithm filter	Texture	93
Exponential filter	Texture	93
Wavelet filters (HHH, HHL, HLH, LHH, HLL, LHL, LLH, LLL)	Texture	$93 \times 8 = 744$
Laplacian of Gaussian (LoG) filters σ ∈ {0.5, 1.5, 2.5, 3.5, 4.5}	Texture	$93 \times 5 = 465$

Features	AUC			Sensitivity (PPV = 70%)			K
Features	Mean	95% CI	p value	Mean	95% CI	p value
Img feature (Tang 2020)	0.818	(0.796, 0.839)	$p < 0.001$	51.0%	(39.3%, 62.6%)	$p < 0.001$	8
Img feature (Zhu 2020)	0.776	(0.762, 0.790)	$p < 0.001$	48.6%	(35.4%, 61.7%)	$p = 0.001$	46
DVB	0.855	(0.844, 0.866)	$p = 0.002$	76.7%	(73.2%, 80.1%)	$p = 0.017$	1
HLQ	0.789	(0.781, 0.797)	$p < 0.001$	51.4%	(45.3%, 57.5%)	$p < 0.001$	21
WLR	0.859	(0.843, 0.873)	$p < 0.001$	71.4%	(60.5%, 82.3%)	$p = 0.022$	70
WLR + HLQ	0.866	(0.857, 0.875)	$p < 0.001$	68.6%	(57.6%, 79.5%)	$p = 0.008$	61
WLR + DVB	0.876	(0.867, 0.886)	p = 0.109	81.4%	(76.0%, 86.8%)	p = 0.152	4
HLQ + DVB	0.865	(0.844, 0.885)	p = 0.080	70.0%	(60.9%, 79.1%)	$p = 0.012$	4
WLR + HLQ + DVB	0.884	(0.875, 0.893)	–	84.3%	(79.9%, 88.7%)	–	52

Methods	AUC			Sensitivity (PPV = 70%)
Methods	Mean	95% CI	p value	Mean	95% CI	p value
Random Forests	0.884	(0.875, 0.893)	–	84.3%	(79.9%, 88.7%)	–
SVM	0.867	(0.855, 0.880)	$p = 0.002$	71.0%	(64.9%, 77.0%)	$p < 0.001$
Logistic Regression	0.785	(0.758, 0.812)	$p < 0.001$	31.0%	(14.8%, 47.1%)	$p < 0.001$
DNN (small)	0.816	(0.804, 0.828)	$p < 0.001$	–	–	$p = 0.023$ –
DNN w/ all features	0.751	(0.723, 0.779)	$p < 0.001$	25.7%	(0.8%, 50.6%)	$p = 0.003$
WD Net w/ all features	0.823	(0.807, 0.838)	$p < 0.001$	58.1%	(40.7%, 75.5%)	$p = 0.009$

Methods	A → B	A → C	B → A	B → C	C → A	C → B	Mean
Random Forests	0.740 (36)	0.685 (29)	0.754 (12)	0.633 (11)	0.591 (17)	0.717 (20)	0.687
SVM	0.774 (1)	0.686 (23)	0.777 (3)	0.710 (1)	0.649 (17)	0.694 (3)	0.715
Logistic Regression	0.744 (11)	0.706 (23)	0.756 (1)	0.698 (1)	0.642 (19)	0.752 (3)	0.716

Features	AUC			Sensitivity (PPV = 70%)			K
Features	Mean	95% CI	p value	Mean	95% CI	p value
Img feature (Tang 2020)	0.770	(0.745, 0.796)	$p < 0.001$	83.1%	(75.8%, 90.4%)	$p = 0.009$	10
Img feature (Zhu 2020)	0.767	(0.752, 0.781)	$p < 0.001$	83.8%	(82.2%, 85.5%)	$p < 0.001$	39
DVB	0.671	(0.643, 0.700)	$p < 0.001$	78.7%	(69.7%, 87.7%)	$p = 0.007$	4
HLQ	0.791	(0.774, 0.809)	$p < 0.001$	84.6%	(81.3%, 88.0%)	$p < 0.001$	3
WLR	0.841	(0.827, 0.855)	$p = 0.014$	94.9%	(93.4%, 96.3%)	–	55
WLR + HLQ	0.847	(0.833, 0.861)	–	92.6%	(89.5%, 95.7%)	p = 0.083	12
WLR+DVB	0.841	(0.828, 0.854)	p = 0.257	91.8%	(90.5%, 93.1%)	$p = 0.012$	33
HLQ + DVB	0.796	(0.777, 0.815)	$p = 0.001$	84.4%	(80.7%, 88.0%)	$p < 0.001$	4
WLR + HLQ + DVB	0.844	(0.833, 0.855)	p = 0.310	92.6%	(90.0%, 95.1%)	$p = 0.027$	12

Methods	AUC			Sensitivity (PPV = 70%)
Methods	Mean	95% CI	p value	Mean	95% CI	p value
Random Forests	0.844	(0.833, 0.855)	–	92.6%	(90.0%, 95.1%)	–
SVM	0.852	(0.838, 0.866)	p = 0.148	95.4%	(93.2%, 97.5%)	$p = 0.020$
Logistic Regression	0.798	(0.783, 0.812)	$p = 0.003$	90.2%	(88.6%, 91.9%)	p = 0.110
DNN (small)	0.831	(0.805, 0.858)	p = 0.187	86.9%	(83.5%, 90.3%)	$p = 0.003$
DNN w/ all features	0.704	(0.670, 0.738)	$p < 0.001$	75.38%	(71.6%, 79.2%)	$p = 0.001$
WD Net w/ all features	0.769	(0.753, 0.786)	$p < 0.001$	85.6%	(82.6%, 88.7%)	$p = 0.004$

Features	AUC			Sensitivity (PPV = 70%)			K
Features	Mean	95% CI	p value	Mean	95% CI	p value
Img feature (Tang 2020)	0.763	(0.670, 0.856)	$p = 0.044$	85.0%	(76.4%, 93.6%)	$p = 0.020$	10
Img feature (Zhu 2020)	0.675	(0.645, 0.706)	$p < 0.001$	73.9%	(59.0%, 88.8%)	$p < 0.011$	39
DVB	0.595	(0.524, 0.665)	$p < 0.001$	63.3%	(36.1%, 90.5%)	$p = 0.019$	4
HLQ	0.691	(0.660, 0.722)	$p < 0.001$	86.7%	(80.3%, 93.0%)	$p < 0.014$	7
WLR	0.815	(0.782, 0.848)	$p = 0.020$	95.6%	(92.8%, 98.3%)	p = 0.186	12
WLR + HLQ	0.826	(0.813, 0.839)	p = 0.191	96.1%	(94.4%, 97.8%)	–	20
WLR+DVB	0.835	(0.809, 0.861)	p = 0.365	95.0%	(91.6%, 98.4%)	p = 0.088	15
HLQ + DVB	0.760	(0.705, 0.815)	$p = 0.016$	85.6%	(77.6%, 93.5%)	$p < 0.010$	2
WLR + HLQ + DVB	0.840	(0.804, 0.876)	–	94.4%	(92.3%, 96.6%)	$p = 0.035$	35

Methods	AUC			Sensitivity (PPV = 70%)
Methods	Mean	95% CI	p value	Mean	95% CI	p value
Random Forests	0.840	(0.804, 0.876)	–	94.4%	(92.3%, 96.6%)	–
SVM	0.811	(0.782, 0.839)	$p = 0.031$	93.3%	(89.2%, 97.5%)	p = 0.324
Logistic Regression	0.717	(0.666, 0.768)	$p = 0.009$	86.7%	(79.3%, 94.0%)	$p = 0.019$
DNN (small)	0.695	(0.619, 0.771)	$p = 0.011$	77.8%	(53.6%, 100.0%)	p = 0.082
DNN w/ all features	0.568	(0.550, 0.587)	$p < 0.001$	52.8%	(29.2%, 76.4%)	$p = 0.006$
WD Net w/ all features	0.528	(0.479, 0.578)	$p < 0.001$	34.4%	(24.1%, 44.8%)	$p < 0.001$

PERMALINK

Integrative analysis for COVID-19 patient outcome prediction

Hanqing Chao

Xi Fang

Jiajin Zhang

Fatemeh Homayounieh

Chiara D Arru

Subba R Digumarthy

Rosa Babaei

Hadi K Mobin

Iman Mohseni

Luca Saba

Alessandro Carriero

Zeno Falaschi

Alessio Pasche

Ge Wang

Mannudeep K Kalra

Pingkun Yan

Graphical abstract

Abstract

1. Introduction

2. Datasets

Table 3.

Table 4.

Table 5.

3. ICU admission prediction

Fig. 1.

3.1. Deep learning based image segmentation

Fig. 2.

3.2. Hierarchical lobe-wise quantification features

3.3. Whole lung radiomics features

Table 1.

Table 2.

3.4. Non-imaging features

3.5. ICU admission prediction

4. Experimental results

Fig. 3.

4.1. Results on Site A dataset

Fig. 4.

Table 6.

Fig. 5.

Table 15.

Table 7.

4.2. Results on Site B dataset

Table 8.

Fig. 6.

Table 9.

Table 10.

Table 11.

4.3. Results on Site C dataset

Table 12.

Fig. 7.

Table 16.

Table 13.

4.4. Generalization ability

Table 14.

5. Discussion and conclusions

CRediT authorship contribution statement

Declaration of Competing Interest

Acknowledgments

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases