Abstract
Objectives
This study aimed to evaluate quantitative and qualitative screening measures for anomalous computed tomography (CT) scans in cancer patients with potential coronavirus disease 2019 (COVID-19) as an automated detection tool in a radiation oncology treatment setting.
Methods
We identified a non-COVID-19 cohort and patients with suspected COVID-19 with chest CT scans from February 1, 2020 to June 30, 2020. Lungs were segmented, and a mean normal Hounsfield Unit (HU) histogram was generated for the non-COVID-19 CT scans; these were used to define thresholds for designating the COVID-19-suspected histograms as normal or abnormal. Statistical measures were computed and compared to the threshold levels, and density maps were generated to examine the difference between lungs with and without COVID-19 qualitatively.
Results
The non-COVID-19 cohort consisted of 70 patients with 70 CT scans, and the cohort of suspected COVID-19 patients consisted of 59 patients with 80 CT scans. Sixty-two patients were positive for COVID-19. The mean HUs and skewness of the intensity histogram discriminated between COVID-19 positive and negative cases, with an area under the curve of 0.948 for positive and 0.944 for negative cases. Skewness correctly identified 57 of 62 positive cases, whereas mean HUs correctly identified 17 of 18 negative cases. Density maps allowed for visualization of the temporal evolution of COVID-19 disease.
Conclusions
The statistical measures and density maps evaluated here could be employed in an automated screening algorithm for COVID-19 infection. The accuracy is high enough for a simple and rapid screening tool for early identification of suspected infection in patients treated with chemotherapy and radiation therapy already receiving CT scans as part of clinical care. This screening tool could also identify other infections that present critical risks for patients undergoing chemotherapy and radiation therapy, such as pneumonitis.
Keywords: covid-19, chest ct, screening tools, density maps, decision-support
Introduction
As of January 5, 2021, there have been over 86 million cases of coronavirus disease 2019 (COVID-19), with more than 1.8 million deaths, worldwide [1]. COVID-19 is a disease secondary to infection from severe acute respiratory syndrome coronavirus 2 that can cause pneumonia and lung damage in symptomatic cases. The effect of COVID-19 on the lungs is visible in chest computed tomography (CT) scans. Since radiation oncology patients receive numerous CT scans for their diagnostic work-up, treatment planning, and tumor positioning during treatment, these scans could be leveraged to implement an automated screening tool. Patients receiving cancer treatment are more vulnerable to COVID-19 and have a higher incidence of severe cases and mortality [2,3]. Respiratory illnesses, hypertension, and diabetes have also been identified as risk factors for poor COVID-19 outcomes [4].
Initial signs of COVID-19 in the lungs might not be detected by chest radiography as they can develop in areas difficult to visualize on radiographs [5]. CT scans are, therefore, superior to chest radiography in detecting early signs of COVID-19. Increased lung density due to COVID-19 is typically found bilaterally and in the peripheral regions of the lungs [6]. CT scans were part of primary diagnoses for COVID-19 before polymerase chain reaction (PCR) swab tests became readily available. Studies on the diagnostic potential of CT scans were not typically able to distinguish between COVID-19 pneumonia and influenza-related pneumonia since both cause an increase in lung density [7]. Therefore, CT scans are no longer the first-line test for patients with suspected COVID-19.
However, the high sensitivity of CT scans can be leveraged as a screening tool, especially for patients with lung cancer, esophageal cancer, or other pulmonary indications who undergo routine CT scans as a part of their diagnostic work-up and treatment. With PCR tests readily available in most hospital systems, the high sensitivity of a CT-based screening tool can be used to identify suspected cases early and determine diagnosis using PCR testing. Furthermore, the ability to automate the rapid detection of any infection leading to respiratory disease in cancer patients would allow early intervention in this highly vulnerable patient population.
Here, we propose using statistical measures derived from Hounsfield Unit (HU) intensity volume histograms and lung density maps as a potential rapid screening tool for COVID-19 in radiation oncology patients already receiving chest CT scans as part of their clinical workflow.
Materials and methods
Selection of patients and chest CT scans
Albert Einstein College of Medicine Institutional Review Board (IRB) issued approval for this study (approval number 2016-6665; approval date 03/01/2021) by expedited review under 45 CFR 46.110 and 21 CFR 56.110. Based on the IRB-approved study protocol, we identified patients with chest CT examinations. One patient cohort consisted of patients receiving CT scans between January 1, 2019 and August 31, 2019, before the COVID-19 pandemic in the United States. Here, we selected an equal number of patients with and without contrast-enhanced CT scans as the control non-COVID-19 cohort. The other cohort consisted of patients with suspected COVID-19 with CT scans acquired between February 1, 2020 and June 30, 2020, and any CT scans for those patients acquired in the first eight months of 2019 as internal controls. Following patient selection, the CT scans were exported to MIM software (MIM Software Inc., Cleveland, OH) for lung segmentation.
Lung segmentation and data processing
The chest CT scans consisted of 5-mm slice thickness images with 512 x 512 pixels, with each scan consisting of approximately 50 to 70 slices covering the entire chest. The lungs (right and left lung together) were then segmented in MIM using a semi-automatic region-growing algorithm and manually adjusted as necessary to include all lung tissue. If there existed known non-COVID-19 pathologies such as a lung tumor, this was not included in the lung contours. Once completed, the CT scan data and lung segmentation were anonymized and exported to MATLAB (MathWorks, Natick, MA) for data processing and analysis, masking the CT data to only include the segmented lung contours as a three-dimensional (3D) image structure.
Data analysis
The CT data for the cohort of control non-COVID-19 scans were used to generate a normal mean histogram of CT HUs. Before this, any HUs above 200 were excluded to avoid the inclusion of higher density anatomy at the fringes of the lung contours. To make the histograms of lung HUs comparable between different CT scans and patients, we normalized the histograms to 100,000 data points binned into 1,200 bins ranging from -1000 to 200 HUs. The normal mean histogram was then generated by adding all of the normalized histograms of non-COVID-19 CT scans and dividing by the number of scans included.
To quantitatively differentiate between control scans and scans showing COVID-19, we included simple statistical measures such as the mean, median, and skewness of the lung HU histogram and the proportion of data points falling above the mean. We also included statistical distance measures, such as the Kolmogorov-Smirnov (KS) distance, the Earth Mover’s Distance (EMD), and the Cramér-von-Mises (CvM) distance as further quantitative measures to differentiate the CT scans. These statistical distance measures account for all information in a lung HU histogram and compare it to the mean normal histogram.
The data from the normal mean histogram of the non-COVID-19 cohort were used to define thresholds for all the study metrics to designate scans as either normal or abnormal. A scan was classified as abnormal for a given study metric if it exceeded the 99% confidence limit of the mean normal histogram. Subsequently, all suspected COVID-19 CT scans were normalized and processed in the same way as described above, and the statistical measures for each scan were computed and compared to the established threshold levels.
We found that the mean normal cumulative lung histograms generated from CT scans of patients that received contrast compared to noncontrast scans were not meaningfully different (Figure 1). Therefore, the two groups were combined in this analysis. As a sensitivity analysis was performed, separate analyses were conducted using thresholds derived separately for contrast and noncontrast scans. A further sensitivity analysis was performed by stratifying patients by age.
Visual density maps were constructed from the lung contour data using Tool for OPerations on Catalogues And Tables (TOPCAT) version 4.8 (TOPCAT, Bristol, England) freeware astrophysics graphing tool. For noncontrast CT scans, the threshold below which all HU values were excluded in the maps was -150 HU. For contrast scans, this threshold was -100 HU. The lungs were compressed in two dimensions along the cranial-caudal axis, plotting the count of voxels above the threshold along this axis in the transverse plane. These density maps were then visualized on a logarithmic scale.
Statistical analysis
The baseline for COVID-19 status and disease severity at the time of CT scan was established by retrospectively examining the patient’s medical records and PCR test results. This was performed by investigators blinded to any CT scan data. Cases were classified as either COVID-19-negative or positive at the time of CT scan, and positive cases classified as asymptomatic, mild, moderate, or severe disease. To classify the performance of the statistical measures, we computed the sensitivity, specificity, negative predictive value, positive predictive value, and accuracy at the given threshold level. The performance of each metric was determined by plotting the receiver operating characteristics (ROC) curve and calculating the accompanying area under the curve (AUC).
Results
Seventy patients with 70 chest CT scans were identified for the non-COVID-19 cohort, and 59 patients with 80 chest CT scans were identified for the cohort of suspected COVID-19 cases; patient characteristics are presented in Table 1. Both patient cohorts spanned a broad age range with similar age distribution. The majority of suspected COVID-19 CT scans were acquired with contrast enhancement. Mild and moderate COVID-19 cases comprised most of the suspected COVID-19 cohort, with approximately one-third of the cases having moderate or severe disease. Within the suspected COVID-19 cohort, 18 of the suspected cases (22.5%) were negative at the time of CT scan.
Table 1. Patient and CT scan characteristics.
Non-COVID-19 Cohort | Suspected COVID-19 Cohort | |
(N = 70 scans in 70 patients) | (N = 80 scans in 59 patients) | |
Sex, n (%) | ||
Male | 30 (43%) | 44 (45%) |
Female | 40 (57%) | 36 (55%) |
Age, years | ||
Mean (± SD) | 58 ± 17 | 62 ± 16 |
Range | 11 – 90 | 21 – 88 |
Use of contrast, n (%) | ||
Contrast | 35 (50%) | 55 (69%) |
Noncontrast | 35 (50%) | 25 (31%) |
COVID-19 case severity at time of CT scan | ||
Negative | 70 (100%) | 18 (22.5%) |
Asymptomatic | NA | 6 (7.5%) |
Mild | NA | 30 (37.5%) |
Moderate | NA | 15 (19%) |
Severe | NA | 11 (14%) |
Figure 2 shows only minor variations in the cumulative distribution of HUs between scans from patients in different age groups, and as such, we did not stratify the analysis based on age.
The statistical measures and classification of all 80 evaluated CT scans are shown in Figures 3-5. Since the CvM distance characterized cases similar to the EMD, it was not evaluated separately in a more limited range. Most of the negative scans are correctly classified as normal according to the various threshold levels depicted, except for the KS distance, which did not work well for classifying cases.
The mean HU and skewness metrics showed good separation between positive and negative COVID-19 cases at the given threshold levels. Positive COVID-19 CT scans incorrectly classified as normal corresponded with asymptomatic or mild disease, whereas CTs corresponding to moderate or severe disease were classified as abnormal. The statistical measures could not distinguish well between moderate and severe cases because there was typically widespread pneumonia with a considerable increase in lung HU numbers in most of these instances. The threshold levels shown represent the 99% confidence limits from the average normal scan described in the methods and appear to be well suited for distinguishing positive from negative cases.
Table 2 shows the corresponding measures of classification performance. Skewness and mean HUs had the best performance for classifying negative or positive cases, with an accuracy of 0.900 and 0.888, respectively. Skewness has the best sensitivity at 0.919 with 57 of 62 true positives, while mean HUs had the best specificity at 0.944 with only one false-negative classification. The EMD showed decent classification performance, but it was not on par with the other statistical measures, and the KS distance was, again, not a good classification metric for this setting.
Table 2. Performance of considered statistical measures in classifying COVID-19 cases.
Mean HUs | Median HUs | Proportion above Mean HUs | Skewness | EMD | KS distance | |
True positive | 54/62 | 49/62 | 46/62 | 57/62 | 45/62 | 33/62 |
True negatives | 17/18 | 16/18 | 14/18 | 15/18 | 14/18 | 13/18 |
Sensitivity | 0.871 | 0.790 | 0.742 | 0.919 | 0.726 | 0.532 |
Specificity | 0.944 | 0.889 | 0.778 | 0.833 | 0.778 | 0.722 |
Positive predictive value | 0.982 | 0.961 | 0.920 | 0.950 | 0.918 | 0.868 |
Negative predictive value | 0.680 | 0.552 | 0.467 | 0.750 | 0.452 | 0.310 |
Accuracy | 0.888 | 0.813 | 0.750 | 0.900 | 0.738 | 0.575 |
The ROC curves for each of the studied statistical measures are presented in Figure 6, which compares their performance for classifying positive or negative cases independent of the specific threshold level chosen. The ROC curves show that mean HUs and skewness have the best classification performance with AUC values of 0.948 and 0.944, respectively. While median HUs also showed good classification performance with an AUC of 0.900, the other metrics did not perform as well, corroborating the results presented in Table 2 above.
Figure 7 shows the classification performance ROC curves if stratifying the analysis between contrast (Figure 7A) and noncontrast (Figure 7B) CT scans. The AUC results are similar to Figure 6, where all scans are assessed together, with some potential improvement for noncontrast scans albeit with a small sample size of scans (n=25) in this group.
Figures 8, 9 present density maps for two different patients, showing the increase in lung tissue density associated with COVID-19 and how the density maps can illustrate signs of more severe disease. Using these maps, the information contained in the 3D CT scan is compressed into a two-dimensional map that allows for visualization of the COVID-19 disease burden across the entire lung in one snapshot. The density maps showed that the posterior of the lungs contained the greatest disease burden, especially for moderate and severe diseases. Comparing the density maps with the statistical measures shows the agreement between the visual qualitative interpretation of these maps with the quantitative assessment of whether to classify a scan as abnormal. The density maps can be particularly useful in tracking the progression or resolution of disease burden over time, as shown by the examples in Figures 8, 9.
Discussion
This study focused on developing a quick and straightforward potential screening tool for COVID-19 in vulnerable patients treated in a radiation oncology setting. We found that simple statistical measures of mean HUs and skewness of the intensity histogram of chest CT scans performed well in discriminating between positive and negative cases of COVID-19. Skewness had the highest sensitivity of 0.919 in the testing cohort, while mean HUs had the highest specificity of 0.944, as they offer some complimentary classification performance. This screening tool could be implemented as a quick semi-automated test applicable to any patient for whom a chest CT scan is acquired. As a correlate, the density maps generated from the segmented lungs in the CT scan act as visual aids to further characterize COVID-19 disease progression and resolution. CT scans are sometimes difficult to use for characterizing lesion growth due to the extent of disease consolidation, whereas density maps may illustrate density differences between seemingly similar regions on a CT scan more easily. More severe cases typically show consolidation at the posterior of the lungs and are suggestive of inflammation of the vasculature in milder cases [7,8]. Further corroborating the agreement between such findings and density maps as presented here could provide an interesting avenue towards improved disease classification and perhaps monitoring efficacy of various COVID-19 treatments using the suggested screening tool.
A decline in pulmonary function and changes in lung morphology are associated with natural aging [9]. There were no appreciable differences in the mean normal cumulative distribution of HUs from different age groups for our patient cohort when stratified in our non-COVID-19 cohort (Figure 2). However, in a more extensive study, we expect small differences in the mean normal cumulative distribution functions of different age groups could require stratification by age to improve classification performance further. Similarly, a larger cohort study could stratify CT scans using contrast compared with noncontrast, which had some indication of potentially improving the classification performance further but would require a larger sample size to answer definitively.
The high accuracy of the presented point measures would allow CT scans to be used as rapid screening tools, especially at the height of an outbreak when testing resources and labs are overburdened. A recent study supports this, with rapid diagnosis of COVID-19 from CT scans using convolutional neural networks showing promising results that align with our findings, especially when combined with clinical symptoms and laboratory serology [10]. The characteristics of COVID-19 on CT scans present a possibility to create more precise screening algorithms rather than just the simple metrics presented here. To this end, there have been virtual imaging trials conducted to exploit differences between COVID-19 and other respiratory illnesses to train artificial intelligence algorithms to distinguish COVID-19 specifically, based on CT scans [11]. One patient in the test cohort had secondary pneumonia attributable to influenza in December 2019 and then developed COVID-19 pneumonia in April 2020. The density maps from the CT scans for this patient showed some differences between COVID-19 and influenza-related pneumonia, but both were picked up by the classification algorithm (Figure 10).
Conclusions
The statistical point measures evaluated in this study showed good classification performance and could be implemented as part of an automated screening tool algorithm for COVID-19, especially during the height of a pandemic. With this type of methodology, an automated algorithm could be trained on several virtually generated density maps that mimic the patterns of COVID-19 of varying case severity to develop algorithms with higher specificity. The results of this study also suggest the potential for this type of screening tool to discover infections that present critical risks for patients undergoing chemotherapy and radiation therapy, such as pneumonitis. Early identification of such infections would allow rapid intervention that is key for improving the outcome of these patients as their immune system is often compromised following their cancer treatment.
Acknowledgments
Marie-Hélène Tomé received recognition as a 2021 Regeneron STS Scholar for a project submission based on this work.
The content published in Cureus is the result of clinical experience and/or research by independent individuals or organizations. Cureus is not responsible for the scientific accuracy or reliability of data or conclusions published herein. All content published within Cureus is intended only for educational, research and reference purposes. Additionally, articles published within Cureus should not be deemed a suitable substitute for the advice of a qualified health care professional. Do not disregard or avoid professional medical advice due to content published within Cureus.
Funding Statement
This work was supported in part by Varian Medical Systems, Inc. research grant 151214 A1 on behalf of Dr. Wolfgang Tomé and Dr. Madhur Garg as principal investigators
The authors have declared that no competing interests exist.
Human Ethics
Consent was obtained or waived by all participants in this study. Albert Einstein College of Medicine IRB issued approval 2016-6665. Approval Date: 03/01/2021 Expiration Date: 02/28/2022 This is to inform you that the Einstein IRB has reviewed and reapproved the above referenced human research project for the period noted above by expedited review under 45 CFR 46.110 and 21 CFR 56.110.
Animal Ethics
Animal subjects: All authors have confirmed that this study did not involve animal subjects or tissue.
References
- 1.An interactive web-based dashboard to track COVID-19 in real time. Dong E, Du H, Gardner L. Lancet Infect Dis. 2020;20:533–534. doi: 10.1016/S1473-3099(20)30120-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Extent of prior lung irradiation and mortality in COVID-19 patients with a cancer history. Kabarriti R, Brodin NP, Maron MI, et al. Adv Radiat Oncol. 2020;5:707–710. doi: 10.1016/j.adro.2020.04.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Case fatality rate of cancer patients with COVID-19 in a New York hospital system. Mehta V, Goel S, Kabarriti R, et al. Cancer Discov. 2020;10:935–941. doi: 10.1158/2159-8290.CD-20-0516. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.COVID-19 and comorbidities: a systematic review and meta-analysis. Gold MS, Sehayek D, Gabrielli S, Zhang X, McCusker C, Ben-Shoshan M. Postgrad Med. 2020;132:749–755. doi: 10.1080/00325481.2020.1786964. [DOI] [PubMed] [Google Scholar]
- 5.Diagnostic value and key features of computed tomography in coronavirus disease 2019. Li B, Li X, Wang Y, et al. Emerg Microbes Infect. 2020;9:787–793. doi: 10.1080/22221751.2020.1750307. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Chest CT findings in 2019 novel coronavirus (2019-nCoV) infections from Wuhan, China: key points for the radiologist. Kanne JP. Radiology. 2020;295:16–17. doi: 10.1148/radiol.2020200241. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.CT manifestations of coronavirus disease (COVID-19) pneumonia and influenza virus pneumonia: a comparative study. Lin L, Fu G, Chen S, Tao J, Qian A, Yang Y, Wang M. AJR Am J Roentgenol. 2021;216:71–79. doi: 10.2214/AJR.20.23304. [DOI] [PubMed] [Google Scholar]
- 8.Pulmonary vascular endothelialitis, thrombosis, and angiogenesis in COVID-19. Ackermann M, Verleden SE, Kuehnel M, et al. N Engl J Med. 2020;383:120–128. doi: 10.1056/NEJMoa2015432. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Sex and age differences in pulmonary mechanics in normal nonsmoking subjects. Gibson GJ, Pride NB, O'cain C, Quagliato R. J Appl Physiol. 1976;41:20–25. doi: 10.1152/jappl.1976.41.1.20. [DOI] [PubMed] [Google Scholar]
- 10.Artificial intelligence-enabled rapid diagnosis of patients with COVID-19. Mei X, Lee HC, Diao KY, et al. Nat Med. 2020;26:1224–1228. doi: 10.1038/s41591-020-0931-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Virtual imaging trials for coronavirus disease (COVID-19) Abadi E, Paul Segars W, Chalian H, Samei E. AJR Am J Roentgenol. 2021;216:362–368. doi: 10.2214/AJR.20.23429. [DOI] [PMC free article] [PubMed] [Google Scholar]