Skip to main content
. 2021 Jun 24;11(7):1155. doi: 10.3390/diagnostics11071155

Table 8.

Comparison between the COVID-19 medical images datasets.

Ref. Type Size URL Open-Source Metadata
medseg.ai CT scan 100 CT scans from 40 COVID-19 patients http://medicalsegmentation.com/covid19/ (access date 20 February 2021) Yes Yes
[265] CT scan 68,623 CT scan images for COVID-19 and non-COVID-19 images - No No
[266] CT scan 370 CT scan images for COVID-19 and non-COVID-19 images - Yes No
[240] X-ray 13,800 X-ray images for COVID-19 and phenomena - No No
[236] X-ray 100 X-ray images for COVID-19 and healthy class images - No Yes
[241] X-ray 230 X-ray images for COVID-19 and non-COVID-19 images - NO No
[53] X-ray 127 X-ray images for COVID-19 and non-COVID-19 images - No No
[241] X-ray 17,000 X-ray images for three class (COVID-19, healthy and phenomena - No No
[242] X-ray 2500 X-ray images for COVID-19 and non-COVID-19 images - Yes NO
[243] X-ray 4707 X-ray images for COVID-19 and non-COVID-19 images - Yes Yes
Kaggle X-ray 359 X-ray images for COVID-19 and non-COVID-19 patients https://www.kaggle.com/bachrr/covid-chest-xray (access date 20 February 2021) Yes Yes
GitHub X-ray 239 images for COVID-19-positive cases, in addition to some vital sings https://github.com/agchung/Actualmed-COVID-chestxraydataset/tree/master/images, (access date 20 February 2021) Yes Yes
[25] CT scan 34 CT scan images for COVID-19 and non-COVID-19 patients https://github.com/UCSD-AI4H/COVID-CT, (access date 20 February 2021) Yes Yes
[70] Ultrasound images (654 COVID-19-positive subjects, 277 bacterial pneumonia, and 172 healthy subjects https://github.com/jannisborn/covid19 pocus ultrasound/tree/master/data, (access date 20 February 2021) Yes Yes
[235] CT scan and X-ray images 265 COVID-19 (165 X-ray, 100 CT scans) https://github.com/ieee8023/covid-chestxray-dataset, (access date 20 February 2021) Yes Yes
EOR CT scan and X-ray images Various CT scan and X-ray images for COVID-19 patients https://www.eurorad.org/advanced-search?search=COVID, (access date 20 February 2021) No Yes
BSTI CT scan and X-ray images Various CT scan and X-ray images for COVID-19 patients https://bit.ly/BSTICovid19 Teaching Library
(access date 20 February 2021)
No Yes
[82] Cough-sound 328 sound from 150 patient - No No
[80] Cough-sound Cough and speech from 1079 normal and 92 COVID-19 https://coswara.iisc.ac.in
(access date 20 February 2021)
Yes Yes
[247] Cough sound Cough sound: 13 normal and 8 COVID-positive cases https://coughtest.online
(access date 20 February 2021)
Yes Yes
GitHub Cough sound 121 segmented coughs collected from 16 patient https://github.com/virufy/covid
(access date 20 February 2021)
Yes Yes
[81] Cough Sound 144 segmented coughs, aggregated from 28 patient - No NO
[249] Breathing sound 260 sound record aggregated from 52 COVID (32 male, 20 females) positive cases - No Yes
[76] Breathing sound 7000 unique samples, including 200 samples from COVID-19-confirmed cases - NO Yes
[266] Text data Symptoms and health reports for 62 patients in South Korea https://www.kaggle.com/kimjihoo/coronavirusdataset
(access date 20 February 2021)
Yes Yes
datahub Text data Time series symptoms from COVID-19 patients https://datahub.io/core/covid-19
(access date 20 February 2021)
Yes Yes
[69] COVID-19 (Japan) 29 columns https://www.kaggle.com/lisphilar/covid19-dataset-in-japan
(access date 20 February 2021)
Yes Yes
Word clouds Covid-19 Text Dataset Text data extracted from 13,202 scientific papers https://github.com/Sarmentor/POS-Tagging-Wordcloud-with-R
(access date 20 February 2021)
Yes Yes
Kaggle COVID-19 Predictors 28 demographic features about 96 countries (infection rate, number of ICU beds, death rate, etc) https://www.kaggle.com/nightranger77/covid19-demographic-predictors
(access date 20 February 2021)
Yes Yes
Kaggle COVID-19 country info Include information about different countries, such as death rate, infection rate, and number of rapid tests https://www.kaggle.com/koryto/countryinfo
(access date 20 February 2021)
Yes No
Kaggle Coronavirus (COVID-19) Tweets 500,000 Tweets of users write the following hashtags: #coronavirus, #covid_19 #coronavirusoutbreak, #coronavirusPandemic, #covid19 https://www.kaggle.com/smid80/coronavirus-covid19-tweets
(access date 20 February 2021)
Yes Yes
[75] COVID-19 Multilanguage Tweets Dataset 1200 M tweets collected using keywords related to COVID-19 https://sites.lafayette.edu/lopezbec/projects/covid-19-multilanguage-tweets-dataset/
(access date 20 February 2021)
Yes Yes
[76] COVID-19 Twitter Dataset 237 million tweets extracted from Twitter posts that mentioned “COVID” as a word or hashtag (e.g., COVID-19, COVID19) https://dataverse.scholarsportal.info/dataset.xhtml?persistentId=doi:10.5683/SP2/PXF2CU
(access date 20 February 2021)
yes Yes
CDCP Text data Patient symptoms and report health status in https://www.cdc.gov/coronavirus/2019-ncov/index.html
https://www.coronavirus.gov/
(access date 20 February 2021)
Yes Yes
NCBI Genome data Viral protein sequence https://www.ncbi.nlm.nih.gov/genbank/sars-cov-2-seqs/
(access date 20 February 2021)
Yes Yes
GISAID Genome data Viral protein sequence https://www.gisaid.org/
(access date 20 February 2021)
Yes Yes
GC Genome data Viral protein sequence https://db.cngb.org/datamart/disease/DATAdis19/
(access date 20 February 2021)
Yes Yes
EBI Genome data Viral structure, RNA, and protein sequence https://www.covid19dataportal.org/
(access date 20 February 2021)
Yes Yes
(NCBI). Genome data Viral protein sequence https://registry.opendata.aws/ncbi-covid-19/
(access date 20 February 2021)
Yes Yes
Zeng’s Case reports Reports on 20 projects, 16 report http://open-source-covid-19.weileizeng.com/
(access date 20 February 2021)
Yes Yes

BSTI: British Society of Thoracic Imaging; CDCP: Centers for Disease Control and Prevention in the US; GISAID: The GISAID organization; NCBI: NCBI GenBank; GC: GeneBank in China; EOR: European Organization for Radiology.