Skip to main content
. 2019 Mar;8(Suppl 1):S64–S77. doi: 10.21037/tau.2019.03.01

Table 1. A combined overview of the clinical, genomics and imaging datasets, ordered by number of patients included.

Data source Dataset name Clinical Genomics Imaging No. of patients
NPCR/SEER 2001–2015 Database (PCa) 31 clinical parameters, such as age, race, grade, diagnostic confirmation and laterality 3,086,534
NPCR/SEER 2005–2015 Database (PCa) 25 clinical parameters, such as age, race, grade, diagnostic confirmation and laterality 2,294,444
SEER YR1973_2015.SEER9 (PCa) 133 clinical parameters, such as age, race, Gleason scores, TNM stages, PSA values, survival data and therapy data 637,005
SEER YR2000_2015.CA_KY_LO_NJ_GA (PCa) 133 clinical parameters, such as age, race, Gleason scores, TNM stages, PSA values, survival data and therapy data 461,552
SEER YR1992_2015.SJ_LA_RG_AK (PCa) 133 clinical parameters, such as age, race, Gleason scores, TNM stages, PSA values, survival data and therapy data 164,576
PLCO Prostate Data for PCa screening, incidence, and mortality analyses 76,682
PLCO Prostate Screening Additional information from PSA and DRE cancer screens 35,875
PLCO Prostate Diagnostic Procedures Information about the diagnostic procedures prompted by positive PCa screens 15,307
PLCO Prostate Treatments Specifics of the initial treatment following the diagnosis of PCa 7,614
PLCO Prostate Screening Abnormalities Information for each induration found during the DRE screen 5,743
PLCO Prostate Medical Complications Information about the medical complications caused by diagnostic workup for PCa 2,164
cBioPortal/Synapse GENIE 13 clinical parameters, such as age, race and ethnicity Mutation data 2,008
SEER YR2005.LO_2ND_HALF (PCa) 133 clinical parameters, such as age, race, Gleason scores, TNM stages, PSA values, survival data and therapy data 1,352
cBioPortal Prostate Adenocarcinoma (MSKCC/DFCI) 19 clinical parameters, such as cancer type, diagnosis age and Gleason scores Mutation data and copy number alteration data 1,013
cBioPortal/ICGC/GDC/TCIA Prostate Adenocarcinoma (TCGA, Provisional), aka PRAD-US 100 clinical parameters, such as Gleason scores, TNM values, survival data, age, weight, ethnicity, PSA values and MRI results Mutation data and copy number alteration data 16,790 CT, PT, MR images in 207 series from 14 patients. 3.74 GB of data. Tissue slide images included 498
cBioPortal Prostate adenocarcinoma (TCGA, PanCancer Atlas) 83 clinical parameters, such as diagnosis age, cancer type, ethnicity category, patient weight and race category Mutation data and copy number alteration data 494
cBioPortal Genomic Hallmarks of Prostate Adenocarcinoma (CPC-GENE) 89 clinical parameters, such as Gleason scores, PSA values, weight, survival data, TNM stages and MRI results Comprehensive genomic profiling of 477 Prostate Adenocarcinoma samples from CPC-GENE and public data sets, including TCGA-PRAD 477
cBioPortal MSK-IMPACT Clinical Sequencing Cohort (MSKCC): Prostate Cancer 17 clinical parameters, such as clinical Gleason, age and mutation data Targeted sequencing of clinical cases via MSK-IMPACT for PCa 451
TCIA PROSTATEx Challenge 309,251 MR (T2W, PD-W, DCE and DW) images, 15.1 GB of data 346
cBioPortal Prostate Adenocarcinoma (TCGA) 89 clinical parameters, such as clinical and reviewed Gleason scores, age and gene mutation data Integrated profiling of 333 primary prostate adenocarcinoma samples 333
cBioPortal Prostate Adenocarcinoma (MSKCC) 25 clinical parameters, such as radical prostatectomy Gleason scores, survival data, tumor stages and ERG Fusion data 181 primary, 37 metastatic PCa samples, 12 PCa cell lines and xenografts 216
ICGC PRAD-UK: Prostate Adenocarcinoma - United Kingdom 6 files with clinical data: donor, donor exposure, donor family, donor therapy, sample and specimen Simple Somatic Mutations (SSM) for 215 patients. Copy Number Somatic Mutations (CNSM) for 13 patients. Structural Somatic Mutations (StSM) for 13 patients 216
ICGC EOPC-DE: Early Onset Prostate Cancer - Germany 6 files with clinical data: donor, donor exposure, donor family, donor therapy, sample and specimen Simple Somatic Mutations (SSM) for 202 patients. Copy Number Somatic Mutations (CNSM) for 11 patients. Structural Somatic Mutations (StSM) for 11 patients 211
cBioPortal Metastatic Prostate Cancer, SU2C/PCF Dream Team 20 clinical parameters, such as age and prior medications Comprehensive analysis of 150 metastatic PCa samples 150
ICGC PRAD-CA: Prostate Adenocarcinoma - Canada 6 files with clinical data: donor, donor exposure, donor family, donor therapy, sample and specimen SSM data for 124 patients. CNSM data for 125 patients. StSM data for 123 patients. SGV data for 123 patients. METH-A data for 102 patients 125
cBioPortal Prostate Adenocarcinoma (Broad/Cornell 2012) 15 clinical parameters, such as Gleason score 4–5%, age, PSA values, radical prostatectomy Gleason scores and modified Capra S Scores Comprehensive profiling of 112 PCa samples 112
cBioPortal Prostate Adenocarcinoma CNA study (MSKCC) 37 clinical parameters, such as biopsy and pathology Gleason scores, survival data, PSA values, age, extracapsular extension and treatment data Copy-number profiling of 103 primary PCa samples from MSKCC 104
R ElemStatLearn package Prostate (R) 9 clinical parameters: cancer volume, prostate weight, age, amount of benign prostatic hyperplasia, seminal vesicle invasion, capsular penetration, Gleason scores, percent of Gleason score 4 or 5 and PSA values 97
TCIA Prostate-Diagnosis 4 clinical text fields: path report biopsy, path prostate specimen, MRI report, treatment 32,537 MR images (T1, T2, and DCE sequences) in 368 series, 5.6 GB of data. 3D segmentation files included 92
cBioPortal Neuroendocrine Prostate Cancer (Trento/Cornell/Broad) 16 clinical parameters, such as genomic burden, pathology classification and ploidy Whole exome and RNA Seq data of castration resistant adenocarcinoma and castration resistant neuroendocrine PCa (somatic mutations and copy number aberrations) 81
cBioPortal/ICGC Prostate Adenocarcinoma (Sun Lab), aka PRAD-CN 20 clinical parameters, such as cancer type, diagnosis age, PSA values, Gleason scores and TNM stage Mutation data and copy number alteration data 65
TCIA Prostate-3T 1,258 MR (T2W) images in 64 series, 284 MB of data. Files with segmentation data included 64
cBioPortal Prostate Adenocarcinoma (Fred Hutchinson CRC) 26 clinical parameters, such as chemotherapy data, EXOME data, number of tumors and PSA values Comprehensive profiling of 176 PCa samples 63
cBioPortal Metastatic Prostate Adenocarcinoma (MCTP) 26 clinical parameters, such as therapy info, PSA values, Gleason scores and survival data Comprehensive profiling of 50 metastatic CRPCs and 11 high-grade localized PCa 59
cBioPortal Prostate Adenocarcinoma (Broad/Cornell 2013) 20 clinical parameters, such as Gleason score 4–5%, age, PSA values, radical prostatectomy Gleason scores and tumor stages Comprehensive profiling of 57 PCa samples 57
TCIA Prostate Fused-MRI-Pathology 32,508 MR images in 325 series, 4.4 GB of data. Annotated whole slide pathology images and fused Rad-Path Matlab files included 28
TCIA Prostate-MRI 22,036 MR (with some PET/CT) images in 182 series, 3.2 GB of data. Pathology images included 26
ICGC PRAD-FR: Prostate Adenocarcinoma-France 6 files with clinical data: donor, donor family, donor surgery, sample and specimen SSM data, CNSM data, StSM data, SGV data 25
TCIA QIN PROSTATE 25,981 MR images in 319 series, 4.4 GB of data 22
TCIA QIN-PROSTATE-Repeatability 2,504 MR images in 270 series, 1.1 GB of data. Manual segmentations and volume measurements included 15
TCIA NaF Prostate 64,535 PET/CT images, 12.9 GB of data. DICOM metadata digest included 9
cBioPortal Prostate Adenocarcinoma Organoids (MSKCC) 18 clinical parameters, such as PSA values, HGB values, ALP values, LDH values and therapy info Exome profiling of PCa samples and matched organoids 7
GEO 51 datasets, see Table S1 see Table S1 see Table S1
ArrayExpress 126 datasets, see Table S2 see Table S2 see Table S2