Skip to main content
. 2020 Mar 19;18:668–675. doi: 10.1016/j.csbj.2020.03.007

Table 1.

A summary of datasets.

Platform Cancer type Source Number of samples Number of MSI-H samples Number of MSI-L/MSS samples
RNA-seqa Colon cancer TCGA 281 52 229
Endometrial cancer 367 123 244
Esophageal cancer 89 2 87
Gastric cancer 415 80 335
Rectum cancer 94 3 91
Uterine cancer 56 2 54
Pan-cancer 1383 328 1055
Microarray (GPL570)b Gastric cancer GSE13911 39 19 20
GSE62254 300 68 232
Colorectal cancer GSE13067 74 11 63
GSE13294 155 78 77
GSE18088 53 19 34
GSE26682 160 18 142
GSE35896 61 5 56
GSE39084 70 16 54
GSE39582 536 77 459
GSE75316 59 11 48
GSE92921 58 5 53
Microarray (GPL5175)c GSE24550 65 14 51
Microarray (GPL2986)d GSE25071 46 5 41
Microarray (GPL13158)b GSE27544 22 8 14
Microarray (GPL96)b GSE26682 140 17 123
GSE41258 168 35 133

Note:

a

Poly-A.

b

Affymetrix Oligonucleotide Array.

c

Agilent Oligonucleotide Array.

d

Affymetrix Exon Array.