Skip to main content
. 2020 Jan 29;10:1414. doi: 10.1038/s41598-020-58088-2

Table 2.

Summary of imported genomic data from various data sources in cGDM databases.

Table name Database
Internal database Demo database
Cancer Panel Leukemia Depression TCGA COAD TCGA LUAD 1000 Genome Phase 3 CEU TCGA PAAD
Row counts (per table) CLINICAL_IDENTIFIER 10 503 1000 459 522 99 155
EXPERIMENT_RELATED_INFORMATION 10 517 1000 459 522 99 155
BIOINFORMATICS_PROTOCOL_RELATED_INFORMATION 10 517 1000 459 522 99 155
GENOMIC_ALTERATION 2733 29,279,631 842,199,347 361,933 318,947 229,525,363 56,159
MICROSATELLITE_INSTABILITY 0 0 0 0 0 0 775
CLINICAL_ANNOTATION 40 267 108 123 97 1 12
QUALITY_CHECK 10 517 1000 0 0 0 0
Data volume (per database) 2MB 8.2GB 144.7GB 48.37MB 42.63MB 47.67GB 9.41MB

The databases are categorised into internal and demo database. The specifications of the database tables are informed in Table 1. This table presents row counts of each database table and data volumes of each database. The internal databases include 3 private datasets (cancer panel, leukemia and depression) and 2 public datasets (TCGA COAD and TCGA LUAD). The demo databases include 2 public datasets (1000 Genome Phase3 CEU and TCGA PAAD). * COAD is study abbreviation in the TCGA stands for Colon adenocarcinom a; LUAD for Lung adenocarcinoma.