Table 2.
Summary of imported genomic data from various data sources in cGDM databases.
Table name | Database | |||||||
---|---|---|---|---|---|---|---|---|
Internal database | Demo database | |||||||
Cancer Panel | Leukemia | Depression | TCGA COAD | TCGA LUAD | 1000 Genome Phase 3 CEU | TCGA PAAD | ||
Row counts (per table) | CLINICAL_IDENTIFIER | 10 | 503 | 1000 | 459 | 522 | 99 | 155 |
EXPERIMENT_RELATED_INFORMATION | 10 | 517 | 1000 | 459 | 522 | 99 | 155 | |
BIOINFORMATICS_PROTOCOL_RELATED_INFORMATION | 10 | 517 | 1000 | 459 | 522 | 99 | 155 | |
GENOMIC_ALTERATION | 2733 | 29,279,631 | 842,199,347 | 361,933 | 318,947 | 229,525,363 | 56,159 | |
MICROSATELLITE_INSTABILITY | 0 | 0 | 0 | 0 | 0 | 0 | 775 | |
CLINICAL_ANNOTATION | 40 | 267 | 108 | 123 | 97 | 1 | 12 | |
QUALITY_CHECK | 10 | 517 | 1000 | 0 | 0 | 0 | 0 | |
Data volume (per database) | 2MB | 8.2GB | 144.7GB | 48.37MB | 42.63MB | 47.67GB | 9.41MB |
The databases are categorised into internal and demo database. The specifications of the database tables are informed in Table 1. This table presents row counts of each database table and data volumes of each database. The internal databases include 3 private datasets (cancer panel, leukemia and depression) and 2 public datasets (TCGA COAD and TCGA LUAD). The demo databases include 2 public datasets (1000 Genome Phase3 CEU and TCGA PAAD). * COAD is study abbreviation in the TCGA stands for Colon adenocarcinom a; LUAD for Lung adenocarcinoma.