Skip to main content
BMC Medical Genomics logoLink to BMC Medical Genomics
. 2017 Nov 29;10:69. doi: 10.1186/s12920-017-0309-7

Correction to: FLAGS, frequently mutated genes in public exomes

Casper Shyr 1,3,4,#, Maja Tarailo-Graovac 1,2,3,#, Michael Gottlieb 1, Jessica JY Lee 1,5, Clara van Karnebeek 3,6,7, Wyeth W Wasserman 1,2,3,
PMCID: PMC5706417  PMID: 29187224

Correction

Unfortunately, the original article [1] contained an error. The additional files were included incorrectly. The correct additional files 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13 and 14 are published in this correction.

Additional files

Additional file 1: Table S1. (220.1KB, txt)

This table lists the five datasets used in this study, and the genes that made up each dataset. The first row in the table shows the names of the datasets referred throughout the manuscript, and each column contains the list of genes, referred to by their official gene symbol (TXT 220 kb)

Additional file 2: Table S6. (207.1KB, vcf)

A list of variants, in variant call format (VCF), showing the mutations that were observed more than 10 times in our in-house database consisting of 150 exomes and 13 whole genomes, after they were filtered by allelic frequencies according to the annotations from dbSNP and Exome variant server (refer to methodology section for more details). (VCF 207 kb)

Additional file 3: Table S4. (185.4KB, txt)

The entire ranked list of FLAGS, with the most frequently mutated genes at the top. (TXT 185 kb)

Additional file 4: Table S2A. (677B, zip)

The list of genes that overlap between OMIM and FLAGS. Table S2B. The list of genes that overlap between HGMD and FLAGS. (ZIP 677 bytes)

Additional file 5: Table S5. (692B, txt)

A table showing a comparison of dN/dS ratio between the values we reported with our calculation (see manuscript for methodology), versus a previously published result of a gene set (refer to reference [31] in the manuscript). The results between the two methodologies were highly consistent. (TXT 692 bytes)

Additional file 6: Table S9A. (900.1KB, zip)

The list of input attributes for each gene that were fed into R for statistical analyses (dN/dS ratio, gene length, # of MeSH terms, # of HPO terms, and # of paralogs). Table S9B. The list of number of pathogenic variants from HGMD for each gene. (ZIP 900 kb)

Additional file 7: Text S3. (48.6KB, pdf)

This section contains a concise description of our in-house bioinformatics pipeline for processing exome and whole genome datasets. (PDF 48 kb)

Additional file 8: Text S4. (49.6KB, pdf)

This section provides a description of the TIDE-BC project. (PDF 49 kb)

Additional file 9: Table S7. (2.4KB, txt)

A summary of the families studied in TIDEX project, and the number of candidate variants remaining after filtering against genetic and allelic frequency thresholds. The results are broken down by family structure and the types of genetic model applied. Refer to www.tidebc.org and additional text S4 for more information on the TIDEX project, and additional text S3 for how the variants were called and filtered. (TXT 2 kb)

Additional file 10: Table S8. (81.8KB, txt)

A table of number of exomes from TIDEX project that lists out the number of rare functional variants for each protein-coding gene captured in the exome capture kits. Please refer to methodology section for how ‘rare’ and ‘functional’ descriptors were defined. (TXT 81 kb)

Additional file 11: Text S2. (48.7KB, pdf)

A comparison of our FLAGS gene ranking system against another method, residual variation intolerance score (RVIS) that also built upon public genomic datasets and ranked importance of each gene’s association to human diseases. Agreements and disagreements between the two methods are discussed. (PDF 48 kb)

Additional file 12: Text S5. (46.6KB, pdf)

This section describes an analysis looking at the distribution of number of MeSH and HPO terms per gene, after normalizing by the number of biological functionally-related literature published for that gene, as reported in GeneRIF. (PDF 46 kb)

Additional file 13: Text S1. (47.2KB, pdf)

This section describes an analysis looking at the uniformity of distribution for rare functional variants across genes. The hypothesis was that genes of less significance to monogenic human diseases would display more uniformity in the occurrences of benign coding mutations across the protein sequence, whereas genes that are more linked to causing penetrating diseases would harbor regions that are more devoid of mutations due to conservation of important protein domains. (PDF 47 kb)

Additional file 14: (108.1KB, pdf)

The PDF outlining the supplementary information for this manuscript. (PDF 108 kb)

Footnotes

Electronic supplementary material

The online version of this article (10.1186/s12920-017-0309-7) contains supplementary material, which is available to authorized users.

The original article can be found online at https://doi.org/10.1186/s12920-014-0064-y.

Reference

  • 1.Shyr et al., FLAGS, frequently mutated genes in public exomes. BMC Med Genomics 2014;7:64. DOI:10.1186/s12920-014-0064-y. [DOI] [PMC free article] [PubMed]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Additional file 1: Table S1. (220.1KB, txt)

This table lists the five datasets used in this study, and the genes that made up each dataset. The first row in the table shows the names of the datasets referred throughout the manuscript, and each column contains the list of genes, referred to by their official gene symbol (TXT 220 kb)

Additional file 2: Table S6. (207.1KB, vcf)

A list of variants, in variant call format (VCF), showing the mutations that were observed more than 10 times in our in-house database consisting of 150 exomes and 13 whole genomes, after they were filtered by allelic frequencies according to the annotations from dbSNP and Exome variant server (refer to methodology section for more details). (VCF 207 kb)

Additional file 3: Table S4. (185.4KB, txt)

The entire ranked list of FLAGS, with the most frequently mutated genes at the top. (TXT 185 kb)

Additional file 4: Table S2A. (677B, zip)

The list of genes that overlap between OMIM and FLAGS. Table S2B. The list of genes that overlap between HGMD and FLAGS. (ZIP 677 bytes)

Additional file 5: Table S5. (692B, txt)

A table showing a comparison of dN/dS ratio between the values we reported with our calculation (see manuscript for methodology), versus a previously published result of a gene set (refer to reference [31] in the manuscript). The results between the two methodologies were highly consistent. (TXT 692 bytes)

Additional file 6: Table S9A. (900.1KB, zip)

The list of input attributes for each gene that were fed into R for statistical analyses (dN/dS ratio, gene length, # of MeSH terms, # of HPO terms, and # of paralogs). Table S9B. The list of number of pathogenic variants from HGMD for each gene. (ZIP 900 kb)

Additional file 7: Text S3. (48.6KB, pdf)

This section contains a concise description of our in-house bioinformatics pipeline for processing exome and whole genome datasets. (PDF 48 kb)

Additional file 8: Text S4. (49.6KB, pdf)

This section provides a description of the TIDE-BC project. (PDF 49 kb)

Additional file 9: Table S7. (2.4KB, txt)

A summary of the families studied in TIDEX project, and the number of candidate variants remaining after filtering against genetic and allelic frequency thresholds. The results are broken down by family structure and the types of genetic model applied. Refer to www.tidebc.org and additional text S4 for more information on the TIDEX project, and additional text S3 for how the variants were called and filtered. (TXT 2 kb)

Additional file 10: Table S8. (81.8KB, txt)

A table of number of exomes from TIDEX project that lists out the number of rare functional variants for each protein-coding gene captured in the exome capture kits. Please refer to methodology section for how ‘rare’ and ‘functional’ descriptors were defined. (TXT 81 kb)

Additional file 11: Text S2. (48.7KB, pdf)

A comparison of our FLAGS gene ranking system against another method, residual variation intolerance score (RVIS) that also built upon public genomic datasets and ranked importance of each gene’s association to human diseases. Agreements and disagreements between the two methods are discussed. (PDF 48 kb)

Additional file 12: Text S5. (46.6KB, pdf)

This section describes an analysis looking at the distribution of number of MeSH and HPO terms per gene, after normalizing by the number of biological functionally-related literature published for that gene, as reported in GeneRIF. (PDF 46 kb)

Additional file 13: Text S1. (47.2KB, pdf)

This section describes an analysis looking at the uniformity of distribution for rare functional variants across genes. The hypothesis was that genes of less significance to monogenic human diseases would display more uniformity in the occurrences of benign coding mutations across the protein sequence, whereas genes that are more linked to causing penetrating diseases would harbor regions that are more devoid of mutations due to conservation of important protein domains. (PDF 47 kb)

Additional file 14: (108.1KB, pdf)

The PDF outlining the supplementary information for this manuscript. (PDF 108 kb)


Articles from BMC Medical Genomics are provided here courtesy of BMC

RESOURCES