Skip to main content
BMC Medical Research Methodology logoLink to BMC Medical Research Methodology
. 2007 Jul 4;7:31. doi: 10.1186/1471-2288-7-31

Online genetic databases informing human genome epidemiology

Angela J Frodsham 1,, Julian PT Higgins 1,2
PMCID: PMC1929117  PMID: 17610726

Abstract

Background

With the advent of high throughput genotyping technology and the information available via projects such as the human genome sequencing and the HapMap project, more and more data relevant to the study of genetics and disease risk will be produced. Systematic reviews and meta-analyses of human genome epidemiology studies rely on the ability to identify relevant studies and to obtain suitable data from these studies. A first port of call for most such reviews is a search of MEDLINE. We examined whether this could be usefully supplemented by identifying databases on the World Wide Web that contain genetic epidemiological information.

Methods

We conducted a systematic search for online databases containing genetic epidemiological information on gene prevalence or gene-disease association. In those containing information on genetic association studies, we examined what additional information could be obtained to supplement a MEDLINE literature search.

Results

We identified 111 databases containing prevalence data, 67 databases specific to a single gene and only 13 that contained information on gene-disease associations. Most of the latter 13 databases were linked to MEDLINE, although five contained information that may not be available from other sources.

Conclusion

There is no single resource of structured data from genetic association studies covering multiple diseases, and in relation to the number of studies being conducted there is very little information specific to gene-disease association studies currently available on the World Wide Web. Until comprehensive data repositories are created and utilized regularly, new data will remain largely inaccessible to many systematic review authors and meta-analysts.

Background

Following the human genome project [1] and with the increasing efficiency and throughput of genotyping techniques, very high numbers of genetic variants can be examined for predisposition to disease [2]. Vast untapped resources of genotyping data sit in laboratories across the world, unlikely to ever be published due to natural tendency to better disseminate the more striking of these findings [3]. As the world of genetics moves into the era of whole genome association studies, the amount of data generated will increase still further [2].

Interpretation of the findings of genetic association studies is problematic, not only due to the selective reporting of findings, but also due to limitations of design, conduct, sample size, suboptimal analysis, and inconsistent findings across studies [4,5]. Systematic reviews and meta-analyses offer valuable means of assembling and synthesising the totality of evidence. They offer maximal power to detect true effects, the highest precision to estimate gene prevalences and gene-disease associations, and enable investigation of differences and inconsistencies across studies. However, when based solely on the available published literature they are dependent on what results have been reported, and publication-related biases may be substantial. To partly overcome this, data can be requested from primary investigators, although lack of response, changes in personnel, lack of access to archived data and unwillingness to share data can hamper such attempts. A preferable approach is for collaborative combined analyses by consortia of multiple studies [6]. The Human Genome Epidemiology Network (HuGENet) is promoting meta-analyses of genetic association studies [7], which all, to some extent, depend on information being available about which groups have examined which genetic variants.

One means of making genetic association information available is through online databases. A discussion paper published in 2000 recommended that, although resources for the provision of genomic information on the web were adequate, the availability of genetic epidemiology data was limited. This was in part blamed on the relative youth of the field of genetic epidemiology at the time [8].

Here we present findings from a systematic search for genetic epidemiology data available on the World Wide Web. Our primary motivation was to seek resources that would facilitate thorough systematic reviews or meta-analyses of gene prevalence or genetic association. We were interested both in identification of relevant studies and in availability of data that might not be published in journal articles. For genetic association information we further sought to evaluate the role of online databases as a supplement to information contained in MEDLINE, from the point of view of either a literature-based meta-analysis or in the preliminary stages of a collaborative combined analysis.

Methods

We sought databases containing epidemiological information on gene prevalence or genetic association. Prevalence databases were determined as those with information on population prevalence of genetic variants without information on the evidence that such variants are involved in disease susceptibility or progression. Association databases were determined as those containing epidemiological information relating specific genetic variants to specific health or disease outcomes. To identify these we investigated the databases listed in the 2005 issue of the Nucleic Acids Research Database issue [9] and used those listed on the Center for Disease Control and Disease Prevention (CDC) Office of Genomics and Disease Prevention website [10]. We supplemented this with a search of the world wide web using the Google [11] search engine, using the search term "database (genetics OR genomics)(phenotype OR disease OR epidemiology OR association)" on the 14th October 2005. Links from all databases identified were followed to identify further databases. We excluded general purpose reference databases (such as MEDLINE and EMBASE), databases primarily presenting information on genomics or proteomics without information on epidemiological studies, databases providing a resource for families and health care practitioners, and reported databases whose websites were found to be non-functional.

We produced a list of prevalence databases, and a list of databases addressing variants of a single gene. Databases including association information on more than one gene were the subject of detailed investigation. We extracted information from these on content, source of data, regularity of update, size of the database, accessibility, search functions, connections to other databases, administration and funding, using a pre-piloted pro forma. We developed a system of grading the database according to its potential utility within systematic reviews and meta-analyses, as a supplement to a standard search of MEDLINE. This 'Beyond-MEDLINE utility grade' runs from grade 1 for a database that includes only material available in MEDLINE (and therefore would be identified by searching MEDLINE alone) to grade 5 for a database making unpublished data available to the user.

The grade definitions are as follows:

1 Nothing novel

Database entries are equivalent to/links to MEDLINE records;

2 Novel information

Database entries are based on MEDLINE records, but with additional qualitative information, or otherwise available data (e.g. a specifically written summary, or results extracted from the cited paper);

3 Novel data

Database entries are based on MEDLINE records, but with additional quantitative information otherwise unavailable (e.g. updated results or unpublished association data);

4 Novel studies

Database enables identification of association studies not mentioned in MEDLINE records (e.g. non-MEDLINE-indexed report of an association study);

5 Novel studies and data

Database enables identification of association studies not mentioned in MEDLINE records AND includes association data from such studies (e.g. grouped data or individual patient data).

Results

A total of 448 websites were investigated, excluding duplicates. Of these, 257 were excluded, 111 were classed as containing prevalence data, 67 were classed a specific to a single gene and the remaining 13 databases were classed as containing information from genetic association studies and contained information on more than one gene. These were examined in more detail. Lists of all databases, by category, are available on our website [12].

The prevalence databases contained information on the frequency of genetic variation in multiple genes, often in more than one population. If a database only contained information relevant to a single gene, then this was placed in the gene-specific subcategory. The majority of databases in the gene-specific subcategory contained only prevalence data but some contained information about gene- disease associations, though these were often limited to the rather older field of single gene disorders. Databases containing information on only a single gene were excluded from the utility grade analysis.

Thirteen databases contained information on genetic association studies in more than a single gene (Table 1 and Additional file 1). The majority of the extracted databases are freely available to the scientific community, although three (Asthma Gene Database, MedGene and PharmGKB) require users to register in order to use the website. Most databases had entries that were specifically linked to MEDLINE citations, and added little to the information available in the relevant MEDLINE record beyond a summary of key findings. Five databases contained summary results for unpublished data, indications that a particular gene had been analysed, or (in the case of PharmGKB), access to the genotype and phenotype data enabling further analysis. These five databases of greatest utility in systematic reviews and meta-analyses are, however, restricted to the disease areas of Alzheimers disease, cardiovascular disease, hereditary inflammation and fever, pharmacogenetics and type 1 diabetes.

Table 1.

a table summarising the key information from the databases identified as containing information on genetic association studies. Further information is available in the Supplementary information section. No of entries refers to the approximate number of different study reports contained within the specified database.

Name of Website Website URL Brief Description Host/Funding No. of Entries Date of last update Beyond-MEDLINE Utility Grade Accessibility Reference
Alzgene http://www.alzforum.org/res/com/gen/alzgene/ Regularly updated collection of published genetic association studies performed on Alzheimer Disease phenotypes, from database searches and journals' contents lists. Case and control data presented. Performs crude meta-analysis of odds ratios on request. Various >1000 Mar 2006 5 Freely available [14, 20]
Asthma and Allergy Gene Database* http://cooke.gsf.de/ Database of published studies for phenotypes related to asthmatic disease. Databases of linkage and mutation information include results extracted from peer-reviewed publications Institut für Epidemiologie, Munchen >20,000 Dec 2003 3 Registered access only [21, 22]
Cytokine Gene Polymorphism in Human Disease http://www.bristol.ac.uk/cellmolmed/services/GAI/cytokine4.htm Regularly updated database with Medline-based records from a systematic review of cytokine gene polymorphisms associated with human disease. Data extracted from two publications about the study University of Bristol/Genes and Immunity >100 Mar 2002 2 Freely available [23-25]
GDPinfo† http://apps.nccd.cdc.gov/genomics/GDPQueryTool Extensive information system including large published literature database (currently only Medline records), HuGE reviews, books and various types of reports. CDC >15,000 Mar 2006 4 Freely available [13, 26]
GenAtlas http://www.genatlas.org Regularly updated database of genes, phenotypes and references. Among numerous databases are brief sections on disorders associated with genes, with lists of citations. May be biased towards statistically significant results Université René Descartes, Paris >60,000 Feb 2006 2 Freely available [27]
GeneCanvas http://genecanvas.idf.inserm.fr Database of cardiovascular candidate genes and their polymorphisms investigated at INSERM (Paris, France). Data include gene frequencies and linkage disequilibrium statistics Inserm >750 Oct 2005 4 Freely available
Genetic Association Database http://geneticassociationdb.nih.gov Database of human genetic association studies of complex diseases and disorders, based on Medline records. Data Extracted from publications. NIH/National Institute on Aging and Center for Information Technology >8000 - 2 Freely available [28]
Human Obesity Gene Map Database http://obesitygene.pbrc.edu Database of obesity-related genes, including P values for association and references. Biased in favour of statistically significant results. Pennington Biomedical Research Centre, Louisiana State University >100 Mar 2005 2 Freely available [29-34]
Infevers http://fmf.igh.cnrs.fr/infevers Database of genetic associations in hereditary inflammatory disorders, with voluntarily submitted entries. Submissions are validated by an editorial board member, Institut de Génétique Humaine, CNRS/European Union 5th framework >400 Aug 2005 4 Freely available [35]
MedGene http://hipseq.med.harvard.edu/MEDGENE Automated database of gene-disease association studies in Medline. Havard Medical School Unknown Apr 2005 1 Registered access only [36]
OMIM http://www.ncbi.nlm.nih.gov/omim/ Database of human genes and genetic disorders, containing textual information with links to MEDLINE and sequence records in the Entrez system, and links to additional related resources at NCBI and elsewhere. Johns Hopkins University >16000 Mar2006 2 Freely available [37-39]
PharmGKB http://www.pharmgkb.org Database of genomic data and clinical information from participants in pharmacogenetics research studies. Welcomes submission of primary data. Stanford University/NIH >700 - 5 Registered access only [40]
T1DBase http://t1dbase.org/ Database of type 1 diabetes data, including information from collaborating laboratories. Some indication given of unpublished data Institute for Systems Biology/JDRF International/JDRF/WT Diabetes and Inflammation laboratory >200 Mar2006 4 Freely available [41]

*- No longer being updated due to lack of financial support

† Reports included in the database may contain structured or unstructured data that are not from MEDLINE- indexed paper

Discussion

Our study aimed to identify, via a systematic search, the readily identifiable databases that have been set up to disseminate genetic epidemiology information over and above that available via MEDLINE to the scientific community. While many databases have been set up to house information on prevalence of genetic variation, with some notable exceptions little progress has been made in the field of gene-disease association data. In the 13 databases we identified on gene-disease association, all but one provided at least some extra information unavailable via a MEDLINE search alone. However, the seven databases among these that gave access to previously unavailable data (i.e. a utility grade of ∁3) clearly include only a small minority of the genetic association studies that exist (for example, Lin et al [13] found over 15,000 articles) The most useful of the databases, i.e. those providing the most, previously unavailable, information were considered excellent examples of resources potentially useful in systematic reviews and meta-analyses, but were targeted to particular fields, such as Type 1 diabetes, Hereditary Fever, Alzheimer's disease or pharmacogenetics. The utility of one such database for meta-analyses is demonstrated by a recent paper on Alzheimer's [14].

Many of the genetic epidemiology databases cited in the 2000 paper [8] are no longer updated or no longer exist, due a lack of financial support. Efforts and funding are needed to facilitate the further development of online repositories that enable the dissemination of all findings into the public domain. Any new repositories will need to provide some assurance of suitable quality control. The Human Genome Epidemiology Network (HuGENet) maintains the Published Literature Database [13], which is currently based on MEDLINE records alone. We would be keen to see this developed into a more comprehensive resource in the way that the Cochrane Central Register of Controlled Trials attempts to includes all clinical trials [15]. Neither database is currently structured to link together reports from the same study.

In the wake of the Human Genome Project, with the advent of high throughput genotyping technology, the HapMap project, and now in the era of whole genome association studies, many thousands of genotypes and other data will be generated from epidemiological studies. Only a small minority of these will be reported in traditional journals, and the published literature will continue to provide a potentially biased resource of only the most exciting findings [16]. The Human Genome Epidemiology Network (HuGENet) is committed to encouraging the dissemination of negative findings into the public domain via collaborating with existing journals and setting up on-line journals that will make this process easier. The 'Journal of Negative Results in Biomedicine' published online by BioMed Central [17] has already published several sets of null results of genetic associations and other journals have dedicated subsections for the reporting of null results [18].

We would strongly encourage individual study investigators, and especially consortia of investigators such as those in the HuGENet network of networks [6], to assemble and maintain lists of studies and data repositories. To enable the latter, an approach similar to that of the microarray research community could be adopted for gene-disease association studies: the MIAME (Minimum Information About a Microarray Experiment) guidelines encourage provision of sufficient detail about a microarray experiment for it to be replicated, and offer a format for data to be held in public repositories. Until such developments, it will continue to be difficult to interpret findings from genetic epidemiological studies easily and to fully include them in rigorous and regularly updated meta-analyses.

Since the completion of this study, the National Center for Biotechnology Information (NCBI) have announced a new database called dbGaP specifically to host genotype-phenotype studies [19]. This database appears to be an ideal example of the sort of database for which we were searching and will hopefully, in time if adequately utilised, form an essential resource for those preparing systematic reviews and meta-analyses of gene-disease association studies.

Conclusion

As a result of our systematic search for online repositories of genetic epidemiology data, we found 13 databases containing information on genetic association on more than one gene. On grading each of these with respect to the amount and type of extra data contained compared with a search of MEDLINE, we found seven that contained completely novel data that was previously unavailable (i.e. utility grade ≥ 3). This suggests that systematic reviews and meta-analyses based on published reports could be usefully supplemented with searches of some of these resources. However, the yield of information on the world wide web was still disappointingly low, and neither published literature nor online databases appear adequate to find all relevant evidence for inclusion in a comprehensive meta-analysis. We encourage study investigators to make their published and unpublished data available in suitable online repositories. A single resource providing structured data from genetic association studies covering multiple diseases would be an invaluable resource.

Abbreviations

CDC Center for Disease Control

HuGENet Human Genome Epidemiology Network

Competing interests

The author(s) declare that they have no competing interests.

Authors' contributions

AJF participated in the design of the study, carried it out, and drafted the manuscript. JPTH conceived of the study, participated in its design and coordination and helped draft the manuscript. Both authors approved the final manuscript.

Pre-publication history

The pre-publication history for this paper can be accessed here:

http://www.biomedcentral.com/1471-2288/7/31/prepub

Supplementary Material

Additional file 1

This file list all of the 13 extracted databases and gives a more detailed description of each

Click here for file (47.5KB, doc)

Contributor Information

Angela J Frodsham, Email: angelaf@mrc-bsu.cam.ac.uk.

Julian PT Higgins, Email: julian.higgins@mrc-bsu.cam.ac.uk.

References

  1. Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W, Funke R, Gage D, Harris K, Heaford A, Howland J, Kann L, Lehoczky J, Levine R, McEwan P, McKernan K, Meldrim J, Mesirov JP, Miranda C, Morris W, Naylor J, Raymond C, Rosetti M, Santos R, Sheridan A, Sougnez C, Stange-Thomann N, Stojanovic N, Subramanian A, Wyman D, Rogers J, Sulston J, Ainscough R, Beck S, Bentley D, Burton J, Clee C, Carter N, Coulson A, Deadman R, Deloukas P, Dunham A, Dunham I, Durbin R, French L, Grafham D, Gregory S, Hubbard T, Humphray S, Hunt A, Jones M, Lloyd C, McMurray A, Matthews L, Mercer S, Milne S, Mullikin JC, Mungall A, Plumb R, Ross M, Shownkeen R, Sims S, Waterston RH, Wilson RK, Hillier LW, McPherson JD, Marra MA, Mardis ER, Fulton LA, Chinwalla AT, Pepin KH, Gish WR, Chissoe SL, Wendl MC, Delehaunty KD, Miner TL, Delehaunty A, Kramer JB, Cook LL, Fulton RS, Johnson DL, Minx PJ, Clifton SW, Hawkins T, Branscomb E, Predki P, Richardson P, Wenning S, Slezak T, Doggett N, Cheng JF, Olsen A, Lucas S, Elkin C, Uberbacher E, Frazier M, Gibbs RA, Muzny DM, Scherer SE, Bouck JB, Sodergren EJ, Worley KC, Rives CM, Gorrell JH, Metzker ML, Naylor SL, Kucherlapati RS, Nelson DL, Weinstock GM, Sakaki Y, Fujiyama A, Hattori M, Yada T, Toyoda A, Itoh T, Kawagoe C, Watanabe H, Totoki Y, Taylor T, Weissenbach J, Heilig R, Saurin W, Artiguenave F, Brottier P, Bruls T, Pelletier E, Robert C, Wincker P, Smith DR, Doucette-Stamm L, Rubenfield M, Weinstock K, Lee HM, Dubois J, Rosenthal A, Platzer M, Nyakatura G, Taudien S, Rump A, Yang H, Yu J, Wang J, Huang G, Gu J, Hood L, Rowen L, Madan A, Qin S, Davis RW, Federspiel NA, Abola AP, Proctor MJ, Myers RM, Schmutz J, Dickson M, Grimwood J, Cox DR, Olson MV, Kaul R, Raymond C, Shimizu N, Kawasaki K, Minoshima S, Evans GA, Athanasiou M, Schultz R, Roe BA, Chen F, Pan H, Ramser J, Lehrach H, Reinhardt R, McCombie WR, de la BM, Dedhia N, Blocker H, Hornischer K, Nordsiek G, Agarwala R, Aravind L, Bailey JA, Bateman A, Batzoglou S, Birney E, Bork P, Brown DG, Burge CB, Cerutti L, Chen HC, Church D, Clamp M, Copley RR, Doerks T, Eddy SR, Eichler EE, Furey TS, Galagan J, Gilbert JG, Harmon C, Hayashizaki Y, Haussler D, Hermjakob H, Hokamp K, Jang W, Johnson LS, Jones TA, Kasif S, Kaspryzk A, Kennedy S, Kent WJ, Kitts P, Koonin EV, Korf I, Kulp D, Lancet D, Lowe TM, McLysaght A, Mikkelsen T, Moran JV, Mulder N, Pollara VJ, Ponting CP, Schuler G, Schultz J, Slater G, Smit AF, Stupka E, Szustakowski J, Thierry-Mieg D, Thierry-Mieg J, Wagner L, Wallis J, Wheeler R, Williams A, Wolf YI, Wolfe KH, Yang SP, Yeh RF, Collins F, Guyer MS, Peterson J, Felsenfeld A, Wetterstrand KA, Patrinos A, Morgan MJ, de Jong P, Catanese JJ, Osoegawa K, Shizuya H, Choi S, Chen YJ. Initial sequencing and analysis of the human genome. Nature. 2001;409:860–921. doi: 10.1038/35057062. [DOI] [PubMed] [Google Scholar]
  2. Gibbs JR, Singleton A. Application of Genome-Wide Single Nucleotide Polymorphism Typing: Simple Association and Beyond. PLoS Genet. 2006;2 doi: 10.1371/journal.pgen.0020150. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Little J, Khoury MJ, Bradley L, Clyne M, Gwinn M, Lin B, Lindegren ML, Yoon P. The human genome project is complete. How do we develop a handle for the pump? Am J Epidemiol. 2003;157:667–673. doi: 10.1093/aje/kwg048. [DOI] [PubMed] [Google Scholar]
  4. Ioannidis JP, Ntzani EE, Trikalinos TA, Contopoulos-Ioannidis DG. Replication validity of genetic association studies. Nat Genet. 2001;29:306–309. doi: 10.1038/ng749. [DOI] [PubMed] [Google Scholar]
  5. Lohmueller KE, Pearce CL, Pike M, Lander ES, Hirschhorn JN. Meta-analysis of genetic association studies supports a contribution of common variants to susceptibility to common disease . Nat Genet. 2003;33:177–182. doi: 10.1038/ng1071. [DOI] [PubMed] [Google Scholar]
  6. Ioannidis JP, Bernstein J, Boffetta P, Danesh J, Dolan S, Hartge P, Hunter D, Inskip P, Jarvelin MR, Little J, Maraganore DM, Bishop JA, O'Brien TR, Petersen G, Riboli E, Seminara D, Taioli E, Uitterlinden AG, Vineis P, Winn DM, Salanti G, Higgins JP, Khoury MJ. A network of investigator networks in human genome epidemiology . Am J Epidemiol. 2005;162:302–304. doi: 10.1093/aje/kwi201. [DOI] [PubMed] [Google Scholar]
  7. Ioannidis JP, Gwinn M, Little J, Higgins JP, Bernstein JL, Boffetta P, Bondy M, Bray MS, Brenchley PE, Buffler PA, Casas JP, Chokkalingam A, Danesh J, Smith GD, Dolan S, Duncan R, Gruis NA, Hartge P, Hashibe M, Hunter DJ, Jarvelin MR, Malmer B, Maraganore DM, Newton-Bishop JA, O'Brien TR, Petersen G, Riboli E, Salanti G, Seminara D, Smeeth L, Taioli E, Timpson N, Uitterlinden AG, Vineis P, Wareham N, Winn DM, Zimmern R, Khoury MJ. A road map for efficient and reliable human genome epidemiology . Nat Genet. 2006;38:3–5. doi: 10.1038/ng0106-3. [DOI] [PubMed] [Google Scholar]
  8. Marks AD, Yoon PW. Human Genome Epidemiology Information (HuGE) on the Internet. 2000.
  9. Galperin MY. The Molecular Biology Database Collection: 2005 update . Nucleic Acids Res. 2005;33:D5–24. doi: 10.1093/nar/gki139. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Center for Disease Control and Disease Prevention (CDC)- Genomics and Disease Prevention. http://www.cdc.gov/genomics/
  11. Google http://www.google.com
  12. HuGENet UK Co-ordinating Centre. http://www.hugenet.org.uk/
  13. Lin BK, Clyne M, Walsh M, Gomez O, Yu W, Gwinn M, Khoury MJ. Tracking the epidemiology of human genes in the literature: the HuGE Published Literature database. Am J Epidemiol. 2006;164:1–4. doi: 10.1093/aje/kwj175. [DOI] [PubMed] [Google Scholar]
  14. Bertram L, McQueen MB, Mullin K, Blacker D, Tanzi RE. Systematic meta-analyses of Alzheimer disease genetic association studies: the AlzGene database. Nat Genet. 2007;39:17–23. doi: 10.1038/ng1934. [DOI] [PubMed] [Google Scholar]
  15. Dickersin K, Manheimer E, Wieland S, Robinson KA, Lefebvre C, McDonald S. Development of the Cochrane Collaboration's CENTRAL Register of controlled clinical trials. Eval Health Prof. 2002;25:38–64. doi: 10.1177/016327870202500104. [DOI] [PubMed] [Google Scholar]
  16. Little J, Bradley L, Bray MS, Clyne M, Dorman J, Ellsworth DL, Hanson J, Khoury M, Lau J, O'Brien TR, Rothman N, Stroup D, Taioli E, Thomas D, Vainio H, Wacholder S, Weinberg C. Reporting, appraising, and integrating data on genotype prevalence and gene-disease associations. Am J Epidemiol. 2002;156:300–310. doi: 10.1093/oxfordjournals.aje.a000179. [DOI] [PubMed] [Google Scholar]
  17. Pfeffer C, Olsen BR. Editorial: Journal of negative results in biomedicine. J Negat Results Biomed. 2002;1:2. doi: 10.1186/1477-5751-1-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Shields PG. Publication bias is a scientific problem with adverse ethical outcomes: the case for a section for null results. Cancer Epidemiol Biomarkers Prev. 2000;9:771–772. [PubMed] [Google Scholar]
  19. NCBI dbGaP. 2006.
  20. Bertram L, McQueen M, Mullin K, Blacker D, Tanzi R. The Alz Gene Database. Alzhiemer Research Forum. Web. 2006.
  21. Immervoll T, Wjst M. Current status of the Asthma and Allergy Database . Nucleic Acids Res. 1999;27:213–214. doi: 10.1093/nar/27.1.213. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Wjst M, Immervoll T. An Internet linkage and mutation database for the complex phenotype asthma . Bioinformatics. 1998;14:827–828. doi: 10.1093/bioinformatics/14.9.827. [DOI] [PubMed] [Google Scholar]
  23. Bidwell J, Keen L, Gallagher G, Kimberly R, Huizinga T, McDermott MF, Oksenberg J, McNicholl J, Pociot F, Hardt C, D'Alfonso S. Cytokine gene polymorphism in human disease: on-line databases. Genes Immun. 1999;1:3–19. doi: 10.1038/sj.gene.6363645. [DOI] [PubMed] [Google Scholar]
  24. Bidwell J, Keen L, Gallagher G, Kimberly R, Huizinga T, McDermott MF, Oksenberg J, McNicholl J, Pociot F, Hardt C, D'Alfonso S. Cytokine gene polymorphism in human disease: on-line databases, supplement 1. Genes Immun. 2001;2:61–70. doi: 10.1038/sj.gene.6363733. [DOI] [PubMed] [Google Scholar]
  25. Haukim N, Bidwell JL, Smith AJ, Keen LJ, Gallagher G, Kimberly R, Huizinga T, McDermott MF, Oksenberg J, McNicholl J, Pociot F, Hardt C, D'Alfonso S. Cytokine gene polymorphism in human disease: on-line databases, supplement 2 . Genes Immun. 2002;3:313–330. doi: 10.1038/sj.gene.6363881. [DOI] [PubMed] [Google Scholar]
  26. Khoury MJ, Mensah GA. Genomics and the Prevention and Control of Common Chronic Diseases: Emerging Priorities for Public Health Action. Prev Chronic Dis (serial online) 2005;2:A05. [PMC free article] [PubMed] [Google Scholar]
  27. Frezal J. Genatlas database, genes and development defects . CR Acad Sci III. 1998;321:805–817. doi: 10.1016/s0764-4469(99)80021-3. [DOI] [PubMed] [Google Scholar]
  28. Becker KG, Barnes KC, Bright TJ, Wang SA. The genetic association database. Nat Genet. 2004;36:431–432. doi: 10.1038/ng0504-431. [DOI] [PubMed] [Google Scholar]
  29. Chagnon YC, Perusse L, Weisnagel SJ, Rankinen T, Bouchard C. The human obesity gene map: the 1999 update. Obes Res. 2000;8:89–117. doi: 10.1038/oby.2000.12. [DOI] [PubMed] [Google Scholar]
  30. Perusse L, Chagnon YC, Weisnagel SJ, Rankinen T, Snyder E, Sands J, Bouchard C. The human obesity gene map: the 2000 update . Obes Res. 2001;9:135–169. doi: 10.1038/oby.2001.17. [DOI] [PubMed] [Google Scholar]
  31. Rankinen T, Perusse L, Weisnagel SJ, Snyder EE, Chagnon YC, Bouchard C. The human obesity gene map: the 2001 update . Obes Res. 2002;10:196–243. doi: 10.1038/oby.2002.30. [DOI] [PubMed] [Google Scholar]
  32. Chagnon YC, Rankinen T, Snyder EE, Weisnagel SJ, Perusse L, Bouchard C. The human obesity gene map: the 2002 update . Obes Res. 2003;11:313–367. doi: 10.1038/oby.2003.47. [DOI] [PubMed] [Google Scholar]
  33. Snyder EE, Walts B, Perusse L, Chagnon YC, Weisnagel SJ, Rankinen T, Bouchard C. The human obesity gene map: the 2003 update . Obes Res. 2004;12:369–439. doi: 10.1038/oby.2004.47. [DOI] [PubMed] [Google Scholar]
  34. Perusse L, Rankinen T, Zuberi A, Chagnon YC, Weisnagel SJ, Argyropoulos G, Walts B, Snyder EE, Bouchard C. The human obesity gene map: the 2004 update . Obes Res. 2005;13:381–490. doi: 10.1038/oby.2005.50. [DOI] [PubMed] [Google Scholar]
  35. Sarrauste M, Terriere S, Pugnere D, Ruiz M, Demaille J, Touitou I. INFEVERS: the Registry for FMF and hereditary inflammatory disorders mutations . Nucleic Acids Res. 2003;31:282–285. doi: 10.1093/nar/gkg031. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Hu Y, Hines LM, Weng H, Zuo D, Rivera M, Richardson A, LaBaer J. Analysis of genomic and proteomic data using advanced literature mining . J ProteomeRes. 2003;2:405–412. doi: 10.1021/pr0340227. [DOI] [PubMed] [Google Scholar]
  37. Hamosh A, Scott AF, Amberger J, Bocchini C, Valle D, McKusick VA. Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders . Nucleic Acids Res. 2002;30:52–55. doi: 10.1093/nar/30.1.52. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Hamosh A, Scott AF, Amberger J, Valle D, McKusick VA. Online Mendelian Inheritance in Man (OMIM) . Hum Mutat. 2000;15:57–61. doi: 10.1002/(SICI)1098-1004(200001)15:1<57::AID-HUMU12>3.0.CO;2-G. [DOI] [PubMed] [Google Scholar]
  39. Hamosh A, Scott AF, Amberger JS, Bocchini CA, McKusick VA. Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders . Nucleic Acids Res. 2005;33:D514–D517. doi: 10.1093/nar/gki033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Hewett M, Oliver DE, Rubin DL, Easton KL, Stuart JM, Altman RB, Klein TE. PharmGKB: the Pharmacogenetics Knowledge Base. Nucleic Acids Res. 2002;30:163–165. doi: 10.1093/nar/30.1.163. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Smink LJ, Helton EM, Healy BC, Cavnor CC, Lam AC, Flamez D, Burren OS, Wang Y, Dolman GE, Burdick DB, Everett VH, Glusman G, Laneri D, Rowen L, Schuilenburg H, Walker NM, Mychaleckyj J, Wicker LS, Eizirik DL, Todd JA, Goodman N. T1DBase, a community web-based resource for type 1 diabetes research. Nucleic Acids Res. 2005;33:D544–D549. doi: 10.1093/nar/gki095. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Additional file 1

This file list all of the 13 extracted databases and gives a more detailed description of each

Click here for file (47.5KB, doc)

Articles from BMC Medical Research Methodology are provided here courtesy of BMC

RESOURCES