Abstract
This paper describes an approach that provides Internet-based support for a genome center to map human chromosome 12, as a collaboration between laboratories at the Albert Einstein College of Medicine in Bronx, New York, and the Yale University School of Medicine in New Haven, Connecticut. Informatics is well established as an important enabling technology within the genome mapping community. The goal of this paper is to use the chromosome 12 project as a case study to introduce a medical informatics audience to certain issues involved in genome informatics and in the Internet-based support of collaborative bioscience research. Central to the approach described is a shared database (DB/12) with Macintosh clients in the participating laboratories running the 4th Dimension database program as a user-friendly front end, and a Sun SPARCstation-2 server running Sybase. The central component of the database stores information about yeast artificial chromosomes (YACs), each containing a segment of human DNA from chromosome 12 to which genome markers have been mapped, such that an overlapping set of YACs (called a "contig") can be identified, along with an ordering of the markers. The approach also includes 1) a map assembly tool developed to help biologists interpret their data, proposing a ranked set of candidate maps, 2) the integration of DB/12 with external databases and tools, and 3) the dissemination of the results. This paper discusses several of the lessons learned that apply to many other areas of bioscience, and the potential role for the field of medical informatics in helping to provide such support.
Full Text
The Full Text of this article is available as a PDF (1.4 MB).
Selected References
These references are in PubMed. This may not be the complete list of references from this article.
- Benson D. A., Boguski M., Lipman D. J., Ostell J. GenBank. Nucleic Acids Res. 1994 Sep;22(17):3441–3444. doi: 10.1093/nar/22.17.3441. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Billings P. R., Smith C. L., Cantor C. R. New techniques for physical mapping of the human genome. FASEB J. 1991 Jan;5(1):28–34. doi: 10.1096/fasebj.5.1.1846833. [DOI] [PubMed] [Google Scholar]
- Cuticchia A. J., Arnold J., Timberlake W. E. ODS: ordering DNA sequences--a physical mapping algorithm based on simulated annealing. Comput Appl Biosci. 1993 Apr;9(2):215–219. doi: 10.1093/bioinformatics/9.2.215. [DOI] [PubMed] [Google Scholar]
- Emmert D. B., Stoehr P. J., Stoesser G., Cameron G. N. The European Bioinformatics Institute (EBI) databases. Nucleic Acids Res. 1994 Sep;22(17):3445–3449. doi: 10.1093/nar/22.17.3445. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fasman K. H., Cuticchia A. J., Kingsbury D. T. The GDB Human Genome Data Base anno 1994. Nucleic Acids Res. 1994 Sep;22(17):3462–3469. doi: 10.1093/nar/22.17.3462. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fuchs R., Cameron G. N. Molecular biological databases: the challenge of the genome era. Prog Biophys Mol Biol. 1991;56(3):215–245. doi: 10.1016/0079-6107(91)90014-j. [DOI] [PubMed] [Google Scholar]
- Goodman N. Genome informatics. New Biol. 1991 Nov;3(11):1021–1023. [PubMed] [Google Scholar]
- Pearson M. L., Söll D. The Human Genome Project: a paradigm for information management in the life sciences. FASEB J. 1991 Jan;5(1):35–39. doi: 10.1096/fasebj.5.1.1991581. [DOI] [PubMed] [Google Scholar]
- Pearson P. L. Genome mapping databases: data acquisition, storage and access. Curr Opin Genet Dev. 1991 Jun;1(1):119–123. doi: 10.1016/0959-437x(91)80052-n. [DOI] [PubMed] [Google Scholar]