Skip to main content
Comparative and Functional Genomics logoLink to Comparative and Functional Genomics
. 2001 Jun;2(3):143–154. doi: 10.1002/cfg.86

A Re-Annotation of the Saccharomyces Cerevisiae Genome

V Wood 1,, K M Rutherford 1, A Ivens 1, M-A Rajandream 1, B Barrell 1
PMCID: PMC2447204  PMID: 18628908

Abstract

Discrepancies in gene and orphan number indicated by previous analyses suggest that S. cerevisiae would benefit from a consistent re-annotation. In this analysis three new genes are identified and 46 alterations to gene coordinates are described. 370 ORFs are defined as totally spurious ORFs which should be disregarded. At least a further 193 genes could be described as very hypothetical, based on a number of criteria. It was found that disparate genes with sequence overlaps over ten amino acids (especially at the N-terminus) are rare in both S. cerevisiae and Sz. pombe. A new S. cerevisiae gene number estimate with an upper limit of 5804 is proposed, but after the removal of very hypothetical genes and pseudogenes this is reduced to 5570. Although this is likely to be closer to the true upper limit, it is still predicted to be an overestimate of gene number. A complete list of revised gene coordinates is available from the Sanger Centre (S. cerevisiae reannotation: ftp://ftp/pub/yeast/SCreannotation).

Full Text

The Full Text of this article is available as a PDF (116.0 KB).

Selected References

These references are in PubMed. This may not be the complete list of references from this article.

  1. Altschul S. F., Gish W., Miller W., Myers E. W., Lipman D. J. Basic local alignment search tool. J Mol Biol. 1990 Oct 5;215(3):403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
  2. Bairoch A., Apweiler R. The SWISS-PROT protein sequence data bank and its supplement TrEMBL in 1999. Nucleic Acids Res. 1999 Jan 1;27(1):49–54. doi: 10.1093/nar/27.1.49. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Bateman A., Birney E., Durbin R., Eddy S. R., Finn R. D., Sonnhammer E. L. Pfam 3.1: 1313 multiple alignments and profile HMMs match the majority of proteins. Nucleic Acids Res. 1999 Jan 1;27(1):260–262. doi: 10.1093/nar/27.1.260. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Birney E., Thompson J. D., Gibson T. J. PairWise and SearchWise: finding the optimal alignment in a simultaneous comparison of a protein profile against all DNA translation frames. Nucleic Acids Res. 1996 Jul 15;24(14):2730–2739. doi: 10.1093/nar/24.14.2730. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Bucher P., Bairoch A. A generalized profile syntax for biomolecular sequence motifs and its function in automatic sequence interpretation. Proc Int Conf Intell Syst Mol Biol. 1994;2:53–61. [PubMed] [Google Scholar]
  6. DeRisi J. L., Iyer V. R., Brown P. O. Exploring the metabolic and genetic control of gene expression on a genomic scale. Science. 1997 Oct 24;278(5338):680–686. doi: 10.1126/science.278.5338.680. [DOI] [PubMed] [Google Scholar]
  7. Dujon B. The yeast genome project: what did we learn? Trends Genet. 1996 Jul;12(7):263–270. doi: 10.1016/0168-9525(96)10027-5. [DOI] [PubMed] [Google Scholar]
  8. Gaillardin C., Duchateau-Nguyen G., Tekaia F., Llorente B., Casaregola S., Toffano-Nioche C., Aigle M., Artiguenave F., Blandin G., Bolotin-Fukuhara M. Genomic exploration of the hemiascomycetous yeasts: 21. Comparative functional classification of genes. FEBS Lett. 2000 Dec 22;487(1):134–149. doi: 10.1016/s0014-5793(00)02292-4. [DOI] [PubMed] [Google Scholar]
  9. Goffeau A., Barrell B. G., Bussey H., Davis R. W., Dujon B., Feldmann H., Galibert F., Hoheisel J. D., Jacq C., Johnston M. Life with 6000 genes. Science. 1996 Oct 25;274(5287):546, 563-7. doi: 10.1126/science.274.5287.546. [DOI] [PubMed] [Google Scholar]
  10. Hieter P., Boguski M. Functional genomics: it's all how you read it. Science. 1997 Oct 24;278(5338):601–602. doi: 10.1126/science.278.5338.601. [DOI] [PubMed] [Google Scholar]
  11. Lowe T. M., Eddy S. R. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997 Mar 1;25(5):955–964. doi: 10.1093/nar/25.5.955. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Mackiewicz P., Kowalczuk M., Gierlik A., Dudek M. R., Cebrat S. Origin and properties of non-coding ORFs in the yeast genome. Nucleic Acids Res. 1999 Sep 1;27(17):3503–3509. doi: 10.1093/nar/27.17.3503. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Malpertuy A., Tekaia F., Casarégola S., Aigle M., Artiguenave F., Blandin G., Bolotin-Fukuhara M., Bon E., Brottier P., de Montigny J. Genomic exploration of the hemiascomycetous yeasts: 19. Ascomycetes-specific genes. FEBS Lett. 2000 Dec 22;487(1):113–121. doi: 10.1016/s0014-5793(00)02290-0. [DOI] [PubMed] [Google Scholar]
  14. Oliver S. G. From DNA sequence to biological function. Nature. 1996 Feb 15;379(6566):597–600. doi: 10.1038/379597a0. [DOI] [PubMed] [Google Scholar]
  15. Oliver S. G., van der Aart Q. J., Agostoni-Carbone M. L., Aigle M., Alberghina L., Alexandraki D., Antoine G., Anwar R., Ballesta J. P., Benit P. The complete DNA sequence of yeast chromosome III. Nature. 1992 May 7;357(6373):38–46. doi: 10.1038/357038a0. [DOI] [PubMed] [Google Scholar]
  16. Pearson W. R., Lipman D. J. Improved tools for biological sequence comparison. Proc Natl Acad Sci U S A. 1988 Apr;85(8):2444–2448. doi: 10.1073/pnas.85.8.2444. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Rutherford K., Parkhill J., Crook J., Horsnell T., Rice P., Rajandream M. A., Barrell B. Artemis: sequence visualization and annotation. Bioinformatics. 2000 Oct;16(10):944–945. doi: 10.1093/bioinformatics/16.10.944. [DOI] [PubMed] [Google Scholar]
  18. Sharp P. M., Cowe E. Synonymous codon usage in Saccharomyces cerevisiae. Yeast. 1991 Oct;7(7):657–678. doi: 10.1002/yea.320070702. [DOI] [PubMed] [Google Scholar]
  19. Sonnhammer E. L., Durbin R. A workbench for large-scale sequence homology analysis. Comput Appl Biosci. 1994 Jun;10(3):301–307. doi: 10.1093/bioinformatics/10.3.301. [DOI] [PubMed] [Google Scholar]
  20. Stoesser G., Tuli M. A., Lopez R., Sterk P. The EMBL Nucleotide Sequence Database. Nucleic Acids Res. 1999 Jan 1;27(1):18–24. doi: 10.1093/nar/27.1.18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Xiang Z., Moore K., Wood V., Rajandream M. A., Barrell B. G., Skelton J., Churcher C. M., Lyne M. H., Devlin K., Gwilliam R. Analysis of 114 kb of DNA sequence from fission yeast chromosome 2 immediately centromere-distal to his5. Yeast. 2000 Nov;16(15):1405–1411. doi: 10.1002/1097-0061(200011)16:15<1405::AID-YEA625>3.0.CO;2-H. [DOI] [PubMed] [Google Scholar]
  22. Zhang C. T., Wang J. Recognition of protein coding genes in the yeast genome at better than 95% accuracy based on the Z curve. Nucleic Acids Res. 2000 Jul 15;28(14):2804–2814. doi: 10.1093/nar/28.14.2804. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Comparative and Functional Genomics are provided here courtesy of Wiley

RESOURCES