Skip to main content
Nucleic Acids Research logoLink to Nucleic Acids Research
. 1998 May 1;26(9):2230–2236. doi: 10.1093/nar/26.9.2230

Using neural networks for prediction of the subcellular location of proteins.

A Reinhardt 1, T Hubbard 1
PMCID: PMC147531  PMID: 9547285

Abstract

Neural networks have been trained to predict the subcellular location of proteins in prokaryotic or eukaryotic cells from their amino acid composition. For three possible subcellular locations in prokaryotic organisms a prediction accuracy of 81% can be achieved. Assigning a reliability index, 33% of the predictions can be made with an accuracy of 91%. For eukaryotic proteins (excluding plant sequences) an overall prediction accuracy of 66% for four locations was achieved, with 33% of the sequences being predicted with an accuracy of 82% or better. With the subcellular location restricting a protein's possible function, this method should be a useful tool for the systematic analysis of genome data and is available via a server on the world wide web.

Full Text

The Full Text of this article is available as a PDF (116.7 KB).

Selected References

These references are in PubMed. This may not be the complete list of references from this article.

  1. Bairoch A., Apweiler R. The SWISS-PROT protein sequence data bank and its supplement TrEMBL. Nucleic Acids Res. 1997 Jan 1;25(1):31–36. doi: 10.1093/nar/25.1.31. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Bairoch A., Boeckmann B. The SWISS-PROT protein sequence data bank, recent developments. Nucleic Acids Res. 1993 Jul 1;21(13):3093–3096. doi: 10.1093/nar/21.13.3093. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Bult C. J., White O., Olsen G. J., Zhou L., Fleischmann R. D., Sutton G. G., Blake J. A., FitzGerald L. M., Clayton R. A., Gocayne J. D. Complete genome sequence of the methanogenic archaeon, Methanococcus jannaschii. Science. 1996 Aug 23;273(5278):1058–1073. doi: 10.1126/science.273.5278.1058. [DOI] [PubMed] [Google Scholar]
  4. Cedano J., Aloy P., Pérez-Pons J. A., Querol E. Relation between amino acid composition and cellular location of proteins. J Mol Biol. 1997 Feb 28;266(3):594–600. doi: 10.1006/jmbi.1996.0804. [DOI] [PubMed] [Google Scholar]
  5. Eisenhaber F., Frömmel C., Argos P. Prediction of secondary structural content of proteins from their amino acid composition alone. II. The paradox with secondary structural class. Proteins. 1996 Jun;25(2):169–179. doi: 10.1002/(SICI)1097-0134(199606)25:2<169::AID-PROT3>3.0.CO;2-D. [DOI] [PubMed] [Google Scholar]
  6. Himmelreich R., Hilbert H., Plagens H., Pirkl E., Li B. C., Herrmann R. Complete sequence analysis of the genome of the bacterium Mycoplasma pneumoniae. Nucleic Acids Res. 1996 Nov 15;24(22):4420–4449. doi: 10.1093/nar/24.22.4420. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Nakai K., Kanehisa M. A knowledge base for predicting protein localization sites in eukaryotic cells. Genomics. 1992 Dec;14(4):897–911. doi: 10.1016/S0888-7543(05)80111-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Nakashima H., Nishikawa K. Discrimination of intracellular and extracellular proteins using amino acid composition and residue-pair frequencies. J Mol Biol. 1994 Apr 22;238(1):54–61. doi: 10.1006/jmbi.1994.1267. [DOI] [PubMed] [Google Scholar]
  9. Nakashima H., Nishikawa K. The amino acid composition is different between the cytoplasmic and extracellular sides in membrane proteins. FEBS Lett. 1992 Jun 1;303(2-3):141–146. doi: 10.1016/0014-5793(92)80506-c. [DOI] [PubMed] [Google Scholar]
  10. Rost B., Casadio R., Fariselli P., Sander C. Transmembrane helices predicted at 95% accuracy. Protein Sci. 1995 Mar;4(3):521–533. doi: 10.1002/pro.5560040318. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Rost B., Sander C. Combining evolutionary information and neural networks to predict protein secondary structure. Proteins. 1994 May;19(1):55–72. doi: 10.1002/prot.340190108. [DOI] [PubMed] [Google Scholar]

Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press

RESOURCES