Skip to main content
The EMBO Journal logoLink to The EMBO Journal
. 1984 Oct;3(10):2315–2318. doi: 10.1002/j.1460-2075.1984.tb02132.x

Analysis of the distribution of charged residues in the N-terminal region of signal sequences: implications for protein export in prokaryotic and eukaryotic cells.

G von Heijne
PMCID: PMC557686  PMID: 6499832

Abstract

A statistical analysis of the distribution of charged residues in the N-terminal region of 39 prokaryotic and 134 eukaryotic signal sequences reveals a remarkable similarity between the two samples, both in terms of net charge and in terms of the position of charged residues within the N-terminal region, and suggests that the formyl group on Metf is not removed in prokaryotic signal sequences.

Full text

PDF
2315

Selected References

These references are in PubMed. This may not be the complete list of references from this article.

  1. Amanuma H., Katori A., Obata M., Sagata N., Ikawa Y. Complete nucleotide sequence of the gene for the specific glycoprotein (gp55) of Friend spleen focus-forming virus. Proc Natl Acad Sci U S A. 1983 Jul;80(13):3913–3917. doi: 10.1073/pnas.80.13.3913. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Argos P., Taylor W. L., Minth C. D., Dixon J. E. Nucleotide and amino acid sequence comparisons of preprosomatostatins. J Biol Chem. 1983 Jul 25;258(14):8788–8793. [PubMed] [Google Scholar]
  3. Bernstein K. E., Reddy E. P., Alexander C. B., Mage R. G. A cDNA sequence encoding a rabbit heavy chain variable region of the VHa2 allotype showing homologies with human heavy chain sequences. Nature. 1982 Nov 4;300(5887):74–76. doi: 10.1038/300074a0. [DOI] [PubMed] [Google Scholar]
  4. Burstein Y., Schechter I. Primary structures of N-terminal extra peptide segments linked to the variable and constant regions of immunoglobulin light chain precursors: implications on the organization and controlled expression of immunoglobulin genes. Biochemistry. 1978 Jun 13;17(12):2392–2400. doi: 10.1021/bi00605a022. [DOI] [PubMed] [Google Scholar]
  5. Chang H. C., Moriuchi T., Silver J. The heavy chain of human B-cell alloantigen HLA-DS has a variable N-terminal region and a constant immunoglobulin-like region. 1983 Oct 27-Nov 2Nature. 305(5937):813–815. doi: 10.1038/305813a0. [DOI] [PubMed] [Google Scholar]
  6. Chin W. W., Godine J. E., Klein D. R., Chang A. S., Tan L. K., Habener J. F. Nucleotide sequence of the cDNA encoding the precursor of the beta subunit of rat lutropin. Proc Natl Acad Sci U S A. 1983 Aug;80(15):4649–4653. doi: 10.1073/pnas.80.15.4649. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Clément J. M., Hofnung M. Gene sequence of the lambda receptor, an outer membrane protein of E. coli K12. Cell. 1981 Dec;27(3 Pt 2):507–514. doi: 10.1016/0092-8674(81)90392-5. [DOI] [PubMed] [Google Scholar]
  8. Davies P. L., Roach A. H., Hew C. L. DNA sequence coding for an antifreeze protein precursor from winter flounder. Proc Natl Acad Sci U S A. 1982 Jan;79(2):335–339. doi: 10.1073/pnas.79.2.335. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Deschenes R. J., Lorenz L. J., Haun R. S., Roos B. A., Collier K. J., Dixon J. E. Cloning and sequence analysis of a cDNA encoding rat preprocholecystokinin. Proc Natl Acad Sci U S A. 1984 Feb;81(3):726–730. doi: 10.1073/pnas.81.3.726. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Doolittle R. F. Angiotensinogen is related to the antitrypsin-antithrombin-ovalbumin family. Science. 1983 Oct 28;222(4622):417–419. doi: 10.1126/science.6604942. [DOI] [PubMed] [Google Scholar]
  11. Early P., Huang H., Davis M., Calame K., Hood L. An immunoglobulin heavy chain variable region gene is generated from three segments of DNA: VH, D and JH. Cell. 1980 Apr;19(4):981–992. doi: 10.1016/0092-8674(80)90089-6. [DOI] [PubMed] [Google Scholar]
  12. Emr S. D., Silhavy T. J. Molecular components of the signal sequence that function in the initiation of protein export. J Cell Biol. 1982 Dec;95(3):689–696. doi: 10.1083/jcb.95.3.689. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Evans G. A., Margulies D. H., Camerini-Otero R. D., Ozato K., Seidman J. G. Structure and expression of a mouse major histocompatibility antigen gene, H-2Ld. Proc Natl Acad Sci U S A. 1982 Mar;79(6):1994–1998. doi: 10.1073/pnas.79.6.1994. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Frink R. J., Eisenberg R., Cohen G., Wagner E. K. Detailed analysis of the portion of the herpes simplex virus type 1 genome encoding glycoprotein C. J Virol. 1983 Feb;45(2):634–647. doi: 10.1128/jvi.45.2.634-647.1983. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Fung M. C., Hapel A. J., Ymer S., Cohen D. R., Johnson R. M., Campbell H. D., Young I. G. Molecular cloning of cDNA for murine interleukin-3. Nature. 1984 Jan 19;307(5948):233–237. doi: 10.1038/307233a0. [DOI] [PubMed] [Google Scholar]
  16. Furutani Y., Morimoto Y., Shibahara S., Noda M., Takahashi H., Hirose T., Asai M., Inayama S., Hayashida H., Miyata T. Cloning and sequence analysis of cDNA for ovine corticotropin-releasing factor precursor. Nature. 1983 Feb 10;301(5900):537–540. doi: 10.1038/301537a0. [DOI] [PubMed] [Google Scholar]
  17. Gilmore R., Blobel G. Transient involvement of signal recognition particle and its receptor in the microsomal membrane prior to protein translocation. Cell. 1983 Dec;35(3 Pt 2):677–685. doi: 10.1016/0092-8674(83)90100-9. [DOI] [PubMed] [Google Scholar]
  18. Goodman R. H., Jacobs J. W., Chin W. W., Lund P. K., Dee P. C., Habener J. F. Nucleotide sequence of a cloned structural gene coding for a precursor of pancreatic somatostatin. Proc Natl Acad Sci U S A. 1980 Oct;77(10):5869–5873. doi: 10.1073/pnas.77.10.5869. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Gray A., Dull T. J., Ullrich A. Nucleotide sequence of epidermal growth factor cDNA predicts a 128,000-molecular weight protein precursor. Nature. 1983 Jun 23;303(5919):722–725. doi: 10.1038/303722a0. [DOI] [PubMed] [Google Scholar]
  20. Gray P. W., Goeddel D. V. Structure of the human immune interferon gene. Nature. 1982 Aug 26;298(5877):859–863. doi: 10.1038/298859a0. [DOI] [PubMed] [Google Scholar]
  21. Gubler U., Monahan J. J., Lomedico P. T., Bhatt R. S., Collier K. J., Hoffman B. J., Böhlen P., Esch F., Ling N., Zeytin F. Cloning and sequence analysis of cDNA for the precursor of human growth hormone-releasing factor, somatocrinin. Proc Natl Acad Sci U S A. 1983 Jul;80(14):4311–4314. doi: 10.1073/pnas.80.14.4311. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Hall M. N., Gabay J., Schwartz M. Evidence for a coupling of synthesis and export of an outer membrane protein in Escherichia coli. EMBO J. 1983;2(1):15–19. doi: 10.1002/j.1460-2075.1983.tb01373.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Harris S. E., Mansson P. E., Tully D. B., Burkhart B. Seminal vesicle secretion IV gene: allelic difference due to a series of 20-base-pair direct tandem repeats within an intron. Proc Natl Acad Sci U S A. 1983 Nov;80(21):6460–6464. doi: 10.1073/pnas.80.21.6460. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Hedrick S. M., Nielsen E. A., Kavaler J., Cohen D. I., Davis M. M. Sequence relationships between putative T-cell receptor polypeptides and immunoglobulins. Nature. 1984 Mar 8;308(5955):153–158. doi: 10.1038/308153a0. [DOI] [PubMed] [Google Scholar]
  25. Higgins T. J., Chandler P. M., Zurawski G., Button S. C., Spencer D. The biosynthesis and primary structure of pea seed lectin. J Biol Chem. 1983 Aug 10;258(15):9544–9549. [PubMed] [Google Scholar]
  26. Hirschberg J., McIntosh L. Molecular Basis of Herbicide Resistance in Amaranthus hybridus. Science. 1983 Dec 23;222(4630):1346–1349. doi: 10.1126/science.222.4630.1346. [DOI] [PubMed] [Google Scholar]
  27. Hobart P., Crawford R., Shen L., Pictet R., Rutter W. J. Cloning and sequence analysis of cDNAs encoding two distinct somatostatin precursors found in the endocrine pancreas of anglerfish. Nature. 1980 Nov 13;288(5787):137–141. doi: 10.1038/288137a0. [DOI] [PubMed] [Google Scholar]
  28. Housman D., Gillespie D., Lodish H. F. Removal of formyl-methionine residue from nascent bacteriophage f2 protein. J Mol Biol. 1972 Mar 14;65(1):163–166. doi: 10.1016/0022-2836(72)90498-6. [DOI] [PubMed] [Google Scholar]
  29. Hyldig-Nielsen J. J., Schenning L., Hammerling U., Widmark E., Heldin E., Lind P., Servenius B., Lund T., Flavell R., Lee J. S. The complete nucleotide sequence of the I-E alpha d immune response gene. Nucleic Acids Res. 1983 Aug 11;11(15):5055–5071. doi: 10.1093/nar/11.15.5055. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Inana G., Piatigorsky J., Norman B., Slingsby C., Blundell T. Gene and protein structure of a beta-crystallin polypeptide in murine lens: relationship of exons and structural motifs. Nature. 1983 Mar 24;302(5906):310–315. doi: 10.1038/302310a0. [DOI] [PubMed] [Google Scholar]
  31. Itoh N., Obata K., Yanaihara N., Okamoto H. Human preprovasoactive intestinal polypeptide contains a novel PHI-27-like peptide, PHM-27. Nature. 1983 Aug 11;304(5926):547–549. doi: 10.1038/304547a0. [DOI] [PubMed] [Google Scholar]
  32. Jansen M., van Schaik F. M., Ricker A. T., Bullock B., Woods D. E., Gabbay K. H., Nussbaum A. L., Sussenbach J. S., Van den Brande J. L. Sequence of cDNA encoding human insulin-like growth factor I precursor. Nature. 1983 Dec 8;306(5943):609–611. doi: 10.1038/306609a0. [DOI] [PubMed] [Google Scholar]
  33. Kaczorek M., Delpeyroux F., Chenciner N., Streeck R. E., Murphy J. R., Boquet P., Tiollais P. Nucleotide sequence and expression of the diphtheria tox228 gene in Escherichia coli. Science. 1983 Aug 26;221(4613):855–858. doi: 10.1126/science.6348945. [DOI] [PubMed] [Google Scholar]
  34. Karathanasis S. K., Zannis V. I., Breslow J. L. Isolation and characterization of the human apolipoprotein A-I gene. Proc Natl Acad Sci U S A. 1983 Oct;80(20):6147–6151. doi: 10.1073/pnas.80.20.6147. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Kumamoto C. A., Oliver D. B., Beckwith J. Signal sequence mutations disrupt feedback between secretion of an exported protein and its synthesis in E. coli. 1984 Apr 26-May 2Nature. 308(5962):863–864. doi: 10.1038/308863a0. [DOI] [PubMed] [Google Scholar]
  36. Larhammar D., Hyldig-Nielsen J. J., Servenius B., Andersson G., Rask L., Peterson P. A. Exon-intron organization and complete nucleotide sequence of a human major histocompatibility antigen DC beta gene. Proc Natl Acad Sci U S A. 1983 Dec;80(23):7313–7317. doi: 10.1073/pnas.80.23.7313. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Law S. W., Dugaiczyk A. Homology between the primary structure of alpha-fetoprotein, deduced from a complete cDNA sequence, and serum albumin. Nature. 1981 May 21;291(5812):201–205. doi: 10.1038/291201a0. [DOI] [PubMed] [Google Scholar]
  38. Magazin M., Minth C. D., Funckes C. L., Deschenes R., Tavianini M. A., Dixon J. E. Sequence of a cDNA encoding pancreatic preprosomatostatin-22. Proc Natl Acad Sci U S A. 1982 Sep;79(17):5152–5156. doi: 10.1073/pnas.79.17.5152. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Malissen M., Hunkapiller T., Hood L. Nucleotide sequence of a light chain gene of the mouse I-A subregion: A beta d. Science. 1983 Aug 19;221(4612):750–754. doi: 10.1126/science.6410508. [DOI] [PubMed] [Google Scholar]
  40. McLean J. W., Fukazawa C., Taylor J. M. Rat apolipoprotein E mRNA. Cloning and sequencing of double-stranded cDNA. J Biol Chem. 1983 Jul 25;258(14):8993–9000. [PubMed] [Google Scholar]
  41. Mekalanos J. J., Swartz D. J., Pearson G. D., Harford N., Groyne F., de Wilde M. Cholera toxin genes: nucleotide sequence, deletion analysis and vaccine development. Nature. 1983 Dec 8;306(5943):551–557. doi: 10.1038/306551a0. [DOI] [PubMed] [Google Scholar]
  42. Michaelis S., Beckwith J. Mechanism of incorporation of cell envelope proteins in Escherichia coli. Annu Rev Microbiol. 1982;36:435–465. doi: 10.1146/annurev.mi.36.100182.002251. [DOI] [PubMed] [Google Scholar]
  43. Mostov K. E., Friedlander M., Blobel G. The receptor for transepithelial transport of IgA and IgM contains multiple immunoglobulin-like domains. Nature. 1984 Mar 1;308(5954):37–43. doi: 10.1038/308037a0. [DOI] [PubMed] [Google Scholar]
  44. Nawa H., Hirose T., Takashima H., Inayama S., Nakanishi S. Nucleotide sequences of cloned cDNAs for two types of bovine brain substance P precursor. Nature. 1983 Nov 3;306(5938):32–36. doi: 10.1038/306032a0. [DOI] [PubMed] [Google Scholar]
  45. Nawa H., Kitamura N., Hirose T., Asai M., Inayama S., Nakanishi S. Primary structures of bovine liver low molecular weight kininogen precursors and their two mRNAs. Proc Natl Acad Sci U S A. 1983 Jan;80(1):90–94. doi: 10.1073/pnas.80.1.90. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Nielsen J. B., Lampen J. O. Membrane-bound penicillinases in Gram-positive bacteria. J Biol Chem. 1982 Apr 25;257(8):4490–4495. [PubMed] [Google Scholar]
  47. Pennica D., Holmes W. E., Kohr W. J., Harkins R. N., Vehar G. A., Ward C. A., Bennett W. F., Yelverton E., Seeburg P. H., Heyneker H. L. Cloning and expression of human tissue-type plasminogen activator cDNA in E. coli. Nature. 1983 Jan 20;301(5897):214–221. doi: 10.1038/301214a0. [DOI] [PubMed] [Google Scholar]
  48. Perlman D., Halvorson H. O. A putative signal peptidase recognition site and sequence in eukaryotic and prokaryotic signal peptides. J Mol Biol. 1983 Jun 25;167(2):391–409. doi: 10.1016/s0022-2836(83)80341-6. [DOI] [PubMed] [Google Scholar]
  49. Pine M. J. Kinetics of maturation of the amino termini of the cell proteins of Escherichia coli. Biochim Biophys Acta. 1969 Jan 21;174(1):359–372. doi: 10.1016/0005-2787(69)90261-5. [DOI] [PubMed] [Google Scholar]
  50. Rogers J. C., Milliman C. Isolation and sequence analysis of a barley alpha-amylase cDNA clone. J Biol Chem. 1983 Jul 10;258(13):8169–8174. [PubMed] [Google Scholar]
  51. Saito H., Maki R. A., Clayton L. K., Tonegawa S. Complete primary structures of the E beta chain and gene of the mouse major histocompatibility complex. Proc Natl Acad Sci U S A. 1983 Sep;80(18):5520–5524. doi: 10.1073/pnas.80.18.5520. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Scheller R. H., Jackson J. F., McAllister L. B., Rothman B. S., Mayeri E., Axel R. A single gene encodes multiple neuropeptides mediating a stereotyped behavior. Cell. 1983 Jan;32(1):7–22. doi: 10.1016/0092-8674(83)90492-0. [DOI] [PubMed] [Google Scholar]
  53. Schenning L., Larhammar D., Bill P., Wiman K., Jonsson A. K., Rask L., Peterson P. A. Both alpha and beta chains of HLA-DC class II histocompatibility antigens display extensive polymorphism in their amino-terminal domains. EMBO J. 1984 Feb;3(2):447–452. doi: 10.1002/j.1460-2075.1984.tb01826.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Scripture J. B., Hogg R. W. The nucleotide sequences defining the signal peptides of the galactose-binding protein and the arabinose-binding protein. J Biol Chem. 1983 Sep 25;258(18):10853–10855. [PubMed] [Google Scholar]
  55. Silhavy T. J., Benson S. A., Emr S. D. Mechanisms of protein localization. Microbiol Rev. 1983 Sep;47(3):313–344. doi: 10.1128/mr.47.3.313-344.1983. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Sims J., Rabbitts T. H., Estess P., Slaughter C., Tucker P. W., Capra J. D. Somatic mutation in genes for the variable portion of the immunoglobulin heavy chain. Science. 1982 Apr 16;216(4543):309–311. doi: 10.1126/science.6801765. [DOI] [PubMed] [Google Scholar]
  57. Skipper N., Thomas D. Y., Lau P. C. Cloning and sequencing of the preprotoxin-coding region of the yeast M1 double-stranded RNA. EMBO J. 1984 Jan;3(1):107–111. doi: 10.1002/j.1460-2075.1984.tb01769.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Slightom J. L., Sun S. M., Hall T. C. Complete nucleotide sequence of a French bean storage protein gene: Phaseolin. Proc Natl Acad Sci U S A. 1983 Apr;80(7):1897–1901. doi: 10.1073/pnas.80.7.1897. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Taniguchi T., Mantei N., Schwarzstein M., Nagata S., Muramatsu M., Weissmann C. Human leukocyte and fibroblast interferons are structurally related. Nature. 1980 Jun 19;285(5766):547–549. doi: 10.1038/285547a0. [DOI] [PubMed] [Google Scholar]
  60. Taniguchi T., Matsui H., Fujita T., Takaoka C., Kashima N., Yoshimoto R., Hamuro J. Structure and expression of a cloned cDNA for human interleukin-2. Nature. 1983 Mar 24;302(5906):305–310. doi: 10.1038/302305a0. [DOI] [PubMed] [Google Scholar]
  61. Uhler M., Herbert E. Complete amino acid sequence of mouse pro-opiomelanocortin derived from the nucleotide sequence of pro-opiomelanocortin cDNA. J Biol Chem. 1983 Jan 10;258(1):257–261. [PubMed] [Google Scholar]
  62. Viskochil D. H., Perry S. T., Lea O. A., Stafford D. W., Wilson E. M., French F. S. Isolation of two genomic sequences encoding the Mr = 14,000 subunit of rat prostatein. J Biol Chem. 1983 Jul 25;258(14):8861–8866. [PubMed] [Google Scholar]
  63. Vlasuk G. P., Inouye S., Ito H., Itakura K., Inouye M. Effects of the complete removal of basic amino acid residues from the signal peptide on secretion of lipoprotein in Escherichia coli. J Biol Chem. 1983 Jun 10;258(11):7141–7148. [PubMed] [Google Scholar]
  64. Walter P., Ibrahimi I., Blobel G. Translocation of proteins across the endoplasmic reticulum. I. Signal recognition protein (SRP) binds to in-vitro-assembled polysomes synthesizing secretory protein. J Cell Biol. 1981 Nov;91(2 Pt 1):545–550. doi: 10.1083/jcb.91.2.545. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Watson R. J., Weis J. H., Salstrom J. S., Enquist L. W. Herpes simplex virus type-1 glycoprotein D gene: nucleotide sequence and expression in Escherichia coli. Science. 1982 Oct 22;218(4570):381–384. doi: 10.1126/science.6289440. [DOI] [PubMed] [Google Scholar]
  66. Wiebauer K., Domdey H., Diggelmann H., Fey G. Isolation and analysis of genomic DNA clones encoding the third component of mouse complement. Proc Natl Acad Sci U S A. 1982 Dec;79(23):7077–7081. doi: 10.1073/pnas.79.23.7077. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Yamagata H., Nakamura K., Inouye M. Comparison of the lipoprotein gene among the Enterobacteriaceae. DNA sequence of Erwinia amylovora lipoprotein gene. J Biol Chem. 1981 Mar 10;256(5):2194–2198. [PubMed] [Google Scholar]
  68. Yanagi Y., Yoshikai Y., Leggett K., Clark S. P., Aleksander I., Mak T. W. A human T cell-specific cDNA clone encodes a protein having extensive homology to immunoglobulin chains. Nature. 1984 Mar 8;308(5955):145–149. doi: 10.1038/308145a0. [DOI] [PubMed] [Google Scholar]
  69. von Heijne G. How signal sequences maintain cleavage specificity. J Mol Biol. 1984 Feb 25;173(2):243–251. doi: 10.1016/0022-2836(84)90192-x. [DOI] [PubMed] [Google Scholar]
  70. von Heijne G. Patterns of amino acids near signal-sequence cleavage sites. Eur J Biochem. 1983 Jun 1;133(1):17–21. doi: 10.1111/j.1432-1033.1983.tb07424.x. [DOI] [PubMed] [Google Scholar]

Articles from The EMBO Journal are provided here courtesy of Nature Publishing Group

RESOURCES