Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 1982 Oct;79(20):6196–6200. doi: 10.1073/pnas.79.20.6196

Relationship between the total size of exons and introns in protein-coding genes of higher eukaryotes.

H Naora, N J Deacon
PMCID: PMC347086  PMID: 6959108

Abstract

We have attempted to ascertain the correlation between the genetic information content in the exons and the surrounding intron sequences with regard to their spatial arrangement within a gene. A comparison is made of the sizes, taken from recent publications, of exons and introns of approximately equal to 80 different protein-coding chromosomal genes, mostly from higher eukaryotes. The exons of these genes do not show very marked variation in size and can be classified into three major discrete and two minor additional size groups, whereas individual introns vary considerably in size within and between genes. Notwithstanding, the overall length of all introns present within a given gene is a function of the total size, mostly corresponding to the total genetic information content, of the exons. Three cases that violate this exon-size dependency of introns are genes coding for (i) histone H1, feather keratin, and interferons, (ii) tubulin and actin, and (iii) silk fibroin. The exons of these genes are larger than 0.7 kilobase pair in total size and the genes show a strong sequence homogeneity among the repetitious family members or internal repeats of coding sequences within the gene. We propose that conservation of sequences, which is required by the family members, internal repeats, or the entire gene, would actually motivate the removal of introns.

Full text

PDF
6196

Selected References

These references are in PubMed. This may not be the complete list of references from this article.

  1. Artymiuk P. J., Blake C. C., Sippel A. E. Genes pieced together--exons delineate homologous structures of diverged lysozymes. Nature. 1981 Mar 26;290(5804):287–288. doi: 10.1038/290287a0. [DOI] [PubMed] [Google Scholar]
  2. Baralle F. E., Shoulders C. C., Proudfoot N. J. The primary structure of the human epsilon-globin gene. Cell. 1980 Oct;21(3):621–626. doi: 10.1016/0092-8674(80)90425-0. [DOI] [PubMed] [Google Scholar]
  3. Barta A., Richards R. I., Baxter J. D., Shine J. Primary structure and evolution of rat growth hormone gene. Proc Natl Acad Sci U S A. 1981 Aug;78(8):4867–4871. doi: 10.1073/pnas.78.8.4867. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Bernard O., Gough N. M. Nucleotide sequence of immunoglobulin heavy chain joining segments between translocated VH and mu constant regions genes. Proc Natl Acad Sci U S A. 1980 Jun;77(6):3630–3634. doi: 10.1073/pnas.77.6.3630. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Blomberg B., Traunecker A., Eisen H., Tonegawa S. Organization of four mouse lambda light chain immunoglobulin genes. Proc Natl Acad Sci U S A. 1981 Jun;78(6):3765–3769. doi: 10.1073/pnas.78.6.3765. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Bonitz S. G., Coruzzi G., Thalenfeld B. E., Tzagoloff A., Macino G. Assembly of the mitochondrial membrane system. Structure and nucleotide sequence of the gene coding for subunit 1 of yeast cytochrme oxidase. J Biol Chem. 1980 Dec 25;255(24):11927–11941. [PubMed] [Google Scholar]
  7. Breathnach R., Chambon P. Organization and expression of eucaryotic split genes coding for proteins. Annu Rev Biochem. 1981;50:349–383. doi: 10.1146/annurev.bi.50.070181.002025. [DOI] [PubMed] [Google Scholar]
  8. Catterall J. F., Stein J. P., Kristo P., Means A. R., O'Malley B. W. Primary sequence of ovomucoid messenger RNA as determined from cloned complementary DNA. J Cell Biol. 1980 Nov;87(2 Pt 1):480–487. doi: 10.1083/jcb.87.2.480. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Chien Y. H., Thompson E. B. Genomic organization of rat prolactin and growth hormone genes. Proc Natl Acad Sci U S A. 1980 Aug;77(8):4583–4587. doi: 10.1073/pnas.77.8.4583. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Clark-Walker G. D., Sriprakash K. S. Sequence rearrangements between mitochondrial DNAs of Torulopsis glabrata and Kloeckera africana identified by hybridization with six polypeptide encoding regions from Saccharomyces cerevisiae mitochondrial DNA. J Mol Biol. 1981 Sep 25;151(3):367–387. doi: 10.1016/0022-2836(81)90002-4. [DOI] [PubMed] [Google Scholar]
  11. Cochet M., Gannon F., Hen R., Maroteaux L., Perrin F., Chambon P. Organization and sequence studies of the 17-piece chicken conalbumin gene. Nature. 1979 Dec 6;282(5739):567–574. doi: 10.1038/282567a0. [DOI] [PubMed] [Google Scholar]
  12. Cowan N. J., Wilde C. D., Chow L. T., Wefald F. C. Structural variation among human beta-tubulin genes. Proc Natl Acad Sci U S A. 1981 Aug;78(8):4877–4881. doi: 10.1073/pnas.78.8.4877. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. DeNoto F. M., Moore D. D., Goodman H. M. Human growth hormone DNA sequence and mRNA structure: possible alternative splicing. Nucleic Acids Res. 1981 Aug 11;9(15):3719–3730. doi: 10.1093/nar/9.15.3719. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Deacon N. J., Shine J., Naora H. Complete nucleotide sequence of a cloned chicken alpha-globin cDNA. Nucleic Acids Res. 1980 Mar 25;8(6):1187–1199. doi: 10.1093/nar/8.6.1187. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Dickson L. A., Ninomiya Y., Bernard M. P., Pesciotta D. M., Parsons J., Green G., Eikenberry E. F., de Crombrugghe B., Vogeli G., Pastan I. The exon/intron structure of the 3'-region of the pro alpha 2(I) collagen gene. J Biol Chem. 1981 Aug 25;256(16):8407–8415. [PubMed] [Google Scholar]
  16. Dodgson J. B., McCune K. C., Rusling D. J., Krust A., Engel J. D. Adult chicken alpha-globin genes alpha A and alpha D: no anemic shock alpha-globin exists in domestic chickens. Proc Natl Acad Sci U S A. 1981 Oct;78(10):5998–6002. doi: 10.1073/pnas.78.10.5998. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Durnam D. M., Perrin F., Gannon F., Palmiter R. D. Isolation and characterization of the mouse metallothionein-I gene. Proc Natl Acad Sci U S A. 1980 Nov;77(11):6511–6515. doi: 10.1073/pnas.77.11.6511. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Early P., Huang H., Davis M., Calame K., Hood L. An immunoglobulin heavy chain variable region gene is generated from three segments of DNA: VH, D and JH. Cell. 1980 Apr;19(4):981–992. doi: 10.1016/0092-8674(80)90089-6. [DOI] [PubMed] [Google Scholar]
  19. Engel J. D., Dodgson J. B. Analysis of the closely linked adult chicken alpha-globin genes in recombinant DNAs. Proc Natl Acad Sci U S A. 1980 May;77(5):2596–2600. doi: 10.1073/pnas.77.5.2596. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Firtel R. A. Multigene families encoding actin and tubulin. Cell. 1981 Apr;24(1):6–7. doi: 10.1016/0092-8674(81)90494-3. [DOI] [PubMed] [Google Scholar]
  21. Fyrberg E. A., Kindle K. L., Davidson N., Kindle K. L. The actin genes of Drosophila: a dispersed multigene family. Cell. 1980 Feb;19(2):365–378. doi: 10.1016/0092-8674(80)90511-5. [DOI] [PubMed] [Google Scholar]
  22. Gage L. P., Manning R. F. Internal structure of the silk fibroin gene of Bombyx mori. I The fibroin gene consists of a homogeneous alternating array of repetitious crystalline and amorphous coding sequences. J Biol Chem. 1980 Oct 10;255(19):9444–9450. [PubMed] [Google Scholar]
  23. Gallwitz D., Sures I. Structure of a split yeast gene: complete nucleotide sequence of the actin gene in Saccharomyces cerevisiae. Proc Natl Acad Sci U S A. 1980 May;77(5):2546–2550. doi: 10.1073/pnas.77.5.2546. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Gilbert W. Why genes in pieces? Nature. 1978 Feb 9;271(5645):501–501. doi: 10.1038/271501a0. [DOI] [PubMed] [Google Scholar]
  25. Go M. Correlation of DNA exonic regions with protein structural units in haemoglobin. Nature. 1981 May 7;291(5810):90–92. doi: 10.1038/291090a0. [DOI] [PubMed] [Google Scholar]
  26. Goeddel D. V., Leung D. W., Dull T. J., Gross M., Lawn R. M., McCandliss R., Seeburg P. H., Ullrich A., Yelverton E., Gray P. W. The structure of eight distinct cloned human leukocyte interferon cDNAs. Nature. 1981 Mar 5;290(5801):20–26. doi: 10.1038/290020a0. [DOI] [PubMed] [Google Scholar]
  27. Gruss P., Khoury G. Rescue of a splicing defective mutant by insertion of an heterologous intron. Nature. 1980 Aug 7;286(5773):634–637. doi: 10.1038/286634a0. [DOI] [PubMed] [Google Scholar]
  28. Hardison R. C., Butler E. T., 3rd, Lacy E., Maniatis T., Rosenthal N., Efstratiadis A. The structure and transcription of four linked rabbit beta-like globin genes. Cell. 1979 Dec;18(4):1285–1297. doi: 10.1016/0092-8674(79)90239-3. [DOI] [PubMed] [Google Scholar]
  29. Heilig R., Perrin F., Gannon F., Mandel J. L., Chambon P. The ovalbumin gene family: structure of the X gene and evolution of duplicated split genes. Cell. 1980 Jul;20(3):625–637. doi: 10.1016/0092-8674(80)90309-8. [DOI] [PubMed] [Google Scholar]
  30. Hieter P. A., Max E. E., Seidman J. G., Maizel J. V., Jr, Leder P. Cloned human and mouse kappa immunoglobulin constant and J region genes conserve homology in functional segments. Cell. 1980 Nov;22(1 Pt 1):197–207. doi: 10.1016/0092-8674(80)90168-3. [DOI] [PubMed] [Google Scholar]
  31. Jones C. W., Kafatos F. C. Structure, organization and evolution of developmentally regulated chorion genes in a silkmoth. Cell. 1980 Dec;22(3):855–867. doi: 10.1016/0092-8674(80)90562-0. [DOI] [PubMed] [Google Scholar]
  32. Jones R. E., Bhat S. P., Sullivan M. A., Piatigorsky J. Comparison of two delta-crystallin genes in the chicken. Proc Natl Acad Sci U S A. 1980 Oct;77(10):5879–5883. doi: 10.1073/pnas.77.10.5879. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Jung A., Sippel A. E., Grez M., Schütz G. Exons encode functional and structural units of chicken lysozyme. Proc Natl Acad Sci U S A. 1980 Oct;77(10):5759–5763. doi: 10.1073/pnas.77.10.5759. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Kedes L. H. Histone genes and histone messengers. Annu Rev Biochem. 1979;48:837–870. doi: 10.1146/annurev.bi.48.070179.004201. [DOI] [PubMed] [Google Scholar]
  35. Kioussis D., Eiferman F., van de Rijn P., Gorin M. B., Ingram R. S., Tilghman S. M. The evolution of alpha-fetoprotein and albumin. II. The structures of the alpha-fetoprotein and albumin genes in the mouse. J Biol Chem. 1981 Feb 25;256(4):1960–1967. [PubMed] [Google Scholar]
  36. Konkel D. A., Maizel J. V., Jr, Leder P. The evolution and sequence comparison of two recently diverged mouse chromosomal beta--globin genes. Cell. 1979 Nov;18(3):865–873. doi: 10.1016/0092-8674(79)90138-7. [DOI] [PubMed] [Google Scholar]
  37. Lacy E., Maniatis T. The nucleotide sequence of a rabbit beta-globin pseudogene. Cell. 1980 Sep;21(2):545–553. doi: 10.1016/0092-8674(80)90492-4. [DOI] [PubMed] [Google Scholar]
  38. Lawn R. M., Adelman J., Franke A. E., Houck C. M., Gross M., Najarian R., Goeddel D. V. Human fibroblast interferon gene lacks introns. Nucleic Acids Res. 1981 Mar 11;9(5):1045–1052. doi: 10.1093/nar/9.5.1045. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Lawn R. M., Efstratiadis A., O'Connell C., Maniatis T. The nucleotide sequence of the human beta-globin gene. Cell. 1980 Oct;21(3):647–651. doi: 10.1016/0092-8674(80)90428-6. [DOI] [PubMed] [Google Scholar]
  40. Lingappa V. R., Lingappa J. R., Blobel G. Chicken ovalbumin contains an internal signal sequence. Nature. 1979 Sep 13;281(5727):117–121. doi: 10.1038/281117a0. [DOI] [PubMed] [Google Scholar]
  41. Lomedico P., Rosenthal N., Efstratidadis A., Gilbert W., Kolodner R., Tizard R. The structure and evolution of the two nonallelic rat preproinsulin genes. Cell. 1979 Oct;18(2):545–558. doi: 10.1016/0092-8674(79)90071-0. [DOI] [PubMed] [Google Scholar]
  42. Manning R. F., Gage L. P. Internal structure of the silk fibroin gene of Bombyx mori. II. Remarkable polymorphism of the organization of crystalline and amorphous coding sequences. J Biol Chem. 1980 Oct 10;255(19):9451–9457. [PubMed] [Google Scholar]
  43. McGhee J. D., Felsenfeld G. Nucleosome structure. Annu Rev Biochem. 1980;49:1115–1156. doi: 10.1146/annurev.bi.49.070180.005343. [DOI] [PubMed] [Google Scholar]
  44. Michelson A. M., Orkin S. H. The 3' untranslated regions of the duplicated human alpha-globin genes are unexpectedly divergent. Cell. 1980 Nov;22(2 Pt 2):371–377. doi: 10.1016/0092-8674(80)90347-5. [DOI] [PubMed] [Google Scholar]
  45. Nagata S., Mantei N., Weissmann C. The structure of one of the eight or more distinct chromosomal genes for human interferon-alpha. Nature. 1980 Oct 2;287(5781):401–408. doi: 10.1038/287401a0. [DOI] [PubMed] [Google Scholar]
  46. Nakanishi S., Teranishi Y., Watanabe Y., Notake M., Noda M., Kakidani H., Jingami H., Numa S. Isolation and characterization of the bovine corticotropin/beta-lipotropin precursor gene. Eur J Biochem. 1981 Apr;115(3):429–438. doi: 10.1111/j.1432-1033.1981.tb06220.x. [DOI] [PubMed] [Google Scholar]
  47. Naora H., Deacon N. J. Clustered genes require extragenic territorial DNA sequences. Differentiation. 1982;21(1):1–6. doi: 10.1111/j.1432-0436.1982.tb01186.x. [DOI] [PubMed] [Google Scholar]
  48. Nishioka Y., Leder P. The complete sequence of a chromosomal mouse alpha--globin gene reveals elements conserved throughout vertebrate evolution. Cell. 1979 Nov;18(3):875–882. doi: 10.1016/0092-8674(79)90139-9. [DOI] [PubMed] [Google Scholar]
  49. Nobrega F. G., Tzagoloff A. Assembly of the mitochondrial membrane system. DNA sequence and organization of the cytochrome b gene in Saccharomyces cerevisiae D273-10B. J Biol Chem. 1980 Oct 25;255(20):9828–9837. [PubMed] [Google Scholar]
  50. Ohno S., Kato K., Hozumi T., Matsunaga T. Mouse immunoglobulin coding sequences for the heavy-chain variable region arose as repeats of the two short building blocks. Proc Natl Acad Sci U S A. 1982 Jan;79(1):132–136. doi: 10.1073/pnas.79.1.132. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Ohno S. Original domain for the serum albumin family arose from repeated sequences. Proc Natl Acad Sci U S A. 1981 Dec;78(12):7657–7661. doi: 10.1073/pnas.78.12.7657. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Overbeek P. A., Merlino G. T., Peters N. K., Cohn V. H., Moore G. P., Kleinsmith L. J. Characterization of five members of the actin gene family in the sea urchin. Biochim Biophys Acta. 1981 Dec 28;656(2):195–205. doi: 10.1016/0005-2787(81)90087-3. [DOI] [PubMed] [Google Scholar]
  53. Patient R. K., Elkington J. A., Kay R. M., Williams J. G. Internal organization of the major adult alpha- and beta-globin genes of X. laevis. Cell. 1980 Sep;21(2):565–573. doi: 10.1016/0092-8674(80)90494-8. [DOI] [PubMed] [Google Scholar]
  54. Perler F., Efstratiadis A., Lomedico P., Gilbert W., Kolodner R., Dodgson J. The evolution of genes: the chicken preproinsulin gene. Cell. 1980 Jun;20(2):555–566. doi: 10.1016/0092-8674(80)90641-8. [DOI] [PubMed] [Google Scholar]
  55. Proudfoot N. J., Maniatis T. The structure of a human alpha-globin pseudogene and its relationship to alpha-globin gene duplication. Cell. 1980 Sep;21(2):537–544. doi: 10.1016/0092-8674(80)90491-2. [DOI] [PubMed] [Google Scholar]
  56. Sakano H., Hüppi K., Heinrich G., Tonegawa S. Sequences at the somatic recombination sites of immunoglobulin light-chain genes. Nature. 1979 Jul 26;280(5720):288–294. doi: 10.1038/280288a0. [DOI] [PubMed] [Google Scholar]
  57. Sakano H., Rogers J. H., Hüppi K., Brack C., Traunecker A., Maki R., Wall R., Tonegawa S. Domains and the hinge region of an immunoglobulin heavy chain are encoded in separate DNA segments. Nature. 1979 Feb 22;277(5698):627–633. doi: 10.1038/277627a0. [DOI] [PubMed] [Google Scholar]
  58. Schuler M. A., Keller E. B. The chromosomal arrangement of two linked actin genes in the sea urchin S. purpuratus. Nucleic Acids Res. 1981 Feb 11;9(3):591–604. doi: 10.1093/nar/9.3.591. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Seidman J. G., Max E. E., Leder P. A kappa-immunoglobulin gene is formed by site-specific recombination without further somatic mutation. Nature. 1979 Aug 2;280(5721):370–375. doi: 10.1038/280370a0. [DOI] [PubMed] [Google Scholar]
  60. Slightom J. L., Blechl A. E., Smithies O. Human fetal G gamma- and A gamma-globin genes: complete nucleotide sequences suggest that DNA can be exchanged between these duplicated genes. Cell. 1980 Oct;21(3):627–638. doi: 10.1016/0092-8674(80)90426-2. [DOI] [PubMed] [Google Scholar]
  61. Smith G. P. Evolution of repeated DNA sequences by unequal crossover. Science. 1976 Feb 13;191(4227):528–535. doi: 10.1126/science.1251186. [DOI] [PubMed] [Google Scholar]
  62. Snyder M., Hirsh J., Davidson N. The cuticle genes of drosophila: a developmentally regulated gene cluster. Cell. 1981 Jul;25(1):165–177. doi: 10.1016/0092-8674(81)90241-5. [DOI] [PubMed] [Google Scholar]
  63. Spritz R. A., DeRiel J. K., Forget B. G., Weissman S. M. Complete nucleotide sequence of the human delta-globin gene. Cell. 1980 Oct;21(3):639–646. doi: 10.1016/0092-8674(80)90427-4. [DOI] [PubMed] [Google Scholar]
  64. Stein J. P., Catterall J. F., Kristo P., Means A. R., O'Malley B. W. Ovomucoid intervening sequences specify functional domains and generate protein polymorphism. Cell. 1980 Oct;21(3):681–687. doi: 10.1016/0092-8674(80)90431-6. [DOI] [PubMed] [Google Scholar]
  65. Steinmetz M., Moore K. W., Frelinger J. G., Sher B. T., Shen F. W., Boyse E. A., Hood L. A pseudogene homologous to mouse transplantation antigens: transplantation antigens are encoded by eight exons that correlate with protein domains. Cell. 1981 Sep;25(3):683–692. doi: 10.1016/0092-8674(81)90175-6. [DOI] [PubMed] [Google Scholar]
  66. Stephenson E. C., Erba H. P., Gall J. G. Characterization of a cloned histone gene cluster of the newt Notophthalamus viridescens. Nucleic Acids Res. 1981 May 25;9(10):2281–2295. doi: 10.1093/nar/9.10.2281. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Tsujimoto Y., Suzuki Y. The DNA sequence of Bombyx mori fibroin gene including the 5' flanking, mRNA coding, entire intervening and fibroin protein coding regions. Cell. 1979 Oct;18(2):591–600. doi: 10.1016/0092-8674(79)90075-8. [DOI] [PubMed] [Google Scholar]
  68. Ullrich A., Dull T. J., Gray A., Brosius J., Sures I. Genetic variation in the human insulin gene. Science. 1980 Aug 1;209(4456):612–615. doi: 10.1126/science.6248962. [DOI] [PubMed] [Google Scholar]
  69. Villeponteau B., Martinson H. Isolation and characterization of the complete chicken beta-globin gene region: frequent deletion of the adult beta-globin genes in lambda. Nucleic Acids Res. 1981 Aug 11;9(15):3731–3746. doi: 10.1093/nar/9.15.3731. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Wahli W., Dawid I. B., Wyler T., Weber R., Ryffel G. U. Comparative analysis of the structural organization of two closely related vitellogenin genes in X. laevis. Cell. 1980 May;20(1):107–117. doi: 10.1016/0092-8674(80)90239-1. [DOI] [PubMed] [Google Scholar]
  71. Wickens M. P., Woo S., O'Malley B. W., Gurdon J. B. Expression of a chicken chromosomal ovalbumin gene injected into frog oocyte nuclei. Nature. 1980 Jun 26;285(5767):628–634. doi: 10.1038/285628a0. [DOI] [PubMed] [Google Scholar]
  72. Wozney J., Hanahan D., Tate V., Boedtker H., Doty P. Structure of the pro alpha 2 (I) collagen gene. Nature. 1981 Nov 12;294(5837):129–135. doi: 10.1038/294129a0. [DOI] [PubMed] [Google Scholar]
  73. Young R. A., Hagenbüchle O., Schibler U. A single mouse alpha-amylase gene specifies two different tissue-specific mRNAs. Cell. 1981 Feb;23(2):451–458. doi: 10.1016/0092-8674(81)90140-9. [DOI] [PubMed] [Google Scholar]

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES