Skip to main content
The EMBO Journal logoLink to The EMBO Journal
. 1986 Dec 20;5(13):3583–3589. doi: 10.1002/j.1460-2075.1986.tb04686.x

Sequence conservation in the protein coding and intron regions of the engrailed transcription unit.

J A Kassis, S J Poole, D K Wright, P H O'Farrell
PMCID: PMC1167397  PMID: 2881781

Abstract

Engrailed (en) is a gene involved in proper segmentation of the Drosophila embryo. The predicted en protein contains a homeodomain and regions rich in polyalanine, polyglutamine, polyglutamate/aspartate and serine. We have taken an evolutionary approach to define which regions may be of fundamental importance by examining the D. virilis genomic sequence homologous to the D. melanogaster en primary transcription unit. Sequence homology begins at the first ATG of a long open reading frame yielding proteins of 584 and 552 amino acids for the D. virilis and D. melanogaster proteins, respectively. The predicted amino acid sequence can be divided into conserved and non-conserved domains. The C-terminal 30% of the protein (which includes the homeodomain) is completely conserved. In the N-terminal 70% of the protein, the overall conservation is 71%, but non-conservative amino acid changes occur in clusters and there are short stretches of highly conserved sequence. A region rich in glutamate and aspartate is conserved and has homology to an 18-amino acid sequence present in members of the myc family of proteins. Major differences in the size of the two proteins occur in regions of non-conserved repeated sequences. In the introns of the engrailed transcription units there are long stretches of conservation, suggesting this DNA may be of functional importance.

Full text

PDF
3583

Selected References

These references are in PubMed. This may not be the complete list of references from this article.

  1. Banerji J., Olson L., Schaffner W. A lymphocyte-specific cellular enhancer is located downstream of the joining region in immunoglobulin heavy chain genes. Cell. 1983 Jul;33(3):729–740. doi: 10.1016/0092-8674(83)90015-6. [DOI] [PubMed] [Google Scholar]
  2. Beachy P. A., Helfand S. L., Hogness D. S. Segmental distribution of bithorax complex proteins during Drosophila development. Nature. 1985 Feb 14;313(6003):545–551. doi: 10.1038/313545a0. [DOI] [PubMed] [Google Scholar]
  3. Beverley S. M., Wilson A. C. Molecular evolution in Drosophila and the higher Diptera II. A time scale for fly evolution. J Mol Evol. 1984;21(1):1–13. doi: 10.1007/BF02100622. [DOI] [PubMed] [Google Scholar]
  4. Bourne H. R. GTP-binding proteins. One molecular machine can transduce diverse signals. 1986 Jun 26-Jul 2Nature. 321(6073):814–816. doi: 10.1038/321814a0. [DOI] [PubMed] [Google Scholar]
  5. Brent R., Ptashne M. A eukaryotic transcriptional activator bearing the DNA specificity of a prokaryotic repressor. Cell. 1985 Dec;43(3 Pt 2):729–736. doi: 10.1016/0092-8674(85)90246-6. [DOI] [PubMed] [Google Scholar]
  6. Carrasco A. E., McGinnis W., Gehring W. J., De Robertis E. M. Cloning of an X. laevis gene expressed during early embryogenesis coding for a peptide region homologous to Drosophila homeotic genes. Cell. 1984 Jun;37(2):409–414. doi: 10.1016/0092-8674(84)90371-4. [DOI] [PubMed] [Google Scholar]
  7. Carroll S. B., Scott M. P. Localization of the fushi tarazu protein during Drosophila embryogenesis. Cell. 1985 Nov;43(1):47–57. doi: 10.1016/0092-8674(85)90011-x. [DOI] [PubMed] [Google Scholar]
  8. Colberg-Poley A. M., Voss S. D., Chowdhury K., Stewart C. L., Wagner E. F., Gruss P. Clustered homeo boxes are differentially expressed during murine development. Cell. 1985 Nov;43(1):39–45. doi: 10.1016/0092-8674(85)90010-8. [DOI] [PubMed] [Google Scholar]
  9. Coussens L., Parker P. J., Rhee L., Yang-Feng T. L., Chen E., Waterfield M. D., Francke U., Ullrich A. Multiple, distinct forms of bovine and human protein kinase C suggest diversity in cellular signaling pathways. Science. 1986 Aug 22;233(4766):859–866. doi: 10.1126/science.3755548. [DOI] [PubMed] [Google Scholar]
  10. Dente L., Cesareni G., Cortese R. pEMBL: a new family of single stranded plasmids. Nucleic Acids Res. 1983 Mar 25;11(6):1645–1655. doi: 10.1093/nar/11.6.1645. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Desplan C., Theis J., O'Farrell P. H. The Drosophila developmental gene, engrailed, encodes a sequence-specific DNA binding activity. Nature. 1985 Dec 19;318(6047):630–635. doi: 10.1038/318630a0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. DiNardo S., Kuner J. M., Theis J., O'Farrell P. H. Development of embryonic pattern in D. melanogaster as revealed by accumulation of the nuclear engrailed protein. Cell. 1985 Nov;43(1):59–69. doi: 10.1016/0092-8674(85)90012-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Efstratiadis A., Posakony J. W., Maniatis T., Lawn R. M., O'Connell C., Spritz R. A., DeRiel J. K., Forget B. G., Weissman S. M., Slightom J. L. The structure and evolution of the human beta-globin gene family. Cell. 1980 Oct;21(3):653–668. doi: 10.1016/0092-8674(80)90429-8. [DOI] [PubMed] [Google Scholar]
  14. Fjose A., McGinnis W. J., Gehring W. J. Isolation of a homoeo box-containing gene from the engrailed region of Drosophila and the spatial distribution of its transcripts. Nature. 1985 Jan 24;313(6000):284–289. doi: 10.1038/313284a0. [DOI] [PubMed] [Google Scholar]
  15. Gillies S. D., Morrison S. L., Oi V. T., Tonegawa S. A tissue-specific transcription enhancer element is located in the major intron of a rearranged immunoglobulin heavy chain gene. Cell. 1983 Jul;33(3):717–728. doi: 10.1016/0092-8674(83)90014-4. [DOI] [PubMed] [Google Scholar]
  16. Greenwald I. lin-12, a nematode homeotic gene, is homologous to a set of mammalian proteins that includes epidermal growth factor. Cell. 1985 Dec;43(3 Pt 2):583–590. doi: 10.1016/0092-8674(85)90230-2. [DOI] [PubMed] [Google Scholar]
  17. Hart C. P., Awgulewitsch A., Fainsod A., McGinnis W., Ruddle F. H. Homeo box gene complex on mouse chromosome 11: molecular cloning, expression in embryogenesis, and homology to a human homeo box locus. Cell. 1985 Nov;43(1):9–18. doi: 10.1016/0092-8674(85)90007-8. [DOI] [PubMed] [Google Scholar]
  18. Hauser C. A., Joyner A. L., Klein R. D., Learned T. K., Martin G. R., Tjian R. Expression of homologous homeo-box-containing genes in differentiated human teratocarcinoma cells and mouse embryos. Cell. 1985 Nov;43(1):19–28. doi: 10.1016/0092-8674(85)90008-x. [DOI] [PubMed] [Google Scholar]
  19. Hayashida H., Miyata T. Unusual evolutionary conservation and frequent DNA segment exchange in class I genes of the major histocompatibility complex. Proc Natl Acad Sci U S A. 1983 May;80(9):2671–2675. doi: 10.1073/pnas.80.9.2671. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Henikoff S. Unidirectional digestion with exonuclease III creates targeted breakpoints for DNA sequencing. Gene. 1984 Jun;28(3):351–359. doi: 10.1016/0378-1119(84)90153-7. [DOI] [PubMed] [Google Scholar]
  21. Hieter P. A., Max E. E., Seidman J. G., Maizel J. V., Jr, Leder P. Cloned human and mouse kappa immunoglobulin constant and J region genes conserve homology in functional segments. Cell. 1980 Nov;22(1 Pt 1):197–207. doi: 10.1016/0092-8674(80)90168-3. [DOI] [PubMed] [Google Scholar]
  22. Hollenberg S. M., Weinberger C., Ong E. S., Cerelli G., Oro A., Lebo R., Thompson E. B., Rosenfeld M. G., Evans R. M. Primary structure and expression of a functional human glucocorticoid receptor cDNA. Nature. 1985 Dec 19;318(6047):635–641. doi: 10.1038/318635a0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Hong G. F. A systemic DNA sequencing strategy. J Mol Biol. 1982 Jul 5;158(3):539–549. doi: 10.1016/0022-2836(82)90213-3. [DOI] [PubMed] [Google Scholar]
  24. Jones C. W., Kafatos F. C. Accepted mutations in a gene family: evolutionary diversification of duplicated DNA. J Mol Evol. 1982;19(1):87–103. doi: 10.1007/BF02100227. [DOI] [PubMed] [Google Scholar]
  25. Joyner A. L., Kornberg T., Coleman K. G., Cox D. R., Martin G. R. Expression during embryogenesis of a mouse gene with sequence homology to the Drosophila engrailed gene. Cell. 1985 Nov;43(1):29–37. doi: 10.1016/0092-8674(85)90009-1. [DOI] [PubMed] [Google Scholar]
  26. Kashima N., Nishi-Takaoka C., Fujita T., Taki S., Yamada G., Hamuro J., Taniguchi T. Unique structure of murine interleukin-2 as deduced from cloned cDNAs. 1985 Jan 31-Feb 6Nature. 313(6001):402–404. doi: 10.1038/313402a0. [DOI] [PubMed] [Google Scholar]
  27. Kassis J. A., Wong M. L., O'Farrell P. H. Electron microscopic heteroduplex mapping identifies regions of the engrailed locus that are conserved between Drosophila melanogaster and Drosophila virilis. Mol Cell Biol. 1985 Dec;5(12):3600–3609. doi: 10.1128/mcb.5.12.3600. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Kidd S., Lockett T. J., Young M. W. The Notch locus of Drosophila melanogaster. Cell. 1983 Sep;34(2):421–433. doi: 10.1016/0092-8674(83)90376-8. [DOI] [PubMed] [Google Scholar]
  29. Kornberg T. Compartments in the abdomen of Drosophila and the role of the engrailed locus. Dev Biol. 1981 Sep;86(2):363–372. doi: 10.1016/0012-1606(81)90194-9. [DOI] [PubMed] [Google Scholar]
  30. Kornberg T. Engrailed: a gene controlling compartment and segment formation in Drosophila. Proc Natl Acad Sci U S A. 1981 Feb;78(2):1095–1099. doi: 10.1073/pnas.78.2.1095. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Kornberg T., Sidén I., O'Farrell P., Simon M. The engrailed locus of Drosophila: in situ localization of transcripts reveals compartment-specific expression. Cell. 1985 Jan;40(1):45–53. doi: 10.1016/0092-8674(85)90307-1. [DOI] [PubMed] [Google Scholar]
  32. Laughon A., Carroll S. B., Storfer F. A., Riley P. D., Scott M. P. Common properties of proteins encoded by the Antennapedia complex genes of Drosophila melanogaster. Cold Spring Harb Symp Quant Biol. 1985;50:253–262. doi: 10.1101/sqb.1985.050.01.032. [DOI] [PubMed] [Google Scholar]
  33. Laughon A., Scott M. P. Sequence of a Drosophila segmentation gene: protein structure homology with DNA-binding proteins. Nature. 1984 Jul 5;310(5972):25–31. doi: 10.1038/310025a0. [DOI] [PubMed] [Google Scholar]
  34. Lawrence P. A., Morata G. Compartments in the wing of Drosophila: a study of the engrailed gene. Dev Biol. 1976 Jun;50(2):321–337. doi: 10.1016/0012-1606(76)90155-x. [DOI] [PubMed] [Google Scholar]
  35. Lawrence P. A., Struhl G. Further studies of the engrailed phenotype in Drosophila. EMBO J. 1982;1(7):827–833. doi: 10.1002/j.1460-2075.1982.tb01255.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Lipman D. J., Pearson W. R. Rapid and sensitive protein similarity searches. Science. 1985 Mar 22;227(4693):1435–1441. doi: 10.1126/science.2983426. [DOI] [PubMed] [Google Scholar]
  37. McGinnis W., Garber R. L., Wirz J., Kuroiwa A., Gehring W. J. A homologous protein-coding sequence in Drosophila homeotic genes and its conservation in other metazoans. Cell. 1984 Jun;37(2):403–408. doi: 10.1016/0092-8674(84)90370-2. [DOI] [PubMed] [Google Scholar]
  38. McGinnis W., Levine M. S., Hafen E., Kuroiwa A., Gehring W. J. A conserved DNA sequence in homoeotic genes of the Drosophila Antennapedia and bithorax complexes. 1984 Mar 29-Apr 4Nature. 308(5958):428–433. doi: 10.1038/308428a0. [DOI] [PubMed] [Google Scholar]
  39. Miesfeld R., Rusconi S., Godowski P. J., Maler B. A., Okret S., Wikström A. C., Gustafsson J. A., Yamamoto K. R. Genetic complementation of a glucocorticoid receptor deficiency by expression of cloned receptor cDNA. Cell. 1986 Aug 1;46(3):389–399. doi: 10.1016/0092-8674(86)90659-8. [DOI] [PubMed] [Google Scholar]
  40. Mount S. M. A catalogue of splice junction sequences. Nucleic Acids Res. 1982 Jan 22;10(2):459–472. doi: 10.1093/nar/10.2.459. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Nordheim A., Rich A. Negatively supercoiled simian virus 40 DNA contains Z-DNA segments within transcriptional enhancer sequences. Nature. 1983 Jun 23;303(5919):674–679. doi: 10.1038/303674a0. [DOI] [PubMed] [Google Scholar]
  42. Norrander J., Kempe T., Messing J. Construction of improved M13 vectors using oligodeoxynucleotide-directed mutagenesis. Gene. 1983 Dec;26(1):101–106. doi: 10.1016/0378-1119(83)90040-9. [DOI] [PubMed] [Google Scholar]
  43. Parker P. J., Coussens L., Totty N., Rhee L., Young S., Chen E., Stabel S., Waterfield M. D., Ullrich A. The complete primary structure of protein kinase C--the major phorbol ester receptor. Science. 1986 Aug 22;233(4766):853–859. doi: 10.1126/science.3755547. [DOI] [PubMed] [Google Scholar]
  44. Perler F., Efstratiadis A., Lomedico P., Gilbert W., Kolodner R., Dodgson J. The evolution of genes: the chicken preproinsulin gene. Cell. 1980 Jun;20(2):555–566. doi: 10.1016/0092-8674(80)90641-8. [DOI] [PubMed] [Google Scholar]
  45. Poole S. J., Kauvar L. M., Drees B., Kornberg T. The engrailed locus of Drosophila: structural analysis of an embryonic transcript. Cell. 1985 Jan;40(1):37–43. doi: 10.1016/0092-8674(85)90306-x. [DOI] [PubMed] [Google Scholar]
  46. Reddy E. P., Reynolds R. K., Watson D. K., Schultz R. A., Lautenberger J., Papas T. S. Nucleotide sequence analysis of the proviral genome of avian myelocytomatosis virus (MC29). Proc Natl Acad Sci U S A. 1983 May;80(9):2500–2504. doi: 10.1073/pnas.80.9.2500. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Sanger F., Nicklen S., Coulson A. R. DNA sequencing with chain-terminating inhibitors. Proc Natl Acad Sci U S A. 1977 Dec;74(12):5463–5467. doi: 10.1073/pnas.74.12.5463. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Scott M. P., Weiner A. J. Structural relationships among genes that control development: sequence homology between the Antennapedia, Ultrabithorax, and fushi tarazu loci of Drosophila. Proc Natl Acad Sci U S A. 1984 Jul;81(13):4115–4119. doi: 10.1073/pnas.81.13.4115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Shepherd J. C., McGinnis W., Carrasco A. E., De Robertis E. M., Gehring W. J. Fly and frog homoeo domains show homologies with yeast mating type regulatory proteins. Nature. 1984 Jul 5;310(5972):70–71. doi: 10.1038/310070a0. [DOI] [PubMed] [Google Scholar]
  50. Slater E. P., Rabenau O., Karin M., Baxter J. D., Beato M. Glucocorticoid receptor binding and activation of a heterologous promoter by dexamethasone by the first intron of the human growth hormone gene. Mol Cell Biol. 1985 Nov;5(11):2984–2992. doi: 10.1128/mcb.5.11.2984. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Stanton L. W., Schwab M., Bishop J. M. Nucleotide sequence of the human N-myc gene. Proc Natl Acad Sci U S A. 1986 Mar;83(6):1772–1776. doi: 10.1073/pnas.83.6.1772. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Tautz D., Trick M., Dover G. A. Cryptic simplicity in DNA is a major source of genetic variation. Nature. 1986 Aug 14;322(6080):652–656. doi: 10.1038/322652a0. [DOI] [PubMed] [Google Scholar]
  53. Taya Y., Mizusawa S., Nishimura S. Nucleotide sequence of the coding region of the mouse N-myc gene. EMBO J. 1986 Jun;5(6):1215–1219. doi: 10.1002/j.1460-2075.1986.tb04349.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Van Beneden R. J., Watson D. K., Chen T. T., Lautenberger J. A., Papas T. S. Cellular myc (c-myc) in fish (rainbow trout): its relationship to other vertebrate myc genes and to the transforming genes of the MC29 family of viruses. Proc Natl Acad Sci U S A. 1986 Jun;83(11):3698–3702. doi: 10.1073/pnas.83.11.3698. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Watson D. K., Psallidopoulos M. C., Samuel K. P., Dalla-Favera R., Papas T. S. Nucleotide sequence analysis of human c-myc locus, chicken homologue, and myelocytomatosis virus MC29 transforming gene reveals a highly conserved gene product. Proc Natl Acad Sci U S A. 1983 Jun;80(12):3642–3645. doi: 10.1073/pnas.80.12.3642. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Watson D. K., Reddy E. P., Duesberg P. H., Papas T. S. Nucleotide sequence analysis of the chicken c-myc gene reveals homologous and unique coding regions by comparison with the transforming gene of avian myelocytomatosis virus MC29, delta gag-myc. Proc Natl Acad Sci U S A. 1983 Apr;80(8):2146–2150. doi: 10.1073/pnas.80.8.2146. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Weir M. P., Kornberg T. Patterns of engrailed and fushi tarazu transcripts reveal novel intermediate stages in Drosophila segmentation. Nature. 1985 Dec 5;318(6045):433–439. doi: 10.1038/318433a0. [DOI] [PubMed] [Google Scholar]
  58. Wharton K. A., Johansen K. M., Xu T., Artavanis-Tsakonas S. Nucleotide sequence from the neurogenic locus notch implies a gene product that shares homology with proteins containing EGF-like repeats. Cell. 1985 Dec;43(3 Pt 2):567–581. doi: 10.1016/0092-8674(85)90229-6. [DOI] [PubMed] [Google Scholar]
  59. Wharton K. A., Yedvobnick B., Finnerty V. G., Artavanis-Tsakonas S. opa: a novel family of transcribed repeats shared by the Notch locus and other developmentally regulated loci in D. melanogaster. Cell. 1985 Jan;40(1):55–62. doi: 10.1016/0092-8674(85)90308-3. [DOI] [PubMed] [Google Scholar]
  60. White R. A., Wilcox M. Protein products of the bithorax complex in Drosophila. Cell. 1984 Nov;39(1):163–171. doi: 10.1016/0092-8674(84)90202-2. [DOI] [PubMed] [Google Scholar]
  61. Wilson A. C., Carlson S. S., White T. J. Biochemical evolution. Annu Rev Biochem. 1977;46:573–639. doi: 10.1146/annurev.bi.46.070177.003041. [DOI] [PubMed] [Google Scholar]

Articles from The EMBO Journal are provided here courtesy of Nature Publishing Group

RESOURCES