Abstract
We have characterized the approximately 6.5-kilobase cytoplasmic poly(A)+ Line-1 (L1) RNA present in a human teratocarcinoma cell line, NTera2D1, by primer extension and by analysis of cloned cDNAs. The bulk of the RNA begins (5' end) at the residue previously identified as the 5' terminus of the longest known primate genomic L1 elements, presumed to represent "unit" length. Several of the cDNA clones are close to 6 kilobase pairs, that is, close to full length. The partial sequences of 18 cDNA clones and full sequence of one (5,975 base pairs) indicate that many different genomic L1 elements contribute transcripts to the 6.5-kilobase cytoplasmic poly(A)+ RNA in NTera2D1 cells because no 2 of the 19 cDNAs analyzed had identical sequences. The transcribed elements appear to represent a subset of the total genomic L1s, a subset that has a characteristic consensus sequence in the 3' noncoding region and a high degree of sequence conservation throughout. Two open reading frames (ORFs) of 1,122 (ORF1) and 3,852 (ORF2) bases, flanked by about 800 and 200 bases of sequence at the 5' and 3' ends, respectively, can be identified in the cDNAs. Both ORFs are in the same frame, and they are separated by 33 bases bracketed by two conserved in-frame stop codons. ORF 2 is interrupted by at least one randomly positioned stop codon in the majority of the cDNAs. The data support proposals suggesting that the human L1 family includes one or more functional genes as well as an extraordinarily large number of pseudogenes whose ORFs are broken by stop codons. The cDNA structures suggest that both genes and pseudogenes are transcribed. At least one of the cDNAs (cD11), which was sequenced in its entirety, could, in principle, represent an mRNA for production of the ORF1 polypeptide. The similarity of mammalian L1s to several recently described invertebrate movable elements defines a new widely distributed class of elements which we term class II retrotransposons.
Full text
PDFImages in this article
Selected References
These references are in PubMed. This may not be the complete list of references from this article.
- Adams J. W., Kaufman R. E., Kretschmer P. J., Harrison M., Nienhuis A. W. A family of long reiterated DNA sequences, one copy of which is next to the human beta globin gene. Nucleic Acids Res. 1980 Dec 20;8(24):6113–6128. doi: 10.1093/nar/8.24.6113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Andrews P. W., Damjanov I., Simon D., Banting G. S., Carlin C., Dracopoli N. C., Føgh J. Pluripotent embryonal carcinoma clones derived from the human teratocarcinoma cell line Tera-2. Differentiation in vivo and in vitro. Lab Invest. 1984 Feb;50(2):147–162. [PubMed] [Google Scholar]
- Bennett K. L., Hastie N. D. Looking for relationships between the most repeated dispersed DNA sequences in the mouse: small R elements are found associated consistently with long MIF repeats. EMBO J. 1984 Feb;3(2):467–472. doi: 10.1002/j.1460-2075.1984.tb01829.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Benton W. D., Davis R. W. Screening lambdagt recombinant clones by hybridization to single plaques in situ. Science. 1977 Apr 8;196(4286):180–182. doi: 10.1126/science.322279. [DOI] [PubMed] [Google Scholar]
- Bernstein L. B., Manser T., Weiner A. M. Human U1 small nuclear RNA genes: extensive conservation of flanking sequences suggests cycles of gene amplification and transposition. Mol Cell Biol. 1985 Sep;5(9):2159–2171. doi: 10.1128/mcb.5.9.2159. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bird A. P. CpG-rich islands and the function of DNA methylation. Nature. 1986 May 15;321(6067):209–213. doi: 10.1038/321209a0. [DOI] [PubMed] [Google Scholar]
- Birnstiel M. L., Busslinger M., Strub K. Transcription termination and 3' processing: the end is in site! Cell. 1985 Jun;41(2):349–359. doi: 10.1016/s0092-8674(85)80007-6. [DOI] [PubMed] [Google Scholar]
- Burke W. D., Calalang C. C., Eickbush T. H. The site-specific ribosomal insertion element type II of Bombyx mori (R2Bm) contains the coding sequence for a reverse transcriptase-like enzyme. Mol Cell Biol. 1987 Jun;7(6):2221–2230. doi: 10.1128/mcb.7.6.2221. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Burton F. H., Loeb D. D., Chao S. F., Hutchison C. A., 3rd, Edgell M. H. Transposition of a long member of the L1 major interspersed DNA family into the mouse beta globin gene locus. Nucleic Acids Res. 1985 Jul 25;13(14):5071–5084. doi: 10.1093/nar/13.14.5071. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Burton F. H., Loeb D. D., Voliva C. F., Martin S. L., Edgell M. H., Hutchison C. A., 3rd Conservation throughout mammalia and extensive protein-encoding capacity of the highly repeated DNA long interspersed sequence one. J Mol Biol. 1986 Jan 20;187(2):291–304. doi: 10.1016/0022-2836(86)90235-4. [DOI] [PubMed] [Google Scholar]
- D'Ambrosio E., Waitzkin S. D., Witney F. R., Salemme A., Furano A. V. Structure of the highly repeated, long interspersed DNA family (LINE or L1Rn) of the rat. Mol Cell Biol. 1986 Feb;6(2):411–424. doi: 10.1128/mcb.6.2.411. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Demers G. W., Brech K., Hardison R. C. Long interspersed L1 repeats in rabbit DNA are homologous to L1 repeats of rodents and primates in an open-reading-frame region. Mol Biol Evol. 1986 May;3(3):179–190. doi: 10.1093/oxfordjournals.molbev.a040390. [DOI] [PubMed] [Google Scholar]
- Di Nocera P. P., Digan M. E., Dawid I. B. A family of oligo-adenylate-terminated transposable sequences in Drosophila melanogaster. J Mol Biol. 1983 Aug 25;168(4):715–727. doi: 10.1016/s0022-2836(83)80071-0. [DOI] [PubMed] [Google Scholar]
- DiGiovanni L., Haynes S. R., Misra R., Jelinek W. R. Kpn I family of long-dispersed repeated DNA sequences of man: evidence for entry into genomic DNA of DNA copies of poly(A)-terminated Kpn I RNAs. Proc Natl Acad Sci U S A. 1983 Nov;80(21):6533–6537. doi: 10.1073/pnas.80.21.6533. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Doerfler W. DNA methylation and gene activity. Annu Rev Biochem. 1983;52:93–124. doi: 10.1146/annurev.bi.52.070183.000521. [DOI] [PubMed] [Google Scholar]
- Dudley J. P. Discrete high molecular weight RNA transcribed from the long interspersed repetitive element L1Md. Nucleic Acids Res. 1987 Mar 25;15(6):2581–2592. doi: 10.1093/nar/15.6.2581. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Enders G. H., Ganem D., Varmus H. Mapping the major transcripts of ground squirrel hepatitis virus: the presumptive template for reverse transcriptase is terminally redundant. Cell. 1985 Aug;42(1):297–308. doi: 10.1016/s0092-8674(85)80125-2. [DOI] [PubMed] [Google Scholar]
- Fanning T. G. Size and structure of the highly repetitive BAM HI element in mice. Nucleic Acids Res. 1983 Aug 11;11(15):5073–5091. doi: 10.1093/nar/11.15.5073. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fanning T., Singer M. The LINE-1 DNA sequences in four mammalian orders predict proteins that conserve homologies to retrovirus proteins. Nucleic Acids Res. 1987 Mar 11;15(5):2251–2260. doi: 10.1093/nar/15.5.2251. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fawcett D. H., Lister C. K., Kellett E., Finnegan D. J. Transposable elements controlling I-R hybrid dysgenesis in D. melanogaster are similar to mammalian LINEs. Cell. 1986 Dec 26;47(6):1007–1015. doi: 10.1016/0092-8674(86)90815-9. [DOI] [PubMed] [Google Scholar]
- Grimaldi G., Skowronski J., Singer M. F. Defining the beginning and end of KpnI family segments. EMBO J. 1984 Aug;3(8):1753–1759. doi: 10.1002/j.1460-2075.1984.tb02042.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hattori M., Hidaka S., Sakaki Y. Sequence analysis of a KpnI family member near the 3' end of human beta-globin gene. Nucleic Acids Res. 1985 Nov 11;13(21):7813–7827. doi: 10.1093/nar/13.21.7813. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hattori M., Kuhara S., Takenaka O., Sakaki Y. L1 family of repetitive DNA sequences in primates may be derived from a sequence encoding a reverse transcriptase-related protein. Nature. 1986 Jun 5;321(6070):625–628. doi: 10.1038/321625a0. [DOI] [PubMed] [Google Scholar]
- Hattori M., Sakaki Y. Dideoxy sequencing method using denatured plasmid templates. Anal Biochem. 1986 Feb 1;152(2):232–238. doi: 10.1016/0003-2697(86)90403-3. [DOI] [PubMed] [Google Scholar]
- Hollis G. F., Hieter P. A., McBride O. W., Swan D., Leder P. Processed genes: a dispersed human immunoglobulin gene bearing evidence of RNA-type processing. Nature. 1982 Mar 25;296(5855):321–325. doi: 10.1038/296321a0. [DOI] [PubMed] [Google Scholar]
- Jacks T., Varmus H. E. Expression of the Rous sarcoma virus pol gene by ribosomal frameshifting. Science. 1985 Dec 13;230(4731):1237–1242. doi: 10.1126/science.2416054. [DOI] [PubMed] [Google Scholar]
- Jackson M., Heller D., Leinwand L. Transcriptional measurements of mouse repeated DNA sequences. Nucleic Acids Res. 1985 May 10;13(9):3389–3403. doi: 10.1093/nar/13.9.3389. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kimmel B. E., ole-MoiYoi O. K., Young J. R. Ingi, a 5.2-kb dispersed sequence element from Trypanosoma brucei that carries half of a smaller mobile element at either end and has homology with mammalian LINEs. Mol Cell Biol. 1987 Apr;7(4):1465–1475. doi: 10.1128/mcb.7.4.1465. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kole L. B., Haynes S. R., Jelinek W. R. Discrete and heterogeneous high molecular weight RNAs complementary to a long dispersed repeat family (a possible transposon) of human DNA. J Mol Biol. 1983 Apr 5;165(2):257–286. doi: 10.1016/s0022-2836(83)80257-5. [DOI] [PubMed] [Google Scholar]
- Kozak M. Bifunctional messenger RNAs in eukaryotes. Cell. 1986 Nov 21;47(4):481–483. doi: 10.1016/0092-8674(86)90609-4. [DOI] [PubMed] [Google Scholar]
- Lakshmikumaran M. S., D'Ambrosio E., Laimins L. A., Lin D. T., Furano A. V. Long interspersed repeated DNA (LINE) causes polymorphism at the rat insulin 1 locus. Mol Cell Biol. 1985 Sep;5(9):2197–2203. doi: 10.1128/mcb.5.9.2197. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lee T. N., Singer M. F. Analysis of LINE-1 family sequences on a single monkey chromosome. Nucleic Acids Res. 1986 May 12;14(9):3859–3870. doi: 10.1093/nar/14.9.3859. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lerman M. I., Thayer R. E., Singer M. F. Kpn I family of long interspersed repeated DNA sequences in primates: polymorphism of family members and evidence for transcription. Proc Natl Acad Sci U S A. 1983 Jul;80(13):3966–3970. doi: 10.1073/pnas.80.13.3966. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Loeb D. D., Padgett R. W., Hardies S. C., Shehee W. R., Comer M. B., Edgell M. H., Hutchison C. A., 3rd The sequence of a large L1Md element reveals a tandemly repeated 5' end and several features found in retrotransposons. Mol Cell Biol. 1986 Jan;6(1):168–182. doi: 10.1128/mcb.6.1.168. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Martin S. L., Voliva C. F., Burton F. H., Edgell M. H., Hutchison C. A., 3rd A large interspersed repeat found in mouse DNA contains a long open reading frame that evolves as if it encodes a protein. Proc Natl Acad Sci U S A. 1984 Apr;81(8):2308–2312. doi: 10.1073/pnas.81.8.2308. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Maxam A. M., Gilbert W. Sequencing end-labeled DNA with base-specific chemical cleavages. Methods Enzymol. 1980;65(1):499–560. doi: 10.1016/s0076-6879(80)65059-9. [DOI] [PubMed] [Google Scholar]
- Miyake T., Migita K., Sakaki Y. Some KpnI family members are associated with the Alu family in the human genome. Nucleic Acids Res. 1983 Oct 11;11(19):6837–6846. doi: 10.1093/nar/11.19.6837. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Morzycka-Wroblewska E., Selker E. U., Stevens J. N., Metzenberg R. L. Concerted evolution of dispersed Neurospora crassa 5S RNA genes: pattern of sequence conservation between allelic and nonallelic genes. Mol Cell Biol. 1985 Jan;5(1):46–51. doi: 10.1128/mcb.5.1.46. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mottez E., Rogan P. K., Manuelidis L. Conservation in the 5' region of the long interspersed mouse L1 repeat: implications of comparative sequence analysis. Nucleic Acids Res. 1986 Apr 11;14(7):3119–3136. doi: 10.1093/nar/14.7.3119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nomiyama H., Obaru K., Jinno Y., Matsuda I., Shimada K., Miyata T. Amplification of human argininosuccinate synthetase pseudogenes. J Mol Biol. 1986 Nov 20;192(2):221–233. doi: 10.1016/0022-2836(86)90361-x. [DOI] [PubMed] [Google Scholar]
- Norrander J., Kempe T., Messing J. Construction of improved M13 vectors using oligodeoxynucleotide-directed mutagenesis. Gene. 1983 Dec;26(1):101–106. doi: 10.1016/0378-1119(83)90040-9. [DOI] [PubMed] [Google Scholar]
- Paterson B. M., Eldridge J. D. alpha-Cardiac actin is the major sarcomeric isoform expressed in embryonic avian skeletal muscle. Science. 1984 Jun 29;224(4656):1436–1438. doi: 10.1126/science.6729461. [DOI] [PubMed] [Google Scholar]
- Peabody D. S., Berg P. Termination-reinitiation occurs in the translation of mammalian cell mRNAs. Mol Cell Biol. 1986 Jul;6(7):2695–2703. doi: 10.1128/mcb.6.7.2695. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Peabody D. S., Subramani S., Berg P. Effect of upstream reading frames on translation efficiency in simian virus 40 recombinants. Mol Cell Biol. 1986 Jul;6(7):2704–2711. doi: 10.1128/mcb.6.7.2704. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Persico M. G., Viglietto G., Martini G., Toniolo D., Paonessa G., Moscatelli C., Dono R., Vulliamy T., Luzzatto L., D'Urso M. Isolation of human glucose-6-phosphate dehydrogenase (G6PD) cDNA clones: primary structure of the protein and unusual 5' non-coding region. Nucleic Acids Res. 1986 Mar 25;14(6):2511–2522. doi: 10.1093/nar/14.6.2511. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Potter S. S. Rearranged sequences of a human Kpn I element. Proc Natl Acad Sci U S A. 1984 Feb;81(4):1012–1016. doi: 10.1073/pnas.81.4.1012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rogers J. H. The origin and evolution of retroposons. Int Rev Cytol. 1985;93:187–279. doi: 10.1016/s0074-7696(08)61375-3. [DOI] [PubMed] [Google Scholar]
- Sakaki Y., Hattori M., Fujita A., Yoshioka K., Kuhara S., Takenaka O. The LINE-1 family of primates may encode a reverse transcriptase-like protein. Cold Spring Harb Symp Quant Biol. 1986;51(Pt 1):465–469. doi: 10.1101/sqb.1986.051.01.056. [DOI] [PubMed] [Google Scholar]
- Sanger F., Nicklen S., Coulson A. R. DNA sequencing with chain-terminating inhibitors. Proc Natl Acad Sci U S A. 1977 Dec;74(12):5463–5467. doi: 10.1073/pnas.74.12.5463. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schmeckpeper B. J., Scott A. F., Smith K. D. Transcripts homologous to a long repeated DNA element in the human genome. J Biol Chem. 1984 Jan 25;259(2):1218–1225. [PubMed] [Google Scholar]
- SenGupta D. N., Zmudzka B. Z., Kumar P., Cobianchi F., Skowronski J., Wilson S. H. Sequence of human DNA polymerase beta mRNA obtained through cDNA cloning. Biochem Biophys Res Commun. 1986 Apr 14;136(1):341–347. doi: 10.1016/0006-291x(86)90916-2. [DOI] [PubMed] [Google Scholar]
- Shafit-Zagardo B., Brown F. L., Zavodny P. J., Maio J. J. Transcription of the KpnI families of long interspersed DNAs in human cells. Nature. 1983 Jul 21;304(5923):277–280. doi: 10.1038/304277a0. [DOI] [PubMed] [Google Scholar]
- Skowronski J., Singer M. F. Expression of a cytoplasmic LINE-1 transcript is regulated in a human teratocarcinoma cell line. Proc Natl Acad Sci U S A. 1985 Sep;82(18):6050–6054. doi: 10.1073/pnas.82.18.6050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Skowronski J., Singer M. F. The abundant LINE-1 family of repeated DNA sequences in mammals: genes and pseudogenes. Cold Spring Harb Symp Quant Biol. 1986;51(Pt 1):457–464. doi: 10.1101/sqb.1986.051.01.055. [DOI] [PubMed] [Google Scholar]
- Soares M. B., Schon E., Efstratiadis A. Rat LINE1: the origin and evolution of a family of long interspersed middle repetitive DNA elements. J Mol Evol. 1985;22(2):117–133. doi: 10.1007/BF02101690. [DOI] [PubMed] [Google Scholar]
- Stark G. R., Wahl G. M. Gene amplification. Annu Rev Biochem. 1984;53:447–491. doi: 10.1146/annurev.bi.53.070184.002311. [DOI] [PubMed] [Google Scholar]
- Sun L., Paulson K. E., Schmid C. W., Kadyk L., Leinwand L. Non-Alu family interspersed repeats in human DNA and their transcriptional activity. Nucleic Acids Res. 1984 Mar 26;12(6):2669–2690. doi: 10.1093/nar/12.6.2669. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Temin H. M. Reverse transcription in the eukaryotic genome: retroviruses, pararetroviruses, retrotransposons, and retrotranscripts. Mol Biol Evol. 1985 Nov;2(6):455–468. doi: 10.1093/oxfordjournals.molbev.a040365. [DOI] [PubMed] [Google Scholar]
- Tuttleman J. S., Pourcel C., Summers J. Formation of the pool of covalently closed circular viral DNA in hepadnavirus-infected cells. Cell. 1986 Nov 7;47(3):451–460. doi: 10.1016/0092-8674(86)90602-1. [DOI] [PubMed] [Google Scholar]
- Voliva C. F., Martin S. L., Hutchison C. A., 3rd, Edgell M. H. Dispersal process associated with the L1 family of interspersed repetitive DNA sequences. J Mol Biol. 1984 Oct 5;178(4):795–813. doi: 10.1016/0022-2836(84)90312-7. [DOI] [PubMed] [Google Scholar]
- Weiner A. M., Deininger P. L., Efstratiadis A. Nonviral retroposons: genes, pseudogenes, and transposable elements generated by the reverse flow of genetic information. Annu Rev Biochem. 1986;55:631–661. doi: 10.1146/annurev.bi.55.070186.003215. [DOI] [PubMed] [Google Scholar]
- Wiedemann L. M., Perry R. P. Characterization of the expressed gene and several processed pseudogenes for the mouse ribosomal protein L30 gene family. Mol Cell Biol. 1984 Nov;4(11):2518–2528. doi: 10.1128/mcb.4.11.2518. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wilson M. C., Sawicki S. G., White P. A., Darnell J. E., Jr A correlation between the rate of poly(A) shortening and half-life of messenger RNA in adenovirus transformed cells. J Mol Biol. 1978 Nov 25;126(1):23–36. doi: 10.1016/0022-2836(78)90277-2. [DOI] [PubMed] [Google Scholar]
- Witney F. R., Furano A. V. Highly repeated DNA families in the rat. J Biol Chem. 1984 Aug 25;259(16):10481–10492. [PubMed] [Google Scholar]
- Wolf S. F., Migeon B. R. Clusters of CpG dinucleotides implicated by nuclease hypersensitivity as control elements of housekeeping genes. Nature. 1985 Apr 4;314(6010):467–469. doi: 10.1038/314467a0. [DOI] [PubMed] [Google Scholar]
- Yanisch-Perron C., Vieira J., Messing J. Improved M13 phage cloning vectors and host strains: nucleotide sequences of the M13mp18 and pUC19 vectors. Gene. 1985;33(1):103–119. doi: 10.1016/0378-1119(85)90120-9. [DOI] [PubMed] [Google Scholar]
- Yoshinaka Y., Katoh I., Copeland T. D., Oroszlan S. Murine leukemia virus protease is encoded by the gag-pol gene and is synthesized through suppression of an amber termination codon. Proc Natl Acad Sci U S A. 1985 Mar;82(6):1618–1622. doi: 10.1073/pnas.82.6.1618. [DOI] [PMC free article] [PubMed] [Google Scholar]