Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2016 Nov 29;45(5):e31. doi: 10.1093/nar/gkw1067

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

© The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com

PMC Copyright notice

Figure 6. — Flowchart for analysis of expression from full-length L1 loci. In all of our approaches we first isolate cytoplasmic RNAs to eliminate a great deal of the background from L1 elements found in the introns of transcripts in the nucleus. Our 5΄ RACE protocol uses the total cytoplasmic RNA for cDNA synthesis with a L1-specific primer that should generate a 1200 base cDNA only from full-length L1 transcripts that initiate at the beginning of the L1 sequence. This 1200 base cDNA is then amplified using the RACE protocol and subjected to PacBio sequencing. Consensus reads from the PacBio for each molecule are aligned to the human genome and those that align throughout their length with an accuracy lower than 99% are eliminated. This not only eliminates fragments with sequence errors, it also eliminates PCR chimeras between different L1 loci. The accurate L1 alignments are then rigorously aligned to the human genome and only reads that align to one full-length L1 locus better than all others are mapped and counted. In the RNA-Seq studies, the same cytoplasmic RNA is subjected to polyA selection to eliminate rRNA and then subjected to a strand-specific, 2 × 100 bp, paired-end RNA-Seq using the Illumina platform. These paired-end reads are aligned rigorously with BOWTIE, accepting only those alignments where both reads align concordantly at one locus better than anywhere else in the human genome. This alignment is used to either count the reads mapping to each individual full-length L1 locus (left branch) or with reads mapping specifically downstream from known polymorphic L1 loci (right branch) as indirect evidence of expression from those loci.