Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2018 Mar;28(3):396–411. doi: 10.1101/gr.222976.117

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

© 2018 Tardaguila et al.; Published by Cold Spring Harbor Laboratory Press

This article, published in Genome Research, is available under a Creative Commons License (Attribution 4.0 International), as described at http://creativecommons.org/licenses/by/4.0/.

PMC Copyright notice

Figure 1. — Overview of the experimental model and SQANTI analysis. (A) SQANTI workflow. Two main functions are part of SQANTI. sqanti_qc.py uses as input files a FASTA file with transcript sequences, the reference genome in FASTA format, a GTF annotation file, and optionally, full-length and short-read expression files. The function returns a reference-corrected transcriptome, transcript-level and junction-level files with structural and quality descriptors, and a QC graphical report. sqanti_filter.py takes the reference-corrected transcriptome and the transcript-level descriptors file to return a curated transcriptome from which artifacts have been removed. (B) SQANTI classification of transcripts according to their splice junctions and donor and acceptor sites. Splice donors and acceptors are indicated in red and blue, respectively. (SJ) splice junction, (FSM) Full Splice Match, (ISM) Incomplete Splice Match, (NIC) Novel in Catalog, (NNC) Novel Not in Catalog. (C) Experimental system and data processing pipeline. RNA isolated from neural progenitor cells (NPCs) and oligodendrocyte precursor cells (OPCs) was retrotranscribed separately into cDNA and sequenced both by long-read PacBio and short-read Illumina technologies. All PacBio RoIs were joined and processed by the ToFU pipeline to obtain consensus transcripts. Residual (indel) errors were eliminated by comparison to the reference genome to generate a corrected transcriptome, and false transcripts were removed using a SQANTI filter to result in a curated transcriptome. Illumina short reads were mapped against the RefSeq murine transcriptome annotation, the corrected, and the curated PacBio transcriptomes.