Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2018 Aug 13;46(19):9971–9989. doi: 10.1093/nar/gky725

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

© The Author(s) 2018. Published by Oxford University Press on behalf of Nucleic Acids Research.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

PMC Copyright notice

Figure 1. — Data analysis pipeline used for genome assembly and annotation. Left. DNA level: the genome sequence of D39V was determined by SMRT sequencing, supported by previously published Illumina data (10,25). Automated annotation by the RAST (13) and PGAP (4) annotation pipelines was followed by curation based on information from literature and a variety of databases and bioinformatic tools. Right. RNA level: Cappable-seq (7) was utilized to identify transcription start sites. Simultaneously, putative transcript ends were identified by combining reverse reads from paired-end, stranded sequencing of the control sample (i.e. not 5′-enriched). Terminators were annotated when such putative transcript ends overlapped with stem loops predicted by TransTermHP (22). Finally, local fragment size enrichment in the paired-end sequencing data was used to identify putative small RNA features. ^αD39V derivative (bgaA::P_ssbB-luc; GEO accessions GSE54199 and GSE69729). ^βThe first 1 kb of the genome file was duplicated at the end, to allow mapping over FASTA boundaries. ^γAnalysis was performed with only sequencing pairs that map uniquely to the genome.