Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2012 Aug;78(15):5288–5296. doi: 10.1128/AEM.00564-12

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

Copyright © 2012, American Society for Microbiology. All Rights Reserved.

PMC Copyright notice

Fig 2 — A diagram of the constrained assembly approach. (A) Paired-end and singleton reads from a metagenomic data set. (B) Assembly of all reads using SOAPdenovo, to generate contigs and a de Bruijn graph that connects the contigs. (C) Identification of contigs that consist of integron recombination repeats (shown as orange bars) and search for paths that start and end at a contig with repeats, using a depth-first search algorithm. At any intermediate node, the process will sort the coverage of all contigs connected by its outgoing edges and begin searching from the highest one. The starting and ending contig could be the same contig. (D) Validation of the assembled sequences (the paths) by read mapping and discarding of the paths that are not supported by reads (e.g., the middle sequence in the figure is discarded). (E) Identification of the integron repeats and their exact locations in the assembled sequences. Prediction of genes using FragGeneScan. Output sequences are between two repeats (attC sites) and consist of three or fewer genes. (F) Retrieval of the genes from sequences that pass all criteria.