Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2013 Apr;20(4):359–371. doi: 10.1089/cmb.2012.0098

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

Copyright 2013, Mary Ann Liebert, Inc.

PMC Copyright notice

FIG. 1. — From de Bruijn graph to pathset graph. (a) A standard de Bruijn graph and the corresponding mapping of mate-pairs. The number on top of each node is the node ID. The smaller blue numbers below/beside each node are the IDs of the corresponding paired right nodes. The bold red, blue, and green paths show how the genome traverses the graph. (b) The condensed de Bruijn graph with edges corresponding to non-branching paths in the standard de Bruijn graph. The dotted red lines indicate edge-pairs. (c) Pathset graph. Initially there are eight pathsets: C₁ = {e₁e₃e₅, e₁e₄e₅}, C₂ = {e₁e₃}, C₃ = {e₃e₅}, C₄ = {e₂e₃e₅, e₂e₄e₅}, C₅ = {e₂e₃e₆, e₂e₄e₆}, C₆ = {e₄e₅}, C₇ = {e₄e₆}, and C₈ = {e₂e₄}. Using the edge-pair information, we find phantom paths (indicated in boldface) and remove them. After removal of all prefix pathsets (C₂ and C₈), the pathset graph has six nodes and consists of three edges: C₁ → C₃ (red path), C₄ → C₆ (green path), and C₅ → C₇ (blue path). Each edge in the pathset graph corresponds to a contig; e.g., C₁ → C₃ spells out the red path (AAACAATCGGCCGCTTTAG).