Skip to main content
. 2018 Nov;28(11):1720–1732. doi: 10.1101/gr.236273.118

Figure 2.

Figure 2.

Incomplete breakpoint graph and missing adjacencies inference. (A) Two reference genomes (R1 and R2) and a target genome T represented in the alphabet of four signed synteny blocks (a, b, c, d). All genomes are circular. Reference R1 could be transformed into R2 with two inversions: (a, b, c, d) → (a, −c, −b, d) → (a, −c, −d, +b). Target genome T is structurally similar to R1; however, T is fragmented into three contigs (missing adjacencies are shown with dashed lines). (B) Incomplete breakpoint graph of all genomes. Each synteny block corresponds to two nodes representing its head and tail (denoted as h and t). Colored edges connect synteny block ends that are adjacent within a corresponding genome. Two inversions between R1 and R2 correspond to two cycles of length four with alternating colors. Because T is fragmented, some red adjacencies are missing (dashed lines). (C) A phylogenetic tree is reconstructed based on pairwise breakpoint distance between the genomes. Because T shares more breakpoints with R1 than with R2, T is closer to R1 within the tree. (D) Missing adjacencies are recovered based on the graph structure and phylogenetic tree. Since T is closer to R1, the algorithm prefers the pair of breakpoints (bt, ch), (dt, ah) to an alternative, (bt, ah), (ch, dt). The reconstructed adjacencies define the synteny block order in the final chromosomes.