Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2016 Oct 13;11:81. doi: 10.1186/s40793-016-0201-7

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

© The Author(s). 2016

Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

PMC Copyright notice

Fig. 2 — Tanglegram of genome based trees. a Maximum likelihood tree based on genomic data of organisms affiliated with the genera Phaeobacter, Pseudophaeobacter, Ruegeria, Leisingera and additional strains of the Roseobacter clade inferred with 500 bootstraps (BS) with RAxML after Stamatakis (2014) [100]. The alignment was created from 684 orthologous single-copy genes present in all genomes (Multilocus Sequence Analysis; MLSA) after total protein sequences of the genomes were extracted from the corresponding GenBank files and used for the downstream analysis with an in house pipeline at the Goettingen Genomics Laboratory (J. Vollmers, unpubl.). In brief, clusters of orthologs were generated using proteinortho version 5 [101], inparalogs were removed, the remaining sequences were aligned with MUSCLE [102] and poorly aligned positions automatically filtered from the alignments using Gblocks [103]. b Gene content tree including singletons of the same organisms as in A based on an orthologs-content matrix representing presence or absence of a gene in a certain genome, inferred with Neighbour Joining (1000 BS). Both scripts for this pipeline, PO_2_MLSA.py and PO_2_GENECONTENT.py, are available at github. Numbers at the nodes specify BS values ≥50 %. Scale bars represent 10 % sequence divergence. For Genbank accession numbers see Additional file 1: Table S1. For a clear view only lines were given linking the same species at different positions