Abstract
The chloroplast genomes of green algae are highly variable in their architecture. In this article we summarize gene content across newly obtained and published chloroplast genomes in Chlorophyceae, including new data from nine of species in Sphaeropleales (Chlorophyceae, Chlorophyta). We present genome architecture information, including genome synteny analysis across two groups of species. Also, we provide a phylogenetic tree obtained from analysis of gene order data for species in Chlorophyceae with fully sequenced chloroplast genomes. Further analyses and interpretation of the data can be found in “Chloroplast phylogenomic data from the green algal order Sphaeropleales (Chlorophyceae, Chlorophyta) reveal complex patterns of sequence evolution” (Fučíková et al., In review) [1].
Specifications Table
Subject area | Biology |
More specific subject area | Phylogenomics |
Type of data | Table, Figures, text file, tree file |
How data were acquired | Phylogenetic analysis of gene order in Badger, Mauve analysis, Geneious genome map |
Data format | Analyzed |
Experimental factors | Genomic data were collected using Illumina HiSeq and annotated in Geneious |
Experimental features | Gene order data analyzed in Badger, Synteny maps in Mauve plugin in Geneious |
Data source location | Storrs, CT, U.S.A. |
Data accessibility | Data is within this article and at NCBI:http://www.ncbi.nlm.nih.govaccessions GenBank: KT199248, GenBank: KT199249, GenBank: KT199250, GenBank: KT199251, GenBank: KT199252, GenBank: KT199253, GenBank: KT199254, GenBank: KT199255, GenBank: KT199256 |
Value of the data
-
•
Chloroplast genomes of green algae in the order Sphaeropleales are currently sparsely studied, thus new data from nine additional species expands knowledge of the structural variation within this order of algae.
-
•
Table summarizes the features present in chloroplast genomes of green algae in Sphaeropleales, useful for comparison to other species in future analyses.
-
•
Figures of nine assembled chloroplast genomes of green algae illustrate the features and their arrangements in these species.
-
•
New gene order data can be used in future phylogenetic analyses that include information from additional species.
1. Data
In this article we present chloroplast genome structural data for nine species of green algae (GenBank: KT199248, GenBank: KT199249, GenBank: KT199250, GenBank: KT199251, GenBank: KT199252, GenBank: KT199253, GenBank: KT199254, GenBank: KT199255, GenBank: KT199256; NCBI: http://www.ncbi.nlm.nih.gov), including comparison of gene and intron content (Fig. 1). Chloroplast genome maps with all annotated features are presented in Fig. 2, Fig. 3, Fig. 4, Fig. 5, Fig. 6, Fig. 7, Fig. 8, Fig. 9, Fig. 10. Synteny comparisons were performed for two sets of species (Fig. 11, Fig. 12). Gene order data were scored for 15 taxa in the class Chlorophyceae (GeneOrder.txt). A phylogenetic analysis of gene order using Badger is presented in (Fig. 13, GeneOrder.tre).
2. Experimental design, materials and methods
Full chloroplast genome sequences were obtained for 9 species of green algae in Sphaeropleales [1]. From these, genome maps were prepared in Geneious (version 6) (Fig. 2, Fig. 3, Fig. 4, Fig. 5, Fig. 6, Fig. 7, Fig. 8, Fig. 9, Fig. 10). Synteny maps were produced using the Mauve plugin in Geneious, which was also used to estimate the numbers of genomic rearrangements among taxa [2] (Fig. 11, Fig. 12). Gene order data were analyzed phylogenetically using Badger [3] for a total of 15 species in Chlorophyceae with fully sequenced cp genomes (Fig. 13). Only single-copy genes that were present in all included genomes (83 genes) were used for the analysis. Genes present in inverted repeats were counted once and inverted repeats were oriented to have the rRNA genes on the positive strand. For the trans-spliced psaA gene, only the first exon was considered. Badger was run for 10,000,000 generations, sampling every 100, with other settings set to default. The first 1000 samples were discarded as burnin.
Acknowledgments
Analyses were supported with funding from NSF grants DEB-1036448 and DEB-1354146 awarded to LAL and POL at University of Connecticut. Analyses were carried out at the Bioinformatics Facility of the University of Connecticut Biotechnology and Bioservices Center (BBC).
Footnotes
Supplementary data associated with this article can be found in the online version at doi:10.1016/j.dib.2016.03.014.
Appendix A. Supplementary material
References
- 1.Fučíková K., Lewis P.O., Lewis L.A. Chloroplast phylogenomic data from the green algal order Sphaeropleales (Chlorophyceae, Chlorophyta) reveal complex patterns of sequence evolution. Mol. Phylogenet. Evol. 2016 doi: 10.1016/j.ympev.2016.01.022. (In press) [DOI] [PubMed] [Google Scholar]
- 2.Darling A.E., Mau B., Perna N.T. progressiveMauve: Multiple genome alignment with gene gain, loss and rearrangement. PloS One. 2010;5(6):e11147. doi: 10.1371/journal.pone.0011147. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Simon D., Larget B. Department of Mathematics and Computer Science , Duquesne University; 2004. Bayesian analysis to describe genomic evolution by rearrangement (BADGER), version 1.02 beta. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.