Skip to main content
Genome Biology and Evolution logoLink to Genome Biology and Evolution
. 2013 Apr 5;5(5):848–859. doi: 10.1093/gbe/evt053

Evolution of Codon Usage in the Smallest Photosynthetic Eukaryotes and Their Giant Viruses

Stephanie Michely 1,2,6, Eve Toulza 1,2, Lucie Subirana 1,2, Uwe John 3, Valérie Cognat 4, Laurence Maréchal-Drouard 4, Nigel Grimsley 1,2, Hervé Moreau 1,2, Gwenaël Piganeau 5,*
PMCID: PMC3673656  PMID: 23563969

Abstract

Prasinoviruses are among the largest viruses (>200 kb) and encode several hundreds of protein coding genes, including most genes of the DNA replication machinery and several genes involved in transcription and translation, as well as transfer RNAs (tRNAs). They can infect and lyse small eukaryotic planktonic marine green algae, thereby affecting global algal population dynamics. Here, we investigate the causes of codon usage bias (CUB) in one prasinovirus, OtV5, and its host Ostreococcus tauri, during a viral infection using microarray expression data. We show that 1) CUB in the host and in the viral genes increases with expression levels and 2) optimal codons use those tRNAs encoded by the most abundant host tRNA genes, supporting the notion of translational optimization by natural selection. We find evidence that viral tRNA genes complement the host tRNA pool for those viral amino acids whose host tRNAs are in short supply. We further discuss the coevolution of CUB in hosts and prasinoviruses by comparing optimal codons in three evolutionary diverged host–virus-specific pairs whose complete genome sequences are known.

Keywords: selection, codon usage bias, NCLDV, picoeukaryote, microarray, tRNA, Ostreococcus

Introduction

Small phytoplanktonic eukaryotes (cell diameter < 2 µm) are important contributors to photosynthesis in many coastal areas, and seawater metagenomic screens have highlighted their high taxonomic diversity and ubiquity in the ocean’s surface (Viprey et al. 2008; Massana 2011). Three genera of planktonic chlorophytes within the class Mamiellophyceae can be particularly prevalent in coastal surface waters: Bathycoccus (Monier et al. 2011; Vaulot et al. 2012), Micromonas (Worden et al. 2009), and Ostreococcus (Demir-Hilton et al. 2011). Genome comparisons between these organisms revealed unexpected ancient divergence and differences in gene content, reflecting potential niche adaptations (Jancek et al. 2008; Worden et al. 2009; Piganeau et al. 2011; Moreau et al. 2012). Recently, large nucleocytoplasmic double-stranded DNA lytic viruses infecting these algae have been found in seawater and may be one to two orders of magnitude more abundant than their hosts (Bellec et al. 2010). The genomes of seven prasinoviruses, lysing four different species of Mamiellophyceae, each contain approximately 200 genes, including genes involved not only in replication and transcription but also in translation, with the intriguing presence of 4–6 transfer RNAs (tRNAs) (Moreau et al. 2010).

Codon usage bias (CUB), the preferential use of a subset of the synonymous codons for a given amino acid in a protein coding gene, is under a mutation–selection-drift balance (Bulmer 1991) that varies both between genes within a genome and between species (see Lynch [2007] Chapter 6 for a review). Selection may act to optimize the translation of individual genes or as a consequence of selection on nucleotide composition (Hershberg and Petrov 2010; Hildebrand et al. 2010). In many species, highly expressed genes have a higher CUB, and optimal codons correspond to the most abundant isoacceptor tRNAs (Ikemura 1985). This is as expected under selection for optimization of translation, as the use of optimal codons increases its accuracy (Stoletzki and Eyre-Walker 2007; Drummond and Wilke 2008; Zhou et al. 2009) and efficiency (Qian et al. 2012). Recently, Qian et al. (2012) provided evidence that expression is maximal for a balanced optimal codon frequency, proportional to the cellular cognate tRNA concentrations in yeast. Optimal codons in highly expressed genes are thus expected to coevolve with the most abundant cognate tRNAs (Qian et al. 2012). The null hypothesis of neutral evolution of CUB, as a consequence of mutational bias, does not predict a positive relationship between gene expression rates and CUB, unless the expression process itself is mutagenic. Analyses of CUB, and its relationship with gene expression rates and tRNA prevalence, enabled us to test the alternative hypothesis of selection for translational optimization. The understanding of the role of CUB in translational optimization of viral genes has important implications, as CUB may be a hallmark of host–virus specificity (Bahir et al. 2009) or virulence (Mueller et al. 2006). As these microalgae are of raising interest for their biotechnological potential, such as biopharmaceutical protein supplements or sustainable energy sources (Cadoret et al. 2012), CUB has also practical implications for transgene expression in the microalgae (van Ooijen et al. 2012).

Here, we first investigate whether we can find evidence of translational selection on CUB in Ostreococcus tauri and in its virus OtV5 (Derelle et al. 2008) by analyzing the relationship between CUB, expression rates, and tRNA gene prevalence both in the host and in the viral genome. Second, we test whether CUB of host and virus coevolve, using the available sequence data in Bathycoccus prasinos, Micromonas pusilla, O. lucimarinus, O. tauri, and the viruses infecting them, namely BpV1, MpV1, OlV1, and OtV5.

Materials and Methods

Ostreococcus tauri Growth and Lysis by OtV5 and RNA Extraction

Ostreococcus tauri (RCC745, first called OTTH0595; Courties et al. 1994) was cultured in continuous light (100 µmol·photon·m2·s1) at 20 °C on Keller’s medium agitated by air bubbles. Three replicates of 2 l of O. tauri culture in exponential growth phase (5 × 107 cells/ml) were each inoculated with 2 × 1010 viral particles. Lysis was monitored by flow cytometry (Becton Dikinson, San Jose) every hour, by autofluorescence for the host and by Syber Green I (Molecular probes) staining for viruses (Marie et al. 1999; Brussaard et al. 2000).

Two hundred milliliters of each replicate was harvested by centrifugation at T0 (control before inoculation), T2, T5, and T10 (2, 5, and 10 h postinfection, respectively). Total RNA was extracted from each sample using RNeasy Plant Kit (Qiagen, MD) following manufacturer's instructions and after DNase I treatment (Qiagen).

The amount of total RNA was assessed using nanodrop ND-1000 spectrophotometer (PeqLab, Erlangen, Germany). RNA integrity was checked using Nano Chip Assay by Bioanalyzer 2100 (Agilent technologies, Böblingen, Germany). We kept only samples with OD260/280 > 1.75 and OD260/230 ≥ 1.75.

Microarrays

The hybridization was performed on five 4X44K microarray slides where both the host O. tauri and the virus OtV5 genes were spotted. Four 60-mer oligonucleotide probes were designed for each gene on the microarray slide. A total of 33,093 probes were thus designed for 7,552 O. tauri genes and 1,056 probes for the 261 OtV5 genes. After 17 h at 65 °C of hybridization, microarrays were scanned on an Agilent G2505B scanner. Raw data were obtained by Agilent Feature Extraction Software version 9131 (FE), and the quality was monitored with Agilent QC Tool (v1.0) with the metric set GE2-v5_95. Five hundred nanograms of RNA was amplified and labeled by Low input Quick Amp Labeling kit (Agilent). For the first experiment, the control T0 cRNA was labeled with cyanine-3, and T2, T5, and T10 cRNA were labeled with dye cyanine-5. Total RNA from each postinfection sampling time was hybridized with the control (T0) for 17 h at 65 °C.

After microarray disassembly and a wash procedure according to the manufacturer’s instructions (Agilent), microarrays were scanned with the Agilent G2505B scanner. Raw data processing was carried out with the Agilent Feature Extraction Software version 9.1.3.1 (FE), and the quality was monitored with Agilent QC Tool (v1.0) with the metric set GE2-v5_95.

The complete data set has been submitted to the ArrayExpress public database at EBI (Parkinson et al. 2009) under the accession number: E-MEXP-3520.

Expression Rate Estimation

To estimate the expression rate of genes in the host O. tauri, we used the expression signal obtained for the noninfected host cells. We first checked the correlation of the expression measures across the 12 replicates: All expression measures were highly correlated across replicates (Spearman ρ > 0.91, supplementary table S2, Supplementary Material online). We then computed the average expression over all probes for each gene, transformed this into a rank (from 1 to 7,552), and summed the ranks across the 12 replicates for the 7,552 O. tauri genes.

To estimate the expression rate of genes in the virus OtV5, we used the expression signal obtained during infection at 2, 5, and 10 h postinoculation. All nine replicates were checked for consistency by computing correlations between expression measures between replicates (supplementary table S3, Supplementary Material online). We then computed the average expression rate for each of the 261 genes over probes and replicates.

Codon Usage Bias

CUB was estimated for each gene larger than 200 codons to reduce variance due to small samples. We used three measures of codon usage: 1) the relative synonymous codon usage (RSCU) value for each codon (Sharp and Li 1986), 2) the effective number of codons (Wright 1990), estimated by Ncp (Novembre 2002), and 3) we estimated the tRNA adaptation index (tAI) for each gene using the tAI.R code as described in dos Reis et al. (2004). tAI scores the optimality of a coding sequence with respect to a species’ tRNA pools.

We defined optimal codons as codons whose frequency significantly increases with gene expression (Whittle et al. 2011) in O. tauri and OtV5.

Both host and viral complete genome sequences for five prasinoviruses, OtV5, OtV1, OlV1, BpV1, and BpV2 (table 1), were used. Another available sequenced prasinovirus, MpV1, was isolated from an as yet unsequenced Micromonas strain (RCC1109) (Moreau et al. 2010); we have, however, included the genomes of MpV1 and Micromonas RCC299 to analyze the general patterns of CUB in hosts and viruses. Two host–prasinovirus pairs came from samples taken at the North West Mediterranean Sea (Lion Gulf): O. tauri and OtV5, as well as B. prasinos RCC1105 and BpV1. One prasinovirus, OlV1, found in the Lion Gulf, infected a host strain, O. lucimarinus, isolated from the Pacific Coast near San Diego.

Table 1.

Genomes, Number of CDS, and tRNA Genes in Mamiellophyceae and Prasinoviruses

Genomes—Abbreviation GenBank Accession No. CDS No. CDS > 200 No. tRNAs (Permuted tRNA)
Mamiellophyceae
    Ostreococcus tauri—Ota CAID01000001-20 7,890 5,463 47 (6)
    O. lucimarinus—Olu NC_009355-75 7,603 5,806 44 (4)
    Bathycoccus prasinos—Bpra FO082260-78 8,747 7,388 33 (7)
    Micromonas pusilla RCC299Mpu NC_013038-54 10,044 8,250 51 (3)
Prasinoviruses
    OtV5 NC_010191 264 107 5a
    OtV1 FN386611 232 107 4b
    OlV1 NC_014766 251 117 5a
    BpV1 NC_014765 203 100 5 (1)c
    BpV2 HM004430 210 98 5 (1)c
    MpV1 NC_014767 244 109 6d

Note.—CDS, coding sequence.

atRNA-Asn-GUU, Gln-UUG, Ile-UAU, Thr-AGU, and Tyr-GUA (Moreau et al. 2010).

bSame as (a) minus Tyr-GUA (Weynberg et al. 2009).

ctRNA-Asn-GUU, Ile-UAU, Leu-AAG, Tyr-GUA, and Thr-UGU (this study).

dSame as (a) + Leu-AAU.

tRNA Data

The available annotated tRNAs did not allow the translation of all codons in both Ostreococcus genomes. We therefore used tRNA-scanSE (Lowe and Eddy 1997) with default parameters to identify tRNAs and completed these predictions with recent findings on permuted tRNAs in Ostreococcus and Micromonas (Maruyama et al. 2010). We curated these predictions manually to provide a complete annotation of 47 nuclear tRNA genes in O. tauri (supplementary table S4, Supplementary Material online). We used the software SPLITS (Sugahara et al. 2006) to detect additional permuted tRNA genes that cannot be detected with tRNA-scanSE in the six viral genomes and the recently sequenced B. prasinos genome.

Statistical Analysis

All statistical analyses were performed with the R software version 2.10.1 http://www.r-project.org (Ihaka and Gentleman 1996). We used ADE4 (Thioulouse and Dray 2007) and seqinR packages for within-group correspondence analysis (CA) as described by Charif et al. (2005).

The expression rates and CUB values deviate from a normal distribution (Shapiro-Wilk test, P value < 1010). Therefore, we used nonparametric statistics to test the significance of the correlation between expression rates and CUB. We used both the Spearman correlation coefficient on all genes with more than 200 codons (table 1) and the Kruskal–Wallis analysis of variance test on gene expression classes, defined so that each class contains the same number of genes. We used the R pcor.test function from Kim and Yi (2006) to estimate partial correlation coefficients between expression rates, CUB, and gene length.

To assess correspondence between CUB of hosts and viruses, we counted the number of identical optimal codons in the host and the virus genome. We then estimated the distribution of the number of expected identical optimal codons by randomly sampling i codons among the n amino acids that had an optimal codon for both the host and the virus. The sampling was performed with random generation for the binomial distribution with probability given by the frequency of each codon in the lowly expressed genes (supplementary table S6, Supplementary Material online).

Results

General Features of CUB in Mamiellophyceae and Prasinoviruses

We performed a correspondence analysis on codon usage in each genome of all available Mamiellophyceae host–virus pairs (fig. 1). Because of the heterogeneity in base composition along the genomes of Bathycoccus, Micromonas, and Ostreococcus (Piganeau et al. 2011), the codon usage of the host genomes was computed independently for the normal chromosomes, the small outlier chromosome and the big outlier chromosome, that have a lower GC composition. Because the variability in amino acid composition between proteins is a confounding factor when analyzing synonymous codon usage variability, we used total codon frequencies (Perriere and Thioulouse 2002). The first two eigenvectors from each analysis were used and incorporated most information from the data sets (89.4% of variance explained). The first axis (78.2% of variance) mainly reflects the GC content at the third codon position (Spearman correlation coefficient between first axis object coordinates and the GC3 ρ = 0.86, P < 1015). On the second axis, all prasinoviruses cluster together, whereas hosts formed independent, separate clusters, with the two Ostreococcus hosts grouped together. The general picture is that there is considerable variation in CUB between hosts and, as a consequence of the outlier chromosomes, between chromosomes within a host. Codon usage is G or C biased for all codons in Micromonas, for all codons but Arg2 in O. tauri, and for all codons but Arg2 and Gly in O. lucimarinus. In B. prasinos, seven amino acids (Arg2, Gln, Glu, Ile, Lys, Phe, and Arg4) have an A or T biased codon usage (supplementary table S1, Supplementary Material online).

Fig. 1.—

Fig.
1.—

Principal component analysis of codon prevalence of the six prasinoviruses and the four Mamiellophyceae genomes, divided according to normal chromosomes (Ota, Olu, Bpra, and Mpu), low GC regions of the big outlier chromosome (Ota-BOC, Olu-BOC, Bpra-BOC, and Mpu-BOC), and small outlier chromosome (Ota-SOC, Olu-SOC, Bpra-SOC, and Mpu-SOC). Prasinoviruses are represented in red. We also projected codons on the two principal axes, 4-fold degenerate codons are blue, 3-fold degenerate codons are green, and 2-fold degenerate codons are pink.

The codon usage in prasinoviruses is A or T biased for most amino acids, the most extreme AT-biased genome, BpV1, using the doublet AT to end codons the most frequently. There is also considerable variation within viruses infecting different hosts, whereas CUB is nearly identical between OtV1 and OtV5, and between BpV1 and BpV2.

Evidence for Selection for Optimization of Translation in O. tauri

To estimate the expression level in O. tauri genes, we used the expression signal obtained for the noninfected host cells as a reference. We found a significant negative correlation between expression rate and the effective number of codons, Ncp, in O. tauri (fig. 2A, n = 5,463, Spearman ρ = −0.22, P < 1010), CUB increasing with gene expression. As CUB decreases with gene length in many eukaryotes (Duret and Mouchiroud 1999; Qiu et al. 2011; Whittle et al. 2011), gene length might be a confounding factor in the relationship between gene expression and CUB. Therefore, we reanalyzed the relationship between Ncp and expression rates taking gene length into account (partial correlation coefficient). The relationship between expression rates and gene lengths in our data set is tenuous (Spearman ρ = 0.05, P < 103) and has no confounding effect on the relationship between Ncp and expression rates (Spearman partial correlation taking gene length into account ρ = −0.22, P < 1015).

Fig. 2.—

Fig. 2.—

CUB, measured as Ncp, as a function of expression levels in Ostreococcus tauri, Kruskal–Wallis analysis of variance, P < 10−10 (A) and OtV5, 10 h postinfection, P < 0.01 (B). Host and viral data include genes longer than 200 codons, and gene expression classes have been defined to have an equal number of genes between classes.

tRNA Sharing Strategies in the Streamlined O. tauri Genome

If there is selection on CUB to optimize translation, we expect both the number of tRNA genes to correspond to the number of amino acids and the number of isoacceptor tRNAs to correspond to the optimal codons (Duret 2002; Qian et al. 2012). The number of tRNA copies in the genome correlates with cellular levels of tRNAs in Escherichia coli (Dong et al. 1996), Bacillus subtilis (Kanaya et al. 1999), Chlamydomonas reinhardtii (Cognat et al. 2008), and Saccharomyces cerevisiae (Percudani et al. 1997; Tuller et al. 2010). We therefore investigated whether optimal codons in highly expressed genes correspond to isoacceptor tRNA genes available in the genome.

We detected 47 tRNAs in O. tauri using tRNA-scan SE (Lowe and Eddy 1997) and SPLITS (Sugahara et al. 2006), which enabled the identification of six permuted tRNAs (supplementary table S4, Supplementary Material online). This is a modest number of tRNAs when compared with other eukaryotic green algae such as Chlamydomonas (259 tRNAs, [Cognat et al. 2008]) or in the yeast S. cerevisiae (275 tRNAs [Hani and Feldmann 1998]).

Because of this small number of tRNAs, tRNA sharing strategies must be considered in the coevolution of optimal codons and isoacceptor tRNAs to infer which tRNAs are dispensable (when only one tRNA is available for all codons corresponding to one amino acid, it cannot be used to characterize the tRNA-codon coevolution). Since the Wobble rule was first proposed (Crick 1966), allowing one tRNA to pair with several codons, many additional post-transcriptional tRNA modifications have been reported that allow one tRNA to pair with as many as four codons (see Agris et al. [2007] for a review). Four different tRNA sharing strategies have been described, one of them allowing as few as 25 tRNAs to decode all 61 codons (Grosjean et al. 2010). The most common tRNA sharing strategy (pairing rule 1 in table 3) consists in the depletion of ANN-tRNAs (A at the first position of anticodon, corresponding to the third position of the codon) or GNN-tRNAs. A second pairing rule consists in the additional depletion of ANN or GNN and CNN-containing tRNAs. A third pairing rule has been reported in bacteria and involves the total depletion of ANN, GNN, and CNN tRNAs. A fourth pairing rule consists in the depletion of GNN and UNN-containing tRNAs. This last strategy has been reported for the arginine tRNA in most bacteria and for leucine and arginine in hemiascomycetous yeasts (Grosjean et al. 2010). Confronting these pairing rules with available tRNAs in O. tauri, we found that the combination of pairing rules 2 and 4 allows all codons to be decoded with the available tRNAs (table 3). Currently, pairing rule 2 has been observed in many eubacteria (such as Bacillus halodurans), a few Archea (such as Methanopyrus kandleri), but only exceptionally in Eukarya (Grosjean et al. 2010). This combination of pairing rules 2 and 4 implies that 14 tRNA genes are dispensable in O. tauri. Among these dispensable tRNAs, two correspond to nonoptimal codons, two correspond to codons whose frequency does not change with expression rates, and 10 correspond to optimal codons (table 3). Dispensable tRNAs are thus significantly more abundant for optimal codons in O. tauri (Binomial test, P = 0.02). Consistent with this excess of dispensable tRNA for optimal codons, the tRNA adaptation index (tAI) that estimates the correspondence of codon usage with available tRNAs for each gene (dos Reis et al. 2004) correlates positively with CUB, measured as Ncp (Spearman ρ = −0.48, P < 1015).

Table 3.

Number of Observed: Expected Identical Optimal Codons between Host and Viral Genomes

OtV5 OlV1 BpV1 MpV1
Ostreococcus tauri 8:4** (14) 10:5** (14) 5:2** (10) 7:4* (12)
O. lucimarinus 12:4* (15) 12:7** (14) 8:3*** (10) 9:4*** (12)
Bathycoccus prasinos 11:4*** (13) 10:6* (12) 7:2*** (8) 7:4 (10)
Micromonas pusilla RCC299 14:6*** (15) 11:6* (14) 7:2*** (9) 12:4*** (12)

Note.—ns, nonsignificant. The expected number of matches is estimated from average base frequencies at 4-fold and 2-fold synonymous positions in the viral genes (supplementary table S6, Supplementary Material online). P values are computed by random sampling the distribution of expected identical codons. The number of amino acids for each comparison is indicated in brackets and corresponds to amino acids where both host and virus have one optimal codon.

*P value < 0.05.

**P value < 0.01.

***P value < 0.001.

CUB Increases with Expression Rate in the Prasinovirus OtV5

We found that the CUB of the virus OtV5 genes, that infects O. tauri, also increases with expression levels for all times after infection in the experiment: 2 h after infection (n = 107, ρ = −0.20, P < 0.05), 5 h (n = 107, ρ = −0.36, P < 103), and 10 h after infection (fig. 2B, n = 107, ρ = −0.37 P < 104). There is no confounding effect of gene length on either expression rates or CUB in our viral gene data set. The positive relationship between CUB and expression rates of viral genes thus provides evidence for selection on translational optimization in the viral genome. Because the virus uses the translation machinery of its host, CUB of the viral genes is expected to converge toward the CUB of the host genes, which are GC rich in O. tauri (supplementary table S2, Supplementary Material online). Consistent with this, we observe a significant increase of the GC content at 4-fold degenerate sites in highly expressed viral genes (30% most expressed genes), GC3 = 52%, when compared with the other viral genes, GC3 = 47% (Wilcoxon test, P < 0.02).

How Can We Explain the Presence of Viral tRNAs?

OtV5 uses the host translation machinery, the tRNA pool of the host, plus its five additional tRNAs to translate its gene pool. Under a purely neutral hypothesis, viral tRNAs are drawn at random from the host genome (Bailly-Bechet et al. 2007) and should thus correspond to the most abundant host tRNAs. This is not the case as the number of tRNAs present in OtV5 does not correspond to the most abundant tRNAs in the host (Wilcoxon test, P = 0.32). The alternative hypothesis is that these complementary tRNAs are maintained by selection. Under selection for translational optimization in the virus, we might hypothesize that 1) the additional tRNAs correspond to optimal codons in the viral genome and/or 2) the additional tRNAs correspond to the most abundant amino acids in the virus and/or 3) they complement the host tRNA pool to optimize viral protein translation. Only two of the five viral tRNA anticodons correspond to the optimal codon in the OtV5 viral genome (tRNA-TTC-Asn and tRNA-TAC-Tyr, table 1 and supplementary table S1, Supplementary Material online). One viral tRNA, tRNA-Gln-UUG, corresponds to an amino acid without optimal codon (i.e., neither UUG nor CUG frequencies vary with expression rates). This leaves us with two viral tRNAs, tRNA-ATA-Ile and tRNA-ACT-Thr, that correspond to the nonoptimal viral codon in both amino acids. There is thus no evidence for an excess of exact codon–anticodon matches in these viral tRNAs. The complementary viral tRNAs do not correspond to the most used amino acids in the viral genome (Wilcoxon test, P = 0.60) nor to the most used amino acids in the highly expressed viral genes (Wilcoxon test, P = 0.55).

To investigate whether these viral tRNAs complemented the host tRNA pool to render it better adapted to viral amino acid requirements, we compared the ratio of host tRNAs per viral amino acid 1) for the 15 amino acids that had no viral tRNA versus the 2) the five amino acids with one viral tRNA. We found that this ratio was significantly lower for the group of amino acids with one cognate viral tRNA in the O. tauri–OtV5 host prasinovirus pair (fig. 3, Wilcoxon P = 0.02).

Fig. 3.—

Fig. 3.—

Proportion of host tRNA gene per viral AA, depending on whether the viral AA have a cognate viral tRNA (with tRNA) or not (without tRNA).

We repeated this analysis in two other host–virus pairs: O. lucimarinus and OlV1 and B. prasinos and BpV1. When one tRNA was lacking in either O. lucimarinus and B. prasinos (table 1), we added tRNAs according to the minimal requirement of pairing rules 2 and 4 for these genomes. We observed that amino acids with cognate viral tRNA tend to have a shortage of corresponding host tRNAs for both OlV1–O. lucimarinus and BpV1–B. prasinos, but the difference is not significant (fig. 3). Overall, this trend suggests that the viral tRNAs optimize viral translation by increasing the available tRNAs for those viral amino acids where the host’s tRNA shortage is more pronounced.

Does Host–Viral CUB Coevolution Reflect Viral Specificity?

If one virus and its host evolve for a sufficient amount of time, one may expect codon usage of a virus to be fine tuned to the translational machinery of its host and thus the CUB of its host (Carbone 2008). Under this assumption, the codon usage of the virus is expected to be more alike to codon usage of a host than to codon usage of a nonhost, despite different substitution patterns on their genomes reflected by average differences in GC content.

We investigated the correlation between host and virus CUB by randomly sampling RSCU from independent codons from host and viral genomes. However, except for the OtV5–O. tauri pair, we could not detect any significant positive relationship between viral and host RSCU (supplementary table S7, Supplementary Material online). As expected from the CA, CUB of host and prasinoviruses are more alike on low GC outlier chromosomes, but the higher correlation coefficients do not correspond to the specific host–virus pairs (supplementary table S7, Supplementary Material online).

We investigated the number of identical optimal codons between hosts and viruses. For O. tauri and OtV5, optimal codons can be inferred from the codons whose frequency increases significantly with expression. In the other host and viral genomes, we took Ncp as a proxy for expression rates to identify optimal codons, because of the positive correlation we found between Ncp and expression rates in O. tauri and OtV5. The expected number of identical codons between hosts and viruses can be inferred from the base composition of the viral genome alone (table 3, supplementary table S6, Supplementary Material online). The observed number of identical codons is always significantly higher than the expected number of identical codons for all host–virus pairs (table 3).

Discussion

General Features of Codon Usage in Mamiellophyceae and Their Giant Viruses

The general picture is that codon usage is biased toward GC in hosts, whereas codon usage is biased toward AT in viruses (fig. 1). However, slight differences in codon usage exist between hosts and between viral genomes, provided that they infect different host species. Previous analysis of base composition in 37 double-stranded DNA phages and their bacterial host provided evidence that base composition is on average 4% AT richer in the viral genome than in the host genome (Rocha and Danchin 2002). This trend might be the consequence of the lower cost of synthesis of A and T nucleotides when compared with G or C nucleotides. Thus, viral genomes with higher AT content would more efficiently exploit the cell resources and become selected (Rocha and Danchin 2002). Nucleocytoplasmic large double-stranded DNA viruses (NCLDV) consist of at least six families of viruses infecting a broad variety of eukaryotic hosts (Iyer et al. 2001; Yutin and Koonin 2012). Base composition analysis in 41 vertebrate-infecting NCLDVs suggest that CUB does reflect mutational pressures rather than translational selection, consistent with the lack of evidence of translational selection in their hosts (Shackelton et al. 2006). However, the life cycle of vertebrate NCLDVs, which can often persist in host cells over long periods, is quite different to those of the lytic prasinoviruses. Here, although regional mutational pressures are obvious given 1) the biased GC content of intergenic regions in Ostreococcus (Piganeau et al. 2009) and 2) the significant correlation in GC frequency at first, second, and third codon positions of genes in Mamiellophycean hosts (Piganeau et al. 2011) and 3) GC content at first and third positions in OtV5 genes (Spearman ρ = 0.37, P < 104), we provide evidence of translational selection on codon usage in the host and viral genomes.

Translational Optimization in O. tauri and Its Virus OtV5

There are two hallmarks of selection for translational optimization in O. tauri and its virus OtV5. First, there is a positive relationship between CUB and expression rates. This phenomenon is well known in diverse organisms such as E. coli, S. cerevisiae, Neurospora tetrasperma, N. discreta, Caenorhabditis elegans, Arabidopsis thaliana, Drosophila Melanogaster, and Silene latifolia (Gouy and Gautier 1982; Ikemura 1985; Duret and Mouchiroud 1999; Coghlan and Wolfe 2000; dos Reis et al. 2003; Qiu et al. 2011; Whittle et al. 2011).

An alternative hypothesis to explain the positive relationship between expression rate and CUB in organisms with an excess of GC optimal codons could be a transcription-coupled AT→GC mutation rate (see Steele [2009] for a review on transcription coupled mutation processes). Because all but two optimal codons end with G or C in O. tauri (table 2), we performed additional analyses to test this hypothesis. Under this alternative scenario, the nucleotides that are subject to the transcription coupled mutation process should always increase with expression (i.e., GC content should increase with expression if the transcription-induced mutation bias goes from AT→GC). However, this is not the case; the Proline codon ending in C is nonoptimal, and its frequency decreases with expression (Spearman ρ = −0.10 and P < 103). For another amino acid, glycine, one of the optimal codons ends with T and the correlation between optimal codon frequency and expression is positive, though tenuous (Spearman ρ = 0.07 and P < 103). These two observations infirm the hypothesis of a transcription-induced GC-biased mutation process, and we are therefore confident that the observed correlation between optimal codon usage and expression rate is the consequence of selection for translational optimization.

Table 2.

Anticodon–Codon Correspondence in Ostreococcus tauri, According to Three Different tRNA Sparing Strategies

graphic file with name evt053t1a.jpg
graphic file with name evt053t1b.jpg

Note.—ns, nonsignificant. Pairing rule 4 for Leu4 is indicated by a gray arrow (Grosjean et al. 2010). For each dispensable tRNA, we indicate whether there is a corresponding optimal (o) codon, showing the best significant positive correlation to expression rate (codon in bold) or nonoptimal (u) codon, showing a significant negative correlation to expression rate. (n) indicates that there is no significant correlation between relative codon prevalence and expression for an amino acid.

aPermuted tRNAs.

Second, there is an excess of dispensable tRNAs for optimal codons in O. tauri (table 3). A similar correspondence with isoacceptor tRNA availability has been observed in E. coli, S. cerevisiae, and C. elegans (Duret 2002). Also, the number of tRNA genes per amino acid increases with amino acid frequency in O. tauri (Spearman ρ = 0.75, P < 104), whereas the viral tRNA genes complement the host tRNA pool for viral amino acids with less corresponding host–tRNAs (fig. 3).

The number of tRNA gene copies is positively correlated to growth rates in bacterial species (Rocha 2004). Consistent with the low number of tRNA genes in O. tauri (47, table 3), this phytoplanktonic cell has a slow doubling time of 8 h in optimal growth conditions (Farinas et al. 2006), which is over four times the doubling time of S. cerevisiae, which has a similar genome size.

Serendipitously, our tRNA annotation revealed a permuted tRNA in BpV1 (table 1 and supplementary table S5, Supplementary Material online). Permuted tRNAs are a recently discovered shared characteristic between archaeal and eukaryotic species (Maruyama et al. 2010), and here, we show that this characteristic can now be extended to double-stranded DNA prasinoviruses.

CUB Reflects Host–Viral Coevolution Not Host Specificity

Although whole CUB is not a good indicator of coevolution in prasinoviruses and their hosts (fig. 1), we found a significant excess of identical optimal codons between host and viruses (table 3). This is a consequence of translational selection acting on the host and on the viral codon usage, viral genes that mimic their host CUB will benefit from the same translation efficiency and accuracy as host genes. This suggests that host and viral genomes have coevolved to have a convergent CUB in highly expressed genes and that host switching should have a significant cost on viral gene expression. This being said, the number of identical optimal codons cannot be used as a proxy for host specificity, because, for example, BpV1 shares a high number of identical preferred codons with O. lucimarinus, a nonhost species.

In conclusion, we found evidence for optimization of translation in the host and prasinovirus genomes for highly expressed genes, which preferentially use codons with a higher number of genomic copies of host tRNAs. Our analyses provide evidence for coevolution of optimal codons between OtV5 and O. tauri and more generally between all prasinoviruses and their Mamiellophycean hosts. Careful annotation of tRNA genes, which may contain permuted tRNA genes in hosts and prasinoviruses, enabled us to infer tRNA sharing rules in O. tauri. The presence of tRNAs in prasinoviruses is biased toward viral amino acid with fewer tRNA genes in the host genome, suggesting selection on viral tRNA content for translational optimization of the viral genes. The identification of optimal codons might be useful to consider for fine tuning transgene expression both in the microalgae (van Ooijen et al. 2012) and in the prasinovirus for future research, including biotechnological applications.

Supplementary Material

Supplementary tables S1–S7 are available at Genome Biology and Evolution online (http://www.gbe.oxfordjournals.org/).

Supplementary Data
supp_5_5_848__index.html (1.4KB, html)

Acknowledgments

The authors thank two anonymous referees for constructive comments and the Genomics of phytoplankton team for stimulating discussions, especially Romain Blanc-Mathieu and Rozenn Thomas. They also thank Adam Eyre-Walker for insightful suggestions on a previous version of the manuscript. This work was funded by Agence Nationale de la Recherche grants PHYTADAPT no. NT09_567009, REVIREC no. 12-BSV7-0006-01, and European Community 7th Framework Program FP7 under grant agreement no. 254619.

Literature Cited

  1. Agris PF, Vendeix APF, Graham WD. tRNA's Wobble decoding of the genome: 40 years of modification. J Mol Biol. 2007;366:1–13. doi: 10.1016/j.jmb.2006.11.046. [DOI] [PubMed] [Google Scholar]
  2. Bahir I, Fromer M, Prat Y, Linial M. Viral adaptation to host: a proteome-based analysis of codon usage and amino acid preferences. Mol Syst Biol. 2009;5:311. doi: 10.1038/msb.2009.71. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Bailly-Bechet M, Vergassola M, Rocha E. Causes for the intriguing presence of tRNAs in phages. Genome Res. 2007;17:1486–1495. doi: 10.1101/gr.6649807. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Bellec L, Grimsley N, Desdevises Y. Isolation of prasinoviruses of the green unicellular algae Ostreococcus spp. on a worldwide geographical scale. Appl Environ Microbiol. 2010;76:96–101. doi: 10.1128/AEM.01799-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Brussaard CP, Marie D, Bratbak G. Flow cytometric detection of viruses. J Virol Methods. 2000;85:175–182. doi: 10.1016/s0166-0934(99)00167-6. [DOI] [PubMed] [Google Scholar]
  6. Bulmer M. The selection-mutation-drift theory of synonymous codon usage. Genetics. 1991;129:897–907. doi: 10.1093/genetics/129.3.897. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Cadoret J, Garnier M, Saint-Jean B. Microalgae, functional genomics and biotechnology. Adv Bot Res. 2012;64:285–341. [Google Scholar]
  8. Carbone A. Codon bias is a major factor explaining phage evolution in translationally biased hosts. J Mol Biol. 2008;66:210–223. doi: 10.1007/s00239-008-9068-6. [DOI] [PubMed] [Google Scholar]
  9. Charif D, Thioulouse J, Lobry JR, Perriere G. Online synonymous codon usage analyses with the ade4 and seqinR packages. Bioinformatics. 2005;21:545–547. doi: 10.1093/bioinformatics/bti037. [DOI] [PubMed] [Google Scholar]
  10. Coghlan A, Wolfe KH. Relationship of codon bias to mRNA concentration and protein length in Saccharomyces cerevisiae. Yeast. 2000;16:1131–1145. doi: 10.1002/1097-0061(20000915)16:12<1131::AID-YEA609>3.0.CO;2-F. [DOI] [PubMed] [Google Scholar]
  11. Cognat V, et al. On the evolution and expression of Chlamydomonas reinhardtii nucleus-encoded transfer RNA genes. Genetics. 2008;179:113–123. doi: 10.1534/genetics.107.085688. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Courties C, et al. Smallest eukaryotic organism. Nature. 1994;370:255–255. [Google Scholar]
  13. Crick FH. Codon–anticodon pairing: the wobble hypothesis. J Mol Biol. 1966;19:548–555. doi: 10.1016/s0022-2836(66)80022-0. [DOI] [PubMed] [Google Scholar]
  14. Demir-Hilton E, et al. Global distribution patterns of distinct clades of the photosynthetic picoeukaryote Ostreococcus. ISME J. 2011;5:1095–1107. doi: 10.1038/ismej.2010.209. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Derelle E, et al. Life-cycle and genome of OtV5, a large DNA virus of the pelagic marine unicellular green alga Ostreococcus tauri. PLoS One. 2008;3:e2250. doi: 10.1371/journal.pone.0002250. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Dong H, Nilsson L, Kurland CG. Co-variation of tRNA abundance and codon usage in Escherichia coli at different growth rates. J Mol Biol. 1996;260:649–663. doi: 10.1006/jmbi.1996.0428. [DOI] [PubMed] [Google Scholar]
  17. dos Reis M, Savva R, Wernisch L. Solving the riddle of codon usage preferences: a test for translational selection. Nucleic Acids Res. 2004;32:5036–5044. doi: 10.1093/nar/gkh834. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. dos Reis M, Wernisch L, Savva R. Unexpected correlations between gene expression and codon usage bias from microarray data for the whole Escherichia coli K-12 genome. Nucleic Acids Res. 2003;31:6976–6985. doi: 10.1093/nar/gkg897. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Drummond DA, Wilke CO. Mistranslation-induced protein misfolding as a dominant constraint on coding-sequence evolution. Cell. 2008;134:341–352. doi: 10.1016/j.cell.2008.05.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Duret L. Evolution of synonymous codon usage in metazoans. Curr Opin Genet Dev. 2002;12:640–649. doi: 10.1016/s0959-437x(02)00353-2. [DOI] [PubMed] [Google Scholar]
  21. Duret L, Mouchiroud D. Expression pattern and, surprisingly, gene length shape codon usage in Caenorhabditis, Drosophila, and Arabidopsis. Proc Natl Acad Sci U S A. 1999;96:4482–4487. doi: 10.1073/pnas.96.8.4482. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Farinas B, et al. Natural synchronisation for the study of cell division in the green unicellular alga Ostreococcus tauri. Plant Mol Biol. 2006;60:277–292. doi: 10.1007/s11103-005-4066-1. [DOI] [PubMed] [Google Scholar]
  23. Gouy M, Gautier C. Codon usage in bacteria: correlation with gene expressivity. Nucleic Acids Res. 1982;10:7055–7074. doi: 10.1093/nar/10.22.7055. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Grosjean H, de Crècy-Lagard V, Marck C. Deciphering synonymous codons in the three domains of life: co-evolution with specific tRNA modification enzymes. FEBS Lett. 2010;584:252–264. doi: 10.1016/j.febslet.2009.11.052. [DOI] [PubMed] [Google Scholar]
  25. Hani J, Feldmann H. tRNA genes and retroelements in the yeast genome. Nucleic Acids Res. 1998;26:689–696. doi: 10.1093/nar/26.3.689. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Hershberg R, Petrov DA. Evidence that mutation is universally biased towards AT in bacteria. PLoS Genet. 2010;6(9):e1001115. doi: 10.1371/journal.pgen.1001115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Hildebrand F, Meyer A, Eyre-Walker A. Evidence of selection upon genomic GC-content in bacteria. PLoS Genet. 2010;6(9):e1001107. doi: 10.1371/journal.pgen.1001107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Ihaka R, Gentleman R. R: a language for data analysis and graphics. J Comput Graph Stat. 1996;5:299–314. [Google Scholar]
  29. Ikemura T. Codon usage and tRNA content in unicellular and multicellular organisms. Mol Biol Evol. 1985;2:13–34. doi: 10.1093/oxfordjournals.molbev.a040335. [DOI] [PubMed] [Google Scholar]
  30. Iyer LM, Aravind L, Koonin EV. Common origin of four diverse families of large eukaryotic DNA viruses. J Virol. 2001;75:11720–11734. doi: 10.1128/JVI.75.23.11720-11734.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Jancek S, Gourbiere S, Moreau H, Piganeau G. Clues about the genetic basis of adaptation emerge from comparing the proteomes of two Ostreococcus ecotypes (Chlorophyta, Prasinophyceae) Mol Biol Evol. 2008;25:2293–2300. doi: 10.1093/molbev/msn168. [DOI] [PubMed] [Google Scholar]
  32. Kanaya S, Yamada Y, Kudo Y, Ikemura T. Studies of codon usage and tRNA genes of 18 unicellular organisms and quantification of Bacillus subtilis tRNAs: gene expression level and species-specific diversity of codon usage based on multivariate analysis. Gene. 1999;238:143–155. doi: 10.1016/s0378-1119(99)00225-5. [DOI] [PubMed] [Google Scholar]
  33. Kim SH, Yi SV. Correlated asymmetry of sequence and functional divergence between duplicate proteins of Saccharomyces cerevisiae. Mol Biol Evol. 2006;23:1068–1075. doi: 10.1093/molbev/msj115. [DOI] [PubMed] [Google Scholar]
  34. Lowe TM, Eddy SR. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997;25:955–964. doi: 10.1093/nar/25.5.955. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Lynch M. The Origins of Genome Architecture. Sinauer Associates. 2007 [Google Scholar]
  36. Marie D, Brussaard CPD, Thyrhaug R, Bratbak G, Vaulot D. Enumeration of marine viruses in culture and natural samples by flow cytometry. Appl Environ Microbiol. 1999;65:45–52. doi: 10.1128/aem.65.1.45-52.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Maruyama S, Sugahara J, Kanai A, Nozaki H. Permuted tRNA genes in the nuclear and nucleomorph genomes of photosynthetic eukaryotes. Mol Biol Evol. 2010;27:1070–1076. doi: 10.1093/molbev/msp313. [DOI] [PubMed] [Google Scholar]
  38. Massana R. Eukaryotic picoplankton in surface oceans. Annu Rev Microbiol. 2011;65:91–110. doi: 10.1146/annurev-micro-090110-102903. [DOI] [PubMed] [Google Scholar]
  39. Monier A, et al. Phosphate transporters in marine phytoplankton and their viruses: cross-domain commonalities in viral-host gene exchanges. Environ Microbiol. 2011;14:162–176. doi: 10.1111/j.1462-2920.2011.02576.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Moreau H, et al. Marine prasinovirus genomes show low evolutionary divergence and acquisition of protein metabolism genes by horizontal gene transfer. J Virol. 2010;84:12555–12563. doi: 10.1128/JVI.01123-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Moreau H, et al. Gene functionalities and genome structure in Bathycoccus prasinos reflect cellular specializations at the base of the green lineage. Genome Biol. 2012;13:R74. doi: 10.1186/gb-2012-13-8-r74. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Mueller S, Papamichail D, Coleman JR, Skiena S, Wimmer E. Reduction of the rate of poliovirus protein synthesis through large-scale codon deoptimization causes attenuation of viral virulence by lowering specific infectivity. J Virol. 2006;80:9687–9696. doi: 10.1128/JVI.00738-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Novembre JA. Accounting for background nucleotide composition when measuring codon usage bias. Mol Biol Evol. 2002;19:1390–1394. doi: 10.1093/oxfordjournals.molbev.a004201. [DOI] [PubMed] [Google Scholar]
  44. Parkinson H, et al. ArrayExpress update-from an archive of functional genomics experiments to the atlas of gene expression. Nucleic Acids Res. 2009;37:D868–D872. doi: 10.1093/nar/gkn889. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Percudani R, Pavesi A, Ottonello S. Transfer RNA gene redundancy and translational selection in Saccharomyces cerevisiae. J Mol Biol. 1997;268:322–330. doi: 10.1006/jmbi.1997.0942. [DOI] [PubMed] [Google Scholar]
  46. Perriere G, Thioulouse J. Use and misuse of correspondence analysis in codon usage studies. Nucleic Acids Res. 2002;30:4548–4555. doi: 10.1093/nar/gkf565. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Piganeau G, Grimsley N, Moreau H. Genome diversity in the smallest marine photosynthetic eukaryotes. Res Microbiol. 2011;162:570–577. doi: 10.1016/j.resmic.2011.04.005. [DOI] [PubMed] [Google Scholar]
  48. Piganeau G, Vandepoele K, Gourbiere S, Van de Peer Y, Moreau H. Unravelling cis-regulatory elements in the genome of the smallest photosynthetic eukaryote: phylogenetic footprinting in Ostreococcus. J Mol Evol. 2009;69:249–259. doi: 10.1007/s00239-009-9271-0. [DOI] [PubMed] [Google Scholar]
  49. Qian W, Yang JR, Pearson NM, Maclean C, Zhang J. Balanced codon usage optimizes eukaryotic translational efficiency. PLoS Genet. 2012;8:e1002603. doi: 10.1371/journal.pgen.1002603. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Qiu S, Bergero R, Zeng K, Charlesworth D. Patterns of codon usage bias in Silene latifolia. Mol Biol Evol. 2011;28:771–780. doi: 10.1093/molbev/msq251. [DOI] [PubMed] [Google Scholar]
  51. Rocha E, Danchin A. Base composition bias might result from competition for metabolic resources. Trends Genet. 2002;18:291–294. doi: 10.1016/S0168-9525(02)02690-2. [DOI] [PubMed] [Google Scholar]
  52. Rocha EP. Codon usage bias from tRNA's point of view: redundancy, specialization, and efficient decoding for translation optimization. Genome Res. 2004;14:2279–2286. doi: 10.1101/gr.2896904. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Shackelton LA, Parrish CR, Holmes EC. Evolutionary basis of codon usage and nucleotide composition bias in vertebrate DNA viruses. J Mol Evol. 2006;62:551–563. doi: 10.1007/s00239-005-0221-1. [DOI] [PubMed] [Google Scholar]
  54. Sharp PM, Li WH. Codon usage in regulatory genes in Escherichia coli does not reflect selection for “rare” codons. Nucleic Acids Res. 1986;14:7737–7749. doi: 10.1093/nar/14.19.7737. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Steele EJ. Mechanism of somatic hypermutation: critical analysis of strand biased mutation signatures at A:T and G:C base pairs. Mol Immunol. 2009;46:305–320. doi: 10.1016/j.molimm.2008.10.021. [DOI] [PubMed] [Google Scholar]
  56. Stoletzki N, Eyre-Walker A. Synonymous codon usage in Escherichia coli: selection for translational accuracy. Mol Biol Evol. 2007;24:374–381. doi: 10.1093/molbev/msl166. [DOI] [PubMed] [Google Scholar]
  57. Sugahara J, et al. SPLITS: a new program for predicting split and intron-containing tRNA genes at the genome level. In Silico Biol. 2006;6:411–418. [PubMed] [Google Scholar]
  58. Thioulouse J, Dray S. Interactive multivariate data analysis in R with the ade4 and ade4TkGUI packages. J Stat Softw. 2007;22:1–14. [Google Scholar]
  59. Tuller T, et al. An evolutionarily conserved mechanism for controlling the efficiency of protein translation. Cell. 2010;141:344–354. doi: 10.1016/j.cell.2010.03.031. [DOI] [PubMed] [Google Scholar]
  60. van Ooijen G, Knox K, Kis K, Bouget FY, Millar AJ. Genomic transformation of the picoeukaryote Ostreococcus tauri. J Vis Exp. 2012:e4074. doi: 10.3791/4074. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Vaulot D, et al. Metagenomes of the picoalga Bathycoccus from the Chile coastal upwelling. PLoS One. 2012;7:e39648. doi: 10.1371/journal.pone.0039648. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Viprey M, Guillou L, Ferreol M, Vaulot D. Wide genetic diversity of picoplanktonic green algae (Chloroplastida) in the Mediterranean Sea uncovered by a phylum-biased PCR approach. Environ Microbiol. 2008;10:1804–1822. doi: 10.1111/j.1462-2920.2008.01602.x. [DOI] [PubMed] [Google Scholar]
  63. Weynberg KD, Allen MJ, Ashelford K, Scanlan DJ, Wilson WH. From small hosts come big viruses: the complete genome of a second Ostreococcus tauri virus, OtV-1. Environ Microbiol. 2009;11:2821–2839. doi: 10.1111/j.1462-2920.2009.01991.x. [DOI] [PubMed] [Google Scholar]
  64. Whittle CA, Sun Y, Johannesson H. Evolution of synonymous codon usage in Neurospora tetrasperma and Neurospora discreta. Genome Biol Evol. 2011;3:332–343. doi: 10.1093/gbe/evr018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Worden AZ, et al. Green evolution and dynamic adaptations revealed by genomes of the marine picoeukaryotes Micromonas. Science. 2009;324:268–272. doi: 10.1126/science.1167222. [DOI] [PubMed] [Google Scholar]
  66. Wright F. The “effective number of codons” used in a gene. Gene. 1990;87:23–29. doi: 10.1016/0378-1119(90)90491-9. [DOI] [PubMed] [Google Scholar]
  67. Yutin N, Koonin EV. Hidden evolutionary complexity of nucleo-cytoplasmic large DNA viruses of eukaryotes. Virol J. 2012;9:161. doi: 10.1186/1743-422X-9-161. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Zhou T, Weems M, Wilke CO. Translationally optimal codons associate with structurally sensitive sites in proteins. Mol Biol Evol. 2009;26:1571–1580. doi: 10.1093/molbev/msp070. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Data
supp_5_5_848__index.html (1.4KB, html)

Articles from Genome Biology and Evolution are provided here courtesy of Oxford University Press

RESOURCES