Abstract
C photosynthesis is a complex trait that sustains fast growth and high productivity in tropical and subtropical conditions and evolved repeatedly in flowering plants. One of the major C lineages is Andropogoneae, a group of 1200 grass species that includes some of the world’s most important crops and species dominating tropical and some temperate grasslands. Previous efforts to understand C evolution in the group have compared a few model C plants to distantly related C species so that changes directly responsible for the transition to C could not be distinguished from those that preceded or followed it. In this study, we analyze the genomes of 66 grass species, capturing the earliest diversification within Andropogoneae as well as their C relatives. Phylogenomics combined with molecular dating and analyses of protein evolution show that many changes linked to the evolution of C photosynthesis in Andropogoneae happened in the Early Miocene, between 21 and 18 Ma, after the split from its C sister lineage, and before the diversification of the group. This initial burst of changes was followed by an extended period of modifications to leaf anatomy and biochemistry during the diversification of Andropogoneae, so that a single C origin gave birth to a diversity of C phenotypes during 18 million years of speciation events and migration across geographic and ecological spaces. Our comprehensive approach and broad sampling of the diversity in the group reveals that one key transition can lead to a plethora of phenotypes following sustained adaptation of the ancestral state. [Adaptive evolution; complex traits; herbarium genomics; Jansenelleae; leaf anatomy; Poaceae; phylogenomics.]
One of the major goals of evolutionary biology is to understand the origins of key innovations underlying the ecological success of particular groups. This requires the study of adaptive traits in a phylogenetic context, using comparisons of species differing in character states (e.g., Bond et al. 2014; Rainford et al. 2014; McGee et al. 2015; Sánchez-García and Matheny 2017). Because species differ in numerous ways, such comparisons must capture the diversity that emerged after the transition in addition to the diversity that preceded it, to precisely identify the properties directly involved in the origin of the trait of interest (e.g., Sprent 2007; Yukawa et al. 2009; Endress 2011; Puttick et al. 2014; Marek and Moore 2015; Clark et al. 2018). Among flowering plants, C photosynthesis represents an adaptive novelty with significant ecological consequences (Sage 2004; Edwards et al. 2010; Christin and Osborne 2014).
The C physiology results from multiple anatomical and biochemical modifications of the ancestral C photosynthetic metabolism, which include 1) the confinement of the primary enzyme of the photosynthetic carbon reduction pathway, ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco), to a compartment isolated from the atmosphere, and 2) increased and cell-specific activity of several enzymes that concentrate CO at the site of Rubisco (Hatch 1987; von Caemmerer and Furbank 2003; Sage 2004). The concentration of CO around Rubisco boosts photosynthetic efficiency, and therefore, growth, particularly in high-light, warm and dry conditions (Long 1999; Atkinson et al. 2016).
Although the C trait requires the modification of multiple components, it has evolved at least 62 times independently during the diversification of flowering plants (Sage et al. 2011). The grass family (Poaceae) encompasses almost half of the C origins, including some with particular ecological and economic relevance, such as the Andropogoneae (Sage 2017). The roughly 1200 species of this tribe are all C, making it the second-most speciose (Sage et al. 2011) and the most dominant C lineage (Lehmann et al. 2019). Andropogoneae include some of the world’s most important cereal and fuel crop plants, such as maize, sorghum, sugarcane, and Miscanthus spp. (e.g., silver grass), as well as numerous species that dominate tropical savannas and some temperate grasslands, including the tallgrass prairies of the Great Plains (Hartley 1958; Bond et al. 2008; Edwards et al. 2010; Kellogg 2015). Besides generating some of the most productive plants in the world, their C trait also increased the diversification of Andropogoneae, which in turn has shaped ecosystems around the world (Osborne 2008; Edwards et al. 2010; Forrestel et al. 2014; Spriggs et al. 2014; Sage and Stata 2015). Because 1) they are separated from other C grass lineages in the phylogeny by several C branches (Grass Phylogeny Working Group II GPWG II 2012 and 2) the different C lineages differ in the underlying genetic changes, Andropogoneae are accepted as a C origin independent from those in other groups of grasses (Sinha and Kellogg 1996; Christin et al. 2008, Christin et al. 2010; Vicentini et al. 2008; Edwards and Smith 2010; Sage et al. 2011; GPWG II 2012; Emms et al. 2016; Gallaher et al. 2019; Niklaus and Kelly 2019).
Due to their economic and ecological importance, Andropogoneae have been included in most studies addressing the evolutionary origins of C photosynthesis in grasses. In particular, efforts to determine the genomic changes involved in the transition to C photosynthesis have focused on comparisons between the two C model Andropogoneae species—maize and sorghum—and distantly related C model grasses (e.g., rice and Dichanthelium; Paterson et al. 2009; Wang et al. 2009; Emms et al. 2016; Studer et al. 2016; Huang et al. 2017). Such a narrow taxon sampling neither covers the diversity of anatomical and biochemical components observed among C grasses (e.g., Hattersley 1984; Christin et al. 2013; Lundgren et al. 2014) nor among C grasses within the Andropogoneae themselves (Renvoize 1982a; Ueno 1995; Sinha and Kellogg 1996). Yet, sampling this diversity is crucial for distinguishing those modifications involved in the early origin of the C pathway in the group as opposed to its subsequent diversification (Christin et al. 2010; Christin and Osborne 2014; Dunning et al. 2017b; Heyduk et al. 2019). Differentiating these scenarios is necessary to determine whether changes for C evolution were concentrated during the initial history of large C groups, or whether they were sustained throughout the diversification of large C clades, as suggested for young C lineages (Dunning et al. 2017b) and hypothesized based on previously available evidence (Christin and Osborne 2014; Heyduk et al. 2019).
The first divergence within Andropogoneae sensuKellogg (2015) separates the subtribe Arundinellinae from Andropogoneae s.s. (tribes Arundinelleae and Andropogoneae, respectively, in Soreng et al. 2017), the latter of which includes the model species maize and sorghum. Until recently, the closest known C relative of Andropogoneae belonged to a different tribe that also included several C groups, and the branch separating them was consequently long (Christin et al. 2008; Vicentini et al. 2008; GPWG II 2012; Kellogg 2015; Soreng et al. 2017). However, the C genera Jansenella and Chandrasekharania have been recently suggested as the sister group of Andropogoneae based on individual chloroplast or nuclear markers (Besnard et al. 2018; Hackel et al. 2018). High-quality genomes are not available for species from these genera or from Arundinellinae, but low-coverage genome data have recently provided insights into the evolution of the nuclear genome in other nonmodel grasses (Besnard et al. 2014, 2018; Olofsson et al. 2016; Bianconi et al. 2018; Dunning et al. 2019). Capitalizing on the availability of such genomic data sets as a side-product of plastome sequencing (e.g., Washburn et al. 2015; Burke et al. 2016; Arthan et al. 2017; Piot et al. 2018), we are now able to phylogenetically track the modifications underlying one of the major innovations of flowering plants.
In this study, we analyze genome-skimming data for 66 grass species covering the diversity of C relatives of Andropogoneae, as well as the earliest diversification within the group, to test the hypothesis that C evolution was sustained throughout the history of old, large C lineages. First, we generate plastome and genome-wide nuclear phylogenetic trees of grasses to confirm the relationships between Andropogoneae and their C relatives and estimate the age of C photosynthesis in the group from a time-calibrated plastome phylogeny. Anatomical traits are then mapped onto the time-calibrated phylogeny to infer the timing of leaf structural transitions in the group. Finally, we look for signatures of adaptive evolution in key C enzymes, testing whether shifts in selective pressures on protein sequences occurred 1) in a C context, and therefore, predated the origin of Andropogoneae, 2) at the base of the clade, during a short period of time representing the initial transition from C to C photosynthesis, or 3) were sustained throughout the history of the group, representing a prolonged period of gradual innovation within the monophyletic C Andropogoneae. Overall, our study presents a comprehensive approach to dissecting a complex adaptive trait and inferring the tempo of key phenotypic transitions in a large group of ecological importance.
Materials and Methods
Species Sampling and Sequencing
A data set of whole-genome sequences of varied sequencing depth was assembled that covers: the main lineages of Andropogoneae including the subtribe Arundinellinae and the Andropogoneae s.s. (sensuKellogg 2015), which represents the earliest known split within this C group (GPWG II 2012); their putative closest C relatives; a variety of other C and C Panicoideae; and representatives of the other grass subfamilies (Supplementary Table S1 available on Dryad at https://doi.org/10.5061/dryad.j6q573n7x). In total, genomic data for 59 grass species were retrieved from previous studies (Besnard et al. 2013, 2018; Lundgren et al. 2015; Burke et al. 2016; Arthan et al. 2017; Dunning et al. 2017a, 2019; Olofsson et al. 2016; Piot et al. 2018; Silva et al. 2017) and similar data for seven species were generated here (Supplementary Table S1 available on Dryad). For the latter, low-coverage sequencing was performed using Illumina technology. Genomic DNA (gDNA) was isolated from ca. 5–10 mg of leaf material using the BioSprint 15 DNA Plant Kit (Qiagen). Four herbarium samples were sequenced at the Genopole platform (Toulouse, France) while the three remaining samples were extracted from silica-preserved material and sequenced at the Genoscope platform (Evry, France). For all samples analyzed at the Genopole, between 100 and 500 ng of double-stranded DNA were used to construct sequencing libraries with the Illumina TruSeq Nano DNA LT Sample Prep kit (Illumina, San Diego, CA, USA), following the manufacturer’s instructions (for more details, see Besnard et al. 2018). Each sample was multiplexed with samples from the same or different projects and paired-end sequenced on 1/24th of an Illumina HiSeq3000 lane (Supplementary Table S1 available on Dryad). At the Genoscope, libraries were constructed using 250 ng of sonicated gDNA. Fragments were end-repaired and 3’-adenylated. NextFlex DNA barcodes (Bioo Scientific Corporation, Austin, TX, USA) were then added using the NEBNext DNA Modules Products (New England Biolabs, MA, USA) followed by clean up with 1 Agencourt AMPure XP (Beckman Coulter, Brea, CA, USA). The ligated product was amplified with 12 PCR cycles using Kapa HiFi HotStart NGS library Amplification kit (Kapa Biosystems, Wilmington, MA) followed by a 0.6 Agencourt AMPure XP purification. Each sample was multiplexed with samples from a different project and paired-end sequenced on 1/48th of an Illumina HiSeq2000 lane (Illumina, USA; Supplementary Table S1 available on Dryad).
Plastome Analyses
A total of 51 plastome sequences were retrieved from NCBI and another 15 were assembled in this study using the genomic data sets (Supplementary Table S1 available on Dryad). For those assembled here, published plastomes of closely related species (same genus) were retrieved from NCBI and used as references for read mapping using Geneious v.9.1.8 (Kearse et al. 2012; Biomatters Ltd., Auckland, NZ, USA) with the Geneious Read Mapper and default sensitivity. A consensus sequence was then called using the highest-quality base criterion, with indels being manually extended/reduced by the assembly of iteratively mapped reads. In cases where no plastome models were available for congeners, a de novo strategy was applied using the software Org.Asm v.1.0 (https://git.metabarcoding.org/org-asm/org-asm) with default parameters. Potential errors in the de novo assembly were corrected by mapping the genomic reads to the assembled sequence using Geneious following the strategy described above.
The 66 plastome sequences were aligned with MAFFT v.7.13 (Katoh and Standley 2013), after excluding the second inverted repeat region to avoid representing the same sequence twice (alignments are available on Dryad). Plastome phylogenetic trees were inferred independently for coding and noncoding regions of the alignment using MrBayes v.3.2.6 (Ronquist et al. 2012) with the GTR+G model. Two analyses were run in parallel and were stopped after reaching a standard deviation of splits 0.01. A consensus tree was obtained after a burn-in period of 25% and trees were rooted on the BOP clade (Bambusoideae, Oryzoideae, and Pooideae), which is sister to the large PACMAD clade that contains the Andropogoneae (GPWG II 2012). Plastome sequences assembled here were deposited in NCBI (see Supplementary Table S1 available on Dryad for accession numbers).
Genome-Wide Nuclear Analyses
Because many of the genomic data sets used here have sequencing depths below the minimum required by existing software (Supplementary Tables S1 and S2 available on Dryad; Bertels et al. 2014; Allen et al. 2018), we have adapted pipelines previously used to obtain nuclear phylogenetic trees from genome-skimming data (Olofsson et al. 2016, 2019; Dunning et al. 2017a, 2019). The general approach consists of extracting nucleotide sequences from each genomic data set by mapping reads onto a reference, which is analogous to existing approaches (Allen et al. 2018). However, all positions receiving mapped reads were considered here regardless of the coverage, to allow genome-skimming samples to be incorporated. Sets of coding sequences (CDSs) were used as references, as they represent the portion of the genome that is sufficiently conserved to allow mapping among distant relatives (Olofsson et al. 2016, 2019), and are present at low copy numbers.
A genome-wide reference data set of putative orthologous sequences of grasses was prepared using the complete CDS data sets of three model grasses representing different degrees of divergence from the Andropogoneae: Sorghum bicolor (part of the focus group), Setaria italica (different tribe in the same subfamily as the focus group), and Brachypodium distachyon (different subfamily). These CDSs were retrieved from Phytozome v.12 (Goodstein et al. 2012). Putative one-to-one orthologs were identified using the BLAST reciprocal best hits (RBH) tool as implemented in Galaxy (Cock et al. 2015). Only CDSs that corresponded to the intersection of the RBH among the three species and which were 500 bp were retained. Genes potentially transferred from organelles to the nuclear genome were identified via BLAST searches (-value = 10 using S. bicolor organellar genomes as reference and subsequently removed from this data set. The nuclear genome-wide reference data set consisted of 9161 putative orthologs. Each of these genes is expected to descend from a single gene in the common ancestor of the two main groups of grasses, the BOP and PACMAD clades, but might have been lost or duplicated in some derived groups. Collapsing such duplicates allows the extraction of phylogenetically useful markers. Downstream analyses were conducted using the S. italica sequence for each group of orthologs, which is closely related, yet outside of the focus group, and therefore, maximizes the evenness of the coverage in the group (Supplementary Fig. S1 available on Dryad).
To minimize the amount of missing data, the nuclear analyses focused on the species of Andropogoneae, Jansenelleae, and Paspaleae for which the estimated sequencing depth was above 1. Arundinella nepalensis was added despite an estimated depth of 0.8 since it is one of only three Arundinellinae for which sequence data were available. Four species outside of Panicoideae for which high-coverage data were available were added to root the tree. Gene models corresponding to each of the 9161 putative orthologs were assembled independently for each of the 37 grass species included in this reduced data set. First, raw genomic data sets were filtered using the NGSQC Toolkit v.2.3.3 (Patel and Jain 2012) to retain only high-quality reads (i.e., 80% of the bases with Phred quality score 20), and to remove adaptor contamination and reads with ambiguous bases. The retained reads were subsequently trimmed from the 3’ end to remove bases with Phred score 20. The cleaned genomic data sets were then mapped as unpaired reads to the genome-wide CDS reference using Bowtie2 v.2.3.2 (Langmead and Salzberg 2012) with default parameters, which map reads identical on 90% or more bases, independently of the read length. Consensus sequences were called based on variant call format files from read alignments with mapping quality score 20 using the mpileup function of Samtools v.1.5 (Li et al. 2009) implemented in a bash-scripted pipeline, modified from Olofsson et al. (2016, 2019; available on the Dryad data repository). Sites with nucleotide variation among mapped reads were coded as ambiguous bases following IUPAC codes. Consensus sequences shorter than 200 bp were removed from the data set. Sites within each alignment with more than 50% missing data were then trimmed using trimAl v1.4 (Capella-Gutiérrez et al. 2009). Only gene alignments 300 bp (with 200 bp per individual sequence) and containing 50% of the total number of species after trimming were retained for subsequent analyses. Maximum-likelihood (ML) trees were then inferred for each gene alignment using RAxML v.8.2.4 (Stamatakis 2014), with a GTR+CAT substitution model and 100 bootstrap pseudoreplicates. To remove poorly informative markers, gene trees with 50% of branches with bootstrap support 50% were discarded. A multigene coalescent tree using Astral v.5.6.2 (Mirarab and Warnow 2015) was then inferred using the filtered set of gene trees after collapsing branches with bootstrap support values 50. Because phylogenomic analyses can be biased by the reference and the amount of missing data (Bertels et al. 2014; Xi et al. 2016; Olofsson et al. 2019), we repeated the mapping and filtering with different filtering stringencies and an alternative reference species (S. bicolor; Supplementary Fig. S1 available on Dryad).
In addition to the genome-wide data set, eight individual nuclear markers previously used to infer grass phylogenies (GPWG 2001; Bomblies and Doebley 2005; Doust et al. 2007; Christin et al. 2012b; Estep et al. 2012, 2014) were investigated, namely aberrant panicle organization 1 (apo1), the gene encoding arogenate dehydrogenase (arodeh), the DELLA protein-encoding gene dwarf 8 (dwarf8), floricaula/leafy-like (floricaula), knotted 1 (kn1), the gene encoding phytochrome B (phyB), retarded palea 1 (rep1), and the gene encoding granule-bound starch synthase 1 (GBSSI or waxy). Sequences of these genes were manually assembled for Garnotia stricta var. longiseta (Arundinellinae) and the putative C sister group of Andropogoneae (i.e., Jansenella and Chandrasekharania) using the reference-guided approach from Besnard et al. (2018). In brief, CDSs from S. bicolor were used as seeds to map reads, which were assembled into contigs by recursively incorporating pairs of reads that overlapped with the assembly on at least 30 bp. All sequences of nuclear markers assembled here were deposited in NCBI (see Supplementary Table S3 available on Dryad for accession numbers). Preliminary visualization of read alignments for Jansenella neglecta suggested two divergent copies for all genes, but the low sequencing depth for this accession prevented phasing the reads into distinct copies. We, therefore, did not include J. neglecta in the phylogenetic analyses of nuclear markers. It is worth mentioning, however, that the short segments that were recovered indicated that one of the copies was very similar to the sequence of J. griffithiana, suggesting a hybrid (e.g., allopolyploid) origin of J. neglecta. The assembled genomic sequences were aligned with additional data retrieved from NCBI nucleotide databases using MAFFT. Phylogenetic trees were inferred for each of the eight markers using MrBayes, running two parallel chains for 40,000,000 generations. Run convergence and appropriateness of the burn-in period were verified using Tracer v.1.6 (Drummond and Rambaut 2007). The burn-in period was then set to 10,000,000, and a majority-rule consensus was inferred from the posterior trees.
Molecular Dating
Divergence times were estimated for the plastome data set using a relaxed molecular clock as implemented in BEAST v.1.8.4 (Drummond and Rambaut 2007). The plastome alignment was reduced to CDSs (57,239 bp), to remove intergenic spacers that undergo a large number of insertions and deletions, and are more difficult to align. The phylogenetic tree was time-calibrated by fixing the age of the split between the PACMAD and BOP clades to 51.2 Ma (based on Christin et al. 2014), using a normal distribution with standard deviation of 0.0001. This age represents the scenario based on macrofossils only, but we also report the equivalent ages from a dating scenario including phytoliths (82.4 Ma for the same node; Christin et al. 2014). These microfossils are abundant in the fossil record, but assigning them to modern lineages of grasses is complicated by their restricted number of characters (Prasad et al. 2005, 2011; Christin et al. 2014; Strömberg et al. 2018; for a discussion on the fossil record of grasses, see Kellogg 2015). The GTR+G substitution model was used, with the Yule model as speciation prior and a lognormal uncorrelated relaxed clock (Drummond et al. 2006). Three MCMC chains were run in parallel for at least 250 million generations, sampling every 10,000 generations. The runs were monitored using Tracer v.1.6 (Rambaut et al. 2013) checking for convergence and effective sample sizes 100 for all parameters. The burn-in period was set to the point of convergence of the runs (25%) and all trees sampled after that were combined. Median ages were summarized on the maximum clade credibility tree.
Carbon Isotopes and Leaf Anatomy
Photosynthetic types for most species were retrieved from the literature (Osborne et al. 2014; Supplementary Table S4 available on Dryad). The photosynthetic type of J. griffithiana was verified through analysis of carbon isotopes. Leaf fragments from the sequenced herbarium specimen were analyzed using an ANCA GSL preparation module coupled to a Sercon 20–20 stable isotope ratio mass spectrometer (PDZ Europa, Cheshire, UK). The carbon isotopic ratio (C, in ‰) was reported relative to the standard Pee Dee Belemnite (PDB). Values of C ranging from ‰ to ‰ are typical of C plants, and values higher than ‰ indicate that the plants grew using a C pathway (O’Leary 1988).
Leaf anatomical phenotypes were recorded for members of Andropogoneae and their C relatives, using data from the literature (Supplementary Table S4 available on Dryad; Renvoize 1982a, Renvoize 1982b, Renvoize 1982c, Renvoize 1985; Watson et al. 1992; Ueno 1995; Zuloaga et al. 2000; Christin et al. 2013). In addition, new leaf cross-sections were prepared for the herbarium samples of J. griffithiana and G. stricta used for genome sequencing. A leaf fragment (ca. 2 cm) was rehydrated by warming the sample in dHO up to 60 C followed by immersion in 1% KOH overnight. The rehydrated fragment was then dehydrated through an ethanol series from 10% to 100% EtOH, with steps of 30 min, and resin-infiltrated with Technovit 7100 (Heraeus Kulzer GmbH, Wehrheim, Germany). Cross-sections of 9 m were obtained using a microtome (Leica RM 2245, Leica Biosystems Nussloch GmbH, Nussloch, Germany) and stained with Toluidine Blue O (Sigma-Aldrich, St. Louis, MO, USA). Micrographs were obtained using an Olympus BX51 microscope coupled with an Olympus DP71 camera (Olympus Corporation, Tokyo, Japan). A number of qualitative and quantitative leaf characters related to the C function were measured on the cross-sections following Christin et al. (2013): number of bundle sheath layers, distance between the centers of consecutive veins (interveinal distance), minimal distance between the bundle sheaths of consecutive veins (bundle sheath distance), fraction of the mesophyll plus bundle sheath area represented by the inner bundle sheath (% inner sheath area), presence/absence of distinctive cells (sensuTateoka 1958; Renvoize 1982b), and localization of starch production.
Analyses of Protein Sequence Evolution
To test for episodes of adaptive evolution of C enzymes during different periods of the history of Andropogoneae, branch model tests using the ratio of nonsynonymous to synonymous substitutions rates (dN/dS; Yang 1998; Yang and Nielsen 1998) were conducted on alignments of five genes encoding proteins known to play important roles in the C pathway (Hatch 1987; Huang et al. 2017): NADP-malate dehydrogenase (NADP-MDH; gene nadpmdh-1P1), NADP-malic enzyme (NADP-ME; gene nadpme-1P4), phosphoenolpyruvate carboxykinase (PCK; gene pck-1P1), phosphoenolpyruvate carboxylase (PEPC; gene ppc-1P3), and pyruvate, phosphate dikinase (PPDK; gene ppdk-1P2). To test whether shifts in selective pressures could be related to processes other than C evolution, an alternative set of 12 genes not known to be involved in C photosynthesis were used as negative controls. These included some paralogs of the same core C genes for which sequences were available in NCBI database and the individual nuclear markers used for phylogenetic analyses (see above; except apo1, rep1, and floricaula, for which no C PACMAD species besides Jansenella and Chandrasekharania was available). For each of these 17 genes, complete or partial CDSs for the putative C sister group of Andropogoneae and G. stricta were manually assembled using the approach described above. Additional sequences were extracted using BLAST (e-value = 10 from 1) the CDS data set of seven published genomes (S. bicolor, Zea mays, S. italica, Panicum hallii, Panicum virgatum, B. distachyon, and Oryza sativa) retrieved from Phytozome 2) the NCBI nucleotide database, and 3) the transcriptomes of 34 PACMAD species retrieved from Washburn et al. (2017). All sequences from each gene were aligned using MAFFT and the alignment was visually inspected. Low-confidence alignment regions containing indels were removed to avoid erroneously inflating estimates of nonsynonymous substitutions. Sequences from multiple accessions of the same species, paralogs, and sequences containing stop codons or frameshift mutations were also removed before the analysis. The third positions of codons were used for phylogenetic inference to decrease biases due to adaptive evolution (Christin et al. 2012a). Phylogenetic trees were obtained using Bayesian inference with MrBayes as described above for the individual nuclear markers. Branch model tests were conducted using the consensus gene trees without collapsing unsupported nodes. These tests were repeated on the species tree obtained from the multigene coalescent analysis (see above), after pruning species for which sequences of the gene were not available. The duplication events inferred from the gene tree were incorporated into this phylogeny by duplicating the corresponding branches. C species outside Andropogoneae were pruned from all trees before analyses to avoid either 1) inflating the dN/dS estimate for the background branches as a result of independent selection signals in other C groups, or 2) underestimating dN/dS in the foreground branches by misidentifying the paralog used for C photosynthesis in these other taxa.
A number of branch models were optimized using codeml as part of PAML v. 4.9 (Yang 2007). The null model, which assumes a single dN/dS ratio for all branches, was compared to several branch models that hypothesized a different dN/dS ratio (i.e., shift in the selective pressure) in a set of foreground branches defined a priori: 1) the branch leading to Andropogoneae and its C sister group (shifts in selective pressures before the transition to C; 2) the branch leading to Andropogoneae (shifts in selective pressures during the transition to C; and 3) the branches leading to each of the two main Andropogoneae groups Arundinellinae and Andropogoneae s.s. (two independent shifts in selective pressures just after the transition to C). Each model was repeated with a sustained shift in selective pressures from the selected branches to all descendants. The best model was selected using the Akaike Information Criterion, after verifying that it was significantly better than the null model (at a significance level of 5%) as assessed via a likelihood ratio test, with a -value adjusted for multiple testing using the Bonferroni correction.
The number of amino acid substitutions through time was assessed by estimating via ML the branch lengths on the amino acid alignment while constraining the topology to that obtained on third positions of codons. This was performed for all genes analyzed here, using IQ-tree v.1.6.1 (Nguyen et al. 2015) with an automated selection of the model of protein sequence evolution.
Results
Plastome and Nuclear Data sets
The plastome alignment of 66 species was 140,427 bp long, of which 56,991 bp corresponded to CDS. The mean estimated sequencing depth for the plastomes ranged from 90 to 4602 reads per site across species. The nuclear data set consisted of 37 species and 365 genes. The alignments were on average 701 bp long (95% range = 370–1532 bp, total = 255,870 bp) and 70% complete (95% range = 63–80%), with an average of 64 parsimony informative sites (95% range = 27–163, total = 23,367; Supplementary Fig. S1 available on Dryad). As expected, less stringent filtering parameters allowed more genes to be retained, and more parsimony informative sites per gene, but resulted in higher amounts of missing data (Supplementary Fig. S1 available on Dryad). The number of genes producing resolved phylogenetic trees was drastically reduced when using the full set of 66 species (Supplementary Fig. S1f–j available on Dryad). The number of genes retained after filtering when using an alternative reference species (S. bicolor) was on average 62% higher, but similar patterns of missing data across data sets were observed.
Plastome and Nuclear Phylogenetic Trees
The phylogenetic trees inferred from plastomes and nuclear genomes were largely congruent with previous studies, with discrepancies between the two types of markers as previously reported (GPWG II 2012; Washburn et al. 2017; Moreno-Villena et al. 2018; Dunning et al. 2019). The multigene coalescent tree was generally congruent with the plastid phylogeny. This nuclear analysis revealed gene discordance for many nodes, which indicates incomplete lineage sorting and possibly hybridization in some parts of the family (Dunning et al. 2019). The relationship between Andropogoneae and its C relatives was however consistent among the plastome and nuclear data sets, with the C genera Jansenella and Chandrasekharania forming a strongly supported group sister to Andropogoneae (Fig. 1 and Supplementary Figs S2–S4 available on Dryad). This relationship was also highly supported by all nuclear data sets obtained with different filtering thresholds (Supplementary Table S5 and Fig. S3 available on Dryad), as well as in all trees inferred from individual nuclear markers (Supplementary Fig. S4 available on Dryad), except for two genes, in which Jansenella and Chandrasekharania formed a paraphyletic group (apo1; Supplementary Fig. S4a available on Dryad) or were nested within Andropogoneae (floricaula; Supplementary Fig. S4d available on Dryad). Our data and analyses, therefore, provide strong evidence that the clade formed by Jansenella and Chandrasekharania (hereafter Jansenelleae) is the extant C lineage most closely related to the Andropogoneae grasses.
Within Andropogoneae, the genera Garnotia and Arundinella (subtribe Arundinellinae) either formed a group that was sister to Andropogoneae s.s. (Supplementary Figs S2–S4f and h and S5b and c available on Dryad), or were paraphyletic (Fig. 1 and Supplementary Figs S4c, d, and g and S5a available on Dryad). Short internal branches, incongruence in the multigene coalescent trees (Fig. 1 and Supplementary Fig. S3 available on Dryad), and low posterior probability support values (Supplementary Fig. S4 available on Dryad) within Andropogoneae . are associated with high incongruence between nuclear and plastome trees, suggesting a complex history for the group, which might be related to a rapid radiation and frequent hybridization (Estep et al. 2014). In particular, nuclear and plastome trees identify different taxa as sister to the rest of Andropogoneae . (Zea mays in the multigene coalescent trees, Lasiurus scindicus in the plastome trees; Fig. 1 and Supplementary Figs S2 and S3 available on Dryad).
Divergence Time Estimates
The confirmation of the sister relationship between Jansenelleae and Andropogoneae allows for refined divergence time estimates, as the most recent divergence from a C relative (stem group node) represents the upper bound for the origin of a trait that could have evolved at any point along the branch leading to the most recent common ancestor of all species with the trait of interest (crown group node). Divergence times were estimated using the plastome data set (CDSs). Based on a secondary calibration considering only macrofossils, the divergence between Andropogoneae and its C sister lineage was estimated at 21.1 (95% HPD = 14.6–27.6) Ma (Table 1; Supplementary Fig. S5 available on Dryad). These dates would be pushed back to 34 (95% HPD = 23.5–44.4) Ma if a microfossil dating scenario was followed. The first split within Andropogoneae was estimated at 17.9 (95% HPD = 12.2–23.7) Ma (28.8 Ma under a microfossil dating scenario).
Table 1.
Clade | Macrofossils | Microfossils |
---|---|---|
BOP crown | 34.7 (24.8–45.6) | 55.8 (39.9–73.4) |
PACMAD crown | 43.4 (34.6–51.1) | 69.8 (55.7–82.2) |
Panicoideae crown | 36.4 (26.3–46.6) | 58.6 (42.3–75.0) |
Jansenelleae/Andropogoneae split | 21.1 (14.6–27.6) | 34.0 (23.5–44.4) |
Andropogoneae crown | 17.9 (12.2–23.7) | 28.8 (19.6–38.1) |
Andropogoneae crown | 11.9 (8.0–16.0) | 19.2 (12.9–25.8) |
Median ages are given in million years ago (Ma), with 95% Highest Posterior Density (HPD) intervals in parentheses.
Secondary calibration using Christin et al. (2014) estimates based only on macrofossils.
Secondary calibration using Christin et al. (2014) estimates based on macrofossils plus microfossils.
Anatomical Changes During the Early Diversification of Andropogoneae
Anatomical and biochemical characters linked to C photosynthesis were recorded based on the literature and on new measurements for G. stricta and J. griffithiana (Table 2 and Supplementary Table S4 available on Dryad). Our carbon isotope analysis confirmed that J. griffithiana is a C plant (C = -27.28%). Its leaf anatomy is typical of C grasses, with two layers of bundle sheath cells (Fig. 2a), which contradicts previous reports of a single sheath (Metcalfe 1960; Türpe 1970). As with other C species, it has a large distance between consecutive bundle sheaths, and no minor veins or distinctive cells (Fig. 2b). In addition, the proportion of the leaf occupied by the inner bundle sheath falls within the range observed for other C grasses (Christin et al. 2013). The leaf anatomy of G. stricta was similar to that previously reported for other Arundinellinae (Renvoize 1982b, 1982c; Watson et al. 1992). Its veins are surrounded by a single bundle sheath and are separated by a large number of mesophyll cells (Fig. 2b). Multiple distinctive cells separate the veins, and staining suggests starch production in both bundle sheaths and distinctive cells (Fig. 2b). Similar anatomical structures are observed in other Arundinellinae (Supplementary Table S4 available on Dryad; Renvoize 1982b, 1982c; Watson et al. 1992) but also in the genus Arthraxon (Ueno 1995), which is an Andropogoneae . representative that diverged early from the rest of the group (e.g., GPWG II 2012; Estep et al. 2014). By contrast, most Andropogoneae lack distinctive cells and decrease the distance between consecutive veins via the proliferation of minor veins (Fig. 2b; Table 2; Supplementary Table S4 available on Dryad).
Table 2.
Lineage/species | Bundle sheath layers | Interveinal distance (m) | % Inner sheath area | Bundle sheath distance (m) | Distinctive cells | Starch in BSC |
---|---|---|---|---|---|---|
Jansenelleae (C | ||||||
Jansenella griffithiana | 2 | 471 | 0.01 | 403 | A | A |
Arundinellinae (C | ||||||
Arundinella nepalensis | 1 | 215 | 0.27 | 101 | P | P |
Garnotia stricta | 1 | 191 | 0.11 | 187 | P | P |
Andropogoneae s.s. (C | ||||||
Arthraxon sp. | 1 | — | — | — | P | P |
Chrysopogon pallidus | 1 | 112 | 0.23 | 29 | A | P |
Heteropogon contortus | 1 | 80 | 0.21 | 32 | A | P |
Ischaemum afrum | 1 | 109 | 0.24 | 52 | A | P |
Sorghum halepense | 1 | 119 | 0.20 | 53 | A | P |
Positive Selection in C Enzymes
Phylogenetic trees for genes encoding C enzymes inferred from third positions of codons were compatible with plastome and nuclear genome trees, with most of the variation being observed within Andropogoneae s.s. and Paniceae (Supplementary Fig. S6 available on Dryad). In all cases, Jansenelleae was sister to Andropogoneae, except in pck-1P1, where Jansenelleae formed a group with Arundinellinae that was sister to Andropogoneae s.s. (Supplementary Fig. S6c available on Dryad). Arundinellinae and Andropogoneae . represented the first split within Andropogoneae in nadpmdh-1P1, nadpme-1P4, and ppc-1P3, but not in ppdk-1P2, in which Arundinellinae is paraphyletic (Supplementary Fig. S6e available on Dryad) and pck-1P1, as mentioned above. Lineage-specific duplications are observed within Andropogoneae and Arundinellinae species for nadpme-1P4, and only in Andropogoneae . for nadpmdh-1P1 (Supplementary Fig. S6a and b available on Dryad), as previously reported (Rondeau et al. 2005; Christin et al. 2009b; Wang et al. 2009).
The inferred trees were used to track shifts in selective pressures in Andropogoneae, independently for each gene. In three core C genes analyzed (nadpmdh-1P1, nadpme-1P4, and ppc-1P3), the best model inferred an increase of the dN/dS ratio after the split between Andropogoneae and Jansenelleae, which was sustained in the descendant branches (Table 3). As compared to the background branches, the estimated dN/dS ratio was 2–3.7 times higher in Andropogoneae for these genes. In pck-1P1 and ppdk-1P2, the best model indicates two independent three- and four-fold increases of the dN/dS ratio at the base of each of Arundinellinae and Andropogoneae s.s., which was sustained in the descendant branches in pck-1P1, but not in ppdk-1P2.
Table 3.
Branch models | |||||||||
---|---|---|---|---|---|---|---|---|---|
Scenarios of adaptive evolution | Single episode preceding C evolution | Single episode during C evolution, at the base of Andropogoneae | Two episodes during C evolution, at the base of Arundinellinae and Andropogoneae s.s. | ||||||
Gene | N | Null model | Basal branch | Sustained | Basal branch | Sustained | Basal branches | Sustained | dN/dS |
Core C genes | |||||||||
nadpmdh-1P1 (NADP-MDH) | 28 | 13.53 | 14.82 | 5.90 | 14.74 | 0.00* | 14.55 | 1.46* | 0.07–0.14 |
nadpme-1P4 (NADP-ME) | 63 | 205.29 | 202.70 | 42.20* | 205.26 | 0.00* | 202.73 | 2.89* | 0.08–0.30 |
pck-1P1 (PCK) | 41 | 36.77 | 38.11 | 5.40* | — | — | 38.77 | 0.00* | 0.02–0.06 |
ppc-1P3 (PEPC) | 51 | 105.82 | 107.82 | 7.83* | 82.81* | 0.00* | 85.87* | 21.78* | 0.03–0.09 |
ppdk-1P2 (PPDK) | 30 | 100.33 | 97.23 | 51.07* | 83.78* | 37.69* | 0.00* | 53.91* | 0.10–0.44 |
Paralogs of C genes | |||||||||
nadpme-1P1 | 30 | 1.52 | 1.56 | 0.00 | 1.28 | 1.16 | 3.33 | 1.73 | 0.09 |
nadpme-1P2 | 21 | 0.00 | 0.72 | 1.59 | 2.00 | 0.97 | 0.55 | 0.93 | 0.09 |
nadpme-1P3 | 24 | 10.58 | 0.00* | 4.73 | 4.77 | 7.75 | 9.97 | 10.05 | 0.05–0.17 |
ppc-1P4 | 30 | 3.92 | 4.08 | 5.90 | 5.23 | 5.30 | 0.00 | 3.84 | 0.07 |
ppc-1P5 | 30 | 0.00 | 2.00 | 1.84 | 1.83 | 1.49 | 1.84 | 1.80 | 0.06 |
ppc-1P7 | 19 | 11.33 | 13.29 | 5.59 | — | — | 12.65 | 0.00* | 0.09–0.04 |
ppdk-1P1 | 12 | 6.19 | 8.17 | 6.31 | 7.39 | 0.37 | 0.00 | 0.63 | 0.18 |
Other nuclear genes | |||||||||
arodeh | 30 | 0.00 | 1.99 | 1.70 | — | — | 1.85 | 1.77 | 0.11 |
dwarf8 | 39 | 46.51 | 45.75 | 23.99* | 47.23 | 2.35* | 48.34 | 0.00* | 0.06–0.16 |
knotted1 | 13 | 4.15 | 0.00 | 5.45 | — | — | 3.60 | 5.94 | 0.07 |
phyB | 55 | 7.95 | 4.67 | 8.35 | 0.00 | 9.86 | 9.71 | 9.85 | 0.09 |
waxy | 55 | 0.00 | 1.74 | 1.70 | — | — | 1.90 | 1.91 | 0.05 |
Note: The best-fit model is highlighted in bold. Asterisks indicate significant likelihood ratio tests (LRT) against the null model after Bonferroni correction. Two hypotheses of potential enzyme adaptation were tested for each scenario, the first assuming a shift in selective pressure only in the basal branch(es) of the group specified (‘Basal branch’), the second assuming a sustained shift from the basal branch including all descendant branches (‘Sustained’). Missing values correspond to trees in which Andropogoneae was not monophyletic.
Delta-AIC values relative to the best-fit model for each gene are shown.
C gene annotation following Moreno-Villena et al. (2018).
Number of sequences in the alignment.
dN/dS ratios of background and foreground branches, respectively, estimated for the best-fit model, except in cases where the null was the best-fit model, for which there was a single dN/dS estimate for all branches.
A shift of the dN/dS ratio was identified in three of the 12 other genes used as negative controls (Table 3). In the case of nadpme-1P3, a gene encoding a NADP-ME isoform not involved in the C cycle of any of the previously screened species (Moreno-Villena et al. 2018), the best model indicates an increase in dN/dS in the branch leading to the most recent common ancestor of Andropogoneae and Jansenelleae (Table 3). In dwarf8, an increase of dN/dS occurred on branches leading to each of the Arundinellinae and Andropogoneae s.s. and was sustained in the descendant branches. Finally, the best model for ppc-1P7, a gene encoding a PEPC isoform also not co-opted for the C cycle in any species previously analyzed (Moreno-Villena et al. 2018), assumed a decrease of dN/dS in the two branches leading to each of the Arundinellinae and Andropogoneae s.s. and their descendants. Therefore, out of the 12 negative controls, only dwarf8 presents an increase of dN/dS that coincides with C evolution in the group. This gene, which affects flowering time (Thornsberry et al. 2001), has been linked to the adaptation of some Andropogoneae to varying climates (Camus-Kulandaivelu et al. 2006).
All positive selection tests were repeated using the multigene coalescent species tree topology (Fig. 1) instead of the trees inferred from third positions of codons (Supplementary Table S6 available on Dryad). The results mostly confirmed those reported above, except that no shift of selective pressure was observed in the core C genes nadpmdh-1P1 and pck-1P1, nor in nadpme-1P3 and dwarf8. Because sequences of the genes were available for species not included in the species tree, the sampling was reduced in these analyses compared to those based on the gene trees. The reduced evidence for positive shifts might, therefore, reflect a lower statistical power of the tests based on the species tree.
To visualize the amount of amino acid substitutions during different periods of the Andropogoneae history, we estimated branch lengths from amino acid sequences after excluding C species outside of Andropogoneae. Overall, numerous substitutions occurred in nadpmdh-1P1 and ppc-1P3 at the base of Andropogoneae, and increased rates compared to non-C species on these genes were sustained throughout Andropogoneae (Figs 3 and 4). By contrast, bursts of amino acid substitutions in nadpme-1P4 and ppdk-1P2 occurred at the base of both Arundinellinae and Andropogoneae lineages, with the first coinciding with events of gene duplication. An increased number of substitutions is also observed in pck-1P1, but it was restricted to a few branches within these groups (Figs 3 and 4). The same patterns were observed when C species outside Andropogoneae were included in the analyses (Supplementary Fig. S7 available on Dryad), as increased rates of amino acid substitution in all five genes characterize most C grasses, which highlights the highly convergent nature of C evolution in grasses. Similar bursts of amino acid substitutions were observed in the negative controls nadpme-1P3 (on the branch leading to Jansenelleae and Andropogoneae) and dwarf8 (in a few derived groups within Andropogoneae s.s.; Supplementary Fig. S7 available on Dryad).
Discussion
A Single Origin of the New C Physiology Followed by Continued Anatomical Changes
In previous grass phylogenetic trees, Andropogoneae formed a large clade entirely composed of C species, and its closest known C relatives belonged to a different group containing multiple independent C lineages (GPWG II 2012; Gallaher et al. 2019). The branch leading to Andropogoneae was, therefore, long, preventing the precise inference of changes leading to C evolution in this group. We confirm here that Jansenella and Chandrasekharania form the sister group of Andropogoneae, both based on plastomes and on markers spread across the nuclear genomes (Fig. 1 and Supplementary Figs S2–S5 available on Dryad). This, combined with a distinctive morphology, supports their recognition as a separate tribe, Jansenelleae (Appendix 1). We further confirm that the group is C, as previously suggested (Türpe 1970; Renvoize 1982a, 1985), providing a shorter branch connecting the last known C ancestor of Andropogoneae (most recent ancestor shared with Jansenelleae) and the first split within the group. The anatomy of Jansenella is typical of C grasses, with a large distance between consecutive veins, a double bundle sheath, and no minor veins or distinctive cells (Fig. 2). In addition, the genes encoding C-related enzymes from Jansenella and Chandrasekharania are similar to those of other C grasses, with no trace of positive selection or increased rates of amino acid replacement (Figs 3 and 4 and Supplementary Fig. S7 available on Dryad; Table 3). We, therefore, conclude that the last common ancestor of Jansenelleae and Andropogoneae was a typical C plant, with the anatomical and genetic characteristics common to all PACMAD grasses (Christin et al. 2013; Emms et al. 2016; Moreno-Villena et al. 2018). The changes responsible for the emergence of a C pathway, therefore, happened after the divergence between Andropogoneae and Jansenelleae. Previous studies comparing C and C anatomical traits or genomes typically sampled only a few Andropogoneae species, preventing them from assigning changes to different phases of C evolution (Christin et al. 2013; Emms et al. 2016; Huang et al. 2017), as enabled here thanks to our denser species sampling.
The comparison of anatomical types suggests multiple modifications during the early diversification of Andropogoneae. All species in this group have a single bundle sheath (Renvoize 1982a), which is ontogenetically equivalent to the inner sheath of C grasses (i.e., the mestome sheath; Dengler et al. 1985). The large distance between consecutive veins, as observed in Jansenella (Table 2), is reduced in Arundinellinae by the insertion of one or multiple distinctive cells, where Rubisco can be segregated (Fig. 2; Dengler and Dengler 1990; Sinha and Kellogg 1996). While these distinctive cells are shared by some Andropogoneae (Ueno 1995), most use a different strategy to reduce the distance between consecutive veins, which consists of the proliferation of minor veins (Table 2; Lundgren et al. 2014, 2019). Distinctive cells and minor veins have similar developmental patterns (Dengler et al. 1996), and the former could be precursors of the latter, in which case minor veins could represent the specialization of ancestral distinctive cells after the split of Andropogoneae from Arundinellinae. Alternatively, the ancestral state of the group could be minor veins that later degenerated in Arundinellinae and some Andropogoneae s.s., or else these specializations evolved multiple times during the early diversification of the group. In all cases, the phylogenetic distribution of distinctive cells and minor veins shows that changes following the initial transition to C led to diverse anatomical solutions for the effective segregation of biochemical reactions.
Modifications of C Enzymes Occurred Throughout the Diversification of Andropogoneae
The emergence of a C pathway generally requires the co-option of multiple enzymes already existing in the C ancestor via their massive upregulation (Hibberd and Covshoff 2010; Moreno-Villena et al. 2018). This is followed by adaptation of their kinetics for the new catalytic context through numerous amino acid replacements (Bläsing et al. 2002; Tausta et al. 2002; Christin et al. 2007; Huang et al. 2017). Tests of shifts in selective pressures conducted here for multiple C-encoding genes from Andropogoneae and other grasses confirm that the evolution of C genes in this group involved an increased fixation of nonsynonymous mutations (Table 3; Christin et al. 2007, 2009b; Wang et al. 2009; Huang et al. 2017). Genes for the key enzyme of the C pathway, PEPC, underwent convergent changes in numerous groups of grasses, and most were shared between Arundinella and Andropogoneae (Christin et al. 2007). However, only a fraction of the changes is also observed in Garnotia stricta (data not shown), indicating that the enzyme underwent adaptive changes both before and after the diversification of major Andropogoneae lineages. Consistent with this, the branch models did not favor increased amino acid replacements solely at the base of the whole clade, but a sustained increase throughout the diversification of the group (Table 3). A sustained shift in selective pressure on the branch leading to Andropogoneae after the split from Jansenelleae is also supported for genes encoding NADP-MDH and NADP-ME, while independent shifts are observed at the base of Arundinellinae and Andropogoneae s.s. for PCK (Table 3). A comparison of branch lengths indeed shows increased rates of amino acid replacements at the respective branches for all five core C genes (Fig. 3). Our analyses, therefore, confirm that massive changes happened at the base of Andropogoneae, and models assuming that increased fixation of nonsynonymous mutations persisted after early shifts in selective pressures are strongly favored for four out of the five core C genes analyzed (Table 3). In addition, increased rates of sustained amino acid replacements are observed on many branches within the group (Figs 3 and 4). Increased rates of amino acid replacements were also detected in genes not known to be directly involved in C biochemistry, such as the NADP-ME paralog nadpme-1P3, and dwarf8. While the selective drivers for changes in nadpme-1P3 are not known, the branches with elevated rates of nonsynonymous mutations do not strictly coincide with the C phenotype (Supplementary Fig. S7 available on Dryad; Table 3). The gene dwarf8 is linked to flowering time in maize, and selective sweeps in the genomic region including dwarf8 have been associated with climatic adaptations in maize (Camus-Kulandaivelu et al. 2006, 2008). We conclude that, while other genes undergo elevated rates of amino acid substitutions for different reasons, important alterations of enzymes for the initial build-up of a C cycle at the base of Andropogoneae were followed by continued adaptation throughout the diversification of the group.
While some enzymes participate in all biochemical variants of the C cycle (Kanai and Edwards 1999), the identity of the enzyme(s) responsible for the decarboxylation of CO in the bundle sheath varies among C lineages (Prendergast et al. 1987; Sage et al. 2011). Our analyses concordantly indicate that the decarboxylating enzyme PCK underwent rounds of amino acid replacements only in some derived groups within Andropogoneae (Figs 3 and 4), without evidence of positive selection at the base of the whole group (Table 3). This conclusion was reached previously (Christin et al. 2009a) and supports later additions of a PCK-catalyzed decarboxylation reaction in some of the Andropogoneae (Gutierrez et al. 1974; Walker et al. 1997; Wingler et al. 1999). However, our data also indicate that NADP-ME, which is the main decarboxylating enzyme in all Andropogoneae, similarly acquired its C properties relatively late in the history of the group. Again, the best model assumed adaptive evolution throughout Andropogoneae (Table 3). The gene nadpme-1P4 for NADP-ME was duplicated independently in Andropogoneae , Garnotia and Arundinella, and amino acid replacements are especially prevalent in one of the copies in each group (Fig. 3 and Supplementary Fig. S7 available on Dryad; Christin et al. 2009b). These observations point to independent adaptation of the enzyme kinetics, but the expression patterns also likely evolved independently in Andropogoneae and Arundinellinae. Indeed, modifications of the promoter regions allowing the C-specific binding of a transcription factor are restricted to one of the Andropogoneae duplicates that fulfills the C function (Borba et al. 2018), which evolved after the split from Arundinellinae. We, therefore, hypothesize that the common ancestor of the Andropogoneae performed a C cycle based on several decarboxylating enzymes relatively abundant in many C grasses (Moreno-Villena et al. 2018), with some amino acid changes in the other C enzymes. Further modifications, which canalized the use of the NADP-ME encoded by nadpme-1P4, added a PCK shuttle and/or improved the action of PEPC, PPDK and NADP-MDH happened later during the diversification of the group, so that its numerous C species represent a diversity of realizations of the C pathway. Similar conclusions were reached for small groups that evolved the C trait more recently (Dunning et al. 2017b), but we show here for the first time that the continuous adaptation of the C trait can be sustained over long evolutionary periods, leaving traces even within one of the largest C groups.
Physiology Evolved During the Early Miocene in Andropogoneae
Besides inferring the changes underlying C evolution in Andropogoneae, our plastome phylogeny encompassing a diversity of Andropogoneae and their closest C relatives shed new light on the age of C photosynthesis in the group. Our molecular dating estimated the split between Jansenelleae and Andropogoneae at roughly 21 Ma, with the first split within Andropogoneae at 18 Ma. While older ages would be inferred if disputed microfossils dates are considered (see Results), these dates represent the interval in which C most likely evolved in this group, and are consistent with those obtained from previous studies (Christin et al. 2008; Christin and Osborne 2014; Vicentini et al. 2008; Estep et al. 2014; Spriggs et al. 2014; Dunning et al. 2017a).
Reconstructing the ancient biogeography of Andropogoneae is complicated by their diversity and presumably numerous long-distance dispersals, but India represents the center of diversity of both Andropogoneae and Jansenelleae (Bor 1955; Hartley 1958; Nair et al. 1982; Yadav et al. 2010), suggesting an origin on the subcontinent. The three species of Jansenelleae occur in open habitats (Bor 1955; Nair et al. 1982; Yadav et al. 2010), including some that regularly burn (Shilla and Tiwari 2015), calling for more research to establish which ecological traits now typical for Andropogoneae had already emerged before the C to C transition and which only appeared afterwards.
The contrast between the sister groups Jansenelleae and Andropogoneae is striking. While the former has only three known species, two of them restricted to small regions of India, the latter encompasses roughly 1200 species spread around the world, many of which are dominant in savanna ecosystems (Hulbert 1988; Solbrig 1996; Bond et al. 2003; Kellogg 2015). This difference is partially explained by the divergence of photosynthetic types, but the expansion of C grasslands happened 10–18 Myr after C originated in Andropogoneae (Edwards et al. 2010), and increased diversification occurred only in some of its subclades (Spriggs et al. 2014). While the initial C trait might have played the role of a key innovation broadening the niche of early Andropogoneae (Lundgren et al. 2015; Aagesen et al. 2016), the later diversification and dominance of some subgroups, their rapid dispersal across large distances (Dunning et al. 2017a) and into different ecosystems (Watcharamongkol et al. 2018) were likely enabled by the acquisition of additional attributes. Traits only partially related or entirely unrelated to C photosynthesis, such as frequent allopolyploidy, herbivore resistance, and fire tolerance have previously been used to explain the success of some Andropogoneae (Stebbins 1975; Bond et al. 2003; Edwards et al. 2010; Visser et al. 2012; Estep et al. 2014; Forrestel et al. 2014; Ripley et al. 2015; Linder et al. 2018). We suggest that the diversity of C phenotypes revealed here might also contribute to variation among Andropogoneae. For instance, the addition of a PCK shuttle, which happened recurrently in some derived Andropogoneae, is predicted to increase tolerance to fluctuating light conditions (Bellasio and Griffiths 2014; Wang et al. 2014). Other anatomical and biochemical variations observed here might alter the hydraulic efficiency and growth rates of the different Andropogoneae (Osborne and Sack 2012). Overall, we conclude that, because of continuous adaptive reinforcement following a key physiological transition, descendants of a lineage sharing the derived trait should not all be considered as functionally equivalent.
Conclusion
Using plastome and nuclear phylogenomics, we confirmed a rare Asian C lineage, Jansenelleae, as sister to the CAndropogoneae grasses. This opens new avenues for comparative analyses of C evolution, which were explored here. The C pathway in Andropogoneae most likely evolved in the Early Miocene between roughly 21 and 18 Ma, and many adaptive changes in C enzymes happened during this 3-Myr period, while many more occurred during the next 18 million years of lineage diversification. The group including Andropogoneae apparently originated on the Indian subcontinent, and the evolutionary diversification of the C phenotype after its origin might have facilitated the spread of Andropogoneae into novel niches and to different regions of the globe, contributing to the success of this emblematic group of savanna grasses.
Supplementary Material
Data available from the Dryad Digital Repository: https://doi.org/10.5061/dryad.j6q573n7x.
Funding
This work was supported by the Brazilian Research Council (CNPq) through a ”Science without Borders“ scholarship (grant number 201873/2014-1) and the European Research Council (grant ERC-2014-STG-638333) to M.E.B. J.H. and G.B. received support from the French excellence projects Labex CEBA (ANR-10-LABX-25-01) and Labex TULIP (ANR-10-LABX-0041). This work was performed within the framework of the PhyloAlps project, whose sequencing was funded by France Génomique (ANR-10-INBS-09-08). P.A.C. is funded by a Royal Society University Research Fellowship (URF120119). M.R.M. was supported by NSF grant DEB-11456884, M.R.D. by Dimensions NASA grant DEB-1342782, and DEB-1457748 to E.A.K. S.L. received funding from the LabexOSUG@2020 (ANR10 LABX56) and the project Origin-Alps (ANR-16-CE93-0004). Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the National Science Foundation.
Acknowledgments
We thank Jacob Washburn, Jeffrey Bennetzen, and Minkyu Park for providing data sets for this study; Luke Dunning, Jill Olofsson, Daniel Wood, and Pierre Solbès for bioinformatic support; Heather Walker for support with mass spectrometry analysis; Sophie Manzi for lab support; Hans-Joachim Esser (Munich Botanical Gardens) for providing herbarium samples; Andrew Fleming for access to the plant histology facility; and Simone de Padua Teixeira, Marjorie Lundgren, and Lamiaa Munshi for suggestions on the leaf anatomy preparation.
Appendix 1
Jansenelleae Voronts. tribus nov.
Type: Jansenella Bor, Kew Bull. 10: 96 (1955)
Included genera: Chandrasekharania V.J. Nair, V.S. Ramach. & Sreek., Jansenella Bor
Description: Annuals with erect culms. Ligule shortly membranous. Leaf-blades lanceolate. Inflorescence shortly branched, appearing capitate. Spikelets laterally compressed, 2-flowered. Glumes 2, apically acuminate to shortly awned. Lower floret sterile, staminate, or bisexual. Lower lemma shortly awned, either entire (Jansenella) or awn arising between two erose apical lobes (Chandrasekharania). Lower palea present. Upper floret bisexual. Upper lemma awned from a bidentate apex, but variable in its shape and indumentum: either with two hair tufts, twisted dehiscent awn arising between long-acuminate lobes (Jansenella) or without hair tufts, short straight awn arising between two erose apical lobes (Chandrasekharania). Stamens 3. Grain ellipsoid; hilum punctiform.
Leaf anatomy: Outer and inner bundle sheaths present. More than four cells between consecutive veins. No distinctive cells. Starch storage in the chlorenchyma.
Distribution: India (including Assam and Sri Lanka), Myanmar and Thailand.
References
- Aagesen L., Biganzoli F., Bena J., Godoy-Bürki A.C., Reinheimer R., Zuloaga F.O.. 2016. Macro-climatic distribution limits show both niche expansion and niche specialization among C Panicoids. PLoS One. 11:e0151075. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Allen J.M., LaFrance R., Folk R.A., Johnson K.P., Guralnick R.P.. 2018. aTRAM 2.0: an improved, flexible locus assembler for NGS data. Evol. Bioinform. 14:1–4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Arthan W., McKain M.R., Traiperm P., Welker C.A.D., Teisher J.K., Kellogg E.A.. 2017. Phylogenomics of Andropogoneae (Panicoideae: Poaceae) of mainland Southeast Asia. Syst. Bot. 42:418–431. [Google Scholar]
- Atkinson R.R.L., Mockford E.J., Bennett C., Christin P.A., Spriggs E.L., Freckleton R.P., Thompson K., Rees M., Osborne C.P.. 2016. C photosynthesis boosts growth by altering physiology, allocation and size. Nat. Plants. 2:16038. [DOI] [PubMed] [Google Scholar]
- Bellasio C., Griffiths H.. 2014. The operation of two decarboxylases, transamination, and partitioning of C metabolic processes between mesophyll and bundle sheath cells allows light capture to be balanced for the maize C pathway. Plant Physiol. 164:466–480. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bertels F., Silander O.K., Pachkov M., Rainey P.B., van Nimwegen E.. 2014. Automated reconstruction of whole-genome phylogenies from short-sequence reads. Mol. Biol. Evol. 31:1077–1088. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Besnard G., Bianconi M.E., Hackel J., Manzi S., Vorontsova M.S., Christin P.-A.. 2018. Herbarium genomics retraces the origins of C-specific carbonic anhydrase in Andropogoneae (Poaceae). Bot. Lett. 165:419–433. [Google Scholar]
- Besnard G., Christin P.-A., Malé P.-J.G., Coissac E., Lhuillier E., Lauzeral C., Vorontsova M.S.. 2014. From museums to genomics: old herbarium specimens shed light on a C to Ctransition. J. Exp. Bot. 65:6711–6721. [DOI] [PubMed] [Google Scholar]
- Besnard G., Christin P.-A., Malé P.-J.G., Coissac E., Ralimanana H., Vorontsova M.S.. 2013. Phylogenomics and taxonomy of Lecomtelleae (Poaceae), an isolated panicoid lineage from Madagascar. Ann. Bot. 112:1057–1066. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bianconi M.E., Dunning L.T., Moreno-Villena J.J., Osborne C.P., Christin P.-A.. 2018. Gene duplication and dosage effects during the early emergence of C photosynthesis in the grass genus Alloteropsis. J. Exp. Bot. 69:1967–1980. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bläsing O.E., Ernst K., Streubel M., Westhoff P., Svensson P.. 2002. The non-photosynthetic phosphoenolpyruvate carboxylases of the C dicot Flaveria trinervia—implications for the evolution of C photosynthesis. Planta. 215:448–456. [DOI] [PubMed] [Google Scholar]
- Bomblies K., Doebley J.F.. 2005. Molecular evolution of FLORICAULA/LEAFY orthologs in the Andropogoneae (Poaceae). Mol. Biol. Evol. 22:1082–1094. [DOI] [PubMed] [Google Scholar]
- Bond J.E., Garrison N.L., Hamilton C.A., Godwin R.L., Hedin M., Agnarsson I.. 2014. Phylogenomics resolves a spider backbone phylogeny and rejects a prevailing paradigm for orb web evolution. Curr. Biol. 24:1765–1771. [DOI] [PubMed] [Google Scholar]
- Bond W.J., Midgley G.F., Woodward F.I.. 2003. What controls South African vegetation—climate or fire? S. Afr. J. Bot. 69:79–91. [Google Scholar]
- Bond W.J., Silander J.A., Ranaivonasy J., Ratsirarson J.. 2008. The antiquity of Madagascar’s grasslands and the rise of C grassy biomes. J. Biogeogr. 35:1743–1758. [Google Scholar]
- Bor N.L. 1955. Notes on Asiatic grasses: XXIII. Jansenella Bor, a new genus of Indian grasses. Kew Bull. 10:93–99. [Google Scholar]
- Borba A.R., Serra T.S., Górska A., Gouveia P., Cordeiro A.M., Reyna-Llorens I., Kneřová J., Barros P.M., Abreu I.A., Oliveira M.M., Hibberd J.M., Saibo N.J.M.. 2018. Synergistic binding of bHLH transcription factors to the promoter of the maize NADP-ME gene used in C photosynthesis is based on an ancient code found in the ancestral C state. Mol. Biol. Evol. 35:1690–1705. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Burke S.V., Wysocki W.P., Zuloaga F.O., Craine J.M., Pires J.C., Edger P.P., Mayfield-Jones D., Clark L.G., Kelchner S.A., Duvall M.R.. 2016. Evolutionary relationships in panicoid grasses based on plastome phylogenomics (Panicoideae; Poaceae). BMC Plant Biol. 16:140. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Camus-Kulandaivelu L., Chevin L.-M., Tollon-Cordet C., Charcosset A., Manicacci D., Tenaillon M.I.. 2008. Patterns of molecular evolution associated with two selective sweeps in the tb1-dwarf8 region in maize. Genetics. 180:1107–1121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Camus-Kulandaivelu L., Veyrieras J.B., Madur D., Combes V., Fourmann M., Barraud S., Dubreuil P., Gouesnard B., Manicacci D., Charcosset A.. 2006. Maize adaptation to temperate climate: relationship between population structure and polymorphism in the dwarf8 gene. Genetics. 172:2449–2463. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Capella-Gutiérrez S., Silla-Martínez J.M., Gabaldón T.. 2009. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 25:1972–1973. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Christin P.-A., Besnard G., Edwards E.J., Salamin N.. 2012a. Effect of genetic convergence on phylogenetic inference. Mol. Phylogenet. Evol. 62:921–927. [DOI] [PubMed] [Google Scholar]
- Christin P.-A., Besnard G., Samaritani E, Duvall M.R., Hodkinson T.R., Savolainen V., Salamin N.. 2008. Oligocene CO decline promoted C photosynthesis in grasses. Curr. Biol. 18:37–43. [DOI] [PubMed] [Google Scholar]
- Christin P.-A., Edwards E.J., Besnard G., Boxall S.F., Gregory R., Kellogg E.A., Hartwell J., Osborne C.P.. 2012b. Adaptive evolution of C photosynthesis through recurrent lateral gene transfer. Curr. Biol. 22:445–449. [DOI] [PubMed] [Google Scholar]
- Christin P.-A., Freckleton R.P., Osborne C.P.. 2010. Can phylogenetics identify C origins and reversals? Trends Ecol. Evol. 25:403–409. [DOI] [PubMed] [Google Scholar]
- Christin P.-A., Osborne C.P.. 2014. The evolutionary ecology of C plants. New Phytol. 204:765–781. [DOI] [PubMed] [Google Scholar]
- Christin P.-A., Osborne C.P., Chatelet D.S., Columbus J.T., Besnard G., Hodkinson T.R., Garrison L.M., Vorontsova M.S., Edwards E.J.. 2013. Anatomical enablers and the evolution of C photosynthesis in grasses. Proc. Natl. Acad. Sci. USA. 110:1381–1386. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Christin P.-A., Petitpierre B., Salamin N., Büchi L., Besnard G.. 2009a. Evolution of C phosphoenolpyruvate carboxykinase in grasses, from genotype to phenotype. Mol. Biol. Evol. 26:357–365. [DOI] [PubMed] [Google Scholar]
- Christin P.-A., Salamin N., Savolainen V., Duvall M.R., Besnard G.. 2007. C photosynthesis evolved in grasses via parallel adaptive genetic changes. Curr. Biol. 17:1241–1247. [DOI] [PubMed] [Google Scholar]
- Christin P.-A., Samaritani E., Petitpierre B., Salamin N., Besnard G.. 2009b. Evolutionary insights on Cphotosynthetic subtypes in grasses from genomics and phylogenetics. Genome Biol. Evol. 1:221–230. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Christin P.-A., Spriggs E., Osborne C.P., Strömberg C.A.E., Salamin N., Edwards E.J.. 2014. Molecular dating, evolutionary rates, and the age of the grasses. Syst. Biol. 63:153–165. [DOI] [PubMed] [Google Scholar]
- Clark C.J., McGuire J.A., Bonaccorso E., Berv J.S., Prum R.O.. 2018. Complex coevolution of wing, tail, and vocal sounds of courting male bee hummingbirds. Evolution. 72:630–646. [DOI] [PubMed] [Google Scholar]
- Cock P.J.A., Chilton J.M., Grüning B., Johnson J.E., Soranzo N.. 2015. NCBI BLAST+ integrated into Galaxy. GigaScience. 4:39. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dengler N.G., Dengler R.E., Hattersley P.W.. 1985. Differing ontogenetic origins of PCR (“Kranz”) sheaths in leaf blades of C grasses (Poaceae). Am. J. Bot. 72:284–302. [Google Scholar]
- Dengler N.G., Donnelly P.M., Dengler R.E.. 1996. Differentiation of bundle sheath, mesophyll, and distinctive cells in the C grass Arundinella hirta (Poaceae). Am. J. Bot. 83:1391–1405. [Google Scholar]
- Dengler R.E., Dengler N.G.. 1990. Leaf vascular architecture in the atypical C NADP—malic enzyme grass Arundinella hirta. Can. J. Bot. 68:1208–1221. [Google Scholar]
- Doust A.N., Penly A.M., Jacobs S.W.L., Kellogg E.A.. 2007. Congruence, conflict, and polyploidization shown by nuclear and chloroplast markers in the monophyletic “bristle clade” (Paniceae, Panicoideae, Poaceae). Syst. Bot. 32:531–544. [Google Scholar]
- Drummond A.J., Ho S.Y.W., Phillips M.J., Rambaut A.. 2006. Relaxed phylogenetics and dating with confidence. PLoS Biol. 4:699–710. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Drummond A.J., Rambaut A.. 2007. BEAST: Bayesian evolutionary analysis by sampling trees. BMC Evol. Biol. 7:214. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dunning L.T., Liabot A.-L., Olofsson J.K., Smith E.K., Vorontsova M.S., Besnard G., Simpson K.J., Lundgren M.R., Addicott E., Gallagher R.V., Chu Y., Pennington R.T., Christin P.-A., Lehmann C.E.R.. 2017a. The recent and rapid spread of Themeda triandra. Bot. Lett. 164:327–337. [Google Scholar]
- Dunning L.T., Lundgren M.R., Moreno-Villena J.J., Namaganda M., Edwards E.J., Nosil P., Osborne C.P., Christin P.-A.. 2017b. Introgression and repeated co-option facilitated the recurrent emergence of C photosynthesis among close relatives. Evolution. 71:1541–1555. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dunning L.T., Olofsson J.K., Parisod C., Choudhury R.R., Moreno-Villena J.J., Yang Y., Dionora J., Quick W.P., Park M., Bennetzen J.L., Besnard G., Nosil P., Osborne C.P., Christin P.-A.. 2019. Lateral transfers of large DNA fragments spread functional genes among grasses. Proc. Natl. Acad. Sci. USA. 116:4416–4425. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Edwards E.J., Osborne C.P., Stromberg C.A.E., Smith S.A., Bond W.J., Christin P.-A., Cousins A.B., Duvall M.R., Fox D.L., Freckleton R.P., Ghannoum O., Hartwell J., Huang Y., Janis C.M., Keeley J.E., Kellogg E.A., Knapp A.K., Leakey A.D.B., Nelson D.M., Saarela J.M., Sage R.F., Sala O.E., Salamin N., Still C.J., Tipple B.. 2010. The origins of C grasslands: integrating evolutionary and ecosystem science. Science. 328:587–591. [DOI] [PubMed] [Google Scholar]
- Edwards E.J., Smith S.A.. 2010. Phylogenetic analyses reveal the shady history of C grasses. Proc. Natl. Acad. Sci. USA. 107:2532–2537. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Emms D.M., Covshoff S., Hibberd J.M., Kelly S.. 2016. Independent and parallel evolution of new genes by gene duplication in two origins of C photosynthesis provides new insight into the mechanism of phloem loading in C species. Mol. Biol. Evol. 33:1796–1806. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Endress P.K. 2011. Evolutionary diversification of the flowers in angiosperms. Am. J. Bot. 98:370–396. [DOI] [PubMed] [Google Scholar]
- Estep M.C., Diaz D.M.V., Zhong J., Kellogg E.A.. 2012. Eleven diverse nuclear-encoded phylogenetic markers for the subfamily Panicoideae (Poaceae). Am. J. Bot. 99:e443–e446. [DOI] [PubMed] [Google Scholar]
- Estep M.C., McKain M.R., Vela Diaz D., Zhong J., Hodge J.G., Hodkinson T.R., Layton D.J., Malcomber S.T., Pasquet R., Kellogg E.A.. 2014. Allopolyploidy, diversification, and the Miocene grassland expansion. Proc. Natl. Acad. Sci. USA. 111:15149–15154. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Forrestel E.J., Donoghue M.J., Smith M.D.. 2014. Convergent phylogenetic and functional responses to altered fire regimes in mesic savanna grasslands of North America and South Africa. New Phytol. 203:1000–1011. [DOI] [PubMed] [Google Scholar]
- Gallaher T.J., Adams D.C., Attigala L., Burke S.V., Craine J.M., Duvall M.R., Klahs P.C., Sherratt E., Wysocki W.P., Clark L.G.. 2019. Leaf shape and size track habitat transitions across forest–grassland boundaries in the grass family (Poaceae). Evolution. 73:927–946. [DOI] [PubMed] [Google Scholar]
- Goodstein D.M., Shu S., Howson R., Neupane R., Hayes R.D., Fazo J., Mitros T., Dirks W., Hellsten U., Putnam N., Rokhsar D.S.. 2012. Phytozome: a comparative platform for green plant genomics. Nucleic Acids Res. 40:D1178–D1186. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Grass Phylogeny Working Group (GPWG). 2001. Phylogeny and subfamilial classification of the grasses (Poaceae). Ann. MO. Bot. Gard. 88:373–457. [Google Scholar]
- Grass Phylogeny Working Group II (GPWG II). 2012. New grass phylogeny resolves deep evolutionary relationships and discovers C origins. New Phytol. 193:304–312. [DOI] [PubMed] [Google Scholar]
- Gutierrez M., Gracen V.E., Edwards G.E.. 1974. Biochemical and cytological relationships in C plants. Planta. 119:279–300. [DOI] [PubMed] [Google Scholar]
- Hackel J., Vorontsova M.S., Nanjarisoa O.P., Hall R.C., Razanatsoa J., Malakasi P., Besnard G.. 2018. Grass diversification in Madagascar: In situ radiation of two large C shade clades and support for a Miocene to Pliocene origin of C grassy biomes. J. Biogeogr. 45:750–761. [Google Scholar]
- Hartley W. 1958. Studies on the origin, evolution and distribution of the Gramineae. I. The tribe Andropogoneae. Aust. J. Bot. 6:116–128. [Google Scholar]
- Hatch M.D. 1987. C photosynthesis: a unique blend of modified biochemistry, anatomy and ultrastructure. Biochim. Biophys. Acta. 895:81–106. [Google Scholar]
- Hattersley P.W. 1984. Characterization of C type leaf anatomy in grasses (Poaceae). Mesophyll: bundle sheath area ratios. Ann. Bot. 53:163–180. [Google Scholar]
- Heyduk K., Moreno-Villena J.J., Gilman I., Christin P.-A., Edwards E.J.. 2019. The genetics of convergent evolution: insights from plant photosynthesis. Nat. Rev. Genet. 20:485–493. [DOI] [PubMed] [Google Scholar]
- Hibberd J.M., Covshoff S.. 2010. The regulation of gene expression required for C photosynthesis. Annu. Rev. Plant Biol. 61:181–207. [DOI] [PubMed] [Google Scholar]
- Huang P., Studer A.J., Schnable J.C., Kellogg E.A., Brutnell T.P.. 2017. Cross species selection scans identify components of C photosynthesis in the grasses. J. Exp. Bot. 68:127–135. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hulbert L.C. 1988. Causes of fire effects in tallgrass prairie. Ecology. 69:46–58. [Google Scholar]
- Kanai R., Edwards G.E.. 1999. The biochemistry of C photosynthesis. In: Sage R.F., Monson R.K., editors. C4 plant biology. San Diego (CA): Academic Press; p. 49–87. [Google Scholar]
- Katoh K., Standley D.M.. 2013. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 30:772–780. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kearse M., Moir R., Wilson A., Stones-Havas S., Cheung M., Sturrock S., Buxton S., Cooper A., Markowitz S., Duran C., Thierer T., Ashton B., Meintjes P., Drummond A.. 2012. Geneious basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics. 28:1647–1649. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kellogg E.A. 2015. Flowering Plants. Monocots: Poaceae; Heidelberg: Springer; p. 416. [Google Scholar]
- Langmead B., Salzberg S.L.. 2012. Fast gapped-read alignment with Bowtie 2. Nat. Methods. 9:357–359. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lehmann C.E.R, Griffith D.M., Simpson K.J., Anderson T.M., Archibald S., Beerling D.J., Bond W.J., Denton E., Edwards E.J., Forrestel E.J., Fox D.L., Georges D., Hoffmann W.A., Kluyver T., Mucina L., Pau S., Ratnam J., Salamin N., Santini B., Smith M.D., Spriggs E.L., Westley R., Still C.J., Strömberg C.A.E., Osborne C.P.. 2019. Functional diversification enabled grassy biomes to fill global climate space. bioRxiv [Preprint] doi: 10.1101/583625. [DOI] [Google Scholar]
- Li H., Handsaker B., Wysoker A., Fennell T., Ruan J., Homer N., Marth G., Abecasis G., Durbin R.. 2009. The sequence alignment/map format and SAMtools. Bioinformatics. 25:2078–2079. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Linder H.P., Lehmann C.E.R., Archibald S., Osborne C.P., Richardson D.M.. 2018. Global grass (Poaceae) success underpinned by traits facilitating colonization, persistence and habitat transformation. Biol. Rev. 93:1125–1144. [DOI] [PubMed] [Google Scholar]
- Long S.P. 1999. Environmental responses. In: Sage R.F., Monson R.K., editors. C4 plant biology. San Diego (CA): Academic Press. p. 215–249. [Google Scholar]
- Lundgren M.R., Besnard G., Ripley B.S., Lehmann C.E.R., Chatelet D.S., Kynast R.G., Namaganda M., Vorontsova M.S., Hall R.C., Elia J., Osborne C.P., Christin P.-A.. 2015. Photosynthetic innovation broadens the niche within a single species. Ecol. Lett. 18:1021–1029. [DOI] [PubMed] [Google Scholar]
- Lundgren M.R., Dunning L.T., Olofsson J.K., Moreno-Villena J.J., Bouvier J.W., Sage T.L., Khoshravesh R., Sultmanis S., Stata M., Ripley B.S., Vorontsova M.S., Besnard G., Adams C., Cuff N., Mapaura A., Bianconi M.E., Long C.M., Christin P.A., Osborne C.P.. 2019. C anatomy can evolve via a single developmental change. Ecol. Lett. 22:302–312. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lundgren M.R., Osborne C.P., Christin P.-A.. 2014. Deconstructing Kranz anatomy to understand C evolution. J. Exp. Bot. 65:3357–3369. [DOI] [PubMed] [Google Scholar]
- Marek P.E., Moore W.. 2015. Discovery of a glowing millipede in California and the gradual evolution of bioluminescence in Diplopoda. Proc. Natl. Acad. Sci. USA. 112:6419–6424. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McGee M.D., Borstein S.R., Neches R.Y., Buescher H.H., Seehausen O., Wainwright P.C.. 2015. A pharyngeal jaw evolutionary innovation facilitated extinction in Lake Victoria cichlids. Science. 350:1077–1079. [DOI] [PubMed] [Google Scholar]
- Metcalfe C.R. 1960. Anatomy of the monocotyledons—I. Gramineae. London:Oxford; p. 267. [Google Scholar]
- Mirarab S., Warnow T.. 2015. ASTRAL-II: coalescent-based species tree estimation with many hundreds of taxa and thousands of genes. Bioinformatics. 31:i44–i52. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moreno-Villena J.J., Dunning L.T., Osborne C.P., Christin P.-A.. 2018. Highly expressed genes are preferentially co-opted for C photosynthesis. Mol. Biol. Evol. 35:94–106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nair V.J., Ramachandran V.S., Sreekumar P.V.. 1982. Chandrasekharania: a new genus of Poaceae from Kerala, India. Proc. Indian Acad. Sci. (Plant Sci.) 91:79–82. [Google Scholar]
- Nguyen L.T., Schmidt H.A., Von Haeseler A., Minh B.Q.. 2015. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 32:268–274. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Niklaus M., Kelly S.. 2019. The molecular evolution of C photosynthesis: opportunities for understanding and improving the world’s most productive plants. J. Exp. Bot. 70:795–804. [DOI] [PubMed] [Google Scholar]
- O’Leary M.H. 1988. Carbon isotopes in photosynthesis. BioScience. 38:328–336. [Google Scholar]
- Olofsson J.K., Bianconi M., Besnard G., Dunning L.T., Lundgren M.R., Holota H., Vorontsova M.S., Hidalgo O., Leitch I.J., Nosil P., Osborne C.P., Christin P.-A.. 2016. Genome biogeography reveals the intraspecific spread of adaptive mutations for a complex trait. Mol. Ecol. 25:6107–6123. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Olofsson J.K., Cantera I., Van de Paer C., Hong-Wa C., Zedane L., Dunning L.T., Alberti A., Christin P.-A., Besnard G.. 2019. Phylogenomics using low-depth whole genome sequencing: a case study with the olive tribe. Mol. Ecol. Resour. 19:877–892. [DOI] [PubMed] [Google Scholar]
- Osborne C.P. 2008. Atmosphere, ecology and evolution: what drove the Miocene expansion of C grasslands? J. Ecol. 96:35–45. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Osborne C.P., Sack L.. 2012. Evolution of C plants: a new hypothesis for an interaction of CO and water relations mediated by plant hydraulics. Philos. Trans. R. Soc. B Biol. Sci. 367:583–600. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Osborne C.P., Salomaa A., Kluyver T.A., Visser V., Kellogg E.A., Morrone O., Vorontsova M.S., Clayton W.D., Simpson D.A.. 2014. A global database of C photosynthesis in grasses. New Phytol. 204:441–446. [DOI] [PubMed] [Google Scholar]
- Patel R.K., Jain M.. 2012. NGS QC Toolkit: a toolkit for quality control of next generation sequencing data. PLoS One. 7:e30619. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Paterson A.H., Bowers J.E., Bruggmann R., Dubchak I., Grimwood J., Gundlach H., Haberer G., Hellsten U., Mitros T., Poliakov A., Schmutz J., Spannagl M., Tang H., Wang X., Wicker T., Bharti A.K., Chapman J., Feltus F.A., Gowik U., Grigoriev I.V., Lyons E., Maher C.A., Martis M., Narechania A., Otillar R.P., Penning B.W., Salamov A.A., Wang Y., Zhang L., Carpita N.C., Freeling M., Gingle A.R., Hash C.T., Keller B., Klein P., Kresovich S., McCann M.C., Ming R., Peterson D.G., Mehboob-Ur-Rahman, Ware D., Westhoff P., Mayer K.F.X., Messing J., Rokhsar D.S.. 2009. The Sorghum bicolor genome and the diversification of grasses. Nature. 457:551–556. [DOI] [PubMed] [Google Scholar]
- Piot A., Hackel J., Christin P.-A., Besnard G.. 2018. One third of the plastid genes evolved under positive selection in PACMAD grasses. Planta. 247:255–266. [DOI] [PubMed] [Google Scholar]
- Prasad V., Strömberg, C.A.E., Leaché, A.D., Samant B., Patnaik R., Tang L., Mohabey D.M., Ge S., Sahni A.. 2011. Late Cretaceous origin of the rice tribe provides evidence for early diversification in Poaceae. Nat. Commun. 2:480. [DOI] [PubMed] [Google Scholar]
- Prasad V., Strömberg C.A.E., Alimohammadian H., Sahni A.. 2005. Dinosaur coprolites and the early evolution of grasses and grazers. Science 310:1177–1180. [DOI] [PubMed] [Google Scholar]
- Prendergast H.D. V, Hattersley P.W., Stone N.E.. 1987. New structural/biochemical associations in leaf blades of C grasses (Poaceae). Funct. Plant Biol. 14:403–420. [Google Scholar]
- Puttick M.N., Thomas G.H., Benton M.J.. 2014. High rates of evolution preceded the origin of birds. Evolution. 68:1497–1510. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rainford J.L., Hofreiter M., Nicholson D.B., Mayhew P.J.. 2014. Phylogenetic distribution of extant richness suggests metamorphosis is a key innovation driving diversification in insects. PLoS One. 9:e109085. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rambaut A., Suchard M.A., Xie W., Drummond A.J.. 2013. Tracer v1.6. Available from: http://tree.bio.ed.ac.uk/software/tracer/. [Google Scholar]
- Renvoize S.A. 1982a. A survey of leaf-blade anatomy in grasses I. Andropogoneae. Kew Bull. 37:315–321. [Google Scholar]
- Renvoize S.A. 1982b. A survey of leaf-blade anatomy in grasses II. Arundinelleae. Kew Bull. 37:489–495 [Google Scholar]
- Renvoize S.A. 1982c. A survey of leaf-blade anatomy in grasses III. Garnotieae. Kew Bull. 37:497. [Google Scholar]
- Renvoize S.A. 1985. A note on Jansenella (Gramineae). Kew Bull. 40: 470. [Google Scholar]
- Ripley B., Visser V., Christin P.-A., Archibald S., Martin T., Osborne C.. 2015. Fire ecology of C and C grasses depends on evolutionary history and frequency of burning but not photosynthetic type. Ecology. 96:2679–2691. [DOI] [PubMed] [Google Scholar]
- Rondeau P., Rouch C., Besnard G.. 2005. NADP-malate dehydrogenase gene evolution in Andropogoneae (Poaceae): gene duplication followed by sub-functionalization. Ann. Bot. 96:1307–1314. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ronquist F., Teslenko M., Van Der Mark P., Ayres D.L., Darling A., Höhna S., Larget B., Liu L., Suchard M.A., Huelsenbeck J.P.. 2012. MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space. Syst. Biol. 61:539–542. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sage R.F. 2004. The evolution of C photosynthesis. New Phytol. 161:341–370. [DOI] [PubMed] [Google Scholar]
- Sage R.F. 2017. A portrait of the C photosynthetic family on the 50th anniversary of its discovery: species number, evolutionary lineages, and Hall of Fame. J. Exp. Bot. 68:e11–e28. [DOI] [PubMed] [Google Scholar]
- Sage R.F., Christin P.-A., Edwards E.J.. 2011. The C plant lineages of planet Earth. J. Exp. Bot. 62:3155–3169. [DOI] [PubMed] [Google Scholar]
- Sage R.F., Stata M.. 2015. Photosynthetic diversity meets biodiversity: the C plant example. J. Plant Physiol. 172:104–119. [DOI] [PubMed] [Google Scholar]
- Sánchez-García M., Matheny P.B.. 2017. Is the switch to an ectomycorrhizal state an evolutionary key innovation in mushroom-forming fungi? A case study in the Tricholomatineae (Agaricales). Evolution. 71:51–65. [DOI] [PubMed] [Google Scholar]
- Shilla U., Tiwari B.K.. 2015. Impact of fire and grazing on plant diversity of a grassland ecosystem of Cherrapunjee. Keanean J. Sci. 4: 67–78. [Google Scholar]
- Silva C., Besnard G., Piot A., Razanatsoa J., Oliveira R.P., Vorontsova M.S.. 2017. Museomics resolve the systematics of an endangered grass lineage endemic to north-western Madagascar. Ann. Bot. 119:339–351. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sinha N.R., Kellogg E.A.. 1996. Parallelism and diversity in multiple origins of C photosynthesis in the grass family. Am. J. Bot. 83:1458–1470. [Google Scholar]
- Solbrig O.T. 1996. The diversity of the savanna ecosystem. In: Solbrig O.T., Medina E., Silva J.F., editors. Biodiversity and savanna ecosystem processes. Ecological Studies Vol. 121 Berlin, Heidelberg: Springer-Verlag; p. 1–27. [Google Scholar]
- Soreng R.J., Peterson P.M., Romaschenko K., Davidse G., Teisher J.K., Clark L.G., Barberá P., Gillespie L.J., Zuloaga F.O.. 2017. A worldwide phylogenetic classification of the Poaceae (Gramineae) II: an update and a comparison of two 2015 classifications. J. Syst. Evol. 55:259–290. [Google Scholar]
- Sprent J.I. 2007. Evolving ideas of legume evolution and diversity: a taxonomic perspective on the occurrence of nodulation. New Phytol. 174:11–25. [DOI] [PubMed] [Google Scholar]
- Spriggs E.L., Christin P.-A., Edwards E.J.. 2014. C photosynthesis promoted species diversification during the Miocene grassland expansion. PLoS One. 9:e97722. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stamatakis A. 2014. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 30:1312–1313. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stebbins G.L. 1975. The role of polyploid complexes in the evolution of North American grasslands. Taxon. 24:91–106. [Google Scholar]
- Strömberg C.A.E., Dunn R.E., Crifò C., Harris E.B.. 2018. Phytoliths in paleoecology: analytical considerations, current use, and future directions. In: Croft D.A., Su D., Simpson S.W., editors. Methods in paleoecology: reconstructing cenozoic terrestrial environments and ecological communities, vertebrate paleobiology and paleoanthropology. Chapter 12 Cham: Springer; p. 235–287. [Google Scholar]
- Studer A.J., Schnable J.C., Weissmann S., Kolbe A.R., McKain M.R., Shao Y., Cousins A.B., Kellogg E.A., Brutnell T.P.. 2016. The draft genome of the C panicoid grass species Dichanthelium oligosanthes. Genome Biol. 17:223. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tateoka T. 1958. Notes on some grasses. VIII. On leaf structure of Arundinella and Garnotia. Bot. Gaz. 120:101–109. [Google Scholar]
- Tausta S.L., Miller Coyle H., Rothermel B., Stiefel V., Nelson T.. 2002. Maize C and non-C NADP-dependent malic enzymes are encoded by distinct genes derived from a plastid-localized ancestor. Plant Mol. Biol. 50:635–652. [DOI] [PubMed] [Google Scholar]
- Thornsberry J.M., Goodman M.M., Doebley J., Kresovich S., Nielsen D., Buckler E.S. IV.. 2001. Dwarf8 polymorphisms associate with variation in flowering time. Nat. Genet. 28:288–289. [DOI] [PubMed] [Google Scholar]
- Türpe A.M. 1970. Sobre la anatomía foliar de Jansenella griffithiana (C. Mueller) Bor (Poaceae: Arundinelleae). Senckenberg. Biol. 51:277–285. [Google Scholar]
- Ueno O. 1995. Occurrence of distinctive cells in leaves of C species in Arthraxon and Microstegium (Andropogoneae-Poaceae) and the structural and immunocytochemical characterization of these cells. Int. J. Plant Sci. 156:270–289. [Google Scholar]
- Vicentini A., Barber J.C., Aliscioni S.S., Giussani L.M., Kellogg E.A.. 2008. The age of the grasses and clusters of origins of C photosynthesis. Glob. Change Biol. 14:2963–2977. [Google Scholar]
- Visser V., Woodward F.I., Freckleton R.P., Osborne C.P.. 2012. Environmental factors determining the phylogenetic structure of C grass communities. J. Biogeogr. 39:232–246. [Google Scholar]
- von Caemmerer S., Furbank R.T.. 2003. The C pathway: an efficient CO pump. Photosynth. Res. 77:191–207. [DOI] [PubMed] [Google Scholar]
- Walker R.P., Acheson R.M., Técsi L.I., Leegood R.C.. 1997. Phosphoenolpyruvate carboxykinase in C plants: its role and regulation. Aust. J. Plant Physiol. 24:459–468. [Google Scholar]
- Wang X., Gowik U., Tang H., Bowers J.E., Westhoff P., Paterson A.H.. 2009. Comparative genomic analysis of C photosynthetic pathway evolution in grasses. Genome Biol. 10:R68. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang Y., Bräutigam A., Weber A.P.M., Zhu X.-G.. 2014. Three distinct biochemical subtypes of Cphotosynthesis? A modelling analysis. J. Exp. Bot. 65:3567–3578. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Washburn J.D., Schnable J.C., Conant G.C., Brutnell T.P., Shao Y., Zhang Y., Ludwig M., Davidse G., Pires J.C.. 2017. Genome-guided phylo-transcriptomic methods and the nuclear phylogenetic tree of the Paniceae grasses. Sci. Rep. 7:13528. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Washburn J.D., Schnable J.C., Davidse G., Pires J.C.. 2015. Phylogeny and photosynthesis of the grass tribe Paniceae. Am. J. Bot. 102:1493–1505. [DOI] [PubMed] [Google Scholar]
- Watcharamongkol T., Christin P.-A., Osborne C.P.. 2018. C photosynthesis evolved in warm climates but promoted migration to cooler ones. Ecol. Lett. 21:376–383. [DOI] [PubMed] [Google Scholar]
- Watson L., Macfarlane T.D., Dallwitz M.J.. 1992. The grass genera of the world: descriptions, illustrations, identification, and information retrieval; including synonyms, morphology, anatomy, physiology, phytochemistry, cytology, classification, pathogens, world and local distribution, and references. Version: 11th December 2017. delta-intkey.com [Google Scholar]
- Wingler A., Walker R.P., Chen Z.-H., Leegood R.C.. 1999. Phosphoenolpyruvate carboxykinase is involved in the decarboxylation of aspartate in the bundle sheath of maize. Plant Physiol. 120:539–546. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xi Z., Liu L., Davis C.C.. 2016. The impact of missing data on species tree estimation. Mol. Biol. Evol. 33:838–860. [DOI] [PubMed] [Google Scholar]
- Yadav S.R., Chivalkar S.A., Gosavi K.V.C.. 2010. On the identity of Jansenella griffithiana (Poaceae) with a new species from Western Ghats, India. Rheedea. 20:38–43. [Google Scholar]
- Yang Z. 1998. Likelihood ratio tests for detecting positive selection and application to primate lysozyme evolution. Mol. Biol. Evol. 15:568–573. [DOI] [PubMed] [Google Scholar]
- Yang Z. 2007. PAML 4: phylogenetic analysis by maximum likelihood. Mol. Biol. Evol. 24:1586–1591. [DOI] [PubMed] [Google Scholar]
- Yang Z., Nielsen R.. 1998. Synonymous and nonsynonymous rate variation in nuclear genes of mammals. J. Mol. Evol. 46:409–418. [DOI] [PubMed] [Google Scholar]
- Yukawa T., Ogura-Tsujita Y., Shefferson R.P., Yokoyama J.. 2009. Mycorrhizal diversity in Apostasia (Orchidaceae) indicates the origin and evolution of orchid mycorrhiza. Am. J. Bot. 96:1997–2009. [DOI] [PubMed] [Google Scholar]
- Zuloaga, F.O., Morrone, O. and Giussani, L.M.. 2000. A cladistic analysis of the Paniceae: a preliminary approach. In: Jacobs S.W.L., Everett J., editors. Grasses: systematics and evolution Collingwood, Victoria, Australia: CSIRO; p. 123–135. [Google Scholar]