Abstract
The repeated evolution of the same trait in different lineages provides powerful natural experiments to study the phenotypic and genotypic predictability of trait gain and loss. A fascinating example is the repeated evolution of hummingbird pollination in plant lineages in the Americas, a widespread and often unidirectional phenomenon. The spiral gingers in the genus Costus are ancestrally bee pollinated, and hummingbird pollination has evolved multiple times independently in the tropical Americas. These pollinator transitions are accompanied by predictable morphological and color changes, but the changes in floral scent have not been described. In this study, we describe the floral scent composition of 30 species of Costus sampled across the phylogeny to understand how floral scent has evolved across the genus with respect to pollinator transitions. We then combine transcriptomics and genomics to identify gene expression differences and gene family evolution associated with pollinator transitions. We show that hummingbird-pollinated species have mostly lost their floral scent, whereas bee-pollinated species exhibit either floral scent maintenance or, in some cases, gains of more diverse scent profiles. We find the floral scent loss appears to be due to gene downregulation rather than pseudogenization. The remarkable consistency of scent loss in hummingbird-pollinated species highlights the shared strong selection pressures experienced by these lineages. Even species with more recent transitions from bee to hummingbird pollination exhibit scent loss, highlighting the rapid breakdown of scent emission following pollinator transitions. This research highlights the capacity for rapid changes when selection pressures are strong through downregulation of floral scent genes.
Keywords: convergent evolution, floral scent, gene family evolution, hummingbird, orchid bee, pollination, spiral gingers, transcriptomics, trait evolution
Introduction
The repeated evolution of the same traits in different lineages is thought to reflect shared selection pressures. These cases of repeated evolution apply not only to the gain of traits, such as the evolution of antifreeze proteins in fish (Chen et al. 1997) or carnivory in plants (Albert et al. 1992), but also the loss of traits, as seen in the eye regression of cavefish (Sifuentes-Romero et al. 2023), or reduction of attractive traits such as flower size or floral scent in self-fertilizing plants (Tsuchimatsu and Fujii 2022). Repeated evolution provides us with systems where we can study the process of adaptation and ask questions about the predictability of evolution and the extent of constraint, or flexibility, of the underlying molecular mechanisms (Orr 2005; Losos 2011; Sackton and Clark 2019).
One such example of repeated evolution is pollinator transitions in flowering plants. For example, the evolution of hummingbird pollination has occurred many times across flowering plants and often involves convergence in a suite of traits. Flowers pollinated by hummingbirds are characterized by traits thought to attract and fit hummingbirds (such as a narrow shape and large amounts of nectar) and, in addition, traits thought to deter bees (red coloration, lack of nectar guides, and lack of scent) (Schemske and Bradshaw 1999; Castellanos et al. 2004; Bergamo et al. 2016). Interestingly, the evolution of hummingbird pollination has primarily been documented in one direction, from ancestral bee pollination to derived hummingbird pollination (Rosas-Guerrero et al. 2014; Abrahamczyk and Renner 2015; Kay and Grossenbacher 2022). However, there is clade-specific variation in this directionality, with the reasons for pollinator transitions remaining unclear (Tripp and Manos 2008; Rosas-Guerrero et al. 2014; Abrahamczyk and Renner 2015; Kay and Grossenbacher 2022; Barreto et al. 2024).
Hummingbird-pollinated flowers often lack floral scent and therefore provide an excellent system to study repeated trait loss (Knudsen et al. 2004; Wessinger 2024). Genomic studies have shown that, even for birds, hummingbirds have a low number of olfactory receptors (Driver and Balakrishnan 2021; Driver 2022), suggesting at least a reduced investment in this sensory modality. Behavioral experiments have also suggested that hummingbirds rely more on visual than olfactory cues when foraging (Goldsmith and Goldsmith 1982). In fact, floral scent analyses of hummingbird-pollinated flowers have shown either complete lack of floral scent or low emission rates of floral scent compounds (Knudsen et al. 2004; Byers et al. 2014; Amrad et al. 2016). The lack of floral scent in hummingbird-pollinated species suggested that there is a cost to emitting scent for these species. This could be due to energetic costs of compound production, but this is predicted to be minimal (Raguso 2016; Pichersky and Raguso 2018). Alternatively, there could be an ecological cost for hummingbird-pollinated species attracting bees, also called the bee avoidance hypothesis (Castellanos et al. 2004; Bergamo et al. 2016). Finally, floral scent reduction could also evolve as an adaptation to prevent the attraction of florivores or nectar robbers, which could be attracted by the scent (Galen et al. 2011; Theis and Adler 2012).
A fascinating group in which to study the evolution of hummingbird pollination and potential trait loss is the genus Costus (Costaceae). The genus contains over 80 species of perennial understory herbaceous monocots, the majority of which have diverged rapidly within the last 3 million years in Central and South America (Maas 1972, 1977; Vargas et al. 2020). Since establishing in the American tropics, hummingbird pollination has evolved repeatedly and independently in Costus from an ancestral state of bee pollination (Kay and Grossenbacher 2022). Specifically, bee-pollinated species are pollinated primarily by orchid bees (Apidae: Euglossini), with females visiting flowers for nectar and pollen and males visiting for nectar (Kay and Schemske 2003). Hummingbird-pollinated species are primarily pollinated by nonterritorial hermit hummingbirds (Kay and Schemske 2003). While different species were initially classified as hummingbird or bee pollinated (Maas 1972, 1977), many species do not fit the classic paradigm of long tubular red flowers associated with hummingbird pollination (Grant and Grant 1968). For example, many bee-pollinated flowers have some red coloration (Kay and Schemske 2003), and hummingbird-pollinated flowers are short (Yost and Kay 2009). Despite this variation, a single phenotypic optimum is shared across most of the independent transitions to hummingbird pollination in the clade (Kay and Grossenbacher 2022). Traits including small flower size, the absence of nectar guides, and brightly colored floral bracts are all predictive of hummingbird pollination (Kay and Grossenbacher 2022). Furthermore, Costus species show extensive sympatry between different pollination syndromes (Kay and Schemske 2003; Vargas et al. 2020; Kay and Grossenbacher 2022), and even the specialist herbivores are shared across Costus species (García-Robledo et al. 2013), allowing us to isolate the effects of pollination syndrome from geographic location or herbivory. The combination of recent genomic resources, a well-resolved phylogeny, documented pollinators, and the possibility of collecting samples both in the field and in greenhouses, makes Costus a powerful system to study the evolution of convergence on hummingbird pollination (Vargas et al. 2020; Kay and Grossenbacher 2022; Harenčár et al. 2023).
While the morphological and color changes involved in the evolution of hummingbird pollination have been well studied in Costus, the expected scent changes have not been characterized. In this study, we describe floral scent in 30 species of Costus sampled across the phylogeny and including eight independent origins of hummingbird pollination (as determined in Kay and Grossenbacher 2022). We test for differences in floral scent between bee-pollinated and hummingbird-pollinated species and investigate how floral scent has evolved across the genus. We then focus on one clade of Costus with variation in floral scent emission and generate floral transcriptomic data. We use this data to identify gene expression differences associated with floral scent production to better understand the genetic changes underlying floral scent evolution in this group. Finally, we annotated a family of genes important for the production of terpenoid floral scent compounds (the terpene synthase [TPS] genes) in three previously available Costus genomes, and a high-quality genome assembly for Costus allenii generated in this study, encompassing two independent transitions to hummingbird pollination. High-quality annotations allow us to analyze the evidence of gene loss or pseudogenization following transition to hummingbird pollination in this gene family that plays an important role in floral scent evolution. Our study, therefore, combines multiple approaches to characterize repeated phenotypic evolution and the molecular mechanisms of these changes with shifts to hummingbird pollination in Costus.
Results
Bee-Pollinated Species Emit More Compounds and More Diverse Floral Scents
We collected 100 samples from 30 Costus species including species previously described as both bee and hummingbird pollinated from the field and the greenhouse (Fig. 1a; supplementary table S1, Supplementary Material online). Using multivariate techniques, we visualized the floral scent profiles to identify the broad patterns in our data set. Before filtering our data, we found that samples from hummingbird-pollinated species did not differ from ambient control samples (Fig. 1b). In contrast, samples from bee-pollinated species spread across a wider section of chemospace, with many samples visually separated from both ambient and hummingbird-pollinated samples (Fig. 1b). We found that both pollination group and species (nested within pollination group) explained variation in floral scent (permutational multivariate analysis of variance [PERMANOVA], pollination group: F2,167 = 8.0, P < 0.001; species: F28,167 = 2.5, P < 0.001). Interestingly, bee-pollinated species were significantly different from both hummingbird-pollinated species (pairwise PERMANOVA, P = 0.003) and ambient control samples (pairwise PERMANOVA, P = 0.003), whereas hummingbird-pollinated species did not differ from ambient control samples (pairwise PERMANOVA, P = 0.07).
Fig. 1.
a) Representative images of two pairs of closely related bee- and hummingbird-pollinated species: C. allenii and C. lasius and C. erythrophyllus and C. spiralis. b) NMDS plot illustrating in the variation in the unfiltered data set (stress = 0.12). Points are labeled as bee-pollinated, hummingbird-pollinated, or ambient samples. Ambient samples are control samples of the air taken in the same area as the focal plant individual. Ellipses are plotted for species with more than two samples to show distributions. c) Differences in bee-pollinated and hummingbird-pollinated Costus species in terms of Compound richness (ANOVA, pollination group: F1,70 = 60.4, P < 0.0001; species: F28,70 = 9.4, P < 0.0001). d) Functional Hill Diversity Index (ANOVA, pollination group: F1,52 = 6.8, P = 0.012; species: F26,52 = 2.9, P < 0.001). e) Total compound amount emitted (measured by proxy of total ion abundance) (ANOVA, pollination group: F1,70 = 109, P = <0.0001; species: F28,70 = 2.8, P < 0.001). Fifteen individuals with the highest compound emission were removed from E for visualization.
After filtering likely contaminants from the ambient air, we obtained a final data set with 60 floral scent compounds (supplementary fig. S1 and table S2, Supplementary Material online). The majority of these compounds were terpenoid (87%, 52/60), with a few aromatics and compounds of unknown class. The data set included both monoterpenes and sesquiterpenes. No one compound class was restricted to a specific pollinator group. After removing the ambient compounds, we found that one bee-pollinated sample and 18 hummingbird-pollinated samples had no detectable floral scent, confirming the similarity of the hummingbird-pollinated samples to the ambient air samples. The unscented hummingbird-pollinated samples were from six species. For two species, Costus montanus and Costus stenophyllus, all individuals emitted no detectable floral scent (4/4 individuals). Furthermore, most Costus chartaceus (4/5), Costus lasius (4/7), and Costus wilsonii (5/8) and a single Costus productus individual (1/5) lacked floral scent. To facilitate visualization of our final filtered data set using nonmetric multidimensional scaling (NMDS), we removed the nineteen samples with no detectable floral scent. When this filtered data set was plotted, we observed more overlap between the hummingbird- and bee-pollinated samples (supplementary fig. S2, Supplementary Material online). This suggests that there is little difference in the scent composition of bee-pollinated samples and those samples from hummingbird-pollinated species, which did emit floral scent, although they did still show a statistically significant difference.
We found that bee-pollinated flowers emitted a richer floral scent, averaging 9 (SD 10) compounds in comparison to only 2 (SD 4) for hummingbird-pollinated flowers (Fig. 1c; supplementary fig. S3, Supplementary Material online). We confirmed that this was the case for both plants raised in a greenhouse and those sampled in the field (supplementary fig. S4, Supplementary Material online). Similar to the compound richness, we used a measure of biochemical and structural diversity of the compounds present in a sample to calculate the overall chemical diversity. The Functional Hill Diversity Index considers not only the total number of compounds (richness) but also their biosynthetic complexity. We found that the bee-pollinated floral scents had a higher chemical diversity (Fig. 1d). Bee-pollinated flowers also emitted a higher abundance of floral scent, as measured by total ion abundance in each sample, a proxy for compound amount (Fig. 1e). We validated this proxy using absolute amounts in nanograms for a subset of the data set, again finding that the bee-pollinated species (C. allenii) has significantly higher compound emission when compared to the hummingbird-pollinated species (C. lasius) (supplementary fig. S5, Supplementary Material online).
To see how individual compounds varied between bee- and hummingbird-pollinated species, we calculated the percentage of individuals in each pollination group that emitted the most common floral scent compounds in the data set. We found that these compounds are present in a higher proportion of bee-pollinated flowers than hummingbird-pollinated flowers (supplementary fig. S6, Supplementary Material online). In particular, (E)-beta-ocimene and beta-caryophyllene show the largest percentage differences between the two pollination groups (supplementary fig. S6, Supplementary Material online). We used generalized linear models (GLMs) with a binomial error distribution and logit link function to identify compounds, which significantly differed between pollinator groups (compound presence ∼ pollinator group/species). Significance was determined with likelihood ratio tests. Of the 60 compounds in our data set, 28 were significantly different between pollinator groups (supplementary table S2, Supplementary Material online), including all of the most abundant compounds, except alpha- and beta-pinene (supplementary fig. S6 and table S2, Supplementary Material online). Some of these compounds do not vary independently between samples. We find evidence for six modules: an aromatic module, a terpenoid module, a monoterpene module, two sesquiterpene modules, and a large sesquiterpene-dominated module (supplementary fig. S7, Supplementary Material online). We found that the compound that showed the biggest difference between pollinator groups, (E)-beta-ocimene, was not part of any these large modules of compounds.
Phylogenetic Analyses Show That Hummingbird-Pollinated Species Have Lost Floral Scent
To understand the evolution of floral scent in Costus, we carried out phylogenetic analyses using a previously published phylogeny (Vargas et al. 2020). To examine scent differences between bee- and hummingbird-pollinated species while accounting for phylogenetic relationships, we used a method of randomizing residuals in a permutation procedure (Adams and Collyer 2018). We found that compound richness, compound diversity, compound amount, and (E)-beta-ocimene emission were all significantly different between bee-pollinated and hummingbird-pollinated species (compound richness, F1,26 = 7.2, P = 0.002; compound diversity, F1,26 = 5.8, P = 0.008; compound amount, F1,26 = 627.5, P = 0.003; (E)-beta-ocimene production, F1,26 = 4.7, P = 0.03). (E)-beta-caryophyllene production did not differ between pollinator groups (F1,26 = 1.0, P = 0.4). Although phylogenetic ANOVA with simulations based on Brownian motion is a more common approach to these types of comparisons, it was inappropriate for our data set. We found a strong correlation between pollination group and the phylogenetic covariance matrix using two-block partial least squares (PLS) analysis (mean correlation [rPLS] = 0.56; P = 0.01). This reduces the statistical power of testing group differences using phylogenetic ANOVA due to the high aggregation of pollinator groups on the phylogeny (Adams and Collyer 2018). Overall, our comparative analyses complement our previous nonphylogenetic analyses, again showing that bee-pollinated species emit higher amounts of more diverse and complex floral scents and have higher (E)-beta-ocimene emission.
We found no evidence of phylogenetic signal in compound richness (λ = 7.4e−05, P = 1), compound diversity (λ = 4.15e−05, P = 1), compound amount (total area of floral scent peaks) (λ = 7.4e−05, P = 1), (E)-beta-ocimene emission (λ = 7.3e−05, P = 1), and (E)-beta-caryophyllene emission (λ = 7.4e−05, P = 1). We also tested for evidence of phylogenetic signal in the multivariate floral scent data set, detecting nonsignificant almost zero phylogenetic signal (Kmult = 0.43, P = 0.68).
To further investigate how floral scent has evolved across the Costus phylogeny, we carried out maximum likelihood ancestral state reconstruction of mean compound emission. We find the ancestral state of all Costus species found in tropical America is to emit more compounds than average hummingbird-pollinated species (4.5 vs. 2). In this reconstruction, the ancestral Costus population emitted an average floral scent—more compounds than hummingbird-pollinated species but less than the average bee-pollinated species (Fig. 2). Many hummingbird-pollinated species have since completely lost floral scent, as shown by the white circles (Fig. 2). We see evidence for low or no scent emission in representatives of all eight origins of hummingbird pollination included in our data set. In contrast, some bee-pollinated species, such as C. allenii, have evolved much stronger floral scents than the ancestral state.
Fig. 2.
Ancestral state reconstruction of floral scent in the genus Costus. Pollinator for each species is illustrated by the right-hand dots. Mean compound richness emitted by each species is shown by the left-hand dots with each shade of gray representing a range of means (where 0 to 2 includes means up to 2.99, 3 to 6 up to 6.99, etc.). Darker dots indicate more compounds are emitted. Predicted compound emission at each node is shown following the same color scale. Tip labels following (Vargas et al. 2020).
Loss of Floral Scent Is Correlated with Downregulation of Floral TPS Genes
To analyze the genetic basis of floral scent evolution, we first chose a clade of closely related species with variation in floral scent emission and pollination systems. We chose the clade containing C. lasius due to the variation in floral scent emission and the availability of a high-quality genome. In this clade, C. lasius (hummingbird-pollinated) emits on average three floral scent compounds, whereas C. allenii (bee-pollinated) emits 24 (one of the highest in our data set), and Costus villosissimus (bee-pollinated) emits seven compounds (Fig. 3a).
Fig. 3.
Differences in floral scent and gene expression between C. allenii, C. lasius, and C. villosissimus. Species differ significantly in a) compound richness (ANOVA, F2,19 = 20.9, P < 0.001) and b) expression of TPS transcripts (ANOVA, F2,16 = 508.4, P < 0.001). In b), TPS transcripts are summed for all TPS loci and then the average expression calculated by dividing by the number of TPSs expressed in that species (20 for C. allenii, 5 for C. villosissimus, and 1 for C. lasius).
We first produced floral transcriptomes of all three species. After clustering transcripts with over 95% similarity, we found 112,998 transcripts present in C. allenii, 80,208 in C. lasius, and 96,886 in C. villosissimus. Remapping rates were high in all three species: 99.2% in C. allenii, 99% in C. lasius and 99.1% in C. villosissimus. All three also had Benchmarking Universal Single-Copy Orthologs (BUSCO) scores of more than 80% (supplementary table S3, Supplementary Material online), suggesting high-quality transcriptomes with most single-copy orthologs from the Embryophyta database present in all species.
Due to the dominance of terpenoid compounds in the floral scent, we focused on the expression of TPS genes. TPS genes are responsible for producing terpenoid compounds. We found that 20 TPS genes were expressed in the floral transcriptome of C. allenii, compared to five in C. villosissimus and only one in C. lasius, when expression was defined as one transcript per million (TPM) in at least one sample of the species. Furthermore, we found that counts of TPS expression per TPS gene expressed were much higher in both C. allenii and C. villosissimus when compared with C. lasius (Fig. 3b). In fact, in C. allenii, a TPS is the highest expressed gene in the floral transcriptome, and TPSs are also the fourth and sixth most expressed genes in the transcriptome. On average, 12% of total expression can be attributed to TPSs in C. allenii, 0.17% in C. villosissimus, and 0.00014% in C. lasius, highlighting the orders of magnitude differences between species.
Floral Scent Loss Is Not Driven by Gene Loss in Costus
While the transcriptome assemblies can give us insight into levels of TPS expression, it is hard to determine gene copy number due to the redundancies in the assemblies. For this, we need genome data to allow us to determine the number of TPS genes in each species. There are currently three publicly available Costus genomes: Costus bracteatus (bee), C. lasius (hummingbird), and Costus pulverulentus (hummingbird) (Valderrama et al. 2022; Harenčár et al. 2023). Costus lasius and C. pulverulentus represent two independent transitions to hummingbird pollination (Kay and Grossenbacher 2022). To generate a data set of two hummingbird- and two bee-pollinated species, we assembled an additional high-quality reference genome for bee-pollinated C. allenii with a BUSCO score of 98.8% (supplementary table S4, Supplementary Material online for assembly statistics; supplementary fig. S8, Supplementary Material online to see alignment with C. lasius genome). We generated high-quality whole-genome annotations for each species with high completeness and consistency with all BUSCO scores over 90% (supplementary table S5, Supplementary Material online).
We annotated between 63% and 75% of each genome as repeat elements (supplementary tables S6 to S9, Supplementary Material online). We found that, of the elements that could be classified, retrotransposable elements represented the largest percentage of the genome. These are the most abundant form of repeat elements found in angiosperm genomes (Oliver et al. 2013). Many repeats were not classified into previously known families, perhaps suggesting the existence of new repeat elements.
We annotated a total of 64 TPSs in C. allenii, 44 in C. bracteatus, 40 in C. lasius, and 43 in C. pulverulentus. The TPSs belonged to previously described TPS families TPS-a, TPS-b, TPS-c, TPS-e/f, and TPS-g (Fig. 4; supplementary fig. S9, Supplementary Material online). Those TPSs responsible for floral scent biosynthesis are primarily found in the TPS-a, TPS-b, and TPS-g families. TPS-a members tend to produce monoterpenes, TPS-b members sesquiterpenes, and TPS-g members either monoterpenes or sesquiterpenes (Chen et al. 2011).
Fig. 4.
Phylogenetic analysis of TPS gene families. We found the highest rates of gene gain and lowest rates of gene loss for C. allenii with the other three species exhibiting mixes of loss and gains across the gene tree. The phylogeny was constructed in IQ-TREE with 1,000 bootstraps and model JTT + F + R4. TPS family classifications based on A. thaliana are illustrated.
We used computational analysis of gene family evolution (CAFE5) to model gene gain and loss across the species phylogeny for each TPS clade identified in Fig. 4 (Mendes et al. 2020). For the bee-pollinated species, we found that C. allenii has gains in four clades with losses in one while C. bracteatus shows gains in one clade and losses in four clades. For the hummingbird-pollinated species, we found that C. lasius has gains in two clades and losses in three, while C. pulverulentus exhibits no gains and losses in two clades. There is no clear correlation between pollination group and gene losses or gains with some evidence that C. allenii shows enhanced gene duplication. We also estimated gene duplications and losses for each species using a gene tree-species tree reconciliation algorithm, reconcILS (Mishra et al. 2024). The algorithm reconciles the gene tree and species tree considering not only duplication and loss events but also incomplete lineage sorting (ILS), which has been documented in Costus (Uckele et al. 2024). Using this approach, we identified the greatest number of lineage-specific duplication events for C. allenii (30), with 9 in C. bracteatus, 11 in C. lasius, and 5 in C. pulverulentus. We found similar numbers of lineage-specific loss events in each species with 7 in C. bracteatus, 8 in C. allenii, 9 in C. pulverulentus, and 14 in C. lasius. The lineage-specific duplications in C. allenii correspond to tandem duplicates in close proximity within the genome. We find support for increased lineage-specific duplications in C. allenii; however, our gene family analyses do not find evidence for increased gene losses in hummingbird-pollinated species.
We mapped the RNA-seq reads both to the C. allenii genome and the C. lasius genome. In C. allenii, we found a consistent percentage of successfully mapped reads attributed to TPSs as with the transcriptome (12% transcriptome, 13% mapped to either genome). We found some differences for both C. lasius and C. villosissimus. For the C. villosissimus transcriptome mapping, we found 0.17% of reads were attributed to TPSs, in comparison to 0.009% when mapped to either genome. Due to a lack of genome for C. villosissimus, it is unclear whether this could be due to genetic changes or rearrangements not captured by the available genomes. For C. lasius, we found only 0.00014% of reads mapped to the transcriptome were due to TPSs, in contrast to 0.011% of the genome-mapped reads.
When C. allenii reads were mapped to the C. allenii genome, we found high levels of multimapping. This is due to recent TPS duplications in C. allenii, which are highly expressed. To illustrate this, the highest gene expressed in C. allenii is TPS49, and the third most expressed “gene” consists of reads that multimap to TPS49 and TPS52. This clade is represented by only one gene in each of the other species. Other recent lineage-specific duplications include TPS2, TPS3, TPS4, and TPS5 where the majority of reads multimap due to sequence similarity between duplicates. Both of these clades have only one member in each of the other species with an available genome sequence (supplementary fig. S9, Supplementary Material online). This highlights the complexity of mapping to gene families with recent duplications, especially when they are highly expressed, meaning that the typical choice of ignoring multimapped reads will lose large percentages of mapped reads from these duplications. We, therefore, decided to carry out our differential expression analysis using the C. lasius genome.
By mapping all reads to the C. lasius genome, we identified 4,790 genes differentially expressed between C. allenii and C. lasius, 7,678 between C. allenii and C. villosissimus, and 6,191 between C. lasius and C. villosissimus. Of the 40 TPS genes found in the C. lasius genome, 15 were differentially expressed between C. allenii and C. lasius, with 13 of these upregulated in C. allenii. This includes TPS33, orthologous to the TPS49/52 clade, the most highly expressed genes in C. allenii. We found 19 TPSs that were differentially expressed between C. allenii and C. villosissimus, with 14 upregulated in C. allenii and 5 in C. villosissimus. Ten TPS genes were differentially expressed between C. lasius and C. villosissimus, with 7 upregulated in C. villosissimus and 3 in C. lasius. Overall, more TPSs were upregulated in C. allenii when compared with either C. lasius or C. villosissimus.
Discussion
Repeated evolutionary transitions provide unique opportunities to study the extent of convergence both at the phenotypic and molecular level, shedding light on the predictability of evolution. Transitions to hummingbird pollination are widespread in the Americas and are predicted to be associated with loss of floral scent; however, this has not been studied in a comparative framework across multiple independent transitions within the same genus. In this study, we combine comparative chemical ecology with genomics and transcriptomics and demonstrate that floral scent in Costus is dominated by terpenes and its evolution is highly associated with pollinator type. We found that hummingbird-pollinated Costus species have lost or reduced their floral scent across multiple independent evolutionary origins, whereas bee-pollinated species in the Americas exhibit either maintenance of floral scent or gain of a more diverse scent profile. We find that this is related to downregulation of genes associated with floral scent emission (in this case TPSs) in floral tissues of hummingbird-pollinated species but is not accompanied by loss of TPS genes.
Overall, we found that bee-pollinated species emitted richer and more diverse floral scents at higher abundances. We did not find any unique compounds associated with either pollinator group, suggesting that the hummingbird-pollinated species emitted a subset of that emitted by bee-pollinated species, at lower amounts. We found the floral scent was mainly composed of terpenoids, with a few aromatic compounds. The most prevalent compounds detected in our data set, such as beta-ocimene, are typical floral-scent compounds found in insect-pollinated flowers (Farré-Armengol et al. 2017). Interestingly, bee-pollinated Costus species showed much greater variation in their floral scents than hummingbird-pollinated species, with some having very diverse floral scents, yet others emitting far fewer compounds. Bee-pollinated species also show greater morphological variation, possibly reflecting variation in orchid bee diversity (Kay and Grossenbacher 2022). Furthermore, not all species in our data set follow the expected patterns. For example, Costus arabicus and Costus vinosus (bee-pollinated) emit little scent. It is unclear whether the variation in floral scent between different bee-pollinated species is related to pollinator preference, other abiotic or biotic factors, or historical contingency.
There are multiple hypotheses for why floral scent would be lost in hummingbird-pollinated species. Firstly, it could be because there is simply no selection to maintain the production due to poor sense of smell, or lack of preference shown for floral scent, by hummingbirds (Goldsmith and Goldsmith 1982; Knudsen et al. 2004; Driver and Balakrishnan 2021). If this were the case, we might expect to find a correlation between length of time following pollinator transition and extent of floral scent loss. However, our data show that downregulation of floral scent can occur on relatively quick evolutionary timescales. The Latin American radiation of Costus is dated at 3 million years (95% confidence interval [CI], 1.50 to 4.87) and so all transitions to hummingbird-pollination have occurred within this timeframe (Vargas et al. 2020). Costus wilsonii, C. lasius, and Costus spiralis are all hummingbird-pollinated species that are most closely related to bee-pollinated species, representing more recent evolutionary shifts to hummingbird pollination. Yet all three species emit little to no floral scent, suggesting there can be quick breakdown of floral scent emission following pollinator transitions. This observation is consistent with previous studies that have shown that floral scent often evolves rapidly in response to selection (Ramos and Schiestl 2019; Zu et al. 2020; Liu et al. 2024).
The rapid and repeated loss of scent in hummingbird-pollinated species suggests a shared strong selection pressure against floral scent emission. One potential mechanism is the energetic cost of terpene production. A similar argument has been made in the case of eye regression in cavefish. If vision is energetically costly to maintain, then it should be selected against when no longer useful (Moran et al. 2015; Sifuentes-Romero et al. 2023). This is supported by evidence of the energetic costs of maintaining a visual system (Moran et al. 2015). The metabolic cost of terpene production is unknown but is thought to be minimal (Raguso 2016; Pichersky and Raguso 2018). In fact, it has been argued that the complex network of terpene metabolism, in which individual enzymes produce multiple products, makes terpene pathways more efficient, compared with a linear pathway with separate enzymes and regulators for each step (Lanier et al. 2023).
An alternative scenario to the energetic cost is that terpene production has ecological costs in hummingbird-pollinated species. Floral scent may attract florivores and floral larcenists in some systems (Galen et al. 2011). In Costus, most floral larceny is by small hummingbirds (e.g. Phaethornis striigularis and Eupherusa eximia) that pierce the base of the flower under the tough floral bract to obtain nectar, and they rob bee- and hummingbird-pollinated Costus at similar rates (KMK, unpublished data). Weevils (likely Cholus sp.) have also been observed occasionally consuming flowers of both pollination syndromes (although consumption rates were not recorded). Thus, it is unlikely that floral scent loss functions to deter florivory. We propose that the selection for floral scent reduction is most likely the result of strong selection against bee attraction (Castellanos et al. 2004; Bergamo et al. 2016). Birds do not consume pollen, leading to less pollen loss, and birds can travel longer distances between plants and promote higher gene flow, which is beneficial to combat inbreeding depression (Thomson and Wilson 2008; Krauss et al. 2017; Dellinger et al. 2022). One caveat is that comparisons in pollination effectiveness between bees and hummingbirds have not been carried out in orchid bees. Both male and female orchid bees tend to forage over long distances, perhaps resulting in a more “bird-like” pollen transport (Janzen 1971, 1981; Wikelski et al. 2010; Opedal et al. 2017; Kay and Grossenbacher 2022; Gamba and Muchhala 2023). While pollinator efficiency is unknown in Costus, rapid floral scent loss is congruent with the hypothesis of “bee avoidance” following the establishment of hummingbird pollination.
A final piece of evidence to support bee repellence is the large reduction in beta-ocimene emission in hummingbird-pollinated flowers relative to bee-pollinated flowers, as this compound is a common insect pollinator attractant (Farré-Armengol et al. 2017). In Mimulus, the loss of monoterpenes, including beta-ocimene, maintains reproductive isolation between bee- and hummingbird-pollinated species (Byers et al. 2014). Interestingly, we found that α- and β-pinene were emitted by both bee- and hummingbird-pollinated species. One potential reason is that pinenes do not repel orchid bees but do repel other types of bees. Terpenoid-dominated scents have been found to repel honey bees (Larue et al. 2016), and in particular, α-pinene has a repellent effect on honey bees (Fernandes et al. 2019). Furthermore, α-pinene has been found in other hummingbird-pollinated species with a suggested role as a moth pollinator deterrent (Bischoff et al. 2014). We speculate that bee-pollinated species emit a mix of compounds, some of which potentially attract orchid bees, and some of which potentially repel other pollinators to narrow attraction to orchid bees, specifically. Then, when floral scent is reduced in hummingbird-pollinated flowers, the compounds that are attractive to orchid bees are lost but not those that are general bee repellants. Further experimental work is needed to test these hypotheses.
We found that the floral scents of Costus are dominated by terpene compounds. Although few Zingiberales have described floral scents, our results are consistent with previous reports from the genus Hedychium that also found terpene-dominated floral scents (Báez et al. 2011; Zhou et al. 2022). Terpene diversity in a given organism is mainly determined by the TPS gene family, which carries out crucial steps in terpene formation (Tholl 2006). We did not find evidence that hummingbird-pollinated species with an available genome sequence (C. lasius and C. pulverulentus) exhibited increased rates of TPS gene loss than the bee-pollinated species. There are multiple reasons why trait loss would be associated with changes in gene regulation rather than gene loss. While C. lasius represents a more recent transition to hummingbird pollination than C. pulverulentus, both transitions have occurred within the last ∼3 million years (95% CI, 1.50 to 4.87) as Costus radiated in Latin America (Vargas et al. 2020). There may not have been enough time for pseudogenization to occur, a process estimated to take 0.5 to 6 million years (Marshall et al. 1994; Esfeld et al. 2018). Potentially, the genes will eventually be lost but are currently in the first phase, reduction in expression, which according to the “step-wise” model to pseudogenization is then followed by relaxed selection and then loss-of-function mutations (Graham et al. 2023).
Alternatively, we might not expect any TPS genes to be lost even in the long term. Terpenes, and therefore TPSs, are important for many other roles in plants. Plants use terpenes to deter herbivores, to attract parasitoids or predators of herbivores, and as antifungal and antibacterial compounds to prevent disease (Gershenzon and Dudareva 2007). These pleiotropic functions of TPSs could explain why we observe downregulation in floral tissues specifically rather than any pseudogenization. One argument against this is that in other systems TPS pseudogenization does occur following pollinator transitions. In Mimulus, loss of monoterpene production (specifically beta-ocimene) is not due to expression differences but loss-of-function mutations (Peng et al. 2017). It is unclear whether pseudogenization will occur at some point in Costus.
Although we did not observe gene loss in hummingbird-pollinated lineages, we did observe extensive lineage-specific duplications in the bee-pollinated species C. allenii. Costus allenii was one of the species with the highest scent diversity in our data set, and we found multiple TPS genes to exhibit several recent lineage-specific duplication events. We also found extremely high TPS expression, accounting for approximately 12% of expression in floral tissue. The gene with the highest expression in floral tissue was found to be a TPS-a family member, with high expression also found for several TPS-b members. These two families produce sesquiterpenes and monoterpenes, respectively, both of which are found in the floral scent of C. allenii. In C. lasius and C. villosissimus, we found reduced TPS expression compared to C. allenii, as expected based off their reduced floral scent emission. We also saw high levels of phenotypic variation in C. villosissimus, however, there is no publicly available genome, making interpretation of the results more difficult. We speculate that C. allenii has higher floral scent emission than closely related C. villosissimus because C. allenii grows as isolated individuals in deeply shaded, perennially wet forest, whereas C. villosissimus grows at forest edges and large treefall gaps and often has conspecifics nearby (Chen and Schemske 2015). Thus, C. allenii may have evolved the emission of stronger floral scent to attract pollinators in dense forests where flowers are less visible and pollinators are less abundant. While the overall trends are clear, we hesitate to overinterpret the gene expression data as we have only sampled at one time point and it is possible that there is floral scent emission happening earlier that is not captured in our data.
We observe more consistency and predictability in the phenotypic patterns than the underlying molecular mechanisms. In many systems, parallel changes in floral scent have accompanied independent pollinator shifts, including the current study (Knudsen et al. 2004; Byers et al. 2014). This can also be seen in other reproductive shifts, such as transitions to self-fertilization (Woźniak et al. 2022). In some systems, the underlying molecular mechanisms of floral scent loss are also convergent, having been shown to involve the same loci in multiple independent evolutionary events. For example, loss of floral scent in different Mimulus lineages is due to changes at the same loci, including frameshifts, deletions, and potentially posttranscriptional regulation (Peng et al. 2017). A similar pattern was found in two different systems that have lost benzaldehyde floral scent production due to either evolution of self-fertilization (Sas et al. 2016) or transition to hummingbird pollination (Amrad et al. 2016). The same locus contains loss-of-function mutations in both cases (Raguso 2016). However, this is not universally true. In Capsella, the loss of beta-ocimene following transition to self-fertilization in one lineage is due to changes in subcellular enzyme localization, but genetic mapping results suggest another locus is responsible for scent loss in a related species (Woźniak et al. 2022). In this study, we saw no evidence for pseudogenization. Instead, we found evidence that gene expression differences are responsible for scent differences between Costus species, with scent loss likely due to changes in, potentially tissue-specific, gene expression.
In summary, we have shown that Costus species exhibit repeated loss of floral scent following transitions from bee to hummingbird pollination, suggesting a shared strong selection pressure favoring scent loss. We also find bee-pollinated species that have recently gained more diverse scents, which, at least in C. allenii, is associated with lineage-specific TPS duplications. We show that scent loss is not associated with loss of TPS genes but rather appears to be due to gene downregulation. This demonstrates the capacity for rapid metabolic changes in response to selection following pollinator transitions. This system provides us with the exciting opportunity to identify the cis- or trans-regulatory changes driving gene expression differences in multiple independent transitions to hummingbird pollination, allowing us to study the extent of genetic convergence.
Materials and Methods
Floral Scent Sample Collection
We collected 100 samples of 30 species of Costus between 2019 and 2023 (supplementary table S1, Supplementary Material online). We sampled an average of 4 individuals per species for bee-pollinated species and 3 for hummingbird-pollinated species. We, therefore, expect that missing compounds due to undersampling will occur to a similar extent for both bee- and hummingbird-pollinated species. All samples for Costus glaucus, Costus laevis, C. montanus, C. stenophyllus, and C. wilsonii were collected in Costa Rica (La Gamba, Las Cruces, Las Alturas, and Monteverde). All other species were sampled in the University of California, Santa Cruz (UCSC) greenhouses in Santa Cruz, California, USA from plants composing the UCSC living Costus collection. For each day of sampling, both floral scent samples and ambient samples were collected (with one exception: a Costus scaber sample collected on 2019 April 8 was missing a control and so we used a control from the nearest date available, March 7). Scent was sampled from a single flower using dynamic headspace sampling from approximately 9 AM until 3 PM (6 h). This time point was chosen as it is the window of most pollinator activity during the lifespan of the flower (Kay and Schemske 2003). Flowers from living plants were placed inside oven bags (Reynold's). A Spectrex PAS-500 air pump with Tygon tubing was used to pull scented air through a volatile trap for 6 h at a flow rate of 200 mL air/min. The volatile trap contained a filter of glass wool and 50 mg Porapak. We conditioned scent traps by passing 5 mL of hexane before usage. Volatiles were eluted from the trap using 400 μL of hexane and stored at −20 °C.
Chemical Analysis
Samples were analyzed using Agilent model 5977A mass-selective detector with an Agilent GC model 7890B. Samples were run on a HP-5 Ultra Inert column (Agilent, 30 m × 0.25 mm, 0.25 μm). For analysis, 1 μL of each sample was injected in splitless mode using the ALS 7694 autosampler with helium as the carrier gas (250 °C injector temperature). The program started at 40 °C for 3 min and then rose at 5 °C/min to 210 °C. The temperature was held at 210 °C for 1 min, before rising at 20 °C/min to 300 °C where it was held for 2.5 min, and 315 °C for a final minute.
Compounds were identified by comparing mass spectra and retention indices with reference libraries. The identity of some compounds was confirmed by comparison to authentic reference standards (see supplementary table S2, Supplementary Material online). Compounds that were not found in at least two samples were removed. A two-step process was used to filter data sets for floral scent components. First, compounds were filtered to include only those that are found in five times higher amounts in floral than ambient samples using the “filter_ambient_ratio” function from the bouquet package (Powers et al. 2023). This resulted in a list of ambient compounds that were not considered floral scent compounds. Second, using the raw data matrices, we subtracted the ambient compound amounts from the corresponding floral scent amount collected on the same date. We then removed the ambient compounds identified by the package bouquet from this subtracted matrix to create the final data matrix of floral scent.
Statistical Analyses of Floral Scent
To visualize divergence in floral scent among samples, we used NMDS using the “metaMDS” function in vegan (Oksanen et al. 2020). To test for differences both between pollination groups, and between floral and ambient samples, we carried out a PERMANOVA using the “adonis2” function in vegan (Bray–Curtis distance matrix, 1,000 permutations). To identify which groups were significantly different, we carried out post hoc pairwise testing using the “pairwise.perm.MANOVA” function in the RVAideMemoire package with a Bonferroni correction (Hervé 2021). We did this for both filtered and unfiltered data sets using the raw peak areas for each compound included in the analysis.
We tested for differences in compound richness emitted by bee-pollinated and hummingbird-pollinated species using ANOVA with a nested model including species within pollinator group. To test for differences in the diversity of compounds emitted by bee-pollinated and hummingbird-pollinated species, we used the package chemodiv (Petrén et al. 2023). This package considers the biochemical and structural diversity of the compounds present in a sample to calculate the overall chemodiversity per sample. First, we used the “calcDiv” function to calculate the functional Hill diversity index for each sample. Then, we carried out an ANOVA test for the diversity indices, using a nested model of species nested within pollinator group to test for differences in chemodiversity. We also tested for differences in compound emission using ANOVA. In this case, a proxy for compound amount was used: the sum of the total area of floral scent peaks in the sample or the total ion abundance in the sample. In addition to this proxy, we also added 82.9 μg of 2-undecanone (10 μL of a 1:100 dilution in hexane) to all C. allenii and C. lasius samples to calculate absolute abundances of each compound for all 15 samples collected for these species.
To test for difference in compound presence between bee- and hummingbird-pollinated samples, we used GLMs with a binomial error distribution and logit link function. The response variable in our model was “presence” or “absence” of a specific compound. The explanatory variables were pollinator group and species with species nested within pollinator group (compound presence ∼ pollinator group/species). Significance was determined with likelihood ratio tests. We adjusted the P-values using the “p.adjust” function in R. To identify groups, or modules, or covarying compounds, we used the R package corrplot (Wei and Simko 2021). We performed hierarchical clustering based on Pearson correlation coefficients.
Phylogenetic Analyses of Floral Scent
To investigate the evolution of floral scent in a phylogenetic context, we used a previously published phylogeny (Vargas et al. 2020). First, we trimmed the phylogeny to include only those species for which we have scent data using the “drop.tip” function in ape (Paradis and Schliep 2019). To test for correlation between pollination group and the phylogenetic covariance matrix, we used a two-block partial least squares analysis using the “two.b.pls” function in the geomorph package (Adams and Otárola-Castillo 2013; Adams and Collyer 2018). A significant result means that our power to test for differences between groups using phylogenetic simulation-based is weakened due to group aggregation on the phylogeny. We then used the “phylosig” function in the package phytools to test for phylogenetic signal in multiple traits (compound richness, compound diversity, compound amount, (E)-beta-ocimene emission, and (E)-beta-caryophyllene emission) (Revell 2012). We also tested if floral scent data overall show phylogenetic signal using the multivariate generalization of Bloomberg's K (Kmult) (Adams 2014). Under Brownian motion, Kmult is expected to be 1. If Kmult > 1, higher phylogenetic signal is detected than the null expectation under Brownian motion. If Kmult < 1, lower phylogenetic signal is detected than the null expectation under Brownian motion. To calculate the Kmult of Costus floral scent, we used the filtered data set with 1,000 iterations to test significance. We carried out the analysis using the “physignal” function in the geomorph package (Adams and Otárola-Castillo 2013). Due to the weak phylogenetic signal detected in both multivariate and univariate analyses, we used the “procD.pgls” function in the RRPP package to test for differences in traits between pollinator groups rather than a simulation-based ANOVA (Adams and Collyer 2018; Collyer and Adams 2018). This function uses a method of randomizing residuals in a permutation procedure to evaluate the significance of phenotypic differences between groups considering the phylogeny and has higher statistical power than methods using phylogenetic simulations (Adams and Collyer 2018). We set lambda as calculated for the univariate variables. We corrected for multiple testing using the “p.adjust” function in R.
To better understand how floral scent has evolved in the genus Costus, we carried out an ancestral state reconstruction of the compound richness. The species-level phylogeny from (Vargas et al. 2020) contains 24 hummingbird-pollinated species, 25 bee-pollinated species, and 3 bee-pollinated African outgroups. Our subsampled phylogeny contains samples in proportion to the full phylogeny (13 hummingbird-pollinated species, 13 bee-pollinated species, and 1 bee-pollinated African outgroup). We note that there are limitations to this subsampled phylogeny but we find value in using it to infer general trends and not values at specific nodes. To do this, we used the function “ace” in the ape package for ancestral character estimation (Paradis and Schliep 2019). We divided the compound richness trait into 12 categories, each of which were assigned a different shade of blue. We plotted both the compound richness for each phylogenetic tip label and internal node to illustrate how floral scent has evolved in Costus.
Sample Collection for RNA-seq
Live individuals of C. allenii and C. villosissimus were collected in Soberanía National Park, Colón Province, Panama. Vouchers are deposited at University of Panama (C. allenii: 0132264 and 0132265; C. villosissimus: 0132270). Live individuals of C. lasius were collected in El Valle de Antón, Cocle Province, Panama. Voucher is deposited in Michigan State University herbarium (Kay 0321). For all three species, the individuals used for RNA-seq analyses were grown in greenhouses at UCSC. Freshly opened flowers of each species were removed at approximately 9 AM and flash frozen immediately in liquid nitrogen before being stored at −80 °C until extraction. Flowers of Costus open early in the morning and fall off by mid-afternoon.
RNA Extraction and Sequencing
Frozen flowers were removed and dissected on dry ice. Tissue from the petals and labella was broken down using a TissueLyser (Qiagen). RNA was then extracted using an RNeasy Mini Kit (Qiagen) with Qiashredder spin columns (Qiagen) following standard protocols. Samples were sent to Novogene for quality control, library preparation, and sequencing. Our final data set consisted of 19 samples: 8 C. allenii (from 4 individual plants), 6 C. lasius (from 2 individual plants), and 5 C. villosissimus (from 1 individual plant). Samples were sequenced on a NovaSeq 6000 using 150-bp paired-end reads. This generated approximately 53 million reads per library (mean = 52.74 million, SD = 12.01 million, N = 19). The raw sequence data were deposited in the SRA under the BioProject accession PRJNA1010186.
Transcriptome Assembly
Before assembling transcriptomes, we trimmed the raw reads using TrimGalore! (Martin 2011). We then assembled transcriptomes for each species using Trinity version 2.5.1 with default settings (Grabherr et al. 2011; Haas et al. 2013). To reduce redundancy, we clustered contigs with a minimum sequence identity of 95% using CD-HIT version 4.6.8 (Fu et al. 2012). We assessed transcriptome completeness and quality using BUSCO (v5.6.1) (embryophyta_odb10 data set) (Simão et al. 2015; Manni et al. 2021; Zdobnov et al. 2021). We also checked assembly quality by mapping RNA-seq reads back to the assembly using Bowtie (v2.4.1) to calculate mapping rates (Langmead and Salzberg 2012).
TPS Gene Expression Analysis
We extracted the longest isoforms for each Trinity “gene” using the Trinity script “get_longest_isoform_seq_per_trinity_gene.pl” and translated these sequences. We then searched for potential TPSs using hmmsearch two HMM profiles: Terpene_syth_C (PF03936) and TPS N-terminal domain (PF01397). We used the R package rhmmer to parse the output from hmmsearch resulting in a list of putative TPS sequences for each species. To reduce the TPSs to only those expressed in the data set, we used a threshold of TPM > 1 (transcripts per million) in at least one individual of a species (Eddy 2011; Arendsee 2017). To calculate expression of each putative TPS, we normalized counts using gene length corrected trimmed mean of M-values (GeTMM), an approach that improves intersample and intrasample comparisons (Smid et al. 2018). To do this, we first calculated fragments per kilobase of transcript per million fragments mapped for each transcript and carried out TMM normalization using the package edgeR (Robinson et al. 2010). We then combined GeTMM values for transcripts that were identified by Trinity to be the same gene. This does not normalize counts between species but is used to determine whether genes are lowly or highly expressed within a species.
Costus allenii Genome Assembly
We selected a wild-collected individual C. allenii from Pipeline Road, Soberanía National Park, Colón Province, Panama for de novo genome assembly. A voucher for this population of C. allenii is deposited at the University of Panama Herbarium (PMA: 0132264). This individual was grown to flowering in the UCSC greenhouses from a cutting, and we harvested 1.5 g fresh tissue from leaf meristems and flower buds. We isolated cell nuclei following Workman (Workman et al. 2019) and then used the Circulomics Nanobind Plant Nuclei Big DNA kit to extract HMW DNA. We assessed DNA concentration with a Qubit Fluorometer (Thermo Fisher Scientific) and purity with 260/280 and 260/230 absorbance ratios from a NanoDrop Spectrophotometer (Thermo Fisher Scientific) and ran a 7% agarose gel at 70 V for over 8 h to assess DNA fragment integrity and size. HMW DNA was sent to the UC Davis Genomics Center for additional cleaning, Pacific Biosciences HiFi library preparation, and sequencing on one PacBio Revio SMRT cell. This produced 74.8 Gb of HiFi yield and 7.9 M HiFi reads. We assembled the PacBio data into a primary and alternative assembly using HiFiasm (v0.19.8) (Cheng et al. 2021). We then used RagTag (v2.1.0) (Alonge et al. 2022) to scaffold C. allenii contigs using the closely related C. lasius as a reference. We used D-GENIES (Cabanettes and Klopp 2018) with Minimap2 (Li 2018) to generate a dotplot comparing the two assemblies both before and after scaffolding. The genome assembly quality was assessed using BUSCO v5.6.1 (Simão et al. 2015). Raw sequence data and the final genome assembly were submitted to the National Center for Biotechnology Information (NCBI) under BioProject PRJNA1091680.
Genome Annotations
Of the three species for which we generated RNA-seq data, an unannotated genome was previously available for C. lasius (Harenčár et al. 2023). In this study, we also generated a genome assembly for C. allenii. In addition, previously published genomes are available for both C. bracteatus and the more distantly related C. pulverulentus (Valderrama et al. 2022; Harenčár et al. 2023). We started by processing RNA-seq data as evidence for annotations. Trimmed reads from C. lasius were mapped using 2pass mapping in STAR to the C. lasius genome and the C. pulverulentus genomes (Martin 2011; Dobin et al. 2013). Bam files were concatenated for use as described below. Trimmed reads from C. allenii were mapped using 2pass mapping in STAR to both the C. allenii and the closely related C. bracteatus genomes and bam files concatenated per species (Martin 2011; Dobin et al. 2013).
We annotated the genomes using BRAKER3 (v3.0.7), which combines RNA-seq and protein data in an automated pipeline (Lomsadze et al. 2005, 2014; Stanke et al. 2006, 2008; Gotoh 2008; Iwata and Gotoh 2012; Buchfink et al. 2015; Hoff et al. 2016, 2019; Kovaka et al. 2019; Brůna et al. 2020, 2021; Pertea and Pertea 2020; Gabriel et al. 2023). This pipeline requires a masked genome. We identified and annotated tandem repeats and transposable elements using RepeatModeler (Flynn et al. 2020) (v2.0.4 for C. lasius and v2.0.5 for the other genomes) and RepeatMasker v4.1.5 (Tarailo-Graovac and Chen 2009). We then used RepeatMasker with the outputs of RepeatModeler to generate softmasked genomes. The masked genomes were then provided to BRAKER along with plant proteins downloaded from OrthoDB (odb10_plants) (Zdobnov et al. 2021). For all species, BRAKER was run in ETP mode that combines evidence from both proteins and RNA-seq (Gabriel et al. 2021). BUSCO v5.6.1 (Simão et al. 2015) and OMArk (Nevers et al. 2025) were used to evaluate the completeness and consistency of the sets of gene models. During the project, the C. lasius genome was updated (direction of arms on chromosome 1 was altered) and so we rescaffolded chromosome 1 of C. allenii and transferred our annotation using liftoff (Shumate and Salzberg 2021).
TPS Gene Family Annotation
Following whole-genome annotation, we manually curated the gene models for TPS genes in the annotation. To do this, we used bitacora (Vizueta et al. 2020), a pipeline that curates an existing annotation and finds additional gene family members. In this pipeline, BLASTP and hmmer are used to search for gene family members of interest within the existing annotation (Altschul et al. 1990; Eddy 2011). Additional regions of the genome are then searched using TBLASTN and are annotated using GeMoMa (Keilwagen et al. 2019). hmmer is used to validate the gene models. As input, we used two HMM profiles: Terpene_syth_C (PF03936) and TPS N-terminal domain (PF01397) downloaded from Pfam. In addition, we provided previously annotated genes from Arabidopsis thaliana and Oryza sativa (Chen et al. 2011; Yu et al. 2020; Jia et al. 2022). Gene models from bitacora and the genome-wide annotation from BRAKER were compared and TPS genes were manually curated using IGV (Robinson et al. 2011). We also searched for additional TPSs by carrying out exonerate searches using the curated Costus gene models (Slater and Birney 2005). We checked all annotated TPSs for complete protein domains using the NCBI conserved domain search (Marchler-Bauer et al. 2015). We excluded proteins with <250 amino acids.
Gene Family Analysis
To classify these TPSs in previously described TPS families, we constructed a phylogenetic tree based on amino acid sequences with TPSs from A. thaliana. The Costus TPS protein sequences, combined with those from A. thaliana, were aligned with MAFFT v7.508 (-maxiterate 1000, using L-INS-I algorithm) (Katoh and Standley 2013). We then trimmed the alignment using trimAl (-gt0.6, sites only included when present in 60% of sequences) (Capella-Gutierrez et al. 2009). We used this alignment to construct a gene tree with IQ-TREE and the ModelFinder function to determine the best-fit model (Nguyen et al. 2015; Kalyaanamoorthy et al. 2017; Hoang et al. 2018). We rooted the tree using the phytools package in R (“midpoint.root” function) (Revell 2012; R Core Team 2023). We plotted the resulting gene trees using the following packages in R: ape (Paradis and Schliep 2019), evobiR (Blackmon and Adams 2015), ggnewscale (Campitelli 2023), ggstar (Xu 2022), ggtree (Yu et al. 2017), ggtreeExtra (Xu et al. 2021), and tidytree (Yu 2022).
To investigate patterns of gene family evolution, we used computational analysis of gene family evolution CAFE (v5.1), which uses a birth–death model of gene family evolution to model gene gain and loss across a species tree (Mendes et al. 2020). The required input files are a species tree which we adapted from (Vargas et al. 2020), and gene counts for each species in each clade (as identified in Fig. 4).
We also took an alternative approach by using reconcILS to reconcile our gene tree and species tree to estimate gene duplications and losses for each species (Mishra et al. 2024). The algorithm considers not only duplication and loss events but also ILS. To create a gene tree, we aligned all Costus TPS nucleotide sequences with MAFFT v7.508 (-maxiterate 1000, using L-INS-I algorithm) (Katoh and Standley 2013). The alignment was trimmed using trimAl (-gt0.6, sites only included when present in 60% of sequences) (Capella-Gutierrez et al. 2009) and used to construct a gene tree with IQ-TREE and the ModelFinder function to determine the best-fit model (Nguyen et al. 2015; Kalyaanamoorthy et al. 2017; Hoang et al. 2018). We rooted the tree using the phytools package in R (“midpoint.root” function) (Revell 2012; R Core Team 2023). The species tree was adapted from (Vargas et al. 2020). We used the default parameter settings.
Differential Expression Analysis
First, we trimmed the reads using TrimGalore! (Martin 2011). We then mapped the reads from all three species to both the C. allenii and C. lasius genome (Harenčár et al. 2023). We ran 2pass mapping using STAR (Dobin et al. 2013). In general, we found higher levels of multimapped C. allenii reads when mapped to C. allenii. To investigate this, we used mmquant (with merge, -m, set to 10) to count the reads, a tool that can account for multimapping reads (Zytnicki 2017). The output from mmquant was read into R for further analysis. To calculate the percentage of gene expression due to TPS expression, we normalized reads by library size using TMM (Robinson and Oshlack 2010). Due to the consistency in TPS expression as calculated by mapping to both genomes, we decided to use the C. lasius genome assembly to identify genes differentially expressed between the three species to remove the complication of multimapping. To do this, we first counted reads using featureCounts (Liao et al. 2014) and again read the output into R. We filtered genes to include those with at least one count per million in at least one library and normalized reads by library size using TMM. Differential gene expression was evaluated using the nonparametric “noiseqbio” function from the NOISeq package (Tarazona et al. 2011, 2015). We used the default cut-off of q = 0.95 in the “degenes” function, which is equivalent to an adjusted P-value of 0.05.
Plotting and Data Manipulation
Additional packages used for general plotting: cowplot (Wilke 2020), ggplot2 (Wickham 2009), ggtree (Yu et al. 2017), gplots (Warnes et al. 2024), scico (Pedersen and Crameri 2023), and tidytree (Yu 2022). Packages used for data transformation and manipulation include data.table (Barrett et al. 2024), dplyr (Wickham et al. 2021), knitr (Xie 2023), magrittr (Bache and Wickham 2023) readr (Wickham et al. 2023), reshape2 (Wickham 2007), and tibble (Müller and Wickham 2022). Analyses were carried out in R version 4.3.2 (R Core Team 2023).
Supplementary Material
Acknowledgments
We thank the Organization for Tropical Studies, Estación Biológica Monteverde, and Estación Tropical La Gamba for facilitating field work in Costa Rica and the Smithsonian Tropical Research Institute for facilitating field work in Panama. All field research was conducted with appropriate research permits in Costa Rica (M-P-SINAC-PNI-ACAT-026-2018, ACC-PI-0272018, MPC-SINAC-PNI-ACLAP-020-2018, INV-ACOSA-076-18, MPC-SINAC-PNI-ACTo-020-18, CONAGEBIO R-056-2019OT, and R-058-2019-OT) and Panama (SE/AP-13-19, 09-99) We thank Julia Harenčár for assistance with lab work, Pedro Juarez and Cecilia Girvin for assistance with field sampling, and Alyssa Kaatmann, Aubrie Tait, Kate Uckele, Melina Sauerman Carrizosa, and Selena Vengco for assistance with greenhouse sampling. We thank J. Velzy and S. Childress for greenhouse care of the UCSC living Costus collection. We thank the Ramírez lab for helpful discussions.
Contributor Information
Kathy Darragh, Department of Evolution and Ecology, University of California, Davis, CA, USA; Department of Biology, Indiana University, Bloomington, IN, USA.
Kathleen M Kay, Department of Ecology and Evolutionary Biology, University of California, Santa Cruz, CA, USA.
Santiago R Ramírez, Department of Evolution and Ecology, University of California, Davis, CA, USA.
Supplementary Material
Supplementary material is available at Molecular Biology and Evolution online.
Author Contributions
Conceptualization: K.D., K.M.K., and S.R.R. Investigation: K.D. and K.M.K. Formal analysis: K.D. Visualization: K.D. Resources: K.M.K. and S.R.R. Writing—original draft: K.D. Writing—review and editing: K.D., K.M.K., and S.R.R. Supervision: K.M.K. and S.R.R. Project administration: K.M.K. and S.R.R. Funding acquisition: K.M.K. and S.R.R.
Funding
This work was funded by the National Science Foundation by a Dimensions of Biodiversity grant awarded to K.M.K. (DEB-1737889) and S.R.R. (DEB-1737771) and the Jean H. Langenheim Chair in Plant Ecology and Evolution held by K.M.K.
Data Availability
Data and R scripts used for analysis are available from Open Science Framework: https://osf.io/2ap4k/. The raw sequence reads and final genome assembly for Costus allenii are deposited in the SRA (BioProject PRJNA1091680). The raw RNA-seq data are deposited in the SRA (BioProject PRJNA1010186).
References
- Abrahamczyk S, Renner SS. The temporal build-up of hummingbird/plant mutualisms in North America and temperate South America. BMC Evol Biol. 2015:15:104. 10.1186/s12862-015-0388-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Adams DC. A generalized K statistic for estimating phylogenetic signal from shape and other high-dimensional multivariate data. Syst Biol. 2014:63(5):685–697. 10.1093/sysbio/syu030. [DOI] [PubMed] [Google Scholar]
- Adams DC, Collyer ML. Phylogenetic ANOVA: group-clade aggregation, biological challenges, and a refined permutation procedure. Evolution. 2018:72(6):1204–1215. 10.1111/evo.13492. [DOI] [PubMed] [Google Scholar]
- Adams DC, Otárola-Castillo E. Geomorph: an R package for the collection and analysis of geometric morphometric shape data. Methods Ecol Evol. 2013:4(4):393–399. 10.1111/2041-210X.12035. [DOI] [Google Scholar]
- Albert VA, Williams SE, Chase MW. Carnivorous plants: phylogeny and structural evolution. Science. 1992:257(5076):1491–1495. 10.1126/science.1523408. [DOI] [PubMed] [Google Scholar]
- Alonge M, Lebeigle L, Kirsche M, Jenike K, Ou S, Aganezov S, Wang X, Lippman ZB, Schatz MC, Soyk S. Automated assembly scaffolding using RagTag elevates a new tomato system for high-throughput genome editing. Genome Biol. 2022:23(1):258. 10.1186/s13059-022-02823-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990:215(3):403–410. 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
- Amrad A, Moser M, Mandel T, de Vries M, Schuurink RC, Freitas L, Kuhlemeier C. Gain and loss of floral scent production through changes in structural genes during pollinator-mediated speciation. Curr Biol. 2016:26(24):3303–3312. 10.1016/j.cub.2016.10.023. [DOI] [PubMed] [Google Scholar]
- Arendsee Z. rhmmer: utilities parsing “HMMER” results; 2017. [accessed 2023 Sept 12]. https://CRAN.R-project.org/package=rhmmer.
- Bache SM, Wickham H. magrittr: a forward-pipe operator for R; 2023. [accessed 2023 May 17]. https://CRAN.R-project.org/package=magrittr.
- Báez D, Pino JA, Morales D. Floral scent composition in Hedychium coronarium J. Koenig analyzed by SPME. J Essent Oil Res. 2011:23(3):64–67. 10.1080/10412905.2011.9700460. [DOI] [Google Scholar]
- Barreto E, Boehm MMA, Ogutcen E, Abrahamczyk S, Kessler M, Bascompte J, Dellinger AS, Bello C, Dehling DM, Duchenne F, et al. Macroevolution of the plant–hummingbird pollination system. Biol Rev Camb Philos Soc. 2024:99(5):1831–1847. 10.1111/brv.13094. [DOI] [PubMed] [Google Scholar]
- Barrett T, Dowle M, Srinivasan A, Gorecki J, Chirico M, Hocking T. data.table: extension of ‘data.frame’; 2024. [accessed 2024 May 7]. https://r-datatable.com.
- Bergamo PJ, Rech AR, Brito VLG, Sazima M. Flower colour and visitation rates of Costus arabicus support the ‘bee avoidance’ hypothesis for red-reflecting hummingbird-pollinated flowers. Funct Ecol. 2016:30(5):710–720. 10.1111/1365-2435.12537. [DOI] [Google Scholar]
- Bischoff M, Jürgens A, Campbell DR. Floral scent in natural hybrids of Ipomopsis (Polemoniaceae) and their parental species. Ann Bot. 2014:113(3):533–544. 10.1093/aob/mct279. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Blackmon H, Adams RA. evobiR: tools for comparative analyses and teaching evolutionary biology; 2015. [accessed 2024 Dec 11]. https://zenodo.org/records/30938.
- Brůna T, Hoff KJ, Lomsadze A, Stanke M, Borodovsky M. BRAKER2: automatic eukaryotic genome annotation with GeneMark-EP+ and AUGUSTUS supported by a protein database. NAR Genom Bioinform. 2021:3(1). 10.1093/nargab/lqaa108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brůna T, Lomsadze A, Borodovsky M. GeneMark-EP+: eukaryotic gene prediction with self-training in the space of genes and proteins. NAR Genom Bioinform. 2020:2(2). 10.1093/nargab/lqaa026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Buchfink B, Xie C, Huson DH. Fast and sensitive protein alignment using DIAMOND. Nat Methods. 2015:12(1):59–60. 10.1038/nmeth.3176. [DOI] [PubMed] [Google Scholar]
- Byers KJRP, Bradshaw HD, Riffell JA. Three floral volatiles contribute to differential pollinator attraction in monkeyflowers (Mimulus). J Exp Biol. 2014:217(Pt 4):614–623. 10.1242/jeb.092213. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cabanettes F, Klopp C. D-GENIES: dot plot large genomes in an interactive, efficient and simple way. PeerJ. 2018:6:e4958. 10.7717/peerj.4958. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Campitelli E. ggnewscale: multiple fill and colour scales in “ggplot2”; 2023. [accessed 2023 Sept 12]. https://CRAN.R-project.org/package=ggnewscale.
- Capella-Gutierrez S, Silla-Martinez JM, Gabaldon T. trimAL: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 2009:25(15):1972–1973. 10.1093/bioinformatics/btp348. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Castellanos MC, Wilson P, Thomson JD. ‘Anti-bee’ and ‘pro-bird’ changes during the evolution of hummingbird pollination in Penstemon flowers. J Evol Biol. 2004:17(4):876–885. 10.1111/j.1420-9101.2004.00729.x. [DOI] [PubMed] [Google Scholar]
- Chen F, Tholl D, Bohlmann J, Pichersky E. The family of terpene synthases in plants: a mid-size family of genes for specialized metabolism that is highly diversified throughout the kingdom. Plant J Cell Mol Biol. 2011:66(1):212–229. 10.1111/j.1365-313X.2011.04520.x. [DOI] [PubMed] [Google Scholar]
- Chen GF, Schemske DW. Ecological differentiation and local adaptation in two sister species of Neotropical Costus (Costaceae). Ecology. 2015:96(2):440–449. 10.1890/14-0428.1. [DOI] [PubMed] [Google Scholar]
- Chen L, DeVries AL, Cheng C-HC. Convergent evolution of antifreeze glycoproteins in Antarctic notothenioid fish and Arctic cod. Proc Natl Acad Sci U S A. 1997:94(8):3817–3822. 10.1073/pnas.94.8.3817. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cheng H, Concepcion GT, Feng X, Zhang H, Li H. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nat Methods. 2021:18(2):170–175. 10.1038/s41592-020-01056-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Collyer ML, Adams DC. RRPP: an R package for fitting linear models to high-dimensional data using residual randomization. Methods Ecol Evol. 2018:9(7):1772–1779. 10.1111/2041-210X.13029. [DOI] [Google Scholar]
- Dellinger AS, Paun O, Baar J, Temsch EM, Fernández-Fernández D, Schönenberger J. Population structure in Neotropical plants: integrating pollination biology, topography and climatic niches. Mol Ecol. 2022:31(8):2264–2280. 10.1111/mec.16403. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, Batut P, Chaisson M, Gingeras TR. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013:29(1):15–21. 10.1093/bioinformatics/bts635. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Driver RJ. Evolution of olfactory receptors in birds [PhD diss]. East Carolina: East Carolina University; 2022. https://thescholarship.ecu.edu/handle/10342/12266.
- Driver RJ, Balakrishnan CN. Highly contiguous genomes improve the understanding of avian olfactory receptor repertoires. Integr Comp Biol. 2021:61(4):1281–1290. 10.1093/icb/icab150. [DOI] [PubMed] [Google Scholar]
- Eddy SR. Accelerated profile HMM searches. PLoS Comput Biol. 2011:7(10):e1002195. 10.1371/journal.pcbi.1002195. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Esfeld K, Berardi AE, Moser M, Bossolini E, Freitas L, Kuhlemeier C. Pseudogenization and resurrection of a speciation gene. Curr Biol. 2018:28(23):3776–3786.e7. 10.1016/j.cub.2018.10.019. [DOI] [PubMed] [Google Scholar]
- Farré-Armengol G, Filella I, Llusià J, Peñuelas J. β-Ocimene, a key floral and foliar volatile involved in multiple interactions between plants and other organisms. Molecules. 2017:22(7):1148. 10.3390/molecules22071148. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fernandes NS, Silva FAN, de Aragão FAS, Zocolo GJ, Freitas BM. Volatile organic compounds role in selective pollinator visits to commercial melon types. J Agric Sci. 2019:11(3):193. 10.5539/jas.v11n3p93. [DOI] [Google Scholar]
- Flynn JM, Hubley R, Goubert C, Rosen J, Clark AG, Feschotte C, Smit AF. RepeatModeler2 for automated genomic discovery of transposable element families. Proc Natl Acad Sci. 2020:117(17):9451–9457. 10.1073/pnas.1921046117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fu L, Niu B, Zhu Z, Wu S, Li W. CD-HIT: accelerated for clustering the next-generation sequencing data. Bioinformatics. 2012:28(23):3150–3152. 10.1093/bioinformatics/bts565. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gabriel L, Brůna T, Hoff KJ, Ebel M, Lomsadze A, Borodovsky M, Stanke M. BRAKER3: Fully Automated Genome Annotation Using RNA-Seq and Protein Evidence with GeneMark-ETP, AUGUSTUS and TSEBRA. biorXiv. 2023.. 10.1101/2023.06.10.544449. [DOI] [PMC free article] [PubMed]
- Gabriel L, Hoff KJ, Brůna T, Borodovsky M, Stanke M. TSEBRA: transcript selector for BRAKER. BMC Bioinform. 2021:22(1). 10.1186/s12859-021-04482-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Galen C, Kaczorowski R, Todd SL, Geib J, Raguso RA. Dosage-dependent impacts of a floral volatile compound on pollinators, larcenists, and the potential for floral evolution in the alpine skypilot Polemonium viscosum. Am Nat. 2011:177(2):258–272. 10.1086/657993. [DOI] [PubMed] [Google Scholar]
- Gamba D, Muchhala N. Pollinator type strongly impacts gene flow within and among plant populations for six Neotropical species. Ecology. 2023:104(1):e3845. 10.1002/ecy.3845. [DOI] [PubMed] [Google Scholar]
- García-Robledo C, Erickson DL, Staines CL, Erwin TL, Kress WJ. Tropical plant–herbivore networks: reconstructing species interactions using DNA barcodes. PLoS One. 2013:8(1):e52967. 10.1371/journal.pone.0052967. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gershenzon J, Dudareva N. The function of terpene natural products in the natural world. Nat Chem Biol. 2007:3(7):408–414. 10.1038/nchembio.2007.5. [DOI] [PubMed] [Google Scholar]
- Goldsmith KM, Goldsmith TH. Sense of smell in the black-chinned hummingbird. Condor. 1982:84(2):237. 10.2307/1367678. [DOI] [Google Scholar]
- Gotoh O. A space-efficient and accurate method for mapping and aligning cDNA sequences onto genomic sequence. Nucleic Acids Res. 2008:36(8):2630–2638. 10.1093/nar/gkn105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, Adiconis X, Fan L, Raychowdhury R, Zeng Q, et al. Full-length transcriptome assembly from RNA-seq data without a reference genome. Nat Biotechnol. 2011:29(7):644–652. 10.1038/nbt.1883. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Graham AM, Jamison JM, Bustos M, Cournoyer C, Michaels A, Presnell JS, Richter R, Crocker DE, Fustukjian A, Hunter ME, et al. Reduction of paraoxonase expression followed by inactivation across independent semiaquatic mammals suggests stepwise path to pseudogenization. Mol Biol Evol. 2023:40(5):msad104. 10.1093/molbev/msad104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Grant KA, Grant V. Hummingbirds and their flowers. New York: Columbia University Press; 1968. [Google Scholar]
- Haas BJ, Papanicolaou A, Yassour M, Grabherr M, Blood PD, Bowden J, Couger MB, Eccles D, Li B, Lieber M, et al. De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nat Protoc. 2013:8(8):1494–1512. 10.1038/nprot.2013.084. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Harenčár J, Vargas OM, Escalona M, Schemske DW, Kay KM. Genome assemblies and comparison of two Neotropical spiral gingers: Costus pulverulentus and C. lasius. J Hered. 2023:114(3):286–293. 10.1093/jhered/esad018 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hervé M. RVAideMemoire: testing and plotting procedures for biostatistics. R package version 0.9-81; 2021. [accessed 2022 Jan 28]. https://CRAN.R-project.org/package=RVAideMemoire.
- Hoang DT, Chernomor O, von Haeseler A, Minh BQ, Vinh LS. UFBoot2: improving the ultrafast bootstrap approximation. Mol Biol Evol. 2018:35(2):518–522. 10.1093/molbev/msx281. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hoff KJ, Lange S, Lomsadze A, Borodovsky M, Stanke M. BRAKER1: Unsupervised RNA-Seq-Based Genome Annotation with GeneMark-ET and AUGUSTUS. Bioinform. 2016:32(5):767–769. 10.1093/bioinformatics/btv661. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hoff KJ, Lomsadze A, Borodovsky M, Stanke M. Whole-genome annotation with BRAKER. Methods Mol Biol. 2019:1962:65–95. 10.1007/978-1-4939-9173-0_5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Iwata H, Gotoh O. Benchmarking spliced alignment programs including Spaln2, an extended version of Spaln that incorporates additional species-specific features. Nucleic Acids Res. 2012:40(20):e161. 10.1093/nar/gks708. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Janzen DH. Euglossine bees as long-distance pollinators of tropical plants. Science. 1971:171(3967):203–205. 10.1126/science.171.3967.203. [DOI] [PubMed] [Google Scholar]
- Janzen DH. Bee arrival at two Costa Rican female catasetum orchid inflorescences, and a hypothesis on euglossine population structure. Oikos. 1981:36(2):177–183. 10.2307/3544443. [DOI] [Google Scholar]
- Jia Q, Brown R, Köllner TG, Fu J, Chen X, Wong GK-S, Gershenzon J, Peters RJ, Chen F. Origin and early evolution of the plant terpene synthase family. Proc Natl Acad Sci. 2022:119(15). 10.1073/pnas.2100361119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kalyaanamoorthy S, Minh BQ, Wong TKF, von Haeseler A, Jermiin LS. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat Methods. 2017:14(6):587–589. 10.1038/nmeth.4285. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013:30(4):772–780. 10.1093/molbev/mst010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kay KM, Grossenbacher DL. Evolutionary convergence on hummingbird pollination in Neotropical Costus provides insight into the causes of pollinator shifts. New Phytol. 2022:236(4):1572–1583. 10.1111/nph.18464. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kay KM, Schemske DW. Pollinator assemblages and visitation rates for 11 species of Neotropical Costus (Costaceae). Biotropica. 2003:35:198–207. 10.1111/j.1744-7429.2003.tb00279.x. [DOI] [Google Scholar]
- Keilwagen J, Hartung F, Grau J. GeMoMa: homology-based gene prediction utilizing intron position conservation and RNA-seq data. Methods Mol Biol. 2019:1962:161–177. 10.1007/978-1-4939-9173-0_9. [DOI] [PubMed] [Google Scholar]
- Knudsen JT, Tollsten L, Groth I, Bergstöm G, Raguso RA. Trends in floral scent chemistry in pollination syndromes: floral scent composition in hummingbird-pollinated taxa. Bot J Linn Soc. 2004:146(2):191–199. 10.1111/j.1095-8339.2004.00329.x. [DOI] [Google Scholar]
- Kollmar M, Hoff KJ, Lomsadze A, Borodovsky M, Stanke M. Whole-genome annotation with BRAKER. Methods Mol Biol. 2019:1962:65–95. 10.1007/978-1-4939-9173-0_5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kovaka S, Zimin AV, Pertea GM, Razaghi R, Salzberg SL, Pertea M. Transcriptome assembly from long-read RNA-seq alignments with StringTie2. Genome Biol. 2019:20(1). 10.1186/s13059-019-1910-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Krauss SL, Phillips RD, Karron JD, Johnson SD, Roberts DG, Hopper SD. Novel consequences of bird pollination for plant mating. Trends Plant Sci. 2017:22(5):395–410. 10.1016/j.tplants.2017.03.005. [DOI] [PubMed] [Google Scholar]
- Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012:9(4):357–359. 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lanier ER, Andersen TB, Hamberger B. Plant terpene specialized metabolism: complex networks or simple linear pathways? Plant J. 2023:114(5):1178–1201. 10.1111/tpj.16177. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Larue A-AC, Raguso RA, Junker RR. Experimental manipulation of floral scent bouquets restructures flower–visitor interactions in the field. J Anim Ecol. 2016:85(2):396–408. 10.1111/1365-2656.12441. [DOI] [PubMed] [Google Scholar]
- Li H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics. 2018:34(18):3094–3100. 10.1093/bioinformatics/bty191. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liao Y, Smyth GK, Shi W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics. 2014:30(7):923–930. 10.1093/bioinformatics/btt656. [DOI] [PubMed] [Google Scholar]
- Liu JW, Milet-Pinheiro P, Gerlach G, Ayasse M, Nunes CEP, Alves-Dos-Santos I, Ramírez SR. Macroevolution of floral scent chemistry across radiations of male euglossine bee-pollinated plants. Evolution. 2024:78(1):98–110. 10.1093/evolut/qpad194. [DOI] [PubMed] [Google Scholar]
- Lomsadze A, Ter-Hovhannisyan V, Chernoff YO, Borodovsky M. Gene identification in novel eukaryotic genomes by self-training algorithm. Nucleic Acids Res. 2005:33(20):6494–6506. 10.1093/nar/gki937. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lomsadze A, Burns PD, Borodovsky M. Integration of mapped RNA-Seq reads into automatic training of eukaryotic gene finding algorithm. Nucleic Acids Res. 2014:42(15):e119. 10.1093/nar/gku557. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Losos JB. Convergence, adaptation, and constraint. Evolution. 2011:65(7):1827–1840. 10.1111/j.1558-5646.2011.01289.x. [DOI] [PubMed] [Google Scholar]
- Maas PJM. Flora neotropica, monograph no. 8. New York: Hafner; 1972. p. 1–139. [Google Scholar]
- Maas PJM. Flora neotropica, monograph no. 18. Bronx, NY: New York Botanical Garden; 1977. p.1–218. [Google Scholar]
- Manni M, Berkeley MR, Seppey M, Simão FA, Zdobnov EM. BUSCO update: novel and streamlined workflows along with broader and deeper phylogenetic coverage for scoring of eukaryotic, prokaryotic, and viral genomes. Mol Biol Evol. 2021:38(10):4647–4654. 10.1093/molbev/msab199. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marchler-Bauer A, Derbyshire MK, Gonzales NR, Lu S, Chitsaz F, Geer LY, Geer RC, He J, Gwadz M, Hurwitz DI, et al. CDD: NCBI’s conserved domain database. Nucleic Acids Res. 2015:43(D1):D222–D226. 10.1093/nar/gku1221. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marshall CR, Raff EC, Raff RA. Dollo's law and the death and resurrection of genes. Proc Natl Acad Sci U S A. 1994:91(25):12283–12287. 10.1073/pnas.91.25.12283. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J. 2011:17(1):10–12. 10.14806/ej.17.1.200. [DOI] [Google Scholar]
- Mendes FK, Vanderpool D, Fulton B, Hahn MW. CAFE 5 models variation in evolutionary rates among gene families. Bioinformatics. 2020:36(22-23):5516–5518. 10.1093/bioinformatics/btaa1022. [DOI] [PubMed] [Google Scholar]
- Mishra S, Smith ML, Hahn MW. reconcILS: a gene tree-species tree reconciliation algorithm that allows for incomplete lineage sorting. bioRxiv 2023.11.03.565544. https://www.biorxiv.org/content/10.1101/2023.11.03.565544v2, 16 November 2024, preprint: not peer reviewed.
- Moran D, Softley R, Warrant EJ. The energetic cost of vision and the evolution of eyeless Mexican cavefish. Sci Adv. 2015:1(8):e1500363. 10.1126/sciadv.1500363. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Müller K, Wickham H. tibble: simple data frames; 2022. [accessed 2024 Dec 11]. https://github.com/tidyverse/tibble.
- Nevers Y, Warwick Vesztrocy A, Rossier V, Train C-M, Altenhoff A, Dessimoz C, Glover NM. Quality assessment of gene repertoire annotations with OMArk. Nat Biotechnol. 2025:43(1): 124–133. 10.1038/s41587-024-02147-w. [DOI] [PMC free article] [PubMed]
- Nguyen L-T, Schmidt HA, von Haeseler A, Minh BQ. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. 2015:32(1):268–274. 10.1093/molbev/msu300. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Oksanen J, Guillaume Blanchet F, Friendly M, Kindt R, Legendre P, McGlinn D, Minchin P, O’Hara R, Simpson G, Solymos P, et al. 2020. vegan: community ecology package. R package version 2.5-7; 2020. [accessed 2022 Jan 28]. https://CRAN.R-project.org/package=vegan.
- Oliver KR, McComb JA, Greene WK. Transposable elements: powerful contributors to angiosperm evolution and diversity. Genome Biol Evol. 2013:5(10):1886–1901. 10.1093/gbe/evt141. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Opedal ØH, Falahati-Anbaran M, Albertsen E, Armbruster WS, Pérez-Barrales R, Stenøien HK, Pélabon C. Euglossine bees mediate only limited long-distance gene flow in a tropical vine. New Phytol. 2017:213(4):1898–1908. 10.1111/nph.14380. [DOI] [PubMed] [Google Scholar]
- Orr HA. The probability of parallel evolution. Evolution. 2005:59(1):216–220. 10.1111/j.0014-3820.2005.tb00907.x. [DOI] [PubMed] [Google Scholar]
- Paradis E, Schliep K. Ape 5.0: an environment for modern phylogenetics and evolutionary analyses in R. Bioinformatics. 2019:35(3):526–528. 10.1093/bioinformatics/bty633. [DOI] [PubMed] [Google Scholar]
- Pedersen TL, Crameri F. scico: colour palettes based on the scientific colour-maps; 2023. [accessed 2024 May 7]. https://CRAN.R-project.org/package=scico.
- Peng F, Byers KJRP, Bradshaw HD Jr. Less is more: independent loss-of-function OCIMENE SYNTHASE alleles parallel pollination syndrome diversification in monkeyflowers (Mimulus). Am J Bot. 2017:104(7):1055–1059. 10.3732/ajb.1700104. [DOI] [PubMed] [Google Scholar]
- Pertea G, Pertea M. GFF Utilities: GffRead and GffCompare. F1000Res. 2020(9):304. 10.12688/f1000research.23297.2. [DOI] [PMC free article] [PubMed]
- Petrén H, Köllner TG, Junker RR. Quantifying chemodiversity considering biochemical and structural properties of compounds with the R package chemodiv. New Phytol. 2023:237(6):2478–2492. 10.1111/nph.18685. [DOI] [PubMed] [Google Scholar]
- Pichersky E, Raguso RA. Why do plants produce so many terpenoid compounds? New Phytol. 2018:220(3):692–702. 10.1111/nph.14178. [DOI] [PubMed] [Google Scholar]
- Powers J, Eisen K, Campbell D, Raguso R. bouquet: filter and augment biological GC-MS data; 2023. [accessed 2023 Sept 19]. https://github.com/jmpowers/bouquet.
- Raguso RA. Plant evolution: repeated loss of floral scent—a path of least resistance? Curr Biol. 2016:26(24):R1282–R1285. 10.1016/j.cub.2016.10.058. [DOI] [PubMed] [Google Scholar]
- Ramos SE, Schiestl FP. Rapid plant evolution driven by the interaction of pollination and herbivory. Science. 2019:364(6436):193–196. 10.1126/science.aav6962. [DOI] [PubMed] [Google Scholar]
- R Core Team . R: a language and environment for statistical computing; 2023. [accessed 2024 May 7]. http://www.R-project.org.
- Revell LJ. Phytools: an R package for phylogenetic comparative biology (and other things). Methods Ecol Evol. 2012:3(2):217–223. 10.1111/j.2041-210X.2011.00169.x. [DOI] [Google Scholar]
- Robinson JT, Thorvaldsdóttir H, Winckler W, Guttman M, Lander ES, Getz G, Mesirov JP. Integrative genomics viewer. Nat Biotechnol. 2011:29( 1): 24.– . 10.1038/nbt.1754. [DOI] [Google Scholar]
- Robinson MD, McCarthy DJ, Smyth GK. Edger: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010:26(1):139–140. 10.1093/bioinformatics/btp616. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Robinson MD, Oshlack A. A scaling normalization method for differential expression analysis of RNA-seq data. Genome Biol. 2010:11(3):R25. 10.1186/gb-2010-11-3-r25. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rosas-Guerrero V, Aguilar R, Martén-Rodríguez S, Ashworth L, Lopezaraiza-Mikel M, Bastida JM, Quesada M. A quantitative review of pollination syndromes: do floral traits predict effective pollinators? Ecol Lett. 2014:17(3):388–400. 10.1111/ele.12224. [DOI] [PubMed] [Google Scholar]
- Sackton TB, Clark N. Convergent evolution in the genomics era: new insights and directions. Philos Trans R Soc Lond B Biol Sci. 2019:374(1777):20190102. 10.1098/rstb.2019.0102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sas C, Müller F, Kappel C, Kent TV, Wright SI, Hilker M, Lenhard M. Repeated inactivation of the first committed enzyme underlies the loss of benzaldehyde emission after the selfing transition in Capsella. Curr Biol. 2016:26(24):3313–3319. 10.1016/j.cub.2016.10.026. [DOI] [PubMed] [Google Scholar]
- Schemske DW, Bradshaw HD Jr. Pollinator preference and the evolution of floral traits in monkeyflowers (Mimulus). Proc Natl Acad Sci U S A. 1999:96(21):11910–11915. 10.1073/pnas.96.21.11910. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shumate A, Salzberg SL. Liftoff: accurate mapping of gene annotations. Bioinformatics. 2021:37(12):1639–1643. 10.1093/bioinformatics/btaa1016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sifuentes-Romero I, Aviles AM, Carter JL, Chan-Pong A, Clarke A, Crotty P, Engstrom D, Meka P, Perez A, Perez R, et al. Trait loss in evolution: what cavefish have taught us about mechanisms underlying eye regression. Integr Comp Biol. 2023:63(2):393–406. 10.1093/icb/icad032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 2015:31(19):3210–3212. 10.1093/bioinformatics/btv351. [DOI] [PubMed] [Google Scholar]
- Slater GSC, Birney E. Automated generation of heuristics for biological sequence comparison. BMC Bioinformatics. 2005:6(1):31. 10.1186/1471-2105-6-31. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smid M, Coebergh van den Braak RRJ, van de Werken HJG, van Riet J, van Galen A, de Weerd V, van der Vlugt-Daane M, Bril SI, Lalmahomed ZS, Kloosterman WP, et al. Gene length corrected trimmed mean of M-values (GeTMM) processing of RNA-seq data performs similarly in intersample analyses while improving intrasample comparisons. BMC Bioinformatics. 2018:19(1):236. 10.1186/s12859-018-2246-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stanke M, Diekhans M, Baertsch R, Haussler D. Using native and syntenically mapped cDNA alignments to improve de novo gene finding. Bioinformatics. 2008:24(5):637–644. 10.1093/bioinformatics/btn013. [DOI] [PubMed] [Google Scholar]
- Stanke M, Schöffmann O, Morgenstern B, Waack S. Gene prediction in eukaryotes with a generalized hidden Markov model that uses hints from external sources. BMC Bioinformatics. 2006:7(1). 10.1186/1471-2105-7-62. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tarailo-Graovac M, Chen N. Using RepeatMasker to Identify Repetitive Elements in Genomic Sequences. Curr Protoc Bioinform. 2009:25(1). 10.1002/0471250953.bi0410s25. [DOI] [PubMed] [Google Scholar]
- Tarazona S, Furió-Tarí P, Turrà D, Pietro AD, Nueda MJ, Ferrer A, Conesa A. Data quality aware analysis of differential expression in RNA-seq with NOISeq R/Bioc package. Nucleic Acids Res. 2015:43(21):e140. 10.1093/nar/gkv711. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tarazona S, García-Alcalde F, Dopazo J, Ferrer A, Conesa A. Differential expression in RNA-seq: a matter of depth. Genome Res. 2011:21(12):2213–2223. 10.1101/gr.124321.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Theis N, Adler LS. Advertising to the enemy: enhanced floral fragrance increases beetle attraction and reduces plant reproduction. Ecology. 2012:93(2):430–435. 10.1890/11-0825.1. [DOI] [PubMed] [Google Scholar]
- Tholl D. Terpene synthases and the regulation, diversity and biological roles of terpene metabolism. Curr Opin Plant Biol. 2006:9(3):297–304. 10.1016/j.pbi.2006.03.014. [DOI] [PubMed] [Google Scholar]
- Thomson JD, Wilson P. Explaining evolutionary shifts between bee and hummingbird pollination: convergence, divergence, and directionality. Int J Plant Sci. 2008:169(1):23–38. 10.1086/523361. [DOI] [Google Scholar]
- Tripp EA, Manos PS. Is floral specialization an evolutionary dead-end? Pollination system transitions in Ruellia (Acanthaceae). Evolution. 2008:62(7):1712–1737. 10.1111/j.1558-5646.2008.00398.x. [DOI] [PubMed] [Google Scholar]
- Tsuchimatsu T, Fujii S. The selfing syndrome and beyond: diverse evolutionary consequences of mating system transitions in plants. Philos Trans R Soc Lond B Biol Sci. 2022:377(1855):20200510. 10.1098/rstb.2020.0510. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Uckele KA, Vargas OM, Kay KM. Prezygotic barriers effectively limit hybridization in a rapid evolutionary radiation. New Phytol. 2024:244(6):2548–2560. 10.1111/nph.20187. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Valderrama E, Landis JB, Skinner D, Maas PJM, Maas-van de Kramer H, André T, Grunder N, Sass C, Pinilla-Vargas M, Guan CJ, et al. The genetic mechanisms underlying the convergent evolution of pollination syndromes in the Neotropical radiation of Costus L. Front Plant Sci. 2022:13:874322. 10.3389/fpls.2022.874322. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vargas OM, Goldston B, Grossenbacher DL, Kay KM. Patterns of speciation are similar across mountainous and lowland regions for a Neotropical plant radiation (Costaceae: Costus). Evolution. 2020:74(12):2644–2661. 10.1111/evo.14108. [DOI] [PubMed] [Google Scholar]
- Vizueta J, Sánchez-Gracia A, Rozas J. BITACORA: A comprehensive tool for the identification and annotation of gene families in genome assemblies. Mol Ecol Resour. 2020:20(5):1445–1452. 10.1111/1755-0998.13202. [DOI] [PubMed] [Google Scholar]
- Warnes GR, Bolker B, Lodewijk B, Gentleman R, Huber W, Liam A, Lumley T, Maechler M, Magnusson A, Moeller S, et al. gplots: various R programming tools for plotting data; 2024. [accessed 2024 Dec 11]. https://CRAN.R-project.org/package=gplots.
- Wei T, Simko V. corrplot: visualization of a correlation matrix; 2021. [accessed 2022 Mar 23]. https://github.com/taiyun/corrplot.
- Wessinger CA. How the switch to hummingbird pollination has greatly contributed to our understanding of evolutionary processes. New Phytol. 2024:241(1):59–64. 10.1111/nph.19335. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wickham H. Reshaping data with the reshape package. J Stat Softw. 2007:21(12):1–20. 10.18637/jss.v021.i12. [DOI] [Google Scholar]
- Wickham H. ggplot2: elegant graphics for data analysis. New York: Springer-Verlag; 2009. https://ggplot2.tidyverse.org/. [Google Scholar]
- Wickham H, François R, Henry L, Müller K. dplyr: a grammar of data manipulation. R package version 1.0.7; 2021. [accessed 2022 Jan 28]. https://CRAN.R-project.org/package=dplyr.
- Wickham H, Hester J, Bryan J. readr: read rectangular text data; 2023. [accessed 2023 May 17]. https://CRAN.R-project.org/package=readr.
- Wikelski M, Moxley J, Eaton-Mordas A, López-Uribe MM, Holland R, Moskowitz D, Roubik DW, Kays R. Large-range movements of Neotropical orchid bees observed via radio telemetry. PLoS One. 2010:5(5):e10738. 10.1371/journal.pone.0010738. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wilke CO. cowplot: streamlined plot theme and plot annotations for “ggplot2”; 2020. [accessed 2022 Jan 28]. https://CRAN.R-project.org/package=cowplot.
- Workman R, Fedak R, Kilburn D, Hao S, Liu K, Timp W. High molecular weight DNA extraction from recalcitrant plant species for third generation sequencing; 2019. [accessed 2024 May 3]. https://www.protocols.io/view/high-molecular-weight-dna-extraction-from-recalcit-4vbgw2n.
- Woźniak NJ, Sartori K, Kappel C, Tran TC, Zhao L, Erban A, Gallinger J, Fehrle I, Jantzen F, Orsucci M, et al. Convergence and molecular evolution of floral fragrance after independent transitions to self–fertilization. Curr Biol. 2024:34(12):2702–2711.e6. 10.1016/j.cub.2024.04.063. [DOI] [PubMed] [Google Scholar]
- Xie Y. knitr: a general-purpose package for dynamic report generation in R; 2023. [accessed 2023 May 17]. https://yihui.org/knitr/.
- Xu S. ggstar: multiple geometric shape point layer for “ggplot2”; 2022. [accessed 2023 Sept 12]. https://CRAN.R-project.org/package=ggstar.
- Xu S, Dai Z, Guo P, Fu X, Liu S, Zhou L, Tang W, Feng T, Chen M, Zhan L, et al. ggtreeExtra: compact visualization of richly annotated phylogenetic data. Mol Biol Evol. 2021:38(9):4039–4042. 10.1093/molbev/msab166. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yost JM, Kay KM. The evolution of postpollination reproductive isolation in Costus. Sex Plant Reprod. 2009:22(4):247–255. 10.1007/s00497-009-0113-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yu G. Data integration, manipulation and visualization of phylogenetic trees. New York: Chapman & Hal; 2022. https://www.routledge.com/Data-Integration-Manipulation-and-Visualization-of-Phylogenetic-Trees/Yu/p/book/9781032233574. [Google Scholar]
- Yu G, Smith DK, Zhu H, Guan Y, Lam TT-Y. Ggtree: an R package for visualization and annotation of phylogenetic trees with their covariates and other associated data. Methods Ecol Evol. 2017:8(1):28–36. 10.1111/2041-210X.12628. [DOI] [Google Scholar]
- Yu Z, Zhao C, Zhang G, Teixeira da Silva JA, Duan J. Genome-Wide Identification and Expression Profile of TPS Gene Family in Dendrobium officinale and the Role of DoTPS10 in Linalool Biosynthesis. Int J Mol Sci. 2020:21(15):5419. 10.3390/ijms21155419. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zdobnov EM, Kuznetsov D, Tegenfeldt F, Manni M, Berkeley M, Kriventseva EV. OrthoDB in 2020: evolutionary and functional annotations of orthologs. Nucleic Acids Res. 2021:49(D1):D389–D393. 10.1093/nar/gkaa1009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhou Y, Abbas F, He J, Yan F, Wang Q, Yu Y, Yu R, Fan Y. Floral volatile chemical diversity in Hedychium F1 hybrid population. Ind Crops Prod. 2022:184:115032. 10.1016/j.indcrop.2022.115032. [DOI] [Google Scholar]
- Zu P, Schiestl FP, Gervasi D, Li X, Runcie D, Guillaume F. Floral signals evolve in a predictable way under artificial and pollinator selection in Brassica rapa. BMC Evol Biol. 2020:20(1):127. 10.1186/s12862-020-01692-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zytnicki M. mmquant: how to count multi-mapping reads? BMC Bioinformatics. 2017:18(1):411. 10.1186/s12859-017-1816-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Data and R scripts used for analysis are available from Open Science Framework: https://osf.io/2ap4k/. The raw sequence reads and final genome assembly for Costus allenii are deposited in the SRA (BioProject PRJNA1091680). The raw RNA-seq data are deposited in the SRA (BioProject PRJNA1010186).




