Skip to main content
Systematic Biology logoLink to Systematic Biology
. 2024 Nov 1;74(1):141–157. doi: 10.1093/sysbio/syae061

Rapid Evolution of Host Repertoire and Geographic Range in a Young and Diverse Genus of Montane Butterflies

Shifang Mo 1,#, Yaowei Zhu 2,#, Mariana P Braga 3,#, David J Lohman 4,5,6, Sören Nylin 7, Ashraf Moumou 8, Christopher W Wheat 9, Niklas Wahlberg 10, Min Wang 11, Fangzhou Ma 12,, Peng Zhang 13,, Houshuai Wang 14,
Editor: Daniele Silvestro
PMCID: PMC11809587  PMID: 39484941

Abstract

Evolutionary changes in geographic distribution and larval host plants may promote the rapid diversification of montane insects, but this scenario has been rarely investigated. We studied the rapid radiation of the butterfly genus Colias, which has diversified in mountain ecosystems in Eurasia, Africa, and the Americas. Based on a data set of 150 nuclear protein-coding genetic loci and mitochondrial genomes, we constructed a time-calibrated phylogenetic tree of Colias species with broad taxon sampling. We then inferred their ancestral geographic ranges, historical diversification rates, and the evolution of host use. We found that the most recent common ancestor of Colias was likely geographically widespread and originated ~3.5 Ma. The group subsequently diversified in different regions across the world, often in tandem with geographic expansion events. No aspect of elevation was found to have a direct effect on diversification. The genus underwent a burst of diversification soon after the divergence of the Neotropical lineage, followed by an exponential decline in diversification rate toward the present. The ancestral host repertoire included the legume genera Astragalus and Trifolium but later expanded to include a wide range of Fabaceae genera and plants in more distantly related families, punctuated with periods of host range expansion and contraction. We suggest that the widespread distribution of the ancestor of all extant Colias lineages set the stage for diversification by isolation of populations that locally adapted to the various different environments they encountered, including different host plants. In this scenario, elevation is not the main driver but might have accelerated diversification by isolating populations.

Keywords: Biogeography, host use, montane species, rapid diversification, target capture


Mountain ecosystems are home to one-third of terrestrial species (Spehn et al. 2011), with numerous rapidly radiating montane taxa (hereafter called RRMT) such as the American legume Astragalus (Scherson et al. 2008), Oreocharis (Gesneriaceae) in the Hengduan Mountains (Kong et al. 2022), and Andean Scytalopus birds (Cadena et al. 2020). It is thought that many montane radiations evolved recently (Linder 2008). Thus, studying RRMT could inform a global conundrum: What causes the rich diversity of montane species (Rahbek et al. 2019a, 2019b)? Research on RRMT has attracted the attention of an increasing number of researchers (Fjeldså et al. 2012; Hughes and Atchison 2015), but plants (Lagomarsino et al. 2016) and vertebrates (Jetz et al. 2012) have received disproportionate attention. Although some studies have been focused on understanding the diversification of montane insects (Casner and Pyrcz 2010; Ye et al. 2016; Chazot et al. 2019b; He et al. 2023; de Moraes Magaldi et al. 2024; Wang et al. 2024), a consensus has still not been reached on what abiotic and biotic factors have determined rapid global montane radiation on insects.

Changes in host repertoire (i.e., the set of hosts used by a parasite, including herbivorous plant parasites), especially changes in the number of hosts used (often referred to as host range or host breadth) seem to have an important role in insect diversification (Braga et al. 2018, 2021; Braga and Janz 2021). One of the predominant hypotheses of host-associated diversification, the “oscillation hypothesis” proposed by Janz and Nylin (2008), suggests that expansions of the host repertoire may facilitate a larger geographic distribution, followed by local adaptation, host (re-) specialization, and speciation. In other words, the diversification of insects may be increased by evolutionary changes in host repertoire and geographic distribution. However, this scenario has been rarely tested, particularly in an RRMT.

The butterfly genus Colias is an ideal taxon for studying the rapid diversification of widely distributed montane insects. Most of the nearly 80 species in this genus are distributed in the mountains of Eurasia, the Americas, and Africa (Grieshuber et al. 2012; Wu and Hsu 2017), and it is also one of the fastest diversifying genera in the family Pieridae (Carvalho et al. 2024). A reliable phylogenetic framework of Colias is necessary to understand the rapid diversification of the genus and the role that host plants might have played. Most previous studies of this genus focused on descriptions of new taxa and taxonomic reviews based on morphological traits (Verhulst 2000; Grieshuber et al. 2012; Grieshuber 2014; Wu and Hsu 2017). Although some molecular phylogenetic studies have examined Colias evolution (Pollock et al. 1998; Wheat and Watt 2008; Laiho and Stahls 2013; Kir’yanov 2021; Tunström et al. 2023), phylogenetic relationships within this genus remain poorly resolved due to inadequate species sampling or insufficient molecular markers.

Molecular markers derived from next-generation sequencing (NGS) are an established tool for reconstructing robust phylogenetic hypotheses. Many reduced representation methods can be used with NGS (Nunes et al. 2022), including target capture, which is useful for reconstructing phylogenetic relationships in rapidly radiating taxa (Larridon et al. 2020; Thomas et al. 2021). In a solution of fragmented DNA, target capture traps genomic regions of interest by hybridization with specific oligonucleotide probes, allowing non-target genomic regions to be removed (Albert et al. 2007; Hodges et al. 2007; Gnirke et al. 2009; Mamanova et al. 2010). Because the method needs relatively small amounts of DNA (Faircloth et al. 2015), it is suitable for small insect species including those with poor DNA quality represented only by specimens in museum collections (Lemmon and Lemmon 2013; Faircloth et al. 2015; Glenn and Faircloth 2016; Nunes et al. 2022). In recent years, several Lepidoptera probe sets have been developed (Breinholt et al. 2018; Zhang et al. 2019; Tian et al. 2023) for resolving the deep relationships among families (Zhang et al. 2019; Kawahara et al. 2023) and inferring phylogenetic relationships below the genus level (Kawahara et al. 2018; Ma et al. 2020).

In this study, we employed target capture with the probe sets of Zhang et al. (2019) and Tian et al. (2023) to sequence 150 nuclear gene loci and mitochondrial genomes to reconstruct the evolutionary history of Colias. Then, the divergence time of the genus was estimated using secondary calibration points. This time-calibrated molecular phylogeny was used to reconstruct historical features of this genus including its ancestral distribution, diversification rates, biogeographic history, and the evolution of host plant use. We also inferred species trees and species networks to explore putative hybridization events within the major clades of Colias. Combining our phylogenetic results with a Bayesian method for ancestral character reconstruction of host plant breadth, we infer that host repertoire evolution accelerated the rapid diversification of the genus Colias butterflies.

Materials And Methods

Taxon Sampling, DNA Extraction, and Library Preparation

We sequenced 74 samples of 66 Colias species (Supplementary Table S1). This taxon sampling includes 84% of valid Colias species with representatives from each geographic region inhabited by the genus. With the exception of C. ponteni, which is morphologically distinctive but presumed extinct, we sequenced specimens of each morphotype; the missing taxa are unlikely to have a significant impact on the topology of phylogenetic relationships (Supplementary Table S2). Zerene cesonia, in the genus most closely related to Colias (Kawahara et al. 2023), was used as an outgroup taxon. The complete genome data of Z. cesonia (Rodriguez-Caro et al. 2020; GenBank assembly accession: GCA_012273895.2) was downloaded, and all relevant sequences of this species were extracted from its genome.

Genomic DNA was extracted from the legs, thoracic, or abdominal tissues of each sample using the TIANamp Genomic DNA Kit (Tiangen, Guangzhou, China), and the extracted DNA was stored at −20 °C. DNA concentration was quantitated with a NanoDrop™ spectrophotometer. A volume of 100 ng of genomic DNA was sheared using a Scientz18-A ultrasonic processor (SCIENTZ, Zhejiang Province, China) to a size of 300–550 bp. We prepared libraries with the NEBNext Ultra DNA Library Prep Kit (New England Biolabs, Ipswich, MA, USA). Each library was labeled with a unique 7-bp P7 index sequence. Four or 5 libraries were mixed into a pooled library in equal concentrations for subsequent hybrid capture, and a total of 15 pooled libraries were generated.

Probe Preparation, Hybridization, and Sequencing

We used the target capture strategy described by Zhang et al. (2019) and Tian et al. (2023) to collect genetic data. We targeted 150 nuclear protein-coding (NPC) loci of Lepidoptera. The PCR primers for these 150 NPC loci are provided in Supplementary Table S3. We prepared amplicon capture baits for our target capture experiment following the method of Zhang et al. (2019). In brief, we mixed 20 ng of genomic DNA from each of the 74 butterfly samples and this pooled DNA was then used as a template to amplify the 150 NPC loci via polymerase chain interaction (PCR). The 150 PCR products were then mixed together and purified with AMPure XP beads (Beckman Coulter, Brea, CA, USA). The purified amplicon mixture was subjected to end-repair and A-tailing, and then ligated with a biotinylated Bio-T adapter (Zhang et al. 2019). The adapter-ligated amplicons were purified with AMPure XP beads and subsequently immobilized on Dynabeads MyOne streptavidin magnetic beads (Life Technologies, Carlsbad, CA, USA) to obtain bait-coated beads. Detailed experimental protocols are provided by Zhang et al. (2019).

For each hybridization reaction, 500 ng of pooled library and 2.5 μl of bait-coated beads (containing 25 ng of biotinylated amplicons) were used. A touch-down hybridization program was adopted. After denaturation, the hybridization started at 65 °C, decreased by 5 °C every 6 h, and ended at 45 °C, for a total duration of 30 h. The enriched libraries were amplified using Illumina P5 and P7 primers and purified with AMPure XP beads. Finally, all 15 enriched libraries were pooled in equal concentrations and sequenced on an Illumina HiSeqX10 sequencer in the paired-end 150-bp mode. In addition, we repeated the hybridization and sequencing of samples with unsatisfactory initial sequencing results.

Data Processing and Data Set Assembly

Bases were identified from raw image files with bcl2fastq2 (https://emea.support.illumina.com/downloads/bcl2fastq-conversion-software-v2-20.html) to generate raw reads. These sequences were then subjected to quality control and filtering by removing adapter sequences and low-quality reads with more than 5% N bases. Reads were then de-multiplexed and screened with Trimmomatic v0.33 (Bolger et al. 2014), FastQC v0.11.9 (http://www.bioinformatics.babraham.ac.uk/projects/fastqc/), and FastUniq v1.1 (Xu et al. 2012). MitoFinder v1.4 (Allio et al. 2020) was used to assemble mitochondrial genomes and each of the 150 targeted nuclear loci using the cleaned reads and the assembled outgroup taxon genome sequences (Rodriguez-Caro et al. 2020). Cd-hit-est v4.0 (Li and Godzik 2006) was used to further filter contigs to reduce redundancy. Then, SAMtools v1.4.1 (Li et al. 2009) was used to calculate the sequencing depth, and sequences of target loci with an average sequencing depth ≥ 5× were retained for further analysis. tBLASTN v2.2.18 (Altschul et al. 1990) was used to identify orthologous sequences from the contigs of each locus in each sample; sequences of our 150 target loci were retrieved from the Danaus plexippus genome as references (GenBank assembly accession: GCA_009731565.1). Potential chimeric sequences were identified by homology comparison and removed. Flanking sequences were identified using reference sequences. The sequences of each targeted locus were then obtained by taking the consensus sequence of each contig of sequencing reads after all quality control steps. MAFFT v7.0.1 (Katoh and Standley 2013) was used to align reads by comparing them to the 150 reference sequences that were used to design the probe set, and the process was optimized using Gblock v0.91B (Castresana 2000). Sequences corresponding to abnormally long branches were deleted. Two phylogenomic data sets were finally made for our samples: 1 was a nuDNA data set of 150 NPC loci, and the other was a mtDNA data set of mitochondrial genomes.

Phylogenetic Analysis

The reading frame of each nuclear and mitochondrial locus was determined using Exonerate v2.2.0 (Slater and Birney 2005). The data sets were partitioned by gene and codon positions, except that the non-protein coding mitochondrial 16S rRNA, 12S rRNA, and tRNA genes were each considered to be single, independent partitions. For each of the nuDNA and mtDNA data sets, the maximum likelihood (ML) gene tree was inferred with IQ-TREE v2.2.0 (Minh et al. 2020a). The best model and partition scheme were calculated using the ModelFinder (Kalyaanamoorthy et al. 2017) in IQ-TREE, with the predefined codon partitions Model Finder Plus (MFP) with the MERGE option. The options (-allnni -nstop 500 -pers 0.5) were employed to achieve a more comprehensive tree search. Node supports were assessed by performing the standard bootstrap test with 500 replicates (-b 500), SH-like approximate likelihood ratio test (SH-aLRT) with 5000 replicates (-alrt 5000), and ultrafast bootstrapping with 5000 replicates (-B 5000). ML support values are regarded to be high when bootstrap support (BS) is ≥ 70% (Hillis and Bull 1993), ultrafast bootstrap values are ≥ 95% (Minh et al. 2013), and SH-aLRT is ≥ 80% (Guindon et al. 2010).

The species trees of nuDNA were inferred using 2 different schemes. For the first scheme, all 150 nuDNA loci were separately used to infer the unrooted gene trees in IQ-TREE v2.0.5 (Minh et al. 2020b). The species tree was reconstructed from the 150 unrooted gene trees in Asteroid v1.0, a method to infer species trees from gene trees using data sets with substantial missing loci (Morel et al. 2023). Bootstrap values were computed from the input gene trees with 50,000 replicates. For the second scheme, a strict filtering criterion applied to the nuDNA data set resulted in a data set of 32 single-gene nuclear loci with all samples, which were used to infer gene trees in IQ-TREE v2.0.5. The species tree was built using the gene trees by ASTRAL-III v5.6.1 (Zhang et al. 2018) with posterior probabilities of local quartet topology (LPP), and the concordance factors (gene concordance factor/sequence concordance factor, Minh et al. 2020a) were calculated comparing the trees by IQ-TREE v2.0.5. With regard to the mtDNA species tree, 13 protein-coding and 2 tRNA loci were selected to infer the gene trees using RAXML v8.0 (Stamatakis 2014) with 200 bootstrap replicates, as the tRNA sequences were too short for gene tree reconstruction. The final species tree of mtDNA was obtained for these gene trees with ASTRAL-III v5.6.1.

Hybridization Inference

Preliminary analyses of our genetic data suggested that introgression between Colias lineages was common. This causes portions of the genomes of two lineages to be similar because of hybridization, not from descent from a recent common ancestor. Species Networks applying Quartets (SNaQ; Solís-Lemus and Ané 2016), implemented in PhyloNetworks (Solís-Lemus et al. 2017), was used to infer the phylogenetic network of each nuclear locus in a maximum pseudolikelihood framework. This method has been recently applied to phylogenetic network analysis of data sets with a large amount of missing data (Blair et al. 2019; Obiol et al. 2021; Calderón-Acevedo et al. 2022). The analysis time of this method increases drastically when there are more than 25 terminal taxa in the data set (Hejase and Liu 2016). To circumvent this limitation, the full data set of 66 unique Colias species was split into 5 data subsets. Each included the outgroup Zerene cesonia and one sample of each species in each of the other major clades in the species tree inferred with Asteroid v1.0. This method can identify hybridization events within each clade, but not between clades (except for hybridization between those taxa and the outgroup). Because the largest clade had 24 species, it was not possible to analyze 2 clades in a single analysis as most of the data sets would surpass the limit of 25 taxa. For each of the 5 data subsets, SNaQ was run with a set of gene trees inferred with IQ-TREE and a starting species tree generated with Asteroid v1.0. For each of the 5 clades, we specified the maximum number of hybridization events (hmax) in SNaQ using values ranging from 0 to 5. The pseudo-likelihoods for each hmax were plotted to select the most likely hmax for each clade. All 5 phylogenetic networks were reassembled into a single tree with the ape R package (Paradis and Schliep 2019).

Divergence Time Estimation

The nuDNA species tree inferred with Asteroid v1.0 was used as the fixed topology for estimating divergence times. In the absence of any fossils that can confidently be assigned to the subfamily Coliadinae (de Jong 2017), we used secondary calibrations to date the tree. The divergence time of each node was estimated with the MCMCTree program in the PAML package (Yang 2007). Two calibration points from previous studies were used: one is 12.13 Ma (10.6–15.71 Ma) for the common ancestor of Colias + Zerene from Kawahara et al. (2023) and the crown age of Colias, which was set as a range based on the minimum and maximum divergence dates from 2 different studies. Ranging from 1.83 Ma (Tunström et al. 2023; Colias crown age estimate 1.83-3.02 Ma) to 6.4 Ma (Carvalho et al. 2024; Colias crown age estimate 4.21–6.4 Ma), which covers the minimum and maximum crown ages of Colias estimated in recent investigations. We adopted the lognormal relaxed clock and General Time Reversible (GTR) + Gamma (G) model with other parameters as defaults. The analysis was run for 15 million generations, sampled every 150 generations with 1.5 million generations discarded as burn-in. The convergence was evaluated in Tracer v1.7 (Rambaut et al. 2018), with effective sample sizes larger than 200 for all parameters.

Ancestral Area Reconstruction

To reconstruct the biogeographic history of the genus Colias, we compiled distribution data for 79 Colias species from the literature (Supplementary Table 4). We then assigned the 66 Colias species in the time-calibrated tree (excluding only Colias hecla viluiensis) to one or a combination of 7 biogeographic regions: Neotropics, Nearctic, Southeast Asia + India, East Palearctic, West Palearctic, Afrotropics, and Pan-Tibetan Highlands. We performed Bayesian inference of historical biogeography using the Multi-feature Feature-Informed GeoSSE (MultiFIG) model (Swiston and Landis 2023). This model allows the estimation of biogeographic processes such as dispersal, extinction, and speciation, in addition to spatial and environmental features of regions that are expected to impact one or several of these processes. We first delimited polygons representing each biogeographic region (following Chazot et al. 2021; Liu and Zhu 2022; Liu et al. 2022). Then, for each region, we retrieved geographic features related to surface area, altitude, and distance to other regions, using the R package regionfeatures (https://bitbucket.org/sswiston/regionfeatures). We extracted data for 8 regional features: quantitative area, categorical area (above/below mean of all regions), quantitative distance (mean distance between a randomly selected point in the first region and a randomly selected point in the second region), categorical distance (adjacent/non-adjacent), quantitative altitude (mean altitude), categorical altitude (above/below mean), quantitative altitudinal difference (difference in mean altitude), and categorical altitudinal difference (whether regions share highland/lowland status or not).

We performed an analysis in RevBayes (Höhna et al. 2016) to infer base rates of dispersal, extinction, within and between-region speciation, the feature effect parameters, and ancestral biogeographical ranges. To tune hyperparameters, we ran 2 independent runs of 50,000 generations, sampling parameters and node histories every 100 cycles, and discarding the first 10% as burn-in. To verify that Markov chain Monte Carlo (MCMC) analyses converged to the same posterior distribution, we applied the Gelman diagnostic (Gelman and Rubin 1992) as implemented in the R package coda (Plummer et al. 2006), with a threshold of 1.1 to confirm convergence. Results presented here are drawn from 1 MCMC run.

Diversification Rate Analysis

To estimate palaeodiversity dynamics and test for heterogeneity of diversification rates across clades within Colias, we used the clade-shift model of Morlon et al. (2011) as implemented in the R package RPANDA (Morlon et al. 2016; Mazet et al. 2023). We tested diversification shifts at the crown age of 3 clades determined by their geographic distribution: (i) the ancestor of the 7 Neotropical species included in our tree; (ii) the ancestor of the sister group to the Neotropical clade, which includes all species in the Northern Hemisphere; and (iii) the ancestor of all species that occur in the Nearctic region. We used the compiled data set of the distribution of 79 Colias species (Supplementary Table S4) as a reference for calculating the sampling fraction for each clade. By doing this, we assumed that unsampled species that occur anywhere but the Neotropics are in Clade B (the non-South American clade) and that unsampled species that occur in the Nearctic are within the Nearctic subclade.

After identifying the best combination of shifts, birth-death models, and parameter values, we tested model adequacy by simulating 3351 trees (from 10,000 attempts) under the best-fit model combination. Following Mazet et al. (2023), we compared simulated trees to the empirical tree in terms of number of lineages through time, species richness per clade, and average leaf depth index, which is a measure of tree imbalance.

Even though this analysis was informed by geographical distribution (for clade selection), diversification rates estimated by the clade-shift model and MultiFIG are not directly comparable because these methods model different types of rate heterogeneity. MultiFIG estimates diversification rates associated with each geographical range and whether spatial and environmental features can explain differences in rates between ranges. Conversely, the clade-shift model infers diversification rates change over time based only on the phylogenetic tree.

Additionally, we tested for elevation-dependent speciation using ES-sim, a simulation-based test that uses a tip-specific metric of speciation rate derived from branch lengths. This approach is similar to the quantitative state speciation and extinction (QuaSSE) method in terms of statistical power to detect trait-dependent diversification and is more robust against false positives (Harvey and Rabosky, 2018). We implemented ES-sim using the R script available at https://github.com/mgharvey/ES-sim, using elevation data for each Colias species (Supplementary Table S4, Supplementary Fig. S1).

Host Repertoire Evolution

To infer the evolutionary history of associations between Colias butterflies and their host plants, we performed Bayesian inference under the 2-state model of host repertoire evolution proposed by Braga et al. (2020). Given a phylogenetic tree for each interacting clade and a matrix containing information about which ecological interactions have been observed between extant lineages, this method models the gains and losses of specific host taxa along the symbiont phylogeny (in our case, the butterflies) that resulted in the observed interactions. The host phylogeny is used to infer whether hosts that are closely related to other hosts already consumed by the species are more easily gained compared to more distantly related hosts. Preliminary analyses demonstrated that phylogenetic distance between host plant genera does not affect host gain probabilities in Colias. Thus, the analysis presented here does not include the host plant phylogeny.

We used information gathered from the literature to compile an interaction matrix between 55 Colias species and 62 host plant genera (Supplementary Table S5). For the butterflies, we used the reconstructed time-calibrated Colias tree and kept only the 55 species with host use data. The joint posterior distribution of model parameters and evolutionary histories were estimated using the inference strategy described in Braga et al. (2020) as implemented in RevBayes (Höhna et al. 2016). We ran 3 independent MCMC analyses for 105 cycles, sampling every 100 cycles. The first 20–50% of samples were discarded as burn-in; burn-in was determined by visual inspection of ESS and likelihood values in Tracer (Rambaut et al. 2018); and then 500 random samples were taken from each independent run. We used the implementation of the Gelman diagnostic (Gelman and Rubin 1992) in the R package coda (Plummer et al. 2006) to verify that MCMC analyses converged to the same posterior distribution. We present results from all MCMC analyses combined.

The posterior distribution of ancestral interactions was summarized in 2 ways using the R package evolnets (www.github.com/maribraga/evolnets): (i) interactions with at least 90% posterior probability at the internal nodes of the butterfly tree; and (ii) weighted summary networks at 3, 2, and 1 Ma where interactions with at least 70% posterior probability are included. This probability was used as a weight for the link between nodes. To facilitate visualization and evaluate whether interactions with groups of hosts tend to evolve together, we identified modules in the extant network and the 3 ancestral networks. We then validated the ancestral modules by calculating how often 2 nodes were placed in the same module across 100 networks sampled during MCMC. For the extant network, in addition to modularity (Beckett 2016) we also calculated nestedness using the Nestedness metric based on Overlap and Decreasing Fill (NODF; Almeida‐Neto et al. 2008; Almeida-Neto and Ulrich 2011) and compared it to 100 networks generated by a null model that maintains the number of interactions in the observed network. This comparison was done by calculating the z-score, which quantifies the position of the observed metric within the null distribution in standard units, revealing how much it deviates from the null expectation (Ulrich et al. 2009). All data, RevBayes and R scripts for the analyses of historical biogeography, clade-shift diversification, and host repertoire evolution are available at www.github.com/maribraga/Colias_hostrep and the Dryad repository https://doi.org/10.5061/dryad.bk3j9kdkz.

Results

Sequence Capture and Data Matrix Statistics

The success of hybrid capture and Illumina sequencing, including attributes of the 150 nuclear loci we targeted, is summarized in Supplementary Table S6. A total of 235,881,658 clean paired-end reads were obtained; the capture success of each sample is summarized in Supplementary Table S7. The number of clean reads per sample ranged from 582,057 to 11,680,568, with an average of 3,187,590 reads per sample. In comparison with the reference sequences of the targeted loci, the percentage of missing sequences for each sample ranged from 2.92% to 74.86%, with an average of 20.48%. The read‐to‐target mapping percentage (on‐target) of the samples with secondary hybridization was significantly higher than those without secondary hybridization. The average sequence coverage of the orthologous contigs was 397×, with a wide range from 1 to 7419× (Supplementary Table S8). The general features of the mitochondrial genome data set are summarized in Supplementary Table S9. Two concatenated matrices were used for phylogenetic analysis: the 150 nuDNA locus data set was 179,022 bp in length and the mitochondrial genome data set was 12,604 bp in length. GenBank accession numbers of the newly sequenced markers were summarized in Supplementary Table S10.

Phylogenetic Analyses

Two characteristics can be seen in all trees: (i) 2 main clades of Colias, hereafter called Clade A and Clade B, were recovered with strong support; and (ii) some species represented by multiple samples, including C. croceus, C. erate, and C. hecla, were not recovered as monophyletic.

The Asteroid species tree of nuDNA (Fig. 1) demonstrates that the genus Colias is monophyletic with full support (BS = 100). The monophyly of Clade A is fully supported (BS = 100). Within this clade, the sister to the remaining South American taxa is C. dimera, which is distributed in northern South America and has a unique male wing color. The monophyly of Clade B is also fully supported (BS = 100), and, while intermediate nodes lack support, stronger node support is found among numerous closely related taxa. Some species pairs were recovered as monophyletic with strong bootstrap values, including C. fieldii and C. tibetana; C. aurorina and C. chlorocoma; and C. diva and C. heos. The highly supported clades of the ASTRAL species tree (Supplementary Fig. S2) are almost identical to those of the Asteroid species tree (Fig. 1).

Figure 1.

Figure 1.

Comparison between 2 Colias trees with representative adult photos of Colias upper sides. Left) A species tree inferred with Asteroid using gene trees of 150 nuclear gene loci. BS is denoted at the node based on 50,000 replications. Right) Reassembled phylogenetic network inferred using 5 SNaQ analyses, one for each clade. The arrows indicate the direction of the hybridization. The numbers near the arrows represent γ values, indicating the proportion of the genome inherited via introgression; larger values of each pair are green. Arrows with solid lines indicate introgression within a clade, and arrows with dashed lines indicate hypothesized introgression between 2 different clades.

The topology of the nuDNA gene tree (Supplementary Fig. S3) differs from the nuDNA species trees (Fig. 1 and Supplementary Fig. S2), but lineages with high support values are coincident among these trees. The mtDNA gene tree (Supplementary Fig. S4) had lower support for Clade A (BS = 68). All South American samples form a strongly supported monophyletic group (BS = 100), which is divided into 2 clades. One clade includes only C. vauthierii, which is found in Chile and Argentina south of the Andes and is restricted to high elevations; the other clade has short branches and is primarily distributed throughout the Andes. The monophyly of the non-South American Clade B is also strongly supported (BS = 100), but the internal branches differ greatly from the relationships in the nuDNA gene tree (Supplementary Fig. S3). The first-diverging clade is the widely distributed C. hyale clade, including C. hyale, C. alta, and C. alfacariensis. All remaining species are grouped into a monophyletic group (BS = 100) and diversified rapidly. The topology of the species tree inferred with 15 mitochondrial loci (Supplementary Fig. S5) is almost identical to the mtDNA gene tree (Supplementary Fig. S4).

Divergence Time Estimation

The genus Colias diverged from the common ancestor of Zerene + Colias in the mid-Miocene ~13.33 Ma (95% Confidence Intervals, CI: 10.63-15.74 Ma) (Supplementary Fig. S6). The common ancestor of the extant Colias species originated ~3.50 Ma (CI: 2.04–5.55 Ma). The crown ages of Clade A and Clade B are ~1.83 Ma (CI: 1.02–3.05 Ma) and ~3.07 Ma (CI: 1.80–4.89 Ma), respectively. Further diversification of this genus occurred at ~0.5–3 Ma.

Inference of Hybridization

The reassembled SNaQ phylogenetic network had a different topology than the Asteroid species tree, even though the topology of the species tree was the starting point for the SNaQ analysis (Fig. 1). Topological differences between the 2 trees occur in lineages with low BS (60% > BS), which result from discordant gene trees that likely result from hybridization. Although the node C. flaveola shares a highly supported node with its sister clade (100% > BS ≥ 90%) in the Asteroid tree, change of its phylogenetic position is found along with hybridization events in the SNaQ tree. Lineages including C. mossi, C. berylla, C. stoliczkana, C. wanda, C. arida, C. croceus, and C. interior have low BS and are inferred to have hybridized.

The 2 hybridization events in Clade 5 demonstrate the 2 types of inference that can result from our approach. The first type is introgression between taxa in 2 distinct lineages. For example, a hybridization event is inferred between C. lesbia and C. vauthierii, albeit with a non-significant value (γ ≈ 0.09). The other type of inference is introgression between a taxon and its ancestral lineage. For instance, Colias flaveola shares nearly half its genome (γ ≈ 0.42) with descendants of its ancestral lineage including C. mossi, C. euxanthe, and C. weberbaueri. In total, we infer 14 hybridization events. Dashed lines in the species network indicate inferred hybridization events between 2 lineages from inside and outside the clade, respectively. Because of computational limitations that led us to analyze each clade separately, we were unable to infer these putative hybridization events more precisely (Fig. 1).

Reconstruction of Ancestral Areas

The MultiFIG analysis estimated that the ancestral range of Colias included East and West Palearctic + Pan-Tibetan Highlands + Nearctic + Neotropics (posterior probability = 0.41, Fig. 2). The range with the second highest posterior probability was much less likely (0.08) and included all regions except for the Afrotropics. The 8 ranges with the highest posterior probability (which together sum to 0.808) included between 4 and all 7 regions (Supplementary Tables S11 and S12). More importantly, among the regions in the most likely ancestral range, the Pan-Tibetan Highlands are included in all 8 ranges, while all other regions are included in 7 of the 8 most likely ranges. Thus, we cannot estimate the specific ancestral range with confidence. The most likely scenario suggests a widespread origin, from which different clades diversified in different regions of the world. A few lineages underwent new expansion phases, colonizing, for example, the 2 remaining regions: Southeast Asia and the Afrotropics (Fig. 2).

Figure 2.

Figure 2.

Historical biogeography of the genus Colias. a) Ancestral range reconstruction done with MultiFIG. The size of circles at internal nodes represents the posterior probabilities for the inferred historical ranges that are indicated with color. Asterisks mark the locations where diversification shifts were tested. b) Delineation of the 7 regions used in this biogeographic analysis.

Of the 8 relationships between-region features (i.e., area, altitude, distance) and processes (i.e., speciation, extinction, dispersal) that we examined, 3 were estimated with high posterior probability when considering the joint probability of categorical and quantitative parameters (Supplementary Fig. S7). First, the model assigned a high probability (0.99) to the scenario where quantitative and/or categorical distance between regions negatively affects the dispersal rate, so that closer regions are more easily colonized. Therefore, we can conclude that distance influences dispersal even if we cannot confidently distinguish between the effects of distance (quantitative parameter) and adjacency (categorical parameter) between regions. Second, there was a high probability (0.95) that quantitative and/or categorical areas negatively affect the extinction rate, meaning that extinction is less likely in larger regions. Third, the model assigned a probability of 0.91 to the scenario where quantitative and/or categorical distance between regions negatively affects the between-region speciation rate. This negative relationship is probably explained by the higher likelihood that widespread ranges include adjacent regions. Variation in diversification rates among biogeographical ranges was mainly determined by range size, where between-region speciation was more likely in ranges including more regions (Supplementary Figs. S8 and S9). Finally, no feature related to elevation was inferred to have an effect on dispersal, speciation, or extinction (0.76 < P < 0.82).

Diversification Rate Analysis

Of the 4 possible combinations of shifts with a single backbone and the 5 combinations with multiple backbones, 2 were detected as the best combinations (ΔAICc = 0.21 between first and second-best models, Fig. 3 and Supplementary Fig. S10). The best combination includes a single shift in diversification regime at the base of Clade B while the second-best combination also includes a shift at the base of the clade of Nearctic Colias taxa. Both subclades are best explained by a model with an exponentially decreasing speciation rate and no extinction while the backbone (Clade A) is best explained by a model of diversification with constant speciation through time (Fig. 3b and Supplementary Fig. S10b). Both combinations result in an increasing species diversity through time, which slows down to the present and reaches the current diversity (Fig. 3 c and Supplementary Fig. S10c). We found no correlation between elevation and tip-diversification rates (min. elevation: ρ = −0.15, P = 0.45; max. elevation: ρ = −0.04, P = 0.80, elevational range (max–min): ρ = 0.15, P = 0.41).

Figure 3.

Figure 3.

Diversification shifts and palaeodiversity dynamics of Colias. a) Colias most likely experienced a single shift in diversification rate, with slower diversification in South America than in the Northern Hemisphere. b) Clade-specific diversification rates over time. Only speciation rates are shown, as the best-fit models do not include extinction. c) Clade-specific species accumulation curves.

Host Repertoire Evolution

The extant network of 55 Colias species and 62 host plant genera reached the highest modularity in a configuration with 5 modules of butterflies and plants that interact more with each other than with the rest of the network (Supplementary Fig. S11a). Because the algorithm searches for the most modular network configuration without taking into account any biological information, modules are not always easy to interpret. Still, there was a general pattern. Module 2 was defined by butterflies with the broadest host repertoires (C. erate and C. eurytheme) and their close relatives, while all other modules were defined by the host plant genera. The host plants with the most interactions in each module were: Astragalus and Medicago in M1; Trifolium and Vaccinium in M3 (even though rarely used together); Oxytropis, Lupinus, and Hedysarum in M4; and Coronilla, Vicia, and Lotus in M5. Interestingly, the extant network is not more modular than expected by the null model (Q = 0.42, z-score = −1.89), but is more nested (NODF = 33.22, z-score = 49.24). Still, the origin of most modules can be traced back to one of the ancestral networks.

Our analysis of host repertoire evolution inferred 343 events (95% highest posterior density [HPD] 296–398) along the Colias tree, 43% of those host gains and 57% of host losses. This is equivalent to approximately 1.5 gain and 2 loss events per million years along each branch of the tree. The most recent common ancestor (MRCA) of all Colias species included in this study likely used Astragalus (module M1) and Trifolium (M3) as host plants (posterior probabilities are 0.99 and 1, respectively). Ancestors of Clade A used combinations of the ancestral hosts (Astragalus and Trifolium) and Medicago (M1), with other hosts being colonized more recently by individual species (Fig. 4). On the other hand, there were several events of host range expansion and contraction within the much larger Clade B. The MRCA of Clade B likely used Oxytropis (M4), in addition to the 2 ancestral hosts (Fig. 4). In the subclades of Clade B, most lineages lost some part of the ancestral host repertoire, except for 1 clade (C. electo and close relatives) that greatly expanded its host range, colonizing hosts from all 5 modules. One of the extant butterflies (C. electo) in this clade (C. electo and close relatives) then dispersed to Africa, while all other butterflies remained in Eurasia. In the clade of Nearctic Colias taxa, several host range expansions were reconstructed. Most of them added 1 or 2 hosts, but larger host range expansions happened in the MRCA of C. alexandra and C. philodice and its descendants. Two lineages went in the opposite direction and reduced their host repertoires to a single host, Vaccinium (M3). Their extant descendants use 1 or 2 host genera in addition to Vaccinium.

Figure 4.

Figure 4.

Evolution of host repertoire in Colias butterflies. a) Asteroid Species Tree with ancestral repertoires at internal nodes with at least 90% posterior probability. Each square represents one host genus and has the same color as in b. b) Interaction network between Colias species and their host plants (see Supplementary figures for taxon names). Interactions are colored by modules, which are groups of butterflies and hosts that interact mainly with each other. Gray squares represent interactions between different modules.

We also reconstructed ancestral butterfly-plant networks at 3, 2, and 1 Ma (Supplementary Fig. S11b–d). We identified modules from summary networks (networks, where interaction weight is given by posterior probability and only interactions with at least 0.7 probability are included) and validated them with networks sampled during MCMC (Supplementary Table S13). At 3 Ma there was only 1 module, including the 3 Colias lineages extant at that time and 3 host plants: Astragalus, Trifolium, and Oxytropis. At 2 Ma, a new module was formed. It included the 3 earliest lineages in the clade of Nearctic Colias taxa and the host plants Hedysarum, Lupinus, and Vaccinium, the first non-Fabaceae plant colonized by Colias. At 1 Ma, M4 expanded and gave origin to M3, which included the Vaccinium specialists. Until this point in time, Trifolium was placed in M1, only switching to M3 in the extant network. This change does not seem to have any biological implications and is simply a stochastic result of the modularity algorithm. Finally, also at 1 Ma, module M5 originated by a host range expansion at the branch leading to the ancestor of C. electo and its sister clade.

Discussion

Rapidly radiating taxa have always posed challenges for inferring phylogenetic relationships with confidence (Larridon et al. 2020). The development of reduced representation NGS methods has enabled phylogenetic reconstruction using hundreds of genetic loci, making evolutionary inferences of recent and ancient divergences more reliable (Grover et al. 2012; Lemmon and Lemmon 2013; Nunes et al. 2022). A robust global phylogeny of Colias has not been recovered in previous studies, largely resulting from the limited genetic data (Kir’yanov 2021) or inadequate taxon sampling (Tunström et al. 2023; Carvalho et al. 2024). Disentangling the rapidly evolving history of geographic range and host repertoire in the montane genus Colias was our motivation for employing a probe set of 150 nuclear loci with mitochondrial genomes to sequence > 190,000 bp/specimen from the most extensive taxon sampling of Colias yet assembled.

Time-calibrated Phylogeny and Species Network

Different phylogenetic analyses of our data all find 2 well-supported main clades within Colias: Clade A includes South American taxa, and Clade B includes non-South American taxa. Hybridization is inferred to have occurred within Clade A and within Clade B, but not between them, as would be expected for these geographically separated lineages. This result further supports the 2-clade division of Tunström et al. (2023). Several lines of evidence suggest that the diversification of Colias was rapid and recent. We infer the Colias crown group to be just ~3.5 Ma old, and introgression between lineages seems to have been rampant, as is often observed in young radiations (Fig. 1; Tunström et al. 2023). The recency of divergence and the frequency of hybridization pose a challenge for phylogenetic inference because of incomplete lineage sorting and an evolutionary history that is not strictly bifurcating. Some species with more than 1 individual in our taxon sample (such as C. croceus, C. erate, and C. hecla) were not monophyletic, which suggests cryptic species-level diversity or frequent, ongoing gene flow. We infer hybridization between C. electo and the lineage that gave rise to C. croceus, and these 2 species are known to hybridize (Berger 1986; Descimon and Mallet 2009). Strong discordance between the trees of mitochondrial genomes (Supplementary Fig. S5) and nuclear loci (Fig. 1) is further evidence of frequent introgression among lineages. Accompanied by hybridization events, the topological uncertainty of many nodes is found in our Colias trees, it should be noted that the presentation of our results focuses mainly on the supported nodes.

Our divergence time estimation of the stem and crown ages of Colias (Supplementary Fig. S6) yields similar results to those of recent studies (Chazot et al. 2019a; Kawahara et al. 2023; Tunström et al. 2023; Carvalho et al. 2024). An older stem age of ~20.1 Ma (10.4–32.3 Ma) was suggested for the initial split of Colias from the MRCA of its sister taxon Zerene, based on a tribal-level dated phylogeny on which only 1 sample was used for each genus of Colias and Zerene (Espeland et al. 2018). This old age is probably caused by the sparce taxon sampling at the genus and species levels.

Diversification of Colias

We found that Clade A has a constant speciation rate, and Clade B has an exponentially decreasing rate in which diversification is higher in the past and exponentially decreases towards the present (Fig. 3). This latter pattern is considered by some to be the main feature of adaptive radiation (Gavrilets and Losos 2009; Moen and Morlon 2014). In addition, Carvalho et al. (2024) found that Colias is one of the most rapidly diversifying lineages within the family Pieridae as a whole. The burst of early diversification may result from recently diverged species filling unoccupied niches after entering new adaptive areas (Kong et al. 2022) until the newly available niche space becomes saturated (Yoder et al. 2010), and the diversification rate begins to slow (Phillimore and Price 2008; Rabosky 2013).

Many factors may lead to niche shifts, including intrinsic factors such as key innovations, hybridization, introgression, and species interactions (López-Fernández et al. 2010; Rabosky and Glor 2010). Extrinsic factors including geologic or (warming) climatic events (Rohde 1992; Carvalho et al. 2024) and mountain uplift (Lagomarsino et al. 2016; Esquerré et al. 2019) may also cause niche shifts. Diversification cannot be explained simply by examining diversification rates, so we will discuss Colias diversification in concert with results from our analyses of historical biogeography and host plant repertoire.

Historical Biogeography

Diversification across different elevations has been considered one of the driving forces in the RRMT (Steinbauer et al. 2013; Halbritter et al. 2018), and likely triggered the diversification of many biological taxa. Even though Colias occurs in montane areas of varying elevation (Supplementary Table S4, Supplementary Fig. S1), our tip rate analysis and the estimates of the relationships between region features and biogeographical processes indicate that elevation has not affected dispersal, speciation, or extinction, which means that elevation may have no effect on the rapid diversification of Colias.

Montane species are profoundly affected by climate change, which causes changes in population size (demography) as well as latitudinal and altitudinal distributions (Hewitt 2000; Vintsek et al. 2022). Most Colias species diversified in the Northern Hemisphere during the past ~3.5 Ma, and geographic expansion events of species in the Western Palearctic were common (Fig. 2). It may be a pattern that repeats with each glacial cycle: (i) During glacial periods, cold-adapted montane species may spread into lowland areas and laterally into other mountain areas (Wallis et al. 2016). (ii) During interglacial periods, their ranges may shrink, isolating populations on mountain peaks (Minter et al. 2020). This distribution pattern associated with glacial cycles may produce genetically isolated populations, which may further promote speciation after the newly separated populations adapt to their isolated mountain homes. The geographic expansions of Colias may also be aided by evolutionary changes in host repertoire and the use of widespread host plants.

Host Repertoire Evolution

Host repertoire evolution in Colias butterflies (Fig. 4) was characterized by many host range expansion and contraction events. From an ancestral host repertoire including the legume genera Astragalus and Trifolium, Colias butterflies colonized a wide range of Fabaceae genera as well as plants in more distantly related families. This variation is mainly seen in Clade B, as extant species in the South American Clade A still only use the ancestral hosts and a few other closely related genera. Among extant species, there are 2 phylogenetically independent groups of more generalist Colias taxa that have colonized many Fabaceae genera (often the same ones, suggesting that they live in similar habitats), including early-diverging Fabaceae genera within this diverse family. While one of these cases can be confidently explained as an extensive host range expansion event by the ancestor of C. philodice and C. eurytheme, the timing of the other event (in the subclade including C. erate, Clade 2) is less clear given the poor branch support for this subclade (Fig. 1), likely caused by high levels of gene flow within it. Still, generalism is seen at the tips of the phylogeny, and generalist species are often closely related to specialist species, which is a general pattern supporting the oscillation hypothesis (Janz and Nylin 2008). This pattern, which is found in phytophagous insects and other parasites (Nylin et al. 2018), demonstrates that species sometimes become more generalist, but then typically specialize again so that only recent host range expansions are observed.

The nestedness observed in the extant butterfly-plant network (with ancestral hosts being the plants with the most interactions) is another line of evidence that oscillations in the host range have occurred across the Colias tree (Braga et al. 2018). The low network modularity and lack of modules with many closely related butterflies on a single host suggest that the interactions between Colias butterflies and their host plant genera cannot be explained by explosive radiations on novel hosts (Braga et al. 2018). These results seem to indicate that the oscillation hypothesis (Janz and Nylin 2008) can operate at various taxonomic and temporal scales, while the escape-and-radiate hypothesis (Ehrlich and Raven 1964) might operate mainly at the host plant family level.

By combining results from the analyses of historical biogeography and host repertoire evolution, we can investigate the role of the host plants in the butterflies’ dispersal across the world. The origin and early diversification of Astragalus likely happened in Asia more than 10 Ma (Azani et al. 2019), and some colonization of the New World likely happened before Colias diversification (~3.5 Ma). Thus, the widespread distribution of Astragalus seems to have facilitated the dispersal of Colias from South America to Asia (likely through North America) and subsequent diversification. Another example of the combined evolution of geographic and host ranges is the dispersal back to North America in one of the subclades of Clade B. Even though phylogenetic uncertainty prevents more confident timing of colonization of specific host genera, early species in this subclade have gradually added new genera into their host repertoires (including genera from distantly related families), which likely facilitated geographic range expansion. One of these genera is Vaccinium, which is mainly found in cooler areas of the Northern Hemisphere (Seider et al. 2022; Hirabayashi et al. 2023). Thus, butterflies could use Vaccinium in North America and Asia, likely helping species achieve widespread distributions, in some cases followed by local adaptation and speciation.

Conclusions

We present a well-sampled, dated phylogeny of Colias and show evidence for recent, rapid radiation during its short evolutionary history. This genus originated from a widespread ancestral range and split into the South American and non-South American clades ~3.5 Ma. Lineages of Colias diversified in many different regions across the world as they expanded geographically, and, surprisingly, we conclude that elevation probably does not affect their diversification. The genus experienced a burst of diversification in its early evolutionary history, followed by an exponential decline in diversification rate until the present. From an ancestral host repertoire including the legume genera Astragalus and Trifolium, the host ranges of Colias butterflies contracted and expanded to include a wide range of Fabaceae genera and plants in more distantly related families. We suggest that the diversity of Colias was facilitated by its geographical expansion and adaptive radiation into new niches, aided by the colonization of new host plant taxa. Our study highlights that the species richness of rapidly radiating montane insect taxa may not necessarily be related to altitude per se, but is likely to benefit from isolation on different peaks, changes in distribution likely related to climate fluctuation, and host repertoire evolution.

Supplementary Material

Data available from the Dryad Digital Repository: https://dx.doi.org/10.5061/dryad.bk3j9kdkz.

Acknowledgements

We are grateful to the chief editor Dr. Isabel Sanmartín, associate editor Dr. Daniele Silvestro, Dr. Pável Matos-Maraví, and an anonymous reviewer for their constructive comments and suggestions. S.M. would like to thank Yuan Zhang and Xiao Tian for their help during molecular data processing.

Contributor Information

Shifang Mo, Department of Entomology, College of Plant Protection, South China Agricultural University, 483 Wushan Road, Tianhe District, Guangzhou, 510000, China.

Yaowei Zhu, State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, 135 Xingangxi Road, Haizhu District, Guangzhou, 510275, China.

Mariana P Braga, Department of Ecology, Swedish University of Agricultural Sciences, Ulls väg 16 Uppsala, 75649, Sweden.

David J Lohman, Department of Biology, City College of New York, City University of New York, 160 Convent Ave., New York, NY 10031, USA; PhD Program in Biology, Graduate Center, City University of New York, 365 5th Ave., New York, NY 10016, USA; Entomology Section, National Museum of Natural History, Rizal Park, T.W. Kalaw St., Manila, 1000, Philippines.

Sören Nylin, Department of Zoology, Svante Arrhenius väg 18B, Stockholm University, Stockholm, SE-10691, Sweden.

Ashraf Moumou, Department of Biology, City College of New York, City University of New York, 160 Convent Ave., New York, NY 10031, USA.

Christopher W Wheat, Department of Zoology, Svante Arrhenius väg 18B, Stockholm University, Stockholm, SE-10691, Sweden.

Niklas Wahlberg, Department of Biology, Kontaktvägen 10, Lund University, Lund, SWE-22362, Sweden.

Min Wang, Department of Entomology, College of Plant Protection, South China Agricultural University, 483 Wushan Road, Tianhe District, Guangzhou, 510000, China.

Fangzhou Ma, Nanjing Institute of Environmental Sciences, Ministry of Ecology and Environment of China, 8 Jiangwangmiao Road, Xuanwu District, Nanjing, 210000, China.

Peng Zhang, State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, 135 Xingangxi Road, Haizhu District, Guangzhou, 510275, China.

Houshuai Wang, Department of Entomology, College of Plant Protection, South China Agricultural University, 483 Wushan Road, Tianhe District, Guangzhou, 510000, China.

Author Contributions

H.W., P.Z., F.M., and M.W. conceived the study. S.M., Y.Z., and M.P.B. collected and analyzed molecular data. M.P.B. and S.N. made host plant data analysis. A.M. and D.J.L. undertook Asteroid and PhyloNetwork analyses. S.M. wrote the original draft with review and edits from D.J.L., M.P.B., S.N., A.M., C.W.W., N.W., P.Z., and H.W.

Conflict Of Interest

The authors declare no conflict of interest.

Funding

This work was supported by grants from (i) the National Natural Science Foundation of China (No. 32070469 to M.W., No. 32270478 to H.W., and No. 32071611 to P.Z.); (ii) the Swedish Research Council (International Postdoc Grant No. 2020-06422 to M.P.B., Grant No. 2019-03441 to S.N.); (iii) the US National Science Foundation (DEB-1541557 to D.J.L.); and (iv) the China Biodiversity Observation Networks for Butterflies (China BON-Butterfly) to F.M.

Data Availability

Code used for software in this study can be also found from the Zenodo using the link: https://doi.org/10.5281/zenodo.13932428

References

  1. Albert T.J., Molla M.N., Muzny D.M., Nazareth L., Wheeler D., Song X., Richmond T.A., Middle C.M., Rodesch M.J., Packard C.J., Weinstock G.M., Gibbs R.A.. 2007. Direct selection of human genomic loci by microarray hybridization. Nat. Methods 4:903–905. [DOI] [PubMed] [Google Scholar]
  2. Allio R., Schomaker-Bastos A., Romiguier J., Prosdocimi F., Nabholz B., Delsuc F.. 2020. MitoFinder: efficient automated large-scale extraction of mitogenomic data in target enrichment phylogenomics. Mol. Ecol. Resour. 20:892–905. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Almeida‐Neto M., Guimarães P., Guimarães Jr P.R., Loyola R.D., Ulrich W. 2008. A consistent metric for nestedness analysis in ecological systems: reconciling concept and measurement. Oikos 117:1227–1239. doi: https://doi.org/ 10.1111/j.0030-1299.2008.16644.x [DOI] [Google Scholar]
  4. Almeida-Neto M., Ulrich W.. 2011. A straightforward computational approach for measuring nestedness using quantitative matrices. Environ. Model. Softw. 26:173–178. [Google Scholar]
  5. Altschul S.F., Gish W., Miller W., Myers E.W., Lipman D.J.. 1990. Basic local alignment search tool. J. Mol. Biol. 215:403–410. [DOI] [PubMed] [Google Scholar]
  6. Azani N., Bruneau A., Wojciechowski M.F., Zarre S.. 2019. Miocene climate change as a driving force for multiple origins of annual species in Astragalus (Fabaceae, Papilionoideae). Mol. Phylogenet. Evol. 137:210–221. [DOI] [PubMed] [Google Scholar]
  7. Beckett S.J. 2016. Improved community detection in weighted bipartite networks. R. Soc. Open Sci. 3:140536. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Berger L.A. 1986. Systématique du genre Colias F. Lepidoptera-Pieridae. Lambillionea 86:1–68. [Google Scholar]
  9. Blair C., Bryson R.W., Linkem C.W., Lazcano D., Klicka J., McCormack J.E. 2019. Cryptic diversity in the Mexican highlands: Thousands of UCE loci help illuminate phylogenetic relationships, species limits and divergence times of montane rattlesnakes (Viperidae: Crotalus). Molecular Ecology Resources 19:349–365. doi: https://doi.org/ 10.1111/1755-0998.12970 [DOI] [PubMed] [Google Scholar]
  10. Bolger A.M., Lohse M., Usadel B.. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30:2114–2120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Braga M.P., Guimarães P.R., Wheat C.W., Nylin S., Janz N.. 2018. Unifying host-associated diversification processes using butterfly–plant networks. Nat. Commun. 9:1–10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Braga M.P., Janz N.. 2021. Host repertoires and changing insect-plant interactions. Ecol. Entomol. 46:1241–1253. [Google Scholar]
  13. Braga M.P., Janz N., Nylin S., Ronquist F., Landis M.J.. 2021. Phylogenetic reconstruction of ancestral ecological networks through time for pierid butterflies and their host plants. Ecol. Lett. 24:2134–2145. [DOI] [PubMed] [Google Scholar]
  14. Braga M.P., Landis M.J., Nylin S., Janz N., Ronquist F.. 2020. Bayesian inference of ancestral host–parasite interactions under a phylogenetic model of host repertoire evolution. Syst. Biol. 69:1149–1162. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Breinholt J.W., Earl C., Lemmon A.R., Lemmon E.M., Xiao L., Kawahara A.Y.. 2018. Resolving relationships among the megadiverse butterflies and moths with a novel pipeline for anchored phylogenomics. Syst. Biol. 67:78–93. [DOI] [PubMed] [Google Scholar]
  16. Cadena C.D., Cuervo A.M., Céspedes L.N., Bravo G.A., Krabbe N., Schulenberg T.S., Derryberry G.E., Silveira L.F., Derryberry E.P., Brumfield R.T., Fjeldså J.. 2020. Systematics, biogeography, and diversification of Scytalopus tapaculos (Rhinocryptidae), an enigmatic radiation of Neotropical montane birds. Auk 138:ukz077. [Google Scholar]
  17. Calderón-Acevedo C.A., Bagley J.C., Muchhala N.. 2022. Genome-wide ultraconserved elements resolve phylogenetic relationships and biogeographic history among Neotropical leaf-nosed bats in the genus Anoura (Phyllostomidae). Mol. Phylogenet. Evol. 167:107356. [DOI] [PubMed] [Google Scholar]
  18. Carvalho A.P.S., Owens H.L., St Laurent R.A., Earl C., Dexter K.M., Messcher R.L., Willmott K.R., Aduse-Poku K., Collins S.C., Homziak N.T., Hoshizaki S., Hsu Y., Kizhakke A.G., Kunte K., Martins D.J., Mega N.O., Morinaka S., Peggie D., Romanowski H.P., Sáfián S., Vila R., Wang H., Braby M.F., Espeland M., Breinholt J.W., Pierce N.E., Kawahara A.Y., Lohman D.J.. 2024. Comprehensive phylogeny of Pieridae butterflies reveals strong correlation between diversification and temperature. iScience 27:109336. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Casner K.L., Pyrcz T.W.. 2010. Patterns and timing of diversification in a tropical montane butterfly genus, Lymanopoda (Nymphalidae, Satyrinae). Ecography 33:251–259. [Google Scholar]
  20. Castresana J. 2000. Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Mol. Biol. Evol. 17:540–552. [DOI] [PubMed] [Google Scholar]
  21. Chazot N., Condamine F.L., Dudas G., Peña C., Kodandaramaiah U., Matos-Maraví P., Aduse-Poku K., Elias M., Warren A.D., Lohman D.J., Penz C.M., DeVries P., Fric Z.F., Nylin S., Müller C., Kawahara A.Y., Silva-Brandão K.L., Lamas G., Kleckova I., Zubek A., Ortiz-Acevedo E., Vila R., Vane-Wright R.I., Mullen S.P., Jiggins C.D., Wheat C.W., Freitas A.V.L., Wahlberg N.. 2021. Conserved ancestral tropical niche but different continental histories explain the latitudinal diversity gradient in brush-footed butterflies. Nat. Commun. 12:5717. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Chazot N., Wahlberg N., Freitas A.V.L., Mitter C., Labandeira C., Sohn J., Sahoo R.K., Seraphim N., De Jong R., Heikkilä M.. 2019a. Priors and posteriors in Bayesian timing of divergence analyses: the age of butterflies revisited. Syst. Biol. 68:797–813. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Chazot N., Willmott K.R., Lamas G., Freitas A.V., Piron‐Prunier F., Arias C.F., Mallet J., De-Silva D.L., Elias M.. 2019b. Renewed diversification following Miocene landscape turnover in a Neotropical butterfly radiation. Glob. Ecol. Biogeogr. 28:1118–1132. [Google Scholar]
  24. de Jong R. 2017. Fossil butterflies, calibration points and the molecular clock (Lepidoptera: Papilionoidea). Zootaxa 4270:1–63. doi: https://doi.org/ 10.11646/zootaxa.4270.1.1 [DOI] [PubMed] [Google Scholar]
  25. de Moraes Magaldi L., Gueratto P.E., Ortega-Abboud E., Sobral-Souza T., Joron M., de Souza A.P., Freitas A.V.L., Silva-Brandão K.L.. 2024. Montane diversification as a mechanism of speciation in Neotropical butterflies. Ecol. Evol. 14:e11704. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Descimon H., Mallet J.. 2009. Bad species. In: Settele J., Konvicka M., Shreeve T., Dennis R., Van Dyck H., editors. Ecology of butterflies in Europe. Cambridge: Cambridge University Press. p. 203–215. [Google Scholar]
  27. Ehrlich P.R., Raven P.H.. 1964. Butterflies and plants: a study in coevolution. Evolution 18:586–608. [Google Scholar]
  28. Espeland M., Breinholt J., Willmott K.R., Warren A.D., Vila R., Toussaint E.F., Maunsell S.C., Aduse-Poku K., Talavera G., Eastwood R., Jarzyna M.A., Guralnick R., Lohman D.J., Pierce N.E., Kawahara A.Y.. 2018. A comprehensive and dated phylogenomic analysis of butterflies. Current Biology 28:770–778. doi: https://doi.org/ 10.1016/j.cub.2018.01.061 [DOI] [PubMed] [Google Scholar]
  29. Esquerré D., Brennan I.G., Catullo R.A., Torres-Perez F., Keogh J.S.. 2019. How mountains shape biodiversity: the role of the Andes in biogeography, diversification, and reproductive biology in South America’s most species-rich lizard radiation (Squamata, Liolaemidae). Evolution 73:214–230. [DOI] [PubMed] [Google Scholar]
  30. Faircloth B.C., Branstetter M.G., White N.D., Brady S.G.. 2015. Target enrichment of ultraconserved elements from arthropods provides a genomic perspective on relationships among Hymenoptera. Mol. Ecol. Resour. 15:489–501. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Fjeldså J., Bowie R.C.K., Rahbek C.. 2012. The role of mountain ranges in the diversification of birds. Annu. Rev. Ecol. Evol. Syst. 43:249–265. [Google Scholar]
  32. Gavrilets S., Losos J.B.. 2009. Adaptive radiation: contrasting theory with data. Science 323:732–737. [DOI] [PubMed] [Google Scholar]
  33. Gelman A., Rubin D.B.. 1992. Inference from iterative simulation using multiple sequences. Stat. Sci. 7:457–472. [Google Scholar]
  34. Glenn T.C., Faircloth B.C.. 2016. Capturing Darwin’s dream. Mol. Ecol. Resour. 16:1051–1058. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Gnirke A., Melnikov A., Maguire J., Rogov P., LeProust E.M., Brockman W., Fennell T., Giannoukos G., Fisher S., Russ C., Gabriel S., Jaffe D.B., Lander E.S., Nusbaum C.. 2009. Solution hybrid selection with ultra-long oligonucleotides for massively parallel targeted sequencing. Nat. Biotechnol. 27:182–189. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Grieshuber J. 2014. Guide to the butterflies of the Palaearctic region. Pieridae II. Coliadinae. Milano: Omnes Artes. [Google Scholar]
  37. Grieshuber J., Worthy B., Lamas G.. 2012. The genus Colias Fabricus, 1807: Jan Haugum’s annotated catalogue of the Old World Colias (Lepidoptera, Pieridae). Pardubice: Tshikolovets Publications. [Google Scholar]
  38. Grover C.E., Salmon A., Wendel J.F.. 2012. Targeted sequence capture as a powerful tool for evolutionary analysis. Am. J. Bot. 99:312–319. [DOI] [PubMed] [Google Scholar]
  39. Guindon S., Dufayard J.F., Lefort V., Anisimova M., Hordijk W., Gascuel O.. 2010. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst. Biol. 59:307–321. [DOI] [PubMed] [Google Scholar]
  40. Halbritter A.H., Fior S., Keller I., Billeter R., Edwards P.J., Holderegger R., Karrenberg S., Pluess A.R., Widmer A., Alexander J.M.. 2018. Trait differentiation and adaptation of plants along elevation gradients. J. Evol. Biol. 31:784–800. [DOI] [PubMed] [Google Scholar]
  41. Harvey M.G., Rabosky D.L.. 2018. Continuous traits and speciation rates: alternatives to state-dependent diversification models. Methods Ecol. Evol. 9:984–993. [Google Scholar]
  42. He B., Zhao Y., Su C., Lin G., Wang Y., Li L., Ma J., Yang Q., Hao J.. 2023. Phylogenomics reveal extensive phylogenetic discordance due to incomplete lineage sorting following the rapid radiation of alpine butterflies (Papilionidae: Parnassius). Syst. Entomol. 48:585–599. [Google Scholar]
  43. Hejase H.A., Liu K.J.. 2016. A scalability study of phylogenetic network inference methods using empirical data sets and simulations involving a single reticulation. BMC Bioinf. 17:422. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Hewitt G.M. 2000. The genetic legacy of the Quaternary ice ages. Nature 405:907–913. [DOI] [PubMed] [Google Scholar]
  45. Hillis D.M., Bull J.J.. 1993. An empirical test of bootstrapping as a method for assessing confidence in phylogenetic analysis. Syst. Biol. 42:182–192. [Google Scholar]
  46. Hirabayashi K., Debnath S.C., Owens G.L.. 2023. Unveiling the evolutionary history of lingonberry (Vaccinium vitis-idaea L.) through genome sequencing and assembly of European and North American subspecies. G3: Genes, Genomes, Genet. 14:jkad294. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Hodges E., Xuan Z., Balija V., Kramer M., Molla M.N., Smith S.W., Middle C.M., Rodesch M.J., Albert T.J., Hannon G.J., McCombie W.R.. 2007. Genome-wide in situ exon capture for selective resequencing. Nat. Genet. 39:1522–1527. [DOI] [PubMed] [Google Scholar]
  48. Höhna S., Landis M.J., Heath T.A., Boussau B., Lartillot N., Moore B.R., Huelsenbeck J.P., Ronquist F.. 2016. RevBayes: bayesian phylogenetic inference using graphical models and an interactive model-specification language. Syst. Biol. 65:726–736. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Hughes C.E., Atchison G.W.. 2015. The ubiquity of alpine plant radiations: from the Andes to the Hengduan Mountains. New Phytol. 207:275–282. [DOI] [PubMed] [Google Scholar]
  50. Janz N., Nylin S.. 2008. The oscillation hypothesis of host-plant range and speciation. In: Tilmon K.J., editor. Specialization, speciation, and radiation: the evolutionary biology of herbivorous insects. CA: California University Press. p. 203–215. [Google Scholar]
  51. Jetz W., Thomas G.H., Joy J.B., Hartmann K., Mooers A.O.. 2012. The global diversity of birds in space and time. Nature 491:444–448. [DOI] [PubMed] [Google Scholar]
  52. Kalyaanamoorthy S., Minh B.Q., Wong T.K., Haeseler A., Jermiin L.S.. 2017. Modelfinder: fast model selection for accurate phylogenetic estimates. Nat. Methods 14:587–589. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Katoh K., Standley D.M.. 2013. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 30:772–780. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Kawahara A.Y., Breinholt J.W., Espeland M., Storer C., Plotkin D., Dexter K.M., Toussaint E.F., St Laurent R.A., Brehm G., Vargas S., Forero D., Pierce N.E., Lohman D.J.. 2018. Phylogenetics of moth-like butterflies (Papilionoidea: Hedylidae) based on a new 13-locus target capture probe set. Mol. Phylogenet. Evol. 127:600–605. [DOI] [PubMed] [Google Scholar]
  55. Kawahara A.Y., Storer C., Carvalho A.P.S., Plotkin D.M., Condamine F.L., Braga M.P., Ellis E.A., St Laurent R.A., Li X., Barve V., Cai L., Earl C., Frandsen P.B., Owens H.L., Valencia-Montoya W.A., Aduse-Poku K., Toussaint E.F.A., Dexter K.M., Doleck T., Markee A., Messcher R., Nguyen Y.-L., Badon J.A.T., Benítez H.A., Braby M.F., Buenavente P.A.C., Chan W., Collins S.C., Childers R.A.R., Dankowicz E., Eastwood R., Fric Z.F., Gott R.J., Hall J.P.W., Hallwachs W., Hardy N.B., Sipe R.L.H., Heath A., Hinolan J.D., Homziak N.T., Hsu Y.F., Inayoshi Y., Itliong M.G.A., Janzen D.H., Kitching I.J., Kunte K., Lamas G., Landis M.J., Larsen E.A., Larsen T.B., Leong J.V., Lukhtanov V., Maier C.A., Martinez J.I., Martins D.J., Maruyama K., Maunsell S.C., Mega N.O., Monastyrskii A., Morais A.B.B., Müller C.J., Naive M.A.K., Nielsen G., Padrón P.S., Peggie D., Romanowski H.P., Sáfián S., Saito M., Schröder S., Shirey V., Soltis D., Soltis P., Sourakov A., Talavera G., Vila R., Vlasanek P., Wang H.S., Warren A.D., Willmott K.R., Yago M., Jetz W., Jarzyna M.A., Breinholt J.W., Espeland M., Ries L., Guralnick R.P., Pierce N.E., Lohman D.J.. 2023. A global phylogeny of butterflies reveals their evolutionary history, ancestral hosts and biogeographic origins. Nat. Ecol. Evol. 7:903–913. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Kir’yanov A.V. 2021. DNA barcoding and renewed taxonomy of South American Clouded Sulphurs (Lepidoptera: Pieridae: Colias). Southwest. Entomol. 46:891–928. [Google Scholar]
  57. Kong H., Condamine F.L., Yang L., Harri A.J., Feng C., Wen F., Kang M.. 2022. Phylogenomic and macroevolutionary evidence for an explosive radiation of a plant genus in the Miocene. Syst. Biol. 71:589–609. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Lagomarsino L.P., Condamine F.L., Antonelli A., Mulch A., Davis C.C.. 2016. The abiotic and biotic drivers of rapid diversification in Andean bellflowers (Campanulaceae). New Phytol. 210:1430–1442. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Laiho J., Stahls G.. 2013. DNA barcodes identify Central Asian Colias butterflies (Lepidoptera, Pieridae). Zookeys 365:175–196. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Larridon I., Villaverde T., Zuntini A.R., Pokorny L., Brewer G.E., Epitawalage N., Fairlie I., Hahn M., Kim J., Maguilla E., Maurin O., Xanthos M., Hipp A.L., Forest F., Baker W.J.. 2020. Tackling rapid radiations with targeted sequencing. Front. Plant Sci. 10:1655. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Lemmon E.M., Lemmon A.R.. 2013. High-throughput genomic data in systematics and phylogenetics. Annu. Rev. Ecol. Evol. Syst. 44:99–121. [Google Scholar]
  62. Li H., Handsaker B., Wysoker A., Fennell T., Ruan J., Homer N., Marth G., Abecasis G., Durbin R.; 1000 Genome Project Data Processing Subgroup. 2009. The sequence alignment/map format and SAMtools. Bioinformatics 25:2078–2079. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Li W., Godzik A.. 2006. Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 22:1658–1659. [DOI] [PubMed] [Google Scholar]
  64. Linder H.P. 2008. Plant species radiations: where, when, why? Philos. Trans. R. Soc. London, Ser. B 363:3097–3105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Liu J., Milne R.I., Zhu G.F., Spicer R.A., Wambulwa M.C., Wu Z.Y., Boufford D.E., Luo Y.H., Provan J., Yi T.S., Cai J., Wang H., Gao L.-M., Li D.-Z.. 2022. Name and scale matters: clarifying the geography of Tibetan Plateau and adjacent mountain regions. Glob. Planet. Change 215:103893. [Google Scholar]
  66. Liu J., Zhu G.F.. 2022. Geographical and geological GIS boundaries of the Tibetan Plateau and adjacent mountain regions (version 2022.1). Available from: https://doi.org/ 10.5281/zenodo.6432940 [DOI] [Google Scholar]
  67. López-Fernández H., Winemiller K.O., Honeycutt R.L.. 2010. Multilocus phylogeny and rapid radiations in Neotropical cichlid fishes (Perciformes: Cichlidae: Cichlinae). Mol. Phylogenet. Evol. 55:1070–1086. [DOI] [PubMed] [Google Scholar]
  68. Ma L., Zhang Y., Lohman D.J., Wahlberg N., Ma F., Nylin S., Janz N., Yago M., Aduse‐Poku K., Peggie D., Wang M., Peng Z., Wang H.S.. 2020. A phylogenomic tree inferred with an inexpensive PCR‐generated probe kit resolves higher‐level relationships among Neptis butterflies (Nymphalidae: Limenitidinae). Syst. Entomol. 45:924–934. [Google Scholar]
  69. Mamanova L., Coffey A.J., Scott C.E., Kozarewa I., Turner E.H., Kumar A., Howard E., Shendure J., Turner D.J.. 2010. Target-enrichment strategies for next-generation sequencing. Nat. Methods 7:111–118. [DOI] [PubMed] [Google Scholar]
  70. Mazet N., Morlon H., Fabre P.H., Condamine F.L.. 2023. Estimating clade‐specific diversification rates and palaeodiversity dynamics from reconstructed phylogenies. Methods Ecol. Evol. 14:2575–2591. [Google Scholar]
  71. Minh B.Q., Hahn M.W., Lanfear R.. 2020a. New methods to calculate concordance factors for phylogenomic data sets. Mol. Biol. Evol. 37:2727–2733. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Minh B.Q., Nguyen M.A.T., von Haeseler A.. 2013. Ultrafast approximation for phylogenetic bootstrap. Mol. Biol. Evol. 30:1188–1195. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Minh B.Q., Schmidt H.A., Chernomor O., Schrempf D., Woodhams M.D., Von Haeseler A., Lanfear R.. 2020b. IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era. Mol. Biol. Evol. 37:1530–1534. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Minter M., Dasmahapatra K.K., Thomas C.D., Morecroft M.D., Tonhasca A., Schmitt T., Siozios S., Hill J.K.. 2020. Past, current, and potential future distributions of unique genetic diversity in a cold‐adapted mountain butterfly. Ecol. Evol. 10:11155–11168. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Moen D., Morlon H.. 2014. Why does diversification slow down? Trends Ecol. Evol. 29:190–197. [DOI] [PubMed] [Google Scholar]
  76. Morel B., Williams T.A., Stamatakis A.. 2023. Asteroid: a new algorithm to infer species trees from gene trees under high proportions of missing data. Bioinformatics 39:btac832. [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Morlon H., Lewitus E., Condamine F.L., Manceau M., Clavel J., Drury J.. 2016. RPANDA: an R package for macroevolutionary analyses on phylogenetic trees. Methods Ecol. Evol. 7:589–597. [Google Scholar]
  78. Morlon H., Parsons T.L., Plotkin J.B.. 2011. Reconciling molecular phylogenies with the fossil record. Proc. Natl. Acad. Sci. U.S.A. 108:16327–16332. [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Nunes R., Storer C., Doleck T., Kawahara A.Y., Pierce N.E., Lohman D.J.. 2022. Predictors of sequence capture in a large-scale anchored phylogenomics project. Front. Ecol. Evol. 10:943361. [Google Scholar]
  80. Nylin S., Agosta S., Bensch S., Boeger W.A., Braga M.P., Brooks D.R., Forister M.L., Hamback P.A., Hoberg E.P., Nyman T., Schapers A., Stigall A.L., Wheat C.W., Osterling M., Janz N.. 2018. Embracing colonizations: a new paradigm for species association dynamics. Trends Ecol. Evol. 33:4–14. [DOI] [PubMed] [Google Scholar]
  81. Obiol J.F., James H.F., Chesser R.T., Bretagnolle V., González-Solís J., Rozas J., Riutort M., Welch A.J.. 2021. Integrating sequence capture and restriction site-associated DNA sequencing to resolve recent radiations of pelagic seabirds. Syst. Biol. 70:976–996. [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Paradis E., Schliep K.. 2019. ape 5.0: an environment for modern phylogenetics and evolutionary analyses in R. Bioinformatics 35:526–528. [DOI] [PubMed] [Google Scholar]
  83. Phillimore A.B., Price T.D.. 2008. Density-dependent cladogenesis in birds. PLoS Biol. 6:e71. [DOI] [PMC free article] [PubMed] [Google Scholar]
  84. Plummer M., Best N., Cowles K., Vines K., Sarkar D., Bates D., Almond R., Magnusson A.. 2006. CODA: convergence diagnosis and output analysis for MCMC. R News 6:7–11. [Google Scholar]
  85. Pollock D.D., Watt W.B., Rashbrook V.K., Iyengar E.V.. 1998. Molecular phylogeny for Colias butterflies and their relatives (Lepidoptera: Pieridae). Ann. Entomol. Soc. Am. 91:524–531. [Google Scholar]
  86. Rabosky D.L. 2013. Diversity-dependence, ecological speciation, and the role of competition in macroevolution. Annu. Rev. Ecol. Evol. Syst. 44:481–502. [Google Scholar]
  87. Rabosky D.L., Glor R.E.. 2010. Equilibrium speciation dynamics in a model adaptive radiation of island lizards. Proc. Natl. Acad. Sci. U.S.A. 107:22178–22183. [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Rahbek C., Borregaard M.K., Antonelli A., Colwell R.K., Holt B.G., Nogues-Bravo D., Rasmussen C.M., Richardson K., Rosing M.T., Whittaker R.J.. 2019a. Building mountain biodiversity: geological and evolutionary processes. Science 365:1114–1119. [DOI] [PubMed] [Google Scholar]
  89. Rahbek C., Borregaard M.K., Colwell R.K., Dalsgaard B., Holt B.G., Morueta-Holme N., Nogues-Bravo D., Whittaker R.J., Fjeldså J.. 2019b. Humboldt’s enigma: what causes global patterns of mountain biodiversity? Science 365:1108–1113. [DOI] [PubMed] [Google Scholar]
  90. Rambaut A., Drummond A.J., Xie D., Baele G., Suchard M.A.. 2018. Posterior summarization in Bayesian phylogenetics using Tracer 1.7. Syst. Biol. 67:901–904. [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Rodriguez-Caro L., Fenner J., Benson C., Van Belleghem S.M., Counterman B.A.. 2020. Genome assembly of the dogface butterfly Zerene cesonia. Genome Biol. Evol. 12:3580–3585. [DOI] [PMC free article] [PubMed] [Google Scholar]
  92. Rohde K. 1992. Latitudinal gradients in species diversity: the search for the primary cause. Oikos 65:514–527. [Google Scholar]
  93. Scherson R.A., Vidal R., Sanderson M.J.. 2008. Phylogeny, biogeography, and rates of diversification of New World Astragalus (Leguminosae) with an emphasis on South American radiations. Am. J. Bot. 95:1030–1039. [DOI] [PubMed] [Google Scholar]
  94. Seider J.H., Lantz T.C., Bone C.. 2022. Tundra shrub expansion in a warming climate and the influence of data type on models of habitat suitability. Arct. Antarct. Alp. Res. 54:488–506. [Google Scholar]
  95. Slater G.S.C., Birney E.. 2005. Automated generation of heuristics for biological sequence comparison. BMC Bioinform. 6:1–11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  96. Solís-Lemus C., Ané C.. 2016. Inferring phylogenetic networks with maximum pseudolikelihood under incomplete sineage sorting. PLoS Genet. 12:e1005896. [DOI] [PMC free article] [PubMed] [Google Scholar]
  97. Solís-Lemus C., Bastide P., Ané C.. 2017. PhyloNetworks: a package for phylogenetic networks. Mol. Biol. Evol. 34:3292–3298. [DOI] [PubMed] [Google Scholar]
  98. Spehn E.M., Rudmann-Maurer K., Körner C.. 2011. Mountain biodiversity. Plant Ecol. Divers. 4:301–302. [Google Scholar]
  99. Stamatakis A. 2014. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30:1312–1313. [DOI] [PMC free article] [PubMed] [Google Scholar]
  100. Steinbauer M.J., Irl S.D.H., Beierkuhnlein C.. 2013. Elevation-driven ecological isolation promotes diversification on Mediterranean islands. Acta Oecol. 47:52–56. [Google Scholar]
  101. Swiston S.K., Landis M.J.. 2023. Testing relationships between multiple regional features and biogeographic processes of speciation, extinction, and dispersal. BioRxiv. doi: https://doi.org/ 10.1101/2023.06.19.545613. [DOI] [PubMed] [Google Scholar]
  102. Thomas A.E., Igea J., Meudt H.M., Albach D.C., Lee W.G., Tanentzap A.J.. 2021. Using target sequence capture to improve the phylogenetic resolution of a rapid radiation in New Zealand Veronica. Am. J. Bot. 108:1289–1306. [DOI] [PubMed] [Google Scholar]
  103. Tian X., Mo S., Liang D., Wang H.S., Zhang P.. 2023. Amplicon capture phylogenomics provides new insights into the phylogeny and evolution of alpine Parnassius butterflies (Lepidoptera: Papilionidae). Syst. Entomol. 48:571–584. [Google Scholar]
  104. Tunström K., Woronik A., Hanly J.J., Rastas P., Chichvarkhin A., Warren A.D., Kawahara A.Y., Schoville S.D., Ficarrotta V., Porter A.H., Watt W.B., Martin A., Wheat C.W.. 2023. Evidence for a single, ancient origin of a genus-wide alternative life history strategy. Sci. Adv. 9:eabq371. [DOI] [PMC free article] [PubMed] [Google Scholar]
  105. Ulrich W., Almeida‐Neto M., Gotelli N.J.. 2009. A consumer’s guide to nestedness analysis. Oikos 118:3–17. [Google Scholar]
  106. Verhulst J.T. 2000. Les Colias du Globe. Monograph of the Genus Colias (Lepidoptera: Rhopalocera: Pieridae). Keltern, Germany: Goecke & Evers. [Google Scholar]
  107. Vintsek L., Klichowska E., Nowak A., Nobis M.. 2022. Genetic differentiation, demographic history and distribution models of high alpine endemic vicariants outline the response of species to predicted climate changes in a Central Asian biodiversity hotspot. Ecol. Indic. 144:109419. [Google Scholar]
  108. Wallis G.P., Waters J.M., Upton P., Craw D.. 2016. Transverse alpine speciation driven by glaciation. Trends Ecol. Evol. 31:916–926. [DOI] [PubMed] [Google Scholar]
  109. Wang S., Girardello M., Zhang W.. 2024. Potential and progress of studying mountain biodiversity by means of butterfly genetics and genomics. J. Genet. Genomics 51:292–301. [DOI] [PubMed] [Google Scholar]
  110. Wheat C.W., Watt W.B.. 2008. A mitochondrial-DNA-based phylogeny for some evolutionary-genetic model species of Colias butterflies (Lepidoptera, Pieridae). Mol. Phylogenet. Evol. 47:893–902. [DOI] [PubMed] [Google Scholar]
  111. Wu C., Hsu Y.F.. 2017. Butterflies of China. Fuzhou, China: The Straits Publishing & Distribution Group. [Google Scholar]
  112. Xu H., Luo X., Qian J., Pang X., Song J., Qian G., Chen J., Chen S.. 2012. FastUniq: a fast de novo duplicates removal tool for paired short reads. PLoS One 7:e52249. [DOI] [PMC free article] [PubMed] [Google Scholar]
  113. Yang Z. 2007. PAML 4: phylogenetic analysis by maximum likelihood. Mol. Biol. Evol. 24:1586–1591. [DOI] [PubMed] [Google Scholar]
  114. Ye Z., Chen P., Bu W.. 2016. Terrestrial mountain islands and Pleistocene climate fluctuations as motors for speciation: a case study on the genus Pseudovelia (Hemiptera: Veliidae). Sci. Rep. 6:33625. [DOI] [PMC free article] [PubMed] [Google Scholar]
  115. Yoder J.B., Clancey E., Des Roches S., Eastman J.M., Gentry L., Godsoe W., Hagey T.J., Jochimsen D., Oswald B.P., Robertson J., Sarver B.A.J., Schenk J.J., Spear S.F., Harmon L.J.. 2010. Ecological opportunity and the origin of adaptive radiations. J. Evol. Biol. 23:1581–1596. [DOI] [PubMed] [Google Scholar]
  116. Zhang C., Rabiee M., Sayyari E., Mirarab S.. 2018. ASTRAL-III: polynomial time species tree reconstruction from partially resolved gene trees. BMC Bioinform. 19:15–30. [DOI] [PMC free article] [PubMed] [Google Scholar]
  117. Zhang Y., Deng S.H., Liang D., Zhang P.. 2019. Sequence capture across large phylogenetic scales by using pooled PCR‐generated baits: a case study of Lepidoptera. Mol. Ecol. Resour. 19:1037–1051. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Code used for software in this study can be also found from the Zenodo using the link: https://doi.org/10.5281/zenodo.13932428


Articles from Systematic Biology are provided here courtesy of Oxford University Press

RESOURCES