Abstract
Genomic variation within and among populations is shaped by the interplay between natural selection and the effects of genetic drift and gene flow. Adaptive divergence can be found in small-scale natural systems even when population sizes are small, and the potential for gene flow is high, suggesting that local environments exert selection pressures strong enough to counteract the opposing effects of drift and gene flow. Here, we investigated genomic differentiation in nine moor frog (Rana arvalis) populations in a small-scale network of local wetlands using 16,707 ddRAD-seq SNPs, relating levels of differentiation with local environments, as well as with properties of the surrounding landscape. We characterized population structure and differentiation, and partitioned the effects of geographic distance, local larval environment, and landscape features on total genomic variation. We also conducted gene–environment association studies using univariate and multivariate approaches. We found small-scale population structure corresponding to 6–8 clusters. Local larval environment was the most influential component explaining 2.3% of the total genetic variation followed by landscape features (1.8%) and geographic distance (0.8%), indicative of isolation-by-environment, -by-landscape, and -by-distance, respectively. We identified 1000 potential candidate SNPs putatively under divergent selection mediated by the local larval environment. The candidate SNPs were involved in, among other biological functions, immune system function and development. Our results suggest that small-scale environmental differences can exert selection pressures strong enough to counteract homogenizing effects of gene flow and drift in this small-scale system, leading to observable population differentiation.
Subject terms: Genetic variation, Ecological genetics
Introduction
As evolution takes place at the population level, understanding local adaptation is central to many aspects of evolutionary, population, and conservation biology (Hoban et al. 2016; Delph 2018). The interplay between spatially varying selection, gene flow, and random genetic drift affects the genetic diversity of organisms and determines the ability of a population to adapt to local environmental conditions. Therefore, quantifying the relative contributions of natural selection, genetic drift, and gene flow to population divergence is crucial for understanding the local adaptation process (Savolainen et al. 2007). The relative strength of these processes is tightly linked with the spatial scale of environmental variation (Kawecki and Ebert 2004; Richardson et al. 2014; Bachmann et al. 2020). According to the more traditional view, gene flow is expected to counteract adaptation, and strong selection is needed to maintain beneficial alleles in a population (Haldane 1930). A more current view, however, suggests that gene flow has a multifaceted role in local adaptation (reviewed in Tigano and Friesen 2016). Depending on the relative strengths of gene flow and selection, gene flow can lead to gene swamping, where less beneficial alleles enter a locally adapted population (Balkau and Feldman 1973; Lenormand 2002), and thus serves as an obstacle for local adaptation. Here the effect size of the alleles involved is important as large-effect alleles are more resistant to gene swamping than small-effect alleles (Yeaman and Otto 2011). However, unless complete swamping occurs, the increase in standing genetic variation through gene flow may result in a larger set of available alleles for selection to work on (Barrett, Schluter 2008), and introgression of directly beneficial alleles into a population may aid adaptation (Hedrick 2013).
Multiple factors acting at various spatial scales affect the spatial distribution of genetic variation and rates of gene flow across landscapes. Geographic separation and environmental heterogeneity are two key factors affecting genetic structure among populations (Shafer, Wolf 2013; Sexton et al. 2014), but landscape resistance, partial barriers to dispersal, and differences in timing of reproduction can play important roles as well (Hendry and Day 2005; Van Strien et al. 2015). Geographic separation is directly related to the interplay of genetic drift and movement. Dispersal is often limited by geographic distance, resulting in higher gene flow between more closely located populations where low levels of migration will increase the influence of genetic drift and reinforce population divergence. This pattern is known as genetic isolation or isolation-by-distance (IBD), and is commonly observed as a correlation between neutral genetic differentiation and geographic distance (Wright 1943; Jenkins et al. 2010).
Adaptive local population divergence is typically mediated by abiotic and/or biotic differences in local environments, a pattern known as isolation-by-environment (IBE, Rundle and Nosil 2005). While the definition of IBE does not make any particular claim about the mechanisms shaping it, allele-frequency shifts correlated with environmental differences are often interpreted as the result of local variation in selection pressures (Kawecki and Ebert 2004; Sexton et al. 2014). Environmental differences can also influence the movements of animals, as well as the behavior of seed and pollen disperses, affecting the genetic structure of organisms (Dyer et al. 2012; Wang and Bradburd 2014). In heterogeneous environments, landscape characteristics, such as rivers, roads, habitat fragmentation, or mountain ridges, may promote or resist individual dispersal, which may lead to a pattern of isolation-by-resistance (IBR, McRae 2006). Habitat features can also result in small-scale shifts in phenology, where populations that occupy the same geographic distribution reproduce at different times, resulting in isolation-by-time (Hendry and Day 2005). Genetic divergence and allochronic reproductive isolation have been described in insects, vertebrates, and plants (Maes et al. 2006; Santos et al. 2007; Ribolli et al. 2017; Ismail and Kokko 2019).
Geography and landscape features are often correlated with local environmental variation, making it difficult to separate the relative effects of IBE from those of the others, and thereby understand their roles in genetic differentiation (Wang and Bradburd 2014). The autocorrelation between climatic and environmental gradients with geography is more evident at large spatial scales, which will confound the detection of IBE, and in particular, complicate any attempt to identify genomic regions involved in adaptive divergence (Vasemägi 2006). At small spatial scales, with a weak correlation between local environments and geographic distances, it would be easier to disentangle the genetic signatures of these different types of evolutionary processes. However, until recently, analyses have often suffered from low power to detect genetic differentiation among populations over small spatial scales. Indeed, in many cases where molecular methods have suggested low divergence, phenotypic methods have indicated considerable adaptive divergence (Leinonen et al. 2013; Savolainen et al. 2013). The study of divergence at small spatial scales has become increasingly feasible with modern genomic techniques, genotyping tens of thousands of single-nucleotide polymorphisms (SNPs), providing the resolution to detect population divergence over short spatial scales (Aguillon et al. 2017), and detecting signatures of selection within the genome (Hemmer-Hansen et al. 2013).
Several studies have aimed to unveil the causes of population differentiation at different environmental and spatial resolutions, such as along latitudinal and altitudinal gradients (e.g., Luquet et al. 2019; Yu et al. 2016; Campbell-Staton et al. 2017; Rödin-Mörch et al. 2019; Montero-Mendieta et al. 2019), as well as across land-, sea-, and riverscapes (e.g., Vincent et al. 2013; Whelan et al. 2019). Amphibians provide excellent systems to study adaptive divergence, where species with a metapopulation structure and a relatively low mobility are exposed to different environments even at small spatial scales. Ecoevolutionary models used to explore the dynamics of adaptation in spatially heterogeneous metapopulations show that depending on gene flow and demographic parameters, adaptation may be local at the network scale, and leads to a mosaic specialization (Hanski et al. 2011; Papaïx et al. 2013). Several studies performed on amphibians have found evidence of adaptive divergence at small spatial scales (Skelly 2004; Lind and Johansson 2007; Hangartner et al. 2012; Richter-Boix et al. 2013; Zellmer 2018). For example, earlier studies in moor frogs (Rana arvalis) in central Sweden identified adaptive phenotypic divergence as well as the transcription factor C/EBP-1 as a candidate gene associated with larval development time, and highlighted the roles of temperature, canopy cover, and breeding time in local adaptation of larval development rate (Richter-Boix et al. 2013, 2015). These studies have demonstrated that amphibian metapopulations are good systems to disentangle the genetic signatures of local natural selection, gene drift, and gene flow. However, studies with a higher density of genetic markers, allowing for detailed analyses of gene flow and adaptation, are lacking at small spatial scales.
In this study, we used double-digest restriction-associated DNA sequencing (ddRAD-seq) to characterize small-scale differentiation among nine R. arvalis populations in a network of wetlands in central Sweden. We investigated the importance of different modes of isolation (geographic distance (IBD), environment (IBE), and landscape resistance (IBR)) in shaping population divergence, and identified candidate loci involved in adaptive divergence. We expected that the influence of geographic distance on population divergence would be minor relative to the influence of landscape features and local larval environment and the latter to shape adaptive divergence in this small-scale system. More specifically, we (1) investigated fine-scale population structure in a landscape with high potential for dispersal and gene flow, (2) partitioned the effects of local larval environment, landscape features, and spatial separation on total genetic variation across the wetland network, conducting model selection to test which scenario fits the data the best, and (3) identified loci under putative divergent selection associated with fine-grained local larval environmental variation.
Methods
Sample collection and characterization of local environments
R. arvalis is a widespread frog occurring from western Europe to western Siberia (Sillero et al. 2014). It is an explosive breeder spawning in permanent and temporary ponds, marshes, and lakes in early spring. In late March–April 2016, we collected eggs from 16 to 20 egg masses (each mass representing a separate family) in each of nine populations in a metapopulation-like network of interconnected wetlands located in Uppland in central Sweden (Fig. 1, Table S1). This system consists of temporary and permanent ponds and marshes in a landscape of agricultural fields and mixed forests (Richter-Boix et al. 2013). The eggs were brought to the laboratory at Uppsala University where they hatched. Tadpoles were raised in a common garden experiment in a separate life-history study (Rödin-Mörch 2019). In short, tadpoles were raised in two different temperature treatments (16 and 19 °C), until they reached metamorphosis (stage 42, Gosner 1960) and were euthanized with an overdose of MS222.
The average distance between the study populations was 24.1 (min 1.1, max 42.6) km. Although the maximum pairwise distance between ponds is higher than the maximum known dispersal distance of the species, this system is characterized by numerous adjacent wetlands to the focal ponds that can facilitate dispersal across the landscape (Richter-Boix et al. 2013). Given these aspects of the current study system in terms of geography and environment, we characterize it as a small-scale system as opposed to microgeographic scales or large environmental gradients. We assessed multiple aspects of the local larval environment (Table S1) in order to analyze IBE. We first measured pH for each wetland at the time of egg laying using a multiparameter device (HANNA instruments). We then measured water temperature for 2 months following egg laying using data loggers (HOBO Water temp Pro v2 Data Logger). The loggers were placed in the immediate vicinity of the area where the eggs were laid and set to record every 15 min. Mean temperature for the entire recording period in each wetland was used in the analyses. In early June, the same person (ARB) measured the percentage of canopy cover in each wetland by visually estimating the amount of nonvisible sky into 10% categories (Korhonen et al. 2006). We estimated the percentage of aquatic vegetation cover along a 15-m transect (Palik et al. 2001) and a predation risk index, the latter being measured by counting macroinvertebrates and newts collected using five standardized dip net sweeps at each of five locations in each wetland (Michel 2011). Finally, we defined breeding time as the number of days from January 1st to the date when the first egg was found in the pond. In R. arvalis, the egg-laying period within a wetland is very short (Richter-Boix et al. 2013), and the date of the first egg is a good proxy of breeding date. In the present study, eggs were laid between 29th of March and 18th of April with a maximum difference of 20 days in breeding time.
To investigate the effect of landscape features on population divergence (IBR), we measured four landscape variables around each of the nine sites by summing the areas of each landscape variable within circular buffer zones at a spatial scale of 2000 m (Table S1), as this is the most common movement distance of amphibians (Smith and Green 2005). The amount of arable land was extracted using topographic vector maps converted to raster format. Mature forest (defined as >200 m3 ha−1) was quantified using k-Nearest-Neighbor-raster (Reese et al. 2003) originally at 25 × 25-m resolution and aggregated to 100 × 100 m by averaging due to low-volume accuracy at the original scale (Gjertsen 2007). The total road length was estimated by summing up the total length of roads within the buffer zone. Additionally, a connectivity index in terms of number and size of nearby wetlands was estimated, using a kernel estimation weighted by distance (smoothing curve of 5 km from each pond) and perimeter of all the surrounding ponds, as vegetated shallow shores are important breeding sites for moor frogs. All landscape variables were processed in ArcMap 10.6 (ArcGIS, ESRI, Redlands, CA, USA).
DNA extraction and ddRAD library prep
To minimize confounding effects of potentially including siblings in the analysis, we used one individual that was picked at random from each egg mass. We extracted DNA from a total of 16–20 individuals from each population (163 individuals in total), from the hind leg or head of metamorphosed individuals using a high salt extraction precipitation protocol with an extra ethanol precipitation step (modified from Paxton et al. 1996). DNA concentration and purity were checked using NanoDrop® 2000 spectrophotometer and Qubit® 3.0 fluorometer Quantitation Kit (Invitrogen™). We prepared a ddRAD-seq library following the protocol by Johansson et al. 2017 (modified from Peterson et al. 2012; Mastretta‐Yanes et al. 2015). DNA was digested for 18 h using the restriction enzyme combination SbfI-HF® and MseI, and digestion was confirmed by gel electrophoresis. We ligated 16 uniquely barcoded (6 bp) P1 adapters and one P2 adapter to the digested DNA. The ligation product was cleaned using AMPure XP beads. For PCR amplification, we used custom Illumina primers containing 12 unique indices, and DNA was amplified using Q5 DNA polymerase (New England Biolabs, Massachusetts, USA). In order to correct for any biased amplification, each individual sample was PCR-amplified in four separate reactions. Samples were then pooled in equimolar concentrations, and amplification success was confirmed using gel electrophoresis. For size selection, we performed agarose gel extraction by cutting out the desired fragments between ~350 and 600 bp, and extracted the product using the QIAquick Minielute Gel Extraction Kit. The final library was then pooled, checked for purity, and the concentration (50.6 ng/μl) was measured using NanoDrop® 2000 spectrophotometer and the Qubit®3.0 fluorometer Quantitation Kit (Invitrogen™). The library was sequenced on four lanes of Illumina 2500 (2 × 125 bp) in high-throughput mode by SciLifeLab, Uppsala, Sweden.
ddRAD bioinformatics
Using process_radtags in STACKS v.2.1 (Catchen et al. 2013), we demultiplexed, cleaned, and filtered the raw reads based on quality and base calling. To reconstruct loci and call SNPs, we ran the individual programs included in denovo_map (cstacks, sstacks, tsv2bam, gstacks, and populations) separately for pair-end reads using starting parameter settings similar to the optimal parameter combinations in Rödin-Mörch et al. (2019). We then changed the values of the parameters -M (the number of mismatches allowed between loci when an individual is being processed) and -n (the number of mismatches allowed when constructing the catalog) by increasing or decreasing the parameter value one step. We tried four different parameter combinations (Table S2) and evaluated which of them yielded the maximum number of variant sites, resulting in selection of the combination -M 4 and -n 3. A final filtering step was performed in the program populations, where we filtered SNPs based on a minor allele frequency of 0.02, present in all nine populations and in at least 70% of the individuals. We only retained the first SNP of each RAD locus.
Population structure
We filtered out SNPs suggested to be under putative divergent selection (described below) to obtain a data set evolving under neutral expectations. All the following analyses were conducted in R v.3.5.2 (R Core Team 2019). To analyze and visualize population structure, we first used discriminant analysis of principal components (DAPC) in the adegenet package v.2.1.1 (Jombart 2008; Jombart and Ahmed 2011) and determined the number of PCs to retain by performing stratified cross-validation. We ran cross-validation using the xvalDapc function for 300 repetitions, keeping default settings. Second, we used the spatially informed package tess3r v.1.1.0 (Caye et al. 2018), which estimates ancestry coefficients using geographically constrained nonnegative matrix factorization to characterize population structure, and evaluates the optimal number of clusters fitting the data based on the cross-validation score. Finally, we estimated global and pairwise FST (Weir and Cockerham 1984) with 95% confidence intervals for the total data set in assigner v.0.4.1 (Gosselin et al. 2016).
Identifying differentiation outliers
In order to identify differentiation outliers, we used pcadapt v.4.0.3 (Luu et al. 2017), which utilizes principal component analysis to detect SNPs most associated with the axes related to population structure. The method estimates Mahalanobis distances between SNP z scores and the first K principal components (PCs) capturing population structure. We ran the initial analysis for ten PCs and evaluated the optimal K PCs to retain for the final analysis by graphically inspecting the per-PC decrease in variance in a scree plot, and the individual clustering using a score plot (Fig. S1). We retained three PCs in order to calculate the test statistic, transformed per-SNP p values into q values, in order to correct for multiple comparisons using the R package qvalue v.2.4.2 (Storey et al. 2019), and filtered the final list of outliers using a false-discovery rate (FDR) of 0.05 in order to correct for multiple comparions.
Identifying environmentally based differentiation in the landscape
We estimated the influence of geographical distance, local environmental, and landscape variables on total genetic variation using redundancy analysis (RDA) and partial redundancy analysis (pRDA) in the R package vegan v.2.5.3 (Oksanen et al. 2019). As RDA does not allow for missing data, we imputed the 2% missing genotypes found in the data set using Random Forest on-the-fly imputation (Tang and Ishwaran 2017) from the randomForestSRC package v.2.7 (Ishwaran and Kogalur 2018) implemented in the grur package v. 0.0.11 (Gosselin 2018). RDA uses a multivariate linear regression with the genotype matrix as a response variable and environmental variables as predictors; a PCA is then performed on the new matrix of fitted values from the regression in order to construct constrained axes summarizing the variation explained by the predictor variables (Legendre and Legendre 1998).
As compared to RDA, pRDA differs in that it removes the influence of some predictor variable on the response matrix when trying to estimate the influence of the remaining predictors; here we control for spatial influence (IBD) on the response matrix by incorporating geographical coordinates. We present the raw r2 values, which for pRDA is referred to as semipartial r2 (Legendre et al. 2011), and adjusted r2 for the models. As a response matrix, we used a transposed matrix of allele abundance in pcadapt format (encoded as 0/1/2) and the predictors consisted of geographical coordinates, local larval environmental, and landscape variables that were scaled and centered for the analyses. We performed RDA on geographic coordinates and pRDA on local larval environmental and landscape variables after testing the correlation between variables to remove any potential multicollinearity, by removing one of the correlated variables if |r| > 0.7 (Table S3 a–c). We tested the significance of each model with the anova.cca function using 999 permutations.
To complement the RDA, we used Maximum-likelihood population-effect (MLPE) mixed models (Clarke et al. 2002). MLPE models analyze the effect of one or several predictor distance matrices on the response distance matrix. The method accounts for the nonindependence between the pairwise measurements in the distance matrices by including population origin as a random effect. We fitted six different MLPE models in lme4 (Bates et al. 2015) using the Euclidean distance between individuals in the same genotype matrix as used for the RDA as a response variable. We included Euclidean geographic, local larval environmental, and landscape distance matrices as fixed effects, either separately testing for IBD, IBE, and IBR, but also in combination two by two (IBD and IBE, IBD and IBR, and IBE and IBR), testing their individual effects combined with their interaction. We included population as a random effect where individuals were implicitly nested within each population. We ran the models with the setting REML = FALSE, as this yields an unbiased estimate of Akaike’s information criterion (AIC) for model comparison (Clarke et al. 2002). We compared each model testing IBD, IBE, and IBR, as well as the best-ranked models based on AIC with a null model just containing the intercept term and the population-specific random effects using the Anova function in the car package (Fox and Weisberg 2019). We used the function r2 from the performance package (Lüdecke et al. 2020) to estimate the amount of variance explained by the fixed effects only (marginal R2) and the variance explained by the fixed and random effects together (conditional R2).
Gene–environment association (GEA)
We used two different gene–environment association (GEA) approaches to identify candidates for adaptive divergence using the imputed SNP data. First, we used the univariate method latent factor mixed models (LFMM) implemented in the LEA package v.1.99.2 (Frichot et al. 2013; Frichot, François 2015). LFMM identifies associations between individual SNPs and environmental variables that are included as fixed effects, while at the same time modeling any underlying population structure included as a latent or hidden factor to correct for its influence. We ran LFMM assuming six latent factors based on graphically evaluating the cross-entropy scores, as well as the number of clusters identified by tess3r and DAPC, and the number of environmental variables included (see below). The Gibbs sampling algorithm was initialized with a burn-in of 50,000 iterations followed by 200,000 main iterations for each scaled and centered environmental variable for 30 independent runs. We then combined the resulting z scores across runs from which the median was used. The p values were then calibrated using the genomic inflation factor to account for population structure, and corrected for multiple comparisons using an FDR of 0.05 calculated as above. Second, we followed the approach described in Capblancq et al. (2018) to identify SNPs associated with the local larval environment, and that of Forester et al. (2018) for pre- and postprocessing of data. The association method combines RDA and the outlier detection approach implemented in pcadapt described above, and has the ability to detect multilocus selection imposed by the combined effect of multiple environmental gradients. We used the pRDA model described above, where we removed the spatial influence on the genetic response matrix. We then calculated the Mahalanobis distance for the loci loading on the six retained constrained axes, and adjusted the p values using an FDR of 0.05 calculated as above.
SNP annotation
We conducted BLAST searches with the RAD tags containing the SNPs putatively under divergent selection against the nonredundant protein database (blastx) and nucleotide database (dc-megablast). A match was considered and retained if it passed an e-value threshold of <10−5 (BLASTX) or <10−20 (dc-megablast) and had at least ~70% query coverage and similarity. We retrieved gene ontology terms and performed enrichment analysis in BLAST2GO using Fisher’s exact tests and an FDR of 0.05 using BLAST2GO (Conesa et al. 2005).
Results
Sequencing, ddRAD bioinformatics, and population summary statistics
We obtained 746.2 M reads from four lanes of Illumina sequencing. After quality control, running every individual component of the de novo pipeline separately and filtering, we obtained a total of 5,915,415 sites (average: 30,888 RAD tags, mean length: 191.51 bp, average depth of coverage: 34.4×). This resulted in a final data set of 16,707 SNP loci across populations with the number of polymorphic sites within populations ranging from 14,580 to 15,460. Expected heterozygosity (HE), nucleotide diversity (π), and inbreeding coefficients (FIS) ranged between 0.277 and 0.289, 0.286 and 0.297, and 0.016 and 0.050, respectively (Table S4).
Population structure
Estimating FST revealed overall low levels of genetic differentiation (global FST = 0.0277, 95% CI 0.0274–0.0282). Pairwise FST ranged between 0.0111 and 0.0417 (mean = 0.02735, Table S5), with a cluster of three populations (P2, P3, and P10) showing elevated pairwise FST values compared to the rest of the populations (Fig. 2a). Using only the neutral data, after running cross-validation retaining 40 PCs with the lowest MSE and eight discriminant functions, the DAPC revealed the presence of five to six somewhat discrete clusters (Fig. 2b). The first axis separated P1–P4 and P10 from P14, P18, P23, and P26. The second axis further separated the populations, most notably P2 and P3 from P1, P4, and P10. Using tess3r and neutral SNPs, the cross-validation scores calculated for each K-ancestral populations showed a relatively steep decline with increasing K and a hint of a plateau at around K6–8, signifying the optimal K (Fig. S2). Each wetland represents to a certain extent a discrete cluster with the exception of populations very close to each other (P18–P26: 1.13 km; P2–P3: 1.59 km, but where P2 and P3 start to separate at K7 and K8, Fig. 2c).
Identifying environmentally based differentiation in the landscape
The RDA, including only geographical coordinates, showed a significant effect of spatial separation on total genetic variation, explaining 0.8% of the variation (F2,158 = 1.664, p < 0.001), indicating IBD. Using MLPE, we found that geographic distance explained less variation than in the RDA, although still a significant proportion (0.2%, χ2 = 26.58, Df = 1, p < 0.001). In order to account for the possible confounding effects of spatial autocorrelation when evaluating the effect of the local larval environment, we ran a pRDA controlling for IBD. We found a significant effect of the local larval environment on total genetic variation (2.3% of the variation explained, Table 1), indicating IBE (F6,152 = 1.624, p < 0.001). This would indicate that the impact of spatial correlation was relatively low relative to the influence of the local larval environment. This was supported by the MLPE models where the local larval environmental distance matrix explained 1.2% of the variation (χ2 = 141.98, Df = 1, p < 0.001); this was however less variation explained than by the pRDA. For the pRDA, the two major axes of variation showed clear environmentally based population clustering (variance explained: RDA1 = 25%, RDA2 = 18%, Fig. 3a). The first axis differentiates populations inhabiting ponds with more aquatic vegetation and later breeding time from ponds with higher predation risk and canopy cover. The second axis separates populations from warmer ponds with later breeding from those with higher pH. The pRDA also found a significant effect of landscape features (roads, amount of forest, arable land, and pond connectivity) on total genetic variation (F4,156 = 1.68, p < 0.001), but this explained slightly less of the total genetic variation (1.8%, Table 1) than the local larval environment. This indicates an additional level of IBE, serving as a coarse estimate of IBR. For the first two axes (33 and 25% of the variance explained), we observed clear landscape-influenced population structure. The first axis differentiates populations from ponds surrounded by the highest amount of roads and arable land from populations surrounded by more forest (Fig. 3b). However unlike the RDA, the results from the MLPE models reveal a larger effect of landscape distance compared to local larval environment distance (2% of the variance explained, χ2 = 209.33, Df = 1, p < 0.001). Furthermore, we conducted model selection for the MLPE models and found all three models representing IBD, IBE, and IBR, as well as the more complex models including two fixed effects showing a better fit than the null model (Table 2). Based on the AIC score, we found that the IBR model fits the data slightly better than the strict IBE model. However, two of the three models containing two fixed effects fit the data better than any of the simpler IBD, IBE, and IBR models. The best-fitting model based on AIC included the geographic and local larval environmental distance matrices as well as their interaction (3.6% of the variance explained, IBD: χ2 = 0.363, Df = 1, p = 0.55, IBE: χ2 = 118.97, Df = 1, p < 0.001, and IBE × IBD: χ2 = 304.53, Df = 1, p < 0.001). The second best model included landscape and local larval environmental distance matrices and their interaction (3.5% of the variance explained, IBE: χ2 = 2.07, Df = 1, p = 0.15, IBR: χ2 = 67.47, Df = 1, p < 0.001, and IBE × IBR: χ2 = 205.84, Df = 1, p < 0.001).
Table 1.
RDA | Spatial | Local environment | Landscape |
---|---|---|---|
r2 | 0.026 | 0.059 | 0.042 |
Adjusted r2 | 0.008 | 0.023 | 0.018 |
Table 2.
Conditional R2 | Marginal R2 | Df | AIC | |
---|---|---|---|---|
Null | 0.096 | 0 | 3 | 63193.7 |
IBD | 0.088 | 0.002 | 4 | 63169.1 |
IBE | 0.074 | 0.012 | 4 | 63054.9 |
IBR | 0.075 | 0.020 | 4 | 62989.0 |
IBD and IBE | 0.080 | 0.036 | 6 | 62757.6 |
IBD and IBR | 0.068 | 0.025 | 6 | 62908.6 |
IBE and IBR | 0.077 | 0.035 | 6 | 62786.7 |
Conditional R2 here refers to the proportion of variance explained by the fixed and random effects together, and marginal R2 refers to the variance explained by the fixed effects only.
Identifying differentiation outliers and GEA
We identified 29 differentiation outliers using pcadapt (Fig. S3), the low number of outliers reflecting the overall low level of population differentiation. The univariate GEA with LFMM identified 110 SNPs associated with aquatic vegetation cover (Fig. S4a), 109 with canopy cover (Fig. S4b), 76 with pH (Fig. S4c), 133 with predation risk index (Fig. S4d), 49 with mean temperature (Fig. S4e), and 153 with breeding time (Fig. S4f). Some of the SNPs identified overlapped between environmental variables, LFMM in total identifying 547 candidates. Furthermore, using the pRDA-based GEA approach, we identified in total 576 candidate SNPs associated with the environmental variables. From these, 60 showed the highest correlation with aquatic vegetation cover (|r| = 0.066–0.307), 150 with canopy cover (|r| = 0.056–0.379), 94 with pH at breeding (|r| = 0.080–0.448), 68 with predation risk index (|r| = 0.092–0.376), 127 with mean temperature (|r| = 0.093–0.363), and 77 with breeding time (|r| = 0.075–0.356). The two methods only converged on 123 SNPs, and in total, we identified 1000 candidates putatively under environmentally mediated divergent selection.
SNP annotation
None of the candidates identified by pcadapt could be matched to any known genes or proteins by querying the blastx and megablast databases. For the SNPs associated with local larval environment, the RAD tags containing those SNPs matched 32 genes or genomic regions (Table S6) and 21 proteins, eight of these matching unknown or hypothetical proteins (Table S7). Hits of particular interest for the local larval environment-associated SNPs were the immune genes MHC class 1a, brevinin-2-CE, palustrin-2-CE, TRIM25, and NLRP3, as well as genes potentially related to development and stress, such as ITGB1, Mab21, DAGLB, and TRH, and a protein similar to bifunctional apoptosis regulator. Due to the relatively few hits, no significant GO enrichment could be retrieved.
Discussion
In this study, we attempted to disentangle the relative influence of different layers of the environment on putative adaptive genomic divergence in a system where we have earlier shown strong phenotypic divergence and ample opportunity for dispersal and gene flow across the landscape (Richter-Boix et al. 2013, 2015). Over the past decade, accumulating studies have indicated that geographic distance are geographical barriers that are not the only factors affecting gene flow and thereby population differentiation, and environmental differences may be the key factor underlying effective migration (Shafer, Wolf 2013; Sexton et al. 2014). Using a ddRAD-seq approach, we identified genomic variation that we posit is associated with adaptive divergence in the R. arvalis populations, and detected fine-scale population structure and differentiation. Our results suggest that small-scale adaptive divergence in R. arvalis is mainly driven by the local larval environment, but where the combined effect between local environmental and spatial and landscape features is also very influential. We found that, much like what would be expected (Meirmans 2015), the effect size of local larval environment on total genetic variation was relatively small, which was also true for landscape features and geographic separation. Nevertheless, larval environment explained more total genetic variation than landscape features or geographic distance, the latter explaining the smallest amount of variation. These results suggest that selection is efficient enough to counteract the potential negative effects of gene flow and genetic drift on adaptive divergence in this small-scale system. We identified a number of genes involved in immunity, development, and stress associated with local larval environmental variables, and might thus potentially be involved in adaptive divergence. Our results agree with previous studies that have shown the importance of thermal environment, canopy cover, time constraints, predation, desiccation risk, and pH as important selective agents for larval amphibians (e.g., Skelly 2004; Lind and Johansson 2007; Hangartner et al. 2012; Richter-Boix et al. 2013, 2015; Nunes et al. 2014; Van Buskirk J 2014).
Population structure
In spite of the low levels of population differentiation observed in the landscape, we were able to identify somewhat discrete clusters in the landscape. The tess3r analysis revealed that there are between six and eight population clusters in the landscape. At these levels, most populations represented clusters with various degrees of admixture, lending further support for the occurrence of gene flow. When populations are grouped into an increasing number of clusters, more apparent substructure reveals itself. The DAPC analysis supported five to six somewhat discrete clusters, corresponding to the positions of populations in the landscape. Here, the first axis represented the component explaining a large part of the spatial and landscape variation based on the position of the populations in the landscape, whereas separation along the second axis mainly reflected local variation and smaller-scale dispersal barriers. Taken together, we found evidence of neutral population structure in spite of the assumed high gene flow. However, the absence of very clear population boundaries, and increased admixture among some of the population, could possibly indicate asymmetrical patterns of gene flow in the landscape.
Identifying environmentally based differentiation in the landscape
Over the past few years, numerous studies have presented evidence for small-scale divergence between habitats without physical barriers in organisms from plants and insects to fish and amphibians (Skelly 2004; Leimu and Fischer 2008; Fraser et al. 2011; Watanabe et al. 2014). In the present study, we found that a variety of mechanisms influenced population structure in our study system, identifying three distinct ways. First, as in a previous study using five microsatelite loci involving some of the same populations as the present study (Richter-Boix et al. 2013), we found evidence of differentiation caused by the local larval environment (IBE), detectable as larger allele-frequency differences among populations inhabiting more contrasting environments. The overall FST in the network of wetlands was low, as expected, with moderate-to-high levels of gene flow. However, using FST as a direct proxy for gene flow without additional information is problematic, giving only indications that gene flow might be occurring (Whitlock and McCauley 1999). In the present system, the level of connectivity with ponds surrounding our focal ponds suggests that low FST is the result of high gene flow (Richter-Boix et al. 2013). Local selection would have to be strong to remove migrants to generate divergence between the ponds. Environmentally induced or genotype-dependent dispersal preferences could plausibly generate the observed genetic divergence between different breeding environments. Biased dispersal is an effective cause of reproductive isolation that can reduce migration rates and gene flow over small spatial scales (Bolnick and Otto 2013). Divergent habitat choice has been documented experimentally in sticklebacks (Bolnick et al. 2009), and in amphibians, site fidelity (Semlitsch 2008) as well as breeding pond selection in response to the presence of predators and competitors (Buxton et al. 2017) are well known. The environmental variable contributing most to the IBE pattern on the first two RDA axes was temperature, closely followed by breeding time, predation risk, pH, canopy cover, and aquatic vegetation, differentiating populations into four different clusters along the axis. As opposed to a linear environmental gradient such as temperature, these results highlight the multivariate effect of the local larval environment shaping IBE in this system. The results from the MLPE model selection further corroborate the importance of the local larval environment in shaping divergence in the landscape.
Second, we found that connectivity was affected by landscape complexity, showing a pattern of IBR. Earlier studies demonstrated that natural and anthropogenic landscape features contribute to the population structure in urodeles and anurans (Vos et al. 2001; Lenhardt et al. 2017; Homola et al. 2019). Different urban landscape elements are capable of influencing connectivity in metapopulation-structured species (Graham et al. 2017), and earlier studies on R. arvalis showed that geographical distance and density of roads were best predictors of population structure (Vos et al. 2001; Arens et al. 2007). In our system, the strongest landscape predictors influencing total genetic variation were the total length of roads within 2000 m of the focal pond and the amount of forest surrounding the ponds. Our results are in line with observations that large-scale landscape features can cause population differentiation, depending on the level of human disturbance and its influence on connectivity in the landscape (Storfer et al. 2007; Safner et al. 2011; Youngquist et al. 2017). MLPE results suggested that landscape distance (IBR) fits the data better than local larval environmental distance (IBE), explaining slightly more of genetic variation than what we found in pRDA models. However, R2 in MLPE models has a tendency to increase with model complexity (Row et al. 2017), and R2 values from the more complex models should therefore perhaps be interpreted with some caution. More specifically, an explanation for the discrepancies between the different types of models could be the relative influence of the random effects on local larval environmental and landscape distance. Indeed, a comparison of the conditional R2 accounting for both the fixed and random effects in the IBE and IBR models shows roughly the same variance explained, suggesting that Euclidean local larval environmental distance is more specific to the individual populations than landscape distance. This interpretation is further strengthened by the RDA results: we found a higher proportion of genetic variation explained by the local larval environment than the landscape features when we corrected for the influence of geographical separation, as opposed to population origin in the MLPE models. Furthermore, the best-fitting MLPE model includes both larval environmental and geographic distances as well as their interaction, suggesting that environmental variation and allele frequencies are not distributed homogeneously in the landscape, but are, to a certain extent, spatially structured, which contributes to the divergence.
The third effect we detected was the small influence of geographic distance shaping genetic differentiation. Previous works in the same system found no signature of IBD or IBR, possibly as a result of lower genetic resolution (five microsatellites). Using a larger number of markers, we found that geographical distance explained a small but significant amount of genetic variation, demonstrating that population structure can be more clearly resolved with RAD sequencing than with microsatellites (Rašić et al. 2014; Vendrami et al. 2017), the increased number of markers giving us more power to detect subtle effects of spatial separation. The spatial genetic autocorrelation signature expected under IBD has been observed in other amphibian species at similar spatial scales (Johansson et al. 2005; Zamudio and Wieczorek 2007; Zellmer 2018).
Identifying differentiation outliers and GEA
Given the relatively low levels of population differentiation in the landscape, the low number of differentiation outliers found by pcadapt in comparison to the number of candidate SNPs found associated with the local larval environment was expected. Using the two methods based on associations between SNPs and local larval environmental variation (LFMM and pRDA), we identified a much larger number of candidate SNPs under putative divergent selection. These methods identified a similar number of outlier SNPs, but they only converged on 123 SNPs. This however might be explained by the differences in analytical strength between multivariate and univariate analyses. The highest number of candidates identified by LFMM was for breeding time followed by predation risk index, aquatic vegetation, canopy cover, pH, and mean temperature. The highest numbers of the candidate SNPs obtained by pRDA-based analyses were correlated with canopy cover, followed by mean temperature, pH, breeding time, predation risk index, and aquatic vegetation. The ranking of the relative importance of the environmental variables could also highlight the discrepancy between univariate and multivariate methods used in this study, where associations found at the additional dimensions in the pRDA are potentially lacking in LFMM. Their importance can also be further explained by the relationship between the variables that might more easily be detected with a multivariate approach. Ponds with high forest canopy have more emergent vegetation, lower temperature, and relatively few predators, while open ponds in human landscapes have little canopy cover, few emergent vegetation, higher temperature, and more predators (Richter-Boix et al. 2013). The genome size of R. arvalis is estimated to be ~6 Gb (Vinogradov 1998) and attempts to identify loci under selection in reduced-representation sequencing studies, where the study organism has a large genome, most often suffering from limited power to detect any larger proportion of causal loci. Consequently, our results should be treated with some caution as they likely only represent a relatively smaller part of the whole picture. However, any identified candidate can be compared in studies covering the whole genome as the latter becomes more affordable.
We annotated 4.3% of the SNPs associated with the local larval environment, and none of the differentiation outliers, to known genes and proteins using BLAST. These low numbers reflect the difficulty to obtain functional and biological annotation for nonmodel organisms (Yandell and Ence 2012), and amphibians in particular, given the general scarcity of genomic resources in this group (but see, e.g., Hellsten et al. 2010; Sun et al. 2015; Session et al. 2016; Hammond et al. 2017). Our BLAST hits matched known genes and proteins that had a relatively wide variety of biological functions (Tables S6, S7), suggesting fine-grained and multifaceted selection acting among, as well as within populations. However, there is a potential risk of overinterpreting candidate gene hits from various genome scan methods, in particular in reduced-representation sequencing studies like ours where only a small part of the genome has been covered, and the number of BLAST hits are relatively few. With this in mind, we discuss a few hits below in more detail that align with genes of importance in amphibian development and immune response, as well as hits we have identified to be under putative divergent selection in previous studies.
We obtained two matches with a possible function related to larval development. The first was bifunctional apoptosis regulator most correlated with mean temperature (r = −0.156), which we have previously identified to be under putative divergent selection along a 1700-km latitudinal gradient (Rödin-Mörch et al. 2019). While the exact function of this gene in amphibians is uncertain, the product of this gene suppresses cell death in mammals (Zhang et al. 2000). Common garden studies have revealed large among-population differences in larval life histories (development rate and mass at metamorphosis) in the present system (Richter-Boix et al. 2013, 2015) as well as along the latitudinal gradient (Luquet et al. 2019; Rödin-Mörch 2019). It seems plausible that the regulation of cell death is related to the development rate as amphibian metamorphosis involves large-scale body restructuring, and thyroid hormone-induced cell death plays a major role in this process (Nakajima et al. 2005). We also matched two different RAD tags containing SNPs with thyrotropin-releasing hormone receptor 1 (TRHR1), mostly correlated with breeding time and pH (r = 0.095 and 0.141). Thyrotropin-releasing hormone stimulates the secretion of growth hormone and prolactine (Denver 1997; Galas et al. 2009), which is known to inhibit amphibian metamorphosis by promoting growth (Brown and Frye 1969; Galas et al. 2009).
The second candidate that we also found along a latitudinal gradient was brevinin-2-CE that is part of the innate immune system. This gene codes for an antimicrobial peptide (AMP) secreted in the skin of amphibians, protecting against bacterial, viral, and fungal pathogens (Rollins-Smith 2009). The associated SNP was also most correlated with mean temperature (r = −0.275). We also obtained a match to another AMP gene palustrin-2-CE, which was most correlated to pH (r = 0.067). AMPs protect anurans against pathogens, such as the chytrid fungus Batrachochytridium dendrobatides (Bd) and ranavirus (Rollins-Smith and Conlon 2005; Rollins-Smith 2009). Previous studies have found that Bd prevalence in R. arvalis is moderately high in this area (Meurling et al. 2020), but studies on ranavirus are lacking in Sweden. Furthermore, we identified matches with additional genes related to immune response, such as the major histocompatibility complex class I (MHC class I) correlated with the amount of aquatic vegetation (r = −0.106), MHC class II correlated with predation risk index (r = 0.249), and TRIM25 and NLRP3 (inflammasome) correlated with mean temperature (r = −0.144 and r = 0.252, respectively). Previous studies have revealed complex interactions between local larval environmental characteristics and disease: depending on parasite type, aspects of the local larval environment, such as abundance of predators and canopy cover, can have an effect on disease risk (Holt and Roy 2007; Becker et al. 2012).
Conclusions
In recent years, landscape genomics studies have started to identify the relative effects of the environment, the landscape and distance, in limiting gene flow and shaping genetic differentiation. Here we document modest but significant patterns of IBE, IBR, and IBD between amphibian populations in a small-scale network of breeding ponds. We find that IBE, either alone or in combination with that of the others, is larger than IBR and IBD, supporting the long-standing view of pond-level local adaptation of amphibian populations. We identify outlier SNPs under putative divergent selection associated with specific aspects of the local environment, those SNPs being involved in developmental processes and the immune system. Our analyses show that regardless of ongoing gene flow populations, local adaptation can lead to genomic divergence, and showcase the complexity of different factors acting at different scales in determining these patterns of divergence.
Supplementary information
Acknowledgements
We thank Gunilla Engström for invaluable help in the lab. The eggs were collected with permits from the Uppsala County board. The tadpoles were raised with permit C40/14 from the Ethical committee for animal experiments in Uppsala County. The sequencing was performed by the SNP&SEQ Technology Platform in Uppsala. This facility is part of the National Genomics Infrastructure (NGI) Sweden and Science for Life laboratory. The SNP&SEQ Platform is supported by the Swedish Research Council and the Knut and Alice Wallenberg Foundation. Uppsala Multidisciplinary Center for Advanced Computational Science provided computing resources. This study was financially supported by the Swedish Research Council (project 621-2013-4503 to AL), Zoologiska stiftelsen (PRM and MCC), and Carl Tryggers stiftelse för vetenskaplig forskning (SK).
Data availability
Sequencing raw data is accessible at NCBI SRA under BioProject accession number: PRJNA660937. Relevant input and output files and analysis scripts are accessible at Dryad (10.5061/dryad.b5mkkwhbc).
Compliance with ethical standards
Conflict of interest
The authors declare that they have no conflict of interest.
Footnotes
Associate editor: Rowan Barrett
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
The online version of this article (10.1038/s41437-020-00371-z) contains supplementary material, which is available to authorized users.
References
- Aguillon SM, Fitzpatrick JW, Bowman R, Schoech SJ, Clark AG, Coop G, et al. Deconstructing isolation-by-distance: the genomic consequences of limited dispersal. PLOS Genet. 2017;13:e1006911. doi: 10.1371/journal.pgen.1006911. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Arens P, van der Sluis T, van’t Westende WPC, Vosman B, Vos CC, Smulders MJM. Genetic population differentiation and connectivity among fragmented Moor frog (Rana arvalis) populations in The Netherlands. Landsc Ecol. 2007;22:1489–1500. [Google Scholar]
- Bachmann JC, van Rensburg AJ, Cortazar-Chinarro M, Laurila A, Buskirk JV. Gene Flow Limits Adaptation along Steep Environmental Gradients. The American Naturalist. 2020;195:E67–E86. doi: 10.1086/707209. [DOI] [PubMed] [Google Scholar]
- Balkau BJ, Feldman MW. Selection for migration modification. Genetics. 1973;74:171–174. doi: 10.1093/genetics/74.1.171. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Barrett RDH, Schluter D. Adaptation from standing genetic variation. Trends Ecol Evol. 2008;23:38–44. doi: 10.1016/j.tree.2007.09.008. [DOI] [PubMed] [Google Scholar]
- Bates D, Mächler M, Bolker B, Walker S. Fitting linear mixed-effects models using lme4. J Stat Softw. 2015;67:1–48.. [Google Scholar]
- Becker CG, Rodriguez D, Longo AV, Talaba AL, Zamudio KR. Disease risk in temperate amphibian populations is higher at closed-canopy sites. PLOS ONE. 2012;7:e48205. doi: 10.1371/journal.pone.0048205. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bolnick D, Otto S. The magnitude of local adaptation under genotype-dependent dispersal. Ecol Evol. 2013;3:4722–4735. doi: 10.1002/ece3.850. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bolnick DI, Snowberg LK, Patenia C, Stutz WE, Ingram T, Lau OL. Phenotype-dependent native habitat preference facilitates divergence between parapatric lake and stream stickleback. Evolution. 2009;63:2004–2016. doi: 10.1111/j.1558-5646.2009.00699.x. [DOI] [PubMed] [Google Scholar]
- Brown PS, Frye BE. Effects of prolactin and growth hormone on growth and metamorphosis of tadpoles of the frog, Rana pipiens. Gen Comp Endocrinol. 1969;13:126–138. doi: 10.1016/0016-6480(69)90229-9. [DOI] [PubMed] [Google Scholar]
- Buxton VL, Ward MP, Sperry JH. Frog breeding pond selection in response to predators and conspecific cues. Ethology. 2017;123:397–404. [Google Scholar]
- Campbell-Staton SC, Cheviron ZA, Rochette N, Catchen J, Losos JB, Edwards SV. Winter storms drive rapid phenotypic, regulatory, and genomic shifts in the green anole lizard. Science. 2017;357:495–498. doi: 10.1126/science.aam5512. [DOI] [PubMed] [Google Scholar]
- Capblancq T, Luu K, Blum MGB, Bazin E. Evaluation of redundancy analysis to identify signatures of local adaptation. Mol Ecol Resour. 2018;18:1223–1233. doi: 10.1111/1755-0998.12906. [DOI] [PubMed] [Google Scholar]
- Catchen J, Hohenlohe PA, Bassham S, Amores A, Cresko WA. Stacks: an analysis tool set for population genomics. Mol Ecol. 2013;22:3124–3140. doi: 10.1111/mec.12354. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Caye K, Jay F, Michel O, François O. Fast inference of individual admixture coefficients using geographic data. Ann Appl Stat. 2018;12:586–608. [Google Scholar]
- Clarke RT, Rothery P, Raybould AF. Confidence limits for regression relationships between distance matrices: estimating gene flow with distance. JABES. 2002;7:361. [Google Scholar]
- Conesa A, Götz S, García-Gómez JM, Terol J, Talón M, Robles M. Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics. 2005;21:3674–3676. doi: 10.1093/bioinformatics/bti610. [DOI] [PubMed] [Google Scholar]
- Delph LF. The study of local adaptation: a thriving field of research. J Heredity. 2018;1:1–2. doi: 10.1093/jhered/esx099. [DOI] [PubMed] [Google Scholar]
- Denver RJ. Proximate mechanisms of phenotypic plasticity in amphibian metamorphosis. Integr Comp Biol. 1997;37:172–184. [Google Scholar]
- Dyer RJ, Chan DM, Gardiakos VA, Meadows CA. Pollination graphs: Quantifying pollen pool covariance networks and the influence of intervening landscapes on genetic connectivity in the North American understory tree, Cornus florida. Landsc Ecol. 2012;27:239–251. [Google Scholar]
- Forester BR, Lasky JR, Wagner HH, Urban DL. Comparing methods for detecting multilocus adaptation with multivariate genotype–environment associations. Mol Ecol. 2018;27:2215–2233. doi: 10.1111/mec.14584. [DOI] [PubMed] [Google Scholar]
- Fox F, Weisberg S (2019) An R Companion to Applied Regression, Third Edition. Sage, Thousand Oaks CA, https://socialsciences.mcmaster.ca/jfox/Books/Companion/
- Fraser DJ, Weir L, Bernatchez L, Hansen MM, Taylor EB. Extent and scale of local adaptation in salmonid fishes: review and meta-analysis. Heredity. 2011;106:4040–4420. doi: 10.1038/hdy.2010.167. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Frichot E, François O. LEA: An R package for landscape and ecological association studies. Methods Ecol Evol. 2015;6:925–929. [Google Scholar]
- Frichot E, Schoville SD, Bouchard G, François O. Testing for associations between loci and environmental gradients using latent factor mixed models. Mol Biol Evol. 2013;30:1687–1699. doi: 10.1093/molbev/mst063. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Galas L, Raoult E, Tonon M-C, Okada R, Jenks BG, Castaño JP et al. (2009) TRH acts as a multifunctional hypophysiotropic factor in vertebrates. Gen Comp Endocrinol 164:40–50 [DOI] [PubMed]
- Gjertsen AK. Accuracy of forest mapping based on Landsat TM data and a kNN-based method. Remote Sens Environ. 2007;110:420–430. [Google Scholar]
- Gosselin T (2018) grur: an R package tailored for RADseq data imputations. R package version 0.0.10. https://github.com/thierrygosselin/grur. 10.5281/zenodo.496176
- Gosselin T, Anderson EC, Bradbury I (2016) assigner: Assignment Analysis with GBS/RAD Data using R. R package version 0.4.1. https://github.com/thierrygosselin/assigner. 10.5281/zenodo.51453
- Gosner KL. A simplified table for staging anuran embryos and larvae with notes on identification. Herpetologica. 1960;16:183–190. [Google Scholar]
- Graham LJ, Haines-Young RH, Field R. Metapopulation modeling of long-term urban habitat-loss scenarios. Landsc Ecol. 2017;32:989–1003. doi: 10.1007/s10980-017-0504-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Haldane JBS. A mathematical theory of natural and artificial selection. (Part VI, Isolation.) Math Proc Camb Philos Soc. 1930;26:220–230. [Google Scholar]
- Hammond SA, Warren RL, Vandervalk BP, Kucuk E, Khan H, Gibb EA, et al. The North American bullfrog draft genome provides insight into hormonal regulation of long noncoding RNA. Nature. Communications. 2017;8:1433. doi: 10.1038/s41467-017-01316-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hangartner S, Laurila A, Räsänen K. Adaptive divergence in Moor Frog (Rana arvalis) populations along an acidification gradient: inferences from Qst–Fst correlations. Evolution. 2012;66:867–881. doi: 10.1111/j.1558-5646.2011.01472.x. [DOI] [PubMed] [Google Scholar]
- Hanski I, Mononen T, Ovaskainen O. Eco-evolutionary metapopulation dynamics ant the spatial scale of adaptation. Am Naturalist. 2011;177:29–43. doi: 10.1086/657625. [DOI] [PubMed] [Google Scholar]
- Hedrick PW. Adaptive introgression in animals: examples and comparison to new mutation and standing variation as sources of adaptive variation. Mol Ecol. 2013;22:4606–4618. doi: 10.1111/mec.12415. [DOI] [PubMed] [Google Scholar]
- Hellsten U, Harland RM, Gilchrist MJ, Hendrix D, Jurka J, Kapitonov V, et al. The genome of the western clawed frog Xenopus tropicalis. Science. 2010;328:633–636. doi: 10.1126/science.1183670. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hemmer-Hansen J, Nielsen EE, Therkidsen NO, Taylor MI, Ogden R, Geffen AJ, et al. A genomic island linked to ecotype divergence in Atlantic cod. Mol Ecol. 2013;22:2653–2667. doi: 10.1111/mec.12284. [DOI] [PubMed] [Google Scholar]
- Hendry AP, Day T. Population structure attributable to reproductive time: isolation by time and adaptation by time. Mol Ecol. 2005;14:901–916. doi: 10.1111/j.1365-294X.2005.02480.x. [DOI] [PubMed] [Google Scholar]
- Hoban S, Kelley JL, Lottherhos KE, Antolin MF, Bradburd G, Lowry DB, et al. Finding the genomic basis of local adaptation: pitfalls, practical solutions, and future directions. Am Naturalist. 2016;188:379–397. doi: 10.1086/688018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Holt RD, Roy M. Predation Can increase the prevalence of infectious disease. Am Naturalist. 2007;169:690–699. doi: 10.1086/513188. [DOI] [PubMed] [Google Scholar]
- Homola JJ, Loftin CS, Kinnison MT. Landscape genetics reveals unique and shared effects of urbanization for sympatric pool-breeding amphibians. Ecol Evol. 2019;9:17799–17823. doi: 10.1002/ece3.5685. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ishwaran H, Kogalur UB (2018) Fast unified random forests for survival, regression, and classification (RF-SRC), R package version 2.7.0
- Ismail SA, Kokko H. An analysis of mating biases in trees. Mol Ecol. 2019;29:1–15. doi: 10.1111/mec.15312. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jenkins DG, Carey M, Czerniewska J, Fletchet J, Hether T, Jones A, et al. A meta-analysis of isolation by distance: relic or reference standard for landscape genetics? Ecography. 2010;33:315–320. [Google Scholar]
- Johansson F, Halvarsson P, Mikolajewski DJ, Höglund J. Genetic differentiation in the boreal dragonfly Leucorrhinia dubia in the Palearctic region. Biol J Linn Soc. 2017;121:294–304. [Google Scholar]
- Johansson M, Primmer CR, Sahlsten J, Merila J. The influence of landscape structure on occurrence, abundance and genetic diversity of the common frog, Rana temporaria. Global Change Biol. 2005;11:1664–1679. [Google Scholar]
- Jombart T. adegenet: a R package for the multivariate analysis of genetic markers. Bioinformatics. 2008;24:1403–1405. doi: 10.1093/bioinformatics/btn129. [DOI] [PubMed] [Google Scholar]
- Jombart T, Ahmed I. adegenet 1.3-1: new tools for the analysis of genome-wide SNP data. Bioinformatics. 2011;27:3070–3071. doi: 10.1093/bioinformatics/btr521. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kawecki TJ, Ebert D. Conceptual issues in local adaptation. Ecol Lett. 2004;7:1225–1241. [Google Scholar]
- Korhonen L, Korhonen K, Rautiainen M, Stenberg P. Estimation of forest canopy cover: a comparison of field measurement techniques. Silva Fennica. 2006;40:577–588. [Google Scholar]
- Legendre L, Legendre P (1998) Numerical ecology. Second English edition. Elsevier Science BV, Amsterdam, The Netherlands
- Legendre P, Oksanen J, ter Braak CJF. Testing the significance of canonical axes in redundancy analysis. Methods Ecol Evol. 2011;2:269–277. [Google Scholar]
- Leinonen T, McCairns RJS, O’Hara RB, Merilä J. QST–FST comparisons: evolutionary and ecological insights from genomic heterogeneity. Nat Rev Genet. 2013;14:179–190. doi: 10.1038/nrg3395. [DOI] [PubMed] [Google Scholar]
- Leimu R, Fischer M. A meta-analysis of local adaptation in plants. PLoS ONE. 2008;3:e4010. doi: 10.1371/journal.pone.0004010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lenormand T. Gene flow and the limits to natural selection. Trends Ecol Evol. 2002;17:183–189. [Google Scholar]
- Lenhardt PP, Brühl CA, Leeb C, Theissinger K. Amphibian population genetics in agricultural landscapes: does viniculture drive the population structuring of the European common frog (Rana temporaria)? PeerL. 2017;5:e3520. doi: 10.7717/peerj.3520. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lind MI, Johansson F. The degree of adaptive phenotypic plasticity is correlated with the spatial environmental heterogeneity experienced by island populations of Rana temporaria. J Evolut Biol. 2007;20:1288–1297. doi: 10.1111/j.1420-9101.2007.01353.x. [DOI] [PubMed] [Google Scholar]
- Lüdecke D, Makowski D, Waggoner P, Patil I (2020) performance: Assessment of regression models performance. R package version 0.4.6. https://CRAN.R-project.org/package=performance
- Luquet E, Mörch PR, Cortázar‐Chinarro M, Meyer‐Lucht Y, Höglund J, Laurila A. Post‐glacial colonization routes coincide with a life‐history breakpoint along a latitudinal gradient. J Evol Biol. 2019;32:356–368. doi: 10.1111/jeb.13419. [DOI] [PubMed] [Google Scholar]
- Luu K, Bazin E, Blum MGB. pcadapt: an R package to perform genome scans for selection based on principal component analysis. Mol Ecol Resour. 2017;17:67–77. doi: 10.1111/1755-0998.12592. [DOI] [PubMed] [Google Scholar]
- Maes GE, Pujolar JM, Hellemans B, Volkaert FA. Evidence for isolation by time in the European eel (Anguilla anguilla) Mol Ecol. 2006;15:2095–2107. doi: 10.1111/j.1365-294X.2006.02925.x. [DOI] [PubMed] [Google Scholar]
- Mastretta‐Yanes A, Arrigo N, Alvarez N, Jorgensen TH, Piñero D, Emerson BC. Restriction site-associated DNA sequencing, genotyping error estimation and de novo assembly optimization for population genetic inference. Mol Ecol Resour. 2015;15:28–41. doi: 10.1111/1755-0998.12291. [DOI] [PubMed] [Google Scholar]
- McRae BH. Isolation by Resistance. Evolution. 2006;60:1551–1561. [PubMed] [Google Scholar]
- Meirmans PG. Seven common mistakes in population genetics and how to avoid them. Mol Ecol. 2015;24:3223–3231. doi: 10.1111/mec.13243. [DOI] [PubMed] [Google Scholar]
- Meurling S, Kärvemo S, Chondrelli N, Cortazar-Chinarro M, Åhlén D, Brookes L et al. (2020) Occurrence of Batrachochytrium dendrobatidis in Sweden: higher infection prevalence in southern species. Dis Aquat Organ in press. [DOI] [PubMed]
- Michel MJ. Spatial dependence of phenotype-environment associations for tadpoles in natural ponds. Evolut Ecol. 2011;25:915–932. [Google Scholar]
- Montero-Mendieta S, Tan K, Christmas MJ, Olsson A, Vilà C, Wallberg A, et al. The genomic basis of adaptation to high-altitude habitats in the eastern honey bee (Apis cerana) Mol Ecol. 2019;28:746–760.. doi: 10.1111/mec.14986. [DOI] [PubMed] [Google Scholar]
- Nakajima K, Fujimoto K, Yaoita Y. Programmed cell death during amphibian metamorphosis. Semin Cell Dev Biol. 2005;16:271–280. doi: 10.1016/j.semcdb.2004.12.006. [DOI] [PubMed] [Google Scholar]
- Nunes AL, Orizaola G, Laurila A, Rebelo R. Rapid evolution of constitutive and inducible defenses against an invasive predator. Ecology. 2014;95:1520–1530. doi: 10.1890/13-1380.1. [DOI] [PubMed] [Google Scholar]
- Oksanen J, Guillaume-Blanchet F, Friendly M, Kindt R, Legendre P, McGlinn D et al. (2019) vegan:Community Ecology Package. R package version 2:5–5. https://CRAN.R-project.org/package=vegan
- Palik B, Batzer DP, Buech R, Nichols D, Cease K, Egeland L, et al. Seasonal pond characteristics across a chronosequence of adjacent forest ages in northern Minnesota, USA. Wetlands. 2001;21:532–542. [Google Scholar]
- Papaïx J, David O, Lannou C, Monod H. Dynamics of adaptation in spatially heterogeneous metapopulations. PLoS ONE. 2013;8:e54697. doi: 10.1371/journal.pone.0054697. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Paxton RJ, Thorén PA, Tengö J, Estoup A, Pamilo P. Mating structure and nestmate relatedness in a communal bee, Andrena jacobi (Hymenoptera, Andrenidae), using microsatellites. Mol Ecol. 1996;5:511–519. doi: 10.1111/j.1365-294x.1996.tb00343.x. [DOI] [PubMed] [Google Scholar]
- Peterson BK, Weber JN, Kay EH, Fisher HS, Hoekstra HE. Double Digest RADseq: an inexpensive method for de novo SNP discovery and genotyping in model and non-model species. PLOS ONE. 2012;7:e37135. doi: 10.1371/journal.pone.0037135. [DOI] [PMC free article] [PubMed] [Google Scholar]
- R Core Team (2019) R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria, https://www.R-project.org/
- Rašić G, Filipović I, Weeks AR, Hoffmann AA. Genome-wide SNPs lead to strong signals of geographic structure and relatedness patterns in the major arbovirus vector, Aedes aegypti. BMC Genomics. 2014;15:275. doi: 10.1186/1471-2164-15-275. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Reese H, Nilsson M, Pahén TG, Hagner O, Joyce S, Tingelöf U, et al. Countrywide estimates of forest variables using satellite data and field data from the National Forest Inventory. Ambio. 2003;32:542–548. doi: 10.1579/0044-7447-32.8.542. [DOI] [PubMed] [Google Scholar]
- Ribolli J, Hoeinghaus DJ, Johnson JA, Zaniboni-Filho E, de Freitas PD, Galetti PM. Isolation-by-time population structure in potamodromous Dourado Salminus brasiliensis in southern Brazil. Conserv Genet. 2017;18:67–76. [Google Scholar]
- Richardson JL, Urban MC, Bolnick DI, Skelly DK. Microgeographic adaptation and the spatial scale of evolution. Trends Ecol Evol. 2014;29:165–176. doi: 10.1016/j.tree.2014.01.002. [DOI] [PubMed] [Google Scholar]
- Richter-Boix A, Katzenberger M, Duarte H, Quintela M, Tejedo M, Laurila A. Local divergence of thermal reaction norms among amphibian populations is affected by pond temperature variation. Evolution. 2015;69:2210–2226. doi: 10.1111/evo.12711. [DOI] [PubMed] [Google Scholar]
- Richter-Boix A, Quintela M, Kierczak M, Franch M, Laurila A. Fine-grained adaptive divergence in an amphibian: genetic basis of phenotypic divergence and the role of nonrandom gene flow in restricting effective migration among wetlands. Mol Ecol. 2013;22:1322–1340. doi: 10.1111/mec.12181. [DOI] [PubMed] [Google Scholar]
- Rödin-Mörch P. Population divergence at different spatial scales in a wide-spread amphibian. Sweden: PhD thesis. Uppsala University; 2019. [Google Scholar]
- Rödin-Mörch P, Luquet E, Meyer-Lucht Y, Richter‐Boix A, Höglund J, Laurila A. Latitudinal divergence in a widespread amphibian: contrasting patterns of neutral and adaptive genomic variation. Mol Ecol. 2019;28:2996–3011. doi: 10.1111/mec.15132. [DOI] [PubMed] [Google Scholar]
- Rollins-Smith LA. The role of amphibian antimicrobial peptides in protection of amphibians from pathogens linked to global amphibian declines. Biochim et Biophys Acta. 2009;1788:1593–1599. doi: 10.1016/j.bbamem.2009.03.008. [DOI] [PubMed] [Google Scholar]
- Rollins-Smith LA, Conlon JM. Antimicrobial peptide defenses against chytridiomycosis, an emerging infectious disease of amphibian populations. Dev Comp Immunol. 2005;29:589–598. doi: 10.1016/j.dci.2004.11.004. [DOI] [PubMed] [Google Scholar]
- Row JR, Knick ST, Oyler‐McCance SJ, Lougheed SC, Fedy BC. Developing approaches for linear mixed modeling in landscape genetics through landscape-directed dispersal simulations. Ecol Evol. 2017;7:3751–3761. doi: 10.1002/ece3.2825. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rundle HD, Nosil P. Ecological speciation. Ecol Lett. 2005;8:336–352. [Google Scholar]
- Safner T, Miaud C, Gaggiotti O, Decout S, Rioux D, Zundel S, et al. Combining demography and genetic analysis to asses the population structure of an amphibian in a human-dominated landscape. Conserv Genet. 2011;12:161–173. [Google Scholar]
- Santos H, Rousselet J, Magnoux E, Paiva MR, Branco M, Kerdelhué C. Genetic isolation though time: allochronic differentiation of a phonologically atypical population of the pine processionary moth. Proc R Soc B. 2007;274:935–941. doi: 10.1098/rspb.2006.3767. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Savolainen O, Lascoux M, Merilä J. Ecological genomics of local adaptation. Nat Rev Genet. 2013;14:807–820. doi: 10.1038/nrg3522. [DOI] [PubMed] [Google Scholar]
- Savolainen O, Pyhäjärvi T, Knürr T. Gene flow and local adaptation in trees. Annu Rev Ecol Evol Syst. 2007;38:595–619. [Google Scholar]
- Semlitsch RD. Differentiating migration and dispersal processes for pond-breeding amphibians. J Wildl Manag. 2008;72:260–267. [Google Scholar]
- Session AM, Uno Y, Kwon T, Chapman JA, Toyoda A, Takahashi S, et al. Genome evolution in the allotetraploid frog Xenopus laevis. Nature. 2016;538:336–343. doi: 10.1038/nature19840. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sexton JP, Hangartner SB, Hoffmann AA. Genetic isolation by environment or distance: which pattern of gene flow is most common? Evolution. 2014;68:1–15. doi: 10.1111/evo.12258. [DOI] [PubMed] [Google Scholar]
- Shafer ABA, Wolf JBW. Widespread evidence for incipient ecological speciation: a meta-analysis of isolation-by-ecology. Ecol Lett. 2013;16:940–950. doi: 10.1111/ele.12120. [DOI] [PubMed] [Google Scholar]
- Sillero N, Campos J, Bonardi A, Corti C, Creemers R, Crochet P-A, et al. Updated distribution and biogeography of amphibians and reptiles of Europe. Amphib Reptilia. 2014;35:1–31. [Google Scholar]
- Skelly DK. Microgeographic countergradient variation in the wood frog, Rana Sylvatica. Evolution. 2004;58:160–165. doi: 10.1111/j.0014-3820.2004.tb01582.x. [DOI] [PubMed] [Google Scholar]
- Smith MA, Green DM. Dispersal and the metapopulation paradigm in amphibian ecology and conservation: are all amphibian populations metapopulations? Ecography. 2005;28:110–128. [Google Scholar]
- Storey JD, Bass AJ, Dabney A, Robinson D (2019) qvalue: Q-value estimation for false discovery rate control. R package version 2.16.0. http://github.com/jdstorey/qvalue
- Storfer A, Murphy MA, Evans JS, Goldberg CS, Robinson S, Spear SF, et al. Putting the ‘landscape’ in landscape genetics. Heredity. 2007;98:128–142. doi: 10.1038/sj.hdy.6800917. [DOI] [PubMed] [Google Scholar]
- Sun Y-B, Xiong Z-J, Xiang X-Y, Liu S-P, Zhou W-W, Tu X-L, et al. Whole-genome sequence of the Tibetan frog Nanorana parkeri and the comparative evolution of tetrapod genomes. Proc Natl Acad Sci USA. 2015;112:E1257–E1262. doi: 10.1073/pnas.1501764112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tang F, Ishwaran H. Random forest missing data algorithms. Stat Anal Data Min. 2017;10:363–377. doi: 10.1002/sam.11348. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tigano A, Friesen VL. Genomics of local adaptation with gene flow. Mol Ecol. 2016;25:2144–2164. doi: 10.1111/mec.13606. [DOI] [PubMed] [Google Scholar]
- Van Buskirk J Incipient habitat race formation in an amphibian. J Evolut Biol. 2014;27:585–592. doi: 10.1111/jeb.12327. [DOI] [PubMed] [Google Scholar]
- Van Strien MJ, Holderegger R, van Heck HJ. Isolation-by-distance in landscapes: considerations for landscape genetics. Heredity. 2015;114:27–37. doi: 10.1038/hdy.2014.62. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vasemägi A. The adaptive hypothesis of clinal variation revisited: single-locus clines as a result of spatially restricted gene flow. Genetics. 2006;173:2411–2414. doi: 10.1534/genetics.106.059881. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vendrami DLJ, Telesca L, Weigand H, Weiss M, Fawcett K, Lehman K, et al. RAD sequencing resolves fine-scale population structure in a benthic invertebrate: implications for understanding phenotypic plasticity. R Soc Open Sci. 2017;4:160548. doi: 10.1098/rsos.160548. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vincent B, Dionne M, Kent MP, Lien S, Bernatchez L. Landscape genomics in atlantic salmon (salmo salar): searching for gene–environment interactions driving local adaptation. Evolution. 2013;67:3469–3487. doi: 10.1111/evo.12139. [DOI] [PubMed] [Google Scholar]
- Vinogradov AE. Genome size and GC-percent in vertebrates as determined by flow cytometry: the triangular relationship. Cytometry. 1998;31:100–109. doi: 10.1002/(sici)1097-0320(19980201)31:2<100::aid-cyto5>3.0.co;2-q. [DOI] [PubMed] [Google Scholar]
- Vos CC, Jong AGA-D, Goedhart PW, Smulders MJM. Genetic similarity as a measure for connectivity between fragmented populations of the moor frog (Rana arvalis) Heredity. 2001;86:598–608. doi: 10.1046/j.1365-2540.2001.00865.x. [DOI] [PubMed] [Google Scholar]
- Wang IJ, Bradburd GS. Isolation by environment. Mol Ecol. 2014;23:5649–5662. doi: 10.1111/mec.12938. [DOI] [PubMed] [Google Scholar]
- Watanabe K, Kazama S, Omura T, Monaghan MT. Adaptive genetic divergence along narrow environmental gradients in four stream insects. PLoS ONE. 2014;9:e93055. doi: 10.1371/journal.pone.0093055. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Weir BS, Cockerham CC. Estimating F-statistics for the analysis of population structure. Evolution. 1984;38:1358–1370. doi: 10.1111/j.1558-5646.1984.tb05657.x. [DOI] [PubMed] [Google Scholar]
- Whelan NV, Galaska MP, Sipley BN, Weber JM, Johnson PD, Halanych KM, et al. Riverscape genetic variation, migration patterns, and morphological variation of the threatened Round Rocksnail, Leptoxis ampla. Mol Ecol. 2019;28:1593–1610. doi: 10.1111/mec.15032. [DOI] [PubMed] [Google Scholar]
- Whitlock MC, McCauley DE. Indirect measures of gene flow and migration: FST ≠ 1/(4 Nm +1) Heredity. 1999;82:117. doi: 10.1038/sj.hdy.6884960. [DOI] [PubMed] [Google Scholar]
- Wright S. Isolation by distance. Genetics. 1943;28:114–138. doi: 10.1093/genetics/28.2.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yandell M, Ence D. A beginner’s guide to eukaryotic genome annotation. Nat Rev Genet. 2012;13:329–342. doi: 10.1038/nrg3174. [DOI] [PubMed] [Google Scholar]
- Yeaman S, Otto SP. Establishment and maintenance of adaptive genetic divergence under migration, selection, and drift. Evolution. 2011;65:2123–2129. doi: 10.1111/j.1558-5646.2011.01277.x. [DOI] [PubMed] [Google Scholar]
- Youngquist M, Inoue K, Berg D, Boone MD. Effects of land use on population presence and genetic structure of an amphibian in an agricultural landscape. Landsc Ecol. 2017;32:147–162. [Google Scholar]
- Yu L, Wang G-D, Ruan J, Chen Y-B, Yang C-P, Cao X, et al. Genomic analysis of snub-nosed monkeys (Rhinopithecus) identifies genes and processes related to high-altitude adaptation. Nat Genet. 2016;48:947–952. doi: 10.1038/ng.3615. [DOI] [PubMed] [Google Scholar]
- Zamudio KR, Wieczorek AM. Fine-scale spatial genetic structure and dispersal among spotted salamander (Ambystoma maculatum) breeding populations. Mol Ecol. 2007;16:257–274. doi: 10.1111/j.1365-294X.2006.03139.x. [DOI] [PubMed] [Google Scholar]
- Zellmer AJ. Microgeographic morphological variation across larval wood frog populations associated with environment despite gene flow. Ecol Evol. 2018;8:2504–2517. doi: 10.1002/ece3.3829. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang H, Xu Q, Krajewski S, Krajewska M, Xie Z, Fuess S, et al. BAR: An apoptosis regulator at the intersection of caspases and Bcl-2 family proteins. Proc Natl Acad Sci USA. 2000;97:2597–2602. doi: 10.1073/pnas.97.6.2597. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Sequencing raw data is accessible at NCBI SRA under BioProject accession number: PRJNA660937. Relevant input and output files and analysis scripts are accessible at Dryad (10.5061/dryad.b5mkkwhbc).