Abstract
Small, isolated populations present a challenge for conservation. The dueling effects of selection and drift in a limited pool of genetic diversity make the responses of small populations to environmental perturbations erratic and difficult to predict. This is particularly true at the edge of a species range, where populations often persist at the limits of their environmental tolerances. Populations of cisco, Coregonus artedi, in inland lakes have experienced numerous extirpations along the southern edge of their range in recent decades, which are thought to result from environmental degradation and loss of cold, well‐oxygenated habitat as lakes warm. Yet, cisco extirpations do not show a clear latitudinal pattern, suggesting that local environmental factors and potentially local adaptation may influence resilience. Here, we used genomic tools to investigate the nature of this pattern of resilience. We used restriction site‐associated DNA capture (Rapture) sequencing to survey genomic diversity and differentiation in southern inland lake cisco populations and compared the frequency of deleterious mutations that potentially influence fitness across lakes. We also examined haplotype diversity in a region of the major histocompatibility complex involved in stress and immune system response. We correlated these metrics to spatial and environmental factors including latitude, lake size, and measures of oxythermal habitat and found significant relationships between genetic metrics and broad and local factors. High levels of genetic differentiation among populations were punctuated by a phylogeographic break and residual patterns of isolation‐by‐distance. Although the prevalence of deleterious mutations and inbreeding coefficients was significantly correlated with latitude, neutral and non‐neutral genetic diversity were most strongly correlated with lake surface area. Notably, differences among lakes in the availability of estimated oxythermal habitat left no clear population genomic signature. Our results shed light on the complex dynamics influencing these isolated populations and provide valuable information for their conservation.
Keywords: deleterious mutations, genetic drift, major histocompatibility complex, marginal populations, mutational load, small populations
1. INTRODUCTION
Understanding the nature of resilience and the capacity for evolution in the face of rapidly changing environments is a cornerstone of conservation genomics. Sources of selective pressure are numerous, and studies using genotype–environment association analyses that detect locally adapted populations can provide valuable insight on the environmental forces that influence population viability (Berg et al., 2015; Eizaguirre et al., 2012; Yeaman et al., 2016). Since heritable genetic variation provides the foundation for adaptive capacity under selection, the maintenance of diversity is fundamental to the preservation of healthy populations (Hoban et al., 2013; Sgrò et al., 2011). The mutations that underlie allelic diversity occur on a spectrum that can be broadly reduced to three categories (advantageous, neutral, or deleterious), with the frequency of these categories varying under mutation‐selection balance.
In small populations, the mutation‐selection balance can often be upset, making the frequency of deleterious mutations a conservation concern. Most mutations are mildly deleterious (Fay et al., 2001; Keightley & Lynch, 2003; Ohta, 1995), and in a large, randomly outcrossing population, recombination and segregation are expected to keep the mutational load from significantly affecting absolute mean fitness (Agrawal & Whitlock, 2012; Haag & Roze, 2007). The role of genetic drift increases as population sizes decline, such that natural selection may be ineffective at removing deleterious mutations (Whitlock & Bürger, 2004; Wright, 1931), leading to “mutational meltdown” and eventually extirpation of small populations (Lynch et al., 1995). As new genomic tools have become available to identify putatively deleterious variants, studies of small populations are quantifying deleterious mutations to assess the viability of at‐risk populations (Andrews, 2010) and to inform conservation efforts (e.g., Ferchaud et al., 2018; Grossen et al., 2020; Perrier et al., 2017).
Conservation of small populations is inherently challenging, especially when they are isolated from nearby populations (Pekkala et al., 2012; Ralls et al., 2018). Without the option of being rescued by immigration, isolated populations are especially susceptible to environmental fluctuations and local habitat degradation (Pimm, 1991). These unique attributes mean that conservation principles developed for larger populations or metapopulations may not be suitable for managing small populations. For example, many conservation plans attempt to understand how species will respond to rising global temperatures on relatively large spatial scales (e.g., 100s of kms), but the viability of isolated, small populations depends on the details of how their local environment changes—which may deviate from large‐scale trends. For example, aquatic ecosystems vary widely in the source of water (Lisi et al., 2013), habitat dimensions and nutrient loads (Olden et al., 2001), and precipitation inputs (Bay et al., 2018) even within the same geographic region. Additionally, environmental variation at local scales has been linked to immunocompetence in aquatic populations (Larson et al., 2016). In habitats separated by as little as 1 km, Larson et al. (2016) found differences in water temperature were significantly correlated with diversity in a major histocompatibility complex (MHC) gene involved in immune system response. Understanding how these local factors influence the genetic integrity of small, isolated populations is vital for managing them.
Here, we use cisco (Coregonus artedi) as a model organism to understand how spatial and environmental factors influence genetic integrity of small, isolated populations under a changing climate. Cisco are an important forage fish found in deep, cold lakes across northern North America (Scott & Crossman, 1973). As cold‐water specialists, ciscoes require both cold‐water temperatures and sufficient dissolved oxygen, found within the oxythermal habitat layer, to survive the stratification that occurs in deep lakes during the summer (Cahn, 1927; Frey, 1955; Jacobson et al., 2008; Lyons et al., 2018). Rising global temperatures have been linked to freshwater fish die‐offs (Till et al., 2019), and declines in abundance as well as complete extirpations of inland lake cisco populations have been documented along the southern edge of the species range in Minnesota, Wisconsin, and Indiana (Honsey et al., 2016; Jacobson et al., 2012). Contemporary surveys across 133 Wisconsin lakes that historically contained ciscoes reveal a mosaic pattern of extirpation rather than a clear latitudinal gradient (Lyons et al., 2015); geographic proximity does not predict which populations were extirpated and which continue to thrive, suggesting that local factors mediate population outcomes.
A growing body of research focuses on using oxythermal habitat modeling to predict extirpations of cold‐water fish populations and design conservation strategies (Jacobson et al., 2013; Jiang et al., 2012; Lyons et al., 2018; Magee et al., 2019; Renik et al., 2020), but this perspective has not been fully integrated with genomic analyses of cisco. Previous genetic analyses across the cisco species range using mitochondrial and microsatellite DNA have found evidence for at least two major lineages (Bernatchez & Dodson, 1990; Turgeon & Bernatchez, 2001a, 2001b, 2003). More recently, a study of cisco in inland lakes of Ontario using a genomic method similar to restriction site‐associated DNA (RAD) sequencing detected two lineages and measured high levels of differentiation among lake systems (Piette‐Lauzière et al., 2019), but did not quantify genomic diversity within or among lakes. The southern edge of the species’ range offers opportunities to assess how genetic metrics correlate with both spatial variables and static and climate‐dynamic environmental variables that may be associated with population persistence.
In this study, we applied multiple genomic tools to profile surviving populations of cisco near their southern range limit in Minnesota, Wisconsin, Indiana, and Michigan, and reconstructed the dynamics of oxythermal habitat availability in each lake across a 37‐year period before genetic sampling. We used restriction site‐associated DNA capture (Rapture) sequencing Ali et al. (2016) to survey genomic diversity and differentiation, and to enumerate the presence and frequency of putative deleterious mutations among lakes. We also amplified the MHC IIβ exon 2 to examine the contributions of neutral or pathogen‐mediated adaptive processes to haplotype diversity across these southern inland cisco populations. To understand the spatial and environmental context for genomic variation in ciscoes, we correlate each of these genetic metrics to the latitude, size, depth, and oxythermal habitat availability in each lake.
2. METHODS
2.1. Tissue sampling and DNA extraction
Cisco used in our study were collected from 29 lakes across the southern part of the species’ range using vertical gill nets or from die‐offs during 2012–2019 (Figure 1; Table 1). To our knowledge, all sampled lakes contained the standard inland cisco ecotype except for one lake, Atkins (ATK), where the cisco population is comprised of a dwarf cisco ecotype. Fin clips were provided by Michigan State University (Howard Lake), the Indiana Department of Natural Resources and Purdue University (Indiana lakes), the Wisconsin Department of Natural Resources (Wisconsin lakes), and the Minnesota Department of Natural Resources and the University of Minnesota‐Duluth (Minnesota lakes). DNA from fin tissue was extracted using Qiagen DNeasy® Blood & Tissue Kit reagents.
FIGURE 1.

Map showing the 29 sampled cisco lakes. Inset (bottom left): box indicates the location of the sampling area within the Laurentian Great Lakes region. Inset (top right): sampled lakes in northern Wisconsin. Lake names and site data can be found with site codes in Table 1. We detected two phylogenetic lineages in our sampling area. Populations associated with the Mississippian (MS) lineage are denoted with circles and populations associated with the Great Lakes (GL) lineage are denoted with triangles
TABLE 1.
Location and environmental data for the 29 cisco lakes sampled across the southern periphery of the species range. Complete oxythermal data can be found in Data S2
| State | Lake | Site code | Latitude | Longitude | Max Depth (m) | Surface Area (km2) | TDO3max |
|---|---|---|---|---|---|---|---|
| Minnesota | Lower Hay | LHY | 46.6693 | −94.2843 | 42.06 | 57.75 | 22.5516 |
| Sugar | SGR | 45.3189 | −94.0418 | 21.03 | 4.1 | 22.9099 | |
| Long | LNG | 46.4895 | −93.4938 | 35.36 | 1.76 | 17.4888 | |
| Wisconsin | Bear | BEA | 45.6343 | −91.8248 | 27 | 5.5 | 20.4916 |
| Grindstone | GRD | 45.9353 | −91.4154 | 18 | 12.6 | 22.7022 | |
| Atkins | ATK | 46.2789 | −91.0367 | 24 | 0.7 | 12.5363 | |
| Trude | TRU | 46.1148 | −90.1579 | 15 | 54.8 | 22.1987 | |
| Pike | PIK | 46.1747 | −90.1196 | 25 | 0.8 | 11.4723 | |
| Presque Isle | PRS | 46.2227 | −89.7798 | 24 | 5.2 | 17.8400 | |
| Trout | TRT | 46.0303 | −89.6766 | 36 | 15.4 | 11.4034 | |
| White Sand | WSL | 46.0887 | −89.5941 | 2 | 3 | 12.1445 | |
| Big Muskellunge | MSK | 46.0174 | −89.6149 | 21 | 3.8 | 16.1785 | |
| Pallette | PLT | 46.0665 | −89.6039 | 20 | 0.7 | 15.2114 | |
| Plum | PLM | 46.0033 | −89.5193 | 17 | 4.5 | 22.2666 | |
| Tomahawk | TOM | 45.8304 | −89.6675 | 26 | 14.4 | 14.8864 | |
| Two Sisters | TWS | 45.7716 | −89.5207 | 19 | 2.9 | 13.3695 | |
| Black Oak | BLK | 46.1612 | −89.3155 | 26 | 2.4 | 9.4554 | |
| Long | LGL | 46.0645 | −89.0225 | 29 | 3.5 | 12.5157 | |
| Sevenmile | SVN | 45.8811 | −89.0506 | 13 | 2 | 22.3921 | |
| Rainbow | RBW | 44.3411 | −89.1504 | 29 | 2.4 | 10.6171 | |
| Spencer | SPN | 44.2897 | −89.1043 | 16 | 0.3 | 12.5953 | |
| Gilbert | GIL | 44.2120 | −89.1675 | 20 | 0.6 | 12.3836 | |
| Green | GRN | 43.8186 | −88.9876 | 72 | 29.7 | 13.5789 | |
| Elkhart | ELK | 43.8262 | −88.0240 | 36 | 1.2 | 11.5196 | |
| Okauchee | OKA | 43.1262 | −88.4232 | 29 | 4.8 | 20.5535 | |
| Geneva | GNV | 42.5637 | −88.5179 | 41 | 21.3 | 11.2306 | |
| Michigan | Howard | HOW | 42.0808 | −85.5889 | 16.15 | 1.84 | 14.8317 |
| Indiana | Eve | EVE | 41.5603 | −85.3199 | 12.8 | 0.3 | 17.4626 |
| Crooked | CRD | 41.2634 | −85.4803 | 32.9 | 1.6 | 13.2668 |
2.2. Development of Rapture panel
We developed a Rapture panel using restriction site‐associated DNA (RAD) data from 129 individuals (data previously published in Euclide et al., 2020) to increase genotyping efficiency and the number of individuals that could be multiplexed on each sequencing lane. The 129 individuals were sampled from multiple populations in Minnesota (N = 24), northern Wisconsin (N = 37), central Wisconsin (N = 26), southern Wisconsin (N = 10), Indiana (N = 13), and Michigan (N = 19). RAD library preparation was conducted using the SbfI enzyme according to the methods of Ali et al. (2016), and libraries were sequenced on two lanes of an Illumina HiSeq 4000 at the Michigan State Genomics Core Facility. Sequence data were analyzed with Stacks v1.46 (Catchen et al., 2011, 2013) with the following parameters: process_radtags (flags = c, ‐q, ‐r, ‐t 140), ustacks (flags = ‐m 3, ‐M 5, ‐H ‐‐max_locus_stacks 3, ‐‐model_type bounded, ‐‐bound_high 0.05), cstacks (‐n of 4, 29 individuals with the most data included in the catalog), and populations (flags = ‐r 0.7, ‐‐min_maf 0.05). In addition, since all salmonids including cisco have experienced a recent genome duplication, putatively paralogous loci were identified with the program HDPlot (McKinney et al., 2017), and any loci with heterozygosity >0.6 or a read ratio deviation >10 and <–6 were classified as duplicated. We chose a total of 11,330 tags containing high‐quality singleton SNPs (i.e., not from duplicated regions of the salmonid genome) and 1670 tags containing duplicated SNPs for a total of 13,000 tags. Sequence data for the loci that met our quality standards were sent to Arbor Biosciences (Ann Arbor, MI) for Rapture bait development. Arbor Biosciences conducted additional quality filters to ensure robust performance and synthesized two baits per‐locus when possible, resulting in a final panel of 7753 loci and 14,382 unique baits.
2.3. Rapture loci processing
Rapture libraries were prepared with the RAD protocol used for panel development followed by bait capture using the myBaits protocol v4.01 (Arbor Biosciences) with minor modifications. Libraries were hybridized with the bait mixture for 16 hr at 65°C and amplified with 10 PCR cycles, universal primers, and an annealing temperature of 56°C. Baited Rapture libraries were purified with 1X AMPureXP beads and sequenced on five Illumina HiSeq 4000 lanes at either the Michigan State Genomics Core Facility or Novogene.
Raw sequences were processed with Stacks v2.41 (Rochette et al., 2019). Sequences were demultiplexed by barcode, filtered for presence of the enzyme cut‐site and quality, and trimmed in the subprogram process_radtags (parameter flags: ‐e SbfI ‐c ‐q ‐r ‐t 140 ‐‐filter_illumina ‐‐bestrad). Filtered reads for each individual were aligned to create matching stacks with ustacks (flags: ‐‐disable‐gapped ‐m 3 ‐M 5 ‐H ‐‐max_locus_stacks 4 ‐‐model_type bounded ‐‐bound_high 0.05). A master catalog of consensus loci was built in cstacks from a catalog of 51 C. artedi used in the development of a cisco linkage map (Blumstein et al., 2020) that was appended with five individuals from each lake that retained the highest number of reads after process_radtags (flags: ‐n 3 ‐p 6 –disable_gapped). Locus stacks for individuals were matched to the catalog using sstacks (flag: ‐‐disable_gapped), data were oriented by locus in tsv2bam, and reads were aligned to loci and SNPs were called with gstacks. SNPs genotyped in >30% of individuals (flag: ‐r 0.3) were exported with populations to a variant call format (vcf) file.
SNP filtering was performed with vcftools v0.1.15 (Danecek et al., 2011) and included the following: (1) removing loci genotyped in <70% of individuals; (2) removing individuals missing >50% of loci; and (3) removing loci with a minor allele frequency <0.01. Any loci remaining after primary filtering that were not genotyped in every lake were also removed. Putatively paralogous loci were identified with the program HDPlot (McKinney et al., 2017), and loci with heterozygosity >0.5, a read ratio deviation >15 and < –15, and an allele balance >0.7 and <0.3 were removed. Finally, loci on the same RAD tag are likely to be physically linked. Therefore, only the SNP with the highest minor allele frequency on each tag was included in the final dataset. File format conversions were performed with PGDSpider v2.1.1.5 (Lischer & Excoffier, 2011).
2.4. Genomic differentiation and diversity
We calculated diversity statistics for each population with GenoDive v3.0 (Meirmans, 2020), including observed and expected heterozygosity, an inbreeding coefficient (G IS, a heterozygosity‐based analog of F IS; Nei, 1987), and the proportion of polymorphic alleles present in each population. Effective population size (N e) was estimated in each population with the bias‐corrected linkage disequilibrium method (LDNE; Hill, 1981; Waples, 2006; Waples & Do, 2010) in the software package NeEstimator v2.1 (Do et al., 2014). We restricted pairs of loci used in calculations to those comprised of two loci on different linkage groups of the cisco linkage map (Blumstein et al., 2020) using the LD locus pairing option to correct for the effects of physical linkage on the estimates of N e (Waples et al., 2016). We used a p‐crit of 0.05 (Waples et al., 2016) to reduce potential bias introduced by low frequency alleles. N e calculations using the linkage disequilibrium method can be biased slightly downward when individuals from multiple cohorts are included in the sample due to a slight Wahlund effect (7% downward bias on average; Waples et al., 2014). Nonetheless, this small bias should not greatly affect the interpretation of the N e results.
We also used GenoDive to estimate standardized pairwise genetic differentiation between populations (F ST). Analysis of genetic differentiation across our populations indicated that we likely sampled inland lakes across a putative phylogeographic break. We constructed a population tree from allele frequency data using the software POPTREE2 to identify which populations belonged to which divergent clade. Genetic distance between inland lake populations was measured with DA (Nei et al., 1983), and trees were generated using the neighbor‐joining (NJ) method of Saitou and Nei (1987) with 1000 bootstrap replicates. Correlation between Slatkin's linearized F ST (F ST /(1 − F ST)) (Rousset, 1997) and planar geographic distance between populations was examined using Mantel and partial Mantel tests to test for isolation‐by‐distance (IBD) while accounting for lineage effects (Mantel, 1967; Meirmans, 2012; Sokal, 1979), and the regression between F ST/1 − F ST and geographic distance (km) was plotted. We also examined the relationship of genetic distance and spatial heterogeneity in effective population size by testing the correlation between pairwise genetic differentiation (F ST /(1 − F ST)) and the di metric of Prunier et al. (distance based on the inverse; 2017) with log (surface area) as a proxy for local carrying capacity, as well as the correlation of average pairwise genetic differentiation to log (surface area) as implemented in Perrier et al. (2017). Finally, we used analysis of molecular variance (AMOVA) in GenoDive to test for the relative amounts of variance in our dataset that could be explained by individual, population, or lineage effects.
2.5. Identifying putatively deleterious mutations
We identified putative deleterious mutations in our Rapture data using the program PROVEAN (Choi et al., 2012) and methods similar to Perrier et al. (2017) and Ferchaud et al. (2020), Ferchaud et al. (2018); scripts included with our supplementary material on Dryad). First, we created a fasta file including each allele for each locus from the catalog.fa file output from Stacks and a VCF file using the script make_fasta_with_all_alleles.py. We then aligned sequences for each RAD locus to the protein coding sequence from the Atlantic salmon (Salmo salar) genome (Lien et al., 2016) using BLASTX. Hits with a sequence overlap of >40 amino acids and a percent identity >90% were retained, and if more than one hit for a given locus met these parameters, the hit with the highest % identity was kept (parse_blast_results_best_align_delet_mut_consensus_seqs.py). Next, portions of sequences for each locus that aligned to Atlantic salmon proteins were translated into protein coding sequences with each allele representing a unique sequence (pull_seqs_to_translate_consensus_blast.py). Protein sequences for loci with nonsynonymous mutations were then converted to PROVEAN input files (generate_provean_files.py). We then used PROVEAN linked to the NCBI BLAST database v4 (https://ftp.ncbi.nlm.nih.gov/blast/db/v4/, author note: v5 was available but incompatible with PROVEAN at the time of analysis) to assess whether each mutation was putatively deleterious (PROVEAN score ≤ −2.5) and outputs were parsed with a custom script (parse_provean_output.py). We interpreted the allele that generated the negative PROVEAN score to be the deleterious allele and the alternative allele (≥2.5) to be neutral, following similar approaches in the literature (e.g., Conte et al., 2017). Finally, we calculated two metrics to assess the frequency of deleterious mutations in each population: (1) average allele frequency of deleterious alleles (AFdel mut), and (2) proportion of deleterious mutations (Propdel mut), which was calculated as the number of loci with an allele frequency >0 for a deleterious mutation in that population divided by the total number of deleterious mutations in the dataset.
2.6. MHC processing
We amplified a 300‐bp region of the MHC IIβ exon 2 using primers and reaction conditions described in Pavey et al. (2011). We then attached sequencing barcodes and adapters and normalized the DNA using the genotyping‐in‐thousands by sequencing (GT‐seq) protocol (Campbell et al., 2015) following Bootsma, Gruenthal, et al. (2020). Each plate was barcoded and then batch normalized using SequalPrep DNA Normalization plates (Invitrogen). Normalized products were pooled, then cleaned and size selected using AMPureXP magnetic beads (Beckman Coulter, Brea, CA). Products were run on an agarose gel to verify size ranges and quantified using a Qubit 2.0 fluorometer. Amplicons were sequenced on an Illumina MiSeq 2x300 flowcell at the University of Wisconsin Biotechnology Center in Madison, Wisconsin.
After sequencing, terminal read quality was examined with FastQC (Andrews, 2010), and sequences were trimmed to 176 bp with the command line tool fastx_trimmer from FASTX‐Toolkit (https://github.com/agordon/fastx_toolkit). Forward and reverse reads were merged using the software tool FLASH (‐m 50, ‐M 150; Magoč & Salzberg, 2011), and the final 300 bp reads were quality filtered with the perl tool PRINSEQ‐lite (http://prinseq.sourceforge.net/) using a minimum base quality score (min_qual_score) of 30. The demultiplexing process removed barcodes from sequences so those were extracted from sequence headers and inserted at the beginning of reads (barcodes‐min.py) before importing into the software jMHC (Stuglik et al., 2011). To maximize the number of sequences imported to the jMHC SQLite database, the forward and reverse primers were truncated to 10 bps. Allowing for one mismatch per 12 bp i7+i5 barcode, reads were then searched from the forward orientation, and if both primers and barcode were found, the read was imported to the database. Unique haplotypes with >50 reads were then exported for analysis. An MHC haplotype was present in an individual if the haplotype contained >10% of reads in that individual. MHC haplotypes that were only found once in the dataset were likely errors and were excluded. Many individuals contained >2 MHC haplotypes, indicating that our primers amplified multiple MHC genes, which is an expected result for salmonids (Harstad et al., 2008). However, we were unable to assign dosage (i.e., copy number) for each allele because there were no clear differences in read count distributions among putative genotype classes. Our genotype matrix is therefore coded as presence/absence data rather than complete genotype calls.
We used the generated MHC genotype matrix to investigate diversity, selection, and differences in MHC haplotype frequencies among populations. Estimates of allelic diversity can be highly influenced by sample sizes, but rarefaction methods such as that implemented in the program HPRARE (Kalinowski, 2005) can be used to estimate unbiased allelic richness. Unfortunately, this program can only take diploid data and many of the individuals in our dataset contained >2 MHC alleles. We therefore calculated a rarefied average number of MHC haplotypes present in each population (N hap) by randomly sampling N = 5 individuals (minimum sample size in our study) from each population 1000 times and calculating the mean number of haplotypes found in those samples. We also estimated the proportion of individuals in each population that contain more than one allele (Proppoly) and the proportion of individuals in a population with more than two MHC alleles (Prop>2 alleles). We conducted a codon‐based Z‐test of selection in MEGA 7 using the Nei‐Gojobori method (Nei & Gojobori, 1986) to test whether the ratio of nonsynonymous mutations (DN) was greater than synonymous mutations (DS). Finally, we visualized MHC haplotype frequencies calculated as the number of individuals containing a given haplotype divided by the total number of individuals for each population with a stacked bar plot. This metric was standardized to 1 by dividing by the sum of the frequencies for each population which was >1 as individuals can have >1 haplotype.
2.7. Oxythermal habitat modeling
We used the open source, vertical one‐dimensional hydrodynamic General Lake Model (GLM; Hipsey et al., 2019) v.2.1.8, coupled with the Aquatic EcoDynamics (AED2) model (Hipsey et al., 2020) to develop daily water temperature and dissolved oxygen profiles for each lake over the years 1979–2015. GLM and AED are available online (https://github.com/AquaticEcoDynamics/GLM and https://github.com/AquaticEcoDynamics/libaed2). This model simplifies lakes using a vertical Lagrangian layer approach, where horizontal variability is ignored, and each layer's thickness dynamically changes in response to water density (Hipsey et al., 2019). The water quality module, AED2, was configured to simulate the dynamics of dissolved oxygen, carbon, silica, nitrogen, phosphorus, organic matter, phytoplankton (three functional groups representing cyanobacteria, diatoms, and chlorophytes), and zooplankton (two functional groups representing copepods and Daphnia).
Historic climate drivers for each lake were estimated using data from (Winslow et al., 2017) and expanded following similar methods for lakes that were not available in the original dataset. The model does not consider surface inflow of water into the system, and we assumed that water levels remained constant over the study period. To counteract any long‐term negative change in lake level, we applied a small correction to precipitation in summer as in Winslow et al. (2017). In situ water temperature and dissolved oxygen profiles were collected from the NTL‐LTER program (https://lter.limnology.wisc.edu/), US Geological Service, Michigan DNR, Minnesota DNR, Wisconsin DNR, University of Wisconsin‐Madison, and Purdue University and compiled as supplemental material on Dryad. We performed a manual calibration for each lake based on the full set of temperature and dissolved oxygen profiles available. Model parameters were first calibrated for water temperature and second for dissolved oxygen. Lake‐specific data were only available for water temperature and dissolved oxygen, so default values were used for other model parameters, as is standard practice. Models were run at an hourly timestep starting April 4, 1979, and continuing through December 31, 2015, with output profiles once daily at noon. Model configuration files, input, and output profiles are accessible on Dryad.
2.8. Environmental variables in sampled cisco populations
We obtained data on surface area and maximum depth from Minnesota Department of Natural Resources databases for Minnesota lakes, from Renik et al. (2020) for Wisconsin Lakes, from Honsey (2014) for Indiana lakes, and from an EPA report for Howard Lake in Michigan (EPA, 1975). Environmental variables to estimate cisco oxythermal habitat from the hydrodynamic model were as follows: maximum daily TDO3 (temperature at 3 mg L−1 dissolved oxygen; Jacobson et al., 2010); seven‐day maximum TDO3 (TDO37day); 30‐day maximum TDO3 (TDO330day); COSD (Cumulative Oxythermal Stress Dosage; Magee et al., 2019); minimum yearly oxythermal habitat thickness; and minimum yearly oxythermal habitat volume estimated from model‐derived daily water temperature and dissolved oxygen profiles from 1979 to 2015. Maximum daily TDO3 (TDO3max) values were calculated following Jacobsen et al. (2010), and TDO37day and TDO330day values were calculated following (Jiang et al., 2012) using either a 7‐ or 30‐day moving average. COSD, which considers both the magnitude and duration of oxythermal stress conditions, was calculated as in Magee et al. (2019) using 3 mg L−1 DO and 17°C thresholds. Daily vertical habitat thickness was calculated from profiles by subtracting the depth at which the thermal profile reached 17°C from the depth at which the DO profile reached 3 mg L−1. The minimum yearly vertical habitat thickness was determined to be the minimum daily vertical habitat thickness value during the open‐water period and could be negative in instances where 3 mg L−1 of DO was achieved in the metalimnion or epilimnion. The minimum yearly oxythermal habitat volume was estimated by multiplying the minimum yearly vertical thickness by the lake's surface area. Calculations were performed using Matlab R2019a, and code is available as supplemental material on Dryad. One value for each environmental variable was calculated for each study year, and we also extracted long‐term averages and extreme values that could lead to bottlenecks in habitat availability. Lake surface area and maximum depth were considered as static descriptors of each lake.
2.9. Correlating genetic variables with spatial and environmental parameters
We screened spatial and environmental variables for multicollinearity using pairwise correlations and variance inflation factors (VIF) with the R packages Hmisc (Harrell Jr & others, 2020) and usdm (Naimi et al., 2014). Predictors were pruned iteratively based on a threshold of VIF <5 and pairwise Pearson's correlation coefficients of |r| >0.7 (Dormann et al., 2013). We conducted RDA analysis in the R package vegan (Okansen et al., 2016) to test for a relationship between spatial and environmental variables and genetic variables. For this analysis, the response matrix was composed of genetic diversity metrics and the predictor matrix was composed of spatial and environmental variables. Our choice to conduct RDA instead of canonical correspondence analysis, a similar multivariate method, was informed by a detrended correspondence analysis, which indicated that variables were linear and thus more appropriate for RDA (Legendre & Legendre, 1998). The significance of each predictor (i.e., their influence on variation in genetic metrics) was assessed with ANOVAs. We then used multiple linear regressions to test for relationships between genetic (response) variables and environmental (predictor) variables. The relative importance of each variable was assessed with the lmg method implemented in the R package relaimpo (Groemping, 2006). Finally, we conducted univariate linear regressions with the ‘lm’ function in R (and visualized with ggplot2; Wickham, 2011) to test for individual relationships between genetic variables and spatial and environmental variables that were significant in the RDA.
3. RESULTS
3.1. Genomic differentiation and diversity with Rapture loci
Sequence reads were generated for a total of 850 samples from our 29 lakes, and after processing in Stacks, 543,108 shared variant sites were genotyped in >30% of individuals. A total of 59,865 loci were genotyped in >70% of individuals. Eighty individuals were dropped for missingness >50%. A large percentage of samples dropped were from previously archived DNA extracts, dried fin clips, and/or collections from die‐offs. The loss of these individuals was most likely due to low quality (degraded) DNA in the extracts used for Rapture library preparation, which has been shown to highly impact the percentage of retained tags (Graham et al., 2015). The dataset contained 16,584 loci after filtering for minor allele frequency. Following the removal of putatively paralogous loci (n = 3578), removal of loci that were not genotyped in every lake (n = 59), and filtering to a single SNP per Rapture locus, 7546 SNPs genotyped in 770 individuals (see Table 2 for number of individuals genotyped per population) remained for analysis.
TABLE 2.
Rapture and MHC data for sampled lakes
| Genetic lineage | Site code | Rapture data | MHC data | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| N | H o | H e | G IS | Proppoly | N e (CI) | AFdel mut | Propdel mut | N | Nhaps | Proppoly | Prop>2 alleles | ||
| Mississippian (MS) | LHY | 26 | 0.236 | 0.273 | 0.136 | 0.917 | 561 (504–631) | 0.72 | 1 | 25 | 7.96 | 0.96 | 0.24 |
| SGR | 23 | 0.125 | 0.173 | 0.276 | 0.781 | 36 (35–37) | 0.71 | 0.97 | 18 | 7.43 | 0.78 | 0.50 | |
| LNG | 18 | 0.177 | 0.254 | 0.302 | 0.866 | 437 (362–552) | 0.73 | 1 | 11 | 6.41 | 0.82 | 0.36 | |
| BEA | 30 | 0.130 | 0.178 | 0.267 | 0.783 | 519 (456–602) | 0.70 | 0.93 | 26 | 5.45 | 0.69 | 0.46 | |
| GRD | 28 | 0.170 | 0.224 | 0.244 | 0.854 | 207 (198–217) | 0.70 | 0.97 | 27 | 5.37 | 0.78 | 0.33 | |
| ATK | 54 | 0.103 | 0.125 | 0.173 | 0.706 | 128 (124–132) | 0.74 | 0.89 | 41 | 2.95 | 0.90 | 0.34 | |
| TRU | 30 | 0.257 | 0.264 | 0.028 | 0.897 | 131 (128–134) | 0.71 | 0.99 | 5 | 8.00 | 0.80 | 0.40 | |
| PIK | 30 | 0.185 | 0.216 | 0.142 | 0.826 | 584 (521–665) | 0.73 | 0.96 | 21 | 2.68 | 0.48 | 0 | |
| PRS | 30 | 0.178 | 0.245 | 0.273 | 0.880 | 155 (151–161) | 0.73 | 0.98 | 27 | 5.25 | 0.67 | 0.33 | |
| TRT | 13 | 0.174 | 0.243 | 0.286 | 0.851 | 231 (201–271) | 0.71 | 0.97 | 15 | 8.29 | 1.00 | 0.47 | |
| WSL | 30 | 0.228 | 0.237 | 0.039 | 0.868 | 1781 (1355–2593) | 0.70 | 0.97 | 28 | 5.64 | 0.79 | 0.39 | |
| MSK | 29 | 0.154 | 0.209 | 0.263 | 0.819 | 157 (151–164) | 0.73 | 0.95 | 24 | 4.36 | 0.83 | 0 | |
| PLT | 30 | 0.176 | 0.192 | 0.084 | 0.809 | 798 (678–970) | 0.73 | 0.95 | 28 | 3.27 | 0.61 | 0 | |
| PLM | 30 | 0.211 | 0.221 | 0.046 | 0.845 | 636 (567–726) | 0.74 | 0.96 | 26 | 5.91 | 0.73 | 0.15 | |
| TOM | 30 | 0.215 | 0.257 | 0.163 | 0.902 | 532 (486–587) | 0.74 | 0.98 | 26 | 8.28 | 0.96 | 0.38 | |
| TWS | 30 | 0.202 | 0.219 | 0.079 | 0.849 | 261 (248–275) | 0.74 | 0.98 | 29 | 5.37 | 0.90 | 0.38 | |
| BLK | 24 | 0.127 | 0.190 | 0.330 | 0.794 | 63 (62–65) | 0.74 | 0.94 | 14 | 4.29 | 0.64 | 0 | |
| LGL | 28 | 0.136 | 0.233 | 0.415 | 0.864 | 821 (684–1025) | 0.74 | 0.98 | 29 | 7.15 | 0.90 | 0.28 | |
| SVN | 29 | 0.162 | 0.185 | 0.125 | 0.798 | 385 (353–423) | 0.73 | 0.92 | 27 | 3.69 | 0.85 | 0 | |
| Great Lakes (GL) | RBW | 19 | 0.211 | 0.234 | 0.101 | 0.851 | 1327 (940–2256) | 0.70 | 0.96 | 18 | 5.02 | 0.78 | 0.06 |
| SPN | 30 | 0.183 | 0.213 | 0.142 | 0.840 | 296 (279–316) | 0.70 | 0.96 | 17 | 6.09 | 0.88 | 0.35 | |
| GIL | 28 | 0.185 | 0.230 | 0.198 | 0.840 | 147 (142–152) | 0.72 | 0.98 | 26 | 6.76 | 0.81 | 0.38 | |
| GRN | 29 | 0.190 | 0.262 | 0.274 | 0.916 | 274 (259–290) | 0.70 | 1 | 18 | 6.63 | 0.94 | 0.50 | |
| ELK | 20 | 0.139 | 0.256 | 0.460 | 0.871 | 282 (255–316) | 0.71 | 0.96 | 18 | 6.38 | 0.83 | 0.28 | |
| OKA | 28 | 0.103 | 0.218 | 0.525 | 0.834 | 355 (317–402) | 0.68 | 0.92 | 27 | 6.01 | 0.93 | 0.07 | |
| GNV | 30 | 0.147 | 0.207 | 0.289 | 0.845 | 391 (358–431) | 0.67 | 0.98 | 27 | 9.15 | 0.96 | 0.33 | |
| HOW | 15 | 0.146 | 0.163 | 0.103 | 0.733 | Inf (Inf‐Inf) | 0.68 | 0.87 | 13 | 6.50 | 0.92 | 0.15 | |
| EVE | 7 | 0.057 | 0.136 | 0.582 | 0.646 | Inf (Inf‐Inf) | 0.66 | 0.80 | 7 | 6.13 | 0.86 | 0.57 | |
| CRD | 22 | 0.164 | 0.192 | 0.145 | 0.800 | 47 (46–48) | 0.67 | 0.91 | 11 | 5.52 | 0.91 | 0.45 | |
N is the number of successfully genotyped individuals (Rapture and MHC), H o/H e is observed and expected heterozygosity, respectively; G IS is an F IS‐analogue inbreeding coefficient; Proppoly for the Rapture data is the proportion of total polymorphic loci found in the populations; N e is effective population size (with corresponding confidence intervals, CI); AFdel mut is the average allele frequency of deleterious alleles; Propdel mut is the proportion of all deleterious mutations identified in the study that are found in a given population; Nhaps is the average number of MHC haplotypes in each population; Proppoly for the MHC data is the proportion of individuals in each population that contain more than one allele; and Prop>2 alleles is the proportion of individuals in a population with more than two MHC alleles.
Differentiation among populations was high (average F ST: 0.361, Figure 2a; Table S1). The lowest F ST was measured between two Wisconsin populations found within 10 km of one another (SPN and GIL), and F ST was comparatively low between all three proximal populations in this region (RBW, SPN, GIL, F ST: 0.076–0.105). This pattern was uncommon across the rest of the sampled populations. For example, many of the populations in northern Wisconsin were tightly clustered and highly differentiated, such as Pallette (PLT) and White Sand (WSL) lake, which are <2 km apart and had an F ST of 0.239 and Plum (PLM) and Big Muskellunge (MSK) lake, which are <8 km apart and had an F ST of 0.201. The highest F ST was measured between Atkins Lake (ATK) in northern Wisconsin and EVE in Indiana (F ST: 0.661); however, overall ATK was more differentiated from other populations than any other lake (Figure 2a, average F ST between ATK and other lakes: 0.511). Since ATK also contained a significantly larger number of samples (n = 54) than other sites, we randomly subsampled 30 individuals from ATK and regenerated pairwise F ST values to test whether sample size could be driving the higher levels of differentiation. After reducing the sample size to 30, ATK still maintained the highest average pairwise differentiation with other sites indicating this pattern is not driven by sample size effects (average F ST between reduced ATK and other lakes: 0.490; see Table S2). Outside of ATK, the largest pairwise F ST value was measured between Sugar Lake in Minnesota (SGR) and EVE in Indiana (F ST: 0.575).
FIGURE 2.

A heatmap of pairwise genetic difference among lakes (a) and neighbor‐joining tree indicating the presence of two lineages across our sampling region (b). Two clades in our Mississippian lineage (root marked with *) almost perfectly associated with the Wisconsin and Chippewa River basins. Bootstrap support values are labeled on major branches
Patterns of differentiation across populations indicated sampling occurred across a biogeographic break (Figure 2a), and the population tree indicated two major genetic lineages (Figure 2b) across the sample region. Lakes in Minnesota and northern Wisconsin belonged to one clade, and the lakes in southeastern Wisconsin, Indiana, and Michigan belonged to the other [termed here the Mississippian (MS) and Great Lakes (GL) lineages]. We named these lineages to reflect current Great Lakes and Mississippi River drainage boundaries. Notably, two subclades within the MS lineage almost perfectly sort populations into the Chippewa and Wisconsin River basins (see top right inset in Figure 1 and the * in Figure 2b), with the exception of Pallette (PLT) and Big Muskellunge (MSK) lake, which are in the Chippewa River basin near the border of the Wisconsin River basin but aligned with the Wisconsin River basin populations.
Mantel and partial Mantel tests indicated significant correlation between F ST/(1‐ F ST) and geographic distance (Figure 3; Table S3). A Mantel test with all sites was significant (r = 0.588, p < 0.001), but likely biased by hierarchical structure between the two lineages. Mantel tests restricted to populations within lineages showed significant IBD) in the GL lineage (r = 0.729, p < 0.001), but nonsignificant IBD in the MS lineage (r = 0.282, p = 0.099). When ATK was removed, the test for IBD in the MS lineage was significant (r = 0.466, p = 0.009), indicating that populations in close proximity to ATK had reduced the strength of the IBD relationship. Partial Mantel tests with ATK removed suggested lineage and geographic distance had a significant correlation to F ST/1 − F ST, but that lineage‐corrected geographic distance was a larger factor (r = 0.596, p < 0.001) than distance corrected‐lineage membership (r = 0.190, p = 0.003). Similarly, an AMOVA indicated that lineages accounted for 13.6% of the variance in the data (p < 0.001), whereas populations accounted for 25.6% of the variance (p < 0.009, Table 3). Given the lineage effects on our estimates of genetic differentiation, we limited our exploration of the relationships of variance in genetic differentiation to variance in population size [as di of log (surface area)] and average pairwise differentiation to log (surface area) to the MS lineage with ATK excluded (a total of 18 lakes, see Figure S1). We found no significant correlation between F ST/(1 − F ST) and di of log (surface area) (adjusted r 2 < 0.01, p = 0.615); however, the average pairwise differentiation for a lake was slightly negatively correlated with log (surface area) (adjusted r 2 = 0.22, p = 0.027).
FIGURE 3.

Isolation‐by‐distance graph. Genetic differentiation values (F ST/(1–F ST)) were generated with Rapture data. Within or among lineage pairwise comparisons are indicated by the color of fill—black circles are comparisons within the Great Lakes (GL) lineage, gray circles are comparisons within the Mississippian (MS) lineage, and white circles are comparisons between lineages. Pairwise comparisons with ATK, a highly differentiated population in the MS lineage, were not included in this plot
TABLE 3.
Analysis of molecular variance (AMOVA) with Rapture loci in 29 inland lake cisco populations
| Source of variation | % variance | F statistic | p‐Values |
|---|---|---|---|
| Within individuals | 0.4860 | F IT | — |
| Among individuals in lakes | 0.1223 | F IS | <0.001 |
| Among lakes in lineages | 0.2557 | F SC | <0.001 |
| Among lineages | 0.1358 | F CT | <0.001 |
Observed and expected heterozygosity (H o, H e) varied substantially (H O: 0.057–0.257, H e: 0.125–0.264) with the lowest H o measured in Eve Lake in Indiana (EVE) and the highest in Trude Lake in northern Wisconsin (TRU). Estimates of inbreeding were similarly broad (G IS: 0.028–0.582) with the same two lakes accounting for the extremes (TRU: 0.028, EVE: 0.582). Estimates of effective population size (N e) were highly variable across populations (36–1781, Table 2). Median N e across populations was 296 with most populations exhibiting an N e between 100 and 1000 (n = 22). The two populations that had an N e > 1000—White Sand (WSL) and Rainbow (RBW) lake—were both in Wisconsin, and the three lakes that had an N e < 100—Sugar Lake in Minnesota (SGR), Black Oak Lake in northern Wisconsin (BLK), and Crooked Lake in Indiana (CRD)—were distributed across the sample range. An additional two populations generated ‘infinite’ estimates, EVE in Indiana and Howard Lake (HOW) in Michigan. An ‘infinite’ N e is more likely a sign that the sample sizes in these populations were too small to generate a finite estimate with the LDNE method rather than a reflection of extremely large population sizes (see Do et al., 2014; Waples & Do, 2010). Due to potential sample size effects on estimates of N e and the inability to generate bounded estimates for all sample sites, we did not include N e in the genetic variables used in correlations.
3.2. Identifying putatively deleterious mutations
A total of 912 of the 7546 loci were successfully aligned to protein sequences from the Atlantic salmon genome. Of these 912 loci, 315 had alleles that coded for different protein sequences (i.e., nonsynonymous mutations), and 173 of these mutations (54.9%) were predicted to be putatively deleterious (PROVEAN score ≤ −2.5). The proportion of all identified deleterious mutations found in each population (i.e., with frequency >0 in a given population) averaged 0.95 and varied from 0.80 to 1 (Table 2), and the mean frequencies of deleterious mutations in each population averaged 0.71 and ranged from 0.66 to 0.74. Deleterious mutation metrics were similar between lineages, with an average frequency of deleterious mutations of 0.72 in the MS lineage and 0.69 in the GL lineage and an average ratio of deleterious mutations of 0.96 in the MS lineage and 0.93 in the GL lineage.
3.3. Haplotype diversity in the MHC
A total of 629 fish were successfully genotyped for MHC IIβ exon 2 (Table 2). High sequencing coverage across individuals was achieved, with an average coverage of 205 (standard deviation 123). Using these data, 73 haplotypes were discovered with >1 occurrence. A Z‐test of selection investigating the DN/DS ratio across all haplotypes indicated that DN >> DS (p‐value < 0.001, test statistic =3.81), providing strong evidence that MHC haplotypes in this system have historically undergone balancing selection. Most individuals in the dataset contained at least two MHC haplotypes (82% on average across populations, Table 2), and an average of 28% of individuals across populations contained >2 haplotypes. The rarefied number of haplotypes varied substantially, with estimates from 2.68 in Pike Lake (PIK) to 9.15 in Lake Geneva (GNV, average = 5.93). MHC haplotype frequencies did not closely reflect geographic proximity; large differences in MHC haplotypes were observed at small spatial scales (Figure S2). This was especially apparent for geographically proximate populations in northern Wisconsin. For example, Big Muskellunge (MSK) and White Sand (WSL) lakes had an almost completely different set of haplotypes despite being located <10 km apart. MHC haplotypes were also decoupled from underlying population genomic structure; for instance, the seq00111 and seq00024 haplotypes were found in high frequencies in populations from both major lineages (see next section, Data S1 and S2).
3.4. Relationships among genetic variables and spatial and environmental parameters
Sampled populations varied substantially in habitat metrics (Table 1; Table S4). For example, lake maximum depth varied from 13 to 72 m and lake size varied from 0.3 to 58 km2. Simple visualizations of genetic and environmental variables revealed that variation was not strictly partitioned by geography (Figure 4). For example, populations in northern Wisconsin near latitude 46° and longitude −90° had substantial variation in size, depth, and genetic metrics (Figure 4, Tables 1 and 2). Predictors chosen for initial screening included latitude, surface area, maximum depth, COSD (average and max), minimum oxythermal habitat thickness (average and min), minimum oxythermal habitat volume (average and min), and TDO330day/TDO37day/TDO3max (average and max). Initial correlations indicated that different variants of the same model parameters were highly correlated, for example, the six TDO3 variables had pairwise correlation coefficients ranging from 0.91–1.00 (average r = 0.95). Of the six remaining variables after iterative removal with a threshold of VIF <5 ‐ latitude, surface area, depth, COSD(max), minimum oxythermal habitat volume(average), and TDO3max(max)—two pairs were correlated above the threshold of |r| >0.70 (depth and minimum oxythermal habitat volume, r > 0.86; TDO3max and COSD, r > 0.80). Of these, we arbitrarily excluded minimum oxythermal habitat volume and COSD from the final list of predictors, but we treat interpretations of the remaining variables as potentially supporting for discarded variables.
FIGURE 4.

Visualization of three genetic variables (H o, N hap (MHC), AFdel mut), and one environmental variable (lake size) across our study area. Abbreviations: observed heterozygosity (H o), number of MHC haplotypes per population [N hap (MHC)], the average allele frequency of putatively deleterious mutations in each population (AFdel mut), and the surface area of each lake. Surface area is km2 and was log transformed to facilitate visualization
Results from the RDA (global p < 0.001, 46% of variation in genetic metrics explained by predictor variables, R 2 adj=0.375) demonstrated that AFdel mut and G IS responded differently to predictors and were not strongly associated with other genetic variables, as compared to H o, Propdel mut, and Proppoly (Rapture), which had similar loadings on RDA1&2 (Figure 5). MHC Proppoly and N hap also displayed similar loadings. Latitude contributed most to differences in genetic variables with high loadings on RDA1 (particularly AFdel mut and G IS), whereas lake surface area and maximum depth were associated with differences in genetic variables with high loadings on RDA2. Populations did not cluster tightly by geography indicating that variation in genetic and environmental variables is at least somewhat spatially decoupled. Analysis of variance on the RDA results indicated that latitude and surface area significantly influenced genetic variables (p < 0.01, Table 4). Depth nearly met the threshold for significance (p = 0.053). To assess whether previously removed environmental variables that were correlated with depth or TDO3max (r > 0.80) might explain a significant portion of genomic variance, we replaced these variables and reran the ANOVA. When depth was replaced with minimum oxythermal habitat volume (average), an even lower proportion of variance was explained (p = 0.549). Similarly, when TDO3max was replaced with COSD, the amount of genomic variance explained remained nonsignificant (p = 0.591).
FIGURE 5.

Redundancy analysis (RDA) investigating the influence of spatial and environmental variables (maximum depth, lake surface area, latitude, and TDO3) on genetic data [H O, Proppoly (Rapture), G IS, AFdel mut, Propdel mut, N hap, and Proppoly (MHC)] for each population. Abbreviations for Rapture‐based metrics: observed heterozygosity (H O), proportion of total polymorphic alleles found in the populations [Proppoly (Rapture)], inbreeding coefficient (G IS), the average allele frequency of putatively deleterious mutations in each population (AFdel mut), and the proportion of deleterious mutations found in a population (Propdel mut). Abbreviations for MHC‐based metrics: number of MHC haplotypes per population (N hap), and proportion of individuals with >1 MHC haplotype [Proppoly (MHC)]. Population abbreviations are in red and are identical to those described in Table 1. Latitude and surface area significantly influenced genetic variables according to an analysis of variance (p < 0.01, Table 4). Other environmental variables had p‐values > 0.05
TABLE 4.
Variance inflation factors (VIF) and results from analysis of variance (ANOVA) on redundancy analysis (RDA) to assess which environmental variables significantly influenced genetic variables
| Environmental variable | VIF | Variance | F | p‐value |
|---|---|---|---|---|
| Latitude | 1.08 | 1.73 | 11.10 | 0.001 |
| Surface area (km2) | 1.61 | 1.02 | 6.56 | 0.002 |
| Depth (m) | 1.50 | 0.38 | 2.42 | 0.053 |
| TDO3max (max) | 1.44 | 0.11 | 0.73 | 0.538 |
ANOVA p‐values < 0.05 are in bold. See Figure 5 for RDA visualization.
Multiple linear regressions conducted with seven genetic metrics (Rapture data: H o, G IS, Proppoly, AFdel mut, Propdel mut; MHC data: Proppoly, N hap) as the response variables and four spatial or environmental metrics (latitude, surface area, depth, TDO3max) as the predictor variables indicated that the predictor with the highest relative importance on average was latitude followed by surface area, and maximum depth (Table 5). However, the relative importance of different predictors varied substantially across regressions, with surface area explaining the largest proportion of variance in Ho , Proppoly (Rapture), and N hap (MHC), and latitude explaining the most variance in G IS, AFdel mut, Propdel mut, and Proppoly (MHC). The average amount of variation explained in each regression was 0.38, with the lowest value (0.18) found for Proppoly (MHC) and the highest (0.68) for AFdel mut. Univariate regression with the significant predictor variables in the RDA (latitude and surface area) revealed significant positive relationships between latitude and H o, AFdel mut, and Propdel mut, and significant positive relationships between surface area (log transformed to aid visualization) and H o, Propdel mut, and N haps (Figure 6). H o and Propdel mut were significantly and positively correlated with latitude and surface area, while the other two genetic variables (AFdel mut and N hap) were significantly correlated for one metric, but not the other. Notably, AFdel mut was not significantly correlated with surface area but was highly correlated with latitude (highest r 2 and lowest p‐value of all eight regressions).
TABLE 5.
Results from multiple linear regressions investigating the relationships between genetic and environmental variables
| Relative importance | |||||||
|---|---|---|---|---|---|---|---|
| Dataset | Genetic variable | r 2 | p‐value | Latitude | Surface Area (km2) | Depth (m) | TDO3max |
| Rapture | H O | 0.33 | 0.008 | 0.35 | 0.57 | 0.04 | 0.04 |
| Proppoly | 0.42 | 0.002 | 0.30 | 0.35 | 0.29 | 0.06 | |
| G IS | 0.21 | 0.046 | 0.56 | 0.14 | 0.24 | 0.06 | |
| AFdel mut | 0.68 | 1.51E‐06 | 0.97 | 0.01 | 0.01 | 0.01 | |
| Propdel mut | 0.47 | 5.27E‐04 | 0.55 | 0.19 | 0.22 | 0.04 | |
| MHC | N hap | 0.30 | 0.013 | 0.18 | 0.66 | 0.14 | 0.02 |
| Proppoly | 0.17 | 0.071 | 0.48 | 0.29 | 0.21 | 0.02 | |
| Average | 0.38 | 0.017 | 0.48 | 0.31 | 0.17 | 0.03 | |
The relative importance of each variable in the regression was calculated using the lmg method implemented in the R package relaimpo. p‐Values <0.05 are in bold.
FIGURE 6.

Relationships among four genetic metrics and latitude and lake surface area which were found to significantly influence genetic data in the redundancy analysis (Figure 5). Genetic metrics presented here include observed heterozygosity (H O), the average allele frequency of putatively deleterious mutations in each population (AFdel mut), the proportion of deleterious mutations found in a population (Propdel mut), and the number of MHC haplotypes per population (N hap). p‐Values and correlation coefficients for each regression are denoted
4. DISCUSSION
Our coupled genomic–environmental approach offers new insights into the factors influencing genetic diversity and differentiation at the southern range boundary for cisco. High levels of genetic differentiation between populations were punctuated by a phylogeographic break and residual patterns of isolation‐by‐distance (IBD). Although the prevalence of deleterious mutations and inbreeding coefficients were significantly correlated with latitude, largely neutral (Rapture) and non‐neutral (MHC) genetic diversity were most strongly correlated with lake surface area. Our results provide valuable information for guiding restoration efforts as well as spatial and environmental factors that may govern the persistence of ciscoes and other cold‐water specialist species as the climate continues to warm.
4.1. Historic vicariance and drift in small, isolated populations
The discovery of two divergent sets of populations in our study that we hypothesize are distinct genetic lineages provides new insights on the phylogeographic history of C. artedi. Turgeon and Bernatchez (2001a, 2003) first found evidence with microsatellites for two cisco lineages hypothesized to be associated with putative glacial refugia (see Eshenroder & Jacobson, 2020), and this phylogenetic break was recently detected again in reduced‐representation sequencing data (Piette‐Lauzière et al., 2019). Only a handful of populations in these studies were sampled across the region where the two lineages meet, and thus, the location of the transition from one lineage to the other has been placed in the general area of Lake Nipigon (Ontario, Canada) and Lake Superior. In our study, we found that what we identified as the MS lineage extends below the southern shore of Lake Superior across northern Wisconsin with the first inland populations associated with the GL lineage occurring in central and southeastern Wisconsin. While we hypothesize that our GL and MS lineages correspond to the two lineages identified in previous studies and believe that our data span a boundary between the lineages, it is important to note that our study covers a relatively small portion of the species range and our findings should be reevaluated in future studies that include additional populations. Within MS lineage cisco populations in northern Wisconsin, genetic clades were almost perfectly associated with the Chippewa and Wisconsin River basins. A similar pattern of genetic association within these river basins has also been found in walleye (Sander vitreus; Bootsma, Miller, et al., 2021), suggesting these drainages maintained corridors of migration between nearby refugia before receding water levels isolated inland lakes.
The drift that followed the isolation of inland lake populations is reflected in the high levels of genetic differentiation we found between our inland cisco populations as well as the pattern of increased average pairwise differentiation with decreased lake size in the MS lineage. These high levels of interpopulation differentiation are similar to those observed in other northern temperate lacustrine species including lake trout (Salvelinus namaycush) and brook trout (Salvelinus fontinalis) (Ferchaud et al., 2020; Perrier et al., 2017). In several cases, this high genetic differentiation was found across very small spatial scales. In the Chippewa River basin, White Sand (WSL), Pallette (PLT), Trout (TRT), and Big Muskellunge (MSK) lakes are all <5 km apart, and the level of pairwise differentiation suggests little or no gene flow has occurred among these lakes for thousands of years. Despite the high levels of interpopulation differentiation, a small but significant pattern of IBD was still present in both lineages when the effect of Atkins Lake (ATK) on the MS lineage was removed. ATK had the lowest genetic diversity and smallest proportion of total alleles in the MS lineage. This could indicate that a small population size in ATK is driving increased amounts of drift, differentiation, and inbreeding (Franklin, 1980); however, ATK did not exhibit a correspondingly elevated inbreeding coefficient as observed in two populations with equal or lower genetic diversity, Okauchee (OKA) and Eve (EVE). In light of this, it is worth considering that ATK is unique among our sampled lakes in that it contains the dwarf cisco ecotype. Dwarf coregonines are smaller‐bodied, spawn earlier, and typically exhibit shorter life spans than their alternative ecotype counterparts (Mann & McCart, 1981; Svardson, 1950). Transplanted dwarf cisco maintain their early spawning behavior (Shields & Underhill, 1993) suggesting heritable differences between ecotypes exist, and since contemporarily ATK is only known to contain the dwarf ecotype, this could lead to a small but significant increase in invariant loci when a panel developed to target polymorphic loci in alternative ecotype cisco is used.
4.2. Spatial and environmental correlates of genetic metrics
Loss of adaptive genetic variation and inbreeding depression via the accumulation of deleterious mutations are major threats to the persistence of small, isolated populations (Charlesworth & Willis, 2009; Hedrick & Garcia‐Dorado, 2016; Hedrick & Kalinowski, 2000; Keller & Waller, 2002; Lande, 1995). Therefore, we used a variety of metrics to quantify neutral (Rapture) and non‐neutral (MHC) genetic diversity, inbreeding, and the frequency of putative deleterious mutations in inland lake cisco populations at the southern extent of their range. Understanding how spatial or environmental factors are correlated with any of these risk factors can provide managers with potential avenues for mitigating problems before they lead to extirpation, as well as identify characteristics of suitable lakes for reintroductions.
Latitude and the frequency of putative deleterious mutations exhibited the strongest correlation between a predictor variable and a genetic metric in our data. Across our sampled lakes, the proportion and frequency of deleterious mutations decreased the further south a population was located. While the proportions of deleterious mutations across populations of inland cisco were similar in range to those found in North American lake trout populations in Ferchaud et al. (2018), the allele frequencies we measured appear much larger than those of widespread continuous populations where most deleterious mutations tend to be a minor allele (Conte et al., 2017; Zhang et al., 2016). The most similar systems to our small, isolated inland cisco populations that have had mutational load evaluated are North American lake trout and brook trout populations; however, differences in methodologies make comparisons challenging. For example, Ferchaud et al. (2018), Ferchaud et al. (2020) used the same PROVEAN threshold we did, but interpreted the minor allele as the deleterious mutation regardless of +/− state due to expectations that strongly deleterious mutations should be purged and in low frequency. The range of average deleterious mutation allele frequencies under this assumption for lake trout was 0.12–0.24 (Ferchaud et al., 2018), and when we analyze our data using the same assumption, our average deleterious mutation allele frequencies similarly range from 0.07 to 0.18, so it is possible that Ferchaud et al. (2018) may have also found high deleterious allele frequencies if they had followed the same interpretation we employ here. A PROVEAN score divides nonsynonymous mutations into binary categories of putatively neutral and putatively deleterious, but within identified putatively deleterious mutations there is no way to distinguish mildly deleterious mutations from strongly deleterious mutations. Our inland cisco populations display patterns indicating a strong influence of drift, which reduces the ability of selection to purge mutational load, especially mildly deleterious mutations, which do not have large fitness effects (Crow, 1993; Glemin, 2003; Kimura et al., 1963). The relatively high average allele frequencies of deleterious mutations we measured could be reflective of an accumulation of mildly deleterious mutations rather than high frequencies of strongly deleterious mutations.
Predicting what patterns of mutational load we might expect to see in inland cisco populations along the southern (trailing) edge of the species range is not straightforward. Several studies have observed increased mutational load on the edges of a species range (Henry et al., 2015; A. Perrier et al., 2020; Willi et al., 2018; Zhang et al., 2016) as a result of increasingly smaller peripheral populations driving increased genetic drift and less effective purifying selection (Crow, 1993; Kimura et al., 1963; Whitlock, 2000). Alternately, many studies have simulated or observed increased mutational load and decreased heterozygosity specifically at the leading edge of a species range as a result of the demographic processes associated with colonization (Lohmueller et al., 2008; Peischl et al., 2013; Rougemont et al., 2020). Our observations of increased heterozygosity and increased mutational load with latitude in southern inland cisco populations do not perfectly align with either of these previously observed patterns of mutational load. The majority of study systems that have been used to examine mutational load across species ranges thus far have exhibited at least some level of migration and interpopulation gene flow (e.g., Rougemont et al., 2020; Willi et al., 2018; Zhang et al., 2016), unlike cisco populations, which were successively isolated in lakes as glaciers receded over thousands of years (Dyke & Prest, 1987). The small but significant correlations for decreased heterozygosity and increased inbreeding we observed the further south a population was found could expose recessive deleterious mutations to selection and lead to purging rather than accumulating mutational load (Hedrick, 1994; Hedrick & Garcia‐Dorado, 2016; Kirkpatrick & Jarne, 2000; Lande & Schemske, 1985; Lynch et al., 1995). Perrier et al. (2017) investigated the proportion of deleterious mutations in isolated populations of lake trout in Quebec, Canada, and although they did not examine whether these proportions correlated with latitude, they similarly detected a weak but significant correlation between latitude and inbreeding, suggesting positive associations between latitude and the frequency of deleterious mutations may exist in other glacial lake populations. Ultimately, extended sampling across a larger portion of the species range is needed to assess the relative roles of drift and selection versus demographic processes such as directional colonization and lineage admixture on the latitudinal pattern of mutational load we observed in southern inland lake cisco populations.
Genetic diversity in inland cisco populations was also linked to lake‐specific environmental variation, reflecting both static and dynamic variables. The number of MHC haplotypes observed in each population, which showed no association with latitude, was positively correlated to lake surface area. No other patterns among populations or correlations with environmental factors were found in the MHC IIβ exon 2, including between lineages, suggesting that diversity in this gene may be linked to population size and drift. Some measures of diversity that were associated with broad latitudinal patterns were also correlated to lake surface area, including neutral diversity and the proportion of deleterious mutations in cisco populations. A positive relationship between genetic diversity and lake size has also been found in several other salmonids, including lake trout (Perrier et al., 2017; Valiquette et al., 2014) and brook trout (Castric et al., 2001), as well as three‐spined stickleback (Gasterosteus aculeatus) (Bolnick & Ballare, 2020; Caldera & Bolnick, 2008). Moreover, Bolnick and Ballare (2020) found that genome‐wide heterozygosity in lacustrine three‐spined stickleback was not correlated with resource diversity and morphological variation, consistent with the expectation that larger population sizes supported by larger lakes experience less drift and therefore increased neutral genetic diversity. Decreasing amounts of drift in larger effective population sizes has also been suspected of driving the positive correlation of proportion of deleterious mutations and lake size observed in lake trout (Perrier et al., 2017).
It is notable that estimates of TDO3 were not significantly correlated with any genetic variables, and this may reflect the complexities of using one‐dimensional modeling to describe three‐dimensional habitat. TDO3 and related metrics are useful, but these metrics and corresponding one‐dimensional modeling and deep‐hole lake monitoring cannot capture isolated pockets of refugia habitat that are often present in large lakes. For example, when using three‐dimensional thermal and oxygen models to simulate Lake Mendota, Wisconsin, results indicated that the majority of cisco habitat in late summer may actually be around the 12–15 m contour of the lake where internal seiche and downwelling events are more likely to occur, bringing pockets of oxygenated water into the hypolimnion, which may act as refugia habitat to sustain the remnant cisco population (see Figure S3). Similarly, by their nature, one‐dimensional models also cannot capture these pockets of refugia within large lakes with multiple deep locations and abrupt changes in bathymetry. In addition, TDO3 only captures one important lake characteristic influencing inland cisco populations. Lake size, which was correlated with genetic diversity, likely encompasses a multitude of factors influencing population size such as prey availability, refuge habitat from predators, and spawning habitat.
4.3. Conserving inland lake cisco populations at the southern extent of their range
Genomic data can provide valuable information to guide decision making in fish conservation (Bernos et al., 2020), and our results provide multiple useful inferences for conserving marginal cisco populations. Most obviously, knowing the extent of the two phylogeographic lineages as well as the relationships between deleterious mutations with latitude and diversity with lake surface area can help inform common freshwater conservation strategies including reintroduction of extirpated populations and the augmentation of diminished or imperiled populations through outcrossing (Anderson et al., 2014; Lamothe & Drake, 2019; Marsh et al., 2005; Weise et al., 2020). Lutz et al. (2021) found that using multiple, compatible source populations for reintroduction of endangered Macquarie perch (Macquaria australasica) benefitted survival and recruitment, and a similar approach may maximize successful reintroductions of inland lake cisco given the apparent influence of population size and drift over selection and adaptation in inland lake populations. In Wisconsin, where historic MS and GL lineages converge, careful consideration should be given before implementing a reintroduction or augmentation plan that involves mixing divergent genetic lineages as the presence of fixed differences in highly divergent populations is a risk factor for outbreeding depression (Frankham et al., 2011). A current reintroduction initiative involving live‐transfers of cisco from WSL to nearby Crystal and Sparkling lakes in fall 2020 will provide important baseline information for the use of regional single‐ and multiple‐source strategies more broadly.
Augmentation through supplementation is rare in inland cisco populations compared to other northern lacustrine salmonids such as lake whitefish (Coregonus clupeaformis) and lake trout, which are more likely to be targeted by recreational fishers (Frey, 1955). A handful of records from cisco stocking events in the early 20th century in Indiana and Michigan exist (Frey, 1955; Homola et al., 2021), and increases in mitochondrial diversity from historic stocking in a few Michigan lakes has been observed (Homola et al., 2021). To the best of our knowledge, none of the lakes included in our study were ever stocked, and Homola et al. (2021) found no evidence of stocking in the one lake we sampled in Michigan (HOW). The broad patterns in IBD and frequency of deleterious mutations we observed across our sampled lakes further support a lack of evidence for stocking events. Contemporarily, conservation measures for inland lake cisco populations focus on mitigating cold‐water habitat degradation and controlling harvest (e.g., Jacobson et al., 2010, 2012; Tingley et al., 2019) rather than supplementation. With little empirical evidence available for the influences of supplementation on inland lake cisco, it is difficult to predict the viability of this strategy to retain beneficial genetic diversity. Genetic rescue of depleted populations through outcrossing has been shown to increase diversity and fitness in many small, isolated populations (Fitzpatrick et al., 2020; Frankham, 2015, 2016). There is some contention, however, on whether ideal source populations used for augmentation should maximize the genetic diversity of target populations (Ralls et al., 2020) or minimize the introduction of strongly deleterious mutations (Kyriazis et al., 2020; Robinson et al., 2020). Ferchaud et al. (2018) found increased neutral diversity and fewer deleterious mutations in stocked versus unstocked populations of lake trout, attributing this trend to a reduction of the influence of drift through supplementation. Given the pattern of decreased mutational load in southern inland lake cisco populations, however, the introduction of new deleterious mutations could be problematic. Additionally, supplementing inland lake cisco populations with fish from the Laurentian Great Lakes could be detrimental regardless of mutational load since these populations have been observed to contain different phenotypic and behavioral adaptations from native inland lake populations (Jacobson et al., 2018), which are also risk factors for outbreeding depression (Frankham et al., 2011). Given the unpredictability of augmentation, it may be best to document interpopulation interactions through cisco reintroduction initiatives before applying this approach to vulnerable extant populations.
The correlation we observed between genetic diversity and lake surface area suggests one key to resilience in southern inland lake cisco populations may be maintaining populations that are large and diverse enough to survive the stochastic demographic, environmental, or genetic factors that drive the extinction of small populations (Frankham, 2008; Pimm, 1991). The predictor variables examined for correlations to genetic metrics in our study are only a subset of the landscape heterogeneity predicted to play a role in the persistence of inland lake cisco populations. While larger lake size was found to be a good predictor of the presence of historic cisco populations in Indiana (Honsey et al., 2016), non‐point‐source nutrient loading, urbanization, and sedimentation from land‐use are all observed or predicted to influence the quality of contemporary cisco habitat in southern lakes (Honsey et al., 2016; Jacobson et al., 2012; Magee et al., 2019). Additional threats are lake‐specific, such as the introduction of invasive species like rainbow smelt Osmerus mordax (Hrabik et al., 1998). The combined interaction of these idiosyncratic factors with fine‐scale heterogeneity in genetic diversity likely explains the lack of a predictable pattern to extirpations (Lyons et al., 2015; Renik et al., 2020). Current recommendations for protecting suitable cold‐water habitat suggest management at the catchment scale (Jacobson et al., 2013) jointly with adapted regional plans to address threats such as climate change (Tingley et al., 2019), and our findings of broad, latitudinal, and local‐scale patterns of maladaptation and genetic diversity support this approach for conserving southern inland lake cisco populations. The mosaic pattern of observed extirpations in this region coupled with the high levels of differentiation we observed at small spatial scales and correlations of lake size and diversity suggest that genetic resilience in inland lake cisco populations is closely associated to levels of genetic drift. The most successful conservation strategies, therefore, may be those that adequately account for the fine‐scale environmental processes that influence population sizes in order to reduce the chance of stochastic natural or anthropogenic events resulting in further extirpations.
CONFLICT OF INTEREST
The authors declare no conflict of interest.
Supporting information
Table S1
Table S2
Table S3
Figure S1
Figure S2
Data S1
Data S2
Table S4
Figure S3
ACKNOWLEDGMENTS
We thank Jeremiah Gorne, Justin Haglund, Martin Jennings, Jeff Kampa, Paul Kanehl, John Lyons, Tim Parks, Dan Walchak, and staff of the Escanaba Lake Research Station with the Wisconsin Department of Natural Resources, Tyler Ahrenstorff with the Minnesota Department of Natural Resources, Tom Hrabik with the University of Minnesota‐Duluth, Tomas Höök and Andrew Honsey from Purdue University, Matthew Linn with the Indiana Department of Natural Resources, and Kim Scribner at Michigan State University for providing the fish tissues used in this study. We thank John Bauman at the Michigan Department of Natural Resources, Zachary Feiner from Purdue University, and Jordan Read at the U.S. Geological Survey for temperature and dissolved oxygen profile information and Rob Mooney, Allison Moody, Austin Milt, and Vince Butitta for assistance with lake data collection. We thank Jared Homola, Julie Turgeon, and three anonymous reviewers for their comments on earlier versions of this manuscript. This research was supported by the Research Computing clusters at Old Dominion University. Funding for this work was provided by the USGS Northeast Climate Adaptation Science Center, Packard Fellowship, US Fish and Wildlife Service, Federal Aid in Sportfish Restoration program, and the Wisconsin of Department of Natural Resources. Any use of trade, product, or firm names is for descriptive purposes only and does not imply endorsement by the U.S. Government.
Ackiss, A. S. , Magee, M. R. , Sass, G. G. , Turnquist, K. , McIntyre, P. B. , & Larson, W. A. (2021). Genomic and environmental influences on resilience in a cold‐water fish near the edge of its range. Evolutionary Applications, 14, 2794–2814. 10.1111/eva.13313
DATA AVAILABILITY STATEMENT
Sequence data for the 129 individuals used in the development of the Rapture panel were previously uploaded to the National Center for Biotechnology Information (NCBI) sequence read archive (BioProject: PRJNA557717). Rapture sequence data used in analyses in this manuscript were georeferenced in the Genomic Observatories MetaDatabase (https://n2t.net/ark:21547/DwZ2) and uploaded to the NCBI sequence read archive (BioProject: PRJNA771945). Scripts and supplemental material for the hydrodynamic model will be uploaded to Dryad upon acceptance.
REFERENCES
- Agrawal, A. F. , & Whitlock, M. C. (2012). Mutation load: The fitness of individuals in populations where deleterious alleles are abundant. Annual Review of Ecology, Evolution, and Systematics, 43(1), 115–135. 10.1146/annurev-ecolsys-110411-160257 [DOI] [Google Scholar]
- Ali, O. A. , O'Rourke, S. M. , Amish, S. J. , Meek, M. H. , Luikart, G. , Jeffres, C. , & Miller, M. R. (2016). RAD capture (Rapture): Flexible and efficient sequence‐based genotyping. Genetics, 202(2), 389–400. 10.1534/genetics.115.183665 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Anderson, J. H. , Pess, G. R. , Carmichael, R. W. , Ford, M. J. , Cooney, T. D. , Baldwin, C. M. , & McClure, M. M. (2014). Planning pacific salmon and steelhead reintroductions aimed at long‐term viability and recovery. North American Journal of Fisheries Management, 34(1), 72–93. 10.1080/02755947.2013.847875 [DOI] [Google Scholar]
- Andrews, S. (2010). FastQC: A quality control tool for high throughput sequence data. Retrieved from http://www.bioinformatics.babraham.ac.uk/projects/fastqc/ [Google Scholar]
- Bay, R. A. , Harrigan, R. J. , Underwood, V. L. , Gibbs, H. L. , Smith, T. B. , & Ruegg, K. (2018). Genomic signals of selection predict climate‐driven population declines in a migratory bird. Science, 359(6371), 83. 10.1126/science.aan4380 [DOI] [PubMed] [Google Scholar]
- Berg, P. R. , Jentoft, S. , Star, B. , Ring, K. H. , Knutsen, H. , Lien, S. , Jakobsen, K. S. , & André, C. (2015). Adaptation to low salinity promotes genomic divergence in Atlantic Cod (Gadus morhua L.). Genome Biology and Evolution, 7(6), 1644–1663. 10.1093/gbe/evv093 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bernatchez, L. , & Dodson, J. J. (1990). Mitochondrial DNA variation among anadromous populations of cisco (Coregonus artedii) as revealed by restriction analysis. Canadian Journal of Fisheries and Aquatic Sciences, 47(3), 533–543. 10.1139/f90-060 [DOI] [Google Scholar]
- Bernos, T. A. , Jeffries, K. M. , & Mandrak, N. E. (2020). Linking genomics and fish conservation decision making: A review. Reviews in Fish Biology and Fisheries, 30(4), 587–604. 10.1007/s11160-020-09618-8 [DOI] [Google Scholar]
- Blumstein, D. M. , Campbell, M. A. , Hale, M. C. , Sutherland, B. J. G. , McKinney, G. J. , Stott, W. , & Larson, W. A. (2020). Comparative genomic analyses and a novel linkage map for cisco (Coregonus artedi) provide insights into chromosomal evolution and rediploidization across salmonids. G3: Genes|Genomes|Genetics, 10(8), 2863. 10.1534/g3.120.401497 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bolnick, D. I. , & Ballare, K. M. (2020). Resource diversity promotes among‐individual diet variation, but not genomic diversity, in lake stickleback. Ecology Letters, 23(3), 495–505. 10.1111/ele.13448 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bootsma, M. L. , Gruenthal, K. M. , McKinney, G. J. , Simmons, L. , Miller, L. , Sass, G. G. , & Larson, W. A. (2020). A GT‐seq panel for walleye (Sander vitreus) provides important insights for efficient development and implementation of amplicon panels in non‐model organisms. Molecular Ecology Resources. 20, 1706–1722. 10.1111/1755-0998.13226 [DOI] [PubMed] [Google Scholar]
- Bootsma, M. L. , Miller, L. , Sass, G. G. , Euclide, P. T. , & Larson, W. A. (2021). The ghosts of propagation past: Haplotype information clarifies the relative influence of stocking history and phylogeographic processes on contemporary population structure of walleye (Sander vitreus). Evolutionary Applications, 14, 1124–1144. 10.1111/eva.13186 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cahn, A. R. (1927). An ecological study of southern Wisconsin fishes: The brook silversides (Labidesthes sicculus) and the cisco (Leucichthys artedi) in their relations to the region (Vol. 11). The University of Illinois. [Google Scholar]
- Caldera, E. J. , & Bolnick, D. I. (2008). Effects of colonization history and landscape structure on genetic variation within and among threespine stickleback (Gasterosteus aculeatus) populations in a single watershed. Evolutionary Ecology Research, 10(4), 575–598. [Google Scholar]
- Campbell, N. R. , Harmon, S. A. , & Narum, S. R. (2015). Genotyping‐in‐Thousands by sequencing (GT‐seq): A cost effective SNP genotyping method based on custom amplicon sequencing. Molecular Ecology Resources, 15(4), 855–867. 10.1111/1755-0998.12357 [DOI] [PubMed] [Google Scholar]
- Castric, V. , Bonney, F. , & Bernatchez, L. (2001). Landscape structure and hierarchical genetic diversity in the Brook Charr, Salvelinus fontinalis . Evolution, 55(5), 1016–1028. 10.1111/j.0014-3820.2001.tb00618.x [DOI] [PubMed] [Google Scholar]
- Catchen, J. M. , Amores, A. , Hohenlohe, P. A. , Cresko, W. A. , & Postlethwait, J. H. (2011). Stacks: Building and genotyping loci de novo from short‐read sequences. G3 (Bethesda), 1(3), 171–182. 10.1534/g3.111.000240 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Catchen, J. M. , Hohenlohe, P. , Bassham, S. , Amores, A. , & Cresko, W. A. (2013). Stacks: An analysis tool set for population genomics. Molecular Ecology, 22(11), 3124–3140. 10.1111/mec.12354 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Charlesworth, D. , & Willis, J. H. (2009). The genetics of inbreeding depression. Nature Reviews Genetics, 10(11), 783–796. 10.1038/nrg2664 [DOI] [PubMed] [Google Scholar]
- Choi, Y. , Sims, G. E. , Murphy, S. , Miller, J. R. , & Chan, A. P. (2012). Predicting the functional effect of amino acid substitutions and indels. PLoS One, 7(10), e46688. 10.1371/journal.pone.0046688 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Conte, G. L. , Hodgins, K. A. , Yeaman, S. , Degner, J. C. , Aitken, S. N. , Rieseberg, L. H. , & Whitlock, M. C. (2017). (2017) Bioinformatically predicted deleterious mutations reveal complementation in the interior spruce hybrid complex. BMC Genomics, 18, 970. 10.1186/s12864-017-4344-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Crow, J. F. (1993). Mutation, mean fitness, and genetic load. Oxford Series in Evolutionary Biology, 9, 3–42. [Google Scholar]
- Danecek, P. , Auton, A. , Abecasis, G. , Albers, C. A. , Banks, E. , DePristo, M. A. , Handsaker, R. E. , Lunter, G. , Marth, G. T. , Sherry, S. T. , McVean, G. , & Durbin, R. , & Genomes Project Analysis, G (2011). The variant call format and VCFtools. Bioinformatics, 27(15), 2156–2158. 10.1093/bioinformatics/btr330 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Do, C. , Waples, R. S. , Peel, D. , Macbeth, G. , Tillett, B. J. , & Ovenden, J. R. (2014). NeEstimator v2: Re‐implementation of software for the estimation of contemporary effective population size (Ne) from genetic data. Molecular Ecology Resources, 14(1), 209–214. [DOI] [PubMed] [Google Scholar]
- Dormann, C. F. , Elith, J. , Bacher, S. , Buchmann, C. , Carl, G. , Carré, G. , Marquéz, J. R. G. , Gruber, B. , Lafourcade, B. , Leitão, P. J. , Münkemüller, T. , McClean, C. , Osborne, P. E. , Reineking, B. , Schröder, B. , Skidmore, A. K. , Zurell, D. , & Lautenbach, S. (2013). Collinearity: A review of methods to deal with it and a simulation study evaluating their performance. Ecography, 36(1), 27–46. 10.1111/j.1600-0587.2012.07348.x [DOI] [Google Scholar]
- Dyke, A. , & Prest, V. (1987). Late Wisconsinan and holocene history of the Laurentide ice sheet. Géographie Physique et Quaternaire, 41(2), 237–263. 10.7202/032681ar [DOI] [Google Scholar]
- Eizaguirre, C. , Lenz, T. L. , Kalbe, M. , & Milinski, M. (2012). Divergent selection on locally adapted major histocompatibility complex immune genes experimentally proven in the field. Ecology Letters, 15(7), 723–731. 10.1111/j.1461-0248.2012.01791.x [DOI] [PMC free article] [PubMed] [Google Scholar]
- EPA, U. S. (1975). Report on Barton Lake Kalamazoo County Michigan EPA Region V (Working Paper No. 183). Retrieved from https://nepis.epa.gov/ [Google Scholar]
- Eshenroder, R. L. , & Jacobson, P. C. (2020). Speciation in cisco with emphasis on secondary contacts, plasticity, and hybridization. Transactions of the American Fisheries Society, 149(6), 721–740. 10.1002/tafs.10267 [DOI] [Google Scholar]
- Euclide, P. T. , McKinney, G. J. , Bootsma, M. , Tarsa, C. , Meek, M. H. , & Larson, W. A. (2020). Attack of the PCR clones: Rates of clonality have little effect on RAD‐seq genotype calls. Molecular Ecology Resources, 20(1), 66–78. 10.1111/1755-0998.13087 [DOI] [PubMed] [Google Scholar]
- Fay, J. C. , Wyckoff, G. J. , & Wu, C.‐I. (2001). Positive and negative selection on the human genome. Genetics, 158(3), 1227–1234. 10.1093/genetics/158.3.1227 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ferchaud, A. L. , Laporte, M. , Perrier, C. , & Bernatchez, L. (2018). Impact of supplementation on deleterious mutation distribution in an exploited salmonid. Evolutionary Applications, 11(7), 1053–1065. 10.1111/eva.12660 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ferchaud, A.‐L. , Leitwein, M. , Laporte, M. , Boivin‐Delisle, D. , Bougas, B. , Hernandez, C. , & Bernatchez, L. (2020). Adaptive and maladaptive genetic diversity in small populations: Insights from the Brook Charr (Salvelinus fontinalis) case study. Molecular Ecology, 29(18), 3429–3445. 10.1111/mec.15566 [DOI] [PubMed] [Google Scholar]
- Fitzpatrick, S. W. , Bradburd, G. S. , Kremer, C. T. , Salerno, P. E. , Angeloni, L. M. , & Funk, W. C. (2020). Genomic and fitness consequences of genetic rescue in wild populations. Current Biology, 30(3), 517–522.e515. 10.1016/j.cub.2019.11.062 [DOI] [PubMed] [Google Scholar]
- Frankham, R. (2008). Genetic adaptation to captivity in species conservation programs. Molecular Ecology, 17(1), 325–333. 10.1111/j.1365-294X.2007.03399.x [DOI] [PubMed] [Google Scholar]
- Frankham, R. (2015). Genetic rescue of small inbred populations: Meta‐analysis reveals large and consistent benefits of gene flow. Molecular Ecology, 24(11), 2610–2618. 10.1111/mec.13139 [DOI] [PubMed] [Google Scholar]
- Frankham, R. (2016). Genetic rescue benefits persist to at least the F3 generation, based on a meta‐analysis. Biological Conservation, 195, 33–36. 10.1016/j.biocon.2015.12.038 [DOI] [Google Scholar]
- Frankham, R. , Ballou, J. D. , Eldridge, M. D. B. , Lacy, R. C. , Ralls, K. , Dudash, M. R. , & Fenster, C. B. (2011). Predicting the probability of outbreeding depression. Conservation Biology, 25(3), 465–475. 10.1111/j.1523-1739.2011.01662.x [DOI] [PubMed] [Google Scholar]
- Franklin, I. R. (1980). Evolutionary change in small populations. Evolutionary change in small populations. In Soulé M. E. & Wilcox B. A. (Eds.), Conservation biology: An evolutionary‐ecological perspective (pp. 135–150). Sinauer Associates. [Google Scholar]
- Frey, D. G. (1955). Distributional ecology of the cisco (Coregonus artedii) in Indiana. Invest. Indiana Lakes Streams, 4(7), 177–227. [Google Scholar]
- Glémin, S. (2003). How are deleterious mutations purged? Drift versus nonrandom mating. Evolution, 57(12), 2678–2687. [DOI] [PubMed] [Google Scholar]
- Graham, C. F. , Glenn, T. C. , McArthur, A. G. , Boreham, D. R. , Kieran, T. , Lance, S. , & Somers, C. M. (2015). Impacts of degraded DNA on restriction enzyme associated DNA sequencing (RADSeq). Molecular Ecology Resources, 15(6), 1304–1315. 10.1111/1755-0998.12404 [DOI] [PubMed] [Google Scholar]
- Groemping, U. (2007). Relative importance for linear regression in R: The package relaimpo. Journal of Statistical Software, 17(1), 1‐27. 10.18637/jss.v017.i01 [DOI]
- Grossen, C. , Guillaume, F. , Keller, L. F. , & Croll, D. (2020). Purging of highly deleterious mutations through severe bottlenecks in Alpine ibex. Nature Communications, 11(1), 1001. 10.1038/s41467-020-14803-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Haag, C. R. , & Roze, D. (2007). Genetic load in sexual and asexual diploids: Segregation, dominance and genetic drift Genetics, 176(3), 1663. 10.1534/genetics.107.073080 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Harrell, F. E. , & Dupont, M. C. (2012). Package ‘Hmisc’. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing.
- Harstad, H. , Lukacs, M. F. , Bakke, H. G. , & Grimholt, U. (2008). Multiple expressed MHC class II loci in salmonids; details of one non‐classical region in Atlantic salmon (Salmo salar). BMC Genomics, 9, 193. 10.1186/1471-2164-9-193 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hedrick, P. W. (1994). Purging inbreeding depression and the probability of extinction: Full‐sib mating. Heredity, 73(4), 363–372. 10.1038/hdy.1994.183 [DOI] [PubMed] [Google Scholar]
- Hedrick, P. W. , & Garcia‐Dorado, A. (2016). Understanding inbreeding depression, purging, and genetic rescue. Trends in Ecology & Evolution, 31(12), 940–952. 10.1016/j.tree.2016.09.005 [DOI] [PubMed] [Google Scholar]
- Hedrick, P. W. , & Kalinowski, S. T. (2000). Inbreeding depression in conservation biology. Annual Review of Ecology and Systematics, 31(1), 139–162. 10.1146/annurev.ecolsys.31.1.139 [DOI] [Google Scholar]
- Henry, R. C. , Bartoń, K. A. , & Travis, J. M. J. (2015). Mutation accumulation and the formation of range limits. Biology Letters, 11(1), 20140871. 10.1098/rsbl.2014.0871 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hill, W. G. (1981). Estimation of effective population size from data on linkage disequilibrium. Genetical Research, 38, 7. 10.1017/s0016672300020553 [DOI] [Google Scholar]
- Hipsey, M. R. , Boon, C. , Paraska, D. , Bruce, L. C. , & Huang, P. (2020). Modelling aquatic eco‐dynamics: Overview of the AED modular simulation platform. Retrieved from https://aquaticecodynamics.github.io/aed‐science/ [Google Scholar]
- Hipsey, M. R. , Bruce, L. C. , Boon, C. , Busch, B. , Carey, C. C. , Hamilton, D. P. , Hanson, P. C. , Read, J. S. , de Sousa, E. , Weber, M. , & Winslow, L. A. (2019). A General Lake Model (GLM 3.0) for linking with high‐frequency sensor data from the Global Lake Ecological Observatory Network (GLEON). Geoscientific Model Development, 12(1), 473–523. 10.5194/gmd-12-473-2019 [DOI] [Google Scholar]
- Hoban, S. M. , Hauffe, H. C. , Pérez‐Espona, S. , Arntzen, J. W. , Bertorelle, G. , Bryja, J. , Frith, K. , Gaggiotti, O. E. , Galbusera, P. , Godoy, J. A. , Hoelzel, A. R. , Nichols, R. A. , Primmer, C. R. , Russo, I.‐R. , Segelbacher, G. , Siegismund, H. R. , Sihvonen, M. , Vernesi, C. , Vilà, C. , & Bruford, M. W. (2013). Bringing genetic diversity to the forefront of conservation policy and management. Conservation Genetics Resources, 5(2), 593–598. 10.1007/s12686-013-9859-y [DOI] [Google Scholar]
- Homola, J. J. , Robinson, J. D. , Kanefsky, J. , Stott, W. , Whelan, G. E. , & Scribner, K. T. (2021). Coalescent methods reconstruct contributions of natural colonization and stocking to origins of Michigan inland Cisco (Coregonus artedi). Journal of Great Lakes Research, 10.1016/j.jglr.2021.08.008 [DOI] [Google Scholar]
- Honsey, A. E. (2014). The decline of Cisco Coregonus artedi at its southern range extent: Stock biology and management implications. Purdue University. [Google Scholar]
- Honsey, A. E. , Donabauer, S. B. , & Höök, T. O. (2016). An analysis of lake morphometric and land‐use characteristics that promote persistence of cisco in Indiana. Transactions of the American Fisheries Society, 145(2), 363–373. 10.1080/00028487.2015.1125949 [DOI] [Google Scholar]
- Hrabik, T. R. , Magnuson, J. J. , & McLain, A. S. (1998). Predicting the effects of rainbow smelt on native fishes in small lakes: Evidence from long‐term research on two lakes. Canadian Journal of Fisheries and Aquatic Sciences, 55(6), 1364–1371. 10.1139/f98-032 [DOI] [Google Scholar]
- Jacobson, P. C. , Cross, T. K. , Zandlo, J. , Carlson, B. N. , & Pereira, D. (2012). The effects of climate change and eutrophication on cisco Coregonus artedi abundance in Minnesota lakes. Advances in Limnology, 63, 417–427. 10.1127/advlim/63/2012/417 [DOI] [Google Scholar]
- Jacobson, P. C. , Emms, S. K. , Cruise, J. L. , Zimmer, K. D. , Grow, R. C. , Nielsen, J. , Tipp, A. R. , & Miller, L. M. (2018). Retention of morphological and ecological traits of Lake Superior cisco Coregonus artedi after translocation into inland lakes. Journal of Great Lakes Research, 44(2), 289–298. 10.1016/j.jglr.2017.12.005 [DOI] [Google Scholar]
- Jacobson, P. C. , Fang, X. , Stefan, H. G. , & Pereira, D. L. (2013). Protecting Cisco (Coregonus artedi Lesueur) oxythermal habitat from climate change: Building resilience in deep lakes using a landscape approach. Advances in Limnology, 64, 323–333. [Google Scholar]
- Jacobson, P. C. , Jones, T. S. , Rivers, P. , & Pereira, D. L. (2008). Field estimation of a lethal Oxythermal Niche boundary for adult ciscoes in Minnesota lakes. Transactions of the American Fisheries Society, 137(5), 1464–1474. 10.1577/T07-148.1 [DOI] [Google Scholar]
- Jacobson, P. C. , Stefan, H. G. , & Pereira, D. L. (2010). Coldwater fish oxythermal habitat in Minnesota lakes: Influence of total phosphorus, July air temperature, and relative depth. Canadian Journal of Fisheries and Aquatic Sciences, 67(12), 2002–2013. 10.1139/F10-115 [DOI] [Google Scholar]
- Jiang, L. , Fang, X. , Stefan, H. G. , Jacobson, P. C. , & Pereira, D. L. (2012). Oxythermal habitat parameters and identifying cisco refuge lakes in Minnesota under future climate scenarios using variable benchmark periods. Ecological Modelling, 232, 14–27. 10.1016/j.ecolmodel.2012.02.014 [DOI] [Google Scholar]
- Kalinowski, S. T. (2005). hp‐rare 1.0: A computer program for performing rarefaction on measures of allelic richness. Molecular Ecology Notes, 5(1), 187–189. 10.1111/j.1471-8286.2004.00845.x [DOI] [Google Scholar]
- Keightley, P. D. , & Lynch, M. (2003). Toward a realistic model of mutations affecting fitness. Evolution, 57(3), 683–685. 10.1111/j.0014-3820.2003.tb01561.x [DOI] [PubMed] [Google Scholar]
- Keller, L. F. , & Waller, D. M. (2002). Inbreeding effects in wild populations. Trends in Ecology & Evolution, 17(5), 230–241. 10.1016/S0169-5347(02)02489-8 [DOI] [Google Scholar]
- Kimura, M. , Maruyama, T. , & Crow, J. F. (1963). The mutation load in small populations. Genetics, 48(10), 1303–1312. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kirkpatrick, M. , & Jarne, P. (2000). The effects of a bottleneck on inbreeding depression and the genetic load. The American Naturalist, 155(2), 154–167. 10.1086/303312 [DOI] [PubMed] [Google Scholar]
- Kyriazis, C. C. , Wayne, R. K. , & Lohmueller, K. E. (2020). Strongly deleterious mutations are a primary determinant of extinction risk due to inbreeding depression. Evolution Letters, 5(1), 33–47. 10.1002/evl3.209 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lamothe, K. A. , & Drake, D. A. R. (2019). Moving repatriation efforts forward for imperilled Canadian freshwater fishes. Canadian Journal of Fisheries and Aquatic Sciences, 76(10), 1914–1921. 10.1139/cjfas-2018-0295 [DOI] [Google Scholar]
- Lande, R. (1995). Mutation and conservation. Conservation Biology, 9(4), 782–791. 10.1046/j.1523-1739.1995.09040782.x [DOI] [Google Scholar]
- Lande, R. , & Schemske, D. W. (1985). The evolution of self‐fertilization and inbreeding depression in plants. I. Genetic Models Evolution, 39(1), 24–40. 10.1111/j.1558-5646.1985.tb04077.x [DOI] [PubMed] [Google Scholar]
- Larson, W. A. , Lisi, P. J. , Seeb, J. E. , Seeb, L. W. , & Schindler, D. E. (2016). Major histocompatibility complex diversity is positively associated with stream water temperatures in proximate populations of sockeye salmon. Journal of Evolutionary Biology, 29(9), 1846–1859. 10.1111/jeb.12926 [DOI] [PubMed] [Google Scholar]
- Legendre, L. , & Legendre, P. (1998). Numerical ecology. Elsevier. [Google Scholar]
- Lien, S. , Koop, B. F. , Sandve, S. R. , Miller, J. R. , Kent, M. P. , Nome, T. , Hvidsten, T. R. , Leong, J. S. , Minkley, D. R. , Zimin, A. , Grammes, F. , Grove, H. , Gjuvsland, A. , Walenz, B. , Hermansen, R. A. , von Schalburg, K. , Rondeau, E. B. , Di Genova, A. , Samy, J. K. A. , … Davidson, W. S. (2016). The Atlantic salmon genome provides insights into rediploidization. Nature, 533, 200. 10.1038/nature17164 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lischer, H. E. , & Excoffier, L. (2011). PGDSpider: An automated data conversion tool for connecting population genetics and genomics programs. Bioinformatics, 28(2), 298–299. 10.1093/bioinformatics/btr642 [DOI] [PubMed] [Google Scholar]
- Lisi, P. J. , Schindler, D. E. , Bentley, K. T. , & Pess, G. R. (2013). Association between geomorphic attributes of watersheds, water temperature, and salmon spawn timing in Alaskan streams. Geomorphology, 185, 78–86. 10.1016/j.geomorph.2012.12.013 [DOI] [Google Scholar]
- Lohmueller, K. E. , Indap, A. R. , Schmidt, S. , Boyko, A. R. , Hernandez, R. D. , Hubisz, M. J. , & Nielsen, R. (2008). Proportionally more deleterious genetic variation in European than in African populations. Nature, 451(7181), 994–997. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lutz, M. L. , Tonkin, Z. , Yen, J. D. L. , Johnson, G. , Ingram, B. A. , Sharley, J. , & Pavlova, A. (2021). Using multiple sources during reintroduction of a locally extinct population benefits survival and reproduction of an endangered freshwater fish. Evolutionary Applications, 14(4), 950–964. 10.1111/eva.13173 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lynch, M. , Conery, J. , & Burger, R. (1995). Mutational meltdowns in sexual populations. Evolution, 49(6), 1067–1080. 10.2307/2410432 [DOI] [PubMed] [Google Scholar]
- Lyons, J. , Kampa, J. , Parks, T. , & Sass, G. (2015). The whitefishes of Wisconsin’s lakes: The 2011–2014 Wisconsin Department of Natural Resources cisco and lake whitefish survey. :Fisheries and Aquatic Research Section. Wisconsin Department of Natural Resources. [Google Scholar]
- Lyons, J. , Parks, T. P. , Minahan, K. L. , & Ruesch, A. S. (2018). Evaluation of oxythermal metrics and benchmarks for the protection of cisco (Coregonus artedi) habitat quality and quantity in Wisconsin lakes. Canadian Journal of Fisheries and Aquatic Sciences, 75(4), 600–608. [Google Scholar]
- Magee, M. R. , McIntyre, P. B. , Hanson, P. C. , & Wu, C. H. (2019). Drivers and management implications of long‐term cisco oxythermal habitat decline in Lake Mendot, WI. Environmental Management, 63(3), 396–407. 10.1007/s00267-018-01134-7 [DOI] [PubMed] [Google Scholar]
- Magoč, T. , & Salzberg, S. L. (2011). FLASH: Fast length adjustment of short reads to improve genome assemblies. Bioinformatics, 27(21), 2957–2963. 10.1093/bioinformatics/btr507 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mann, G. J. , & McCart, P. J. (1981). Comparison of sympatric dwarf and normal populations of least cisco (Coregonus sardinella) inhabiting Trout Lake, Yukon Territory. Canadian Journal of Fisheries and Aquatic Science, 38, 240–244. [Google Scholar]
- Mantel, N. (1967). The detection of disease clustering and a generalized regression approach. Cancer Research, 27(2 Part 1), 209–220. [PubMed] [Google Scholar]
- Marsh, P. C. , Kesner, B. R. , & Pacey, C. A. (2005). Repatriation as a management strategy to conserve a critically imperiled fish species. North American Journal of Fisheries Management, 25(2), 547–556. 10.1577/M04-123.1 [DOI] [Google Scholar]
- McKinney, G. J. , Waples, R. K. , Seeb, L. W. , & Seeb, J. E. (2017). Paralogs are revealed by proportion of heterozygotes and deviations in read ratios in genotyping‐by‐sequencing data from natural populations. Molecular Ecology Resources, 17(4), 656–669. 10.1111/1755-0998.12613 [DOI] [PubMed] [Google Scholar]
- Meirmans, P. G. (2012). The trouble with isolation by distance. Molecular Ecology, 21(12), 2839–2846. 10.1111/j.1365-294X.2012.05578.x [DOI] [PubMed] [Google Scholar]
- Meirmans, P. G. (2020). genodive version 3.0: Easy‐to‐use software for the analysis of genetic data of diploids and polyploids. Molecular Ecology Resources, 20(4), 1126–1131. 10.1111/1755-0998.13145 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Naimi, B. , Hamm, N. A. S. , Groen, T. A. , Skidmore, A. K. , & Toxopeus, A. G. (2014). Where is positional uncertainty a problem for species distribution modelling? Ecography, 37(2), 191–203. 10.1111/j.1600-0587.2013.00205.x [DOI] [Google Scholar]
- Nei, M. (1987). Molecular evolutionary genetics. Columbia University Press. [Google Scholar]
- Nei, M. , & Gojobori, T. (1986). Simple methods for estimating the numbers of synonymous and nonsynonymous nucleotide substitutions. Molecular Biology and Evolution, 3(5), 418–426. 10.1093/oxfordjournals.molbev.a040410 [DOI] [PubMed] [Google Scholar]
- Nei, M. , Tajima, F. , & Tateno, Y. (1983). Accuracy of estimated phylogenetic trees from molecular data. Journal of Molecular Evolution, 19(2), 153–170. 10.1007/BF02300753 [DOI] [PubMed] [Google Scholar]
- Ohta, T. (1995). Synonymous and nonsynonymous substitutions in mammalian genes and the nearly neutral theory. Journal of Molecular Evolution, 40(1), 56–63. 10.1007/BF00166595 [DOI] [PubMed] [Google Scholar]
- Okansen, J. , Blanchet, F. G. , Kindt, R. , Legendre, P. , Minchin, P. R. , O'Hara, R. B. , & Wagner, H. (2016). vegan: Community Ecology Package. R package version 2.3‐5. Retrieved from https://CRAN.R‐project.org/package=vegan
- Olden, J. D. , Jackson, D. A. , & Peres‐Neto, P. R. (2001). Spatial isolation and fish communities in drainage lakes. Oecologia, 127(4), 572–585. 10.1007/s004420000620 [DOI] [PubMed] [Google Scholar]
- Pavey, S. A. , Lamaze, F. C. , Garant, D. , & Bernatchez, L. (2011). Full length MHC IIβ exon 2 primers for salmonids: A new resource for next generation sequencing. Conservation Genetics Resources, 3(4), 665–667. 10.1007/s12686-011-9428-1 [DOI] [Google Scholar]
- Peischl, S. , Dupanloup, I. , Kirkpatrick, M. , & Excoffier, L. (2013). On the accumulation of deleterious mutations during range expansions. Molecular Ecology, 22(24), 5972–5982. 10.1111/mec.12524 [DOI] [PubMed] [Google Scholar]
- Pekkala, N. , E. Knott, K. , Kotiaho, J. S. , Nissinen, K. , & Puurtinen, M. (2012). The benefits of interpopulation hybridization diminish with increasing divergence of small populations. Journal of Evolutionary Biology, 25(11), 2181–2193. 10.1111/j.1420-9101.2012.02594.x [DOI] [PubMed] [Google Scholar]
- Perrier, A. , Sánchez‐Castro, D. , & Willi, Y. (2020). Expressed mutational load increases toward the edge of a species’ geographic range. Evolution, 74(8), 1711–1723. 10.1111/evo.14042 [DOI] [PubMed] [Google Scholar]
- Perrier, C. , Ferchaud, A. L. , Sirois, P. , Thibault, I. , & Bernatchez, L. (2017). Do genetic drift and accumulation of deleterious mutations preclude adaptation? Empirical investigation using RAD seq in a northern lacustrine fish. Molecular Ecology, 26(22), 6317–6335. 10.1111/mec.14361 [DOI] [PubMed] [Google Scholar]
- Piette‐Lauzière, G. , Bell, A. H. , Ridgway, M. S. , & Turgeon, J. (2019). Evolution and diversity of two cisco forms in an outlet of glacial Lake Algonquin. Ecology and Evolution, 9(17), 9654–9670. 10.1002/ece3.5496 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pimm, S. L. (1991). The balance of nature? Ecological Issues in the conservation of species and communities. The University of Chicago Press. [Google Scholar]
- Prunier, J. G. , Dubut, V. , Chikhi, L. , & Blanchet, S. (2017). Contribution of spatial heterogeneity in effective population sizes to the variance in pairwise measures of genetic differentiation. Methods in Ecology and Evolution, 8(12), 1866–1877. 10.1111/2041-210X.12820 [DOI] [Google Scholar]
- Ralls, K. , Ballou, J. D. , Dudash, M. R. , Eldridge, M. D. B. , Fenster, C. B. , Lacy, R. C. , Sunnucks, P. , & Frankham, R. (2018). Call for a paradigm shift in the genetic management of fragmented populations. Conservation Letters, 11(2), e12412. 10.1111/conl.12412 [DOI] [Google Scholar]
- Ralls, K. , Sunnucks, P. , Lacy, R. C. , & Frankham, R. (2020). Genetic rescue: A critique of the evidence supports maximizing genetic diversity rather than minimizing the introduction of putatively harmful genetic variation. Biological Conservation, 251, 108784. 10.1016/j.biocon.2020.108784 [DOI] [Google Scholar]
- Renik, K. M. , Jennings, M. J. , Kampa, J. M. , Lyons, J. , Parks, T. P. , & Sass, G. G. (2020). Status and distribution of Cisco (Coregonus artedi) and Lake Whitefish (Coregonus clupeaformis) in Inland Lakes of Wisconsin. Northeastern Naturalist, 27(3), 469–484. 10.1656/045.027.0307 [DOI] [Google Scholar]
- Robinson, Z. L. , Bell, D. A. , Dhendup, T. , Luikart, G. , Whiteley, A. R. , & Kardos, M. (2020). Evaluating the outcomes of genetic rescue attempts. Conservation Biology, 35(2), 666–677, 10.1111/cobi.13596 [DOI] [PubMed] [Google Scholar]
- Rochette, N. C. , Rivera‐Colón, A. G. , & Catchen, J. M. (2019). Stacks 2: Analytical methods for paired‐end sequencing improve RADseq‐based population genomics. Molecular Ecology, 28(21), 4737–4754. 10.1111/mec.15253 [DOI] [PubMed] [Google Scholar]
- Rougemont, Q. , Moore, J.‐S. , Leroy, T. , Normandeau, E. , Rondeau, E. B. , Withler, R. E. , Van Doornik, D. M. , Crane, P. A. , Naish, K. A. , Garza, J. C. , Beacham, T. D. , Koop, B. F. , & Bernatchez, L. (2020). Demographic history shaped geographical patterns of deleterious mutation load in a broadly distributed Pacific Salmon. PLOS Genetics, 16(8), e1008348. 10.1371/journal.pgen.1008348 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rousset, F. (1997). Genetic differentiation and estimation of gene flow from F‐statistics under Isolation by distance. Genetics, 145(4), 1219. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Saitou, N. , & Nei, M. (1987). The neighbor‐joining method: A new method for reconstructing phylogenetic trees. Molecular Biology and Evolution, 4(4), 406–425. [DOI] [PubMed] [Google Scholar]
- Scott, W. , & Crossman, E. (1973). Freshwater fishes of Canada. Galt House Publishing. [Google Scholar]
- Sgrò, C. M. , Lowe, A. J. , & Hoffmann, A. A. (2011). Building evolutionary resilience for conserving biodiversity under climate change. Evolutionary Applications, 4(2), 326–337. 10.1111/j.1752-4571.2010.00157.x [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shields, B. A. , & Underhill, J. C. (1993). Phenotypic plasticity of a transplanted population of dwarf cisco, Coregonus artedii . Environmental Biology of Fishes, 37, 9–23. 10.1007/BF00000708 [DOI] [Google Scholar]
- Sokal, R. R. (1979). Testing statistical significance of geographic variation patterns. Systematic Zoology, 28(2), 227–232. 10.2307/2412528 [DOI] [Google Scholar]
- Stuglik, M. T. , Radwan, J. , & Babik, W. (2011). jMHC: Software assistant for multilocus genotyping of gene families using next‐generation amplicon sequencing. Molecular Ecology Resources, 11(4), 739–742. 10.1111/j.1755-0998.2011.02997.x [DOI] [PubMed] [Google Scholar]
- Svardson, G. (1950). The coregonid problem. II. Morphology of two coregonid species in different environments. Report of the Institute of Freshwater Research Drottningholm, 31, 151–162. [Google Scholar]
- Till, A. , Rypel, A. L. , Bray, A. , & Fey, S. B. (2019). Fish die‐offs are concurrent with thermal extremes in north temperate lakes. Nature Climate Change, 9(8), 637–641. 10.1038/s41558-019-0520-y [DOI] [Google Scholar]
- Tingley, R. W. , Paukert, C. , Sass, G. G. , Jacobson, P. C. , Hansen, G. J. A. , Lynch, A. J. , & Shannon, P. D. (2019). Adapting to climate change: Guidance for the management of inland glacial lake fisheries. Lake and Reservoir Management, 35(4), 435–452. 10.1080/10402381.2019.1678535 [DOI] [Google Scholar]
- Turgeon, J. , & Bernatchez, L. (2001a). Clinal variation at microsatellite loci reveals historical secondary intergradation between glacial races of Coregonus artedi (Teleostei: Coregoninae). Evolution, 55(11), 2274–2286. 10.1111/j.0014-3820.2001.tb00742.x [DOI] [PubMed] [Google Scholar]
- Turgeon, J. , & Bernatchez, L. (2001b). Mitochondrial DNA phylogeography of lake Cisco (Coregonus artedi): Evidence supporting extensive secondary contacts between two glacial races. Molecular Ecology, 10(4), 987–1001. [DOI] [PubMed] [Google Scholar]
- Turgeon, J. , & Bernatchez, L. (2003). Reticulate evolution and phenotypic diversity in North American ciscoes, Coregonus ssp. (Teleostei: Salmonidae): Implications for the conservation of an evolutionary legacy. Conservation Genetics, 4(1), 67–81. [Google Scholar]
- Valiquette, E. , Perrier, C. , Thibault, I. , & Bernatchez, L. (2014). Loss of genetic integrity in wild lake trout populations following stocking: Insights from an exhaustive study of 72 lakes from Québec. Canada. Evolutionary Applications, 7(6), 625–644. 10.1111/eva.12160 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Waples, R. S. (2006). A bias correction for estimates of effective population size based on linkage disequilibrium at unlinked gene loci*. Conservation Genetics, 7(2), 167–184. 10.1007/s10592-005-9100-y [DOI] [Google Scholar]
- Waples, R. S. , Antao, T. , & Luikart, G. (2014). Effects of overlapping generations on linkage disequilibrium estimates of effective population size. Genetics, 197(2), 769–780. 10.1534/genetics.114.164822 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Waples, R. S. , & Do, C. (2010). Linkage disequilibrium estimates of contemporary Ne using highly variable genetic markers: A largely untapped resource for applied conservation and evolution. Evolutionary Applications, 3(3), 244–262. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Waples, R. K. , Larson, W. A. , & Waples, R. S. (2016). Estimating contemporary effective population size in non‐model species using linkage disequilibrium across thousands of loci. Heredity, 117(4), 233. 10.1038/hdy.2016.60 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Weise, E. M. , Sard, N. M. , Nehfer, A. , & Scribner, K. T. (2020). Adaptive genetic management: Assessing the benefits of translocations. Conservation Genetics, 21(2), 277–287. 10.1007/s10592-020-01249-7 [DOI] [Google Scholar]
- Whitlock, M. C. (2000). Fixation of new alleles and the extinction of small populations: Drift load, beneficial alleles, and sexual selection. Evolution, 54(6), 1855–1861. 10.1111/j.0014-3820.2000.tb01232.x [DOI] [PubMed] [Google Scholar]
- Whitlock, M. C. , & Bürger, R. (2004). Fixation of new mutations in small populations. In Ferriere R., Dieckmann U., & Couvet D. (Eds.), Evolutionary Conservation Biology (pp. 155–170). Cambridge University Press. [Google Scholar]
- Wickham, H. (2011). ggplot2. WIREs Computational Statistics, 3(2), 180–185. 10.1002/wics.147 [DOI] [Google Scholar]
- Willi, Y. , Fracassetti, M. , Zoller, S. , & Van Buskirk, J. (2018). Accumulation of mutational load at the edges of a species range. Molecular Biology and Evolution, 35(4), 781–791. 10.1093/molbev/msy003 [DOI] [PubMed] [Google Scholar]
- Winslow, L. A. , Hansen, G. J. A. , Read, J. S. , & Notaro, M. (2017). Large‐scale modeled contemporary and future water temperature estimates for 10774 Midwestern U.S. Lakes. Scientific Data, 4(1), 170053. 10.1038/sdata.2017.53 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wright, S. (1931). Evolution in Mendelian populations. Genetics, 16(2), 97. Retrieved from http://www.genetics.org/content/16/2/97.abstract [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yeaman, S. , Hodgins, K. A. , Lotterhos, K. E. , Suren, H. , Nadeau, S. , Degner, J. C. , Nurkowski, K. A. , Smets, P. , Wang, T. , Gray, L. K. , Liepe, K. J. , Hamann, A. , Holliday, J. A. , Whitlock, M. C. , Rieseberg, L. H. , & Aitken, S. N. (2016). Convergent local adaptation to climate in distantly related conifers. Science, 353(6306), 1431. 10.1126/science.aaf7812 [DOI] [PubMed] [Google Scholar]
- Zhang, M. , Zhou, L. , Bawa, R. , Suren, H. , & Holliday, J. A. (2016). Recombination rate variation, hitchhiking, and demographic history shape deleterious load in poplar. Molecular Biology and Evolution, 33(11), 2899–2910. 10.1093/molbev/msw169 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Table S1
Table S2
Table S3
Figure S1
Figure S2
Data S1
Data S2
Table S4
Figure S3
Data Availability Statement
Sequence data for the 129 individuals used in the development of the Rapture panel were previously uploaded to the National Center for Biotechnology Information (NCBI) sequence read archive (BioProject: PRJNA557717). Rapture sequence data used in analyses in this manuscript were georeferenced in the Genomic Observatories MetaDatabase (https://n2t.net/ark:21547/DwZ2) and uploaded to the NCBI sequence read archive (BioProject: PRJNA771945). Scripts and supplemental material for the hydrodynamic model will be uploaded to Dryad upon acceptance.
