Skip to main content
Evolutionary Applications logoLink to Evolutionary Applications
. 2024 Jun 28;17(7):e13703. doi: 10.1111/eva.13703

Genetic and epigenetic differentiation in response to genomic selection for avian lay date

Melanie Lindner 1,2,, Irene Verhagen 3, A Christa Mateman 1, Kees van Oers 1,4, Veronika N Laine 5, Marcel E Visser 1,2
PMCID: PMC11211926  PMID: 38948539

Abstract

Anthropogenic climate change has led to globally increasing temperatures at an unprecedented pace and, to persist, wild species have to adapt to their changing world. We, however, often fail to derive reliable predictions of species' adaptive potential. Genomic selection represents a powerful tool to investigate the adaptive potential of a species, but constitutes a ‘blind process’ with regard to the underlying genomic architecture of the relevant phenotypes. Here, we used great tit (Parus major) females from a genomic selection experiment for avian lay date to zoom into this blind process. We aimed to identify the genetic variants that responded to genomic selection and epigenetic variants that accompanied this response and, this way, might reflect heritable genetic variation at the epigenetic level. We applied whole genome bisulfite sequencing to blood samples of individual great tit females from the third generation of bidirectional genomic selection lines for early and late lay date. Genomic selection resulted in differences at both the genetic and epigenetic level. Genetic variants that showed signatures of selection were located within genes mostly linked to brain development and functioning, including LOC107203824 (SOX3‐like). SOX3 is a transcription factor that is required for normal hypothalamo‐pituitary axis development and functioning, an essential part of the reproductive axis. As for epigenetic differentiation, the early selection line showed hypomethylation relative to the late selection line. Sites with differential DNA methylation were located in genes important for various biological processes, including gonadal functioning (e.g., MSTN and PIK3CB). Overall, genomic selection for avian lay date provided insights into where within the genome the heritable genetic variation for lay date, on which selection can operate, resides and indicates that some of this variation might be reflected by epigenetic variants.

Keywords: avian breeding time, climate change adaptation, DNA methylation, genomic selection, Parus major, SNPs

1. INTRODUCTION

Climate change has led to environmental changes at an unprecedented pace and, to persist, wild species need to adapt to their changing world. Data from long‐term study populations of wild animals have enabled the quantification of selection pressures and additive genetic variation for various phenotypes in many species and, based on these estimates, predictions of the potential for micro‐evolutionary responses to selection were derived (Grant & Grant, 1995; Kruuk et al., 2008). However, despite many examples of directional selection in the wild (Kruuk et al., 2002; Marrot et al., 2018; Nussey et al., 2005; Ramakers et al., 2019; Sheldon et al., 2003), micro‐evolutionary responses to selection that match the estimated predictions are rare (Charmantier & Gienapp, 2014; Merilä et al., 2001). Predicting micro‐evolutionary responses relies on accurate estimates of selection and genetic variation, but the complex nature of wild study systems and focal traits (and hence statistical frameworks) in combination with the limited data availability almost unavoidably leads to biased estimates of such parameters (but see the review by Pujol et al., 2018).

Advances in genomics provide us with the opportunity to zoom into the genomic architecture of phenotypes with the aim to identify the causal genes and genomic mechanisms that translate the relevant genes into phenotypes. This way, we can predict the potential for a micro‐evolutionary change to take place at specific genetic variants and derive predictions that take the genomic architecture of phenotypes into account. Indeed, loci that explained a moderate proportion of the genetic variation of phenotypes have been identified for some phenotypes, such as birth weight and recombination rate in ungulates (Johnston et al., 2016; Slate et al., 2002), wing length or bill morphology in birds (Bosse et al., 2017; Tarka et al., 2010) or age at maturity in salmon (Barson et al., 2015). Many phenotypes, however, are thought to be determined by a large number of genetic variants which each have a small effect on the phenotype (e.g., Husby et al., 2015; Santure et al., 2015). In combination with rather small sample sizes in most evolutionary studies of wild populations, small effects sizes lead to low power in detecting genetic variants, especially when phenotypes are phenotypically plastic, that is there is an environment dependent association between genetic variants and phenotypes (e.g., Gienapp et al., 2017). As a consequence, genetic variants only explain a small proportion of the heritable phenotypic variation for many phenotypes (e.g., references in table S1 of Gienapp et al., 2017), leaving a large proportion of the heritable phenotypic variation unexplained (or ‘missing’).

Epigenetics marks, that is chemical modifications of the DNA sequences or DNA‐associated histones, are suggested to contribute to heritable phenotypic variation (Trerotola et al., 2015). Epigenetic marks are involved in the regulation of gene expression and, as such, contribute to the expression of phenotypes. Especially if epigenetic marks are inherited independently of genetic variants or are involved in mediating genotype–environment interactions, they could, in theory, contribute to heritable phenotypic variation. While there are sources of epigenetics marks that are thought to be independent of genetic variation, such as the environmental induction of epigenetic modifications and spontaneous epi‐mutations (Sepers et al., 2019), it is under debate whether epigenetic variants that are independent of genetic variants can be inherited (especially in vertebrates). Nevertheless, the identification of epigenetic variants can provide insights into the genomic architecture of phenotypes, especially for phenotypes that are dependent on the environment. In maize, phenotypic plasticity in plant growth is suggested to be regulated by epigenetic processes that are driven by a genetic variant in the rice plasticity 1 (RPL1) gene. Individuals that harbour the genetic variant in RPL1 showed specific epigenetic signatures and increased levels of phenotypic plasticity (Zhang et al., 2012). In this case, epigenetic processes are responsive to the environment, but the degree of responsiveness is dependent on a genetic variant, indicating that the observed phenotypic plasticity is facilitated by genotype‐dependent epigenetic variation. Hence, specifically those epigenetic variants that are dependent on genetic variants are of particular interest for understanding how phenotypic variation is shaped.

Artificial selection is a promising and powerful tool for studying the genes and genomic mechanisms underlying natural variation of quantitative traits as it facilitates the comparison between individuals of extreme phenotypes at different molecular levels (Hill & Caballero, 1992). In contrast to phenotype‐based selection, genomic selection relies on the marker‐based prediction of genomic breeding values (GEBVs), the additive effect of an individual's genotype on the phenotype relative to the population mean phenotype (Charmantier et al., 2014). Despite its promising advantages and common application in animal and plant breeding (Jannink et al., 2010; Meuwissen et al., 2016), the only application of genomic selection in a wild animal population stems from the long‐term study population of great tits (Parus major) at the Hoge Veluwe National Park (Gienapp et al., 2019; Verhagen, Gienapp, et al., 2019).

In the Dutch study population of great tits, bidirectional genomic selection was applied to avian lay date, a trait with a heritability of about 0.2 (Gienapp et al., 2019). For this, GEBVs for lay date were estimated (Gienapp et al., 2019) and individuals with GEBVs for extremely early and late lay date were selected to establish the genomic selection lines (Verhagen, Gienapp, et al., 2019). The genomic selection experiment allows us to study the genetic and genomic basis underlying lay dates, a complex phenotype that is phenotypically plastic to spring temperature and of high evolutionary and ecological relevance for the population due to its effect on reproductive success (Schaper et al., 2012; Verhagen et al., 2020; Visser et al., 1998). The preparations for egg laying on a molecular and physiological level are initiated 6–8 weeks prior to the initiation of egg laying when increasing photoperiods in early spring induce a neuro‐endocrine cascade along the hypothalamic–pituitary–gonadal–liver axis (HPGL axis) that activates the onset of gonadal growth (Dawson et al., 2001; Williams, 2012). The final ‘decision’ on when to initiate egg laying is regulated much later in spring downstream of the underlying neuro‐endocrine cascade and depends on supplementary cues such as increasing temperatures (Caro et al., 2013; Schaper et al., 2012; Verhagen, Laine, et al., 2019). This complex regulation of lay dates across all levels of the HPGL axis in interaction with various environmental variables and the highly polygenetic nature of the trait have proven challenging for genome‐wide association studies (GWAS) on avian lay dates in wild populations. Even when moderate sample sizes are used (>2000 great tit females) and genotype–environment interactions are accounted for, GWAS failed to detect genome‐wide significant genetic variants for lay dates (Gienapp et al., 2017). As a consequence, the genetic and genomic basis underlying lay dates remain mostly unknown.

Genomic selection for lay dates has led to a clear differentiation in GEBVs accompanied by phenotypic differentiation when lay dates were recorded in aviaries (Verhagen, Gienapp, et al., 2019) and in the wild (Lindner et al., 2023). However, genomic selection based on GEBVs is a ‘blind process’ that does not provide any information on the genetic variants that responded to genomic selection or the epigenetic marks that accompanied the genetic response. Genetic signatures of selection within the genomic selection experiments were previously identified at a number of loci, in the form of single nucleotide polymorphisms (SNPs) (Verhagen, Gienapp, et al., 2019) and we here expand on these findings by exploring epigenetic signatures of the genomic selection line experiment. For this, we used a whole genome bisulfite sequencing (WGBS) approach on blood samples of female great tits from the third generation of the genomic selection lines for early and late lay dates to simultaneously profile heritable genetic variants and epigenetic marks in the form of SNPs and CpG site methylation (i.e., the methylation status of bi‐nucleotides within the DNA sequence that consist of a potentially methylated cytosine and an unmethylated guanine). We found differences at both the genetic and epigenetic level between females from the early and late selection line. Epigenetic variants were related to general biological processes, while genetic variants were related to brain development and functioning. This way, our findings provide insights into where within the genome heritable genetic variation for lay date resides and point towards complementary functions of genetic and epigenetic variants for avian lay dates.

2. METHODS

2.1. Genomic selection lines for avian lay date

The great tit, is a well‐known model species in ecology and evolution with a reference genome (Laine et al., 2016) and whole transcriptome and methylome for various tissues (Derks et al., 2016; Laine et al., 2016; Santure et al., 2011). The females included here originated from a bidirectional genomic selection experiment, in which great tits were selected based on GEBVs for early and late lay dates (full methods are provided in Gienapp et al., 2019; Verhagen, Gienapp, et al., 2019). We define a female's lay date as the date a female initiates egg laying within a year. In contrast to traditional methods for artificial selection where individuals are selected based on their expressed phenotype, genomic selection is based on genomic breeding values (GEBVs), the additive effect of an individual's genotype on the phenotype relative to the population mean phenotype (Charmantier et al., 2014).

For the genomic selection experiment, a wild training population of >2000 great tit females from the Hoge Veluwe National Park (The Netherlands) with known lay dates and genotyped at >500,000 single nucleotide polymorphisms (SNPs) was used to estimate genomic breeding values (GEBVs) using the ‘genomic best linear unbiased prediction’ (GBLUP) approach (Gienapp et al., 2019). Within this approach, the pedigree‐based relatedness matrix within the animal model was replaced by a SNP‐based relatedness matrix. The animal model constitutes a specific form of a mixed‐effect model frequently used in quantitative genetic studies (Wilson et al., 2010). Within this framework GEBVs of genotyped individuals with unknown lay date can be predicted based on the SNP‐based relatedness between individuals. To initiate the selection lines for early and late lay dates, 28 breeding pairs from the Hoge Veluwe study site were selected in 2014 as ‘parental’ generation based on their breeding values for lay dates (Verhagen, Gienapp, et al., 2019). All nestlings produced by the wild parental generation (i.e., the first‐generation offspring) were brought to aviary facilities at the NIOO‐KNAW on the tenth day after hatching (d10), where they were hand raised until independence (Drent et al., 2003). First‐generation offspring were genotyped (n = 163 first‐generation genotyped nestlings) to estimate their GEBVs and individuals with extreme early GEBVs and extreme late GEBVs were selected for breeding the selection lines for early and late lay dates in captivity (n = 37 first‐generation breeding pairs). Eggs produced by the first‐generation offspring in captivity were moved into nests of wild foster parents in spring 2015. Second‐generation offspring hatched in these foster nests and foster parents undertook the brood care until d10. Then, the nestlings were brought to the aviary facilities at the NIOO‐KNAW for hand rearing and genotyping (n = 189 second‐generation genotyped nestlings) and, based on the predicted GEBVs for lay date, selected into breeding pairs (n = 33 second‐generation breeding pairs) for spring 2016. This procedure was repeated for the third‐generation (n = 280 third‐generation genotyped nestlings). Overall, average GEBVs for third‐generation individuals (−0.50 and 0.61 for the early and late selection line, respectively) corresponded reasonably well to the cumulative predictive response to genomic selection of −0.72 days for the early selection line and 0.84 days for the late selection line (Verhagen, Gienapp, et al., 2019). The response to genomic selection at the level of GEBVs translated to a response at the phenotypic level when birds were breeding in aviaries (Verhagen, Gienapp, et al., 2019) as well as in the wild (Lindner et al., 2023). The experiment was performed under the approval by the Animal Experimentation Committee of the Royal Academy of Sciences (DEC‐KNAW), Amsterdam, The Netherlands, protocol NIOO 14.10.

2.2. Sample selection and DNA extraction

For the WGBS libraries, we selected 19 individual blood samples that included 10 samples from third‐generation females of the early selection line and nine samples from third‐generation females of the late selection line as well as one toe sample of a third‐generation female of the late selection line (Table S1). The toe sample was excluded from the DNA methylation analyses, but included for analyses on SNP data. We extracted DNA from blood samples (whole blood or blood of which plasma was removed) that were taken closest to the first of June in the first year of breeding. DNA was extracted from the samples using FavorPrep DNA extraction kit (Bio‐Connect, The Netherlands) following the manufacturer's instructions.

2.3. Whole genome bisulfite sequencing

Library preparation and sequencing was performed at the Roy J. Carver Biotechnology Centre (University of Illinois at Urbana‐Champaign, USA). WGBS data showed high rates of technical duplicates (Table S2) which can bias the methylation status and coverage of CpG sites. In contrast to RRBS, where the non‐random fragmentation complicates the differentiation between technical (e.g., PCR) and biological duplicates, the random fragmentation in WGBS allows for the removal of technical duplicates (for more details on technical duplicates see Laine et al., 2023). However, when duplication rates are high, the removal of technical duplicates results in a drastic loss of data. To circumvent data loss, we performed library preparation and sequencing twice for all libraries and a third time for eight libraries with particularly high duplication rates (Table S2). For the first two sequencing runs DNA from the same extraction was used and library preparation and sequencing were performed in the same way. Shotgun genomic libraries (with read length of 150 nucleotides) were prepared with the Hyper Library construction kit from kapa Biosystems (Roche) and treated with the EZ DNA Methylation‐Lightning kit from Zymo Research. Libraries were quantitated by qPCR and sequenced for 151 cycles from each end of the fragment (i.e., paired‐end) in two lanes of a S4 flow cell on a NovaSeq 6000. See Table S3 for how libraries were divided across lanes. For the third run, we extracted more DNA from eight of the previously sequenced samples using the same extraction protocol. Libraries were prepared as described above, pooled into one pool and sequenced for 151 cycles from each end of the fragment (i.e., paired‐end) in one lane of a SP flow cell on a NovaSeq 6000.

2.4. Bioinformatics processing

We ran the bioinformatics pipelines with snakemake v5.17.0 (Koster & Rahmann, 2012). We used R v4.0.1 (R Core Team, 2021) for additional scripts used within the pipeline and, in addition to base R packages, we used dplyr v1.0.0 (Wickham et al., 2020), tidyr v1.1.0 (Wickham & Henry, 2020), stringr v1.4.0 (Wickham, 2019), ggplot2 v3.3.2 (Wickham, 2016, p. 2), cowplot v1.1.0 (Wilke, 2020) and RColorBrewer v1.1.2 (Neuwirth, 2014) for data formatting and visualization. Environments were build and managed with conda v4.8.4 (Anaconda Software Distribution, 2016).

2.5. Quality control of whole genome bisulfite sequencing data

For the initial quality control, we used FastQC v0.11.9 (Andrew, 2010), FastQ Screen v0.11.1 (Wingett & Andrews, 2018), and MultiQC v1.7 (Ewels et al., 2016) in default settings but allowed parallel processing of samples by FastQC. Results are presented in Table S2. We trimmed the data and removed adapters using TrimGalore v0.6.5 (https://github.com/FelixKrueger/TrimGalore) in settings for paired‐end data and set a NovaSeq specific quality cut‐off of 20 (by specifying –2coulor 20) accounting for NovaSeq specific over‐representation of Gs (poly‐G). We repeated the quality control by running FastQC and MultiQC for the trimmed data.

2.6. Methylation calling

We used Bismark v0.22.3 (Krueger & Andrews, 2011) for alignment and methylation calling. First, the great tit reference genome build 1.1 (https://www.ncbi.nlm.nih.gov/assembly/GCF_001522545.3) was in silico bisulfite converted and indexed with default settings using Bismark's genome build function. We aligned the reads with default settings for paired‐end reads and set the number of parallel instances to be run concurrently to eight. We used the percentage of CHH methylation from the Bismark alignment reports to calculate the minimal bisulfite conversion efficiency (Table S4). We deduplicated the alignments with default settings for paired‐end reads using Bismark. Using Picard v2.23.3 (https://github.com/broadinstitute/picard) we added read groups to the alignments and merged the alignment files for either the first two sequencing runs or all three sequencing runs of the same sample depending on whether a sample was run two or three times. We assessed the number of mapped reads, average coverage depth and breadth of coverage using samtools (Table S5). The breadth of coverage was calculated as the number of bases with a minimum coverage of 10 divided by the total number of bases within the great tit genome, that is genome length; calculated using Bowtie2 v2.3.5.1 (Langmead & Salzberg, 2012). We used Bismark to call methylation from the deduplicated and merged alignments using default setting for paired‐end reads ignoring the first two and three bases from the 5′ end (of both reads) and the 3′ end (of both reads), respectively.

In addition to methylation called from the merged alignments (which are analysed in this study), we also called methylation from the alignments prior to merging, that is from alignment files for either the first two sequencing runs or all three sequencing runs of the same sample, using the pipeline described above. We used the derived methylation calls to assess whether our findings are robust in regard to the use of multiple sequencing runs.

2.7. Processing and analysis of methylation data

We used R package MethylKit v1.16.1 (Akalin et al., 2012) to import the raw methylation counts (Bismark output; CpG reports) into R. We excluded CpG sites with a CpG site coverage higher than the 99.9 percentile from further analyses. We combined methylation data of all samples into one data frame using MethylKit while merging cytosines from both strands of a CpG site into one CpG site by calculating the sum of methylated and unmethylated cytosines. We used custom R code to format the data for downstream analysis in which we removed CpG sites that did not have a coverage of 10x in all samples and removed CpG sites with 0% or 100% methylation across all samples (resulting in a total of 1,334,373 CpG sites).

2.8. Differential methylation analysis on CpG sites

We used the R package MethylKit to perform a differential methylation analysis on each CpG site to test for differences in DNA methylation between third‐generation females of the early (n = 10) and late (n = 9) selection line. To calculate the differential methylation statistics, we use the calculateDiffMeth function which calculates differential methylation using a Fisher's Exact test. Within the function call we applied overdispersion correction (overdispersion set to ‘MN’) as proposed by (McCullagh & Nelder, 1989). We validated the model by visually inspecting the p‐value distribution, the relationship of p‐values as ‐log10(p‐value) with CpG site coverage, the correlation of observed p‐values with the expected p‐values (both as ‐log10(p‐value), QQ‐plot) and the relationship of p‐values as ‐log10(p‐value) with the differences in mean methylation level between the selection lines (volcano plot). To identify differentially methylated sites (DMS), we corrected for multiple testing by using two adjustment methods. We (i) applied Benjamini and Hochberg's method implemented within MethylKit that controls the false discovery rate (FDR), the expected proportion of false discoveries within the rejected hypotheses (Benjamini & Hochberg, 1995), using a q‐value threshold of 0.05. We (ii) applied the Bonferroni approach using a corrected α‐threshold that was calculated as the initial α‐threshold (of 0.05) divided by the number of tests performed (i.e., 1,334,373 CpG sites). Validation plots of the differential methylation analysis show that, overall, the model was well‐suited for the methylation data (Figure S1a–d), although the QQ‐plot indicates some p‐value inflation (λ = 1.15; Figure S1c). Moreover, for DMS that passed the Bonferroni α‐threshold we showed that the identification of those DMS is not a statistical artefact of combining multiple sequencing runs by repeating the differential methylation analysis for the first and second sequencing run separately which, in both cases, confirmed our findings (Table S6).

In addition to the CpG site‐based analysis, we tested for a difference in overall methylation level across CpG sites between females from the early and late selection line using the CpG sites that passed the q‐value threshold. For this, we used the following generalized linear mixed model with binomially distributed errors implemented with the R package lme4 v1.1.27 (Bates et al., 2015);

y=μ+βxLine+Si+Fj+eij (1)

with y as dependent variable, a two‐column matrix of methylated and unmethylated counts to account for variation in CpG site coverage (corresponding to methylation levels weighted by the total number of counts; Lea et al., 2017; Zhang et al., 2016), with μ for the intercept term, with β for the selection line regression coefficient, with S i and F j for the random effect terms of CpG site and female identity respectively (for i = 1.37 and j = 1.19) and with e ij for the residual term. Significance was assessed by comparing the full model to its corresponding null model, that is a model equivalent to the full model but without the selection line regression coefficient, using an ANOVA.

2.9. Differential methylation analysis on regions

To identify differentially methylated regions, we predicted CGIs using cpgplot (Larsen et al., 1992), a software part of the EMBOSS package v6.6.0.0, in default settings. We predicted a total number of 33,142 CGIs and matched the 1,334,373 CpG sites that passed data processing to GCIs using BEDtools v.2.26.0 (Quinlan & Hall, 2010). We applied a filter threshold of 10 or more CpG sites (with 10x coverage in all samples) per CGI, resulting in a total of 596 CGIs. We used MethylKit's regionCounts function to sum up methylated and unmethylated base counts for each CGI and tested for differences in DNA methylation between the early (n = 10) and late (n = 9) selection line using MethylKit's calculateDiffMeth function. We used the same settings in the function call and the same approach for multiple testing correction as described above for the CpG site‐based differential methylation analysis.

2.10. SNP calling

Based on our recent evaluation of SNP calling tools for bisulfite sequencing data (Lindner et al., 2022), we used the Bayesian wildcard strategy of CGmapTools v0.1.2 (Guo et al., 2018) for SNP calling. Due to the bisulfite treatment (conversion of C to T if C is unmethylated), the presence of Ts might indicate either Ts or Cs in the unconverted genome resulting in ambiguous genotypes. Wildcards are used to denote this ambiguity in predicted genotypes with Y referring to either T or C and R referring to either A or G. When both strands have high coverage, this ambiguity can be resolved and an exact genotype can be computed (Guo et al., 2018). Prior to SNP calling, however, we were required to repeat the alignments as Bismark implemented new flag values (from v0.8.3 onwards) while CGmapTools requires the previous flag values for SNP calling. For the Bismark alignments with ‘old’ flag values we aligned the reads with the settings described above and additionally specified ‘–old_flag’ and ‘–no_dovetail’. We deduplicated the alignments, added read groups and merged alignments as described above (see Methylation calling). As females are the heterogametic sex, there is only one copy of the Z chromosome while autosomes have two copies. Here, we split the merged alignments such that the Z chromosome and mitochondrial DNA were removed and only autosomes and scaffolds were kept. We converted the alignments into ATCGmaps while removing the overlap of read pairs using CGmapTools. We called SNPs from the ATCGmaps using CGmapTools' Bayesian wildcard strategy in default settings. We removed ambiguous genotypes and applied a filter for minimum and maximum coverage using GATK v4.2.0 (DePristo et al., 2011; McKenna et al., 2010). We set the threshold for minimum coverage to 10 and the maximum coverage threshold to the 99th percentile of coverage within a sample averaged across sequenced samples. We merged samples, selected only positions that were SNPs and created unique SNP identifiers using GATK and BCFtools v1.11 (Danecek et al., 2021). The merged SNP data sets included 10,546,938 SNP positions prior to quality control.

2.11. Processing and analysis of SNP data

We used Plink v1.9 (Purcell et al., 2007) to manage SNP data and GenABEL v1.8.0 (Aulchenko et al., 2007) to perform the quality control. We discarded SNP positions that were present in less than 90% of samples per selection line and had a minor allele frequency below 0.125 (corresponding to five alleles), reducing the number of SNP positions to 451,600. We calculate the identity‐by‐state‐based genetic distance matrix for all samples over all SNPs which we then used for multi‐dimensional scaling (MDS, also known as principal coordinates analysis; Gower, 1966) to identify outliers and clusters of genetically similar individuals. No clear outlier was detected and samples clustered by selection line alongside the second PC (explaining for 8.84% of SNP variation; Figure S2 and Table S7).

The calculation of GEBVs for the genomic selection line experiment (Gienapp et al., 2019) was based on SNPs from a high‐density SNP chip for the great tit that covers >500,000 SNPs (Kim et al., 2018). SNPs from the SNP chip of females from all generations in the genomic selection line experiment (parents: n = 40, first‐generation offspring: n = 158, second‐generation offspring: n = 184, third‐generation offspring: n = 277), that after quality filter included 437,271 SNPs, were previously used for fixation index (F st) outlier analysis (Verhagen, Gienapp, et al., 2019). There was an overlap of less that 15% (n = 31,423) between the SNPs covered on the SNP chip (n = 437,271) and SNPs called from WGBS data (n = 451,600) and hence we consider our F st outlier analysis complementary to the F st outlier analysis performed in Verhagen, Gienapp, et al. (2019).

2.12. SNP outlier detection

To identify potential SNPs that differentiated in response to genomic selection for early and late lay dates, we performed three analytical approaches for SNP outlier detection described below; (i) fixation index (Fst) outlier analysis, (ii) a genome scan for selection based on principle component analysis (PCA) and (iii) redundancy analysis. For SNPs without missing data, we defined outlier SNPs as SNPs that were detected as outliers in all three approaches. The redundancy analysis required to exclude SNPs with missing genotypes. Hence, for SNPs with missing genotypes we defined outlier SNPs as SNPs that were detected as outliers in the F st outlier analysis and the PCA‐based genome scan for selection.

2.13. Fixation index (F st) outlier analysis

We estimated SNP‐specific F st coefficients for SNPs called from WGBS data (n = 451,600) using BayeScan v2.1 (Foll & Gaggiotti, 2008), a method that aims to directly estimate the locus‐specific probability that SNPs are subject to selection using a Bayesian method. While the use of BayeScan can result in a high number of false positives, this is not the case when a large number of neutral loci (relative to selected loci) is included in the analysis (Lotterhos & Whitlock, 2014), which is the case here. We first converted SNP data into a BayeScan‐readable format using R packages adegenet v2.1.5 (Jombart, 2008; Jombart & Ahmed, 2011) and dartR v1.9.9.1 (Gruber et al., 2018). We implemented BayeScan with a few adjustments relative to the default settings. We (i) increased the prior odds to 100 (default: 10) as low prior odds lead to false positives when testing a large number of markers, we (ii) increased the number of iterations to 25,000 (default 5000) and the number of the thinning interval size to 50 (default: 10) to decrease autocorrelation between iterations while keeping the same number of iterations as in default settings, and we (iii) set the number of threads used to 60. Trace plots indicate good convergence of chains (Figure S3) and effective sample sizes to estimate the posterior distributions were sufficiently large (n eff > 7000 for loglikelihood and n eff > 13,000 for F st coefficients). We considered SNPs with q‐value < 0.05 as F st outliers. In addition to BayeScan, we estimated F st using Arlequin and plink. For F st outlier analysis with Arlequin v3.5.2 (Excoffier & Lischer, 2010) we followed the analysis described in Verhagen, Gienapp, et al. (2019) and considered SNPs with a p‐value < 0.01 as potential F st outlier. For estimating locus‐specific F st coefficients with plink we set the –fst and –within flags in combination with a file linking sample IDs to the selection line. Overall F st coefficients estimated with BayeScan were lower than estimated with Arlequin and plink (Figure S4). However, the order of SNPs in regard to F st coefficients is quite comparable between tools, specifically for SNPs with the highest F st coefficients that correspond to F st outliers.

2.14. Genome scan for selection based on principle component analysis (PCA)

We used the R package pcadapt v4.3.3 (Luu et al., 2017) to implement a genome scan for selection based on PCA under the assumption that SNPs that excessively relate to population structure are candidate SNPs for local adaptation. pcadapt performs two successive tasks, (i) a PCA on the centred and scaled SNP matrix and (ii) the computation of test statistics and p‐values based on the correlations between SNPs and the first K principal components (PCs). For our data set (n = 451,600 SNPs), we set K = 1, because only the first PC was significantly associated with selection line (p‐value = 2.04E‐10, all other PCs had p‐value > 0.05) otherwise and used default settings. SNPs with p‐values smaller that the Bonferroni‐based threshold (calculated as the initial α‐threshold of 0.05 divided by the number of tests performed, that is 451,600 SNPs) are considered PCA‐based outliers.

2.15. Redundancy analysis

We implemented a redundancy analysis (Forester et al., 2016) that aims to detect SNP outlier that constitute footprints of divergent selection typically expected in polygenetic adaptation when traits are in complex interactions with the environment (Caizergues et al., 2022). Redundancy analysis requires complete data and hence we removed SNPs with missing genotypes from the data set, reducing the data set to 164,959 SNPs with genotypes in all samples. We implemented the redundancy analysis with R package vegan v2.5.7 (Oksanen et al., 2019) and specified selection line as explanatory variable. We considered a relationship between SNP data and selection line with p‐value < 0.05 as significant and extracted the SNP‐specific loadings in the ordination space to identify SNPs involved in local adaptation. SNPs that are located in the tails of the distribution of the SNP‐specific loadings (using a cut‐off of three standard deviations; Forester et al., 2016) are considered redundancy analysis outliers.

2.16. Annotation of CpG sites and SNPs

We annotated genomic regions using R packages GenomicFeatures v1.42.3 (Lawrence et al., 2013) and rtracklayer v1.50.0 (Lawrence et al., 2009). The genomic regions included the TSS region (300 bp upstream to 50 bp downstream of the annotated transcription start site), promoter region (2000 bp upstream to 200 bp downstream of the annotated transcription start site), gene body (exons and introns) and 10 kb up‐ and downstream regions (respective 10 kb regions adjacent to the gene body). CpG sites and SNPs were matched to the above‐described annotated genomic regions of genes with BEDtools or assigned to the intergenic region (if no match was found).

After pre‐processing the methylation data sets, we retained 1,334,373 CpG sites that were associated to 17,917 annotated genes. Mostly CpG sites were located within the gene body (n = 773,206) of annotated genes or in intergenic regions (n = 421,872; Figure S5 and Table S8). Moreover, 209,466 CpG sites were located in the 10 k upstream region, 186,300 CpG sites in the 10 k downstream region, 58,770 CpG sites in the promoter region and 8738 CpG sites in the transcription start site region.

2.17. Gene enrichment analysis

We performed gene ontology (GO) analyses for gene lists identified with the statistical analysis of the methylation data using the ClueGO v2.5.7 (Bindea et al., 2009) plug‐in for Cytoscape v3.8.2 (Shannon, 2003). We used the human (v20.01.2022) and chicken (v26.01.2022) annotations, GO categories ‘biological process’, ‘cellular components’, ‘molecular function’, ‘immune system process’ and KEGG pathways, and custom background lists of all annotated genes within the methylation data (n = 17,916 genes). We specified the selection criteria for GO terms such that >=5% of the genes associated with a GO term and >=2 genes associated with the GO term had to be present in the input genes. We used a two‐sided enrichment/depletion test, p‐value correction for multiple testing via Bonferroni step down and set the network specificity to ‘medium’ ranging from the third to tenth GO level. The great tit annotation contains LOC genes and we checked whether the LOC genes were categorized as predicted genes, uncharacterized genes or small nuclear RNA and assessed the reliability of the gene prediction using the NCBI genome browser and blast (Table S9).

3. RESULTS

3.1. Differential methylation analysis

Of the 1,334,373 CpG sites included in the CpG site‐based differential methylation analysis, we identified 37 differentially methylated CpG sites (DMS) when using the FDR‐based significance threshold (Figure 1). However, three DMS showed less than 10% difference in CpG methylation between selection lines (Table S10). Overall, identified DMS were hypomethylated in females of the early selection line relative to the late selection line (LRT: Chisq = 37.44, Df = 1, p‐value < 0.001; Figure 2). When applying a more stringent significance threshold (i.e., Bonferroni) and, this way, decreasing the number of expected false positives, we identified six DMS in four genes with at least 19% difference in CpG methylation between selection lines (Figure 2 and Table 1). These DMS were located within the gene body (n = 3), the 10 kb upstream region (n = 1) and the promoter region (n = 1) of annotated genes (Table 1), whose potential functional relevance for avian lay dates is described in the discussion.

FIGURE 1.

FIGURE 1

Manhattan plot of the differential methylation analysis. Data points are p‐values (on the ‐log10 scale) of individuals CpG sites (n = 1,334,373) that correspond to the significance of differential CpG site methylation between females from the early (n = 10) and late (n = 9) selection line. Black dashed lines refer to the significance threshold with Bonferroni (top) and FDR (bottom) correction for multiple testing. Furthermore, CpG sites that pass the FDR‐based significance thresholds are highlighted in orange and plotted with increased plot symbol size. ‘Sc’ refers to unplaced scaffolds (x‐axis).

FIGURE 2.

FIGURE 2

CpG site methylation level (in %) of 37 CpG sites with differential methylation after FDR correction for multiple testing for females from the early and late selection line. Data point and error bars represent the selection line‐specific mean and 95%‐confidence interval. The difference between selection lines was assessed using a likelihood ratio test (Chisq = 37.44, Df = 1, p‐value = 9.43*10−10).

TABLE 1.

CpG sites with differential methylation that pass the Bonferroni significance threshold.

Site p‐Value Methylation difference (%, late‐early) Genomic region Gene symbol
chr2_23971207 1.99E‐10 23.43 Gene body CASD1
chr2_27152785 6.96E‐10 30.23 Intergenic NA
chr4_14314088 1.70E‐08 −28.00 Intergenic NA
chr7_7319671 a 2.32E‐08 −22.00 10 kb upstream MSTN
chr7_7319671 a 2.32E‐08 −22.00 Gene body C7H2orf88
chr9_3916309 a 2.63E‐08 19.34 Promoter PIK3CB
chr9_3916309 a 2.63E‐08 19.34 Gene body PIK3CB
Scaffold101_1793295 1.74E‐08 19.75 Intergenic NA

Note: The CpG site identifier (chromosome and position on the chromosome), p‐value, difference in CpG site methylation, genomic region and gene symbol of the annotated gene are shown. A positive difference in CpG site methylation corresponds to hypomethylation in females from the early selection line relative to females from the late selections line and vice versa. Full name of genes: CAS1 domain containing 1 (CASD1); myostatin (MSTN); chromosome 7 C2orf88 homologue (C7H2orf88); phosphatidylinositol‐4,5‐bisphosphate 3‐kinase catalytic subunit beta (PIK3CB).

a

Marks CpG sites that are annotated to more than one genomic region.

Enrichment analysis of genes that overlap DMS (n = 25) provided limited insights, likely due to the low number of genes. Only if we decreased the number of genes required per GO term to be included in the enrichment analysis to two genes, we found enriched GO terms and hence findings should be interpreted with care. We found nine enriched GO terms using human GOs (Table S11A) and seven enriched GO terms using chicken GOs (Table S11B) with three GO terms shared between species‐specific GO terms. GO terms were linked to general biological processes, including protein kinase B (PKB) signalling, that is linked to two genes encoding for the phosphatidylinositol‐4,5‐bisphosphate 3‐kinase catalytic subunit beta (PIK3CB, an intracellular signal transducer enzyme) and myostatin (MSTN, a growth differentiation factor). DMS in the promoter region of PIK3CB and in the 10 kb upstream region of MSTN also pass the more stringent significance threshold.

We performed a region‐based differential methylation analysis on 596 CGIs and did not identify any differentially methylated CGIs.

3.2. SNP outlier analyses

We implemented three analytical approaches to identify potential candidate SNPs underlying genomic selection for early and late lay dates and focused on the overlap between approaches; (i) Fst outlier analysis, (ii) a genome scan for selection based on PCA, and (iii) redundancy analysis (Table S12).

We estimated the Fst coefficients for 451,600 SNPs with BayeScan and identified four Fst outliers located on chromosomes 4 and 4A (q‐value < 0.05; Figure 3 and Table 2). While the two SNPs on chromosome 4 were located within intergenic regions, the two SNPs on chromosome 4A were located within the 10 kb upstream region of the transcription factor SOX‐3‐like gene (LOC107203824; SOX3‐like) and the gene body of the nik related kinase gene (NRK). F st outliers and DMS showed no overlap in physical location on the genome or associated genes.

FIGURE 3.

FIGURE 3

Manhattan plots for the SNP outlier analyses. F st coefficients estimated with BayeScan (a), PCA‐based genome scan SNP loadings (b) and redundancy analysis SNP loadings (c) between females from the early (n = 10) and late (n = 10) selection line for each SNP (n = 451,600 for a and b, n = 164,959 for c). All SNPs above the significance thresholds are highlighted by increased plot symbol size. Outlier SNPs detected as outliers with all three approaches (or with BayeScan and pcadapt for SNPs with missing genotypes) are highlighted in orange. Tool‐specific outlier SNPs are highlighted in blue. SNPs detected as outliers with pcadapt and redundancy analysis are highlighted in yellow. ‘Sc’ (x‐axis) refers to unplaced scaffolds.

TABLE 2.

Ooutlier SNPs.

SNP F st q‐Value Allele frequencies Genomic region Gene symbol
chr4_39145992_T:C 0.24 2.40E‐04 E = 0.00:1.00, L = 0.85:0.15 Intergenic NA
chr4_39145990_T:C 0.23 6.20E‐04 E = 0.00:1.00, L = 0.83:0.17 Intergenic NA
chr4A_16892597_C:T 0.21 5.52E‐03 E = 0.25:0.75, L = 1.00:0.00 10 kb upstream LOC107203824 (SOX3‐like)

Note: For SNPs without missing data we defined outlier SNPs as SNPs that were detected as outliers in all three approaches. For SNPs with missing genotypes we defined outlier SNPs as SNPs that were detected as outliers in the F st outlier analysis and the PCA‐based genome scan. The SNP identifier (chromosome, position and reference:alternative allele), F st coefficient, F st q‐value, allele frequencies (reference:alternative) for early (E) and late (L) selection line females, genomic region and gene symbol of the annotated gene are shown. Full name of gene: transcription factor SOX‐3‐like (LOC107203824; SOX3‐like).

We found a larger number of F st outliers with Arlequin (n = 2230 with p‐value < 0.01), which is in line with the large number of F st outliers (n = 4786 with p‐value < 0.01) previously identified in Verhagen, Gienapp, et al. (2019). When focusing on SNPs that are shared in both data sets (n = 31,423), 156 and 391 Arlequin F st outlier SNPs remained for SNPs used here and SNPs used in Verhagen, Gienapp, et al. (2019), of which 46 SNPs are shared. This indicates that the significance threshold used (p‐value < 0.01) is rather tolerant for the number of SNPs tested. However, F st outlier analysis with Arlequin and BayeScan performed here identified the same set of SNPs as outlier SNPs with the largest effect sizes (Figure S4).

The PCA‐based genome scan for selection on 451,600 SNPs implemented with pcadapt revealed that only the first PC was significantly associated with selection line (p‐value = 2.04E‐10, all other PCs p‐value > 0.05). Based on correlations between SNPs and the first PC, we identified 47 SNPs with outlier loadings.

In contrast to the F st outlier analysis and the PCA‐based genome scan for selection, redundancy analysis cannot handle missing data which reduced the SNP data set to 164,959 SNPs. Redundancy analysis showed that selection line was indeed a significant explanatory variable (F‐statistic = 1.56, Df = 1, p‐value = 0.001) that explained 8.0% of the total variance in the SNP data. Based on the SNP‐specific loadings on the first axis, we identified 14 SNPs with outlier loading score (significance threshold: mean ± 3sd with mean = 1.20*10−7 and sd = 0.029, Figure 3).

When focusing on the overlap between approaches, we identified three outlier SNPs (SNP without missing genotypes: chr4A_16892597_C:T; SNPs with missing genotypes: chr4_39145992_T:C and chr4_39145990_T:C, Table 2) While the two SNPs on chromosome 4 were intergenic, the SNP on chromosome 4A was located within the 10 kb upstream region of LOC107203824 (SOX3‐like).

4. DISCUSSION

Identifying the genetic variants and genomic pathways underlying complex quantitative traits is a major challenge in modern evolutionary biology. As many highly polygenic traits are expressed in interaction with the environment (e.g., Gienapp et al., 2017), this has proven challenging in wild populations. A unique genomic selection experiment for early and late avian lay date (Gienapp et al., 2019; Verhagen, Gienapp, et al., 2019) enabled us to identify both genetic and epigenetic differentiation between females of the early and late lay selection line and, this way, gain insights into the genetic variants and genomic pathways that might be involved in adaptation of a wild bird population.

4.1. Biological relevance of identified DMS and outlier SNPs

When comparing females from the early and late genomic selection lines for lay date, we found evidence for both epigenetic and genetic differentiation in response to genomic selection. Overall, differentially methylated CpG sites (DMS) and outlier SNPs were located within genes that have known functions for reproduction. The biological relevance of genes with DMS mostly concerned ovarian functioning (e.g., PIK3CB, C7H2orf88 and MSTN), while the biological relevance of genes with outlier SNPs mostly concerned brain development and functioning (LOC107203824; SOX3‐like).

Genes with differential DNA methylation in response to genomic selection for lay dates mostly function downstream in the neuro‐endocrine system along the HPGL axis (i.e., in the ovary), indicating that genetic variation in DNA methylation potentially acts to fine‐tune lay dates (Lindner, Laine, et al., 2021; Verhagen, Laine, et al., 2019). Genetic differentiation, in contrast, was located within genes that mostly function at the stage where the neuro‐endocrine cascade is activated (i.e., upstream in the neuro‐endocrine system along the HPGL axis).

Genes with DMS have predicted functions in a variety of biological processes including ovarian and reproductive functioning, especially when focusing on DMS that passed the Bonferroni‐corrected significance threshold (CASD1, MSTN, C7H2orf88 and PIK3CB). GO analysis pointed towards two genes encoding for an intracellular signal transducer enzyme (PIK3CB) and a growth differentiation factor (MSTN), that are of relevance for the protein kinase B (PKB) signalling pathway, a pathway associated with the survival of follicle cells. For example, inactivation of the PKB signalling pathway in chicken granulosa cells of the three largest preovulatory follicles led to oligonucleosome formation, which characterizes ovarian follicle atresia (Johnson et al., 2001). Although follicle atresia is a common process in pre‐hierarchical follicles, follicles that have been selected into the preovulatory hierarchy are committed to ovulate and rarely undergo follicle atresia (Johnson & Woods, 2009), highlighting the essential role for PKB‐mediated inhibition of follicle atresia in preovulatory follicles for ovarian functioning.

PIK3CB (p100β), together with PIK3CA (p100α) and PIK3CD (p100δ), builds the catalytic p110 subunit that characterizes a class of phosphatidylinositol 3‐kinase (PI3K) proteins that are primarily responsible for the production of phosphatidylinositol in response to growth factors (Cantley, 2002; Zhang et al., 2020). In general, PI3K signalling pathways play crucial roles in cell growth, cell survival and cell movement (Cantley, 2002), but are known to have more specific roles in reproductive functioning such as ovarian follicle development (Li et al., 2021). For example, PI3K signalling pathways in neuronal leptin receptors, that harbour catalytic subunits PIK3CA (p100α) and PIK3CB (p100β), are crucial for pubertal maturation and reproductive functioning in mice (Garcia‐Galiano et al., 2017, 2019) and the PIK3 and cAMP/protein kinase A (PKA) signalling is involved in the crosstalk between the transforming growth factor (TGF) β1 and follicle stimulating hormone (FSH) that mediates steroidogenesis in ovarian granulosa cells in rats (Chen et al., 2007). Furthermore, C7H2orf88 encodes for the chromosome 7 C2orf88 homologue that is also known as smAKAP, encoding for the small membrane A‐kinase anchor protein. smAKAP constitutes a small protein that is directly anchored to membranes by acyl motifs and almost exclusively interacts with the type I regulatory subunits of cAMP/protein kinase A (PKA) signalling (Burgers et al., 2016), a key signalling pathway during the ovulation process (Shimada & Yamashita, 2011).

MSTN, also known as growth and differentiation factor 8 (GDF8) encodes a transcriptional growth factor that actively represses skeletal muscle growth (Kollias & McDermott, 2008; McPherron et al., 1997) and as such has been of much interest in livestock research (Bellinge et al., 2005). Genetic manipulation of MSTN in livestock resulted in increased muscle mass (e.g., the double‐muscled phenotype in cattle; Grobet et al., 1997), but compromised fertility, calf viability and stress susceptibility (Han et al., 2021). In quail, a three base‐pair deletion in MSTN resulted in amino acid deletion in the MSTN peptide and led to increased body weight and muscle mass (Lee et al., 2020) as well as delayed lay dates, higher egg weight and lower number of eggs produced during the active laying period (Lee et al., 2021) in homozygous mutant quail relative to heterozygous mutant and wild‐type quail. These studies in quail exemplify the relevance of MSTN for reproductive traits in seasonally breeding birds, including lay dates in quail.

CASD1 (encoding for CAS1 domain containing 1) is also known as sialate O‐acetyltransferase (SOAT) that catalyses the 9‐O‐acetylation of sialic acids that constitute sugars at the reducing end of glycoproteins and glycolipids (Arming et al., 2011). SOAT has known to have an important role in lipid trafficking and utilization during embryonic development. For example, zebrafish embryos injected with a SOAT inhibitor showed a lower rate of yolk consumption indicating that SOAT is catalytically active in the yolk cholesterol trafficking during embryogenesis (Chang et al., 2016). In the chicken embryo, SOAT is responsible for the rapid esterification of a large proportion of yolk cholesterol, a process essential for lipid uptake from the yolk by the yolk sac membrane during embryonic development (Shand et al., 1993; Wang et al., 2017). While the role of SOAT in lipid trafficking and utilization during embryonic development within the egg is well understood for oviparous species, including birds, it is unclear whether SOAT is of any relevance for lipid deposition during the production of eggs.

For outlier SNPs, we focus on outliers detected with all three approaches and found one gene with an outlier SNP; LOC107203824 (SOX3‐like). The transcription factor SOX‐3 encoded by LOC107203824 (SOX3‐like) is crucial for the development and functioning of the hypothalamo‐pituitary axis (Rizzoti et al., 2004; Szeliga et al., 2021) which is an essential component of the hypothalamic–pituitary–gonadal–liver axis (HPGL axis). The HPGL axis mediates the neuro‐endocrine cascade that, in seasonally breeding birds, activates the onset of gonadal growth in response to increasing photoperiods (Dawson et al., 2001; Williams, 2012). Consequently, reproductive functioning is critically dependent on normal development and functioning of the hypothalamo‐pituitary axis. For example, a genetic variant in SOX3 in humans was linked to normosmic idiopathic hypogonadotropic hypogonadism, a form of isolated gonadotropin‐releasing hormone (GnRH) deficiency (Kim et al., 2019). Next to the brain, SOX3 is also expressed in the ovary where it is required for gonadal functioning. For example, double SOX3 deletion in female mice led to excess follicular atresia, ovulation of defective oocytes, and severely reduced fertility (Weiss et al., 2003). SOX‐3 was also identified as one of eight master transcription factors driving folliculogenesis in mice (Bian et al., 2021) and was found to inhibit apoptosis during follicle development in zebrafish leading to improved fecundity (Hong et al., 2019). In general, SOX3 constitutes a gene of high relevance for reproductive functioning, that is crucial for normal development and functioning of the hypothalamo‐pituitary axis.

4.2. Hypomethylation in the early selection line relative to the late selection line

The majority of DMS showed hypomethylation in females from the early selection line relative to females from the late selection line. However, the functional relevance of a general hypomethylation in females from the early selection line is unclear as CpG sites are located within many genes and within different genomic locations of genes. The association of CpG site methylation and gene expression is likely to differ between genomic features (Laine et al., 2016), making it difficult to derive general conclusions on the functional relevance. Considering a previously reported association between changes in DNA methylation throughout the breeding season and the onset of lay dates in great tit (Lindner, Laine, et al., 2021), it is possible that the observed hypomethylation in the early selection line is related to differences in seasonal changes in DNA methylation between the early and late selection line. However, the CpG sites at which DNA methylation is known to change with seasonality account for a small proportion of CpG sites (e.g., 35 of 5097 CpG sites (Lindner, Laine, et al., 2021)) and hence is unlikely to account for the overall hypomethylation in the early selection line observed here. Due to the lack of replicated selection lines, it is not possible for us to exclude the possibility that the overall hypomethylation in the early selection line is a consequence of genetic drift.

4.3. Epigenetic differentiation is hypothesized to be a correlated response to genetic differentiation following genomic selection

We here hypothesize that the observed differentiation in DNA methylation was accompanying the genetic differentiation induced by the genomic selection experiment. In other words, we hypothesize that the variants in DNA methylation were inherited with genetic variants that arose following genomic selection rather than inherited independently of genetic variants. While we were not able to explicitly test for an association between DNA methylation and genetic variants, this hypothesis is in line with findings by van Oers et al. (2020) where differences in DNA methylation between selection lines for exploratory behaviour in great tits were explained by genetic differences rather than spontaneous epi‐mutations. Furthermore, many recent studies have shown that a large proportion of variants in DNA methylation is indeed dependent on genetic variants. For example, in Arabidopsis thaliana the variation in CHH methylation in transposable elements was strongly associated with cis‐ and trans‐acting genetic variants (Dubin et al., 2015), in inter‐crosses between wild derived red junglefowl and domestic chickens >45% of mapped trait loci were controlled by five trans‐acting loci mainly associated with an increase in hypothalamic DNA methylation in red junglefowl genotypes (Höglund et al., 2020) and in great tit nestlings DNA methylation in early life is largely determined by genetic effects (common origin) rather than environmental effects (common rearing environment) (Sepers et al., 2023). This dependency of DNA methylation on genetic variation is also supported by studies showing that more closely related individuals are more similar in their methylation patterns than unrelated individuals (Lea et al., 2017; van Oers et al., 2020; Viitaniemi et al., 2019). There are, however, other sources of DNA methylation that concern the environmental induction of DNA methylation and spontaneous epi‐mutations. While such epi‐mutations are indeed established in plants (Feil & Fraga, 2012), there is limited evidence for their stable inheritance in vertebrates, or more specifically, in avian species (e.g., Sepers et al., 2019). In vertebrates, it is unclear how such epi‐mutations would escape the extensive reprogramming of DNA methylation that, for example in mammals, takes place during the fertilization of the zygote and in the primordial germ cell (the progenitors of sperm cells and oocytes; Seisenberger et al., 2013). Nevertheless, there are some comparative studies that imply a role for epigenetic variation in evolution, assuming that epigenetic variants are inherited independently of genetic variants. For example, epigenetic variants were more common among several closely related species of Darwin's finches than genetic variants in the form of copy number variation and there was no apparent overlap between epigenetic variants and genetic variants suggesting that epigenetic variants are distinct and correlate with the evolutionary history of Darwin's finches (Skinner & Guerrero‐Bosagna, 2009). Although it cannot be excluded that epigenetic variants are dependent on genetic variants, as this was not formally assessed, it is likely that epigenetic changes contribute to the molecular basis of the evolution of Darwin's finches. Hence, the study shows that DNA methylation might be inherited and contributes to evolution in its own right, that is independently of genetic variants, indicating that our interpretation of the observed differentiation in DNA methylation as a response that accompanied the genetic differentiation might be too simplistic.

4.4. Caveats and future challenges

We designed our study with great care, but there are some limitations that we need to address here. Firstly, our experimental design did not include any replicate lines (Verhagen, Gienapp, et al., 2019), limiting our ability to differentiate selection as the underlying cause of the reported epigenetic and genetic differentiation from genetic drift. We, however observe increased differentiation over generations between selection lines in GEBVs (genetic level) and lay dates (phenotypic level) (Lindner et al., 2023; Verhagen, Gienapp, et al., 2019), indicating that the genomic selection lines offer a suitable contrast between early and late avian lay date for the study of genetic variants and genomic pathways that might be involved in mediating avian lay date.

We succeeded to identify DMS and outlier SNPs of genome‐wide significance, but our statistical power was limited with a sample size of 9–10 females per selection line. This means that we were only able to identify DMS and outlier SNPs with very large effect sizes, resulting in an incomplete list of candidates DMS and outlier SNPs.

We hypothesize that the observed epigenetic differentiation arose as a consequence of genetic differentiation in response to genomic selection for lay date. While, in theory, it is possible to test for an association between genetic variants and epigenetic variants and estimate the proportion of variation in DNA methylation that is explained by genetic variation (Dubin et al., 2015; Höglund et al., 2020), our limited sample size did not provide us with the statistical power needed for such an analysis. To increase sample size, we could have chosen an alternative and more cost‐efficient sequencing approach, such as reduced representation bisulfite sequencing (RRBS) (e.g., Gu et al., 2011; Meissner et al., 2008). However, we did not have an a priori expectation as of where within the genome the genomic selection experiment would induce differentiation in DNA methylation and, as such, did not want to limit our analysis to a reduced and bias representation of the genome.

We used blood samples, rather than tissues of obvious relevance for reproduction (e.g., tissues within the HPGL axis). While information on DNA methylation in blood per se is of little relevance for reproductive functioning, DNA methylation in blood can reflect DNA methylation in other tissues, when DNA methylation is induced in a tissue‐general manner (Lindner, Verhagen, et al., 2021). In red blood cells and liver samples of great tits, (between‐individual) change in DNA methylation over time was established in a tissue‐general and tissue‐specific manner, indicating that, at specific CpG sites, DNA methylation can indeed change in a tissue‐general manner (Lindner, Verhagen, et al., 2021). However, DNA methylation is not exclusively established in a tissue‐general manner which means that these findings cannot be generalized and that more between‐tissue comparisons are needed to establish in which context(s) DNA methylation patterns in blood can reflect DNA methylation patterns in other tissues. Furthermore, the functional relevance for DNA methylation on altering gene expression is time‐ and context‐dependent (Laine et al., 2016; Stevenson & Prendergast, 2013). In the great tit, low levels of CpG site methylation (~20%) close to the transcription start site of genes are sufficient to shut down gene expression, while moderate levels of CpG site methylation (up to 60%) within the gene body allow for high gene expression (up to 1000 FPKM; Laine et al., 2016). Furthermore, DNA methylation can be variable over time (Lindner, Laine, et al., 2021; Stevenson & Prendergast, 2013; Viitaniemi et al., 2019) and as such could lead to temporal variation in the expression of affected genes. However, specifically in the case of temporarily variable DNA methylation, DNA methylation may not exclusively act as a cause of gene expression, but can be a result of downstream consequences of gene expression or phenotypes. For example, Mycobacterium tuberculosis infection of human dendritic cells is accompanied by changes in CpG site methylation overlapping distal enhancer elements, but changes in CpG site methylation are preceded by changes in gene expression, indicating that identified changes in CpG site methylation might be a downstream consequence of transcriptional activation (Pacis et al., 2019). Due to the complexity underlying the effects of DNA methylation on gene expression (and likely vice versa), we do not know whether the identified DMS indeed affect the expression of genes and in which tissues they may have an effect. On a similar note, identified outlier SNPs can have cis‐ and/or trans‐effects on gene expression and above we discussed the identified SNP in the context of cis‐effects. Assessing the functional relevance of DMS and outlier SNPs on gene expression would require samples and gene expression profiling of the relevant tissues (e.g., tissues within the HPGL axis) which was outside the scope of this study. Furthermore, establishing causal links between DMS or outlier SNPs and lay dates would require experimental validation using functional tools, but, at this point, the application of such functional tools is not feasible in vertebrate non‐model organisms such as the great tit.

5. CONCLUSION

We observed genetic and epigenetic differentiation between females from genomic selection lines for early and late avian lay date. Biological functions of genes with such differentiation hint towards a complementary function of DNA methylation and genetic variants, such that DNA methylation might act downstream of the neuro‐endocrine system underlying lay dates, while genetic variants might act at the activation of the neuro‐endocrine system. However, functional validation is required to establish whether identified genetic and epigenetic differentiation is affecting gene expression and reproductive functioning. Nevertheless, the genomic selection experiment for avian lay dates provides insights into where within the genome heritable genetic variation for lay date resides and shows that a part of this variation might be reflected by epigenetic variants.

CONFLICT OF INTEREST STATEMENT

The authors have no conflict of interest to declare.

Supporting information

Data S1.

EVA-17-e13703-s001.zip (70.2MB, zip)

ACKNOWLEDGEMENTS

We thank Agata Pijl and colleagues in the molecular laboratory at NIOOK‐NAW for support with the extraction of DNA, Bart van Lith and Ruben de Wit for assistance during the experiments and the animal carers at the NIOO‐KNAW for taking care of the birds.

Lindner, M. , Verhagen, I. , Mateman, A. C. , van Oers, K. , Laine, V. N. , & Visser, M. E. (2024). Genetic and epigenetic differentiation in response to genomic selection for avian lay date. Evolutionary Applications, 17, e13703. 10.1111/eva.13703

DATA AVAILABILITY STATEMENT

WGBS data are available at the NCBI BioProject database (http://www.ncbi.nlm.nih.gov/bioproject/) under BioProject PRJNA208335 and accession numbers SRR15410225 to SRR15410232 and SRR21783914 to SRR21783973. Scripts for the bioinformatic pipelines and statistical analyses are published on GitHub (https://github.com/MLindner0/GenomicSelection_AvianLayDate_WGBS).

REFERENCES

  1. Akalin, A. , Kormaksson, M. , Li, S. , Garrett‐Bakelman, F. E. , Figueroa, M. E. , Melnick, A. , & Mason, C. E. (2012). methylKit: A comprehensive R package for the analysis of genome‐wide DNA methylation profiles. Genome Biology, 13, R87. 10.1186/gb-2012-13-10-r87 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Anaconda Software Distribution . (2016). Conda. Version 4.8.4, Anaconda.
  3. Andrew, S. (2010). FastQC: A quality control tool for high throughput sequence data [online].
  4. Arming, S. , Wipfler, D. , Mayr, J. , Merling, A. , Vilas, U. , Schauer, R. , Schwartz‐Albiez, R. , & Vlasak, R. (2011). The human Cas1 protein: A sialic acid‐specific O‐acetyltransferase? Glycobiology, 21, 553–564. 10.1093/glycob/cwq153 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Aulchenko, Y. S. , Ripke, S. , Isaacs, A. , & van Duijn, C. M. (2007). GenABEL: An R library for genome‐wide association analysis. Bioinformatics, 23, 1294–1296. 10.1093/bioinformatics/btm108 [DOI] [PubMed] [Google Scholar]
  6. Barson, N. J. , Aykanat, T. , Hindar, K. , Baranski, M. , Bolstad, G. H. , Fiske, P. , Jacq, C. , Jensen, A. J. , Johnston, S. E. , Karlsson, S. , Kent, M. , Moen, T. , Niemelä, E. , Nome, T. , Næsje, T. F. , Orell, P. , Romakkaniemi, A. , Sægrov, H. , Urdal, K. , … Primmer, C. R. (2015). Sex‐dependent dominance at a single locus maintains variation in age at maturity in salmon. Nature, 528, 405–408. 10.1038/nature16062 [DOI] [PubMed] [Google Scholar]
  7. Bates, D. , Mächler, M. , Bolker, B. , & Walker, S. (2015). Fitting linear mixed‐effects models using lme4. Journal of Statistical Software, 67, 1–48. 10.18637/jss.v067.i01 [DOI] [Google Scholar]
  8. Bellinge, R. H. S. , Liberles, D. A. , Iaschi, S. P. A. , O'Brien, P. A. , & Tay, G. K. (2005). Myostatin and its implications on animal breeding: A review. Animal Genetics, 36, 1–6. 10.1111/j.1365-2052.2004.01229.x [DOI] [PubMed] [Google Scholar]
  9. Benjamini, Y. , & Hochberg, Y. (1995). Controlling the false discovery rate: A practical and powerful approach to multiple testing. Journal of the Royal Statistical Society: Series B: Methodological, 57, 289–300. 10.1111/j.2517-6161.1995.tb02031.x [DOI] [Google Scholar]
  10. Bian, X. , Xie, Q. , Zhou, Y. , Wu, H. , Cui, J. , Jia, L. , & Suo, L. (2021). Transcriptional changes of mouse ovary during follicle initial or cyclic recruitment mediated by extra hormone treatment. Life Sciences, 264, 118654. 10.1016/j.lfs.2020.118654 [DOI] [PubMed] [Google Scholar]
  11. Bindea, G. , Mlecnik, B. , Hackl, H. , Charoentong, P. , Tosolini, M. , Kirilovsky, A. , Fridman, W.‐H. , Pagès, F. , Trajanoski, Z. , & Galon, J. (2009). ClueGO: A Cytoscape plug‐in to decipher functionally grouped gene ontology and pathway annotation networks. Bioinformatics, 25, 1091–1093. 10.1093/bioinformatics/btp101 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Bosse, M. , Spurgin, L. G. , Laine, V. N. , Cole, E. F. , Firth, J. A. , Gienapp, P. , Gosler, A. G. , McMahon, K. , Poissant, J. , Verhagen, I. , Groenen, M. A. M. , van Oers, K. , Sheldon, B. C. , Visser, M. E. , & Slate, J. (2017). Recent natural selection causes adaptive evolution of an avian polygenic trait. Science, 358, 365–368. 10.1126/science.aal3298 [DOI] [PubMed] [Google Scholar]
  13. Burgers, P. P. , Bruystens, J. , Burnley, R. J. , Nikolaev, V. O. , Keshwani, M. , Wu, J. , Janssen, B. J. C. , Taylor, S. S. , Heck, A. J. R. , & Scholten, A. (2016). Structure of smAKAP and its regulation by PKA‐mediated phosphorylation. The FEBS Journal, 283, 2132–2148. 10.1111/febs.13726 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Caizergues, A. E. , Le Luyer, J. , Grégoire, A. , Szulkin, M. , Senar, J. , Charmantier, A. , & Perrier, C. (2022). Epigenetics and the city: Non‐parallel DNA methylation modifications across pairs of urban‐forest great tit populations. Evolutionary Applications, 15, 149–165. 10.1111/eva.13334 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Cantley, L. C. (2002). The phosphoinositide 3‐kinase pathway. Science, 296, 1655–1657. 10.1126/science.296.5573.1655 [DOI] [PubMed] [Google Scholar]
  16. Caro, S. P. , Schaper, S. V. , Hut, R. A. , Ball, G. F. , & Visser, M. E. (2013). The case of the missing mechanism: How does temperature influence seasonal timing in endotherms? PLoS Biology, 11, e1001517. 10.1371/journal.pbio.1001517 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Chang, N.‐Y. , Chan, Y.‐J. , Ding, S.‐T. , Lee, Y.‐H. , HuangFu, W.‐C. , & Liu, I.‐H. (2016). Sterol O‐acyltransferase 2 contributes to the yolk cholesterol trafficking during zebrafish embryogenesis. PLoS One, 11, e0167644. 10.1371/journal.pone.0167644 [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Charmantier, A. , Garant, D. , & Kruuk, L. E. B. (2014). Quantitative genetics in the wild. Oxford University Press. [Google Scholar]
  19. Charmantier, A. , & Gienapp, P. (2014). Climate change and timing of avian breeding and migration: Evolutionary versus plastic changes. Evolutionary Applications, 7, 15–28. 10.1111/eva.12126 [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Chen, Y.‐J. , Hsiao, P.‐W. , Lee, M.‐T. , Mason, J. I. , Ke, F.‐C. , & Hwang, J.‐J. (2007). Interplay of PI3K and cAMP/PKA signaling, and rapamycin‐hypersensitivity in TGFβ1 enhancement of FSH‐stimulated steroidogenesis in rat ovarian granulosa cells. Journal of Endocrinology, 192, 405–419. 10.1677/JOE-06-0076 [DOI] [PubMed] [Google Scholar]
  21. Danecek, P. , Bonfield, J. K. , Liddle, J. , Marshall, J. , Ohan, V. , Pollard, M. O. , Whitwham, A. , Keane, T. , McCarthy, S. A. , Davies, R. M. , & Li, H. (2021). Twelve years of SAMtools and BCFtools. GigaScience, 10, giab008. 10.1093/gigascience/giab008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Dawson, A. , King, V. M. , Bentley, G. E. , & Ball, G. F. (2001). Photoperiodic control of seasonality in birds. Journal of Biological Rhythms, 16, 365–380. 10.1177/074873001129002079 [DOI] [PubMed] [Google Scholar]
  23. DePristo, M. A. , Banks, E. , Poplin, R. , Garimella, K. V. , Maguire, J. R. , Hartl, C. , Philippakis, A. A. , del Angel, G. , Rivas, M. A. , Hanna, M. , McKenna, A. , Fennell, T. J. , Kernytsky, A. M. , Sivachenko, A. Y. , Cibulskis, K. , Gabriel, S. B. , Altshuler, D. , & Daly, M. J. (2011). A framework for variation discovery and genotyping using next‐generation DNA sequencing data. Nature Genetics, 43, 491–498. 10.1038/ng.806 [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Derks, M. F. L. , Schachtschneider, K. M. , Madsen, O. , Schijlen, E. , Verhoeven, K. J. F. , & van Oers, K. (2016). Gene and transposable element methylation in great tit (Parus major) brain and blood. BMC Genomics, 17, 332. 10.1186/s12864-016-2653-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Drent, P. J. , van Oers, K. , & van Noordwijk, A. J. (2003). Realized heritability of personalities in the great tit (Parus major). Proceedings of the Royal Society of London B, 270, 45–51. 10.1098/rspb.2002.2168 [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Dubin, M. J. , Zhang, P. , Meng, D. , Remigereau, M.‐S. , Osborne, E. J. , Paolo Casale, F. , Drewe, P. , Kahles, A. , Jean, G. , Vilhjálmsson, B. , Jagoda, J. , Irez, S. , Voronin, V. , Song, Q. , Long, Q. , Rätsch, G. , Stegle, O. , Clark, R. M. , & Nordborg, M. (2015). DNA methylation in Arabidopsis has a genetic basis and shows evidence of local adaptation. eLife, 4, e05255. 10.7554/eLife.05255 [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Ewels, P. , Magnusson, M. , Lundin, S. , & Käller, M. (2016). MultiQC: Summarize analysis results for multiple tools and samples in a single report. Bioinformatics, 32, 3047–3048. 10.1093/bioinformatics/btw354 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Excoffier, L. , & Lischer, H. E. L. (2010). Arlequin suite ver 3.5: A new series of programs to perform population genetics analyses under Linux and windows. Molecular Ecology Resources, 10, 564–567. 10.1111/j.1755-0998.2010.02847.x [DOI] [PubMed] [Google Scholar]
  29. Feil, R. , & Fraga, M. F. (2012). Epigenetics and the environment: Emerging patterns and implications. Nature Reviews. Genetics, 13, 97–109. 10.1038/nrg3142 [DOI] [PubMed] [Google Scholar]
  30. Foll, M. , & Gaggiotti, O. (2008). A genome‐scan method to identify selected loci appropriate for both dominant and codominant markers: A Bayesian perspective. Genetics, 180, 977–993. 10.1534/genetics.108.092221 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Forester, B. R. , Jones, M. R. , Joost, S. , Landguth, E. L. , & Lasky, J. R. (2016). Detecting spatial genetic signatures of local adaptation in heterogeneous landscapes. Molecular Ecology, 25, 104–120. 10.1111/mec.13476 [DOI] [PubMed] [Google Scholar]
  32. Garcia‐Galiano, D. , Borges, B. C. , Allen, S. J. , & Elias, C. F. (2019). pi 3K signalling in leptin receptor cells: Role in growth and reproduction. Journal of Neuroendocrinology, 31, e12685. 10.1111/jne.12685 [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Garcia‐Galiano, D. , Borges, B. C. , Donato, J. , Allen, S. J. , Bellefontaine, N. , Wang, M. , Zhao, J. J. , Kozloff, K. M. , Hill, J. W. , & Elias, C. F. (2017). PI3Kα inactivation in leptin receptor cells increases leptin sensitivity but disrupts growth and reproduction. JCI Insight, 2, e96728. 10.1172/jci.insight.96728 [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Gienapp, P. , Calus, M. P. L. , Laine, V. N. , & Visser, M. E. (2019). Genomic selection on breeding time in a wild bird population. Evolution Letters, 3, 142–151. 10.1002/evl3.103 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Gienapp, P. , Laine, V. N. , Mateman, A. C. , van Oers, K. , & Visser, M. E. (2017). Environment‐dependent genotype‐phenotype associations in avian breeding time. Frontiers in Genetics, 8, 102. 10.3389/fgene.2017.00102 [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Gower, J. C. (1966). Some distance properties of latent root and vector methods used in multivariate analysis. Biometrika, 53, 325–338. [Google Scholar]
  37. Grant, P. R. , & Grant, B. R. (1995). Predicting microevolutionary responses to directional selection on heritable variation. Evolution, 49, 241–251. 10.1111/j.1558-5646.1995.tb02236.x [DOI] [PubMed] [Google Scholar]
  38. Grobet, L. , Royo Martin, L. J. , Poncelet, D. , Pirottin, D. , Brouwers, B. , Riquet, J. , Schoeberlein, A. , Dunner, S. , Ménissier, F. , Massabanda, J. , Fries, R. , Hanset, R. , & Georges, M. (1997). A deletion in the bovine myostatin gene causes the double–muscled phenotype in cattle. Nature Genetics, 17, 71–74. 10.1038/ng0997-71 [DOI] [PubMed] [Google Scholar]
  39. Gruber, B. , Unmack, P. J. , Berry, O. F. , & Georges, A. (2018). dartr: An r package to facilitate analysis of SNP data generated from reduced representation genome sequencing. Molecular Ecology Resources, 18, 691–699. 10.1111/1755-0998.12745 [DOI] [PubMed] [Google Scholar]
  40. Gu, H. , Smith, Z. D. , Bock, C. , Boyle, P. , Gnirke, A. , & Meissner, A. (2011). Preparation of reduced representation bisulfite sequencing libraries for genome‐scale DNA methylation profiling. Nature Protocols, 6, 468–481. 10.1038/nprot.2010.190 [DOI] [PubMed] [Google Scholar]
  41. Guo, W. , Zhu, P. , Pellegrini, M. , Zhang, M. Q. , Wang, X. , & Ni, Z. (2018). CGmapTools improves the precision of heterozygous SNV calls and supports allele‐specific methylation detection and visualization in bisulfite‐sequencing data. Bioinformatics, 34, 381–387. 10.1093/bioinformatics/btx595 [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Han, S. , Li, Z. , Paek, H. , Choe, H. , Yin, X. , & Quan, B. (2021). Reproduction traits of heterozygous myostatin knockout sows crossbred with homozygous myostatin knockout boars. Reproduction in Domestic Animals, 56, 26–33. 10.1111/rda.13845 [DOI] [PubMed] [Google Scholar]
  43. Hill, W. G. , & Caballero, A. (1992). Artificial selection experiments. Annual Review of Ecology and Systematics, 23, 287–310. [Google Scholar]
  44. Höglund, A. , Henriksen, R. , Fogelholm, J. , Churcher, A. M. , Guerrero‐Bosagna, C. M. , Martinez‐Barrio, A. , Johnsson, M. , Jensen, P. , & Wright, D. (2020). The methylation landscape and its role in domestication and gene regulation in the chicken. Nature Ecology & Evolution, 4, 1713–1724. 10.1038/s41559-020-01310-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Hong, Q. , Li, C. , Ying, R. , Lin, H. , Li, J. , Zhao, Y. , Cheng, H. , & Zhou, R. (2019). Loss‐of‐function of sox3 causes follicle development retardation and reduces fecundity in zebrafish. Protein & Cell, 10, 347–364. 10.1007/s13238-018-0603-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Husby, A. , Kawakami, T. , Rönnegård, L. , Smeds, L. , Ellegren, H. , & Qvarnström, A. (2015). Genome‐wide association mapping in a wild avian population identifies a link between genetic and phenotypic variation in a life‐history trait. Proceedings of the Royal Society B, 282, 20150156. 10.1098/rspb.2015.0156 [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Jannink, J.‐L. , Lorenz, A. J. , & Iwata, H. (2010). Genomic selection in plant breeding: From theory to practice. Briefings in Functional Genomics, 9, 166–177. 10.1093/bfgp/elq001 [DOI] [PubMed] [Google Scholar]
  48. Johnson, A. L. , Bridgham, J. T. , & Swenson, J. A. (2001). Activation of the Akt/protein kinase B signaling pathway is associated with granulosa cell survival. Biology of Reproduction, 64, 1566–1574. 10.1095/biolreprod64.5.1566 [DOI] [PubMed] [Google Scholar]
  49. Johnson, A. L. , & Woods, D. C. (2009). Dynamics of avian ovarian follicle development: Cellular mechanisms of granulosa cell differentiation. General and Comparative Endocrinology, 163, 12–17. 10.1016/j.ygcen.2008.11.012 [DOI] [PubMed] [Google Scholar]
  50. Johnston, S. E. , Bérénos, C. , Slate, J. , & Pemberton, J. M. (2016). Conserved genetic architecture underlying individual recombination rate variation in a wild population of Soay sheep (Ovis aries). Genetics, 203, 583–598. 10.1534/genetics.115.185553 [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Jombart, T. (2008). Adegenet: A R package for the multivariate analysis of genetic markers. Bioinformatics, 24, 1403–1405. 10.1093/bioinformatics/btn129 [DOI] [PubMed] [Google Scholar]
  52. Jombart, T. , & Ahmed, I. (2011). Adegenet 1.3‐1: New tools for the analysis of genome‐wide SNP data. Bioinformatics, 27, 3070–3071. 10.1093/bioinformatics/btr521 [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Kim, J. H. , Seo, G. H. , Kim, G.‐H. , Huh, J. , Hwang, I. T. , Jang, J.‐H. , Yoo, H.‐W. , & Choi, J.‐H. (2019). Targeted gene panel sequencing for molecular diagnosis of Kallmann syndrome and Normosmic idiopathic hypogonadotropic hypogonadism. Experimental and Clinical Endocrinology & Diabetes, 127, 538–544. 10.1055/a-0681-6608 [DOI] [PubMed] [Google Scholar]
  54. Kim, J. M. , Santure, A. W. , Barton, H. J. , Quinn, J. L. , Cole, E. F. , Great Tit HapMap Consortium , Visser, M. E. , Sheldon, B. C. , Groenen, M. A. M. , van Oers, K. , & Slate, J. (2018). A high‐density SNP chip for genotyping great tit (Parus major) populations and its application to studying the genetic architecture of exploration behaviour. Molecular Ecology Resources, 18, 877–891. 10.1111/1755-0998.12778 [DOI] [PubMed] [Google Scholar]
  55. Kollias, H. D. , & McDermott, J. C. (2008). Transforming growth factor‐β and myostatin signaling in skeletal muscle. Journal of Applied Physiology, 104, 579–587. 10.1152/japplphysiol.01091.2007 [DOI] [PubMed] [Google Scholar]
  56. Koster, J. , & Rahmann, S. (2012). Snakemake–a scalable bioinformatics workflow engine. Bioinformatics, 28, 2520–2522. 10.1093/bioinformatics/bts480 [DOI] [PubMed] [Google Scholar]
  57. Krueger, F. , & Andrews, S. R. (2011). Bismark: A flexible aligner and methylation caller for bisulfite‐seq applications. Bioinformatics, 27, 1571–1572. 10.1093/bioinformatics/btr167 [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Kruuk, L. E. B. , Slate, J. , Pemberton, J. M. , Brotherstone, S. , Guinness, F. , & Clutton‐Brock, T. (2002). Antler size in red deer: Heritability and selection but no evolution. Evolution, 56, 1683–1695. 10.1111/j.0014-3820.2002.tb01480.x [DOI] [PubMed] [Google Scholar]
  59. Kruuk, L. E. B. , Slate, J. , & Wilson, A. J. (2008). New answers for old questions: The evolutionary quantitative genetics of wild animal populations. Annual Review of Ecology, Evolution, and Systematics, 39, 525–548. 10.1146/annurev.ecolsys.39.110707.173542 [DOI] [Google Scholar]
  60. Laine, V. N. , Gossmann, T. I. , Schachtschneider, K. M. , Garroway, C. J. , Madsen, O. , Verhoeven, K. J. F. , de Jager, V. , Megens, H.‐J. , Warren, W. C. , Minx, P. , Crooijmans, R. P. M. A. , Corcoran, P. , The Great Tit HapMap Consortium , Sheldon, B. C. , Slate, J. , Zeng, K. , van Oers, K. , Visser, M. E. , & Groenen, M. A. M. (2016). Evolutionary signals of selection on cognition from the great tit genome and methylome. Nature Communications, 7, 10474. 10.1038/ncomms10474 [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Laine, V. N. , Sepers, B. , Lindner, M. , Gawehns, F. , Ruuskanen, S. , & Oers, K. (2023). An ecologist's guide for studying DNA methylation variation in wild vertebrates. Molecular Ecology Resources, 23, 1488–1508. 10.1111/1755-0998.13624 [DOI] [PubMed] [Google Scholar]
  62. Langmead, B. , & Salzberg, S. L. (2012). Fast gapped‐read alignment with bowtie 2. Nature Methods, 9, 357–359. 10.1038/nmeth.1923 [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Larsen, F. , Gundersen, G. , Lopez, R. , & Prydz, H. (1992). CpG islands as gene markers in the human genome. Genomics, 13, 1095–1107. 10.1016/0888-7543(92)90024-M [DOI] [PubMed] [Google Scholar]
  64. Lawrence, M. , Gentleman, R. , & Carey, V. (2009). Rtracklayer: An R package for interfacing with genome browsers. Bioinformatics, 25, 1841–1842. 10.1093/bioinformatics/btp328 [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Lawrence, M. , Huber, W. , Pagès, H. , Aboyoun, P. , Carlson, M. , Gentleman, R. , Morgan, M. T. , & Carey, V. J. (2013). Software for computing and annotating genomic ranges. PLoS Computational Biology, 9, e1003118. 10.1371/journal.pcbi.1003118 [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Lea, A. J. , Vilgalys, T. P. , Durst, P. A. P. , & Tung, J. (2017). Maximizing ecological and evolutionary insight in bisulfite sequencing data sets. Nature Ecology & Evolution, 1, 1074–1083. 10.1038/s41559-017-0229-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Lee, J. , Kim, D.‐H. , Brower, A. M. , Schlachter, I. , & Lee, K. (2021). Effects of myostatin mutation on onset of laying, egg production, fertility, and hatchability. Animals, 11, 1935. 10.3390/ani11071935 [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Lee, J. , Kim, D.‐H. , & Lee, K. (2020). Muscle hyperplasia in Japanese quail by single amino acid deletion in MSTN Propeptide. International Journal of Molecular Sciences, 21, 1504. 10.3390/ijms21041504 [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Li, L. , Shi, X. , Shi, Y. , & Wang, Z. (2021). The signaling pathways involved in ovarian follicle development. Frontiers in Physiology, 12, 730196. 10.3389/fphys.2021.730196 [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Lindner, M. , Gawehns, F. , te Molder, S. , Visser, M. E. , Oers, K. , & Laine, V. N. (2022). Performance of methods to detect genetic variants from bisulphite sequencing data in a non‐model species. Molecular Ecology Resources, 22, 834–846. 10.1111/1755-0998.13493 [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Lindner, M. , Laine, V. N. , Verhagen, I. , Viitaniemi, H. M. , Visser, M. E. , van Oers, K. , & Husby, A. (2021). Rapid changes in DNA methylation associated with the initiation of reproduction in a small songbird. Molecular Ecology, 30, 3645–3659. 10.1111/mec.15803 [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Lindner, M. , Ramakers, J. J. , Verhagen, I. , Tomotani, B. M. , Mateman, A. C. , Gienapp, P. , & Visser, M. E. (2023). Genotypes selected for early and late avian lay date differ in their phenotype, but not fitness, in the wild. Science Advances, 9, eade6350. 10.1126/sciadv.ade6350 [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Lindner, M. , Verhagen, I. , Viitaniemi, H. M. , Laine, V. N. , Visser, M. E. , Husby, A. , & van Oers, K. (2021). Temporal changes in DNA methylation and RNA expression in a small song bird: Within‐ and between‐tissue comparisons. BMC Genomics, 22, 36. 10.1186/s12864-020-07329-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Lotterhos, K. E. , & Whitlock, M. C. (2014). Evaluation of demographic history and neutral parameterization on the performance of f st outlier tests. Molecular Ecology, 23, 2178–2192. 10.1111/mec.12725 [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Luu, K. , Bazin, E. , & Blum, M. G. B. (2017). Pcadapt: An r package to perform genome scans for selection based on principal component analysis. Molecular Ecology Resources, 17, 67–77. 10.1111/1755-0998.12592 [DOI] [PubMed] [Google Scholar]
  76. Marrot, P. , Charmantier, A. , Blondel, J. , & Garant, D. (2018). Current spring warming as a driver of selection on reproductive timing in a wild passerine. The Journal of Animal Ecology, 87, 754–764. 10.1111/1365-2656.12794 [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. McCullagh, P. , & Nelder, J. A. (1989). Generalized Linear Models. Chapman and Hall. [Google Scholar]
  78. McKenna, A. , Hanna, M. , Banks, E. , Sivachenko, A. , Cibulskis, K. , Kernytsky, A. , Garimella, K. , Altshuler, D. , Gabriel, S. , Daly, M. , & DePristo, M. A. (2010). The genome analysis toolkit: A MapReduce framework for analyzing next‐generation DNA sequencing data. Genome Research, 20, 1297–1303. 10.1101/gr.107524.110 [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. McPherron, A. C. , Lawlert, A. M. , & Lee, S.‐J. (1997). Regulation of skeletal muscle mass in mice by a new TGF‐fl superfamily member. Nature, 387, 83–90. [DOI] [PubMed] [Google Scholar]
  80. Meissner, A. , Mikkelsen, T. S. , Gu, H. , Wernig, M. , Hanna, J. , Sivachenko, A. , Zhang, X. , Bernstein, B. E. , Nusbaum, C. , Jaffe, D. B. , Gnirke, A. , Jaenisch, R. , & Lander, E. S. (2008). Genome‐scale DNA methylation maps of pluripotent and differentiated cells. Nature, 454, 766–770. [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Merilä, J. , Sheldon, B. C. , & Kruuk, L. E. B. (2001). Explaining stasis: Microevolutionary studies in natural populations. In Microevolution Rate, Pattern, Process, Contemporary Issues in Genetics and Evolution (Vol. 8, pp. 199–222). Springer Science + Business Media. 10.1007/978-94-010-0585-2_13 [DOI] [PubMed] [Google Scholar]
  82. Meuwissen, T. , Hayes, B. , & Goddard, M. (2016). Genomic selection: A paradigm shift in animal breeding. Animal Frontiers, 6, 6–14. 10.2527/af.2016-0002 [DOI] [Google Scholar]
  83. Neuwirth . (2014). RColorBrewer: ColorBrewer palettes. R Package Version 1.1–2. https://CRAN.R‐project.org/package=RColorBrewer.
  84. Nussey, D. H. , Postma, E. , Gienapp, P. , & Visser, M. E. (2005). Selection on heritable phenotypic plasticity in a wild bird population. Science, 310, 304–306. [DOI] [PubMed] [Google Scholar]
  85. Oksanen, J. , Blanchet, F. G. , Friendly, M. , Kindt, R. , Legendre, P. , McGlinn, D. , Minchin, P. R. , O'Harra, R. B. , Simpson, G. L. , Solymos, P. , Stevens, H. H. , Szoecs, E. , & Wagner, H. (2019). Vegan: Community ecology package.
  86. Pacis, A. , Mailhot‐Léonard, F. , Tailleux, L. , Randolph, H. E. , Yotova, V. , Dumaine, A. , Grenier, J.‐C. , & Barreiro, L. B. (2019). Gene activation precedes DNA demethylation in response to infection in human dendritic cells. Proceedings of the National Academy of Sciences of the United States of America, 116, 6938–6943. 10.1073/pnas.1814700116 [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. Pujol, B. , Blanchet, S. , Charmantier, A. , Danchin, E. , Facon, B. , Marrot, P. , Roux, F. , Scotti, I. , Teplitsky, C. , Thomson, C. E. , & Winney, I. (2018). The missing response to selection in the wild. Trends in Ecology & Evolution, 33, 337–346. [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Purcell, S. , Neale, B. , Todd‐Brown, K. , Thomas, L. , Ferreira, M. A. R. , Bender, D. , Maller, J. , Sklar, P. , de Bakker, P. I. W. , Daly, M. J. , & Sham, P. C. (2007). PLINK: A tool set for whole‐genome association and population‐based linkage analyses. The American Journal of Human Genetics, 81, 559–575. 10.1086/519795 [DOI] [PMC free article] [PubMed] [Google Scholar]
  89. Quinlan, A. R. , & Hall, I. M. (2010). BEDTools: A flexible suite of utilities for comparing genomic features. Bioinformatics, 26, 841–842. 10.1093/bioinformatics/btq033 [DOI] [PMC free article] [PubMed] [Google Scholar]
  90. R Core Team . (2021). R: A language and environment for statistical computing. R Foundation for Statistical Computing. https://www.R‐project.org/ [Google Scholar]
  91. Ramakers, J. J. C. , Gienapp, P. , & Visser, M. E. (2019). Phenological mismatch drives selection on elevation, but not on slope, of breeding time plasticity in a wild songbird. Evolution, 73, 175–187. 10.1111/evo.13660 [DOI] [PMC free article] [PubMed] [Google Scholar]
  92. Rizzoti, K. , Brunelli, S. , Carmignac, D. , Thomas, P. Q. , Robinson, I. C. , & Lovell‐Badge, R. (2004). SOX3 is required during the formation of the hypothalamo‐pituitary axis. Nature Genetics, 36, 247–255. 10.1038/ng1309 [DOI] [PubMed] [Google Scholar]
  93. Santure, A. W. , Gratten, J. , Mossman, J. A. , Sheldon, B. C. , & Slate, J. (2011). Characterisation of the transcriptome of a wild great tit Parus major population by next generation sequencing. BMC Genomics, 12, 283. 10.1186/1471-2164-12-283 [DOI] [PMC free article] [PubMed] [Google Scholar]
  94. Santure, A. W. , Poissant, J. , De Cauwer, I. , van Oers, K. , Robinson, M. R. , Quinn, J. L. , Groenen, M. A. M. , Visser, M. E. , Sheldon, B. C. , & Slate, J. (2015). Replicated analysis of the genetic architecture of quantitative traits in two wild great tit populations. Molecular Ecology, 24, 6148–6162. 10.1111/mec.13452 [DOI] [PMC free article] [PubMed] [Google Scholar]
  95. Schaper, S. V. , Dawson, A. , Sharp, P. J. , Gienapp, P. , Caro, S. P. , & Visser, M. E. (2012). Increasing temperature, not mean temperature, is a Cue for avian timing of reproduction. The American Naturalist, 179, E55–E69. 10.1086/663675 [DOI] [PubMed] [Google Scholar]
  96. Seisenberger, S. , Peat, J. R. , Hore, T. A. , Santos, F. , Dean, W. , & Reik, W. (2013). Reprogramming DNA methylation in the mammalian life cycle: Building and breaking epigenetic barriers. Philosophical Transactions of the Royal Society B, 368, 20110330. 10.1098/rstb.2011.0330 [DOI] [PMC free article] [PubMed] [Google Scholar]
  97. Sepers, B. , Chen, R. S. , Memelink, M. , Verhoeven, K. J. F. , & van Oers, K. (2023). Variation in DNA methylation in avian nestlings is largely determined by genetic effects. Molecular Biology and Evolution, 40, msad086. 10.1093/molbev/msad086 [DOI] [PMC free article] [PubMed] [Google Scholar]
  98. Sepers, B. , van den Heuvel, K. , Lindner, M. , Viitaniemi, H. , Husby, A. , & van Oers, K. (2019). Avian ecological epigenetics: Pitfalls and promises. Journal für Ornithologie, 160, 1183–1203. 10.1007/s10336-019-01684-5 [DOI] [Google Scholar]
  99. Shand, J. H. , West, D. W. , McCartney, R. J. , Noble, R. C. , & Speake, B. K. (1993). The esterification of cholesterol in the yolk sac membrane of the chick embryo. Lipids, 28, 621–625. 10.1007/BF02536056 [DOI] [PubMed] [Google Scholar]
  100. Shannon, P. (2003). Cytoscape: A software environment for integrated models of biomolecular interaction networks. Genome Research, 13, 2498–2504. 10.1101/gr.1239303 [DOI] [PMC free article] [PubMed] [Google Scholar]
  101. Sheldon, B. C. , Kruuk, L. E. B. , & Merila, J. (2003). Natural selection and inheritance of breeding time and clutch size in the collared flycatcher. Evolution, 57, 406–420. 10.1111/j.0014-3820.2003.tb00274.x [DOI] [PubMed] [Google Scholar]
  102. Shimada, M. , & Yamashita, Y. (2011). The key signaling cascades in granulosa cells during follicular development and ovulation process. Journal of Mammalian Ova Research, 28, 25–31. 10.1274/jmor.28.25 [DOI] [Google Scholar]
  103. Skinner, M. K. , & Guerrero‐Bosagna, C. (2009). Environmental signals and transgenerational epigenetics. Epigenomics. Epigenomics, 1, 111–117. 10.2217/epi.09.11 [DOI] [PMC free article] [PubMed] [Google Scholar]
  104. Slate, J. , Visscher, P. M. , MacGregor, S. , Stevens, D. , Tate, M. L. , & Pemberton, J. M. (2002). A genome scan for quantitative trait loci in a wild population of Red Deer (Cervus elaphus). Genetics, 162, 1863–1873. [DOI] [PMC free article] [PubMed] [Google Scholar]
  105. Stevenson, T. J. , & Prendergast, B. J. (2013). Reversible DNA methylation regulates seasonal photoperiodic time measurement. Proceedings of the National Academy of Sciences of the United States of America, 110, 16651–16656. [DOI] [PMC free article] [PubMed] [Google Scholar]
  106. Szeliga, A. , Kunicki, M. , Maciejewska‐Jeske, M. , Rzewuska, N. , Kostrzak, A. , Meczekalski, B. , Bala, G. , Smolarczyk, R. , & Adashi, E. Y. (2021). The genetic backdrop of hypogonadotropic hypogonadism. International Journal of Molecular Sciences, 22, 13241. 10.3390/ijms222413241 [DOI] [PMC free article] [PubMed] [Google Scholar]
  107. Tarka, M. , Åkesson, M. , Beraldi, D. , Hernández‐Sánchez, J. , Hasselquist, D. , Bensch, S. , & Hansson, B. (2010). A strong quantitative trait locus for wing length on chromosome 2 in a wild population of great reed warblers. Proceedings of the Royal Society B, 277, 2361–2369. 10.1098/rspb.2010.0033 [DOI] [PMC free article] [PubMed] [Google Scholar]
  108. Trerotola, M. , Relli, V. , Simeone, P. , & Alberti, S. (2015). Epigenetic inheritance and the missing heritability. Human Genomics, 9, 17. 10.1186/s40246-015-0041-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  109. van Oers, K. , Sepers, B. , Sies, W. , Gawehns, F. , Verhoeven, K. J. F. , & Laine, V. N. (2020). Epigenetics of animal personality: DNA methylation cannot explain the heritability of exploratory behavior in a songbird. Integrative and Comparative Biology, 606, 1517–1530. 10.1093/icb/icaa138 [DOI] [PMC free article] [PubMed] [Google Scholar]
  110. Verhagen, I. , Gienapp, P. , Laine, V. N. , Grevenhof, E. M. , Mateman, A. C. , Oers, K. , & Visser, M. E. (2019). Genetic and phenotypic responses to genomic selection for timing of breeding in a wild songbird. Functional Ecology, 33, 1708–1721. 10.1111/1365-2435.13360 [DOI] [Google Scholar]
  111. Verhagen, I. , Laine, V. N. , Mateman, A. C. , Pijl, A. , de Wit, R. , van Lith, B. , Kamphuis, W. , Viitaniemi, H. M. , Williams, T. D. , Caro, S. P. , Meddle, S. L. , Gienapp, P. , van Oers, K. , & Visser, M. E. (2019). Fine‐tuning of seasonal timing of breeding is regulated downstream in the underlying neuro‐endocrine system in a small songbird. The Journal of Experimental Biology, 222, jeb202481. 10.1242/jeb.202481 [DOI] [PubMed] [Google Scholar]
  112. Verhagen, I. , Tomotani, B. M. , Gienapp, P. , & Visser, M. E. (2020). Temperature has a causal and plastic effect on timing of breeding in a small songbird. The Journal of Experimental Biology, 223, jeb218784. 10.1242/jeb.218784 [DOI] [PubMed] [Google Scholar]
  113. Viitaniemi, H. M. , Verhagen, I. , Visser, M. E. , Honkela, A. , van Oers, K. , & Husby, A. (2019). Seasonal variation in genome‐wide DNA methylation patterns and the onset of seasonal timing of reproduction in great tits. Genome Biology and Evolution, 11, 970–983. 10.1093/gbe/evz044 [DOI] [PMC free article] [PubMed] [Google Scholar]
  114. Visser, M. E. , van Noordwijk, A. J. , Tinbergen, J. M. , & Lessells, C. M. (1998). Warmer springs lead to mistimed reproduction in great tits (Parus major). Proceedings of the Royal Society of London B, 265, 1867–1870. 10.1098/rspb.1998.0514 [DOI] [Google Scholar]
  115. Wang, S.‐H. , Lin, H.‐J. , Lin, Y.‐Y. , Chen, Y.‐J. , Pan, Y.‐H. , Tung, C.‐T. , Mersmann, H. J. , & Ding, S.‐T. (2017). Embryonic cholesterol esterification is regulated by a cyclic AMP‐dependent pathway in yolk sac membrane‐derived endodermal epithelial cells. PLoS One, 12, e0187560. 10.1371/journal.pone.0187560 [DOI] [PMC free article] [PubMed] [Google Scholar]
  116. Weiss, J. , Meeks, J. J. , Hurley, L. , Raverot, G. , Frassetto, A. , & Jameson, J. L. (2003). Sox3 is required for gonadal function, but not sex determination, in males and females. Molecular and Cellular Biology, 23, 8084–8091. 10.1128/MCB.23.22.8084-8091.2003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  117. Wickham, H. (2016). ggplot2: Elegant graphics for data analysis. Springer‐Verlag. [Google Scholar]
  118. Wickham, H. (2019). Stringr: Simple, consistent wrappers for common string operations. R package version 1.4.0. https://CRAN.R‐project.org/package=stringr
  119. Wickham, H. , François, R. , Henry, L. , & Müller, K. (2020). Dplyr: A grammar of data manipulation. R Package Version 1.0.0. https://CRAN.R‐project.org/package=dplyr
  120. Wickham, H. , & Henry, L. (2020). Tidyr: Tidy messy data. R Package Version 1.1.0. https://CRAN.R‐project.org/package=tidyr
  121. Wilke, C. O. (2020). cowplot: Streamlined Plot Theme and Plot Annotations for “ggplot2.”. R package version 1.1.0. https://CRAN.R‐project.org/package=cowplot
  122. Williams, T. D. (2012). Physiological adaptations for breeding in birds. Princeton University Press. [Google Scholar]
  123. Wilson, A. J. , Réale, D. , Clements, M. N. , Morrissey, M. M. , Postma, E. , Walling, C. A. , Kruuk, L. E. B. , & Nussey, D. H. (2010). An ecologist's guide to the animal model. Journal of Animal Ecology, 79, 13–26. 10.1111/j.1365-2656.2009.01639.x [DOI] [PubMed] [Google Scholar]
  124. Wingett, S. W. , & Andrews, S. (2018). FastQ screen: A tool for multi‐genome mapping and quality control. F1000Research, 7, 1338. [DOI] [PMC free article] [PubMed] [Google Scholar]
  125. Zhang, C.‐C. , Yuan, W.‐Y. , & Zhang, Q.‐F. (2012). RPL1, a gene involved in epigenetic processes regulates phenotypic plasticity in Rice. Molecular Plant, 5, 482–493. 10.1093/mp/ssr091 [DOI] [PubMed] [Google Scholar]
  126. Zhang, Y. , Baheti, S. , & Sun, Z. (2016). Statistical method evaluation for differentially methylated CpGs in base resolution next‐generation DNA sequencing data. Briefings in Bioinformatics, 19, 374–386. 10.1093/bib/bbw133 [DOI] [PubMed] [Google Scholar]
  127. Zhang, Z. , Zhang, L. , Wang, B. , Wei, R. , Wang, Y. , Wan, J. , Zhang, C. , Zhao, L. , Zhu, X. , Zhang, Y. , Chu, C. , Guo, Q. , Yin, X. , & Li, X. (2020). MiR‐337–3p suppresses proliferation of epithelial ovarian cancer by targeting PIK3CA and PIK3CB. Cancer Letters, 469, 54–67. 10.1016/j.canlet.2019.10.021 [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data S1.

EVA-17-e13703-s001.zip (70.2MB, zip)

Data Availability Statement

WGBS data are available at the NCBI BioProject database (http://www.ncbi.nlm.nih.gov/bioproject/) under BioProject PRJNA208335 and accession numbers SRR15410225 to SRR15410232 and SRR21783914 to SRR21783973. Scripts for the bioinformatic pipelines and statistical analyses are published on GitHub (https://github.com/MLindner0/GenomicSelection_AvianLayDate_WGBS).


Articles from Evolutionary Applications are provided here courtesy of Wiley

RESOURCES