Abstract
Despite still being a matter of debate, there is growing evidence that pollutant-induced epigenetic changes can be propagated across generations. Whereas such modifications could have long-lasting effects on organisms and even on population, environmentally relevant data from long-term exposure combined with follow-up through multiple generations remain scarce for non-mammalian species. We performed a transgenerational experiment comprising four successive generations of zebrafish. Only fish from the first generation were exposed to an environmentally realistic concentration of cadmium (Cd). Using a whole methylome analysis, we first identified the DNA regions that were differentially methylated in response to Cd exposure and common to fish of the first two generations. Among them, we then focused our investigations on the exon 3 (ex3) of the cep19 gene. We indeed recorded transgenerational growth disorders in Cd-exposed fish, and a mutation in this exon is known to cause morbid obesity in mammals. Its methylation level was thus determined in zebrafish from all the four generations by means of a targeted and base resolution method. We observed a transgenerational inheritance of Cd-induced DNA methylation changes up to the fourth generation. However, these changes were closely associated with genetic variations, mainly a single nucleotide polymorphism. This single nucleotide polymorphism was itself at the origin of the creation or deletion of a methylation site and deeply impacted the methylation level of neighboring methylation sites. Cd-induced epigenetic changes were associated with different mRNA transcripts and an improved condition of Cd fish. Our results emphasize a tight relationship between genetic and epigenetic mechanisms and suggest that their interplay and pre-existing diversity can allow rapid adaptation to anthropogenic environmental changes.
Keywords: transgenerational growth disorders, DNA methylation, SNP, pollution, rapid adaptation
Introduction
The fact that stress-induced changes in epigenetic marks can not only have long-lasting effects on organisms but also on their offspring has prompted renewed and growing interest of biologists for epigenetics in the last decades [1, 2]. Especially, the fact that these changes can be inherited across several generations raises the possibility that epigenetic mutations could contribute not only to heritable phenotypic traits and transgenerational diseases but also to evolution by acting as precursors of genetic mutations [3, 4]. However, even if this mode of inheritance is well recognized in plants and despite the fact that examples of epigenetic inheritance in different taxa have multiplied in the last years, there is still a controversy as to whether transgenerational inheritance is widespread, particularly in vertebrates [5–7]. By controlling the activity of genes, epigenetic mechanisms play a crucial role in cell differentiation during development [1, 3, 8]. Cells differentiate, acquire and maintain identity by expressing a particular transcriptome [9]. Thus, the epigenome must be re-programmed at least to ensure totipotency after fertilization. In vertebrates, particularly in mammals, the epigenome does not persist throughout life, but rather undergoes precise and coordinated changes at precise stages of development. The two rounds of DNA methylation erasure that occurs in early embryo and germline leave little chance to the inheritance of stress-induced changes in epigenetic marks [6, 10]. Furthermore, another critical argument against epigenetic inheritance comes from the fact that genetic variations are rarely or even never taken into account in transgenerational epigenetic studies; i.e. some examples of transgenerational epigenetic inheritance could be explained by changes in the DNA sequence itself. Finally, true epigenetic inheritance can only be determined after several generations and reported examples of epigenetic inheritance in vertebrates often concern inter—rather than true transgenerational effects [2, 5, 6].
Most of the current knowledge relating to pollutant-induced epigenetic modifications comes from rodent models and experimental investigations during which exposure to pollutants was carried out during embryogenesis. Data from long-term exposure, which are environmentally more relevant, for non-mammalian species and for at least three consecutive generations remain scarce, although such modifications could have long-lasting effects on organisms and even on the evolution trajectories of populations [3, 5, 11–15]. In view of these considerations, recent studies showed that non-mammalian vertebrate species such as fish can be more prone to transgenerational inheritance of stress-induced DNA methylation marks [16, 17]. Although DNA methylation is present in different phyla, it is in vertebrates and plants that this epigenetic mechanism has the most prominent role in transcriptional regulation and the greatest potential to contribute to transgenerational inheritance [1].
In this context, the aims of the present study were to test whether (i) pollutant exposure can induce DNA methylation changes in zebrafish and (ii) such pollutant-induced changes can be transmitted to the unexposed offspring at least until the fourth generation. To test these hypotheses, one generation of zebrafish (F0) was exposed throughout its life, from fertilization until sexual maturity, to dissolved cadmium (Cd) at an environmentally relevant concentration of 1 µg.L−1 (see Pierron et al., 2021 [15]). Cd is a widespread nonessential metal with endocrine disrupting effects [15, 18]. Zebrafish were lifelong exposed to Cd to encompass the remodeling events of DNA methylation that occur in germline not only during the embryo–larval stages but also later in adult gonads at the onset of “puberty” [11, 16, 17, 19]. We first used a high-throughput pyrosequencing approach to identify without a priori the regions of DNA that were differentially methylated (called differentially methylated regions [DMRs]) in response to Cd exposure and common to fish of the F0 and F1 generation. Considering the Cd-induced transgenerational effects on zebrafish growth and condition, we then focused our investigations on a DMR identified in the ex3 of the gene encoding for the ciliary protein CEP19 [20, 21]. Its methylation level was determined in zebrafish from all the four generations by means of a targeted and base resolution method.
Material and Methods
General Rearing and Breeding Program
The experiment is presented in Fig. 1 and described in details in text S1. At the beginning of the experiment, 40 randomly mating pairs (one male and one female) of wild-type (domesticated) zebrafish were placed for one night in small breeding tanks; 20 tanks were filled with clean water (Control, C) and 20 tanks were filled with Cd-contaminated water (1 µg.L−1). Their offspring (F0 generation) were, respectively, reared in clean or Cd-contaminated water throughout their life. For DNA methylation and transcription analyses, C and Cd fish were sampled at 5 days post-fertilization (dpf) (larvae) and at 124 dpf (adult mature fish). Whole larvae and tissue samples of adults were fixed in RNAlater solution before being stored at −20°C. At 170 dpf, 20 pairs per condition were constituted to produce the next generation (F1). Mating was carried out in Cd-free water for both groups (C or Cd). Pairs were not randomly selected. For each condition, fish were reared in four different tanks. As all individuals from one clutch (i.e. from one pair) were raised in the same large tank, one male from one tank was mated with a female from another tank to limit inbreeding. With the exception of the fish that were used as genitors, all remaining individuals after mating were measured (total and standard (from snout to the base of the caudal fin, length) and weighted to determine the Fulton condition factor [K = (total weight (g))/(total length (cm))3) × 105] and investigate the length (L)–weight (W) relationship (LWR). After mating, genitors were conserved in one large tank under Cd-free condition where C and Cd fish were separated by a plastic net. Throughout the experiment, fish were fed three times daily. The food quantity delivered per aquarium was adjusted to fish density in order to not induce bias in growth. The same protocol as previously described was used for mating and sampling for the F1, F2 and F3 generations. However, for both groups of animals (C and Cd), spawning and rearing were carried out in Cd-free water. Thus, only fish of the F0 generation were directly exposed to dissolved Cd. The total duration of the experiment was 643 days.
Figure 1:

Synoptic of the experimental design. Only fish of the F0 generation were exposed to cadmium (Cd) throughout their life (0–169 dpf). At each generation, 20 mating pairs [one pair per breeding tank (2 L)] and 4 large tanks (75 L) were used per condition (control or Cd). Each large tank was filled with larvae from 3 to 5 different mating pairs. While random mating was used to produce the F0 generation, males from one tank were mated with females from another tank (closed diamond) to produce the generations F1, F2 and F3 in order to limit inbreeding
This work was conducted in accordance with the EU Directive 2010/63/EU for animal experiments and received approval for research ethics from the Aquitaine Fish-Birds Ethics Committee (APAFIS#7535-2016111009351504).
RNA/DNA Extraction
For each generation, a total of six biological replicates were used per condition. We used six larvae (5 dpf) from six different mating pairs and six adult males (124 dpf) and six adult females (124 dpf) from three different tanks in order to analyze individuals from different families. Only larval samples for the methylated DNA immunoprecipitation coupled with high-throughput pyrosequencing (MeDip-Seq) analysis were differently treated. Indeed, the amount of DNA required for MeDip-Seq compelled us to pool the DNA from two larvae (n = 6 pools per condition for a total of 12 larvae per condition). For all other samples and all other analyses (Bisulfite sequencing and quantitative reverse transcription PCR (RT-qPCR)), no pool was used. Total RNA and DNA were purified from whole larvae or liver, brain and gonads of adults (males and females) using the AllPrep DNA/RNA kit (Qiagen) as described in Pierron et al. (2019) [22].
Identification of Intergenerational Cd-induced DMRs
Libraries Preparation and Sequencing
The MeDip-Seq approach was carried out on DNA samples from larvae and adult zebrafish of the F0 and F1 generation, for a total of 168 samples. Libraries were prepared according to the Bioo Scientific’s protocol using the Bioo Scientific NEXTflex™ Methyl-seq Library Prep Kit for Illumina Sequencing. Briefly, DNA was fragmented by sonication on a Covaris M220 (400–500 bp), size selection was then performed using AMPure XP beads and adapters were ligated. Libraries were then immunoprecipitated using the MagMeDIP Kit (Diagenode) according to the manufacturer’s instructions. Finally, 12 cycles of PCR were performed. Library quality was assessed using an Advanced Analytical Fragment Analyzer, and libraries were quantified by qPCR using the Kapa Library Quantification Kit. Following quantification, libraries were randomly pooled in equal amounts and sequenced on two lanes of an Illumina NovaSeq 6000 using a paired-end read length of 2 × 150 bp using the Illumina NovaSeq Reagent Kits. Sequencing was performed at the GeT-PlaGe core facility, INRAE Toulouse.
Data Preprocessing and Analysis
Sequencing files were quality checked using fastp (version 0.19.5). Filtered sequences were then mapped to the Danio rerio reference genome Genome Reference Consortium Zebrafish 11 (GRCz11) using the mem algorithm of the BWA software (version v 0.7.17) with default settings. The output files (*.sam) were then combined and converted to bam format (*.bam) by Samtools (version 1.4). Output files were grouped in 28 sets corresponding to the developmental stage, sex, generation (F0 or F1), organs studied and exposure. Analysis was carried out using the MEDIPS package (version 1.46.0) in R (version 3.5.1), following the MEDIPS R script. CpG density-dependent normalization was performed on the mapping files for each set. No set was considered as a reference. The parameters specified in this analysis were as follows: ‘extend’ set to 300, ‘shift’ set to 0, ‘window size’ set to 100 and ‘uniq’ set to 1. Differentially methylated windows of 100 bp were identified between sets using the ‘edgeR’ plug-in function. For all comparisons, genomic windows that were covered by a minimal count threshold of 10 reads were kept for further statistical analysis. Statistically significant results were selected using criteria for the adjusted P ≤ 0.01. Contiguous significant windows were merged into one DMR. Based on the biological questions, the DMRs were compared between the relevant sets. Since the final merging step can produce different DMRs between each pair of set, to enable pairwise comparisons, DMRs have been split in shared regions of size 300 or more. All DMRs having a positive or a negative fold change were retained.
Analysis of Single Base Methylation Levels and SNP in cep19
Genomic sequences were obtained from the GRCz11. Specific primers used for PCR amplification and bisulfite sequencing were designed in the promoter—ex1 region and ex3 of the cep19 gene by means of the PyroMark Assay Design Software (Table S1).
Quantification of single cytosine percent methylation at specific sites was performed as previously described in Pierron et al. (2019) by bisulfite pyrosequencing using the PyroMark Q48 Autoprep Instrument (Qiagen). SNP detection was carried out during the run of bisulfite pyrosequencing by modifying the sequence analyzed (AQ mode, Fig. S1). In order to assess the conversion efficiency of the bisulfite treatment, the average methylation level of one CpA site was measured in ex3. The methylation level of this control site was <7%, indicating very good conversion efficiency. The methylation level of 10 CpG sites was quantified in the ex3. The mean methylation level of these 10 CpG sites was used for statistical analyses. Examples of pyrograms are provided in Fig. S1. To confirm our results, the same protocol, with different primers (Table S1), was used to sequence the ex3 by using native, untreated, and genomic DNA as template (Fig. S2).
Cloning and Sequencing of Genetic/Transcript Variants
In order to identify potential transcript variants, several primers overlapping different exons were designed using the Primer3Plus software (Table S1 [23]). Additional variants of the cep19 mRNA were identified using 3ʹRACE. First-strand cDNA was synthesized using the GoScript™ Reverse Transcription System (Promega) and an anchored oligo-dT (480 nM final, Table S1). A touch-down strategy was used as follows: 94°C for 15 min; 12 cycles of 94°C for 30 s, 64.8–60.4 for 30 s and 72°C for 3 min; 35 cycles of 94°C for 30s, 60°C for 30 s and 72°C for 3 min and final extension at 72°C for 10 min using specific forward primers and a reverse primer specific to the anchored oligo-dT primer (Table S1). Products were separated and visualized by gel electrophoresis or by means of a Qiaxcell instrument using the DNA high-resolution kit (Qiagen). Amplified products of interest were cloned into pGEM-T Easy vector according to the manufacturer’s instructions (Promega) and were sequenced (Sanger, GATC Biotech).
Quantitative Analysis of Transcription Levels
RT-qPCR analyses were carried out as previously described in Pierron et al. (2019) [22] with few modifications. First-strand cDNA was synthesized from 250 ng of total RNA from individual whole larvae and was diluted 20-fold. Relative quantification of gene transcription was achieved by concurrent amplification of the eef1a1a and rpl13a endogenous controls [24]. As the same quantity of total RNA was used for each sample, the output cycle of the endogenous controls was statistically examined. No significant difference was observed among conditions and generations, demonstrating the relevance of the two genes as reference in our conditions. As the Ct value of the eef1a1a gene showed the lowest variability among samples, this gene was finally used as reference. Amplification efficiencies for all primer sets were calculated and used to determine transcription levels of targets. Primers used are available in Table S1.
Statistical Analyses
Comparisons among fish groups were performed by two-ways analysis of variance, after checking assumptions of normality (Kolmogorov–Smirnov) and homoscedasticity of the error terms (Levene). If significant effects were detected, the least significant difference test was used to determine whether means between pairs of samples were significantly different from one another. When the assumptions were not met as deduced from ad-hoc tests, we used Box-Cox data transformations or the nonparametric Kruskal–Wallis test. Using Kruskal–Wallis, the post-hoc Conover-Iman test was used. Comparisons of proportions were performed using the chi-square test and the Monte Carlo method using 5000 permutations followed by the Marascuilo procedure to compare proportions among the different conditions. Comparisons of allelic frequencies between groups were carried out by means of the homogeneity chi-square test. Comparisons of slopes were realized using a Student’s t-test followed by the Bonferroni correction. Computations were performed using STATISTICA version 6.1 software (StatSoft) and XLSTAT (Addinsoft version 2020.1.1). Numerical results are provided as mean ± SE.
Results and Discussion
Direct and Transgenerational Effect of Cd on Fish Growth
Cd exposure triggered an increase in fish growth in F0 generation. At 170 dpf, fish directly exposed to Cd were significantly heavier (Fig. 2A) and longer (Fig. 2B) than control specimens. However, they presented a Fulton condition factor that was significantly lower in comparison to controls (Fig. 2C). Fulton is an index frequently used in ecology to assess the relative plumpness (i.e. energy reserves) of fish and corresponds to a weight-to-length ratio comparable to the body mass index in humans. After mating (170 dpf), C and Cd genitors were placed in one large tank under Cd-free condition where fish from the two conditions were separated by a plastic net. In the case of Cd fish, the weight gain strongly increased after 170 dpf at a rate that was almost exponential (Fig. 2D). However, as it can be observed for controls, the growth rate of zebrafish normally slows down with age [25]. At 400 dpf, Cd fish were almost obese (Fig. 2E) and were significantly 1.7-fold heavier than controls. At the opposite, in the subsequent generations (F1–F3), fish ancestrally exposed to Cd presented a body weight and length that were significantly lower than controls (Fig. 2A and B). However, from the generation F2, they presented a Fulton condition factor that was equivalent to controls (Fig. 2C). This highlights that Cd triggered transgenerational growth disorders. Whereas Cd exposure at the F0 generation initially boosted the growth of fish, the growth of their descendants (up to the fourth generation) was significantly lower in comparison to controls. However, their condition (i.e. Fulton) reached that of the controls starting from the third generation.
Figure 2:

Cadmium exposure induced transgenerational growth disorders. Change across generations (F0–F3) in the mean (±SE, n = 177–582) (A) body weight, (B) total length and (C) Fulton condition factor of control (C, blue) or Cd-exposed (Cd, red) fish sampled at 170 dpf. Means designated by different letters are significantly different (Conover-Iman, P < 0.05). (D) Change in the weight of fish of the F0 generation according to their age. Cd exposure stopped at 170 dpf. Asterisk denotes a significant effect of Cd at each sampling time (Mann–Whitney or Student’s t test, P < 0.05, n = 20–190). (E) Males and females of the F0 generation at 400 dpf
Cd Exposure Triggered Intergenerational Changes in the Methylation Level of cep19
In order to identify without a priori the DNA regions that could be differentially methylated in response to Cd exposure and to maximize the number of samples analyzed, we first used a MeDip-Seq approach on both larvae and adult zebrafish of the F0 and F1 generations. Only few DMRs were common to the two generations, and only 13 Cd-induced DMRs were common between larvae of the two generations (Table S2). Eleven of them were localized in non-coding regions, including intergenic and intronic regions. Only one DMR was found purely exonic. This DMR that was localized on the ex3 of the gene encoding for the ciliary protein CEP19 retained our attention. Indeed, a homozygous nonsense mutation in this exon causes morbid obesity in humans and mice [20, 21]. As Cd exposure was associated with transgenerational growth disorders that were initially marked by an increased growth leading to signs of obesity in fish of the F0 generation, we then decided to investigate the methylation level of this gene throughout the four generations by means of a targeted and base resolution method, i.e. bisulfite pyrosequencing.
Transgenerational Changes in the Methylation Level of the ex3 of cep19 Was Associated with a SNP
Significant differences in the methylation level of the ex3 of the cep19 gene were observed between C and Cd larvae of the F1 and F3 generations (Fig. 3A). Methylation levels were significantly higher in Cd larvae in comparison to C larvae. Generally, whatever the generation, higher methylation levels were observed in Cd fish in comparison to controls. However, high variations were observed among individuals, and methylation levels ranged from 4.3% to 90.4%. No effect was observed on this region. We also identified an SNP (position 2941538 in NC_007133.7, GRCz11) during sequencing runs, where the A base that is referenced in the RefSeq genome (GRCz11) was replaced by a G base. Interestingly, the presence of the alternative G base is at the origin of an additional site of methylation as this SNP is preceded by a C (CA→CG), methylated cytosines being primarily found at CpG dinucleotides [26]. Presence of this SNP deeply impacted the methylation level of neighboring CpG sites at both the larval and adult stages. Combining data from all analyzed individuals (larvae and adults of the four generations), the mean methylation level of the ex3 reached 7.08% ± 0.41% (mean ± SE, n = 84) in fish that were homozygous for the A base (A/A genotype), 41.68% ± 0.64% (mean ± SE, n = 47) in heterozygous A/G individuals and 83.48% ± 1.86% (mean ± SE, n = 13) in homozygous G/G animals (Fig. 3B; raw data for the 10 CpG sites analyzed are available in Fig. S3). Considering that the detection limit of the instrument is of 5%, methylation level in A/A individuals can be considered as negligible. Thus, our results show that CpG sites of the A allele are completely demethylated, whereas CpG sites of the G allele are hypermethylated (Fig. 3C). In addition, as it can be observed in Fig. 3B, if highly significant changes in DNA methylation levels were observed among fish genotypes, no significant difference was observed between C and Cd fish of each genotype. These results highlighted a very strong relationship between the observed SNP and the methylation level of the exon. As promoter DNA methylation is an important regulatory component of vertebrate gene expression, we also measured the methylation level of the promoter region of cep19. No significant variation was observed in this region (Fig. S4). It is worth noting that the base change in the ex3 does not induce change in the protein sequence as the two different codons encode for the same amino acid. We then used the PROMO software (version 3.02, dissimilarity rate 5% [27, 28]) in order to identify potential binding sites for transcription factors (TFs) in the coding region of the ex3. Indeed, despite the widespread assumption that such binding in coding regions is non-functional, there are increasing evidences that coding exons are commonly bound by enhancer-related and histone modifying TFs and that many genes, mainly developmental genes, contain enhancers in coding regions that drive tissue-specific gene expression [29]. In addition, many studies have reported changes in gene expression in relation to synonymous SNPs in coding regions [30]. Finally, the chromatin structure of the genome is defined by a tight and dynamic interplay between histone modifications and DNA methylation [31]. In our case, the A→G transition triggered the loss of the binding site for the glucocorticoid receptor (GR, also known as NR3C1). Interestingly, several papers have reported that binding of GR leads to the demethylation of its target genes [32–34]. Emerging evidences also suggest that TFs could hinder the methylation of their bound loci during global re-methylation that arises during embryogenesis [35]. In our case, the ex3 was effectively demethylated in the presence of the GR binding site (A allele). GR mediates the diverse physiologic effects of glucocorticoids (GCs). GCs of which cortisol is the best known are steroid hormones that are important for maintenance of basal and stress-related homeostasis with significant effects on growth and weight maintenance [36, 37]. Indeed, certain cases of obesity in humans have clear features of hypercortisolism [38, 39]. Moreover, Cd exposure during pregnancy in mammals is well known to reduce birth weight, an effect mediated by an increase in maternal and fetal GCs [40]. In zebrafish, Wilson et al. (2016) [36] investigated whether embryo–larval GC exposure (0–5 dpf) affects adult zebrafish phenotypes. They experimentally induced an excess (by pharmacological treatment) or inversely a reduced (using an antisense GR morpholino) GC activity in zebrafish from 0 to 5 dpf. These treatments triggered, respectively, an increase and a decrease in the Fulton condition factor of fish at the adult stage (84 dpf).
Figure 3:

Change in the methylation level of the exon 3 of cep19 was tightly associated with an SNP. (A) Change in the methylation level of cep19 (mean ± SE, n = 6) in control (C) or Cd-exposed (Cd) larvae (5 dpf) across generations (F0–F3). (B) Methylation level of cep19 (Bars, mean ± SE, n = 4–52) and distribution of animals (Curves, % of fish of each genotype) according to the genotype (A/A, A/G or G/G) and experimental condition (Control, blue; and Cadmium, red). All analyzed animals were used, i.e. 6 larvae, 6 males and 6 females per generation and condition, for a total number of 72 individuals per condition. For adults, data from gonad samples were used. Values with different letters [uppercase for bars (Conover-Iman) and lowercase for curves (Marascuilo procedure)] are significantly different (P < 0.05). (C) Schematic representation of the methylation pattern of the two alleles
This tight association between genetic and epigenetic changes can explain the high variations observed in DNA methylation levels (Fig. 3A). Each group was indeed composed of fish of different genotypes, i.e. of fish presenting differing levels of methylation. We then investigated the proportion of fish of each genotype per condition (Fig. 3B; i.e. for each condition, the number of fish of each genotype divided per the total number of fish analyzed × 100). The great majority of C fish (72.2%) presented the A/A genotype. In contrast, Cd fish were equally comprised by fish of A/A and A/G genotypes (45.1 and 43.7%, respectively). Moreover, the proportion of fish presenting the A/A genotype was significantly lower in the Cd in comparison to the C condition. The proportion of homozygous G/G individuals remained quite low, reaching 5.6% and 12.7% for C and Cd, respectively (for the distribution according to sex, stage of development and conditions see Fig. S5).
In order to get more insight into the effect of Cd on the genotype of fish, we then calculated the frequency of the G allele according to the condition (C or Cd), stage of development and generation (Fig. 4A). Despite the low number of biological replicates, some significant differences were observed. In accordance to the differences observed at the methylation level, the G allele frequency was significantly higher in the Cd larvae of the F1 and F3 generation in comparison to controls. In addition, considering all individuals (both larvae and adults, n = 18), the frequency of the G allele was significantly higher at the fourth generation (F3) in Cd fish in comparison to controls. Thus, after three generations, Cd and C fish were genetically distinct populations for cep19. We also noticed a significant difference between Cd adults of the F0 generation and Cd larvae of the F1 generation, i.e. between parents and their offspring. In only one generation, this frequency significantly increased from 0.21 in F0 adults to 0.58 in F1 larvae. Such a result suggests a deviation from the Hardy–Weinberg equilibrium and thus an effect of selection and/or genetic drift. Genetic drift is commonly associated with loss of genetic polymorphism [41]. However, in our case, the percentage of heterozygous individuals increased between the two generations in response to Cd. We must add that a particular effort was made to limit inbreeding. The F1 generation was generated by using a total of 16 (considering that some pairs did not spawn) different mating pairs, these pairs being composed by a male and a female from different families. Concerning selection, no difference was observed in terms of fecundity or reproductive success between Cd and C fish at each generation (the fecundity was however almost significantly lower in Cd fish in comparison to controls at the F1 generation (P = 0.082); Fig. S6). Moreover, no significant difference was observed in mortality rates at each generation between Cd and C animals, both at the embryo–larval and adult stage. In general, the highest mortalities were observed during early development (0–14 dpf, Fig. S6). Observed mortality did not exceed 1% at the juvenile-adult stage, but cannibalism was observed until 60 dpf (personal observation). However, we must note that an equal magnitude of mortality in the two conditions does not exclude an effect of selection and selection can induce temporally (i.e. according to the stage of development) dynamic genotype-dependent mortalities [42]. Whatever the mechanisms involved, Cd favored the rapid increase (in only one generation) of the hypermethylated G allele in the experimental population. Considering all animals analyzed by condition (larvae and adults of the four generations, n = 72), the frequency of the G allele was significantly higher with a 2-fold increase (Homogeneity chi-square test, X2 = 1147 with a X2 0.05 = 5.99) in the Cd (0.34) in comparison to the control population (0.17).
Figure 4:

Frequency of the G allele was affected by Cd and fish density. (A) Change in the frequency of the G allele (control, blue or Cd, red) at each life stage [larvae (5 dpf) or adult (124 dpf) or both] across generations (F0–F3). Asterisk denotes a significant effect of Cd exposure for a given life stage and generation; Hashtag indicates a significant variation between parents (adults of the generation F) and their offspring (larvae of the generation F+1). Homogeneity chi-square test (P < 0.05). (B) Adult fish (124 dpf) were grouped into three classes (low, medium and high) according to the density (number of fish per liter) of the aquarium in which they were reared by using the 33.33 and 66.66 percentiles. Bars refer to the mean frequency of the G allele (±SE, n = 32 fish; 8 tanks with 4 fish per aquarium) and the curve to the mean fish density (±SE) determined for each density class. Means with different letters [uppercase for bars (Homogeneity chi-square test) and lowercase for the curve (LSD)] are significantly different (P < 0.05)
Concerning controls, despite no significant variation in the G allele frequency was observed, some fluctuations were observed across generations. The highest frequency (0.33) was observed in adults of the F2 generation. Such a high frequency could explain, at least in part, why no effect of Cd was observed at this generation. The only factor that was not similar among generations was fish density. The density of fish was significantly 2-fold higher in fish of the F2 generation in comparison to the other generations (see Pierron et al., 2021 for details [15]). As presented in Fig. 4B, a significant increase in fish density was associated with a significant increase in the G allele frequency. This suggests that, in addition to Cd, fish density can also affect, to a lesser extent, the proportion of fish presenting the G allele. However, in the present case, high densities did not lead to change in the frequency of the G allele in the subsequent generation (F3).
Impacts of Epigenetic Changes Induced by Cd on cep19 Transcripts
As methylation of exons can induce alternative splicing events [43], we initially amplified by PCR two regions overlapping the different exons of the cep19 gene (that comprises four exons) using cDNA as template (Fig. S7). If no difference was observed among genotypes for the upstream part (ex1–3) of the mRNA, amplification of the ex3–4 portion was unsuccessful for homozygous A/A individuals (i.e. for the demethylated “version”). We then amplified the ex3–ex4 using genomic DNA as template. Amplicons of different sizes were obtained for A/A and G/G individuals, while heterozygous animals presented the two products (Fig. 5A). After sequencing, we found that the intron 3 (in3) was totally absent in G/G individuals. To confirm our results, all individuals (n = 144) were tested and PCR products were visualized by capillary electrophoresis. This analysis confirmed that the A allele was systematically associated with the in3. On the contrary, when the G base was present, the in3 was absent. We also identified two additional SNPs in the non-coding part (i.e. 3ʹ-UTR) of the ex3 (position 2941886 and 2941888, CGCA→TGTA). Finally, we identified an indel in the 5ʹUTR region (ex1, Fig. S8). Note that this indel did not induce significant methylation changes in the promoter-ex1 region as no significant difference in methylation levels was observed among genotypes (Fig. S8). With the exception of intron loss, all these changes were already described as they are present in the mRNA sequence of cep19 (sequence NM_001033734.1). Indeed, the RefSeq transcript (NM_001033734.1) has four substitutions, one non-frameshifting indel compared to the RefSeq genomic sequence (GRCz11, NC_007133.7, position 2 937 485-2 942 954). To summarize, at the genomic level, individuals with the G/G genotype presented the exonic sequences of the RefSeq transcript and lacked in3. Results are summarized in Fig. 5B. In addition, the two alleles encode for different transcripts. When the in3 is present (A allele), the ex4 is not transcribed. Through 3ʹRace on cDNA, we found that the transcript encoded by the A allele retains a portion of the in3 and ends with a region rich in CpG sites (end of the cDNA: base position 2 942 037 in GRCz11, sequence CGCTCGCGGCATAACAGA). Remarkably, all genetic variations identified between the two alleles involved CpG sites. In this view, we must note that the in3 is fully constituted by a transposable element (TE) called Harbinger-5N1 (also known as DNA-1-4_DR). Generally, TEs are fully methylated to prevent their activation [22]. Accumulating evidences suggest that TEs can act as “methylation centers” after insertion into host DNA, whereby methylation subsequently spreads into the flanking DNA [4]. However, in our case, the presence of the TE was associated with the demethylation of the ex3, reinforcing the role of the SNP (A/G) in the methylation state of the exon.
Figure 5:

The two alleles encode for two different mRNA transcripts. (A) The alternative G allele lacks intron 3. PCR were realized with primers overlapping the exon 3 and 4 using genomic DNA (gDNA) or complementary DNA (cDNA) as templates (see B for details). (B) Schematic representation of the two different alleles and their relative transcripts. Light gray boxes correspond to coding regions and dark gray boxes to UTR regions, and introns are schematized by a black line. Brown boxes correspond to the regions amplified by PCR or rt-qPCR. Methylation levels are indicated in green. Vertical thick red line segment, SNP; vertical thick red line, indel. (C) Mean (±SE, n = 4–26) transcription levels of the different targeted regions of the cep19 gene in larvae according to their genotype. Means designated by different letters are significantly different (Conover-Iman, P < 0.05)
We then analyzed the transcription levels of three different portions of the cep19 mRNA corresponding to the ex1, ex3–in3 and ex3–ex4 in the larvae (Fig. 5C). As expected from our previous analyses, the amplicon ex3–in3 was not detected in G/G individuals while the amplicon ex3–ex4 was not detected in A/A individuals. We also found that the transcription level of ex3–ex4 was significantly lower in comparison to the transcription level of ex3–in3 in heterozygous animals. This is in accordance with the consensus view that DNA methylation leads to “gene silencing,” i.e. transcriptional repression [1]. In this sense, the methylation level of ex1 was significantly lower in fish with a G/G genotype in comparison to other genotypes. Moreover, contrary to A/A individuals, for which no difference was observed between the transcription level of ex1 and ex3–in3, the transcription level of ex3–ex4 was significantly lower than that of ex1 in G/G individuals. This suggests that the methylation of ex3 is not only responsible for the downregulation of cep19 but also for some copies, for a precocious arrest in mRNA transcription.
Impacts of Cd-induced Epigenetic Changes on Fish Growth
In order to investigate a potential link between Cd-induced epigenetic changes and fish growth, we then calculated the Fulton condition factor of adult fish (124 dpf) according to their genotype and condition. Whereas Cd fish presenting a A/A genotype had a Fulton that was significantly lower than controls, heterozygous Cd fish showed a Fulton that did not differ from C fish (Fig. 6A). In order to deepen and confirm these results, the same analyses were carried out on juvenile fish (35 dpf) of the F0 generation, i.e. in fish for which the weight or condition is not influenced by sex and gonadal weight. The same results were obtained. Heterozygous Cd fish presented a Fulton that did not differ from controls, whereas A/A Cd fish presented a condition that was significantly lower than controls (Fig. 6B). In order to gain additional insight, we then investigated the LWR in adults (Fig. 6C). In fish, the LWR is expressed by using the equation: W = aLb, where b is a growth coefficient used to assess the growth trajectory of fish, indicating whether fish body length increases at the same relative rate as weight. Using the linear regression of the log-transformed equation:
, the growth coefficient b represents the slope of the relationship. A positive allometric growth (i.e. short and stout) is observed when b is >3.1, and b < 2.9 is associated with a negative allometric growth (i.e. long and thin) [44]. This coefficient was inferior to three only in the case of Cd A/A fish. Moreover, this coefficient was significantly lower in homozygous A/A Cd fish in comparison to A/A control or to heterozygous A/G Cd fish (data from homozygous G/G fish were not taken into account, see Fig. 6C). Thus, heterozygosity seems to allow Cd fish to reach a growth pattern similar to controls by increasing fish weight gain (rather than decreasing growth in length).
Figure 6:

Heterozygosis was associated with an improved condition in Cd-exposed fish. (A and B) Change in the Fulton condition factor of control (C, blue) or Cd-exposed (Cd, red) fish according to their genotype (A/A, A/G and G/G) for (A) adult fish (124 dpf) of the four generations or (B) juvenile fish (35 dpf) of the F0 generation. Number of fish per condition is indicated above the bars. Means designated by different letters are significantly different (Conover-Iman and LSD tests, P < 0.05). (C) Length–weight relationship for adult fish (124 dpf) according to the condition and genotype. The Pearson coefficients (r2) and the growth coefficients (b) obtained after log transformation of the data are given in the right panel. Significant correlations are underlined (P < 0.001). The growth coefficients indicated by different letters are significantly different (Student’s t-test with Bonferroni correction, P < 0.01). Note that data from homozygous G/G animals were not statistically exploited due to the low number of replicates (see Fig. A) and the absence of significant correlations
Our results strongly support an adaptive role of the epigenetic variations and associated changes in relative transcript variants abundance in response to Cd exposure. In Cd fish, heterozygosis was associated with an improved condition. Whereas Cd fish of the F0 and F1 generation presented a condition that was significantly lower than controls (Fig. 2C), epigenetic changes induced by Cd seem to participate to the rapid recovery of the condition of fish, mainly by increasing their weight gain. In other words, the increase in the frequency of the G allele and concomitantly in the number of heterozygous animals allowed the Cd population to reach a condition similar to the control one after only two generations.
Conclusion
Our results highlight a complex and tight interplay between genetic and epigenetic mechanisms. Mainly, we found that change in one base (SNP) can strongly influence the methylation levels of flanking CpG sites, potentially by affecting the sensitivity of the DNA sequence to TFs. We also found that this CpG-SNP (A/G) was accompanied by several other genetic changes throughout the cep19 sequence. All these variations were found in the RefSeq transcript. Thus, the SNP was not just a point mutation but rather corresponded to the presence of two different alleles. Cd exposure significantly favored the hypermethylated G allele. In this view, we must underline that the RefSeq transcript record has not yet been subject to final NCBI review. Whereas the RefSeq transcript sequence was derived from the ancient mRNA sequence (BC097138.1) published in 2006, the RefSeq genomic sequence (GRCz11) was published in 2018. This suggests that the emergence of the in3 results from a TE insertion that has arisen recently during zebrafish domestication and/or that the domestication has favored the hypomethylated A allele. This appears to be consistent with the fact that (i) we do not observe transposon footprints in the G allele (i.e. in most cases, when a TE is excised it leaves small rearrangements [45]) and (ii) a loss of genetic polymorphism and a severe reduction in heterozygosity was observed in domesticated zebrafish in the last years [46]. In our case, Cd exposure thus led to an increase in the frequency of a more ancient allele leading to an increased heterozygosity in the experimental population. This “reversion” was associated with an improved condition across generations of fish ancestrally exposed to Cd, suggesting an adaptive process in response to Cd exposure. As the ancient alternative G allele pre-existed in the experimental zebrafish population (at low frequency, reaching 0.17 in controls), our results suggest that standing epigenetic variation fueled the rapid adaptation of the population to Cd [42, 47]. Considering the rapidity and the magnitude of this increase (particularly from the F0 to the F1 generation), the underlying mechanisms require further investigations. Our work emphasizes the role of the interplay between genetics and epigenetics as well as the exploitation of their standing variation [42, 47] in the rapid transgenerational adaptation to anthropogenic environmental stresses.
Supplementary Material
Acknowledgements
Christian Portier, Kévin Lebaillif and Jacques Martin are thanked for their technical help with the experimental setup.
Contributor Information
Fabien Pierron, University of Bordeaux, CNRS, Bordeaux INP, EPOC, UMR 5805, Pessac 33600, France.
Débora Heroin, University of Bordeaux, CNRS, Bordeaux INP, EPOC, UMR 5805, Pessac 33600, France.
Guillemine Daffe, University of Bordeaux, CNRS, INRAE, La Rochelle University, UMS 2567 POREA, Pessac 33615, France.
Flore Daramy, University of Bordeaux, CNRS, Bordeaux INP, EPOC, UMR 5805, Pessac 33600, France.
Aurélien Barré, University of Bordeaux, Bordeaux Bioinformatics Center, Bordeaux, 33076, France.
Olivier Bouchez, INRAE, US 1426, GeT-PlaGe, Genotoul, Castanet-Tolosan 31326, France.
Alicia Romero-Ramirez, University of Bordeaux, CNRS, Bordeaux INP, EPOC, UMR 5805, Pessac 33600, France.
Patrice Gonzalez, University of Bordeaux, CNRS, Bordeaux INP, EPOC, UMR 5805, Pessac 33600, France.
Macha Nikolski, University of Bordeaux, Bordeaux Bioinformatics Center, Bordeaux, 33076, France; University of Bordeaux, CNRS, IBGC, UMR 5095, Bordeaux 33077, France.
Supplementary data
Supplementary data are available at EnvEpig online.
Data availability
Raw sequencing data were deposited in the European Nucleotide Archive with accession number PRJEB52137. All other data are included in the article and/or supporting information.
Conflict of interest statement
None declared.
Funding
This work was supported by the Agence Nationale de la Recherche of France (ANR JCJC TRACE ANR-16-CE34-0008). This Work was supported by the France Génomique National infrastructure, funded as part of “Investissement d’avenir” program managed by the Agence Nationale pour la Recherche (contrat ANR-10-INBS-09).
Author Contribution
F.P. conceived the study. F.P. and D.H. performed the experiments. F.P., G.D., F.D., A.B., O.B. and A.R.R. acquired and analyzed the data. F.P. wrote the paper, and P.G. and M.N. edited and approved the manuscript.
References
- 1. Feil R, Fraga MF. Epigenetics and the environment: emerging patterns and implications. Nat Rev Genet 2012;13:97–109. [DOI] [PubMed] [Google Scholar]
- 2. Allis C, Jenuwein T. The molecular hallmarks of epigenetic control. Nat Rev Genet 2016;17:487–500. [DOI] [PubMed] [Google Scholar]
- 3. Sarkies P. Molecular mechanisms of epigenetic inheritance: possible evolutionary implications. Semin Cell Dev Biol 2020;97:106–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Zhou W, Liang G, Molloy PL. et al. DNA methylation enables transposable element-driven genome expansion. Proc Natl Acad Sci USA 2020;117:19359–66. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Fitz-James MH, Cavalli G. Molecular mechanisms of transgenerational epigenetic inheritance. Nat Rev Genet 2022;23:325–41. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Heard E, Martienssen RA. Transgenerational epigenetic inheritance: myths and mechanisms. Cell 2014;157:95–109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Horsthemke B. A critical view on transgenerational epigenetic inheritance in humans. Nat Commun 2018;9:2973. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Boland MJ, Nazor KL, Loring JF. Epigenetic regulation of pluripotency and differentiation. Circ Res 2014;115:311–24. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Piferrer F. Epigenetics of sex determination and gonadogenesis. Dev Dyn 2013;242:360–70. [DOI] [PubMed] [Google Scholar]
- 10. Zeng Y, Chen T. DNA methylation reprogramming during mammalian development. Genes (Basel) 2019;10:257. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Head JA. Patterns of DNA methylation in animals: an ecotoxicological perspective. Integr Comp Biol 2014;54:77–86. [DOI] [PubMed] [Google Scholar]
- 12. Skinner MK. Environmental stress and epigenetic transgenerational inheritance. BMC Med 2014;12:153. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Rey O, Danchin E, Mirouze M. et al. Adaptation to global change: a transposable element-epigenetics perspective. Trends Ecol Evol 2016;31:514–26. [DOI] [PubMed] [Google Scholar]
- 14. Šrut M. Ecotoxicological epigenetics in invertebrates: emerging tool for the evaluation of present and past pollution burden. Chemosphere 2021;282:131026. [DOI] [PubMed] [Google Scholar]
- 15. Pierron F, Lorioux S, Heroin D. et al. Transgenerational epigenetic sex determination: environment experienced by female fish affects offspring sex ratio. Environ Pollut 2021;277:116864. [DOI] [PubMed] [Google Scholar]
- 16. Ortega-Recalde O, Day RC, Gemmell NJ. et al. Zebrafish preserve global germline DNA methylation while sex-linked rDNA is amplified and demethylated during feminisation. Nat Commun 2019;10:3053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Skvortsova K, Tarbashevich K, Stehling M. et al. Retention of paternal DNA methylome in the developing zebrafish germline. Nat Commun 2019;10:3054. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Henson MC, Chedrese PJ. Endocrine disruption by cadmium, a common environmental toxicant with paradoxical effects on reproduction. Exp Biol Med 2004;229:383–92. [DOI] [PubMed] [Google Scholar]
- 19. Kelsey G, Feil R. New insights into establishment and maintenance of DNA methylation imprints in mammals. Phil Trans R Soc B 2013;368:1–16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Shalata A, Ramirez MC, Desnick RJ. et al. Morbid obesity resulting from inactivation of the ciliary protein CEP19 in humans and mice. Am J Hum Genet 2013;93:1061–71. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Yıldız Bölükbaşı E, Mumtaz S, Afzal M. et al. Homozygous mutation in CEP19, a gene mutated in morbid obesity, in Bardet-Biedl syndrome with predominant postaxial polydactyly. J Med Genet 2018;55:189–97. [DOI] [PubMed] [Google Scholar]
- 22. Pierron F, Daffe G, Lambert P. et al. Retrotransposon methylation and activity in wild fish (A. anguilla): a matter of size. Environ Pollut 2019;245:494–503. [DOI] [PubMed] [Google Scholar]
- 23. Untergasser A, Nijveen H, Rao X. et al. Primer3Plus, an enhanced web interface to Primer3. Nucleic Acids Res 2007;35:W71–4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. McCurley AT, Callard GV. Characterization of housekeeping genes in zebrafish: male-female differences and effects of tissue type, developmental stage and chemical treatment. BMC Mol Biol 2008;9:102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Ribas L, Piferrer F. The zebrafish (Danio rerio) as a model organism, with emphasis on applications for finfish aquaculture research. Rev Aquacult 2014;6:209–40. [Google Scholar]
- 26. Yong WS, Hsu FM, Chen PY. Profiling genome-wide DNA methylation. Epigenetics Chromatin 2016;9:26. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Messeguer X, Escudero R, Farré D. et al. PROMO: detection of known transcription regulatory elements using species-tailored searches. Bioinformatics 2002;18:333–4. [DOI] [PubMed] [Google Scholar]
- 28. Farré D, Roset R, Huerta M. et al. Identification of patterns in biological sequences at the ALGGEN server: PROMO and MALGEN. Nucleic Acids Res 2003;31:3651–3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Ritter DI, Dong Z, Guo S. et al. Transcriptional enhancers in protein-coding exons of vertebrate developmental genes. PLoS One 2012;7:e35202. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Hunt R, Sauna ZE, Ambudkar SV. et al. Silent (synonymous) SNPs: should we care about them? Methods Mol Biol 2009;578:23–39. [DOI] [PubMed] [Google Scholar]
- 31. Velasco G, Ulveling D, Rondeau S. et al. Interplay between histone and DNA methylation seen through comparative methylomes in rare mendelian disorders. Int J Mol Sci 2021;22:3735. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Thomassin H, Flavin M, Espinás ML. et al. Glucocorticoid-induced DNA demethylation and gene memory during development. EMBO J 2001;20:1974–83. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Klengel T, Mehta D, Anacker C. et al. Allele-specific FKBP5 DNA demethylation mediates gene–childhood trauma interactions. Nat Neurosci 2013;16:33–41. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Wiechmann T, Röh S, Sauer S. et al. Identification of dynamic glucocorticoid-induced methylation changes at the FKBP5 locus. Clin Epigenet 2019;11:83. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Kremsky I, Corces VG. Protection from DNA re-methylation by transcription factors in primordial germ cells and pre-implantation embryos can explain trans-generational epigenetic inheritance. Genome Biol 2020;21:118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Wilson KS, Tucker CS, Al-Dujaili EAS. et al. Early-life glucocorticoids programme behaviour and metabolism in adulthood in zebrafish. J Endocrinol 2016;230:125–42. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Faught E, Vijayan MM. Loss of the glucocorticoid receptor in zebrafish improves muscle glucose availability and increases growth. Am J Physiol Endocrinol Metab 2019;316:E1093–104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Jackson SE, Kirschbaum C, Steptoe A. Hair cortisol and adiposity in a population-based sample of 2,527 men and women aged 54 to 87 years. Obesity 2017;25:539–44. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Björntorp P, Rosmond R. Obesity and cortisol. Nutrition 2000;16:924–36. [DOI] [PubMed] [Google Scholar]
- 40. Ronco AM, Urrutia M, Montenegro M. et al. Cadmium exposure during pregnancy reduces birth weight and increases maternal and foetal glucocorticoids. Toxicol Lett 2009;188:186–91. [DOI] [PubMed] [Google Scholar]
- 41. Masel J. Genetic drift. Curr Biol 2011;21:R837–8. [DOI] [PubMed] [Google Scholar]
- 42. Bitter MC, Kapsenberg L, Gattuso JP. et al. Standing genetic variation fuels rapid adaptation to ocean acidification. Nat Commun 2019;10:5821. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Shayevitch R, Askayo D, Keydar I. et al. The importance of DNA methylation of exons on alternative splicing. RNA 2018;24:1351–62. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44. Forster CE, Norcross BL, Spies I. Documenting growth parameters and age in Arctic fish species in the Chukchi and Beaufort seas. Deep Sea Res Part II 2020;177:104779. [Google Scholar]
- 45. Muñoz-López M, García-Pérez JL. DNA transposons: nature and applications in genomics. Curr Genomics 2010;11:115–28. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46. Suurväli J, Whiteley AR, Zheng Y. et al. The laboratory domestication of zebrafish: from diverse populations to inbred substrains. Mol Biol Evol 2020;37:1056–69. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. Lai YT, Yeung CKL, Omland KE. et al. Standing genetic variation as the predominant source for adaptation of a songbird. Proc Natl Acad Sci USA 2019;116:2152–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Raw sequencing data were deposited in the European Nucleotide Archive with accession number PRJEB52137. All other data are included in the article and/or supporting information.
