Abstract
The sizes of Astyanax mexicanus blind cavefish populations of North-East Mexico are demographic parameters of great importance for investigating a variety of ecological, evolutionary, and conservation issues. However, few estimates have been obtained. For these mobile animals living in an environment difficult to explore as a whole, methods based on capture-mark-recapture are appropriate, but their feasibility and interpretation of results depend on several assumptions that must be carefully examined. Here, we provide evidence that minimally invasive genetic identification from captures at different time intervals (three days and three years) can give insights into cavefish population size dynamics as well as other important demographic parameters of interest. We also provide tools to calibrate sampling and genotyping efforts necessary to reach a given level of precision. Our results suggest that the El Pachón cave population is currently very small, of an order of magnitude of a few hundreds of individuals, and is distributed in a relatively isolated area. The probable decline in population size in the El Pachón cave since the last census in 1971 raises serious conservation issues.
Keywords: Cavefish, Population size, Conservation, Swabbing, Genetic identification
INTRODUCTION
The Mexican tetra, or Astyanax mexicanus, is an outstanding freshwater fish model to study evolution. This species exists under two forms represented by cave-adapted and surface dwelling populations that can interbreed despite striking differences in morphology, physiology and behavior (Keene et al., 2016). At present, there are 34 described cave locations which host Astyanax cavefish populations (Elliott, 2018; Espinasa et al., 2018; Espinasa et al., 2020; Mitchell et al., 1977). Genetic evidence suggests a recent origin of the cave populations of a few thousand years (<20000 years) according to some authors (Fumey et al., 2018; Policarpo et al., 2021) or up to about 150000 years according to others (Herman et al., 2018). Whatever the date when extant troglomorphic populations first settled in caves and how many independent events were involved, the genetic divergence between surface and cavefish is very low (Avise & Selander, 1972; Bradic et al., 2012; Fumey et al., 2018; Herman et al., 2018). The previous enlisted characteristics make it possible to use genetic methods to search for loci, and eventually to identify mutations, involved in the evolution of cavefish phenotypes (Casane & Rétaux, 2016; Protas et al., 2006). However, the environmental and demographic context should also be considered to better understand the evolutionary mechanisms involved in the fixation of these mutations, that is, the relative roles of selection and genetic drift. For a given subterranean population, the main demographic parameters of interest are its size and the migration rates between this population and other subterranean and surface populations. These parameters can vary across the distribution area and can change through time. Estimates of population size, dispersal potential and level of hybridization with surface fish, and their variations, are also important for conservation purposes.
However, only two capture-mark-recapture (CMR) studies have been performed to estimate the size of Astyanax mexicanus cavefish populations (Elliott, 2018). In March 1971, Elliot caught and marked 201 fish in Sótano de Yerbaniz and 230 in Cueva de El Pachón. A caudal fin clip was used to mark captured fish. One day and three days later, respectively, 226 and 242 fish were caught, among which 4 and 3 were marked. In El Pachón cave, he could see about three to five cavefish per m2, but the water was murky deeper than 60–90 cm, and he assumed there were more fish deeper. The visual estimate alone would have provided a minimal estimate of 950–1600 cavefish, but we now know that the full extent of the El Pachón “Main pool” is not visible to humans because the so-called “Maryland extension” is not accessible when water level is high (Figure 1A). He kept 20 marked fish in a 19 L aquarium in the cave for the duration of the work to gauge the deleterious effects of the fin clip. In Yerbaniz, one control fish died and was eaten by the others after 48 hours. In El Pachón, five control fish died after 24 hours and one was nearly dead, but none showed obvious signs of attack by their fellows. Taking these death rates into account (1/20 and 6/20), the numbers of marked fish were corrected (191 and 161, respectively) and using equation (3) and (4) reported here in materials and methods, population size was estimated to be 9781 and 95% confidence interval (CI) 1179–18283 () in El Pachón and 8671 (95% CI 1810–15534) in Yerbaniz. In 2009, Reynoso et al. captured and marked, by clipping the lower lobule of the caudal fin, 50 individuals in the El Pachón cave and 36 individuals in the Chica cave, but no marked fish were found among the 54 and 14 captured two days later, respectively, making it impossible to estimate the population sizes (Elliott, 2018). The absence of marked fish in the second sample in El Pachón and Chica could be the result of too small samples compared to the population sizes, but also the consequence of a high mortality of marked fish.
Undoubtedly, there is room to improve our knowledge of the demography of these subterranean fish as well as to develop less invasive methods for CMR. Here, we examined the demography of the population in the El Pachón cave using a modified CMR method to better fit some assumptions and identify the most appropriate population model.
MATERIALS AND METHODS
Fish sampling
Since 2004, we have maintained a laboratory stock of Astyanax mexicanus cavefish originating from the El Pachón cave (Sierra de El Abra, Mexico), initially obtained from W. R. Jeffery (University of Maryland, USA). In 2018 and 2022, 16 fish were sampled at random in our breeding facility for genotyping.
During field expeditions in the El Pachón cave, we sampled 20 individuals in 2016 and 35 individuals in 2019. In 2022, we sampled 30 and 29 individuals, respectively, at three days of interval (22nd and 25th February 2022), collected from the Main pool (Figure 1 and Figure 2). In addition, in 2016, we sampled 38 surface fish individuals in the river Tampemole (thereafter called Arroyo Tampemole) and 8 surface fish individuals in a water extraction well close to the El Pachón cave (thereafter called Pozo Pachón Praxedis Guerrero) (Figure 2).
All animals were treated according to the French and European regulations for the handling and care of animals in research. SR’s authorization for use of animals in research including Astyanax mexicanus is 91–116 and the Paris Centre-Sud Ethic Committee protocol authorization number related to this work is 2012–0052.
Fish genotyping
Up to 2022, all fish were fin-clipped and DNA extracted later from the fin fragments conserved in ethanol. Instead of this invasive fin clip procedure, which is the traditional way to get DNA, in 2022 we used a less invasive procedure. Gentle swabbings were performed on each flank of the fish in order to obtain two independent samples per individual. This procedure does not generate any physical damage to the fish and probably reduces stress significantly as compared to fin clipping. In all cases, fin-clipped or swabbed individuals were rapidly released in their natural pond after sampling.
The procedure of sample storage and DNA extraction was optimized to take into account the conditions of fieldwork, as follows.
DNA was extracted from swabs (FLOQSwabsR, COPAN Diagnostics Inc., USA) stored in tubes with silica gel beads at room temperature. Lysis with a mix of buffer (Tris HCl pH 8 100 mmol/L, EDTA 2 mmol/L, triton 0.2%) and proteinase K (250 μg/mL) overnight at 55 °C was followed by inactivation of proteinase K for 10 min at 98 °C. Aliquots of DNA were transferred in new tubes and stored at −20 °C.
Among 26 microsatellite loci that proved to be highly polymorphic in this species (Bradic et al., 2012), we selected 18 loci on the basis that 1) they were polymorphic in El Pachón cave, 2) they could be easily amplified by using polymerase chain reaction (PCR), 3) the amplicon sizes allow the amplification of these 18 loci through only three multiplexed PCR (Supplementary Table S1). Before setting up the PCR reactions, we prepared 10X primer mix with 2 µmol/L of each primer (Multiplex1, Multiplex2 and Multiplex3). The PCR reactions were carried out in 10 µL of final volume with: 1 µL of template DNA, 5 µL 2X PlatinumTM Multiplex PCR Master Mix (ThermoFisher Scientific, USA), 1 µL 10X primer mix and 3 µl H2O. The program used was: 2 min 95 °C, followed by 30 cycles of 30 sec 95 °C, 90 sec 60 °C, 60 sec 72 °C), and a final extension for 30 min at 60 °C.
Genotypes were scored using an ABI 3130 XL Genetic Analyzer, with GeneScanTM 500 LIZTM size standard (ThermoFisher Scientific, USA) and GeneMapperTM software v4.1 (Applied BiosystemsTM, USA). For each locus, several alleles with different sizes were sequenced using homozygous specimens. These sequences allowed us to translate allele relative sizes obtained with GeneMapper into real allele sizes.
For each specimen captured in 2022, two independent samples (swabs) were genotyped. Globally, with most DNA samples (obtained by fin clip or swabbing, and from cave, river, well or lab individuals) we could obtain the genotype at each of the 18 loci, but for eight fish the genotype at one or several loci was missing. These incomplete genotypes were not used in statistical analyses (Supplementary Table S2).
For statistical analyses, only complete genotypes were used. This included 16 samples from our breeding facility (2022); 19, 35, 29 and 29 samples from El Pachón cave collected in the field (2016, 2019, 22nd and 25th February 2022, respectively); 34 samples from Arroyo Tampemole, (2022), and six samples from Pozo Pachón Praxedis Guerrero (2022).
Statistical analyses
Estimation of population size
The Pachón cave undergoes cycles where during the dry season, water level goes down and isolated pools are formed. During the rainy season, water level goes up, a stream of moving current is formed, and the different pools can be joined into a single body of water. During the dry season, fish are segregated and isolated into the different pools (Figure 1). For this study, we conducted a capture recapture study to estimate the population size by sampling fish in the Main pool during a dry season. This represents but a fraction of the actual population that can exchange individuals, not only from the four pools accessible to humans, but also from any unexplored pool beyond the sumps. To get an estimate of the actual population size, a capture recapture study was performed between the years of 2016, 2019 and 2022, allowing for a minimum of two flooding events of the cave between captures, during two rainy seasons. Since in a minimum of two-yearly events, both human accessible and non-human accessible pools exchanged individuals, an estimate can be obtained for the number of individuals that yearly get intermixed for cave locality visited by scientists. This accounts for the subpopulation of Astyanax that can readily interchange individuals during normal years. It may be that perched or distant pools are only accessible during strong hurricanes or once in a hundred years flooding events, but these, being infrequent events, are not accounted for in our experimental design.
For these small, mobile and relatively numerous animals that cannot be counted directly over their whole distribution area, population size can be estimated using capture-mark-recapture (CMR) methods (Bailey, 1951). The principle is as follows: we capture, mark and release a animals out of a total population of size x. After the marked animals have freely mingled with the unmarked, we re-catch a random sample of size n, of which r are found to be marked. Bailey obtained the maximum likelihood estimate of x and its variance:
1 |
2 |
If r = 0 (no recapture of marked individuals), the population size cannot be estimated as has an infinite expectation. If r is very small (a few marked individuals are recaptured), the variance is very large and we can get the order of magnitude of the population size rather than a precise estimate.
Excluding the case r = 0, Bailey found that the expectation of is biased:
where is the expectation of . In order to take into account this bias when m is small, he proposed an adjusted estimate and gave its variance:
3 |
4 |
Equation (3) should not be used as a mathematical trick to solve the issue that a population size cannot be estimated if no marked animal is recaptured. Often, equation (3) and (4) are used instead of equations (1) and (2), not for obtaining precise and less biased estimates, but for getting an estimate with a smaller variance. However, the only valid solution to have a precise estimate, that is with a small variance, is to capture a large fraction of the total population in order to recapture many marked individuals. Moreover, the validity of the estimate depends on several assumptions that should be verified: 1) the population is closed, 2) all animals are equally likely to be captured, 3) capture and marking do not affect catchability, 4) marks are not lost.
The last two assumptions are a matter of the methodology. Researchers should use permanent marks and marking should be as little invasive as possible to limit effect on catchability. Whether the studied population is closed and restricted in a well-defined distribution area in which there is neither immigration nor emigration is often an unsettled issue. This is particularly true for cave animals for which we often can explore only a small part of their subterranean world. As marked and unmarked animals must have freely mingled before recapture, the animals must be mobile enough and the interval between sampling must be long enough to allow mixture. The validity of these assumptions depends on the model of the distribution of cavefish.
Polymorphism variation through time
Changes in allele frequencies at each locus between each pair of sampling dates were tested using the Fisher’s exact test, using the fisher.test function from the stats package in R 4.1.0 (R Core Team, 2021). As 180 tests were performed, we used the Bonferroni correction (threshold α=0.05/180) and Benjamini-Hochberg procedure in order to decrease the false discovery rate using the p.adjust function from the stats package in R (R Core Team, 2021).
Multidimensional scaling
To represent the matrix of genetic distances, the genetic distance between two individuals being the number of allelic differences, as a two-dimensional scatter plot, we performed a metric multidimensional scaling (MDS) using the cmdscale function from the stats package in R (R Core Team, 2021). We used only individuals for which a complete genotype was available.
Distribution of pairwise genetic distances according to kinship
First, we estimated the probability that two unrelated individuals would have the same genotype (Puni) using the following formula:
5 |
where Pid is the probability that two unrelated individuals have the same genotype at a given locus, n is the number of alleles at a given locus, fi is the frequency of allele i, and L is the number of loci analyzed.
Although it is simple to compute the probability that two unrelated individuals have the same genotype, it is more difficult to compute the expected distribution of the genetic distance, that is the number of different alleles between two individuals, according to their kinship (i.e. unrelated, parent-descendant, full siblings, half siblings). These distributions were obtained using simulations. Pairs of genotypes were generated for different kinships. For two unrelated individuals, at each locus, the genotype of each individual was generated by randomly sampling two alleles, considering current allele frequencies in the population. For a parent and its descendant, the genotype of the parent was generated as described above, and the genotype of the descendant was generated by sampling at random one allele from this parent and the other allele from an unknown parent, that is from the population, considering allele frequencies. For full siblings, two unrelated parents were first generated; then, each descendant was generated by randomly sampling one allele from each parent. For half siblings, one mother and two unrelated fathers were first generated; then, one descendant was generated by randomly sampling one allele from the mother and one allele from one father. The other descendant was generated by randomly sampling one allele from the same mother and one allele from the other father. For each kind of relationship, one thousand simulations were performed to estimate the empirical distribution of pairwise distances according to a given kinship. Simulations and data analyses were carried out in the R statistical environment (R Core Team, 2021). The R script “GenerateIndividual.r” can be found in GitHub “jmorode/Genetics”.
Estimation of genealogical relationships
The software ML Relate (Kalinowski et al., 2006) was used to find evidence of relatedness between cavefish. The accuracy of inferred genealogical relationships by the software was evaluated using simulated families of individuals of known genotypes. One thousand families were generated using the R script described above. Each family was composed by a mother, two unrelated fathers, two full siblings and two pairs of half siblings. This family composition allowed us to test the four relationships assessed by ML Relate: parent-offspring, full siblings, half siblings and unrelated. The genotypes of the members of these 1000 families were written in an input file for ML Relate. For each pair of individuals in each family, the known relationship was compared with the one inferred by ML Relate. In each family, six unrelated individuals, one pair of full siblings, two pairs of half siblings and six pairs of parent-offspring were expected. The percentage of known relationships found by ML Relate over the 1000 families was interpreted as an estimation of the accuracy of this software when identifying genealogical relationships in our observed population of cavefish.
RESULTS
Genetic polymorphism in the El Pachón cave and nearby surface locations
Individuals collected in the El Pachón cave from 2016 to 2022 were genotyped at 18 microsatellite loci (Figure 1A’). In addition, El Pachón individuals from our lab stock, derived from El Pachón cavefish collected before 2004 (probably in the 1990’s), and surface fish from two locations, a well and a river close to the El Pachón cave, were also genotyped (Figure 2). All genotypes are reported in Supplementary Table S2.
The number of alleles at each locus was smaller in the El Pachón cave samples than in the two surface samples (Table 1). In total, among the 18 microsatellite loci analysed, only 54 alleles were recovered from 113 El Pachón cavefish whereas 203 alleles were recovered from 43 surface fish. The mean heterozygosity in El Pachón cave (0.2) is about 4 times lower than in river-dwelling fish (0.66 and 0.80 in Pozo Pachón Praxedis Guerrero and Arroyo Tampemole, respectively).
Table 1. Allelic diversity in surface locations and El Pachón cave.
H-c5 | I-b1 | V-f3 | A8-g5 | X-a3 | W-f6 | A13-e5 | L-b9 | A2-a7 | V-c10 | A13-f8 | A6-f1 | A6-h6 | A14-d8 | A4-g11 | W-d11 | W-c12 | A5-f9 |
The names of the 18 microsatellite loci analyzed are indicated in the first line. For each locus, allele sizes are indicated in columns. Alleles in red are found in El Pachón, those highlighted in green are unique to El Pachón. | |||||||||||||||||
154 | 303 | 201 | 150 | 198 | 273 | 242 | 213 | 106 | 228 | 107 | 257 | 103 | 115 | 161 | 154 | 169 | 194 |
156 | 305 | 205 | 160 | 200 | 277 | 244 | 215 | 108 | 232 | 109 | 265 | 105 | 121 | 165 | 162 | 171 | 196 |
158 | 307 | 207 | 162 | 202 | 281 | 246 | 223 | 110 | 234 | 113 | 267 | 107 | 123 | 167 | 166 | 173 | 198 |
160 | 309 | 209 | 164 | 204 | 282 | 248 | 225 | 112 | 236 | 115 | 269 | 111 | 125 | 169 | 170 | 177 | 200 |
162 | 311 | 211 | 166 | 206 | 283 | 250 | 227 | 114 | 238 | 117 | 271 | 113 | 127 | 171 | 172 | 179 | 204 |
164 | 313 | 213 | 168 | 208 | 285 | 252 | 229 | 116 | 240 | 119 | 273 | 115 | 129 | 173 | 174 | 185 | 206 |
166 | 315 | 215 | 170 | 210 | 287 | 254 | 233 | 118 | 242 | 121 | 121 | 131 | 177 | 176 | 193 | 207 | |
168 | 317 | 172 | 212 | 289 | 256 | 259 | 120 | 244 | 123 | 123 | 133 | 179 | 178 | ||||
170 | 319 | 174 | 291 | 258 | 271 | 122 | 246 | 125 | 125 | 135 | 183 | 180 | |||||
172 | 329 | 176 | 293 | 262 | 124 | 248 | 127 | 127 | 137 | 182 | |||||||
174 | 178 | 295 | 264 | 126 | 129 | 129 | 143 | 184 | |||||||||
178 | 297 | 272 | 128 | 131 | 131 | 145 | 186 | ||||||||||
130 | 133 | 135 | 147 | 188 | |||||||||||||
134 | 135 | 139 | 190 | ||||||||||||||
138 | 137 | 192 | |||||||||||||||
140 | 139 | 194 | |||||||||||||||
142 | 143 | 196 | |||||||||||||||
144 | 147 | 200 | |||||||||||||||
148 | 149 | 202 | |||||||||||||||
150 | 161 | 204 | |||||||||||||||
156 | |||||||||||||||||
164 | |||||||||||||||||
172 |
Allele frequencies are reported in Table 2 for El Pachón cave samples and Supplementary Table S3 for surface populations.
Table 2. Allele frequencies in El Pachón cave.
H-c5 | Lab | 2016 | 2019 | 2022 (1) | 2022 (2) | I-b1 | Lab | 2016 | 2019 | 2022 (1) | 2022 (2) | V-f3 | Lab | 2016 | 2019 | 2022 (1) | 2022 (2) |
For each of the 18 loci analyzed, allele frequencies are shown for each session of sampling in the lab (2018+2021) and in the cave (2016, 2019 and 2022). Yellow highlighting indicates the frequency of dominant alleles over the years. Lab stock was obtained with fish sampled before 2004. He: Expected heterozygosity. | |||||||||||||||||
154 | 1,00 | 0.97 | 0.99 | 0.98 | 0.98 | 305 | 0.10 | 0.06 | 0.08 | 0.10 | 213 | 0.01 | 0.02 | ||||
172 | 0.03 | 0.01 | 0.02 | 0.02 | 311 | 0.09 | 0.03 | 0.04 | 0.02 | 0.02 | 215 | 1,00 | 1,00 | 0.99 | 0.98 | 1,00 | |
313 | 0.91 | 0.87 | 0.90 | 0.90 | 0.88 | ||||||||||||
He | 0.00 | 0.05 | 0.03 | 0.03 | 0.03 | He | 0.17 | 0.23 | 0.18 | 0.19 | 0.22 | He | 0.00 | 0.00 | 0.03 | 0.03 | 0.00 |
A8-g5 | Lab | 2016 | 2019 | 2022 (1) | 2022 (2) | X-a3 | Lab | 2016 | 2019 | 2022 (1) | 2022 (2) | W-f6 | Lab | 2016 | 2019 | 2022 (1) | 2022 (2) |
160 | 0.45 | 0.31 | 0.33 | 0.29 | 202 | 0.81 | 0.74 | 0.81 | 0.67 | 0.71 | 289 | 1,00 | 1,00 | 0.97 | 0.98 | 1,00 | |
168 | 1,00 | 0.55 | 0.69 | 0.67 | 0.67 | 206 | 0.03 | 0.03 | 0.02 | 295 | 0.03 | 0.02 | |||||
176 | 0.04 | 208 | 0.19 | 0.23 | 0.16 | 0.31 | 0.29 | ||||||||||
He | 0.00 | 0.49 | 0.43 | 0.44 | 0.46 | He | 0.30 | 0.40 | 0.31 | 0.45 | 0.41 | He | 0.00 | 0.00 | 0.06 | 0.03 | 0.00 |
A13-e5 | Lab | 2016 | 2019 | 2022 (1) | 2022 (2) | L-b9 | Lab | 2016 | 2019 | 2022 (1) | 2022 (2) | A2-a7 | Lab | 2016 | 2019 | 2022 (1) | 2022 (2) |
242 | 0.03 | 0.02 | 0.03 | 213 | 0.34 | 0.09 | 0.05 | 0.05 | 106 | 0.02 | |||||||
246 | 0.03 | 0.02 | 225 | 0.66 | 1,00 | 0.91 | 0.95 | 0.95 | 108 | 0.56 | 0.24 | 0.13 | 0.12 | 0.19 | |||
252 | 0.03 | 112 | 0.37 | 0.32 | 0.33 | 0.40 | |||||||||||
254 | 1,00 | 0.97 | 0.97 | 0.96 | 0.91 | 114 | 0.28 | 0.34 | 0.44 | 0.45 | 0.36 | ||||||
256 | 0.03 | 124 | 0.16 | 0.05 | 0.11 | 0.10 | 0.03 | ||||||||||
He | 0.00 | 0.05 | 0.06 | 0.07 | 0.16 | He | 0.45 | 0.00 | 0.16 | 0.10 | 0.10 | He | 0.58 | 0.69 | 0.68 | 0.67 | 0.67 |
V-c10 | Lab | 2016 | 2019 | 2022 (1) | 2022 (2) | A13-f8 | Lab | 2016 | 2019 | 2022 (1) | 2022 (2) | A6-f1 | Lab | 2016 | 2019 | 2022 (1) | 2022 (2) |
228 | 0.04 | 0.02 | 0.03 | 121 | 0.05 | 0.02 | 265 | 0.16 | 0.41 | 0.14 | 0.20 | 0.31 | |||||
246 | 0.03 | 0.04 | 0.05 | 125 | 1,00 | 0.95 | 0.88 | 0.93 | 0.98 | 267 | 0.05 | 0.02 | |||||
248 | 1,00 | 0.97 | 0.92 | 0.93 | 0.97 | 127 | 0.06 | 0.02 | 271 | 0.84 | 0.59 | 0.81 | 0.78 | 0.69 | |||
135 | 0.06 | 0.03 | 0.02 | ||||||||||||||
He | 0.00 | 0.05 | 0.16 | 0.13 | 0.07 | He | 0.00 | 0.10 | 0.21 | 0.13 | 0.03 | He | 0.26 | 0.49 | 0.31 | 0.35 | 0.43 |
A6-h6 | lab | 2016 | 2019 | 2022 (1) | 2022 (2) | A14-d8 | lab | 2016 | 2019 | 2022 (1) | 2022 (2) | A4-g11 | lab | 2016 | 2019 | 2022 (1) | 2022 (2) |
103 | 1,00 | 0.97 | 0.99 | 0.98 | 0.97 | 129 | 1,00 | 1,00 | 1,00 | 1,00 | 1,00 | 165 | 1,00 | 1,00 | 0.94 | 0.98 | 0.98 |
115 | 0.03 | 0.01 | 0.02 | 0.03 | 171 | 0.06 | 0.02 | 0.02 | |||||||||
He | 0.00 | 0.05 | 0.03 | 0.03 | 0.07 | He | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | He | 0.00 | 0.00 | 0.11 | 0.03 | 0.03 |
W-d11 | lab | 2016 | 2019 | 2022 (1) | 2022 (2) | W-c12 | lab | 2016 | 2019 | 2022 (1) | 2022 (2) | A5-f9 | lab | 2016 | 2019 | 2022 (1) | 2022 (2) |
162 | 0.03 | 0.16 | 0.19 | 0.14 | 0.22 | 173 | 0.03 | 0.01 | 0.03 | 0.07 | 198 | 1,00 | 0.97 | 0.96 | 1,00 | 0.97 | |
166 | 0.18 | 0.33 | 0.26 | 0.21 | 177 | 0.01 | 0.02 | 200 | 0.03 | 0.04 | 0.03 | ||||||
172 | 0.08 | 185 | 1,00 | 0.97 | 0.98 | 0.97 | 0.91 | ||||||||||
176 | 0.19 | 0.24 | 0.09 | 0.22 | 0.09 | ||||||||||||
186 | 0.75 | 0.32 | 0.37 | 0.38 | 0.45 | ||||||||||||
188 | 0.03 | 0.02 | 0.01 | ||||||||||||||
200 | 0.01 | 0.03 | |||||||||||||||
He | 0.40 | 0.78 | 0.71 | 0.72 | 0.70 | He | 0.00 | 0.05 | 0.06 | 0.07 | 0.16 | He | 0.00 | 0.05 | 0.08 | 0.00 | 0.07 |
The polymorphism in El Pachón cave appears stable since 2004, as all alleles recovered from lab individuals (i.e., derived from fish taken from the cave before 2004) are still present in the cave in 2022. In the lab stock, genetic drift may have led to the loss of some rare alleles (Table 2; Figure 3). Indeed, Fisher’s exact tests, with no correction for multiple tests (Supplementary Figure S1A), with a Bonferroni correction (Supplementary Figure S1B) and applying the Benjamini-Hochberg procedure (Supplementary Figure S1C) indicate that significant allele frequencies differences observed between pairs of El Pachón samples involve most often the lab sample. However, these differences are minimal as compared to those observed between El Pachón samples and surface fish samples, or even between the two different surface fish locations (Figure 3, orange and green dots).
Of note, the polymorphism observed in our El Pachón cavefish samples is different from the one estimated in individuals collected in 2008 at the same place (Bradic et al., 2012) (Supplementary Table S4). The lengths of the alleles are different, and even doing a translation, we could not align the data. Moreover, higher polymorphism was found in Bradic et al. (2008). They often found two alleles differing by one repeat and with similar frequencies, while we find that for most loci one allele had a high frequency (>80%). It is currently difficult to understand this discrepancy and these data were not further considered in our analyses.
Genetic evidence of recaptures and population size estimation in El Pachón cave
The genetic capture-recapture method is based on the assumption that two fish have a very low probability of having the same genotype. The probability that, in our samples, two unrelated individuals have the same genotype was estimated using equation (5). For surface populations which display high allelic diversity, the probability is extremely low, 2.1×10−15 and 1.5×10−23 in Pozo Pachón Praxedis Guerrero and Arroyo Tampemole, respectively. For El Pachón cave, the probability is 2.1×10−4, 2.8×10−4, 2.2×10−4 and 2.2×10−4 for samples obtained in 2016, 2019 and 22 Feb 2022 and 25 Feb 2022, respectively. Thus, it is virtually impossible to catch unrelated surface fish with the same genotype, and it is unlikely for two unrelated cavefish to have the same genotype - despite the lower genetic polymorphism in the El Pachón cave population. One must also take into account that closely related fish, like full sibs, have more similar genotypes than unrelated individuals. Simulations of the genotypes of pairs of individuals with various relatedness were realized in order to estimate the probability of finding a given genetic distance, in the range of 0 to 36, according to kinship (Supplementary Figure S2). For full sibs (the most closely related individuals), in our simulations, we did not observe identical genotypes in surface populations, but about 2% in the El Pachón population. These results indicate that even if there were some closely related individuals in our samples of cavefish, the probability is low that they share the same genotype. In accordance with this assumption, we found four pairs of identical genotypes from different El Pachón samples, but none within a given sample. We therefore considered that identical genotypes in different samples corresponded to the same individual that had been recaptured.
On the 22nd February 2022, 29 fish were sampled by swabbing in El Pachón Main pool and genotyped, and 29 fish were sampled 3 days later in the same place and genotyped. We identified 3 pairs of identical genotypes that we assumed to be recaptured individuals (Supplementary Table S5). Using equation (3) and (4), the estimated population size for the Main pool in El Pachón cave during the dry season was therefore of 218 cavefish (95% CI 40–395).
We also analysed the multi-year data to obtain population size across pools, both accessible and non-accessible to humans that readily exchange individuals during rainy seasons. Among 55 genotypes observed in 2022, we found three genotypes that were identical to genotypes observed in 2019 (Figure 1B), suggesting that after a minimum of two events when the pools get joined into a single body of water, three individuals have been recaptured three years after their first capture (Supplementary Table S5).
The estimation of their age from their size (Simon et al., 2017) suggests that these three individuals were 3–4 years old in 2019, thus 6–7 years old in 2022, i.e., largely in the range of cavefish lifespan. Taking into account that 35 were captured in 2019 and assuming a low mortality between 2019 and 2022 of the fishes captured in 2019, using equation (3) and (4), we obtained another estimate of the population size in El Pachón cave, 490 (95% CI 76–904). Both estimates point to a small population consisting of a few hundreds of individuals.
Genealogical Relatedness
Individual genotypes from a small population give an opportunity to examine kinship. However, the reliability of kinship inferences depends on the level of polymorphism. We assessed the possibility to infer kinships among surface fish on the one hand and El Pachón cavefish on the other hand using the software ML Relate. First, pairs of individuals with different relatedness (unrelated individuals, offspring-parent, full sibs and half sibs) were simulated taking into account allele frequencies. Then the true kinship was compared to the one inferred by ML Relate. For surface populations, the accuracy was 81% and 92% for Pozo Pachon Praxedis Guerrero and Arroyo Tampemole respectively, suggesting that the polymorphism is sufficient to infer relatedness with a good confidence (Supplementary Table S6). However, for cavefish the accuracy was only about 50% (Supplementary Table S6). The polymorphism at the 18 microsatellite loci studied herein is thus not sufficient to identify related individuals in El Pachón cave.
DISCUSSION
We can imagine several possibilities for the distribution of Astyanax cavefish populations, the two extremes being called here the “Oasis” and “Sea” models, and an intermediate “Lake” model (Figure 4). Moreover, one model can be the most appropriate for a given region in the distribution area, but another model for another region. Moreover, in a given region, the best model can change through time. Even if models are oversimplifications of the reality, they help deciphering what is plausible from what is unlikely in a given case.
The “Oasis” model states that the fish distribution is very patchy, with few small isolated populations that are not permanently connected. It fits well the definition of closed populations. Some populations are accessible, so that population size and its variation through time can be estimated. In addition, if these populations have a relatively high extinction rate and if some empty areas are sporadically invaded, this system has the dynamic of a metapopulation. At the other extreme stands the “Sea” model. There, we have access to a few subterranean pools that are well connected to a single network over a large distribution area. In this case, the population of cavefish is large and homogeneous, and it is almost impossible to study its size using a CMR approach. The intermediate “Lake” model proposes the existence of clusters of caves, with caves well connected within a cluster, but clusters poorly interconnected. A cluster of caves fits the definition of a closed population, but a cave only gives access to a fraction of the population. Yet not impossible, it is difficult to study the demography of such large population. This discouraging outlook may have limited the research activity on cavefishes in this domain (Bichuette & Trajano, 2021; Elliott, 2018).
Genetic tags for long-term population surveys
Two assumptions about the experimental conditions, which should be verified for CMR analyses, are that capturing and marking have a minimal effect on the probability of recapturing. Several reports indicate that fin clip can affect the behaviour and be lethal for a substantial percentage of fish (Elliott, 2018). To reduce as much as possible this problem that is both technical and ethical, in 2022 we used a method with swabbing instead of fin-clipping to collect DNA samples, which has been successfully applied for population estimates in other species including the flagship cave species, Proteus anguinus (Prunier et al., 2012; Trontelj & Zaksek, 2016). This approach is likely much less invasive (Tilley et al., 2020), if we do not take into account the stress induced by net capture and human handling during the swabbing procedure – which was performed as gently as possible. Yet it does “mark” captured fish genetically. Genotyping is then necessary to identify captured individuals. Although more time-consuming and costly, the great advantage of marking by genotyping is that we have access to a unique and stable tag that allows the identification of each animal over its entire life. However, the uniqueness of a genotype depends on the genetic diversity in the population. Microsatellite loci are particularly useful because their high mutation rate leads to several alleles at a given locus when the population size is relatively large, and individuals can be identified based on the combination of a small number of loci. Conversely, in small populations, the polymorphism can be low and genotyping more loci might be necessary to associate a genotype with a unique individual. Here, we found that a combination of 18 microsatellite loci is sufficient to identify each El Pachón cavefish and compute population size, but the population is not polymorphic enough to infer kinship. The more reliable kinships inferred with surface fish polymorphism, which is about four times larger than in El Pachón cave, suggest that the use of four times more microsatellite markers would be necessary to get reliable inferences of kinship between cavefish.
Size and isolation of the cavefish population in the El Pachón cave
There are four pools in El Pachón cave with sizable numbers of fish, but only two are usually accessible to humans (Figure 1): the Main pool, which measures about 25 m×5 m and in which the study was conducted, and a small pool on a side lateral gallery, about 3 m×1.5 m in size. These two pools connect during the rainy season. Fish are concentrated in the small pool at higher densities. We have visited the El Pachón cave multiple times in the past 14 years and with photos and in situ observations, we estimate that in the small Lateral pool there are usually about 50–150 individuals. Often, a large part of the Main pool is hidden to humans as water continues into the “Maryland extension” under a sump (Elliott, 2018). Only in 2003, 2009 and in February 2022 when we performed the genetic mark and recapture experiment have we seen the water level so extremely low as to give access to the Maryland extension galleries (Figure 1B). In 2022, we visited the Maryland extension galleries and in particular the perched pool named UAEM pool (Elliott, 2018). This pool is higher and does not connect normally with the Main pool. We did not do genetic mark and recapture experiments at the UAEM pool, but observations in situ suggest that the density of fish is similar to the Main pool. The fourth pool was not visited in 2022, but in 2003 there may have been an estimate of about 50 individuals based on in situ observations. Given that during extreme rainy seasons all four pools may exchange individuals, the population size obtained for the Main pool (218 fish) could be multiplied by ~3 for the four human accessible pools.
Our analysis therefore suggests that for these four pools, El Pachón cavefish population size in 2022 was in the order of magnitude of a few hundreds. Our second study attempted to resolve the total population size that readily exchanges individuals in normal years, including pools not accessible to humans. The capture in 2022 of three fish that were first captured in 2019 (with samples of 35 and 55 fish, respectively) gives a population size of 490 fish, which is in line with the estimate 654 (3×218) obtained with the CMR in 2022. This indicates that few individuals, if any, outside the known pools at El Pachón contribute in exchanging individuals during normal years. Together, these results suggest that the El Pachón population corresponds to the Oasis model, i.e., a small population, perched and poorly connected with other subterranean populations. Indeed, large migrations of fish between this known population and other unknown populations would prevent the recapture of fish after several years. Moreover, and importantly, the low genetic diversity and the stability of allele frequencies over time support the absence of a gene flow from surface or other genetically differentiated cave populations between ~1990/2000 and 2022. In 1986 and 1988, some individuals with variable eye sizes and melanin pigmentation were observed in the albino and eye-reduced El Pachón population (Langecker et al., 1991). Thus, it is possible that sporadic migrations of surface fish occurred but the low and stable genetic polymorphism in the cave during the last 20 years when compared to the high polymorphism in nearby surface populations shows that surface fish migrations had an undetectable impact on the genetic diversity in the cave during the last few decades.
Population genetic issues
Several researchers used genetic polymorphism to estimate the effective population size (Ne) and the migration rate in El Pachón cave. The first study, examining the allozyme variations at 17 loci, found no polymorphism in this population, suggesting a small Ne and isolation from surface populations (Avise & Selander, 1972). Later on, the analysis of 26 microsatellite loci also pointed out a small Ne (<1000) and limited gene flow (Bradic et al., 2012). A re-analysis of this data set, combined with transcriptomic data and using other statistical approaches led to the same conclusion (Fumey et al., 2018). Finally, a study using genomic data estimated that Ne could be higher (median=32000, min=3000 and max=46000) (Herman et al., 2018). Ne is a population genetic parameter that depends on the long term census population size (Nc) but also on a series of other biological and demographic parameters (Charlesworth, 2009). In the case of Astyanax cavefish, two parameters could lead to a Ne much lower than Nc, potentially several order of magnitude lower. First, episodes of low population size are known to have a disproportionate effect on the overall value of Ne (Charlesworth, 2009). Moreover, when females have the potential to lay thousands of eggs during their reproductive lifespan, like in this species, this can lead to a much larger variance in offspring number than expected with purely random variation, reducing Ne much below Nc. Indeed, if many individuals from only a few egg laying events survive during exceptional and favorable environmental conditions whereas most individuals from most egg laying die, then the variance of reproductive success can be very large, a process known as sweepstakes reproductive success (Hedgecock, 1994). In a small population, this could result in many full- or half-siblings in a cohort sample. The hypothesis might be tested using genetic polymorphism. Even if the variance of reproductive success is not large, we can assume that Ne is at best in the order of magnitude of the smallest Nc, that is a few hundreds.
Conservation issues
In 1971, Elliott estimated the El Pachón cavefish population to ~10000 individuals (Elliott, 2018). Our 2022 estimate of the number of cavefishes in El Pachón cave to a few hundreds of fish, i.e., an order of magnitude lower, is very worrying, and questions the long-term maintenance of this population. The causes of the apparent decline in population size in this cave may be manifold. Theats might include variations in water quality parameters or water levels due to pumping, human impacts like phreatic contamination, habitat disturbing like paintings of the cave walls above the Main pool and plastic waste inside the caves, as well as too frequent and too important samplings involving lethal experiments or removal of specimens from the cave by researchers, which are all facilitated by the ease of access to the El Pachón cave. The effect by researchers should be easily and collectively handled and corrected by the Astyanax cavefish research community. Given all the intellectual benefits they have received from this single cavefish population, scientists should be actively and collectively engaged in its conservation. The swabbing procedure used here will be an important tool in this direction and should be implemented for genetic sampling in the future. A longitudinal monitoring of the El Pachón cave population and the ecological parameters in the cave will also be paramount to counteract population decline and avoid extinction. Unfortunately and sadly, the El Pachón cavefish population, which has been the most studied and emblematic since its first scientific description in 1946 as “Astyanax antrobius” (Elliott, 2018) could well be the victim of its own success.
SUPPLEMENTARY DATA
SCIENTIFIC FIELD SURVEY PERMISSION INFORMATION
Astyanax mexicanus: fin clips from surface and cave morphs of A. mexicanus were sampled during field expeditions, under the field permits SGPA/DGVS/02438/16, SGPA/DGVS/1893/19, and SGPA/DGVS/03334/22 provided to P.O.G and S.R. by the Secretaría de Medio Ambiente y Recursos Naturales of Mexico (SEMARNAT).
DATA AVAILABILITY
The R script “GenerateIndividual.r” can be found in GitHub “jmorode/Genetics”.
COMPETING INTERESTS
The authors declare that they have no competing interests.
AUTHORS’ CONTRIBUTIONS
L.L., S.R., and D.C. conceived and designed the research; L.L., J.R., M.P., M.P., J.L., S.P., J.F., C.H., P.O.G., L.E., S.R., and D.C. conducted field surveys and collected samples; I.G., M.P., and C.Q. performed the molecular work; J.R., M.P., J.F., S.R., and D.C. analyzed the data; L.L., J.R., P.O.G., L.E., S.R., and D.C. wrote the manuscript. All authors read and approved the final version of the manuscript.
Funding Statement
This work was supported by a CNRS MITI (Mission pour les Initiatives Transverses et Interdisciplinaires) grant “Expérimentation en Milieux Extrêmes” to S.R. and collaborative grants from Agence Nationale de la Recherche (BLINDTEST) and Institut Diversité Ecologie et Evolution du Vivant to S.R. and D.C.
Contributor Information
Sylvie Rétaux, Email: sylvie.retaux@cnrs.fr.
Didier Casane, Email: didier.casane@universite-paris-saclay.fr.
References
- Avise JC, Selander RK. 1972. Evolutionary genetics of cave-dwelling fishes of the genus Astyanax. Evolution, 26(1): 1–19.
- Bailey NTJ On estimating the size of mobile populations from recapture data. Biometrika. 1951;38(3/4):293–306. doi: 10.2307/2332575. [DOI] [Google Scholar]
- Bichuette ME, Trajano E Monitoring Brazilian cavefish: ecology and conservation of four threatened catfish of genus Ituglanis (Siluriformes: Trichomycteridae) from Central Brazil. Diversity. 2021;13(2):91. doi: 10.3390/d13020091. [DOI] [Google Scholar]
- Bradic M, Beerli P, García-de León FJ, et al Gene flow and population structure in the Mexican blind cavefish complex (Astyanax mexicanus) BMC Evolutionary Biology. 2012;12:9. doi: 10.1186/1471-2148-12-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Casane D, Rétaux S. 2016. Evolutionary genetics of the cavefish Astyanax mexicanus. Advances in Genetics, 95: 117–159.
- Charlesworth B Effective population size and patterns of molecular evolution and variation. Nature Reviews Genetics. 2009;10(3):195–205. doi: 10.1038/nrg2526. [DOI] [PubMed] [Google Scholar]
- Elliott W The Astyanax caves of Mexico. Cavefishes of Tamaulipas, San Luis Potosi, and Guerrero. Association for Mexican Cave Studies. Bulletin. 2018;26:1–325. [Google Scholar]
- Espinasa L, Legendre L, Fumey J, et al A new cave locality for Astyanax cavefish in Sierra de El Abra, Mexico. Subterranean Biology. 2018;26:39–53. doi: 10.3897/subtbiol.26.26643. [DOI] [Google Scholar]
- Espinasa L, Ornelas-García CP, Legendre L, et al Discovery of two new astyanax cavefish localities leads to further understanding of the species biogeography. Diversity. 2020;12(10):368. doi: 10.3390/d12100368. [DOI] [Google Scholar]
- Fumey J, Hinaux H, Noirot C, et al Evidence for late Pleistocene origin of Astyanax mexicanus cavefish. BMC Evolutionary Biology. 2018;18(1):43. doi: 10.1186/s12862-018-1156-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hedgecock D. 1994. Does variance in reproductive success limit effective population sizes of marine organisms? In: Beaumont AR. Genetics and Evolution of Aquatic Organisms. London: Chapman & Hall, 122–134.
- Herman A, Brandvain Y, Weagley J, et al The role of gene flow in rapid and repeated evolution of cave-related traits in Mexican tetra. Astyanax mexicanus. Molecular Ecology. 2018;27(22):4397–4416. doi: 10.1111/mec.14877. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kalinowski ST, Wagner AP, Taper ML ML-RELATE: a computer program for maximum likelihood estimation of relatedness and relationship. Molecular Ecology Notes. 2006;6(2):576–579. doi: 10.1111/j.1471-8286.2006.01256.x. [DOI] [Google Scholar]
- Keene AC, Yoshizawa M, McGaugh SE. 2016. Biology and Evolution of the Mexican Cavefish. San Diego: Academic Press.
- Langecker TG, Wilkens H, Junge P Introgressive hybridization in the Pachon Cave population of Astyanax fasciatus (Teleostei: Characidae) Ichthyological Exploration of Freshwaters. 1991;2(3):209–212. [Google Scholar]
- Mitchell RW, Russell WH, Elliott WR Mexican eyeless characin fishes, genus Astyanax: environment, distribution, and evolution. Special Publications, the Museum Texas Tech University. 1977;12:1–89. [Google Scholar]
- Policarpo M, Fumey J, Lafargeas P, et al Contrasting gene decay in subterranean vertebrates: insights from cavefishes and fossorial mammals. Molecular Biology and Evolution. 2021;38(2):589–605. doi: 10.1093/molbev/msaa249. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Protas ME, Hersey C, Kochanek D, et al Genetic analysis of cavefish reveals molecular convergence in the evolution of albinism. Nature Genetics. 2006;38(1):107–111. doi: 10.1038/ng1700. [DOI] [PubMed] [Google Scholar]
- Prunier J, Kaufmann B, Grolet O, et al Skin swabbing as a new efficient DNA sampling technique in amphibians, and 14 new microsatellite markers in the alpine newt (Ichthyosaura alpestris) Molecular Ecology Resources. 2012;12(3):524–531. doi: 10.1111/j.1755-0998.2012.03116.x. [DOI] [PubMed] [Google Scholar]
- R Core Team. 2021. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing.
- Simon V, Elleboode R, Mahé K, et al Comparing growth in surface and cave morphs of the species Astyanax mexicanus: insights from scales. EvoDevo. 2017;8(1):23. doi: 10.1186/s13227-017-0086-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tilley CA, Carreño Gutierrez H, Sebire M, et al Skin swabbing is a refined technique to collect DNA from model fish species. Scientific Reports. 2020;10(1):18212. doi: 10.1038/s41598-020-75304-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Trontelj P, Zakšek V Genetic monitoring of Proteus populations. Natura Sloveniae. 2016;18(1):53–54. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.