Abstract
The genomic architecture of barriers to gene exchange during the speciation process is poorly understood. The genomic islands model suggests that loci associated with barriers to gene exchange prevent introgression of nearby genomic regions via linkage disequilibrium. But few analyses of the actual genomic location of non-introgressing loci in closely related species exist. In a previous study Maroja et al. showed that in the hybridizing field crickets, Gryllus firmus and G. pennsylvanicus, 50 non-introgressing loci are localized on two autosomal regions and the X chromosome, but they were not able to map the loci along the X chromosome because they used a male informative cross. Here, we localize the introgressing and non-introgressing loci on the X chromosome, and reveal that all X-linked non-introgressing loci are restricted to a 50-cM region with 10 of these loci mapped to a single location. We discuss the implications of this finding to speciation.
Introduction
Although the basic characteristics of barriers to gene exchange are relatively well understood, the underlying genomic architecture of such barriers is not. Investigation of the localization of alleles or chromosomal rearrangements associated with reduced gene flow can give us insight into the genomic architecture of barriers to gene exchange and help us better understand the speciation process. For instance, are highly differentiated loci spread evenly across the genome, or are they concentrated in few areas following an “islands of speciation” model? Are there certain genes or chromosomes that often show signatures of restricted introgression between hybridizing species? What is the nature of loci unable to cross species barriers? The field crickets Gryllus firmus and Gryllus pennsylvanicus are an excellent system in which to study the genetic architecture of introgressing and non-introgressing loci. These crickets form an extensive, well-characterized hybrid zone with multiple barriers to gene exchange including many prezygotic barriers and a one-way reproductive incompatibility, likely a post-mating, prezygotic barrier [1, 2, 3, 4, 5]. The pair of species diverged recently (~200,000 years) and 110 loci have been characterized in two locations in the hybrid zone [5, 6].
Larson et al. [5, 6] showed that most of the loci are able to freely flow between species, but 50 (out of 110) display sharp genomic and geographical clines in both the Pennsylvania and Connecticut hybrid zones. Maroja et al. [7] determined that of these non-introgressing loci, 9 and 2 loci were mapped to small regions of the autosomal linkage groups LG14 and LG2, respectively. The rest of the non-introgressing loci (n = 39 out of 50) were found to be X-linked [7]. However, as the backcross used by Maroja et al. [7] was male-informative (an F1 male backcrossed to a G. pennsylvanicus female), the X-linked loci could not be ordered along the X chromosome. Here we built a new genetic map focusing only on the X-linked loci using a female informative cross (F1 female backcrossed to a G. pennsylvanicus male).
Materials and methods
All steps in the data acquisition and analyses closely followed the protocol from Maroja et al. [7]. We collected pure species crickets from Ithaca, NY (42°24’, -76°31’) (Gryllus pennsylvanicus, GP) and Guilford, CT (41°15’, -72°42’) (Gryllus firmus, GF) in August 2014. These crickets are not an endangered or protected species and thus do not required collection permits. Virgin GP females were crossed to GF males and resulting F1 female hybrids were mated to captive hatched GP males. Eggs were incubated at 4°C for 2 months prior to hatching at room temperature. We preserved two-week-old backcross offspring from the single, most fertile female in 100% ethanol in -20°C.
DNA extraction and library preparation
We used Agencourt DNAdvance Genomic DNA Isolation Kit (Beckman Coulter, Inc.) to extract DNA from 163 backcross nymphs and their parents. A total of four offspring failed to generate enough reads for successful analyses and were eliminated.
We sequenced individuals in three Illumina Miseq lanes (Cornell Life Sciences Core Laboratory Center for first batch of individuals, Genewiz for second and third batch). For each individual, we first performed 5 PCR reactions with 24 primer pairs in each using the Qiagen Multiplex PCR Kit (Qiagen) to amplify a total of 120 SNP markers, the 165 individuals were divided into two Illumina lanes. To each individual we added Illumina barcodes (N501-520/S701-729) via PCR. All barcoded individuals from each batch (lane) were combined in a single mix and cleaned up using Agencourt Ampure XL beads (Beckman Coulter, Inc.). The third Illumina lane was used to get data for loci that failed to generate enough reads in the previous runs (23 loci).
Analyses of sequences
We analyzed sequences using Geneious R9 (Biomatters, https://www.geneious.com). All reads were mapped to a library containing reference sequences of all amplified loci. Consensus sequences were generated for each amplified locus/individual using a threshold of 75%. All consensus sequences were then grouped into a single list, which was then mapped to the reference sequence library with medium sensitivity and up to five iterations. Of the 120 sequenced loci (58 X-linked loci), we were able to generate usable data for nearly all analyzed individuals at 91 loci (44 X-linked loci), the remaining loci did not generate enough reads in many individuals and were thus eliminated.
Construction of linkage map
We analyzed genotype data using JoinMap 4.1 [8]. Linkage groups were determined using logarithm of odds (LOD) calculations with a threshold of 4.0. Multiple regression mappings were done on X chromosome linkage group, yielding similar maps. A total of three remaining X-linked loci did not follow Mendelian expectations and were excluded from these final maps (Chi-square, P<0.01). Data used for mapping and unformatted SNPs can be found in Dryad doi: 10.5061/dryad.h0g0kf0.
Results and discussion
Our map revealed a single linkage group of 92cM representing the 44 X linked loci. This single X linkage group (LG) merged the two separate LG representing the X chromosome in Maroja et al. [7] (S1 Table). The X chromosome was shorter than that reported in Maroja et al. [7], in which the largest X LG was 106cM. The underestimated distance between loci is likely due to the reduced number of loci (we did not include a new ddRAD library) in our analyses leading to a reduced number of 3-point crosses. Of the 41 previously-analyzed X linked loci, 33 were non-introgressing. All of these 33 non-introgressing loci were located within a 50-cM region but 10 of them mapped to a single location (Fig 1). A similar clustering of non-introgressing loci was previously observed on autosome LG14 [7]. Interestingly, some introgressing loci mapped to areas of the X chromosome that were highly enriched for non-introgressing loci. This was an unexpected result because loci in the same position are expected to be under high linkage disequilibrium and thus experience similar rates of introgression or lack thereof.
Fig 1. Map of introgressing, non-introgressing loci on X chromosome.
Red boxes, ticks = non-introgressing loci, blue boxes, ticks = introgressing loci, grey boxes, ticks = not analyzed by Larson et al. (2013, 2014). Height of box stacks indicates number of SNPs mapped to a given location. Distances measured in cM from leftmost locus.
The observed clustering of non-introgressing loci into short regions of LG14 and the X chromosome follows the genomic pattern predicted by the genomic islands of speciation model. This model states that alleles that act as barriers to gene exchange will, via linkage disequilibrium, ensure that genomic regions in close proximity also fail to be exchanged between species. Eventually, mutations will accumulate within these regions and the regions’ boundaries will expand, further reducing gene flow [9] to eventually encompass the entire genome. Critics of this model argue that observed examples of genomic islands could instead be caused by a combination of ancestral variation and selection in areas of the genome with restricted recombination [10, 11], leading to the observation of “islands of differentiation” even in the complete absence of introgression. Either of these explanations could account for the island pattern of non-introgressing loci observed in Gryllus but would imply very different speciation histories. In the first case introgression would have been important in homogenizing genomes and non-introgressing areas would be free to accumulate mutations. In the second case hybridization (while certainly happening) did not play a role in introgression and homogenization, instead selection led to the differentiation of genomes that were initially similar due to common ancestry.
In Maroja et al. [7], the Gryllus X chromosome was found to be enriched both for highly differentiated loci and for non-introgressing loci. This might be due to several factors. First, the SNPs analyzed across the hybrid zone and mapped in this study were derived from a male accessory gland RNAseq library, and the chosen SNPs were those with a relatively strong differentiation between allopatric populations of the two species, near fixation [5, 6, 12]. Thus, if male-specific genes expressed in the accessory gland are overrepresented on the X chromosome, we would likely see a large number of SNPs localizing to the X. Furthermore, the larger differentiation of the X-linked loci could be due to its smaller effective population size (3/4Ne), which is predicted to increase genetic drift and thus accelerate population differentiation [13]. Additionally, male-hemizygous recessive alleles on the X are subjected to more intense selection [14], which may again lead to higher differentiation if differential selection exists between populations.
Similar to our observation that there is a specific region with reduced introgression on the Gryllus X chromosome, the human X chromosome displays large stretches with greatly reduced levels of Neanderthal ancestry [15]. A similar region of reduced introgression is also seen in the X chromosome of house mice [14]. In these two mammalian species, it has been shown that the X chromosome is enriched for loci associated with decreased fertility or male hybrid sterility, suggesting that the X chromosome might often harbor genes responsible for barriers to gene flow or species differentiation [15, 16]. For example, mammalian spermatogenesis is a process known to have elements under positive selection, and in mice the X chromosome is significantly enriched for genes uniquely expressed in spermatogonia, the pre-meiotic germ cells of the testis from which sperm arise [17, 18]. As is the case in mammals, the orthopteran X chromosome seems to be inactive in the male meiotic germline [19, 20, 21, 22]; it is thus possible that the Orthoptera X is not enriched for spermatogenesis genes. However, it is still possible that X-linked accessory gland genes are expressed and, in other Drosophila, accessory gland proteins have been shown to be essential in fertilization, fecundity and female postmating behavior (e.g. [23, 24]). Furthermore it is possible that X-linked gene products accumulate prior to meiosis thus, even if the meiotic X is inactive, its products could still play a role in sperm production (as might be the case with mammals). Collectively, these factors indicate that contributions from both the male-hemizygosity of the X chromosome and the genomic trend for localization of sterility factors to the X may both contribute to the observed enrichment of non-introgressing loci to the X chromosome.
It must be noted that the 50-cM region in which non-introgressing loci are spread on the X chromosome is far larger than the autosomal region with non-introgressing loci (only 7-cM in LG14, [7]). This would be consistent with a larger number of X-linked loci acting as barriers to gene exchange compared to autosomal loci. Alternatively, the large size of this highly differentiated region may simply be due to the higher level of linkage disequilibrium on the X chromosome, which has been associated with the lack of recombination and increased selection pressure of the X chromosome in hemizygous males [25]. Thus, even if only a few loci were unable to cross species barriers, many other loci that do not directly affect fitness of the other species could hitchhike with them due to the lower levels of recombination, a phenomenon that would be less prevalent on autosomes.
It is also possible that one or more chromosomal inversions are behind the clustering of many loci in the X chromosome. This would provide an alternative explanation for the shorter length of our X chromosome map in relation to that in Maroja et al. [7]; the previous map was a result of recombination between the two G. pennsylvanicus chromosomes while our current map is a result of recombination between chromosomes from different species, potentially involving inversions. Chromosomal inversions are known to play a role in population and morph differentiation, such as in "social" chromosome of fire ants [26] and the ruff mating system [27]. Inversions also play a role in speciation and ecotypification, such as in Drosophila pseudoobscura and D. persimilis [28], in Anopheles gambiae [29], and in Gadus morhua [30]. Of particular relevance, a recent study showed that the number of inversion differences between species of passerine birds is associated with range overlap and that in passerines inversions are more likely to be fixed on the Z chromosome than on the average autosome [31]. It is thus possible that one or more inversions on the G. firmus/G. pennsylvanicus X chromosome may have played a role in the reduction of gene exchange in the X chromosome and might potentially have contributed to the further development of barriers to gene exchange between the two species.
Supporting information
Distance is measured in cM from leftmost locus (seq1412) on the X chromosome. Data can be found in Dryad: doi: 10.5061/dryad.h0g0kf0.
(XLSX)
Acknowledgments
We thank late Dr. Richard Harrison for helpful discussions and advice and Steve Bogdanowicz for help with the nextera illumina sequencing. This work was partially funded by a NSF grant 1650887 to L. S. Maroja and by Williams College.
Data Availability
Data has been submitted to DRYAD doi:10.5061/dryad.h0g0kf0.
Funding Statement
This work was funded by grant 1650887 from the National Science Foundation (US) [https://www.nsf.gov] and Williams College to LM. The funder had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Harrison R. G., 1985. Barriers to gene exchange between closely related cricket species. ii. life cycle variation and temporal isolation. Evolution 39: 244–259. 10.1111/j.1558-5646.1985.tb05664.x [DOI] [PubMed] [Google Scholar]
- 2.Maroja L. S., Andrés J. A., and Harrison R. G., 2009. Genealogical discordance and patterns of introgression and selection across a cricket hybrid zone. Evolution 63: 2999–3015. 10.1111/j.1558-5646.2009.00767.x [DOI] [PubMed] [Google Scholar]
- 3.Maroja L. S., McKenzie Z. M., Hart E., Jing J., Larson E. L., et al. , 2014. Barriers to gene exchange in hybridizing field crickets: the role of male courtship effort and cuticular hydrocarbons. BMC Evolutionary Biology 14: 65 10.1186/1471-2148-14-65 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Larson E. L., Hume G. L., Andrés J. A., and Harrison R. G., 2012. Post-mating prezygotic barriers to gene exchange between hybridizing field crickets. Journal of Evolutionary Biology 25: 174–186. 10.1111/j.1420-9101.2011.02415.x [DOI] [PubMed] [Google Scholar]
- 5.Larson E. L., Andrés J. A., Bogdanowicz S. M., and Harrison R. G., 2013. Differential introgression in a mosaic hybrid zone reveals candidate barrier genes. Evolution 67: 3653–3661. 10.1111/evo.12205 [DOI] [PubMed] [Google Scholar]
- 6.Larson E. L., White T. A., Ross C. L., and Harrison R. G., 2014. Gene flow and the maintenance of species boundaries. Molecular Ecology 23: 1668–1678. [DOI] [PubMed] [Google Scholar]
- 7.Maroja L. S., Larson E. L., Bogdanowicz S. M., and Harrison R. G., 2015. Genes with restricted introgression in a field cricket (Gryllus firmus/Gryllus pennsylvanicus) hybrid zone are concentrated on the x chromosome and a single autosome. G3 5: 2219–2227. 10.1534/g3.115.021246 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Van Ooijen, J., 2006 JoinMap 4. Software for the calculation of genetic linkage maps in experimental populations. Wageningen: Kyazma BV.
- 9.Noor M. A. F. and Bennett S. M., 2009a. Divergent selection and heterogeneous genomic divergence. Molecular Ecology 18: 375–402. 10.1111/j.1365-294X.2008.03946.x [DOI] [PubMed] [Google Scholar]
- 10.Noor M. A. F. and Bennett S. M., 2009b. Islands of speciation or mirages in the desert? examining the role of restricted recombination in maintaining species. Heredity 103: 439–444. 10.1038/hdy.2009.151 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Cruickshank T. E. and Hahn M. W., 2014. Reanalysis suggests that genomic islands of speciation are due to reduced diversity, not reduced gene flow. Molecular Ecology 23: 3133–3157. 10.1111/mec.12796 [DOI] [PubMed] [Google Scholar]
- 12.Andrés J. A., Larson E. L., Bogdanowicz S. M., and Harrison R. G., 2013. Patterns of transcriptome divergence in the male accessory gland of two closely related species of field crickets. Genetics 193: 501–513. 10.1534/genetics.112.142299 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Charlesworth B., 2009. Effective population size and patterns of molecular evolution and variation. Nature Reviews Genetics 10: 195–205. 10.1038/nrg2526 [DOI] [PubMed] [Google Scholar]
- 14.Payseur B. A., Krenz J. G., and Nachman M. W., 2004. Differential patterns of introgression across the X chromosome in a hybrid zone between two species of house mice. Evolution 58: 2064–2078. [DOI] [PubMed] [Google Scholar]
- 15.Sankararaman S., Mallick S., Dannemann M., Prüfer K., Kelso J., et al. , 2014. Neanderthal ancestry in present-day humans. Nature 507: 354–357. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Good J. M., Dean M. D., and Nachman M. W., 2008. A complex genetic basis to X-linked hybrid male sterility between two species of house mice. Genetics 179: 2213–2228. 10.1534/genetics.107.085340 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Wang P. J., McCarrey J. R., Yang F., and Page D. C., 2001. An abundance of X-linked genes expressed in spermatogonia. Nature Genetics 27: 422–426. 10.1038/86927 [DOI] [PubMed] [Google Scholar]
- 18.Torgerson D. G., Kulathinal R. J., and Singh R. S., 2002. Mammalian sperm proteins are rapidly evolving: Evidence of positive selection in functionally diverse genes. Molecular Biology and Evolution 19: 1973–1980. 10.1093/oxfordjournals.molbev.a004021 [DOI] [PubMed] [Google Scholar]
- 19.Jablonka E. and Lamb M. J., 1988. Meiotic Pairing Constraints and the Activity of Sex Chromosomes. Journal of Theoretical Biology 133: 23–36. [DOI] [PubMed] [Google Scholar]
- 20.Viera A., Santos J. L., Page J., Parra M. T., Calvente A., Cifuentes M., Gomez R., Lira R., Suja J. A., and Rufas J. S. 2004. DNA double-strand breaks, recombination and synapsis: the timing of meiosis differs in grasshoppers and flies. EMBO Rep 5:385–391. 10.1038/sj.embor.7400112 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Cabrero J., Teruel M., Carmona F. D., and Camacho J. P. 2007a. Histone H2AX phosphorylation is associated with most meiotic events in grasshopper. Cytogenetic and Genome Research 116:311–315. 10.1159/000100416 [DOI] [PubMed] [Google Scholar]
- 22.Cabrero J., Teruel M., Carmona F. D., Jimenez R., and Camacho J. P. 2007b. Histone H3 lysine 9 acetylation pattern suggests that X and B chromosomes are silenced during entire male meiosis in a grasshopper. Cytogenetic and Genome Research 119:135–142. 10.1159/000109630 [DOI] [PubMed] [Google Scholar]
- 23.Ravi Ram K., Ji S., and Wolfner M. F. 2005. Fates and targets of male accessory gland proteins in mated female Drosophila melanogaster. Insect Biochem Mol Biol 35:1059–1071. 10.1016/j.ibmb.2005.05.001 [DOI] [PubMed] [Google Scholar]
- 24.Sitnik J. L., Gligorov D., Maeda R. K., Karch F., and Wolfner M. F. 2016. The Female Post-Mating Response Requires Genes Expressed in the Secondary Cells of the Male Accessory Gland in Drosophila melanogaster. Genetics 202:1029–1041. 10.1534/genetics.115.181644 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Tapper W. A., Collins A., Gibson J., Maniatis N., Ennis S., et al. , 2005. A map of the human genome in linkage disequilibrium units. Proc. Natl. Acad. Sci. USA 102: 11835–11839. 10.1073/pnas.0505262102 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Wang J., Wurm Y., Mipitwattanaphon M., Riba-Grognuz O., Huang Y.C., et al. , 2013. A Y-like social chromosome causes alternative colony organization in fire ants. Nature 493: 664–668. 10.1038/nature11832 [DOI] [PubMed] [Google Scholar]
- 27.Lamichhaney S., Fan G., Widemo F., Gunnarsson U., Thalmann D. S., et al. , 2016. Structural genomic changes underlie alternative reproductive strategies in the ruff (Philomachus pugnax). Nature Genetics 48: 84–88. 10.1038/ng.3430 [DOI] [PubMed] [Google Scholar]
- 28.Noor M. A. F., Grams K. L., Bertucci L. A., Almendarez Y., Reiland J., et al. , 2001. The genetics of reproductive isolation and the potential for gene exchange between Drosophila psuedoobscura and D. persimilis via backcross hybrid males. Evolution 55: 512–521. [DOI] [PubMed] [Google Scholar]
- 29.Manoukis N. C., Powell J. R., Touré M. B., Sacko A., Edillo F. E., et al. , 2008. A test of the chromosomal theory of ecotypic speciation in Anopheles gambiae. Proc. Natl. Acad. Sci. USA 105: 2940–2945. 10.1073/pnas.0709806105 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Berg P. R., Star B., Pampoulie C., Bradbury I. R., Bentzen, et al. , 2017. Trans-oceanic genomic divergence of Atlantic cod ecotypes is associated with large inversions. Heredity (Edinb). 10.1038/hdy.2017.54 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Hooper D. M. and Price T. D., 2017. Chromosomal inversion differences correlate with range overlap in passerine birds. Nature Ecology & Evolution 1: 1526–1534. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Distance is measured in cM from leftmost locus (seq1412) on the X chromosome. Data can be found in Dryad: doi: 10.5061/dryad.h0g0kf0.
(XLSX)
Data Availability Statement
Data has been submitted to DRYAD doi:10.5061/dryad.h0g0kf0.

