Emergence and spread of the barley net blotch pathogen coincided with crop domestication and cultivation history

Demetris Taliadoros; Alice Feurtey; Nathan Wyatt; Benoit Barrès; Pierre Gladieux; Timothy L Friesen; Eva H Stukenbrock

doi:10.1371/journal.pgen.1010884

. 2024 Jan 29;20(1):e1010884. doi: 10.1371/journal.pgen.1010884

Emergence and spread of the barley net blotch pathogen coincided with crop domestication and cultivation history

Demetris Taliadoros ^1,², Alice Feurtey ^1,^3,⁴, Nathan Wyatt ^5,⁶, Benoit Barrès ⁷, Pierre Gladieux ⁸, Timothy L Friesen ⁵, Eva H Stukenbrock ^1,^2,^*

Editor: Gregory P Copenhaver⁹

PMCID: PMC10852282 PMID: 38285729

Abstract

Fungal pathogens cause devastating disease in crops. Understanding the evolutionary origin of pathogens is essential to the prediction of future disease emergence and the potential of pathogens to disperse. The fungus Pyrenophora teres f. teres causes net form net blotch (NFNB), an economically significant disease of barley. In this study, we have used 104 P. teres f. teres genomes from four continents to explore the population structure and demographic history of the fungal pathogen. We showed that P. teres f. teres is structured into populations that tend to be geographically restricted to different regions. Using Multiple Sequentially Markovian Coalescent and machine learning approaches we demonstrated that the demographic history of the pathogen correlates with the history of barley, highlighting the importance of human migration and trade in spreading the pathogen. Exploring signatures of natural selection, we identified several population-specific selective sweeps that colocalized with genomic regions enriched in putative virulence genes, and loci previously identified as determinants of virulence specificities by quantitative trait locus analyses. This reflects rapid adaptation to local hosts and environmental conditions of P. teres f. teres as it spread with barley. Our research highlights how human activities can contribute to the spread of pathogens that significantly impact the productivity of field crops.

Author summary

Population genetic and genomics studies of several crop pathogens have revealed that human activities, such us domestication, trade, and migration have played a pivotal role in the emergence and spread of plant diseases. In this study, we employed cutting-edge genetic analysis techniques and machine learning tools to shed light on the population structure and historical dispersal patterns of a major fungal pathogen of barley, Pyrenophora teres f. teres. We found that barley domestication during the Neolithic era potentially gave rise to the pathogen which has since co-evolved with barley to become one of the most devastating barley diseases we face today. In addition, we identified a large number of genomic regions evolving under strong positive selection that were specific to different populations. This finding suggests that populations are evolving fast, becoming well adapted to their local host availability and environmental conditions.

Introduction

Fungi cause devastating diseases in crop plants and can be dispersed across continents by agricultural trade [1,2]. Understanding the evolutionary history of fungal pathogens and the mechanisms underlying their emergence and spread is essential in preventing future epidemics in agroecosystems. Notably, information on historic and current evolutionary trajectories can be key in the development of regulation for re-engineer crops and agroecosystem to improve epidemiological surveillance and prevent potential outbreaks [3].

Pyrenophora teres f. teres is a widespread fungal pathogen of barley causing the disease “net form net blotch” (NFNB). Infection of susceptible barley leaves occurs by the production of specialized infection structures, appressoria, whereby the fungus penetrates the cuticle and cell wall of epidermal cells. The characteristic net blotch symptoms arise through necrosis which is induced rapidly after initial pathogen invasion of the host leaf [4]. Net form net blotch disease occurs in all barley-producing regions of the world, and has been reported in several African countries [5–7], West and East Asia [8,9], Europe [10], North and South America [11,12], and Australia [13]. The pathogen has been known to humans for centuries, initially described as Helminthosporium teres (Sacc.), however insights into the population biology and demographic history of P. teres f. teres are scarce.

Barley was domesticated in the Fertile Crescent approximately 10,000 years ago and was later introduced to North Africa and Eurasia by Neolithic farmers [14]. North Africa has been proposed as a center of diversity of wild barley [15–18]. In the past few centuries, barley has further been dispersed with European migrants to the Americas, Australia, and South Africa [19]. Altogether, the dispersal history of barley is associated with human activities and reflects the spread of cereal cultivation through historical and modern trading routes.

Plant domestication has been associated with the emergence of new fungal pathogens [20]. Population genetic and evolutionary studies have been used to track down the origin and dispersal history of important crop pathogens. The fungal wheat pathogen Zymoseptoria tritici emerged at the onset of wheat domestication and was dispersed with wheat farming during the Neolithic and much later with European migrants [21,22]. Likewise, the center of origin of the maize infecting smut fungus Ustilago maydis likely also coincide with the center of maize domestication in Central and South America [23]. Several other examples underline the importance of domestication and agricultural trade in shaping the evolution and dispersal of crop pathogens [24–26]. In modern times, breeding of new crop species has also driven the rapid evolution of new pathogen species, such as mildew pathogens affecting the hybrid crop triticale [27].

The dispersal of pathogens across continents may be accompanied by local adaptation to distinct environmental conditions and/or to specific management practices such as local crop varieties or, in more recent times, fungicides [22,28]. Signatures of adaptation can be identified in population genomic data as “selective sweeps”, which are genomic regions with low genetic variation and elevated linkage disequilibrium [29,30]. Several statistical methods have been developed to distinguish signatures of selective sweeps from other scenarios that can influence patterns of variation along genomes, such as demography, recombination rate variation, and population structure [30]. Prime candidates for signatures of strong and recent positive selection in plant pathogens are genes that encode effector proteins [31–34], which are small proteins secreted by pathogens to manipulate their host’s physiology. Effector proteins play determining roles in the suppression of plant immune responses and have been a major focus in molecular plant pathology research [35]. Effector genes can be predicted from genome sequences as they typically encode a signal peptide targeting them for secretion into the plant apoplast or translocation into the host cytoplasm. Moreover, effectors are typically cysteine-rich and specifically expressed during host invasion [36]. Several effector genes have been identified by quantitative trait locus analysis (QTL) or genome-wide-association studies (GWAS) underlining the importance of genome data in the discovery of virulence mechanisms [37].

In the present study, we analyzed the haploid genomes of 104 P. teres f. teres isolates derived from different barley fields worldwide. Based on single nucleotide polymorphisms (SNPs), we characterized the population structure and inferred the demographic history of the pathogen. We specifically investigated patterns of early lineage divergence using different methods for demographic inference. Our analyses provide strong evidence for a recent origin and dispersal of P. teres f. teres, likely coinciding with the domestication and dispersal of barley by Neolithic farmers. We, moreover, investigated signatures of recent natural selection and found an overlap between signatures of selective sweeps and putative virulence factors previously identified by quantitative trait locus (QTL) analyses [38,39]. Our study reveals the recent emergence of an important crop pathogen along the domestication and dispersal of its host and underlines the impact of human activities and agriculture on the evolution and spread of new diseases.

Results

Generation of a population genomic dataset of P. teres f. teres

To study the geographical population genetic structure of P. teres f. teres and to infer the recent history of this emerging barley pathogen, we generated a population genomic dataset comprising sequence data of 104 isolates from cultivated barley in six countries across four continents (Africa, America, Central Asia, and Europe) (Table A in S1 Table). The genomes were sequenced with Illumina technology and sequencing reads were mapped to the reference genome of P. teres f. teres [40] to identify SNPs. The average read coverage of genomes was 21X, with a minimum coverage of 8.3X and a maximum of 68.8X. In total, we identified 1,092,635 high-quality SNPs among the 104 isolates. Further summary statistics related to the read mapping and variant calling are summarized in Table B in S1 Table.

Phylogenetic relationship of Pyrenophora species from barley and other grass hosts

Because barley can be infected by other closely related Pyrenophora species, including P. teres f. maculata, we first reconstructed the phylogenetic relationships between the isolates included in our study and other barley infecting Pyrenophora species to ensure the species identity of our isolates. Our analyses included a set of isolates collected from wild barley exhibiting spot lesions in the Monterey Peninsula (California), which allowed us to compare genetic diversity in P. teres f. teres populations of cultivated and wild barley.

Using sequence information from four gene loci (ITS, LSU, tub2, and tef1-a) [41,42], we found that the Californian population from wild barley represents a separate lineage or species of Pyrenophora that is more closely related to P. graminea than P. teres (Fig 1A). The Californian Pyrenophora population thereby provided us with an ideal outgroup for further analyses of the P. teres f. teres populations.

Fig 1 — A) Inference of the phylogenetic relationship of closely related *Pyrenophora* species, including isolates of different species originating from barley. The tree was built with nucleotide sequence alignments of the ITS, tub2, LSU, and tef1 regions obtained from five *Pyrenophora* species (Maximum likelihood inference, loglikelihood: -12,351.458). Numbers reflect maximum likelihood and maximum parsimony bootstrap, respectively. *Alternaria alternata* was defined as root of the tree. B) Nucleotide diversity of P. *teres* f. *teres* populations in each geographic region. A Kruskal-Wallis test with post-hoc pairwise Wilcoxon was used to identify significant differences (p < 0.05) between the groups (Table C in S1 Table). C) Linkage disequilibrium decay for each population. D) Percentage of the two mating types occurring at each location. Asterisk indicates significant departure from the 1:1 ratio (chi-squared test, p-value: 0.05).

The Caucasian population of P. teres f. teres exhibits higher nucleotide diversity

The amount and distribution of genetic variation in a population can give insight into its demographic history. We compared the nucleotide diversity, π, among the six geographical P. teres f. teres populations, and found significant differences among populations (Kruskal-Wallis test, p-value < 2.2e-16) (Fig 1B). Our analysis showed that the Caucasus population comprises significantly higher levels of genetic diversity (mean π = 0.0518) than every other population (pairwise Wilcoxon test, p-values shown in Table C in S1 Table). Furthermore, even though the Middle Eastern and North African populations comprise significantly lower genetic diversity (mean π: 0.0477 and 0.0464, respectively) than the Caucasian population, they still have more diversity than the European and the North American population (mean π: 0.031 and 0.0301, respectively) (Fig 1B).

Similar to genetic diversity, we compared values of Tajima´s D reflecting the relationship between the observed and expected allele frequencies (Table D in S1 Table and S1 Fig). In all populations we find a Tajimas D value > 0 reflecting an excess of common alleles. In general, Tajimas D>0 can be indicative of a recent population contraction. We found however that Tajima´s D values were significantly different between P. teres f. teres populations (Kruskal-Wallise test, p-value < 2.2e-16) possibly reflecting different demographic scenarios. The North African population showed a lower Tajima’s D value (D = 0.62) compared to the Middle Eastern, European, and North American populations. But intriguingly, the Caucasian population had a Tajimas D value close to 0 (D = 0.18), reflecting a mutation–drift equilibrium. Jointly, the higher genetic diversity and the Tajimas D value close to 0 could reflect that the Caucasian population is older than the other populations in our dataset.

Varying extent of linkage disequilibrium (LD) among populations can inform about the age of populations, and in the case of fungi with mixed reproductive modes, also provide insights into the frequency of sexual reproduction. More recently founded populations, and populations with lower frequencies of sexual reproduction, will typically exhibit a greater extent of LD compared to older or sexually recombining populations. We found considerably longer linkage blocks in the North American population, for which the LD statistic r² was < 0.25 at 9,28 Kbp (Fig 1C, Table 1). These long LD blocks in the North American population may reflect that the population is more recently founded and/or undergoes a large extent of clonal propagation. The latter is in agreement with a skewed distribution of mating types (Fig 1D).

Table 1. Summary statistics of the genetic clusters of P. teres f. teres.

Cluster name	No of individuals	Isolate origin	π	Θw	Tajima	Dist r2 < 0.25 (Kbp)	Ne (π)	Ne (θw)
Middle East	7	Iran	0,003	0,001	1,02	3,57	2621,053	1087,316
North Africa	23	Morocco	0,006	0,002	0,62	5,01	4840,351	1915,079
Caucasus	18	Azerbaijan (11), Denmark (3), Iran (2), Morocco (2)	0,005	0,002	0,18	4,28	4607,895	1852,807
Cenral and North Europe	20	France(18), Denmark (1), Azerbaijan (1)	0,005	0,002	0,90	3,80	4414,912	1622,167
North America	20	ND	0,004	0,002	0,82	9,28	3674,561	1450,482

Open in a new tab

[1] π: Estimator of genetic diversity based on the pairwise nucleotide differences of a genetic region (Nei and Li, 1979)

[2] Θw: Waterson’s Θ (Waterson, 1975)

[3] Tajima’s D (Tajima, 1989)

[4] Correlation coefficient

[5]Ne (π): π/2μ

[6]Ne (θw): θw/2μ

Pyrenophora teres f. teres has a heterothallic mating system implying that mating only occurs between individuals of opposite mating types, Mat1-1 and Mat1-2 [43]. To further investigate geographic variation in the frequency of sexual reproduction, we examined the mating type ratio, which is expected to be equal to one under random mating. We used the software SPAdes [44] to de novo assemble genomes and thereby validate and compare the frequency of mating type loci. A null hypothesis of 1:1 mating type ratio could not be rejected for the P. teres f. teres populations except the North American population, for which we found a significant departure from the expected 1:1 ratio of mating types (Mat1-1:Mat1-2 = 3.5, Chi-squared test, p = 0.0184) (Fig 1D, Table E in S1 Table). These analyses suggest that that P. teres f. teres is regularly undergoing sexual reproduction throughout most of its range. The skewed mating type frequency in the North American population can reflect a more pronounced contribution of asexual reproduction.

Populations of P. teres f. teres are geographically structured

We characterized the population genetic structure of P. teres f. teres based on complementary methods using genome-wide SNP data. Firstly, we investigated the extent of clustering using a principal component analysis (PCA) (Fig 2A). The PCA mostly separated isolates according to their geographical origin. We further explored population structure by generating a Neighbour-net network with SPLITSTREE v. 4 and by inferring the extent of shared ancestry using an ADMIXTURE analysis [45]. In the ADMIXTURE analysis, the Cross-Validation error used to select the most appropriate number of clusters (K) was minimized at K = 6 (Fig 2B, Table F in S1 Table). The genetic clusters inferred from the ADMIXTURE analysis corresponded to five clusters mostly circumscribed to North America, North Africa, the Middle East, Europe, and Caucasus (Fig 2B), and a cluster of three individuals restricted to the Caucasus referred to as the Caucasus-2 cluster. The European cluster, referred to as Europe+, was also present in the Caucasus (one isolate), and the Caucasus cluster, referred to as Caucasus+ was also present in the Middle East, in North Africa, and in Northern Europe. The ADMIXTURE analysis also revealed 15 individuals with shared ancestry in multiple clusters, suggesting admixture. Most of the individuals showing mixed ancestry (9/15) were derived from the Caucasus, where multiple clusters coexist. Furthermore, three isolates with mixed ancestry were derived from Europe, two from North Africa, and one from the Middle East.

Fig 2 — A) Principal Coordinate Analysis, where shape reflects the origin of the isolate and colour reflects the genetic cluster set with ADMIXTURE at K = 6. B) The program ADMIXTURE was used to compute population structure between the six geographical populations. The most fit number of hypothetical ancestral groups was identified as six based on the cross-validation method (Table F in S1 Table). Here we present patterns of four, five, and six hypothetical ancestral groups. C) Neighbour-Net tree generated from SNP data from the P. *teres* f. *teres* populations. The branch colour reflects the genetic cluster set with ADMIXTURE at K = 6. D) World map shows the distribution and contribution of the genetic clusters identified by ADMIXTURE at K = 6 at the sampling sites. Underlying map based on OpenStreetMap (OpenStreetMap contributors) data (retrieved from https://www.openstreetmap.org/#map=2/19.1/54.7), freely available under Open Data Licence (https://www.openstreetmap.org/copyright).

The Neighbour-net phylogenetic network essentially revealed the same clusters as the ADMIXTURE analysis. All clusters were connected by reticulations indicating homoplasic mutations caused by incomplete lineage sorting or historical gene flow (Fig 2C). The Caucasus-2 cluster was connected to other lineages by a long, non-reticulated branch, consistent with a relatively long history of isolation from other clusters (S2 Fig).

We applied a Mantel test to determine if genetic distance, simply measured as pairwise messma3tes across the genome, is correlated with geographic distance between the isolates [46], and indeed confirm that geography explains some of the variation between clusters as spatial and genetic distances are correlated (S3 Fig).

Phylogenomic analysis suggests an ancient split of the North African P. teres f. teres population and a Caucasian origin of the North American population

The Middle East, Caucasus, and North Africa are the regions that have the longest history of barley cultivation [16]. Domesticated barley was introduced later to Europe and then to America. To test the hypothesis that early dispersal of P. teres f. teres occurred simultaneously with the spread of barley cultivation we inferred the evolutionary relationships between the populations of the pathogen. To this end, we have constructed a population tree, using polymorphism-aware models in IQ-TREE, using the Californian population as root for the P. teres f. teres populations (Fig 3). In this analysis, based on the full complement of polymorphisms, we found two major population splits: one lineage comprising the North African, Middle Eastern, and European population and another lineage comprising the Caucasus and North American populations. Within the former lineage, the branching harbouring the North African population diverged earlier than the branches harbouring the Middle Eastern and European populations. The clustering of Caucasus and North American populations suggested a Caucasian origin of the North American population.

Fig 3 — Phylogenetic tree using polymorphism-aware models (PoMo) (Maximum likelihood inference, loglikelihood: -857128.545) to assess the evolutionary relationship between P. *teres* f. *teres* populations. Branch numbers reflect maximum likelihood bootstrap values. The tree was rooted using the Californian population. The scale bar represents the expected number of substitutions per site.

The origin and dispersal of P. teres f. teres correlates with the early history of barley cultivation

We further investigated the demographic history of P. teres f. teres populations using Approximate Bayesian computations (ABC) with supervised machine learning implemented in the DIYABC Random Forest software (DIYABC-RF) [47]. As the ABC framework requires populations not connected by continuous geneflow, we excluded isolates with shared ancestry in multiple clusters and considered five, non-admixed geographic populations for the analysis. Non-Caucasian and non-European isolates were excluded from the Caucasus+ and European clusters. We compared invasion scenarios in which the origin of each derived population was associated with a demographic bottleneck (see Materials and Methods) [48,49]. For each scenario, we assessed the compatibility of the simulated datasets with the observed data using linear discriminant analysis (LDA), by simultaneously projecting simulated and observed data on the first two LDA axes. The overlap between the simulated datasets and the observed data indicated the compatibility of the simulated scenarios and the observed data (S4 and S5 Figs).

Demographic inference with ABC was performed in three consecutive steps, each step corresponding to a different family of invasion scenarios. The first step considered simple scenarios of the three populations (Caucasus, N. Africa, and Middle East) which we found to be most distant from each other in the split tree analysis (Fig 2), suggesting a more ancient divergence of these populations. The scenario complexity was then gradually increased as we assessed the evolutionary relationships of the additional populations in each consecutive step. For each family of scenarios, LDA confirmed that the chosen conditions were suitable for the random forest analysis. To select the most probable hypothetical scenario from each family, we used a random forest classifier with 1,000 trees. Detailed results of DIY-ABC analyses are provided in Table G in S1 Table and S4–S6 Figs.

Scenarios of family 1: Early divergence of Middle East, Caucasus, North African populations

To elucidate the most ancestral splits, we tested a total of 49 invasion scenarios with different ancestries and branching orders among populations from the Middle East, Caucasus and North Africa, regions where barley was first cultivated. Nine distinct categories of scenarios with similar topologies were considered. The most probable category of scenarios was “group 9” and scenario 45 (posterior probability of 0.778 and 0.401, respectively) (Table H in S1 Table). Scenario 45 modelled an initial divergence of P. teres f. teres populations from the Middle East and the Caucasus, and a subsequent emergence of the North African population from the Middle Eastern population (Fig 4A).

Fig 4 — A) Development of speciation scenarios across three analyses steps and using approximate Bayesian computation and Random Forest analyses implemented in DIY ABC-RF version 1.0. B) The most probable hypothetical evolutionary scenario of the migration routes of P. *teres* f. *teres* across the Middle East, North Africa, Europe, and North America based on the results of three sequential DIY ABC-RF analyses. The parameter “P” indicates the posterior probability of the most probable scenario. In paratheses are shown the probabilities of the different group containing the most probable demographic scenario (see Methods). We considered a classic invasion scenario for the topology building where each derived population passes through a bottleneck as it gives rise to a new population. Predicted time over the current Ne of the most ancestral population (N_Azb) values inferred with random forest are shown. The 90% CI values for each parameter is provided in parentheses. C) Changes in effective population size (Ne) for all P. *teres* f. *teres* populations were estimated with MSMC2. The axes were scaled with a mutation rate of 4.5 x 10⁻⁷ per site per generation, and one generation per year. The estimated time of barley domestication is indicated with an arrow. D) Relative cross-coalescence rate (RCCR) for all pairs of populations. Five runs of seven randomly selected individuals per population per run were performed. Shown here, is the trend line fit between the five runs. Gray area around the line indicates the 95% confidence interval. The arrow indicates the estimated time of barley domestication.

Scenarios of family 2: Founding of the European population through migration from the Middle East

Having determined the branching order among populations from areas of more ancient barley cultivation, we proceeded to examine the more recent history of P. teres f. teres populations through a second family of scenarios. We compared 17 evolutionary scenarios modeling the relationships among the Central and Northern European populations, and other populations. In line with the population tree presented above, the most probable scenario category was “group 2” and scenario 2 (posterior probability of 0.701 and 0.412, respectively), suggesting that the Central and North European population derived from the Middle Eastern population (Fig 4A).

Scenarios of family 3: Origin of the North American population

The third family of scenarios modeled the origin of the North American population. According to Scenario 9 from group 5, which had the highest posterior probabilities (0.628 and 0.990, respectively), the North American population was established through admixture between the Caucasus population and an unknown "ghost" population (Fig 4A). This analysis also revealed that the earliest divergence was between the Caucasus and ghost population, which suggest that the Caucasus population is the oldest, and that the Middle East population emerged following admixture between the Caucasus and ghost populations.

Parameter inference analysis with DIY-ABC

To estimate the demographic parameters of the scenarios with highest posterior probabilities, we used a random forest with 1000 trees. Time estimates were estimated as the ratio of time over the current effective population size of the predicted most ancestral population, Caucasus population (N_Az) [47]. Our maximum posterior probability estimate of the divergence time between the Caucasus and unsampled ghost populations was 0.5640 generations/N_Az (credibility interval [CI] 0.3090–0.6200). The North African population split from the Caucasus population 0.4862 generations/ N_Az (CI 0.2127–0.6043), and the Middle East population emerged by the admixture of the Caucasus population and the ghost population around 0.2137 generations/ N_Az (CI 0.1333–0.5474). The Central and Northern European population arose from the Middle Eastern population 0.0982 generations/ N_Az (CI 0.0334–0.1903), and the North American population emerged 0.05193 generations/ N_Az (CI 0.0053–0.1296) ago (Table 2, Fig 4B).

Table 2. Results for DIYABC-RF estimation of population divergence times normalized over the most ancestral sampled population N5 under scenario 9 detailed in Fig 4.

Parameter	Prediction^*	90% CI	Global (prior) NMAE	Local (posterior) NMAE
t1/N5	0,052	0,005–0,13	0,437	0,316
t2/N5	0,098	0,033–0,19	0,440	0,311
t3/N5	0,214	0,133–0,547	0,320	0,275
t4/N5	0,486	0,213–0,604	0,245	0,210
t5/N5	0,564	0,310–0,62	0,200	0,190

Open in a new tab

*RF analysis included 20,000 simulated data sets and the number of trees was set to 1000. Global (prior) and local (posterior) error rates were estimated using out-of-bag estimators from a sample of 10,000 data randomly chosen in a training set. CI, credibility interval; NMAE, normalized mean absolute error.

A severe demographic bottleneck in the history of the crop pathogen coincided with the domestication of barley

To further investigate population size variation through time and population split, we applied a Multiple Sequentially Markovian Coalescent (MSMC2) approach [50]. Inferences of population size changes were performed using all available individuals (Fig 4C). Furthermore, we estimated the divergence time between populations using five independent runs of 14 randomly selected isolates per population pair (seven isolates per population) (Fig 4D). We found that the effective population size of the Caucasus population was initially the largest, but that the population subsequently experienced a demographic bottleneck. In agreement with the ABC analyses, we also found evidence for an early divergence between Caucasus and the North African population. Considering a mutation rate (μ) of 5.7 x 10⁻⁷ per base pair [51] and assuming here on average one sexual generation per year [4], these events coincide with the domestication of barley and subsequent migration waves of neolithic farmers to Northwest Africa [52] about 7,000 to 14,500 years ago (Table I in S1 Table).

We computed the Relative Cross Coalescence Rate (RCCR) for pairwise combinations of populations to estimate splitting times (Fig 4D). These analyses provided further support for the close evolutionary relationship between Central Europe and Middle East populations, as also observed in the PCA and NeighborNet tree analyses. Furthermore, the RCCR of the North America and Caucasus populations decays slower, which indicates extensive amounts of geneflow after divergence of the populations. The geneflow and late split between North America and Caucasus populations interfered with the RCCR, is in line with the emergence of the North American population from the Caucasian population as inferred by the ABC analysis.

We want to underline that our inference of actual coalescence times is based on assumptions that we considered reasonable for some unknown parameters. For example, the number of sexual cycles of P. teres f. teres is not known, and might even have varied throughout evolutionary times and host shift events. Nevertheless, the relative estimates of population divergence with two independent methods suggest that the Caucasus has been the center of origin of P. teres f. teres. Moreover, both methods applied here, provide evidence for an early divergence of a pathogen lineage in North Africa. In summary, our inference of pathogen population history suggests a parallel dispersal of the pathogen alongside its host and emphasizes the fundamental importance of early agriculture on pathogen evolution.

Recent positive selection has shaped genomic regions encoding putative virulence-related genes

To identify genomic regions that may have experienced selective sweeps during the spread of P. teres f. teres, we used three methods (SweeD, OmegaPlus, and RAisD) which combine information from the site-frequency-spectrum (SFS) and patterns of LD and π along the genome [53–55]. We conducted analyses on each population separately to identify population-specific selective sweeps. These selective sweeps may indicate local adaptation in the pathogen populations.

Demography can greatly impact the distribution of genetic variants along the genome and thereby bias inference of selective sweeps [56]. We therefore combined the selective sweep analyses with simulations of genetic variation under different demographic scenarios (see Material and Methods). Multiple regions exhibiting signatures of selective sweeps were identified with the three methods (Table J–O in S1 Table). We compared and combined selective sweeps maps of the three methods to get a final list of candidate regions exhibiting signatures of recent positive selection.

Our final list of sweeps includes a total of 109 regions across all P. teres f. teres populations, with 20 to 27 selective sweeps per population (Fig 5A, D, Table 3). We identified 42 putative effector genes [40] colocalizing with the 109 selective sweep regions, suggesting that genes encoding virulence related traits such as effectors have been most prone to experience recent positive selection. Some selective sweep regions overlapped while others were unique to distinct populations, possibly representing adaptation to different resistance genes in barley or other local environmental conditions. For example, we found one selective sweep region on chromosome 6 (position 2,856,396–2,964,731) that is present in all P. teres f. teres populations, except the Middle Eastern population. This region includes a gene encoding a predicted effector, which represents a candidate for future functional studies (Fig 5B, Table 3).

Table 3. Summary of total and unique selective sweep regions, as well as, genes and effectors located in the regions per populations. The number of sweep regions, total genes, and effectors only identified in one population are characterized as unique.

Population	Total no of sweep regions	Total no of genes	Total number of predicted effectors	Population-specific sweep regions	Population-specific genes	Population-specific effectors	Selective sweep—QTL overlap	QTLs name	QTL Reference
Middle East	27	253	14	11	198	13	1	VK2	40
N. America	22	107	9	8	75	8	3	VK1, VK2, AvrHar	40, 54
N. Africa	29	91	5	19	67	3	2	VK2, PttTif1	40, 39
C.&N. Europe	34	180	5	21	136	5	1	PttPin1	39
Caucasus	20	101	7	11	69	6	2	VR1, PttCell1	40, 39
California (P. sp)	24	155	5	12	108	4	3	VK1, PttCell1, Pttcell2	40, 39

Open in a new tab

We performed a permutation test to assess if effector genes were significantly enriched in selective sweep regions. To this end, we performed a permutation test to assess the relative abundance of predicted effector genes in the selective sweep regions compared to the rest of the genome. Indeed, we found that the abundance of effector genes in selective sweep regions is higher compared to randomly sampled regions along the genome (Fig 5C).

We furthermore explored previously generated lists of candidate virulence determinants in P. teres f. teres. Previous studies have used quantitative trait locus (QTL) analyses to identify determinants of virulence on different barley cultivars [38,39]. We found that 14 putative virulence related genes (QTL candidates) co-localized with selective sweep regions [38,39]. (Fig 5E, Table 3). The QTL candidate region VK2 on chromosome 6 [39] overlapped with a selective sweep region, which was found in each of the populations in the Middle East, North America, and North Africa. The QTL candidate regions VK1 [39] and AvrHar [57] on chromosomes 3 and 5, respectively, co-localized with selective sweep regions predicted in the North American population. Two putative effectors are in the VK1 region on chromosome 3 (Table 3).

We predict that the selective sweeps in P. teres f. teres reflect recent adaptation to barley and local agricultural environments. We further addressed divergent adaptation in Pyrenophora pathogens on different hosts by comparing selective sweep maps of P. teres f. teres and the Pyrenophora population obtained from wild barley in California. To this end, we considered the windows that showed a composite likelihood ratio (CLR), ω, and μ higher than 99,95% for significant outliers.

We identified 24 selective sweeps along the genome of the Californian population, including five genes predicted to encode effector genes. Approximately half of the selective sweeps predicted in the Californian population were shared with the domesticated barley-infecting populations suggesting that the same suite of genes is important for virulence on wild and cultivated hosts. This hypothesis is further supported by the fact that some P. teres f. teres QTL candidates [39] overlap with selective sweep regions in the wild barley pathogen (Table 3). Hereby, also the QTL locus VK1 co-localized with selective sweeps in the wild-barley pathogen.

In summary, the selective sweep analyses identify multiple loci in P. teres f. teres that have experienced recent positive selection. Functional analyses of candidate genes in these regions may shed light on the adaptation of the pathogen to different barley cultivars.

Discussion

Understanding the evolutionary origin of crop pathogens is crucial to predict future epidemics. In this study, we addressed the history of the globally occurring pathogen of barley, P. teres f. teres. We used a global population sample and extensive genome sequencing to assess the population structure and demographic history of the crop pathogen. Our analyses were based on the hypothesis that P. teres f. teres could have emerged and co-evolved with barley during early crop domestication. Extensive sampling of the pathogen in geographical regions representing the most ancestral history of barley domestication and cultivation [58] allowed us to dissect the early history of P. teres f. teres. We also characterized the population structure and demography from present-day barley-producing countries, including France, Denmark and the USA. Our detailed population genomic analyses provide evidence for a scenario where P. teres f. teres emerged in the Middle East at the onset of barley domestication, and subsequently dispersed with Neolithic farmers to North Africa and Europe.

We compared measures of nucleotide diversity among the different P. teres f. teres populations and observed higher diversity in the Caucasian and North African populations. Notably the North American population represented an overall low nucleotide diversity indicating either a more recent origin of the population or a recent bottleneck.

Next, we applied two independent methods to infer the population histories of P. teres f. teres. Both methods provide evidence for a scenario where the most ancestral populations of P. teres f. teres have originated in the Fertile Crescent region. We note that our inferences of population histories based on the Middle East, Caucasus, and North Africa populations, may have been affected by sampling bias as only ten isolates were available from the Middle East, in contrast to 21 and 27 from Caucasus and North Africa, respectively. For the parameter inference of ABC-RF, we have calculated time over the effective population size to reduce the error of parameter inference, as suggested in [47]. We computed current effective populations sizes based on the SNP data to rescale parameters with a mutation rate of 5.7 x 10⁻⁷ [25] and one generation per year (see Materials and methods). Using these values, we estimated that the Caucasian population was founded around 13,000 years ago (CI: 7,267–14,546).

The history of P. teres f. teres not only parallels the evolution of barley. It also parallels the history of a small number of other prominent crop pathogens which have emerged and co-evolved with their host during domestication. Other important pathogens that emerged with their host during domestication include the wheat pathogenic fungus Zymoseptoria tritici causing the disease septoria tritici blotch [59], the rice blast fungus Magnaporthe oryzae [60], and the corn smut fungus Ustilago maydis [61]. In these studies, coalescence analyses were used to infer the divergence time between wild and crop-infecting populations of the pathogen and to infer major demographic events, such as bottlenecks that coincide with the domestication of the host. The emergence of P. teres f. teres was also associated with a considerable population bottleneck probably reflecting strong selection on pathogen individuals with the right gene combination necessary to invade a new host niche.

Interestingly, we find evidence for the early emergence of a distinct P. teres f. teres population in North Africa. Using the above-mentioned scaling the divergence between pathogen populations in North Africa and Caucasus occurred around 11,400 years ago (CI: 4,991–14,181). This scenario is in agreement with the introduction of barley into North Africa by neolithic farmers and the early development of distinct barley varieties [62].

Archaeological remains suggest that barley was cultivated in Central Europe from approximately 6,000 [63] to 4,800 years ago [64]. We find evidence that the P. teres f. teres pathogen accompanied the introduction of barley as our population genomic data suggest the emergence of the European pathogen population occurred around 2,300 years ago (CI: 785–4,465). More recently, the North American population has split from all other populations. Our analyses suggest the emergence of the North American population approximately 1,200 years ago (CI: 125–3,044), which conflicts with the much later introduction of barley to North America by European migrants. We speculate that this inconsistency reflects the uncertainty of our parameter scaling and note that the confidence interval of our estimates still concurs with a European-based introduction to North America a few centuries ago [19]. Moreover, our data strongly corroborate a late divergence between the North American and Caucasian populations. While we included a large sample of isolates, covering a broad geographic region, it is still possible that some pathogen variation was not collected in Europe. Consequently, an intriguing hypothesis emerges: what we recognize as the North American population may have originated from a distinct population in Europe which was not sampled here.

In conclusion, the demographic history of P. teres f. teres recovered using ABC and MSMC2 reflects the introduction of its host, barley, in different locations, highlighting the significant role of historic trading in the dispersal of crop pathogens (Fig 6).

Fig 6 — Different colours represent the approximate period when the proposed events occurred. (1) The Fertile Crescent is the most plausible center of origin of the pathogen. (2) Ancestral divergence of the North African population is consistent with an early migration of the pathogen to North Africa, possibly with early barley cultivation by neolithic farmers in North Africa. (3) More recent populations have emerged in Europe and (4) North America. Underlying map based on OpenStreetMap (OpenStreetMap contributors) data (retrieved from https://www.openstreetmap.org/#map=2/19.1/54.7), freely available under Open Data Licence (https://www.openstreetmap.org/copyright).

We used the P. teres f. teres SNP data to compute overall nucleotide diversity in the pathogen populations. Interestingly, P. teres f. teres showed higher nucleotide diversity (π) compared to other prominent pathogens such as the wheat pathogen Zymoseptoria tritici [65], the wheat powdery mildew pathogen Blumeria graminis f.sp. tritici [25], and the rice blast fungus Magnaporthe oryzae [66]. Genetic variation is instrumental in rapid adaptation of pathogens and the high nucleotide diversity in P. teres f. teres may be an important factor in the successful spread of the pathogen. Differences between species may be explained by different extent of sexual recombination and gene flow and can also be highly impacted by past population bottlenecks. Based on our LD analyses and the distribution of mating type frequencies, we find evidence for frequently occurring sexual recombination in P. teres f. teres [67]. An exception is the North American population that exhibits a higher extent of clonality. The population genetic structure of the North American population is in agreement with a recent founder event of the population, but may also reflect the large-scale monocropping systems in North America that may favour clonal spread of the pathogen over large spatial scales.

We identify six genetic clusters that largely correlate with geographic origin. However, some isolates appear to co-exist and pertain to distinct clusters with little evidence of introgression. This observation may indicate local adaptation and possibly some limits to gene flow, for example between isolates adapted to distinct barley cultivars.

To explore signatures of local adaptation, we used different methods to identify selective sweeps. In total we identify 109 selective sweep regions. We asked how many, and which selective sweeps would be common among the barley-infecting populations; considering that these loci could represent important host specificity loci. Intriguingly, most selective sweeps are population-specific or shared among a small number of populations. Only 23 regions are shared among all P. teres f. teres populations. Several selective sweeps are shared with the Californian population of P. teres occurring on wild grasses. Possibly, genes in these “conserved” sweep regions represent fundamentally important virulence traits.

For the population-specific sweeps, we speculate that the pathogen undergoes strong selection from its local environment, including barley cultivars utilized in different countries. We find that selective sweeps are enriched with predicted effector genes in our regions, which may highlight how host genetics is a main driver of rapid evolution in this pathogen. In addition, as many as 15 out of the 109 selective sweep regions were previously identified as candidate loci in QTL studies aiming to identify virulence determinants in P. teres f. teres [38,39].

Conclusion

A growing body of evidence suggests that human activities play a major role in the emergence and dispersal of plant pathogens [25,61,68]. Here, we employed population genetic approaches with statistical and simulation tools to unravel the population structure and dispersal history of a major fungal barley pathogen. Our results sediment the conclusion that crop domestication in the Neolithic was accompanied by the emergence of several new plant pathogens; pathogens which co-evolved and spread with their hosts and presently represent some of the most important crop diseases we have. Wild relatives of domesticated plants represent important resources of genetic resistances. Likewise, they may be hosts to “wild populations” of pathogens. Exploring genetic variation in natural plant-pathogen systems holds a large potential for the discovery of new crop resistances as well as pathogen virulence determinants.

Material and methods

Genome data

124 P. teres whole genomes were sequenced using Illumina technology (PRJNA923641) [69]. The sequenced isolates were sampled from barley fields on four different continents. Twenty isolates were obtained from two North Dakota State University experimental fields in Fargo and Langdon, North Dakota, USA. 27 isolates were collected from six locations in Morocco, North Africa. Ten strains were isolated in Iran, and 21 isolates were sampled from five locations in Azerbaijan, South Caucasus. Twenty-one isolates were sampled in Central and Northern Europe and five in Denmark, Europe. Finally, we included a collection of isolates from a wild barley species collected in California, USA. The Californian isolates were included with the purpose of identifying recently diverged genomic features in the barley-infecting populations of P. teres f. teres (Table A in S1 Table).

Read mapping and variant calling

A pipeline was developed to filter and map Illumina reads to a reference genome and extract high-quality single nucleotide polymorphisms (SNPs). In brief, the program Trimomatic version 0.38 [70] was used to filter and trim sequencing adapters, nucleotide bases, based on sequencing quality (PHRED 33), and read length (reads shorter than 30 bp were discarded). Overlapping reads were merged using PEAR version 0.9.11 [71]. Burrows-Wheeler Aligner (BWA) version 0.7.17 [72] and Stampy v. 1.0.20 were used [73] to map individual reads to the reference genome of P. teres f. teres (0–1 P. teres f. teres genome GCA_000166005.1) obtained from the NCBI [40] (Table B in S1 Table). Haplotyping and genotyping procedures were performed with the GATK HaplotypeCaller version 4.2.18 [74], providing a final VCF file with the raw SNP calls.

We next conducted filtering of SNPs based on the following criteria: (1) The call quality divided by the depth of sample reads should be larger than 2, (2) the depth per position should be higher than 8, (3) mapping quality of reads supporting each SNP should be higher than 40, (4) allele-specific rank sum test for mapping qualities of the reference (REF) versus alternative (ALT) reads should be higher than -12.5, (5) allele-specific rank sum test for relative positioning of REF versus ALT allele within reading must be higher than -8. (6) each genome has to have an average read coverage of at least 2. For the application of these filters, GATK VariantFiltration version 4.0.11 was used [74]. After applying these hard-filtering criteria, 1,092,635 SNPs were kept, and we further refer to this dataset as the “full high-quality dataset”.

We note that additional subsets or filtering steps were added for specific analyses. Most clustering analyses assume that the markers used are independent. Therefore, for these analyses, we filtered the full high-quality dataset based on the linkage disequilibrium (LD) decay patterns considering a distance of at least 3,12 Kbp (distance of r²/2 averaged across populations) between SNPs (Table 1). After filtering for LD, a dataset of 465,963 SNPs was retained. We refer to this dataset as the “independent SNP dataset”.

We also generated a dataset of SNPs exclusively located in non-coding, presumably neutrally evolving genome regions. For this, we excluded all SNPs located in predicted gene regions and in 500 bp gene-flanking regions, both upstream and downstream, in a third filtering step based on LD. We obtained the coordinates of the genes from the 0–1 reference annotation file [40]. After this filtering step, 160,472 high-quality independent and presumably neutrally evolving biallelic SNPs were kept. We refer to this as the “neutral dataset, which we used to infer the demographic history of the species.

Population genetic structure

Population genetic structure was inferred using three different approaches: a principal component analysis (PCA), ADMIXTURE version 1.3 [45], and Neighbour-Net analyses [75], each of them based on the “independent SNP dataset.” The PCA was here applied to reveal genetic clustering among the isolates and was created using the R package SNPRelate v. 1.6.4 and visualized with the R package ggplot2 [76]. Population structure was further characterized using a maximum likelihood approach implemented in ADMIXTURE. Ten replicate runs for a range of K-values (1–10) were performed. The best K value was determined using the ADMIXTURE software to estimate the cross-validation error [45] (Table F in S1 Table). Finally, we explored the genetic structure and reticulation patterns among lineages, using the distance-based method for constructing Neighbour-Net networks as implemented in the program Splitstree 4 version 4.15.1 [77]. A mantel test was performed to assess the correlation of geographic and genetic distance using the R package ape v. 5.6–2 [78]

Genetic diversity, neutrality tests and linkage disequilibrium

We further used the “full high-quality dataset” to compute and compare genetic variation among populations. ANGSD v.0.939 [79] was used to estimate the genetic diversity for each population as the nucleotide diversity (π) and the number of segregating sites (Wθ), as well as values of Tajima’s D. Furthermore, to assess the effects of sampling bias on the genetic diversity estimations, we recalculated and visualize π if five independent rans using from two to seven individuals per population (S7 Fig). The tool PopLDdecay [80] was used with default parameters to estimate the linkage disequilibrium (LD) decay for each genetic cluster. VCFtools v. 0.1.17 [81] was used to calculate the fixation index Fst [82] between the populations. A Kruskal-Wallis test with post-hoc pairwise Wilcoxon was used to identify significant differences (p < 0.05).

Mating types

MAT1-1and MAT1-2 mating type sequences were obtained from GenBank (accession no. HM121994 and HM122006, respectively) [83]. Assemblies were created using SPAdes [44] with default parameters for each isolate. Subsequently, the mating type of each assembly was assessed by blasting the mating type sequences against them. To that end, blastn [84] with default parameters was used.

Phylogenetic reconstruction

A Maximum likelihood approach was applied to assess phylogenetic relationships between the population isolated from wild barley in California and six others Pyrenophora species. Sequences of four DNA loci (ITS, LSU, tub2, and tef1-a) were extracted from Alternaria alternata (SRC11RK2F), Pyrenophora teres f. maculata (SAMN15340022), Pyrenophora teres f. teres (SRS084801), Pyrenophora graminea (CBS 336.29), Pyrenophora tritici-repentis (V0001), and Pyrenophora seminiperda (SAMN02981545) from NCBI. Californian isolate CAWB5, showing the highest raw read count among the Californian isolates was selected to represent the group. Subsequently, it was assembled with the software SPAdes [44], following the process described under the “Mating types” section, and the sequences of ITS, LSU, tub2, and tef1-a were extracted. Consensus sequences of the four individual loci were aligned with MAFFT v 7.490 [85] using default parameters, manually adjusted using Unipro Ugene v. 43.0 [86], and concatenated using SeqKit [87]. The concatenated alignment was then subjected to maximum-likelihood (ML) analysis using Iqtree version 2.0.3 [88]. The best-fitting substitution model was chosen based on the Bayesian Information Criterion (BIC) using the ModelFinder algorithm implemented in Iqtree version 2.0.3 [89]. Moreover, 1000 bootstrap replicates were performed to obtain branch support values using the bootstrap approximation option of Iqtree [90]. Further support for the phylogenetic inference was provided by a maximum-parsimony (MP) analysis using MPBoot [91]. Similar to ML, MP analysis was performed, including 1000 bootstrap replicates. Alternaria alternata (GCF_001642055.1) was selected as the outgroup taxon for both ML and MP analyses. The resulting trees were edited in FigTree 1.4.4.

IQ-TREE polymorphism-aware models (PoMo) [92] were used to reconstruct relationships between P. teres f. teres populations. For the preparation of the input file, the FastaVCFtoCount.py script provided with the PoMo software was used. Similar to the previous phylogenetic analysis, the best-fitting substitution model was chosen based on the Bayesian Information Criterion (BIC) using the ModelFinder algorithm implemented in Iqtree version 2.0.3. Again, 1000 bootstrap replicates were performed to obtain branch support values using the bootstrap approximation option of Iqtree. The Californian P. teres population was used as an outgroup in this analysis.

Inference of the demographic history of P. teres f. teres populations

The demographic history of P. teres f. teres populations was inferred using approximate Bayesian computation (ABC) with a supervised machine learning algorithm implemented in DIYABC-RF version 1.0 [47]. Since the ABC framework requires populations without continuous gene flow, the five non-admixed, well-populated clusters revealed by the population structure analyses were used in this analysis. Due to the limited number of individuals in the cluster (only three isolates from Caucasus), this second Caucasus cluster was excluded from the inference. Out of the five remaining clusters, three were entirely consistent with the geographical origin of the populations: North Africa, Middle East, and North America. For the remaining two genetic clusters, the composition of isolates did not reflect on a single geographic location but rather a mixture of isolates from different locations, although these clusters originated primarily from Europe and Azerbaijan, Caucasus (Table 1). Most of the isolates (19/20) of the fourth cluster originated from Europe (France and Denmark). Similarly, the majority (12/18) of the fifth cluster isolates originated from Caucasus. For the inference of the demographic history, we only kept the 19 isolates from Europe, representing the fourth genetic cluster, and the 12 isolates originated from Caucasus, representing the fifth cluster.

Since the records about P. teres f. teres invasion history are scarce, the phylogenetic analyses obtained with PoMo were incorporated as the starting point to construct hypothetical evolutionary scenarios. First, the three populations (North Africa, Middle East, Caucasus) that were further apart from each other on the tree, indicating ancient split and isolation among these, were selected as a starting point. Three sequential DIYABC-RF analyses were performed as follows: For the first analysis, 49 scenarios were tested, describing cases where (1) either of the single population or (2) an admixture event between two populations gave rise to the other populations (S4 Fig).

Considering a wider geographical distribution of P. teres f. teres not covered by our sampling, we also included scenarios that tested an unsampled (referred to as a “ghost”) population as the putative ancestral population. We included scenarios where either a present-day sampled population was derived from a ghost population or the present-day population emerged by admixture from a ghost population with another sampled population. These scenarios were analyzed individually and combined in groups of similar scenarios [47]. For the combined groups, scenarios were joined into thirteen groups based on the population that was most ancestral (Table H in S1 Table): Groups 1,2, and 3 consist of scenarios considering the Caucasus, North Africa, and Middle East population as the origin, respectively. Groups 4,5 and 6 consider North America, Middle East, and Caucasus to have been established through an admixture event of the other two populations, respectively. Group 7 considers that all three populations diverged at the same time. Groups 8 to 13 consider a ghost population to be parental to one of the sampled populations. A detailed description of the scenarios and scenario families can be found in the Supplementary materials: S4 Fig and Table G in S1 Table.

In the second analysis, 17 scenarios were considered to assess the emergence and relationship of the European population in relation to the putatively ancestral populations from Caucasus, Middle East, and North America (S5 Fig). Group 1 consisted of scenarios where Europe and Caucasus have the same ancestral population (scenarios 1,8,9,16). Group 2 considers Europe and Middle East share a common ancestor (scenarios 2,13). Group 3 considers Europe and North Africa share a common ancestor (scenarios 3,6,14). Group 4 considers that Europe and the “ghost” population share a common ancestor (scenarios 4,7). Group 5 (scenarios 10, 11, 12) consider the European population to be the product of admixture of two other sampled populations. Group 6 considers scenario 17, where the Europe population is the “ghost” population identified in step 1. The best scenario, selected by random forest in analysis two, was used as the base for analysis 3.

In the third analysis, we assessed the relationship of the population originating from the North America with the rest of the population (S6 Fig). As many as 26 scenarios were tested. As in the previous steps, group 1 consisted of scenarios where North America and Caucasian have the same ancestral population (scenarios 1,8,9,16). Group 2 considers North America and Middle East to share a common ancestor (scenarios 2,13). Group 3 considers North America and Europe to share a common ancestor (scenarios 3,7,10,23,24). Group 4 considers North America and North Africa share a common ancestor (scenarios 4,8,11,13,25). Group 5 considers the North America and the “ghost” population to share a common ancestor (scenarios 5,9,12,14,15). Group 6, consisting of scenarios 6,7,8,10,11,13 considers the North America population to be the product of admixture of two other sampled populations. Group 7, consisting of scenario 26, considers the North America population as the “ghost” population identified in step 1.

A uniform prior distribution for all effective population sizes in the three analyses ranged from 500 and 5,000,000. The changes in effective population sizes were split into recent changes, where the uniform prior distribution was between 10 and 20,000 years ago, and ancient, where the distribution was between 10 and 1,000,000 years ago. The later distribution was used for all splitting and admixture times with uniform probability. For the scenarios that consider the origin of a population through admixture, the prior distribution of the contribution of each parental population was set to be between 0.01 and 0.99. The random forest analysis for the model choice and the parameter inference was performed with 1000 decision trees and default values of DIYABC-RF. Furthermore, DIYABC-RF uses Hudson’s ms simulator [93] with the “-s” parameter to introduce a fixed number of segregating sites under each scenario. Mutation parameters are, in this case, not needed for the simulations [47]. Eventually we have scaled the time values by estimating the current effective population sizes based on Watterson’s theta and a mutation rate of 4,5 x 10⁻⁷ (25) (Table I in S1 Table).

MSMC2 v2.1.1 [50] was applied to infer changes in the effective population size through time. MSMC2 uses a backward-in-time algorithm to build back genome lineages. The MSMCtools bamCaller script was used for the preparation of mask files for the low-coverage regions and to “diploidize” the haploid vcf files. After that, the script generate_multihetsep.py included in the MSMC2 software package was used to create the input files for the analysis. Changes in effective population size were inferred using all the isolates available for each population (from 7 for Middle East to 21 for North Africa). Subsequently, the cross-coalescent rate between the populations was estimated using 100 iterations. To this end, five independent runs of 14 randomly selected isolates per population pair (seven isolates per population) were performed for each pair of populations. The coalescence rate within populations and the cross-coalescence rate between populations was re-calculated based on the 14 randomly selected haplotypes in each run.

Genome scans for selective sweeps

Three independent approaches were applied to identify signatures of selective sweeps along the P. teres f. teres genome. Hereby we used the programs SweeD [53], 2), OmegaPlus v. 3.0.3 [54], and 3) RaiSD v 2.9 [55]. The analyses were conducted with the full high-quality dataset. SweeD v. 3.0 uses the Site Frequency Spectrum (SFS) patterns of SNPs to estimate a composite likelihood ratio (CLR) test for detecting complete sweeps (Pavlidis et al., 2013). We used SweeD, OmegaPlus, and RaisD individually for each genetic cluster and each of the 12 chromosomes and a grid size equal to the number of SNPs present in each chromosome (28,698–77,617 points). OmegaPlus is a scalable implementation of the ω statistic [94] that can be applied to whole-genome data. It uses a maximum likelihood framework and utilizes information on the LD between SNPs. For OmegaPlus, the minimum and maximum window sizes were set to 1 kb and 100 kb, respectively. RaiSD computes the μ statistic, a composite evaluation test that scores genomic regions by quantifying changes in the SFS, the levels of LD, and the amount of genetic diversity along the chromosome [55]. We used RaiSD with the default window size of 50 kb.

Changes in genetic variation and LD along the genome are also influenced by demography. To account for the effect of demography and determine the significance level of the identified selective sweeps, we simulated 10,000 datasets under the best neutral demographic scenario using the program ms [93], to mimic a population evolving under the same conditions as P. teres f. teres, but without any effect of selection. We then computed the ω and μ statistics on this data and used the highest ω and μ value obtained under the best demographic scenario without selection as the threshold for our selective sweep analyses. Setting a significance threshold for the deviation of the ω and μ statistics based on the simulated data sets allowed us to control for the effect of the demographic history of the population on the SFS, LD, and genetic diversity along the genome [53,95]. Subsequently, we only kept the selective sweep regions with evidence of selection from at least two methods to control for false positives. Genome-wide maps of the sweep regions were created using Circos v. 0.69–9 [96].

To test if the abundance of effector genes was different for the predicted selective sweep regions compared to the rest of the genome, we used a permutation test (based on a custom script available in GitHub: https://github.com/Jimi92/Population-genomics-Pyrenophora-teres). In brief, the abundance of predicted effectors was counted in regions of the same size and number equal to the predicted sweep regions. As many as 10,000 replicate runs of random resampled region were performed.

Co-localization of selective sweep regions, QTLs and predicted effectors

QTLs associated with P teres teres virulence were published in previous works [38,39,97]. In addition, effector prediction was performed in a previous work [40] We have compared the candidate selective sweep regions obtained through our analyses to the reported QTL and predicted effector coordinates using BEDTools version 2.27.1 [98]

Supporting information

S1 Table. Supplementary Tables.

Table A: Sampling sites of NFNB. Table B: Read counts. Number of raw reads before and after filtering for quality, length and PCR duplicates. Also shown is percentage of reads mapped to the reference genome and mean coverage per isolate across the genome. Table C: Pairwise Wilcoxon test to assess significant differences in genetic diversity levels between P. teres f. teres populations. Table D: Pairwise Wilcoxon test to assess significant differences in Tajima’s D between P. teres f. teres populations. Table E: Proportion of each mating type per population. Table F: Cross-validation error over 10 replicate runs, the average error per K-value and the standard deviation. Table G: Overview of scenario groups used in the three DIYABC-RF analyses. Table H: Overview of random forest vote results for each of the three ABC-RF analyses performed for individual scenarios and scenario groups. Table I: Estimations of splitting time between P. teres f. teres populations. Table J: Genomic regions that have undergone a recent selective sweep for the Middle Eastern population. The reported regions have been identified by three independent methods (see Methods). Table K: Genomic regions that have undergone a recent selective sweep for the North American population. The reported regions have been identified by three independent methods (see Methods). Table L: Genomic regions that have undergone a recent selective sweep for North African population. The reported regions have been identified by three independent methods (see Methods). Table M: Genomic regions that have undergone a recent selective sweep for the Caucasian population. The reported regions have been identified by three independent methods (see Methods). Table N: Genomic regions that have undergone a recent selective sweep for the Californian population. The reported regions have been identified by three independent methods (see Methods. Table O: Genomic regions that have undergone a recent selective sweep for the North and Central European population. The reported regions have been identified by three independent methods (see Methods). Table P: Pairwise population divergence estimates (Fst) of P. teres f. teres populations.

(XLSX)

Click here for additional data file.^{(47.5KB, xlsx)}

S1 Fig. Boxplot showing values of Tajima’s D of P. teres f. teres populations in each geographic region.

Kruskal-Wallis test with post-hoc pairwise Wilcoxon was used to identify significant differences (p < 0.05) between the groups (Table D in S1 Table).

(PDF)

Click here for additional data file.^{(64.4KB, pdf)}

S2 Fig. NeighbourNet tree shows the evolutionary relationship between P.teres f. teres, P.teres f. maculata and P. tritici-repentis.

(PDF)

Click here for additional data file.^{(98KB, pdf)}

S3 Fig. Mantel test shows significant corelation between geographic and genetic distance (Number of permutations: 9,999, Pearson corelation coefficient).

(PDF)

Click here for additional data file.^{(283.3KB, pdf)}

S4 Fig. Models tested in DIYABC-RF analysis 1.

(PDF)

Click here for additional data file.^{(4.4MB, pdf)}

S5 Fig. Models tested in DIYABC-RF analysis 2.

(PDF)

Click here for additional data file.^{(2.6MB, pdf)}

S6 Fig. Models tested in DIYABC-RF analysis 3.

(PDF)

Click here for additional data file.^{(3.5MB, pdf)}

S7 Fig. The effect of sample size on the genetic diversity estimations.

(PDF)

Click here for additional data file.^{(64.9KB, pdf)}

Acknowledgments

Asieh Vasighzadeh kindly provided isolates from Iran which were used in this study. The authors are grateful to Idalia Rojas Barrera, Danilo Pereira, Wagner Fagundes and other members of the Environmental Genomics group. The PhD research of DT was conducted in the framework of the DFG Research Training Group RTG 2501.

Data Availability

DT’s work was supported by the German Science Foundation (DFG) Research Training Group RTG 2501. The research was further supported by the U.S. Department of Agriculture, Agricultural Research Service through USDA project 3060-22000-051-000D (Friesen). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Funding Statement

References

1.Fellers JP, Sakthikumar S, He F, McRell K, Bakkeren G, Cuomo CA, et al. Whole-genome sequencing of multiple isolates of Puccinia triticina reveals asexual lineages evolving by recurrent mutations. G3: Genes, Genomes, Genetics. 2021;11. doi: 10.1093/g3journal/jkab219 [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Islam MT, Croll D, Gladieux P, Soanes DM, Persoons A, Bhattacharjee P, et al. Emergence of wheat blast in Bangladesh was caused by a South American lineage of Magnaporthe oryzae. BMC Biol. 2016;14: 1–11. doi: 10.1186/s12915-016-0309-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Ali S, Gladieux P, Leconte M, Gautier A, Justesen AF, Hovmøller MS, et al. Origin, Migration Routes and Worldwide Population Genetic Structure of the Wheat Yellow Rust Pathogen Puccinia striiformis f.sp. tritici. PLoS Pathog. 2014;10: 1–13. doi: 10.1371/journal.ppat.1003903 [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Liu Z, Ellwood SR, Oliver RP, Friesen TL. Pyrenophora teres: Profile of an increasingly damaging barley pathogen. Mol Plant Pathol. 2011;12: 1–19. doi: 10.1111/j.1364-3703.2010.00649.x [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Douiyssi A, Rasmusson DC, Roelfs AP. Responses of barley cultivars and lines to isolates of Pyrenophora teres. Plant Dis. 1998;82: 316–321. doi: 10.1094/PDIS.1998.82.3.316 [DOI] [PubMed] [Google Scholar]
6.Louw JPJ, Victor D, Crous PW, Holz G, Janse BJH. Characterization of Pyrenophora Isolates Associated with Spot and Net Type Lesions on Barley in South Africa. Journal of Phytopathology. 1995;143: 129–134. doi: doi.org/10.1111/j.1439-0434.1995.tb00245.x [Google Scholar]
7.Zeleke T. Evaluation of host reaction and yield performance of malt barley cultivars to net blotch, Pyrenophora teres in Bale Highlands, Ethiopia. Journal of Plant Sciences. 2017;5: 43–47. doi: 10.11648/j.jps.20170501.16 [DOI] [Google Scholar]
8.Arabi MIE, Al-Safadi B, Charbaji T. Pathogenic variation among isolates of Pyrenophora teres, the causal agent of barley net blotch. Journal of Phytopathology. 2003. pp. 376–382. doi: 10.1046/j.1439-0434.2003.00734.x [DOI] [Google Scholar]
9.Sato K, Takeda K. Net blotch resistance in wild species of Hordeum. Euphytica. 1997;95: 179–185. doi: 10.1023/A:1002958924439 [DOI] [Google Scholar]
10.ARABI MI BARRAULT G, SARRAFI A, ALBERTINI L. Variation in the resistance of barley cultivars and in the pathogenicity of Drechslera teres f. sp. maculata and D. teres f. sp. teres isolates from France. Plant Pathol. 1992;41: 180–186. doi: 10.1111/j.1365-3059.1992.tb02336.x [DOI] [Google Scholar]
11.Tekauz A. Characterization and distribution of pathogenic variation in Pyrenophora teres f. teres and P. teres f. maculata from western Canada. Canadian Journal of Plant Pathology. 1990;12: 141–148. doi: 10.1080/07060669009501017 [DOI] [Google Scholar]
12.Moya P, Girotti JR, Toledo AV, Sisterna MN. Antifungal activity of Trichoderma VOCs against Pyrenophora teres, the causal agent of barley net blotch. J Plant Prot Res. 2018;58: 45–53. doi: 10.24425/119115 [DOI] [Google Scholar]
13.Cromey MG, Parkes RA. Pathogenic variation in Drechslera teres in New Zealand. New Zealand Plant Protection. 2003;56: 251–256. doi: 10.30843/nzpp.2003.56.6020 [DOI] [Google Scholar]
14.van Zeist W, Bakker. View of Archaeobotanical studies in the Levant: I. Neolithic sites in the Damascus Basin: Aswad, Ghoraife, Ramad. Palaeohistoria. 1982;24: 165–256. [Google Scholar]
15.Åberg E. Hordeum agriocrithon nova sp., a wild six-rowed barley. Ann R Agric Col Swed. 1938;6: 159–216. [Google Scholar]
16.Molina-Cano JL, Fra-Mon P, Salcedo G, Aragoncillo C, de Togores FR, García-Olmedo F. Morocco as a possible domestication center for barley: biochemical and agromorphological evidence. Theoretical and Applied Genetics. 1987;73: 531–536. doi: 10.1007/BF00289190 [DOI] [PubMed] [Google Scholar]
17.Molina-Cano JL, Moralejo M, Igartua E, Romagosa I. Further evidence supporting Morocco as a centre of origin of barley. Theoretical and Applied Genetics. 1999;98: 913–918. doi: 10.1007/s001220051150 [DOI] [Google Scholar]
18.Tanno K, Taketa S, Takeda K, Komatsuda T. A DNA marker closely linked to the vrs1 locus (row-type gene) indicates multiple origins of six-rowed cultivated barley (Hordeum vulgare L.). Theoretical and Applied Genetics. 2002;104: 54–60. doi: 10.1007/s001220200006 [DOI] [PubMed] [Google Scholar]
19.Riehl S. Barley in Archaeology and Early History. Oxford Research Encyclopedia of Environmental Science. 2019. doi: 10.1093/acrefore/9780199389414.013.219 [DOI] [Google Scholar]
20.McDonald BA, Stukenbrock EH. Rapid emergence of pathogens in agro-ecosystems: Global threats to agricultural sustainability and food security. Philosophical Transactions of the Royal Society B: Biological Sciences. 2016;371. doi: 10.1098/rstb.2016.0026 [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Banke S, Mcdonald BA. Migration patterns among global populations of the pathogenic fungus Mycosphaerella graminicola. Mol Ecol. 2005;14: 1881–1896. doi: 10.1111/j.1365-294X.2005.02536.x [DOI] [PubMed] [Google Scholar]
22.Feurtey A, Lorrain C, McDonald MC, Milgate A, Solomon PS, Warren R, et al. A thousand-genome panel retraces the global spread and adaptation of a major fungal crop pathogen. Nat Commun. 2023;14. doi: 10.1038/s41467-023-36674-y [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Munkacsi AB, Stoxen S, May G. Ustilago maydis populations tracked maize through domestication and cultivation in the Americas. Proceedings of the Royal Society B: Biological Sciences. 2008;275: 1037–1046. doi: 10.1098/rspb.2007.1636 [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Stukenbrock EH, Mcdonald BA. The Origins of Plant Pathogens in Agro-Ecosystems. Annu Rev Phytopathol. 2008;46: 75–100. doi: 10.1146/annurev.phyto.010708.154114 [DOI] [PubMed] [Google Scholar]
25.Sotiropoulos AG, Arango-Isaza E, Ban T, Barbieri C, Bourras S, Cowger C, et al. Global genomic analyses of wheat powdery mildew reveal association of pathogen spread with historical human migration and trade. Nat Commun. 2022;13: 1–14. doi: 10.1038/s41467-022-31975-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Santini A, Liebhold A, Migliorini D, Woodward S. Tracing the role of human civilization in the globalization of plant pathogens. ISME Journal. 2018;12: 647–652. doi: 10.1038/s41396-017-0013-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Menardo F, Praz CR, Wyder S, Ben-David R, Bourras S, Matsumae H, et al. Hybridization of powdery mildew strains gives rise to pathogens on novel agricultural crop species. Nat Genet. 2016;48: 201–205. doi: 10.1038/ng.3485 [DOI] [PubMed] [Google Scholar]
28.Thierry M, Charriat F, Milazzo J, Adreit H, Ravel S, Cros-Arteil S, et al. Maintenance of divergent lineages of the Rice Blast Fungus Pyricularia oryzae through niche separation, loss of sex and post-mating genetic incompatibilities. PLoS Pathog. 2022;18. doi: 10.1371/journal.ppat.1010687 [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Derbyshire MC. Bioinformatic Detection of Positive Selection Pressure in Plant Pathogens: The Neutral Theory of Molecular Sequence Evolution in Action. Front Microbiol. 2020;11: 1–14. doi: 10.3389/fmicb.2020.00644 [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Tellier A, Moreno-Gamez S, Stephan W. Speed of adaptation and genomic signatures in arms race and trench warfare models of host- parasite coevolution. Evolution (N Y). 2014;68: 73–78. [DOI] [PubMed] [Google Scholar]
31.Ebert MK, Rangel LI, Spanner RE, Taliadoros D, Wang X, Friesen TL, et al. Identification and characterization of Cercospora beticola necrosis-inducing effector CbNip1. Molecular Plant Pathology. 2021. pp. 301–316. doi: 10.1111/mpp.13026 [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Duan G, Bao J, Chen X, Xie J, Liu Y, Chen H, et al. Large-Scale Genome Scanning within Exonic Regions Revealed the Contributions of Selective Sweep Prone Genes to Host Divergence and Adaptation in Magnaporthe oryzae Species Complex. Microorganisms. 2021;9: 562. doi: 10.3390/microorganisms9030562 [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Richards JK, Stukenbrock EH, Carpenter J, Liu Z, Cowger C, Faris JD, et al. Local adaptation drives the diversification of effectors in the fungal wheat pathogen Parastagonospora nodorum in the United States. PLoS Genetics. 2019. doi: 10.1371/journal.pgen.1008223 [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Pereira D, Oggenfuss U, McDonald BA, Croll D. Population genomics of transposable element activation in the highly repressive genome of an agricultural pathogen. Microb Genom. 2021;7. doi: 10.1099/mgen.0.000540 [DOI] [PMC free article] [PubMed] [Google Scholar]
35.Rovenich H, Boshoven JC, Thomma BPHJ. Filamentous pathogen effector functions: Of pathogens, hosts and microbiomes. Curr Opin Plant Biol. 2014;20: 96–103. doi: 10.1016/j.pbi.2014.05.001 [DOI] [PubMed] [Google Scholar]
36.Sperschneider J, Dodds PN, Gardiner DM, Singh KB, Taylor JM. Improved prediction of fungal effector proteins from secretomes with EffectorP 2. 0. Mol Plant Pathol. 2018;9: 2094–2110. doi: 10.1111/mpp.12682 [DOI] [PMC free article] [PubMed] [Google Scholar]
37.Demirjian C, Vailleau F, Berthomé R, Roux F. Genome-wide association studies in plant pathosystems: success or failure? Trends Plant Sci. 2022;xx: 1–15. doi: 10.1016/j.tplants.2022.11.006 [DOI] [PubMed] [Google Scholar]
38.Koladia VM, Richards JK, Wyatt NA, Faris JD, Brueggeman RS, Friesen TL. Genetic analysis of virulence in the Pyrenophora teres f. teres population BB25 × FGOH04Ptt-21. Fungal Genetics and Biology. 2017;107: 12–19. doi: 10.1016/j.fgb.2017.07.003 [DOI] [PubMed] [Google Scholar]
39.Shjerve RA, Faris JD, Brueggeman RS, Yan C, Zhu Y, Koladia V, et al. Evaluation of a Pyrenophora teres f. teres mapping population reveals multiple independent interactions with a region of barley chromosome 6H. Fungal Genetics and Biology. 2014;70: 104–112. doi: 10.1016/j.fgb.2014.07.012 [DOI] [PubMed] [Google Scholar]
40.Wyatt N, Richards J, Brueggeman R, Friesen T. Reference Assembly and Annotation of the Pyrenophora teres f. teres Isolate 0–1. GENES, GENOMES, GENETICS. 2018;8: 1–8. doi: 10.1534/g3.117.300196 [DOI] [PMC free article] [PubMed] [Google Scholar]
41.Bakonyi J, Justesen AF. Genetic relationship of Pyrenophora graminea, P. teres f. maculata and P. teres f. teres assessed by RAPD analysis. Journal of Phytopathology. 2007;155: 76–83. doi: 10.1111/j.1439-0434.2007.01192.x [DOI] [Google Scholar]
42.Wingfield BD, Berger DK, Coetzee MPA, Duong TA, Martin A, Pham NQ, et al. IMA genome-F17: Draft genome sequences of an Armillaria species from Zimbabwe, Ceratocystis colombiana, Elsinoë necatrix, Rosellinia necatrix, two genomes of Sclerotinia minor, short-read genome assemblies and annotations of four Pyrenophora teres isolate. IMA Fungus. 2022;13. doi: 10.1186/s43008-022-00104-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
43.Ni M, Feretzaki M, Sun S, Wang X, Heitman J. Sex in Fungi. Annu Rev Genet. 2011;45: 405–430. doi: 10.1146/annurev-genet-110410-132536 [DOI] [PMC free article] [PubMed] [Google Scholar]
44.Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, et al. SPAdes: A new genome assembly algorithm and its applications to single-cell sequencing. Journal of Computational Biology. 2012;19: 455–477. doi: 10.1089/cmb.2012.0021 [DOI] [PMC free article] [PubMed] [Google Scholar]
45.Alexander DH, Novembre J, Lange K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 2009;19: 1655–1664. doi: 10.1101/gr.094052.109 [DOI] [PMC free article] [PubMed] [Google Scholar]
46.Mantel N. The Detection of disease clustering and a generalized regression approach. Cancer Reaserch. 1967;27: 209–220. doi: 10.1016/0002-9343(48)90434-3 [DOI] [PubMed] [Google Scholar]
47.Collin FD, Durif G, Raynal L, Lombaert E, Gautier M, Vitalis R, et al. Extending approximate Bayesian computation with supervised machine learning to infer demographic history from genetic polymorphisms using DIYABC Random Forest. Mol Ecol Resour. 2021;21: 2598–2613. doi: 10.1111/1755-0998.13413 [DOI] [PMC free article] [PubMed] [Google Scholar]
48.Nei M, Maruyama T, Chakraborty R. The bottleneck effect and genetic variability in populations. Evolution (N Y). 1975;29: 1–10. doi: 10.1111/j.1558-5646.1975.tb00807.x [DOI] [PubMed] [Google Scholar]
49.Slatkin M, Excoffier L. Serial founder effects during range expansion: A spatial analog of genetic drift. Genetics. 2012;191: 171–181. doi: 10.1534/genetics.112.139022 [DOI] [PMC free article] [PubMed] [Google Scholar]
50.Malaspinas AS, Westaway MC, Muller C, Sousa VC, Lao O, Alves I, et al. A genomic history of Aboriginal Australia. Nature. 2016;538: 207–214. doi: 10.1038/nature18299 [DOI] [PMC free article] [PubMed] [Google Scholar]
51.Komluski J, Habig M, Stukenbrock EH. Repeat-Induced Point Mutation and Gene Conversion Coinciding with Heterochromatin Shape the Genome of a Plant-Pathogenic Fungus. mBio. 2023;14. doi: 10.1128/mbio.03290-22 [DOI] [PMC free article] [PubMed] [Google Scholar]
52.Simões LG, Günther T, Martínez-Sánchez RM, Vera-Rodríguez JC, Iriarte E, Rodríguez-Varela R, et al. Northwest African Neolithic initiated by migrants from Iberia and Levant. Nature. 2023;618: 550–556. doi: 10.1038/s41586-023-06166-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
53.Stamatakis A, Alachiotis N, Pavlidis P, Daniel Z. SweeD: Likelihood-Based Detection of Selective Sweeps in Thousands of Genomes. Mol Biol Evol. 2013;30: 2224–2234. doi: 10.1093/molbev/mst112 [DOI] [PMC free article] [PubMed] [Google Scholar]
54.Alachiotis N, Stamatakis A, Pavlidis P. OmegaPlus: a scalable tool for rapid detection of selective sweeps in whole-genome datasets. Bioinformatics. 2012;28: 2274–2275. doi: 10.1093/bioinformatics/bts419 [DOI] [PubMed] [Google Scholar]
55.Alachiotis N, Pavlidis P. RAiSD detects positive selection based on multiple signatures of a selective sweep and SNP vectors. Commun Biol. 2018;1. doi: 10.1038/s42003-018-0085-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
56.Stephan W. Selective Sweeps. 2019;211: 5–13. [DOI] [PMC free article] [PubMed] [Google Scholar]
57.Weiland JJ, Steffenson BJ, Cartwright RD, Webster RK. Identification of molecular genetic markers in Pyrenophora teres f. teres associated with low virulence on “Harbin” barley. Phytopathology. 1999;89: 176–181. doi: 10.1094/PHYTO.1999.89.2.176 [DOI] [PubMed] [Google Scholar]
58.Poets AM, Fang Z, Clegg MT, Morrell PL. Barley landraces are characterized by geographically heterogeneous genomic origins. Genome Biol. 2015;16: 1–11. doi: 10.1186/s13059-015-0712-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
59.Stukenbrock EH, Banke S, Javan-Nikkhah M, McDonald BA. Origin and domestication of the fungal wheat pathogen Mycosphaerella graminicola via sympatric speciation. Mol Biol Evol. 2007;24: 398–411. doi: 10.1093/molbev/msl169 [DOI] [PubMed] [Google Scholar]
60.Couch BC, Fudal I, Lebrun MH, Tharreau D, Valent B, Van Kim P, et al. Origins of host-specific populations of the blast pathogen Magnaporthe oryzae in crop domestication with subsequent expansion of pandemic clones on rice and weeds of rice. Genetics. 2005;170: 613–630. doi: 10.1534/genetics.105.041780 [DOI] [PMC free article] [PubMed] [Google Scholar]
61.Schweizer G, Haider MB, Barroso G V., Rössel N, Münch K, Kahmann R, et al. Population Genomics of the Maize Pathogen Ustilago maydis: Demographic History and Role of Virulence Clusters in Adaptation. Genome Biol Evol. 2021;13: 1–17. doi: 10.1093/gbe/evab073 [DOI] [PMC free article] [PubMed] [Google Scholar]
62.Fuller QD, Weisskopf A. Encyclopedia of Global Archaeology, Barley: Origins and Development. 2014. doi: 10.1007/978-1-4419-0465-2_2168 [DOI] [Google Scholar]
63.Jones H, Civá P, Cockram J, Leigh FJ, Smith LMJ, Jones MK, et al. Evolutionary history of barley cultivation in Europe revealed by genetic analysis of extant landraces. BMC Evol Biol. 2011;11: 1–12. doi: 10.1186/1471-2148-11-320 [DOI] [PMC free article] [PubMed] [Google Scholar]
64.Kirleis W, Klooß S, Kroll H, Müller J. Crop growing and gathering in the northern German Neolithic: A review supplemented by new results. Veg Hist Archaeobot. 2012;21: 221–242. doi: 10.1007/s00334-011-0328-9 [DOI] [Google Scholar]
65.Hartmann FE, McDonald BA, Croll D. Genome-wide evidence for divergent selection between populations of a major agricultural pathogen. Molecular Ecology. 2018. pp. 2725–2741. doi: 10.1111/mec.14711 [DOI] [PMC free article] [PubMed] [Google Scholar]
66.Zhong Z, Chen M, Lin L, Han Y, Bao J, Tang W, et al. Population genomic analysis of the rice blast fungus reveals specific events associated with expansion of three main clades. ISME Journal. 2018;12: 1867–1878. doi: 10.1038/s41396-018-0100-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
67.Dahanayaka BA, Vaghefi N, Knight NL, Bakonyi J, Prins R, Seress D, et al. Population Structure of Pyrenophora teres f. teres Barley Pathogens from Different Continents. Phytopathology. 2021;111: 2118–2129. doi: 10.1094/PHYTO-09-20-0390-R [DOI] [PubMed] [Google Scholar]
68.Stukenbrock EH, Banke S, McDonald BA. Global migration patterns in the fungal wheat pathogen Phaeosphaeria nodorum. Mol Ecol. 2006;15: 2895–2904. doi: 10.1111/j.1365-294X.2006.02986.x [DOI] [PubMed] [Google Scholar]
69.Li J, Wyatt NA, Skiba RM, Kariyawasam GK, Effertz K, Rehman S, et al. Pathogen genetics identifies avirulence/virulence loci associated with barley 1 chromosome 6H resistance in the Pyrenophora teres f. teres-barley interaction Biodiversity and Crop Improvement Program, International Center for Agricultural Orcid-IDs. doi: 10.1101/2023.02.10.527674 [DOI] [Google Scholar]
70.Bolger AM, Lohse M, Usadel B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30: 2114–2120. doi: 10.1093/bioinformatics/btu170 [DOI] [PMC free article] [PubMed] [Google Scholar]
71.Zhang J, Kobert K, Flouri T, Stamatakis A. PEAR: A fast and accurate Illumina Paired-End reAd mergeR. Bioinformatics. 2014;30: 614–620. doi: 10.1093/bioinformatics/btt593 [DOI] [PMC free article] [PubMed] [Google Scholar]
72.Li H, Durbin R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics. 2010;26: 589–595. doi: 10.1093/bioinformatics/btp698 [DOI] [PMC free article] [PubMed] [Google Scholar]
73.Lunter G, Goodson M. Stampy: a statistical algorithm for sensitive and fast mapping of Illumina sequence reads. Genome Res. 2011. Genome Res. 2011;21:936–939: 936–939. doi: 10.1101/gr.111120.110 [doi] [DOI] [PMC free article] [PubMed] [Google Scholar]
74.McKenna A, Hanna M, Banks E, Sivachenko AY, Cibulskis K, Kernytsky AM, et al. The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20: 1297–1303. doi: 10.1101/gr.107524.110 [DOI] [PMC free article] [PubMed] [Google Scholar]
75.Bryant D, Moulton V. Neighbor-Net: An Agglomerative Method for the Construction of Phylogenetic Networks. Mol Biol Evol. 2004;21: 255–265. doi: 10.1093/molbev/msh018 [DOI] [PubMed] [Google Scholar]
76.Wickham H. ggplot2: Elegant Graphics for Data Analysis. Journeal of Statistical Software. 2017;80: 1–4. doi: 10.18637/jss.v080.b01 [DOI] [Google Scholar]
77.Huson DH, Bryant D. Application of phylogenetic networks in evolutionary studies. Mol Biol Evol. 2006;23: 254–267. doi: 10.1093/molbev/msj030 [DOI] [PubMed] [Google Scholar]
78.Paradis E, Schliep K. ape 5.0: an environment for modern phylogenetics and evolutionary analyses in R. Bioinformatics. 2019;35: 526–528. doi: 10.1093/bioinformatics/bty633 [DOI] [PubMed] [Google Scholar]
79.Korneliussen TS, Albrechtsen A, Nielsen R. ANGSD: Analysis of Next Generation Sequencing Data. BMC Bioinformatics. 2014;15: 1–13. doi: 10.1186/s12859-014-0356-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
80.Zhang C, Dong SS, Xu JY, He WM, Yang TL. PopLDdecay: A fast and effective tool for linkage disequilibrium decay analysis based on variant call format files. Bioinformatics. 2019;35: 1786–1788. doi: 10.1093/bioinformatics/bty875 [DOI] [PubMed] [Google Scholar]
81.Danecek P, Auton A, Abecasis G, Albers CA, Banks E, Depristo MA, et al. The variant call format and VCFtools. Bioinformatics. 2011;27: 2156–2158. doi: 10.1093/bioinformatics/btr330 [DOI] [PMC free article] [PubMed] [Google Scholar]
82.Weir BS, Cockerham CC. Estimating F-Statistics for the Analysis of Population Structure. Evolution (N Y). 1984;38: 1358–1370. doi: 10.1111/j.1558-5646.1984.tb05657.x [DOI] [PubMed] [Google Scholar]
83.Lu S, Platz GJ, Edwards MC, Friesen TL. Mating type locus-specific polymerase chain reaction markers for differentiation of Pyrenophora teres f. teres and P. teres f. maculata, the causal agents of barley net blotch. Phytopathology. 2010;100: 1298–1306. doi: 10.1094/PHYTO-05-10-0135 [DOI] [PubMed] [Google Scholar]
84.Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, et al. BLAST+: Architecture and applications. BMC Bioinformatics. 2009;10: 1–9. doi: 10.1186/1471-2105-10-421 [DOI] [PMC free article] [PubMed] [Google Scholar]
85.Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol Biol Evol. 2013;30: 772–780. doi: 10.1093/molbev/mst010 [DOI] [PMC free article] [PubMed] [Google Scholar]
86.Okonechnikov K, Golosova O, Fursov M, Varlamov A, Vaskin Y, Efremov I, et al. Unipro UGENE: A unified bioinformatics toolkit. Bioinformatics. 2012;28: 1166–1167. doi: 10.1093/bioinformatics/bts091 [DOI] [PubMed] [Google Scholar]
87.Shen W, Le S, Li Y, Hu F. SeqKit: A cross-platform and ultrafast toolkit for FASTA/Q file manipulation. PLoS One. 2016;11: 1–10. doi: 10.1371/journal.pone.0163962 [DOI] [PMC free article] [PubMed] [Google Scholar]
88.Nguyen LT, Schmidt HA, Von Haeseler A, Minh BQ. IQ-TREE: A fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. 2015;32: 268–274. doi: 10.1093/molbev/msu300 [DOI] [PMC free article] [PubMed] [Google Scholar]
89.Kalyaanamoorthy S, Minh BQ, Wong TKF, Von Haeseler A, Jermiin LS. ModelFinder: Fast model selection for accurate phylogenetic estimates. Nat Methods. 2017;14: 587–589. doi: 10.1038/nmeth.4285 [DOI] [PMC free article] [PubMed] [Google Scholar]
90.Hoang DT, Chernomor O, Von Haeseler A, Minh BQ, Vinh LS. UFBoot2: Improving the ultrafast bootstrap approximation. Mol Biol Evol. 2018;35: 518–522. doi: 10.1093/molbev/msx281 [DOI] [PMC free article] [PubMed] [Google Scholar]
91.Hoang DT, Vinh LS, Flouri T, Stamatakis A, Von Haeseler A, Minh BQ. MPBoot: Fast phylogenetic maximum parsimony tree inference and bootstrap approximation. BMC Evol Biol. 2018;18: 1–11. doi: 10.1186/s12862-018-1131-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
92.Schrempf D, Minh BQ, De Maio N, von Haeseler A, Kosiol C. Reversible polymorphism-aware phylogenetic models and their application to tree inference. J Theor Biol. 2016;407: 362–370. doi: 10.1016/j.jtbi.2016.07.042 [DOI] [PubMed] [Google Scholar]
93.Hudson RR. Generating samples under a Wright–Fisher neutral model ofgenetic variation. Bioinformatics. 2002;18: 337–338. doi: 10.1093/bioinformatics/18.2.337 [DOI] [PubMed] [Google Scholar]
94.Kim Y, Neilsen R. Linkage disequilibrium as a signature of selective sweeps. Genetics. 2004;167: 1513–1524. doi: 10.1534/genetics.103.025387 [DOI] [PMC free article] [PubMed] [Google Scholar]
95.Nielsen R, Williamson S, Kim Y, Hubisz MJ, Clark AG, Bustamante C. Genomic scans for selective sweeps using SNP data. Genome Rese. 2005;15: 1566–1575. doi: 10.1101/gr.4252305 [DOI] [PMC free article] [PubMed] [Google Scholar]
96.Krzywinski M, Schein J, Birol I, Connors J, Gascoyne R, Horsman D, et al. Circos: An information aesthetic for comparative genomics. Genome Res. 2009;19: 1639–1645. doi: 10.1101/gr.092759.109 [DOI] [PMC free article] [PubMed] [Google Scholar]
97.Clare SJ, Wyatt NA, Brueggeman RS, Friesen TL. Research advances in the Pyrenophora teres–barley interaction. Mol Plant Pathol. 2020;21: 272–288. doi: 10.1111/mpp.12896 [DOI] [PMC free article] [PubMed] [Google Scholar]
98.Quinlan AR, Hall IM. BEDTools: A flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26: 841–842. doi: 10.1093/bioinformatics/btq033 [DOI] [PMC free article] [PubMed] [Google Scholar]

PLoS Genet. doi: 10.1371/journal.pgen.1010884.r001

Decision Letter 0

Gregory P Copenhaver

7 Sep 2023

Dear Eva,

I apologize for the length of time it has taken us to return our decision to you. We had secured two external peer-reviewers but despite several reminders from the journal, and from me personally, one of them is unresponsive. The review we have is thoughtful constructive, so using that together with our own judgment we have decided to move forward with a single review. Below please find the standard letter inviting a revised manuscript.

-cheers

GPC

Dear Dr Stukenbrock,

Thank you very much for submitting your Research Article entitled 'Emergence and spread of the barley net blotch pathogen coincided with crop domestication and cultivation history' to PLOS Genetics.

The manuscript was fully evaluated at the editorial level and by independent peer reviewers. The reviewers appreciated the attention to an important problem, but raised some substantial concerns about the current manuscript. Based on the reviews, we will not be able to accept this version of the manuscript, but we would be willing to review a much-revised version. We cannot, of course, promise publication at that time.

Should you decide to revise the manuscript for further consideration here, your revisions should address the specific points made by each reviewer. We will also require a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript.

If you decide to revise the manuscript for further consideration at PLOS Genetics, please aim to resubmit within the next 60 days, unless it will take extra time to address the concerns of the reviewers, in which case we would appreciate an expected resubmission date by email to plosgenetics@plos.org.

If present, accompanying reviewer attachments are included with this email; please notify the journal office if any appear to be missing. They will also be available for download from the link below. You can use this link to log into the system when you are ready to submit a revised version, having first consulted our Submission Checklist.

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

Please be aware that our data availability policy requires that all numerical data underlying graphs or summary statistics are included with the submission, and you will need to provide this upon resubmission if not already present. In addition, we do not permit the inclusion of phrases such as "data not shown" or "unpublished results" in manuscripts. All points should be backed up by data provided with the submission.

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

PLOS has incorporated Similarity Check, powered by iThenticate, into its journal-wide submission system in order to screen submitted content for originality before publication. Each PLOS journal undertakes screening on a proportion of submitted articles. You will be contacted if needed following the screening process.

To resubmit, use the link below and 'Revise Submission' in the 'Submissions Needing Revision' folder.

We are sorry that we cannot be more positive about your manuscript at this stage. Please do not hesitate to contact us if you have any concerns or questions.

Yours sincerely,

Gregory P. Copenhaver

Editor-in-Chief

PLOS Genetics

Gregory Barsh

Editor-in-Chief

PLOS Genetics

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: The Authors reconstruct the demographic history of the Barley net blotch pathogen using genomic data collected across continents, and they find signatures of selection at genomic regions enriched for effector genes likely involved in host-pathogen interactions.

I find the manuscript well written, I find the amount of work considerable and the analyses generally robust and that the results align with the authors claims, which are interesting and novel.

However, some parts were for me hard to follow, for example the discussion on the different models tested with ABC, and some results did not really behave as I would have expected, like the msmc2-ccr analysis, or where not really consistent across the manuscript, like for Ne estimated from theta (table 1) and with msmc2 (Fig 4c).

More specific comments below:

Table 1: for all populations, theta pi is larger than theta W. As a consequence, Tajima’s D values should all be positive, but in the table they are not. Maybe the authors can clarify this.

Results:

Line 163: “indicating” a bottleneck seems too strong. While a positive tajima’s D value is consistent with a bottleneck, other scenarios may result in a positive Tajima’s D. I would edit to “suggesting” a bottleneck scenario

The section on ABC inference of population splits was for me difficult to follow. I understand the authors tested many different scenarios, but I found myself wondering which these scenarios are, why they were chosen, and what was the rational behind inferring this history in three steps, and including the European and North American populations only at the second and third steps. I think it would help the reader to try to restructure this part and discuss which scenarios were tested and why

Line 310: this sentence refers to fig 4D, on the cross-coalescence rate, but my impression from the text was that the authors are still talking about inference of effective population size (would be fig 4c). This sentence could be restructured to clarify when the authors are estimating Ne and when split times.

Effective population size for the Caucasus has an interesting spike at very ancient times. The authors also showed earlier in the manuscript that in the Caucasus there are two diverged genetic clusters (from the neighbour-net-tree). I was wondering what part may the two clusters have in potentially creating this spike. Population structure results in lower coalescence rates, hence high inferred Ne, but this effect is normally seen in more recent times rather than in more ancient times. However the two clusters in the Caucasus seem to be highly diverged. I wonder if the authors included these two clusters in msmc2, and if they tried running msmc2 with the main cluster only to see if the spike in Ne remains or disappears.

The authors infer with msmc2 a trend in Ne that almost never goes higher than Ne=1000 for the majority of these populations histories, rather stays mostly under that. How do the authors interpret this in comparison to their table 1, where Ne is rather always higher than 13000 and sometimes up to 60000? Are the same parameters used in both analyses (e.g. mutation rate, 2*Ne*mu versus 4*Ne*mu for theta, 1/2*Ne versus 1/Ne for coalescence rates in diploids versus haploids)?

The trend lines for the cross-coalescence rate analyses seem to level around CCR=0.5 going backwards in time, starting around 10kya into the ancient past. These lines are supposed to level at ccr=1 plus minus some noise, when the populations merge into the ancestral population. I suggest the authors check their parameterisation of CCR, for whether there may be a factor of 2 set differently compared to other studies? This will probably not affect the authors claims in the text, since they focus on the relative age of splits, but it will affect the absolute values for split times and the clarity of the results

Selection analysis:

Line 368: I found myself here asking why the authors did not make a GO analysis before this point, because this section sounds like the permutation test was focused on effector genes. However the authors mention a GO analysis in methods, and this appears only after they discuss selection signatures at effector genes. This is to me a bit of an odd structure for this section of results, where I would have expected a GO analysis right after the selection scans, and only after that a discussion of candidate genes under selection, including the effector genes. It makes me wonder, does a GO analysis pick effector genes?

Methods:

Line 534-539: It was not clear to me what is the difference between point 2 and 6

In the methods for msmc2, I would expect to see some information on how many genomes were used and in which combinations (2,4,8 haplotypes mode, and what configuration for CCR). Some of this is mentioned in results and not in methods, while I would think this belongs to methods

Discussion:

The authors propose a link between the Caucasus and North America. Is a similar link known for Barley? I was missing some discussion of the history of Barley introduction to North America and whether this aligns with the genomic data of the fungus

Minor comments:

1) Line 153: typo, that -> than

Line 189 has a typo at “In”

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: None

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

PLoS Genet. 2024 Jan 29;20(1):e1010884. doi: 10.1371/journal.pgen.1010884.r002

Author response to Decision Letter 0

18 Oct 2023

Attachment

Submitted filename: Taliadoros_ResponseToReviewers.docx

Click here for additional data file.^{(19.2KB, docx)}

PLoS Genet. doi: 10.1371/journal.pgen.1010884.r003

Decision Letter 1

Gregory P Copenhaver

7 Nov 2023

Dear Eva,

Thank you very much for submitting your Research Article entitled 'Emergence and spread of the barley net blotch pathogen coincided with crop domestication and cultivation history' to PLOS Genetics.

The manuscript was fully evaluated at the editorial level and by the original independent peer reviewer. The reviewer appreciated your revisions but identified some concerns that we ask you address in a revised manuscript. I think most of the reviewer comments can be handled by carefully and thoughtfully amending the text, but some re-analysis of data and supporting statistical analysis may also be necessary.

We therefore ask you to modify the manuscript according to the review recommendations. Your revisions should address the specific points made by each reviewer.

In addition we ask that you:

1) Provide a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript.

2) Upload a Striking Image with a corresponding caption to accompany your manuscript if one is available (either a new image or an existing one from within your manuscript). If this image is judged to be suitable, it may be featured on our website. Images should ideally be high resolution, eye-catching, single panel square images. For examples, please browse our archive. If your image is from someone other than yourself, please ensure that the artist has read and agreed to the terms and conditions of the Creative Commons Attribution License. Note: we cannot publish copyrighted images.

We hope to receive your revised manuscript within the next 30 days. If you anticipate any delay in its return, we would ask you to let us know the expected resubmission date by email to plosgenetics@plos.org.

If present, accompanying reviewer attachments should be included with this email; please notify the journal office if any appear to be missing. They will also be available for download from the link below. You can use this link to log into the system when you are ready to submit a revised version, having first consulted our Submission Checklist.

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

To resubmit, you will need to go to the link below and 'Revise Submission' in the 'Submissions Needing Revision' folder.

Please let us know if you have any questions while making these revisions.

Yours sincerely,

Gregory P. Copenhaver

Editor-in-Chief

PLOS Genetics

Gregory Barsh

Editor-in-Chief

PLOS Genetics

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: The authors have responded to most of my previous comments, but their new version of the manuscript raised some more questions:

Line 148: I would add mention to the range of coverage. Information on the minimum is important to know what was the sequencing quality of the worse sample included in the analyses

Line 164: “significantly higher” need mention of parameters and results of a statistical test

Lines 164-169: I find it weird to mention diversity levels for some populations but not for all

Line 176: a Tajima’s D of 0.62 is hardly evidence for a bottleneck, especially in comparison to even higher Tajima’s D values in other populations

Line 178: this sentence has no background: “with a frequency of genetic variants that reflects a mutation – drift equilibrium ”. Assuming that this is based on Tajima’s D, I find the text very unclear without a direct mention to a Tajima’s D value and statistical test. The mention to table one is missing from this paragraph.

Lines 180-187: The authors messages are probably true, but I find the way they are presented weird. The North American population is discussed as having very long linkage blocks, and the value for r2/2 of 3.9 in Table 1 is mentioned as a reference. Except the population from the Caucasus has a much higher r2/2 of 7.3. When looking in more detail at figure 1c, this is likely due to a lower maximum r2 for Caucasus, which suggests that the measure reported is not the most relevant. I can see in the figure what the authors mean, that the North American population has more LD, but the measures used to argue for this are unclear. Furthermore, long LD blocks for the North American population are interpreted as a recent founding of this population. However, Caucasus has much longer LD blocks (by the measure used by the authors), but is proposed to be a very ancient population.

Line 195: How is random mating the null hypothesis here? The two mating types can only mate between each other, so that they should maintain a mating type ratio of 1:1. This is very far from random mating, which would not result in a 1:1 ratio

Line 199: typo at “in”

Line 331: It is unclear what “these events” refer to. Since the authors are talking about population splits, I hardly see how “these events” coincide with domestication of barley, since they barely overlap. They seem to rather have followed domestication of barley, I.e. be more recent than domestication of barley, which is also a perfectly plausible biological scenario.

Line 383: This sentence should be supported by the mention of a statistical test of significance

Lines 404-407: the simple overlap between QTL regions and selective sweeps demonstrates very little. Is the overlap larger than expected by chance?

Line 744: It is unclear what significance threshold the authors use for simulations

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Reviewer #1: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

PLoS Genet. 2024 Jan 29;20(1):e1010884. doi: 10.1371/journal.pgen.1010884.r004

Author response to Decision Letter 1

7 Dec 2023

Attachment

Submitted filename: Taliadoros_2ndReviewerResponse.pdf

Click here for additional data file.^{(39.1KB, pdf)}

PLoS Genet. doi: 10.1371/journal.pgen.1010884.r005

Decision Letter 2

Gregory P Copenhaver

11 Dec 2023

Dear Dr Stukenbrock,

We are pleased to inform you that your manuscript entitled "Emergence and spread of the barley net blotch pathogen coincided with crop domestication and cultivation history" has been editorially accepted for publication in PLOS Genetics. Congratulations!

Before your submission can be formally accepted and sent to production you will need to complete our formatting changes, which you will receive in a follow up email. Please be aware that it may take several days for you to receive this email; during this time no action is required by you. Please note: the accept date on your published article will reflect the date of this provisional acceptance, but your manuscript will not be scheduled for publication until the required changes have been made.

Once your paper is formally accepted, an uncorrected proof of your manuscript will be published online ahead of the final version, unless you’ve already opted out via the online submission form. If, for any reason, you do not want an earlier version of your manuscript published online or are unsure if you have already indicated as such, please let the journal staff know immediately at plosgenetics@plos.org.

In the meantime, please log into Editorial Manager at https://www.editorialmanager.com/pgenetics/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production and billing process. Note that PLOS requires an ORCID iD for all corresponding authors. Therefore, please ensure that you have an ORCID iD and that it is validated in Editorial Manager. To do this, go to ‘Update my Information’ (in the upper left-hand corner of the main menu), and click on the Fetch/Validate link next to the ORCID field. This will take you to the ORCID site and allow you to create a new iD or authenticate a pre-existing iD in Editorial Manager.

If you have a press-related query, or would like to know about making your underlying data available (as you will be aware, this is required for publication), please see the end of this email. If your institution or institutions have a press office, please notify them about your upcoming article at this point, to enable them to help maximise its impact. Inform journal staff as soon as possible if you are preparing a press release for your article and need a publication date.

Thank you again for supporting open-access publishing; we are looking forward to publishing your work in PLOS Genetics!

Yours sincerely,

Gregory P. Copenhaver

Editor-in-Chief

PLOS Genetics

Gregory Barsh

Editor-in-Chief

PLOS Genetics

www.plosgenetics.org

Twitter: @PLOSGenetics

----------------------------------------------------

Comments from the reviewers (if applicable):

----------------------------------------------------

Data Deposition

If you have submitted a Research Article or Front Matter that has associated data that are not suitable for deposition in a subject-specific public repository (such as GenBank or ArrayExpress), one way to make that data available is to deposit it in the Dryad Digital Repository. As you may recall, we ask all authors to agree to make data available; this is one way to achieve that. A full list of recommended repositories can be found on our website.

The following link will take you to the Dryad record for your article, so you won't have to re‐enter its bibliographic information, and can upload your files directly:

http://datadryad.org/submit?journalID=pgenetics&manu=PGENETICS-D-23-00842R2

More information about depositing data in Dryad is available at http://www.datadryad.org/depositing. If you experience any difficulties in submitting your data, please contact help@datadryad.org for support.

Additionally, please be aware that our data availability policy requires that all numerical data underlying display items are included with the submission, and you will need to provide this before we can formally accept your manuscript, if not already present.

----------------------------------------------------

Press Queries

If you or your institution will be preparing press materials for this manuscript, or if you need to know your paper's publication date for media purposes, please inform the journal staff as soon as possible so that your submission can be scheduled accordingly. Your manuscript will remain under a strict press embargo until the publication date and time. This means an early version of your manuscript will not be published ahead of your final version. PLOS Genetics may also choose to issue a press release for your article. If there's anything the journal should know or you'd like more information, please get in touch via plosgenetics@plos.org.

PLoS Genet. doi: 10.1371/journal.pgen.1010884.r006

Acceptance letter

Gregory P Copenhaver

16 Jan 2024

PGENETICS-D-23-00842R2

Emergence and spread of the barley net blotch pathogen coincided with crop domestication and cultivation history

Dear Dr Stukenbrock,

We are pleased to inform you that your manuscript entitled "Emergence and spread of the barley net blotch pathogen coincided with crop domestication and cultivation history" has been formally accepted for publication in PLOS Genetics! Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out or your manuscript is a front-matter piece, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Genetics and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Zsofi Zombor

PLOS Genetics

On behalf of:

The PLOS Genetics Team

Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom

plosgenetics@plos.org | +44 (0) 1223-442823

plosgenetics.org | Twitter: @PLOSGenetics

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 Table. Supplementary Tables.

(XLSX)

Click here for additional data file.^{(47.5KB, xlsx)}

S1 Fig. Boxplot showing values of Tajima’s D of P. teres f. teres populations in each geographic region.

Kruskal-Wallis test with post-hoc pairwise Wilcoxon was used to identify significant differences (p < 0.05) between the groups (Table D in S1 Table).

(PDF)

Click here for additional data file.^{(64.4KB, pdf)}

S2 Fig. NeighbourNet tree shows the evolutionary relationship between P.teres f. teres, P.teres f. maculata and P. tritici-repentis.

(PDF)

Click here for additional data file.^{(98KB, pdf)}

S3 Fig. Mantel test shows significant corelation between geographic and genetic distance (Number of permutations: 9,999, Pearson corelation coefficient).

(PDF)

Click here for additional data file.^{(283.3KB, pdf)}

S4 Fig. Models tested in DIYABC-RF analysis 1.

(PDF)

Click here for additional data file.^{(4.4MB, pdf)}

S5 Fig. Models tested in DIYABC-RF analysis 2.

(PDF)

Click here for additional data file.^{(2.6MB, pdf)}

S6 Fig. Models tested in DIYABC-RF analysis 3.

(PDF)

Click here for additional data file.^{(3.5MB, pdf)}

S7 Fig. The effect of sample size on the genetic diversity estimations.

(PDF)

Click here for additional data file.^{(64.9KB, pdf)}

Attachment

Submitted filename: Taliadoros_ResponseToReviewers.docx

Click here for additional data file.^{(19.2KB, docx)}

Attachment

Submitted filename: Taliadoros_2ndReviewerResponse.pdf

Click here for additional data file.^{(39.1KB, pdf)}

Data Availability Statement

[pgen.1010884.ref001] 1.Fellers JP, Sakthikumar S, He F, McRell K, Bakkeren G, Cuomo CA, et al. Whole-genome sequencing of multiple isolates of Puccinia triticina reveals asexual lineages evolving by recurrent mutations. G3: Genes, Genomes, Genetics. 2021;11. doi: 10.1093/g3journal/jkab219 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref002] 2.Islam MT, Croll D, Gladieux P, Soanes DM, Persoons A, Bhattacharjee P, et al. Emergence of wheat blast in Bangladesh was caused by a South American lineage of Magnaporthe oryzae. BMC Biol. 2016;14: 1–11. doi: 10.1186/s12915-016-0309-7 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref003] 3.Ali S, Gladieux P, Leconte M, Gautier A, Justesen AF, Hovmøller MS, et al. Origin, Migration Routes and Worldwide Population Genetic Structure of the Wheat Yellow Rust Pathogen Puccinia striiformis f.sp. tritici. PLoS Pathog. 2014;10: 1–13. doi: 10.1371/journal.ppat.1003903 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref004] 4.Liu Z, Ellwood SR, Oliver RP, Friesen TL. Pyrenophora teres: Profile of an increasingly damaging barley pathogen. Mol Plant Pathol. 2011;12: 1–19. doi: 10.1111/j.1364-3703.2010.00649.x [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref005] 5.Douiyssi A, Rasmusson DC, Roelfs AP. Responses of barley cultivars and lines to isolates of Pyrenophora teres. Plant Dis. 1998;82: 316–321. doi: 10.1094/PDIS.1998.82.3.316 [DOI] [PubMed] [Google Scholar]

[pgen.1010884.ref006] 6.Louw JPJ, Victor D, Crous PW, Holz G, Janse BJH. Characterization of Pyrenophora Isolates Associated with Spot and Net Type Lesions on Barley in South Africa. Journal of Phytopathology. 1995;143: 129–134. doi: doi.org/10.1111/j.1439-0434.1995.tb00245.x [Google Scholar]

[pgen.1010884.ref007] 7.Zeleke T. Evaluation of host reaction and yield performance of malt barley cultivars to net blotch, Pyrenophora teres in Bale Highlands, Ethiopia. Journal of Plant Sciences. 2017;5: 43–47. doi: 10.11648/j.jps.20170501.16 [DOI] [Google Scholar]

[pgen.1010884.ref008] 8.Arabi MIE, Al-Safadi B, Charbaji T. Pathogenic variation among isolates of Pyrenophora teres, the causal agent of barley net blotch. Journal of Phytopathology. 2003. pp. 376–382. doi: 10.1046/j.1439-0434.2003.00734.x [DOI] [Google Scholar]

[pgen.1010884.ref009] 9.Sato K, Takeda K. Net blotch resistance in wild species of Hordeum. Euphytica. 1997;95: 179–185. doi: 10.1023/A:1002958924439 [DOI] [Google Scholar]

[pgen.1010884.ref010] 10.ARABI MI BARRAULT G, SARRAFI A, ALBERTINI L. Variation in the resistance of barley cultivars and in the pathogenicity of Drechslera teres f. sp. maculata and D. teres f. sp. teres isolates from France. Plant Pathol. 1992;41: 180–186. doi: 10.1111/j.1365-3059.1992.tb02336.x [DOI] [Google Scholar]

[pgen.1010884.ref011] 11.Tekauz A. Characterization and distribution of pathogenic variation in Pyrenophora teres f. teres and P. teres f. maculata from western Canada. Canadian Journal of Plant Pathology. 1990;12: 141–148. doi: 10.1080/07060669009501017 [DOI] [Google Scholar]

[pgen.1010884.ref012] 12.Moya P, Girotti JR, Toledo AV, Sisterna MN. Antifungal activity of Trichoderma VOCs against Pyrenophora teres, the causal agent of barley net blotch. J Plant Prot Res. 2018;58: 45–53. doi: 10.24425/119115 [DOI] [Google Scholar]

[pgen.1010884.ref013] 13.Cromey MG, Parkes RA. Pathogenic variation in Drechslera teres in New Zealand. New Zealand Plant Protection. 2003;56: 251–256. doi: 10.30843/nzpp.2003.56.6020 [DOI] [Google Scholar]

[pgen.1010884.ref014] 14.van Zeist W, Bakker. View of Archaeobotanical studies in the Levant: I. Neolithic sites in the Damascus Basin: Aswad, Ghoraife, Ramad. Palaeohistoria. 1982;24: 165–256. [Google Scholar]

[pgen.1010884.ref015] 15.Åberg E. Hordeum agriocrithon nova sp., a wild six-rowed barley. Ann R Agric Col Swed. 1938;6: 159–216. [Google Scholar]

[pgen.1010884.ref016] 16.Molina-Cano JL, Fra-Mon P, Salcedo G, Aragoncillo C, de Togores FR, García-Olmedo F. Morocco as a possible domestication center for barley: biochemical and agromorphological evidence. Theoretical and Applied Genetics. 1987;73: 531–536. doi: 10.1007/BF00289190 [DOI] [PubMed] [Google Scholar]

[pgen.1010884.ref017] 17.Molina-Cano JL, Moralejo M, Igartua E, Romagosa I. Further evidence supporting Morocco as a centre of origin of barley. Theoretical and Applied Genetics. 1999;98: 913–918. doi: 10.1007/s001220051150 [DOI] [Google Scholar]

[pgen.1010884.ref018] 18.Tanno K, Taketa S, Takeda K, Komatsuda T. A DNA marker closely linked to the vrs1 locus (row-type gene) indicates multiple origins of six-rowed cultivated barley (Hordeum vulgare L.). Theoretical and Applied Genetics. 2002;104: 54–60. doi: 10.1007/s001220200006 [DOI] [PubMed] [Google Scholar]

[pgen.1010884.ref019] 19.Riehl S. Barley in Archaeology and Early History. Oxford Research Encyclopedia of Environmental Science. 2019. doi: 10.1093/acrefore/9780199389414.013.219 [DOI] [Google Scholar]

[pgen.1010884.ref020] 20.McDonald BA, Stukenbrock EH. Rapid emergence of pathogens in agro-ecosystems: Global threats to agricultural sustainability and food security. Philosophical Transactions of the Royal Society B: Biological Sciences. 2016;371. doi: 10.1098/rstb.2016.0026 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref021] 21.Banke S, Mcdonald BA. Migration patterns among global populations of the pathogenic fungus Mycosphaerella graminicola. Mol Ecol. 2005;14: 1881–1896. doi: 10.1111/j.1365-294X.2005.02536.x [DOI] [PubMed] [Google Scholar]

[pgen.1010884.ref022] 22.Feurtey A, Lorrain C, McDonald MC, Milgate A, Solomon PS, Warren R, et al. A thousand-genome panel retraces the global spread and adaptation of a major fungal crop pathogen. Nat Commun. 2023;14. doi: 10.1038/s41467-023-36674-y [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref023] 23.Munkacsi AB, Stoxen S, May G. Ustilago maydis populations tracked maize through domestication and cultivation in the Americas. Proceedings of the Royal Society B: Biological Sciences. 2008;275: 1037–1046. doi: 10.1098/rspb.2007.1636 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref024] 24.Stukenbrock EH, Mcdonald BA. The Origins of Plant Pathogens in Agro-Ecosystems. Annu Rev Phytopathol. 2008;46: 75–100. doi: 10.1146/annurev.phyto.010708.154114 [DOI] [PubMed] [Google Scholar]

[pgen.1010884.ref025] 25.Sotiropoulos AG, Arango-Isaza E, Ban T, Barbieri C, Bourras S, Cowger C, et al. Global genomic analyses of wheat powdery mildew reveal association of pathogen spread with historical human migration and trade. Nat Commun. 2022;13: 1–14. doi: 10.1038/s41467-022-31975-0 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref026] 26.Santini A, Liebhold A, Migliorini D, Woodward S. Tracing the role of human civilization in the globalization of plant pathogens. ISME Journal. 2018;12: 647–652. doi: 10.1038/s41396-017-0013-9 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref027] 27.Menardo F, Praz CR, Wyder S, Ben-David R, Bourras S, Matsumae H, et al. Hybridization of powdery mildew strains gives rise to pathogens on novel agricultural crop species. Nat Genet. 2016;48: 201–205. doi: 10.1038/ng.3485 [DOI] [PubMed] [Google Scholar]

[pgen.1010884.ref028] 28.Thierry M, Charriat F, Milazzo J, Adreit H, Ravel S, Cros-Arteil S, et al. Maintenance of divergent lineages of the Rice Blast Fungus Pyricularia oryzae through niche separation, loss of sex and post-mating genetic incompatibilities. PLoS Pathog. 2022;18. doi: 10.1371/journal.ppat.1010687 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref029] 29.Derbyshire MC. Bioinformatic Detection of Positive Selection Pressure in Plant Pathogens: The Neutral Theory of Molecular Sequence Evolution in Action. Front Microbiol. 2020;11: 1–14. doi: 10.3389/fmicb.2020.00644 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref030] 30.Tellier A, Moreno-Gamez S, Stephan W. Speed of adaptation and genomic signatures in arms race and trench warfare models of host- parasite coevolution. Evolution (N Y). 2014;68: 73–78. [DOI] [PubMed] [Google Scholar]

[pgen.1010884.ref031] 31.Ebert MK, Rangel LI, Spanner RE, Taliadoros D, Wang X, Friesen TL, et al. Identification and characterization of Cercospora beticola necrosis-inducing effector CbNip1. Molecular Plant Pathology. 2021. pp. 301–316. doi: 10.1111/mpp.13026 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref032] 32.Duan G, Bao J, Chen X, Xie J, Liu Y, Chen H, et al. Large-Scale Genome Scanning within Exonic Regions Revealed the Contributions of Selective Sweep Prone Genes to Host Divergence and Adaptation in Magnaporthe oryzae Species Complex. Microorganisms. 2021;9: 562. doi: 10.3390/microorganisms9030562 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref033] 33.Richards JK, Stukenbrock EH, Carpenter J, Liu Z, Cowger C, Faris JD, et al. Local adaptation drives the diversification of effectors in the fungal wheat pathogen Parastagonospora nodorum in the United States. PLoS Genetics. 2019. doi: 10.1371/journal.pgen.1008223 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref034] 34.Pereira D, Oggenfuss U, McDonald BA, Croll D. Population genomics of transposable element activation in the highly repressive genome of an agricultural pathogen. Microb Genom. 2021;7. doi: 10.1099/mgen.0.000540 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref035] 35.Rovenich H, Boshoven JC, Thomma BPHJ. Filamentous pathogen effector functions: Of pathogens, hosts and microbiomes. Curr Opin Plant Biol. 2014;20: 96–103. doi: 10.1016/j.pbi.2014.05.001 [DOI] [PubMed] [Google Scholar]

[pgen.1010884.ref036] 36.Sperschneider J, Dodds PN, Gardiner DM, Singh KB, Taylor JM. Improved prediction of fungal effector proteins from secretomes with EffectorP 2. 0. Mol Plant Pathol. 2018;9: 2094–2110. doi: 10.1111/mpp.12682 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref037] 37.Demirjian C, Vailleau F, Berthomé R, Roux F. Genome-wide association studies in plant pathosystems: success or failure? Trends Plant Sci. 2022;xx: 1–15. doi: 10.1016/j.tplants.2022.11.006 [DOI] [PubMed] [Google Scholar]

[pgen.1010884.ref038] 38.Koladia VM, Richards JK, Wyatt NA, Faris JD, Brueggeman RS, Friesen TL. Genetic analysis of virulence in the Pyrenophora teres f. teres population BB25 × FGOH04Ptt-21. Fungal Genetics and Biology. 2017;107: 12–19. doi: 10.1016/j.fgb.2017.07.003 [DOI] [PubMed] [Google Scholar]

[pgen.1010884.ref039] 39.Shjerve RA, Faris JD, Brueggeman RS, Yan C, Zhu Y, Koladia V, et al. Evaluation of a Pyrenophora teres f. teres mapping population reveals multiple independent interactions with a region of barley chromosome 6H. Fungal Genetics and Biology. 2014;70: 104–112. doi: 10.1016/j.fgb.2014.07.012 [DOI] [PubMed] [Google Scholar]

[pgen.1010884.ref040] 40.Wyatt N, Richards J, Brueggeman R, Friesen T. Reference Assembly and Annotation of the Pyrenophora teres f. teres Isolate 0–1. GENES, GENOMES, GENETICS. 2018;8: 1–8. doi: 10.1534/g3.117.300196 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref041] 41.Bakonyi J, Justesen AF. Genetic relationship of Pyrenophora graminea, P. teres f. maculata and P. teres f. teres assessed by RAPD analysis. Journal of Phytopathology. 2007;155: 76–83. doi: 10.1111/j.1439-0434.2007.01192.x [DOI] [Google Scholar]

[pgen.1010884.ref042] 42.Wingfield BD, Berger DK, Coetzee MPA, Duong TA, Martin A, Pham NQ, et al. IMA genome-F17: Draft genome sequences of an Armillaria species from Zimbabwe, Ceratocystis colombiana, Elsinoë necatrix, Rosellinia necatrix, two genomes of Sclerotinia minor, short-read genome assemblies and annotations of four Pyrenophora teres isolate. IMA Fungus. 2022;13. doi: 10.1186/s43008-022-00104-3 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref043] 43.Ni M, Feretzaki M, Sun S, Wang X, Heitman J. Sex in Fungi. Annu Rev Genet. 2011;45: 405–430. doi: 10.1146/annurev-genet-110410-132536 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref044] 44.Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, et al. SPAdes: A new genome assembly algorithm and its applications to single-cell sequencing. Journal of Computational Biology. 2012;19: 455–477. doi: 10.1089/cmb.2012.0021 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref045] 45.Alexander DH, Novembre J, Lange K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 2009;19: 1655–1664. doi: 10.1101/gr.094052.109 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref046] 46.Mantel N. The Detection of disease clustering and a generalized regression approach. Cancer Reaserch. 1967;27: 209–220. doi: 10.1016/0002-9343(48)90434-3 [DOI] [PubMed] [Google Scholar]

[pgen.1010884.ref047] 47.Collin FD, Durif G, Raynal L, Lombaert E, Gautier M, Vitalis R, et al. Extending approximate Bayesian computation with supervised machine learning to infer demographic history from genetic polymorphisms using DIYABC Random Forest. Mol Ecol Resour. 2021;21: 2598–2613. doi: 10.1111/1755-0998.13413 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref048] 48.Nei M, Maruyama T, Chakraborty R. The bottleneck effect and genetic variability in populations. Evolution (N Y). 1975;29: 1–10. doi: 10.1111/j.1558-5646.1975.tb00807.x [DOI] [PubMed] [Google Scholar]

[pgen.1010884.ref049] 49.Slatkin M, Excoffier L. Serial founder effects during range expansion: A spatial analog of genetic drift. Genetics. 2012;191: 171–181. doi: 10.1534/genetics.112.139022 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref050] 50.Malaspinas AS, Westaway MC, Muller C, Sousa VC, Lao O, Alves I, et al. A genomic history of Aboriginal Australia. Nature. 2016;538: 207–214. doi: 10.1038/nature18299 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref051] 51.Komluski J, Habig M, Stukenbrock EH. Repeat-Induced Point Mutation and Gene Conversion Coinciding with Heterochromatin Shape the Genome of a Plant-Pathogenic Fungus. mBio. 2023;14. doi: 10.1128/mbio.03290-22 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref052] 52.Simões LG, Günther T, Martínez-Sánchez RM, Vera-Rodríguez JC, Iriarte E, Rodríguez-Varela R, et al. Northwest African Neolithic initiated by migrants from Iberia and Levant. Nature. 2023;618: 550–556. doi: 10.1038/s41586-023-06166-6 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref053] 53.Stamatakis A, Alachiotis N, Pavlidis P, Daniel Z. SweeD: Likelihood-Based Detection of Selective Sweeps in Thousands of Genomes. Mol Biol Evol. 2013;30: 2224–2234. doi: 10.1093/molbev/mst112 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref054] 54.Alachiotis N, Stamatakis A, Pavlidis P. OmegaPlus: a scalable tool for rapid detection of selective sweeps in whole-genome datasets. Bioinformatics. 2012;28: 2274–2275. doi: 10.1093/bioinformatics/bts419 [DOI] [PubMed] [Google Scholar]

[pgen.1010884.ref055] 55.Alachiotis N, Pavlidis P. RAiSD detects positive selection based on multiple signatures of a selective sweep and SNP vectors. Commun Biol. 2018;1. doi: 10.1038/s42003-018-0085-8 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref056] 56.Stephan W. Selective Sweeps. 2019;211: 5–13. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref057] 57.Weiland JJ, Steffenson BJ, Cartwright RD, Webster RK. Identification of molecular genetic markers in Pyrenophora teres f. teres associated with low virulence on “Harbin” barley. Phytopathology. 1999;89: 176–181. doi: 10.1094/PHYTO.1999.89.2.176 [DOI] [PubMed] [Google Scholar]

[pgen.1010884.ref058] 58.Poets AM, Fang Z, Clegg MT, Morrell PL. Barley landraces are characterized by geographically heterogeneous genomic origins. Genome Biol. 2015;16: 1–11. doi: 10.1186/s13059-015-0712-3 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref059] 59.Stukenbrock EH, Banke S, Javan-Nikkhah M, McDonald BA. Origin and domestication of the fungal wheat pathogen Mycosphaerella graminicola via sympatric speciation. Mol Biol Evol. 2007;24: 398–411. doi: 10.1093/molbev/msl169 [DOI] [PubMed] [Google Scholar]

[pgen.1010884.ref060] 60.Couch BC, Fudal I, Lebrun MH, Tharreau D, Valent B, Van Kim P, et al. Origins of host-specific populations of the blast pathogen Magnaporthe oryzae in crop domestication with subsequent expansion of pandemic clones on rice and weeds of rice. Genetics. 2005;170: 613–630. doi: 10.1534/genetics.105.041780 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref061] 61.Schweizer G, Haider MB, Barroso G V., Rössel N, Münch K, Kahmann R, et al. Population Genomics of the Maize Pathogen Ustilago maydis: Demographic History and Role of Virulence Clusters in Adaptation. Genome Biol Evol. 2021;13: 1–17. doi: 10.1093/gbe/evab073 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref062] 62.Fuller QD, Weisskopf A. Encyclopedia of Global Archaeology, Barley: Origins and Development. 2014. doi: 10.1007/978-1-4419-0465-2_2168 [DOI] [Google Scholar]

[pgen.1010884.ref063] 63.Jones H, Civá P, Cockram J, Leigh FJ, Smith LMJ, Jones MK, et al. Evolutionary history of barley cultivation in Europe revealed by genetic analysis of extant landraces. BMC Evol Biol. 2011;11: 1–12. doi: 10.1186/1471-2148-11-320 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref064] 64.Kirleis W, Klooß S, Kroll H, Müller J. Crop growing and gathering in the northern German Neolithic: A review supplemented by new results. Veg Hist Archaeobot. 2012;21: 221–242. doi: 10.1007/s00334-011-0328-9 [DOI] [Google Scholar]

[pgen.1010884.ref065] 65.Hartmann FE, McDonald BA, Croll D. Genome-wide evidence for divergent selection between populations of a major agricultural pathogen. Molecular Ecology. 2018. pp. 2725–2741. doi: 10.1111/mec.14711 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref066] 66.Zhong Z, Chen M, Lin L, Han Y, Bao J, Tang W, et al. Population genomic analysis of the rice blast fungus reveals specific events associated with expansion of three main clades. ISME Journal. 2018;12: 1867–1878. doi: 10.1038/s41396-018-0100-6 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref067] 67.Dahanayaka BA, Vaghefi N, Knight NL, Bakonyi J, Prins R, Seress D, et al. Population Structure of Pyrenophora teres f. teres Barley Pathogens from Different Continents. Phytopathology. 2021;111: 2118–2129. doi: 10.1094/PHYTO-09-20-0390-R [DOI] [PubMed] [Google Scholar]

[pgen.1010884.ref068] 68.Stukenbrock EH, Banke S, McDonald BA. Global migration patterns in the fungal wheat pathogen Phaeosphaeria nodorum. Mol Ecol. 2006;15: 2895–2904. doi: 10.1111/j.1365-294X.2006.02986.x [DOI] [PubMed] [Google Scholar]

[pgen.1010884.ref069] 69.Li J, Wyatt NA, Skiba RM, Kariyawasam GK, Effertz K, Rehman S, et al. Pathogen genetics identifies avirulence/virulence loci associated with barley 1 chromosome 6H resistance in the Pyrenophora teres f. teres-barley interaction Biodiversity and Crop Improvement Program, International Center for Agricultural Orcid-IDs. doi: 10.1101/2023.02.10.527674 [DOI] [Google Scholar]

[pgen.1010884.ref070] 70.Bolger AM, Lohse M, Usadel B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30: 2114–2120. doi: 10.1093/bioinformatics/btu170 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref071] 71.Zhang J, Kobert K, Flouri T, Stamatakis A. PEAR: A fast and accurate Illumina Paired-End reAd mergeR. Bioinformatics. 2014;30: 614–620. doi: 10.1093/bioinformatics/btt593 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref072] 72.Li H, Durbin R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics. 2010;26: 589–595. doi: 10.1093/bioinformatics/btp698 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref073] 73.Lunter G, Goodson M. Stampy: a statistical algorithm for sensitive and fast mapping of Illumina sequence reads. Genome Res. 2011. Genome Res. 2011;21:936–939: 936–939. doi: 10.1101/gr.111120.110 [doi] [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref074] 74.McKenna A, Hanna M, Banks E, Sivachenko AY, Cibulskis K, Kernytsky AM, et al. The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20: 1297–1303. doi: 10.1101/gr.107524.110 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref075] 75.Bryant D, Moulton V. Neighbor-Net: An Agglomerative Method for the Construction of Phylogenetic Networks. Mol Biol Evol. 2004;21: 255–265. doi: 10.1093/molbev/msh018 [DOI] [PubMed] [Google Scholar]

[pgen.1010884.ref076] 76.Wickham H. ggplot2: Elegant Graphics for Data Analysis. Journeal of Statistical Software. 2017;80: 1–4. doi: 10.18637/jss.v080.b01 [DOI] [Google Scholar]

[pgen.1010884.ref077] 77.Huson DH, Bryant D. Application of phylogenetic networks in evolutionary studies. Mol Biol Evol. 2006;23: 254–267. doi: 10.1093/molbev/msj030 [DOI] [PubMed] [Google Scholar]

[pgen.1010884.ref078] 78.Paradis E, Schliep K. ape 5.0: an environment for modern phylogenetics and evolutionary analyses in R. Bioinformatics. 2019;35: 526–528. doi: 10.1093/bioinformatics/bty633 [DOI] [PubMed] [Google Scholar]

[pgen.1010884.ref079] 79.Korneliussen TS, Albrechtsen A, Nielsen R. ANGSD: Analysis of Next Generation Sequencing Data. BMC Bioinformatics. 2014;15: 1–13. doi: 10.1186/s12859-014-0356-4 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref080] 80.Zhang C, Dong SS, Xu JY, He WM, Yang TL. PopLDdecay: A fast and effective tool for linkage disequilibrium decay analysis based on variant call format files. Bioinformatics. 2019;35: 1786–1788. doi: 10.1093/bioinformatics/bty875 [DOI] [PubMed] [Google Scholar]

[pgen.1010884.ref081] 81.Danecek P, Auton A, Abecasis G, Albers CA, Banks E, Depristo MA, et al. The variant call format and VCFtools. Bioinformatics. 2011;27: 2156–2158. doi: 10.1093/bioinformatics/btr330 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref082] 82.Weir BS, Cockerham CC. Estimating F-Statistics for the Analysis of Population Structure. Evolution (N Y). 1984;38: 1358–1370. doi: 10.1111/j.1558-5646.1984.tb05657.x [DOI] [PubMed] [Google Scholar]

[pgen.1010884.ref083] 83.Lu S, Platz GJ, Edwards MC, Friesen TL. Mating type locus-specific polymerase chain reaction markers for differentiation of Pyrenophora teres f. teres and P. teres f. maculata, the causal agents of barley net blotch. Phytopathology. 2010;100: 1298–1306. doi: 10.1094/PHYTO-05-10-0135 [DOI] [PubMed] [Google Scholar]

[pgen.1010884.ref084] 84.Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, et al. BLAST+: Architecture and applications. BMC Bioinformatics. 2009;10: 1–9. doi: 10.1186/1471-2105-10-421 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref085] 85.Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol Biol Evol. 2013;30: 772–780. doi: 10.1093/molbev/mst010 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref086] 86.Okonechnikov K, Golosova O, Fursov M, Varlamov A, Vaskin Y, Efremov I, et al. Unipro UGENE: A unified bioinformatics toolkit. Bioinformatics. 2012;28: 1166–1167. doi: 10.1093/bioinformatics/bts091 [DOI] [PubMed] [Google Scholar]

[pgen.1010884.ref087] 87.Shen W, Le S, Li Y, Hu F. SeqKit: A cross-platform and ultrafast toolkit for FASTA/Q file manipulation. PLoS One. 2016;11: 1–10. doi: 10.1371/journal.pone.0163962 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref088] 88.Nguyen LT, Schmidt HA, Von Haeseler A, Minh BQ. IQ-TREE: A fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. 2015;32: 268–274. doi: 10.1093/molbev/msu300 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref089] 89.Kalyaanamoorthy S, Minh BQ, Wong TKF, Von Haeseler A, Jermiin LS. ModelFinder: Fast model selection for accurate phylogenetic estimates. Nat Methods. 2017;14: 587–589. doi: 10.1038/nmeth.4285 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref090] 90.Hoang DT, Chernomor O, Von Haeseler A, Minh BQ, Vinh LS. UFBoot2: Improving the ultrafast bootstrap approximation. Mol Biol Evol. 2018;35: 518–522. doi: 10.1093/molbev/msx281 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref091] 91.Hoang DT, Vinh LS, Flouri T, Stamatakis A, Von Haeseler A, Minh BQ. MPBoot: Fast phylogenetic maximum parsimony tree inference and bootstrap approximation. BMC Evol Biol. 2018;18: 1–11. doi: 10.1186/s12862-018-1131-3 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref092] 92.Schrempf D, Minh BQ, De Maio N, von Haeseler A, Kosiol C. Reversible polymorphism-aware phylogenetic models and their application to tree inference. J Theor Biol. 2016;407: 362–370. doi: 10.1016/j.jtbi.2016.07.042 [DOI] [PubMed] [Google Scholar]

[pgen.1010884.ref093] 93.Hudson RR. Generating samples under a Wright–Fisher neutral model ofgenetic variation. Bioinformatics. 2002;18: 337–338. doi: 10.1093/bioinformatics/18.2.337 [DOI] [PubMed] [Google Scholar]

[pgen.1010884.ref094] 94.Kim Y, Neilsen R. Linkage disequilibrium as a signature of selective sweeps. Genetics. 2004;167: 1513–1524. doi: 10.1534/genetics.103.025387 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref095] 95.Nielsen R, Williamson S, Kim Y, Hubisz MJ, Clark AG, Bustamante C. Genomic scans for selective sweeps using SNP data. Genome Rese. 2005;15: 1566–1575. doi: 10.1101/gr.4252305 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref096] 96.Krzywinski M, Schein J, Birol I, Connors J, Gascoyne R, Horsman D, et al. Circos: An information aesthetic for comparative genomics. Genome Res. 2009;19: 1639–1645. doi: 10.1101/gr.092759.109 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref097] 97.Clare SJ, Wyatt NA, Brueggeman RS, Friesen TL. Research advances in the Pyrenophora teres–barley interaction. Mol Plant Pathol. 2020;21: 272–288. doi: 10.1111/mpp.12896 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pgen.1010884.ref098] 98.Quinlan AR, Hall IM. BEDTools: A flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26: 841–842. doi: 10.1093/bioinformatics/btq033 [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Emergence and spread of the barley net blotch pathogen coincided with crop domestication and cultivation history

Demetris Taliadoros

Alice Feurtey

Nathan Wyatt

Benoit Barrès

Pierre Gladieux

Timothy L Friesen

Eva H Stukenbrock

Roles

Abstract

Author summary

Introduction

Results

Generation of a population genomic dataset of P. teres f. teres

Phylogenetic relationship of Pyrenophora species from barley and other grass hosts

Fig 1. Collection of Pyrenophora teres isolates across continents for the inference of pathogen population structure and dispersal.

The Caucasian population of P. teres f. teres exhibits higher nucleotide diversity

Table 1. Summary statistics of the genetic clusters of P. teres f. teres.

Populations of P. teres f. teres are geographically structured

Fig 2. Global population structure of P. teres f. teres.

Phylogenomic analysis suggests an ancient split of the North African P. teres f. teres population and a Caucasian origin of the North American population

Fig 3. Evolutionary relationship between P. teres f. teres populations.

The origin and dispersal of P. teres f. teres correlates with the early history of barley cultivation

Scenarios of family 1: Early divergence of Middle East, Caucasus, North African populations

Fig 4. Inference of the demographic history of P. teres f. teres populations shows an ancient split coinciding with barley domestication and early migration.

Scenarios of family 2: Founding of the European population through migration from the Middle East

Scenarios of family 3: Origin of the North American population

Parameter inference analysis with DIY-ABC

Table 2. Results for DIYABC-RF estimation of population divergence times normalized over the most ancestral sampled population N5 under scenario 9 detailed in Fig 4.

A severe demographic bottleneck in the history of the crop pathogen coincided with the domestication of barley

Recent positive selection has shaped genomic regions encoding putative virulence-related genes

Fig 5. Distribution of selective sweeps across the genome in five P. teres f. teres. populations.

Table 3. Summary of total and unique selective sweep regions, as well as, genes and effectors located in the regions per populations. The number of sweep regions, total genes, and effectors only identified in one population are characterized as unique.

Discussion

Fig 6. Invasion scenario of P. teres f. teres based on our population and phylogenomic analyses and the known history of the host.

Conclusion

Material and methods

Genome data

Read mapping and variant calling

Population genetic structure

Genetic diversity, neutrality tests and linkage disequilibrium

Mating types

Phylogenetic reconstruction

Inference of the demographic history of P. teres f. teres populations

Genome scans for selective sweeps

Co-localization of selective sweep regions, QTLs and predicted effectors

Supporting information

Acknowledgments

Data Availability

Funding Statement

References

Decision Letter 0

Gregory P Copenhaver

Roles

Author response to Decision Letter 0

Decision Letter 1

Gregory P Copenhaver

Roles

Author response to Decision Letter 1

Decision Letter 2

Gregory P Copenhaver

Roles

Acceptance letter

Gregory P Copenhaver

Roles

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases