Abstract
Population genetic structure of agricultural pests can be impacted not only by geographic distance and the broader ecological and physical barriers but also by patterns related to where crops are produced and how they are moved after harvest. Stored-product pests, for instance, specialize in exploiting grains such as wheat and rice from on-farm storage through transportation to final processing at often geographically distant locations; therefore human-aided movement may impact their dispersal. Although stored product insects are associated with stored grain, they can also exploit resources in the surrounding environments so different ecological regions where the grain is grown and stored may also influence population structure. Here we used 1,156 SNP markers to investigate how geographic distance, ecological and agricultural variables can impact the genetic structure and gene flow of the stored food pest beetle Rhyzopertha dominica. We found a substantial degree of admixture between weakly structured populations in the US. Ecological regions were more important in explaining R. dominica population structure than crop type, suggesting insect movement between wheat and rice grain distribution channels. We have also found a significant correlation between the genetic and geographical distance (i.e., isolation by distance). However, our modeling approach combining the ecological and management variables has highlighted the importance of the volume of grain received by a location in the dispersal dynamics of the pest. The first-generation migrant analysis offered additional supported to movement over great distances that are likely associated with grain movement. Our data suggest that a multitude of factors play small but significant parts in the movement dynamics of the pest. The beetles can take advantage of the source-sink dynamic of grain movement in the US, but also engage in a high rate of movement at the local scale. Understanding population structure for R. dominica will provide insights into the potential for local processes of adaptation and broader patterns of movement that will impact management programs and the potential for spread of resistance genes.
Keywords: population genetics, GBS, dispersal facilitation, stored pest, Rhyzopertha dominica
Population structure of agricultural pests is likely impacted not only by geographic distance and the broader ecological and physical barriers but also by patterns related to where crops are produced and how they are moved after harvest. For example, grain commodities such as wheat and rice are typically grown in specific geographic regions where proper agronomic and environmental conditions can be found for the plant to develop. After harvest, the grain is often transported and stored multiple times, often over considerable geographic distances. Unwantedly, there is a community of organisms, including a number of significant stored product insect pests that can exploit the sizeable artificial resource patches created by storage of grain after harvest and are effective at finding and infesting these patches. The movement and storage of grain as it moves from producers to consumers generates a complex network of sources and receivers of grain that has the potential to facilitate the movement of associated pests (Dias 1996; Nopsa et al. 2015).
Source-sink dynamic predictions are primarily based on the differences in patch quality and availability, with higher quality and more available resources found in the source locations and lower quality and less available supply in the sink locations (Dias 1996). In theory, individuals in a source population have a higher rate of population growth due to more opportunities to find suitable habitat where they can feed and reproduce, and the opposite is expected for sink populations (Diffendorfer 1998). Over time, the source-sink model predicts that the source habitat is a net exporter of individuals and the sink is a net importer, with the assumption that dispersal is constrained so that individuals cannot be free to disperse to any or all possible locations (Holt 1997; Dias 1996; Diffendorfer 1998). This concept of source-sink has been applied to the network of grain storage and transportation and its potential impact on pests and pathogens associated with the grain (Nopsa et al., 2015), but the actual implications of grain production and transportation on pest populations has not been evaluated. Areas with large amounts of grain production and storage are likely to be sources of grain to areas that have limited grain production and storage (i.e., sink locations). The human designed grain production system and its network of storage and movement can facilitate the long-range dispersal of some insects (Drury et al. 2009; Semeao et al. 2012; Nopsa et al. 2015). The impact of this network can be extended to local adaptation, allelic diversity, development of resistance to pesticides, and the range of geographic distribution of the pest insects. This network is also fundamentally different for wheat and rice crops. Wheat is generally produced over a larger area and transported much further distances before processing compared with rice, which tends to be stored and milled within the same local area. Since there is limited overlap in the production areas and transportation network, populations of insects associated with each of these grains may be less likely to overlap.
One of the major insect pests of stored grain throughout the world is Rhyzopertha dominica, also known as the lesser grain borer (Potter 1935). Grain such as wheat and rice typically become colonized by R. dominica after it is harvested and stored, with infestations coming either from remaining populations on-site or movement of beetles into the storage by their flight dispersal or transportation in grain and grain handling equipment. This species can also be found in natural areas feeding on fruit seeds, shrub, dry-wood, and timber (Potter 1935; Wright et al. 1990; Jia et al. 2008); however, the influence of natural habitats on dynamics in grain storage is not well understood (Mahroof and Phillips 2007; Ching’Oma 2006). This small beetle is a strong flier, and cross-infestation among storage sites or from natural areas into storage sites is possible (Edde et al. 2012; Ching’Oma 2006), but little is known about their long-range dispersal ability (Sinclair and Haddrell 1985; Dowdy and McGaughey 1994; Toews et al. 2006).
One can find R. dominica throughout the United States, and it is established in all major wheat and rice production areas, although less abundant and less severe in the northern parts of the continent due to its limited cold tolerance (Edde 2012). Due to the broad geographical distribution of grain production, R. dominica is present in many different ecoregions (i.e., geographically defined regions that are ecologically and environmentally similar) within the United States, including diverse habitats such as prairies, plains, mountains, valley, highlands, hills, and lowlands (EPA 1997). While the environment within grain storages, where insects can potentially spend many generations, is very homogeneous across these ecoregions, leading to more homogeneity in populations, local population genotype and dispersal ability may still be impacted by the surrounding environment. Ecoregions may also confound the effects of geographic distance and selection due to the influence of local conditions.
Our primary objective in this study was to estimate the relative importance of natural features of the environment (ecoregions), elements of the crop production system (wheat vs. rice), geographical distance, and human-aided movement (rail transportation network) on R. dominica population structure within the United States. To accomplish that, we collected beetles from different wheat and rice areas within the same season and conducted a hierarchical approach to test for possible structure-causing factors (Johnson and Omland 2004; Wagenmakers and Farrell 2004). As the association between R. dominica and stored grain is clearly undesirable, our results may help in the design of better monitoring tools and management strategies.
Materials and Methods
Sampling
We used R. dominica beetles collected at 11 distinct locations within wheat, rice, and wheat + rice production areas and within ten different ecoregions (EPA 1997) in the United States (Figure 1, Table 1). Ecoregions are geographical and ecological units that share similar features of environment, fauna, and flora (Bailey 2005). Sampling was done between July and November 2013, which is the period of the year when most flight activity is reported for this species (Edde et al. 2006; Toews et al. 2006). Six pheromone-baited delta traps (Scentry Biologicals Inc., Billing, MT) were deployed at each location targeting flying insects in the field near storage sites. The insect traps consisted of a cardboard sheet folded in a triangle shape with inner surfaces coated with sticky glue and containing a rubber septum lure impregnated with synthetic pheromone (Trece Inc., Adair OK) inside the trap. The traps were placed at least 10 m apart from each other and where possible close to grain storage sites such as grain bins, grain elevators, and warehouses. The pheromone lure within the traps can attract both male and female adults flying outdoors (Leos-Martinez et al. 1986; Toews et al. 2006). The traps were placed in the field for a week before being collected and for most locations shipped back to our lab where beetles were processed. For processing, the beetles were first carefully removed from the sticky glue inside the trap with forceps and transferred to a 15 ml tube containing 95% EtOH. A histological cleaning agent (Histo-Clear II, National Diagnostics; Atlanta, GA) was added to the tube and tube vortexed to remove residual glue from the beetles. The beetles were then transferred to a 1.5 ml tube with 95% EtOH and stored at -80° until DNA extraction. The number of beetles collected at each location varied due to both the number captured in traps and the integrity of the body after processing the sample. Due to variation in time of beetle capture and environmental conditions during the week the traps were deployed, some beetles became dehydrated which can decrease DNA extraction yield. Beetles that did not remain intact after processing were discarded to help reduce this effect.
Figure 1.
Rhyzopertha dominica sampling locations in the United States. (A) Ecoregions (legend refer only to ecoregions sampled according to EPA classification) and (available at https://www.epa.gov/eco-research/ecoregions-north-america) (B) wheat (green) or rice (blue) crop region. Map displaying ranges of all wheat and rice harvested for grain as a percent (%) of harvested cropland in 2012. Darker colors indicate a higher percent of the cropland acreage as all wheat and rice harvested for grain (source: NASS - USDA National Agricultural Statistic Service 2014). The locations LA and AR overlap wheat and rice areas. Source for the winter wheat and rice production are available at www.nass.usda.gov/Charts_and_Maps/Crops_County/.
Table 1. Sampling location information on ecoregion, crop type and number of individuals successfully genotyped (NGEN). R= rice, W= wheat.
| Ecological regions | Locations | Code | Lat | Long | Crop | NGEN |
|---|---|---|---|---|---|---|
| South Central Plains | Alexandria, LA | LA | 31.2928 | −92.4592 | R|W | 7 |
| Western Gulf Coastal Plain | Beaumont, TX | TX | 30.0649 | −94.3325 | R | 29 |
| Flint Hills | Manhattan, KS | MKS | 39.1917 | −96.5917 | W | 7 |
| South central semiarid | Hudson, KS | HKS | 38.0608 | −97.9297 | W | 11 |
| Western Corn Belt Plains | Fairview, KS | FKS | 39.8464 | −95.7337 | W | 5 |
| Central California Valley | Parlier, CA | CA | 36.6117 | −119.526 | W | 42 |
| Mississippi V. Loess Plains | Jonesboro, AR | AR | 35.8281 | −90.6942 | R|W | 30 |
| South central semiarid | Stillwater, OK | OK | 36.1157 | −97.0586 | W | 11 |
| Lake Agassiz Plain | Fargo, ND | ND | 46.8772 | −96.7894 | W | 43 |
| Southeastern Plains | Orangeburg, SC | SC | 33.4969 | −80.8622 | W | 21 |
| Columbia Plateau | Spokane, WA | WA | 47.6589 | −117.425 | W | 4 |
DNA extraction and library preparation
Genomic DNA was extracted from individual R. dominica using the DNeasy kit (Qiagen, Valencia, CA) following the manufacturer recommended protocol. We eluted the DNA using distilled water and use a vacuum concentrator process (Speed Vac SC110, Savant) to concentrate samples to 50 μl DNA. RAD-sequencing (restriction site-associated DNA sequencing) libraries were prepared following Saintenac et al. (2013). Two libraries containing 80 individuals and one containing 96 individuals barcoded with unique nucleotide sequences were used. Complexity-reduced genomic libraries were prepared using the combination of two restriction endonucleases (RE), PstI (CTGCAG) and MseI (AATT), to create the genomic DNA fragments. After size selection targeting ∼300bp DNA fragment length, the size distribution of DNA fragments in the genomic libraries and the presence of contaminating adaptor peaks were tested on a Bioanalyzer (Agilent 2100), and real-time PCR was used to quantify libraries. Three libraries (pool of 80, 80, and 96 individuals) were diluted to 10 nM concentrations and sequenced on three lanes of Illumina HiSeq2000 (100 bp single-end read run) at the University of Kansas Genome Sequencing Core facility.
Genotyping
Rhyzopertha dominica libraries were demultiplexed, separating individual beetles using the process_rad-tags script in STACKS v.1.44 (Catchen et al. 2013). The SNPs were identified on the 100 bp barcoded reads then filtered for overall quality (Ilut et al. 2014). A maximum of two nucleotide mismatches (M = 2) among reads with potentially variable sequences was the parameter used for the formation of RAD loci (Hohenlohe et al. 2011, Ilut et al. 2014; Benestan et al. 2015). The following filtering steps were taken using the population module of STACKS. We used RAD tags with a minimum stack depth (m) of 3 as this value has shown to perform well under other combinations of stack parameters (Hohenlohe et al. 2010, Hohenlohe et al. 2011, Rochette and Catchen 2017). Filtering was set to retain SNPs genotyped that were at least 85% of the individuals within locations and 100% of the among sampling locations (11 locations). After performing a series of preliminary tests combining different parameters for missing data allowance between and within the population (-p 11 and -r 0.85), the selected parameters were shown to be best in minimizing overall missing data and maximizing SNP retention for downstream analyses. We have also removed markers showing heterozygosity greater than 0.5 within samples to avoid potential paralogous loci (Hohenlohe et al. 2011; Benestan et al. 2015, Garcia-Elfring et al. 2017). The minor allele frequency (MAF) filtered out alleles with a frequency less than 5% (MAF< 0.05) to remove rare variants that have been shown to produce a biased estimate for population connectivity and population structure (Roesti et al. 2012; Linck and Battey 2017) (Table S2). Details of the number of SNPs kept after the filtering step were stored in variant call format (VCF), genepop, and structure files. Output files were converted when necessary into other file formats using PGDSPIDER 2.0 (Lischer and Excoffier 2012).
Nucleotide diversity, population history, and migration inferences
Standard diversity indices for each location using SNP sets consisting of variant and invariant sites and genetic diversity analysis were calculated using the population module in STACKS. The number of variant sites, the number of polymorphic sites, effective number of alleles, the observed (Ho) and expected heterozygosity (He), inbreeding coefficient (FIS), and nucleotide diversity (π) across the genome were estimated for each sampled location. We have calculated allelic richness (AR) and private allelic richness (PAR) for each genetic cluster using rarefaction with standardized sample size of 8, equivalent to 4 diploid samples, with a tolerance of 15% missing data using ADZE 1.0 (Szpiech et al. 2008).
An assignment analysis was conducted, and the origin of each beetle was inferred by the calculation of the likelihood of a specific genotype to be found in a given population according to allele population frequencies using GENODIVE (Paetkau et al. 1995; Cornuet et al. 1999). Because the migration estimation methods based on Nm derived from FST can lack precision (Whitlock and McCauley 1999), the assignment method can be a viable alternative to infer migration using Monte Carlo resampling, in particular, first-generation migrants (Cornuet et al. 1999; Paetkau et al. 2004). The individual‐based assignment tests were performed on the multilocus genotype of each beetle using the assembly of individuals collected at the same place as a reference. We used the same set of markers as for the other analyses (1156 SNPs) that were shown to be within the optimum range for population assignment analysis (Benestan et al. 2015). We used a replacement rate of 0.005 (Paetkau et al. 2004) and Monte Carlo resampling for 10,000 permutations (Cornuet et al. 1999) and tested using the likelihood ratio threshold of all 11 locations at 0.002 alpha value.
Population clusters:
Population structure was tested using nested analysis of molecular variance (AMOVA) using 99,999 permutations with ARLEQUIN software in two hierarchical levels (Excoffier et al. 1992). The first hierarchical level tested for differences between crop group regions (rice or wheat) and the second level tested for differences between ecoregion within the crop group (Table 1). To visualize the level of gene flow between sampled locations and test number of clusters using genetic partitioning we used the software STRUCTURE v2.3.4 (Pritchard et al. 2000; Falush et al. 2003) using Bayesian clustering method with 1156 markers in STACKS. We used 1.5 × 10−5 burn-in iterations followed by 5 × 10−6 Markov Chain Monte Carlo (MCMC) steps. The analysis considered admixture model (i.e., individuals may have mixed ancestry) and correlated allele frequencies (i.e., frequencies in the different populations are likely to be similar). No prior information on the sampling location was included in the analysis. We have tested clusters from K = 1 to K = 12 with 10 replications. STRUCTURE output was analyzed using the program CLUMPP 1.1.2 (Jakobsson and Rosenberg 2007) and visualized using DISTRUCT (Rosenberg 2004). Most likely K values were selected with the Evanno method implemented in Structure Harvester (Earl and von Holdt 2012). We additionally performed a Principal Component Analysis (PCA) that is a non-model-based method to visualize possible factors that could explain grouping patterns (Zheng et al. 2012).
FST and Isolation by Distance
Pair-wise FST analysis in Arlequin 3.5.2.2 was used to measure levels of genetic differentiation between two populations. We also assessed the population average pairwise matrix of Slatkin linearized FST, population average pairwise differences, and population average pairwise matrix of M values (M = 2Nm). We used RColorBrewer and igraph packages to visualize the FST relations. We performed Mantel test with 10,000 permutations to correlate the genetic distances (FST) matrix with the geographic distance matrix using the R libraries ecodist (Goslee and Urban 2007) and ade4 (Dray and Dufour 2007).
Model Selection
We used a model selection approach based on the Akaike’s Information Criterion (AIC, Wagenmakers and Farrell 2004) that allows several competing hypotheses to be simultaneously tested to find the single best model or a small set of very likely models (Johnson and Omland 2004). We selected and examined explanatory variables that can help us to predict the mean effective migrants exchanged by locations. Explanatory variables included geographical, bioclimatic, crop, and grain transport network variables. For geographic variables, we tested the relative effect of the average distance to the other locations (‘distance’) (Table S11). For the bioclimate variables, the PCA scores from the 19 WorldClim variables were used (Hijmans et al. 2005) (Table S12). For agriculture variables, we included crop production information such as the average wheat or rice acreage within a 50 km radius buffer from each sampling point. Buffer radius was selected based on the relative size of the zoned producing region and the ability of the insect to disperse (∼5 km) (Edde 2012). For instance, 50 km is large enough to encompass most regional rice-producing clusters but small enough to prevent significant overlap between landscapes. Landscapes were extracted from the wheat, and rice crop maps using ARGIS ESRI ArcMap v.10.0 and the average yield production for the state where the samples were collected (NASS - USDA National Agricultural Statistic Service 2014) (Table S13). For grain transportation variables, we tested how grain movement by railroads can potentially explain the observed variation in genetic parameters. We tested the volume of wheat received from other locations (‘received’), the volume of wheat shipped to other locations (‘shipped’), incoming and outgoing degree for each state, and betweenness centrality (‘centrality’). States with high incoming node degree are potentially important sinks (‘sink’) whereas those with high outgoing node degree are potentially important sources (‘source’). Betweenness centrality is the number of shortest paths going through a node in the rail network as a measurement of connectivity. A node with high betweenness centrality has a significant influence on the transfer of items through the system (Prater et al. 2013; Nopsa et al. 2015). Because available data on grain transportation is often summarized by state, we performed the analysis combining the three Kansas locations into a single value.
We standardized response and predictor variables (i.e., converting to z-scores) prior to analysis so the beta could be interpreted as the standardized partial regression coefficient or beta weight (Abdi 2004). Beta weights can be compared and account for the relative contribution of each covariate present in the model. For each response variable, we tested the individual and additive effect of all covariates within a group of variables and then between groups of variables comparing the 200 top candidates. We selected models whose Corrected Akaike’s Information Criterion (AICc) values were less than two units away from the best model. The two-unit deviation criterion used to eliminate less plausible models from a smaller set of likely ones is somewhat arbitrary and therefore should not be used as an automatic cutoff. (Anderson 2007). In addition to the AICc ranked values, we have also chosen the best candidates according to model probabilities or Akaike weights (wi) (Johnson and Omland 2004). To separate the most import variables from less important ones, we used three criteria: a cutoff of 0.8 under the weight criterium, the frequency the variable appears in the top models, and beta weights (Burnham and Anderson 2003; Anderson 2008). We used glmulti (Calcagno and de Mazancourt 2010) package in R to perform model selection analyses using the exhaustive screening algorithm (Table S14).
Data availability
A detailed supplementary document was prepared and made available at FigShare and GitHub at github.com/cordeiroemg. Tables and figures contain all the necessary information to reproduce the results presented here. Also, a fasta, genepop and VCF file containing all SNP information was made available. Supplemental material available at Figshare: https://doi.org/10.25387/g3.7242893.
Results
SNP discovery and data processing
The average number of sequence reads among the 3 libraries was 232.3 million (Library-1: 233.94, Library-2: 237.7, and Library-3: 225.19, Table S1) and the average percentage of quality-filtered reads (≥Q30) per library was 91.13% (Library-1: 92.3, Library-2: 90.8, and Library-3: 90.3), giving an average depth of coverage per individual over all SNPs of 35x (Figure S1). Average yield per library was 20.13 GB. Eighteen beetles (7% of the total) had insufficient mean coverage (<5x) and were removed from further analysis. After filtering steps, 45,7018 SNPs were retained before population module in STACKS when it was filtered down to 1,156 SNP markers.
Nucleotide diversity, population history, and migration inferences
We had not found significant differences in nucleotide diversity among sampled locations when diversity parameters were calculated using only variant positions (Table 2) or using all positions (variant and fixed) (Table S4). Similar trend has been found for the rarefied allelic diversity and the rarefied private allelic richness indicating a lack of difference in genetic diversity among populations (Table 2). Considering all locations, the observed heterozygosity (H0) was 0.234 ± 0.137, the expected heterozygosity (HE) was 0.259 ± 0.136, and inbreeding coefficient (FIS) was 0.133 indicating fewer heterozygotes than expected (Supplementary material: Table S3 to S5).
Table 2. Genetic diversity statistics of Rhyzopertha dominica population from nine United States locations estimated from RADseq data (209 individuals and 1156 loci included) for only variant nucleotide positions; site polymorphisms, H0 observed heterozygosity, HE expected heterozygosity, inbreeding coefficient (FIS) (mean ± 95% C.I.), nucleotide diversity (π) (mean ± 95% C.I.), rarefied allelic richness (AR) and rarefied private allelic richness (PAR) using a maximum standardized sample size of 8.
| Sample location | H0 | HE | FIS | π | AR | PAR |
|---|---|---|---|---|---|---|
| Louisiana | 0.227 | 0.225 | 0.041 ± 0.02 | 0.243 ± 0.01 | 1.640 ± 0.011 | 0.002 ± 0.001 |
| Texas | 0.219 | 0.234 | 0.062 ± 0.06 | 0.239 ± 0.01 | 1.616 ± 0.009 | 0.003 ± 0.001 |
| Kansas | 0.228 | 0.232 | 0.055 ± 0.01 | 0.250 ± 0.01 | 1.673 ± 0.008 | 0.004 ± 0.002 |
| California | 0.195 | 0.236 | 0.156 ± 0.08 | 0.239 ± 0.01 | 1.626 ± 0.009 | 0.003 ± 0.001 |
| Arkansas | 0.217 | 0.246 | 0.114 ± 0.07 | 0.251 ± 0.01 | 1.658 ± 0.008 | 0.003 ± 0.001 |
| Oklahoma | 0.219 | 0.238 | 0.091 ± 0.02 | 0.250 ± 0.01 | 1.660 ± 0.010 | 0.002 ± 0.001 |
| North Dakota | 0.228 | 0.251 | 0.090 ± 0.07 | 0.254 ± 0.01 | 1.667 ± 0.008 | 0.003 ± 0.001 |
| South Carolina | 0.235 | 0.245 | 0.054 ± 0.05 | 0.251 ± 0.01 | 1.655 ± 0.009 | 0.003 ± 0.001 |
| Washington | 0.226 | 0.208 | 0.028 ± 0.00 | 0.238 ± 0.01 | 1.589 ± 0.014 | 0.002 ± 0.001 |
The pair WA and TX showed the greatest pairwise nucleotide difference between populations (πXY - (πX + πY)/2) followed by WA and LA and WA and SC (Table S6). The population pair with the smallest overall number of differences (i.e., more similar to all other locations) was KS and OK. The population pair with the largest number of pairwise differences within the population (πX) (i.e., more heterogeneous) was KS and WA whereas the population with the least amount of within population differences (i.e., more homogenous) were CA and TX (Table S6). The estimated number of effective migrants Nm varied from 35.6 beetles between KS and ND to 5.1 beetles between CA and WA. The places that in average exchanged most migrants per generation were KS (24.5 beetles) and OK (23.2 beetles) whereas the places that exchanged the least were WA (5.9 beetles) and TX (9.1 beetles) (Table S9). The assignment analysis successfully assigned the majority of beetles to their correct collection location (78%), but there was considerable variation among locations (range: 0% LA to 98% CA). Beetles from SC, ND, TX, and CA were more often assigned to their correct locations, while LA and one of the KS populations (FKS) did not have any beetles assigned correctly. The high heterogeneity (high values for pairwise nucleotide difference within the population, πX) of central locations such as KS and OK can explain the low assignment rate found in those populations whereas TX, CA, SC were more homogenous. Two beetles were flagged as putative first-generation migrants: one beetle collected in Beaumont, TX, migrating from Jonesboro, AR, (Lh/Lmax (Threshold)= 173 (28.7)) and one collected in Orangeburg, SC, migrating from Parlier, CA (Lh/Lmax (Threshold)= 222.3 (65.5)). This result gives us strong evidence of a long-distance movement that is likely to be human-aided.
Population structure and clustering
The Analysis of Molecular Variance (AMOVA) revealed a small but significant degree of genetic differentiation among ecoregion (Table 1) within the crop groups (FSC= 0.035, P = 0.001; Table 3), and between crops groups of wheat and rice (FCT= 0.006, P = 0.001; Table 3), using 1,156 SNP markers. The more substantial portion of the variance was allocated to the difference within groups (ecoregions) compared to differences between groups (crop type).
Table 3. FST variance within populations, FSC variance among populations within crop groups, FCT variance among crop groups (wheat vs. rice). The AR and LA location were assigned as rice location as that was the predominant crop.
| Source of variation | d.f. | SS | VC | % |
|---|---|---|---|---|
| Among group | 1 | 269.487 | 0.42117 | 0.64 |
| Among populations within groups | 7 | 1117.794 | 2.26467 | 3.45 |
| Within populations | 409 | 25759.637 | 62.98200 | 95.91 |
| Total | 417 | 27146.919 | 65.66783 |
Model-based and non-model-based cluster analyses agreed that optimum partition is two (Figure 2). The Evanno method estimated K = 2 as the most likely number of genetic clusters and the second most probably partition is K = 7 indicating a weaker but a possible level of substructure (Figure 2, Figure S2 and Table S10). Furthermore, evaluating other K values, it is possible to differentiate sampled locations as we increase K values (Figure S3). Genetic differentiation between wheat and rice regions was not apparent in the STRUCTURE analysis (Figure 2, Figure S3). The PCA analysis also reveals 2 clusters with a small intersection that could be linked to crop region (Figure S4) or the grain transport dynamic (Figure S5) even though the variance captured by the first principal components was relatively small.
Figure 2.
Structure plot of R. dominica across the United States. Vertical bars represent individuals whose genotype have been portioned into 2 and 7 clusters. Below each location is the predominate crop at the sampled region. Colors in pie charts represent the percentage of assignment (Q value) of each group).
FST and Isolation by Distance
There was a significant association between genetic and geographic distances (Mantel’s r = 0.6, P = 0.009) considering all pairwise comparisons (Figure 3) and therefore evidence for isolation by distance (IBD). The level of genetic differentiation in pairwise comparisons varied 11-fold between the smallest and the highest degree of differentiation (FST= 0.0101-0.108, Figure 4, Table S7, Table S8). The overall pattern also shows the formation of the two groups. The first group was formed by LA, SC, OK, ND, AR, and KS indicating that most locations the ‘Central Plains’ and ‘Coastal Plains’ are highly connected (Figure 4). The locations TX, CA formed the second group, and WA showing more differentiation in locations in the extremes of the range. By IBD showed in previous analyses, network analysis also reveals the importance of producers and consumers of grain in the population dynamics. Central nodes, which represent the largest wheat producers (OK, ND, and KS) and therefore sources of grain for other locations, are very close genetically to one another (Figure 5). Locations showing great genetic distance, on the other hand, tended to be associated with smaller producers and receivers of grain from source locations (Figure 5).
Figure 3.
Isolation by distance (IBD) of R. dominica population. Analysis was based upon the correlation between genetic distance (FST/1-FST) and the geographic distance (km).
Figure 4.
FST population dendogram and heatmap based on FST values Analysis compared a total of 11 R. dominica sampling locations. Darker color represents greater degree of differentiation. The three Kansas location were grouped together and for that reason considered a single location here.
Figure 5.
Network analysis of wheat producer and consumers. Network showing FST distance (links), wheat production (node size), and role as source (green) or sink (red) according to rail transport of wheat in the United States. Source locations are greater producers and tended be less isolated. Sink location are small producers and tented to be more isolated. KS, ND, OK are source populations of wheat whereas WA, CA, LA, SC, AR, TX are sink populations of wheat.
Model selection
A single model was selected to explain the average number of effective migrants (M) at each location (Figure S6). The top model elected volume of grain received and geographical distance as best predictors according to AICc criteria (top model AICc = 24.57 vs. null model AICc= 30.48) (Table 4). The two predictors were consistent across the 100 models analyzed (Figure S7). Both variables were negatively associated with the average number of migrants, which means that locations closer to each other with the lower rates of grain receipts (i.e., sources) tended to exchange more migrants with other locations. On the other hand, locations farther apart with a high rate of grain receipts (i.e., sinks) tended to exchange fewer migrants with other locations. The volume of grain received at each location alone has a significant effect on the average number of effective migrants (M) and explains a significant portion of the variance (Figure 6).
Table 4. Global generalized linear model results (predictor variables included including intercept) modeling M (effective migrants).
| M = Intercept + Distance + Receipts | Estimate | SE | t-value | Pr(>|t|) |
|---|---|---|---|---|
| Intercept | −2.18e-18 | 1.423e-01 | 0.000 | 1.0 |
| Average geographic distance | −6.103e-01 | 1.532e-01 | −3.984 | 0.00725*** |
| Volume of grain received from other locations | −6.045e-01 | 1.532e-01 | −3.946 | 0.00757*** |
**significant at 0.001.
Figure 6.
Probability of migrant exchanged according to wheat trade. Mean of effective migrant Nm as function of the volume of grain received (millions of tons) at each location.
Discussion
Here we used hundreds of markers to examine and extract genetic diversity information from local populations and found correlations with geographic distances, agriculture practices, and the movement of commodities across vast distances. For instance, agriculture-related variables such as the volume of grain received by a particular location helped to explain the degree of genetic isolation, and the number of effective migrants exchanged between populations. Here, we have not found statistically significant differences in genetic diversity among populations including nucleotide diversity, allele richness, and expected heterozygosity, which could be attributed to the high gene flow between areas. An interesting finding is that the most heterogeneous and better-connected populations are also located in places with the largest grain production in the United States. KS was the most heterogeneous location showing the highest value for pairwise nucleotide differences within population and lowest values of pairwise nucleotide differences between populations. More heterogenous areas tended to be large hubs of the grain transport network and highly connected to other areas. Our results support the notion that important production areas are support larger and heterogeneous populations (Nopsa et al. 2015).
Heterozygosity deficiency
All populations showed lower values of observed heterozygosity than expected. Similar low heterozygosity has been described in the literature for other stored product insects (Demuth et al. 2007; Drury et al. 2009) and other beetle species (Brouat et al. 2003; Schrey et al. 2008). Heterozygosity deficiency may be associated with the way this species colonizes and disperses. Flying females outside of grain storage structures are often mated (Edde 2012, Ridley et al. 2016); thus, when infesting a large mass of grain such as a grain elevator or a grain bin, they may behave like a colonizing propagule. It is possible that the founding event that starts an infestation leads to a more inbred population (Wade and McCauley 1988). This effect can be substantial if the colonizing propagules are also coming from a single population or a small group of relatives (kin-structured colonization; Wade and McCauley 1988; Whitlock and McCauley 1990; Wade et al. 1994, Drury et al. 2009). Outside storage traps might reflect different patterns depending on the local dynamic at the specific sites where they were placed; e.g., more significant differences among individuals flying toward the storage structure (colonization) and a higher relatedness for individuals flying away from a storage structure(dispersing).
Further research at the landscape scale level is needed to determine the genetic variability of R. dominica colonizers. For instance, it would be useful to determine if they tend to have higher levels of genetic similarity among themselves (originate from similar local sources), or if they will exhibit a higher level of genetic differentiation from one another (arise from different sources) as observed in Tribolium populations (Drury et al. 2009). The reunion of dispersing inbred colonizers from different locations can temporarily create substructure in the local population (Wade and McCauley 1988; Whitlock and McCauley 1990; Zhivotovsky 2015) that is likely to disappear within few generations of random mating (Berthier et al. 2006).
Heterozygous deficiency is not the only pattern that has been found in stored-product insects, other species have shown an excess of heterozygotes that was attributed to bottlenecks caused by fumigation or heat control tactics (Fields and White 2002; Semeao et al. 2012; Coelho-Bortolo et al. 2016; Blanc et al. 2006). Nonetheless, we must be careful interpreting and generalizing conclusions from inbreeding coefficients such as FIS as such coefficients are more related to properties of the mating system within the population rather than evolutionary processes that lead to divergence among populations such as FST coefficients (Holsinger and Weir 2009).
Population sub-division
Rhyzopertha dominica is a synanthropic insect that was likely first introduced into North America with grains brought by European settlers several hundred years ago, with many reintroductions likely to have occurred, and is now widely distributed across the US. However, this trans-continental distribution is not uniform throughout the range of this species, and we expected to encounter different degrees of isolation (Ellstrand and Elam 1993). Physical barriers such the Rocky Mountains and deserts, historical processes of human colonization, and characteristics of the insect dispersal behavior have an impact on gene flow and consequently on population structure (Lowe and Allendorf 2010). We have found a significant degree of isolation in populations from CA, WA, and TX, and because those locations also have the most substantial average distance to other places, isolation by distance (IBD) can be implied. Other studies on stored product insects have found a significant degree of structure but lack of IBD (Bas et al. 2000; Drury et al. 2009; Semeao et al. 2012; Coelho-Bortolo et al. 2016; Thangaraj et al. 2016), which led to the prevalent hypothesis that human-aided movement was a more critical factor than flight dispersal in defining population structure (Drury et al. 2009). Here we found that geographic distance can play a significant role in population structure even in stored-product insects with a high degree of predicted anthropogenic movement, at least at a large geographic scale. Distance can be considered an essential factor in determining the structure of stored product pests. When evaluating effective migrants Nm, the most significant number of migrants would be expected between KS and ND whereas KS was the place with the higher number of pairwise migrant exchange. When evaluating possible migrants in our samples, our assignment analysis tagged one insect from AR to TX and one from CA to SC as putative first-generation immigrants, which suggests both short and long longitudinal displacement between producing areas in the US. These findings strongly support the notion that beetles are dispersing with the grain using the transportation system (i.e., train but potential not only) to engage in cross-continental travels.
Data presented here demonstrate that R. dominica populations experience a considerable amount of gene flow that was reflected in the high degree of population admixture illustrated in the structure plots. Rhyzopertha dominica populations seem to exhibit a combination of patterns, both continuous genetic differentiation and discrete population structure. The subtle levels of structure can give us clues of earlier demographic processes and ongoing evolutionary dynamics and therefore should be evaluated carefully. Keeping that in mind, we explored the population genetic structure using a wide range of techniques that indicated the influence of multiple and concurrent structuring factors. It appears that structuring factors might be to some extent correlated.
We presented two estimations for the number of clusters suggesting potential substructure. According to the Evanno method, K = 2 was the most likely number of groups, but there was also a small peak at K = 7. The smaller peak is much less important here, but it is nonetheless interesting to see that sampled location can still be identified despite the extremely high levels of gene flow. The PCA analysis, a non-model-based method, supported those two levels of structure even though it is not clear if crop type, source-sink dynamic, or another non-tested factor was responsible for the observed pattern. Scrutinizing K = 7, we confirm that KS and OK are indeed the most heterogeneous locations and CA, WA, and TX are the most homogenous. We can also spot the putative first-generation migrant in SC coming from CA. The FST values allowed the separation of the locations into two groups, one composed of relatively well-connected locations such as KS, ND, and OK and one composed by more structured locations (i.e., higher FST) such as WA, CA, and TX. Interestingly, TX is very close geographically to LA but seemed to be very distant genetically suggesting the importance of other processes and dynamics more complicated than just geographic distance.
Crop vs. ecoregion
We found very little structure caused by crop region (i.e., 0.63% of the total variance was explained by crop type). Given that rough rice tends not to be transported over very long distances prior to processing and is less likely to be cross-contaminated with wheat in transportation, we had predicted populations from rice-growing regions would be more isolated from wheat growing regions. However, our results are not conclusive regarding the impact of the crop because only the TX location could be considered an exclusive rice production region according to our classification (crop type within a 50 km radius). Our other rice-growing regions in AR and LA, while they can be considered rice regions predominantly, did also have some wheat grown within the established perimeter, making them mixed crop regions. That pattern of mixing areas was also apparent in the PCA plot. Texas presented a substantial degree of differentiation compared to a close location such as LA. These few cases of crop-related variation suggest that there may be a crop effect that was not detectable using our sample locations. The hypothesis that crop type could be a factor causing population structure has been investigated in the past in wheat and rice mills for Tribolium (Semeao et al. 2012), but in that study the variation within commodity type or region grouping was equal to or greater than that between groups, showing a weak relation at best. A finer scale sampling effort including more locations that are exclusively rice and wheat is necessary to definitively answer the question of whether crop type can be an essential factor structuring populations of R. dominica.
Source-sink dynamics
Because distance has not been found as a major factor affecting population structure in stored-product insect populations in the past, grain transportation is often evoked as the best explanation for the lack of IBD (Bas et al. 2000; Blanc et al. 2006; Ryne and Bensch 2008). In the present work, we found a significant correlation between the geographic distance (km) and genetic distance (FST/1-FST) using the Mantel test, and a high rate of admixture (Figure 2). However, clear signs of other structuring factors related to agriculture activity and grain transportation suggest that they are potentially significant in impacting genetic parameters of the tested populations. Model selection allows us to test several competing hypotheses by weighting and establishing relationships between variables (Johnson and Omland 2004). We used geographic distance, crop information, bioclimatic, and transportation variables to assess the relative importance of each covariant for explaining genetic isolation and the number of migrants. The prediction was that larger geographic range over which grain is grown and the more grain shipped should be associated with greater number of migrants and less geographic isolation (source population, as in Semeao et al. 2012), and places that tend to receive more grain will have fewer migrants going out and are more isolated (sink population). Our prediction was supported because the covariant ‘volume of grain received’ was elected as most important considering all 100 models evaluated. This variable defines whether a location is a sink or a source (i.e., low values of grain received are associated with source population, and high values of grain received are associated with sink population).
Conclusions
Here we have shown how the geographic distance, features of the environment in which local populations are present, the agriculture management, as well as storage, and transportation of commodity are likely to affect different aspects of a stored grain insect pest population. This is the first time that population structure and IBD have been shown for a stored product insect. Inbreeding coefficients gave us insights on how populations infest and develop within grain storage sites; however, the observed heterozygosity deficiency needs further evaluation to determine if it is due to sample bias, such as timing before or after fumigation or being nearer to stored grain vs. natural habitat locations, or if it truly represents a pattern of substructure or kin-structure. Most heterogeneous and well-connected populations were in regions with the greatest wheat production. Given that after harvest wheat is typically stored locally and then transported through a series of successively larger and more distant storage locations, large production areas offer more opportunities for R. dominica to breed and these regions can potentially be a source of migrants to other less productive locations. In addition, dispersal by flight behavior and displacement from one location to another by human-aided means using trucks and railroads connecting populations can largely explain population differentiation patterns in R. dominica in the United States. This information also has a potential application to pest management and the management of insecticide resistance that needs further exploration.
Acknowledgments
We thank K. Leonard for technical support, Dr. K. J. Jordan, and Dr. S. Wang of helping with library preparation and bioinformatics support. We also would like to express gratitude to the network of collaborators across the country who helped us successfully sample R. dominica in the major wheat and rice regions of the United States. Thank you Dr. T. McKay (AR), Dr. Y. Yang (TX), Dr. D. Weaver (MT), Dr. R. Mahroof (SC), Dr. L. Mason (IN), Dr. J. Prasifka (ND), Dr. J. Beuzelin (LA), Dr. G. Opit (OK), Dr. C. Burks (CA), and K. Wenger and C. Klein (Fairview KS). We thank the National Council of Scientific and Technological Development (CNPq) for financial support (number of the process: 201878/2011-9, EMGC). We also acknowledge the United States Environmental Protection Agency (EPA) for the shapefiles. Mention of trade names or commercial products in this publication is solely for the purpose of providing specific information and does not imply recommendation or endorsement by the U. S. Department of Agriculture or Kansas State University. USDA ARS is an equal opportunity employer. This work was supported in part by the Kansas Agricultural Experiment Station, and represents contribution number 19-115-J.
Footnotes
Supplemental material available at Figshare: https://doi.org/10.25387/g3.7242893.
Communicating editor: P. Morrell
Literature Cited
- Abdi H., 2004. Partial regression coefficients, pp. 1–4 in Encyclopedia of Social Sciences Research Methods, edited by Lewis-Beck M., Bryman A., Futing T. SAGE Publications, Thousand Oaks. [Google Scholar]
- Anderson D. R., 2007. Model based inference in the life sciences: A primer on evidence, Springer, New York. [Google Scholar]
- Anderson D. R., 2008. Information Theory and Entropy, pp. 51–82 in Model Based Inference in the Life Sciences: A Primer on Evidence, edited by Anderson D. R. Springer-Verlag, New York: 10.1007/978-0-387-74075-1_3 [DOI] [Google Scholar]
- Bas B., Dalkilic Z., Peever T. L., Nigg H. N., Simpson S. E., et al. , 2000. Genetic relationships among Florida Diaprepes abbreviatus (Coleoptera: Curculionidae) populations. Ann. Entomol. Soc. Am. 93: 459–467. 10.1603/0013-8746(2000)093[0459:GRAFDA]2.0.CO;2 [DOI] [Google Scholar]
- Bailey R. G., 2005. Identifying Ecoregion Boundaries. Environ. Manage. 34: S14–S26. 10.1007/s00267-003-0163-6 [DOI] [PubMed] [Google Scholar]
- Benestan L., Gosselin T., Perrier C., Sainte-Marie B., Rochette R., et al. , 2015. RAD genotyping revels fine-scale genetic structuring and provides powerful populations assignment in a widely distributed marine species, the American lobster (Homarus americanus). Mol. Ecol. 24: 3299–3315. Erratum: 1626–1629. 10.1111/mec.13245 [DOI] [PubMed] [Google Scholar]
- Berthier K., Charbonnel N., Galan M., Chaval Y., Cosson J.-F., 2006. Migration and recovery of the genetic diversity during the increasing density phase in cyclic vole populations. Mol. Ecol. 15: 2665–2676. 10.1111/j.1365-294X.2006.02959.x [DOI] [PubMed] [Google Scholar]
- Blanc M. P., Lugon-Moulin N., Panighini C., Pijnenburg H., Rossi L., 2006. Structure of worldwide populations of Lasioderma serricorne (Coleoptera: Anobiidae) as revealed by amplified fragment length polymorphism profiles. Bull. Entomol. Res. 96: 111–116. 10.1079/BER2005405 [DOI] [PubMed] [Google Scholar]
- Brouat C., Sennedot F., Audiot P., Leblois R., Rasplus J.-Y., 2003. Fine-scale genetic structure of two carabid species with contrasted levels of habitat specialization. Mol. Ecol. 12: 1731–1745. 10.1046/j.1365-294X.2003.01861.x [DOI] [PubMed] [Google Scholar]
- Burnham K. P., Anderson D. R., 2003. Model selection and multimodel inference: a practical information-theoretic approach, Springer Science & Business Media, Berlin, Germany. [Google Scholar]
- Calcagno V., de Mazancourt C., 2010. glmulti: An R package for easy automated model selection with (generalized) linear models. J. Stat. Softw. 34: 1–29. 10.18637/jss.v034.i12 [DOI] [Google Scholar]
- Catchen J., Hohenlope P. A., Bassham S., Amores A., Cresko W. A., 2013. Stacks: an analysis tool set for population genomics. Mol. Ecol. 22: 3124–3140. 10.1111/mec.12354 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Coelho-Bortolo T., Mangolin C. A., Lapenta A. S., 2016. Genetic Variability in the natural populations of Lasioderma serricorne (F.) (Coleoptera: Anobiidae), detected by RAPD markers and the esterase isozymes. Bull. Entomol. Res. 106: 47–53. 10.1017/S0007485315000723 [DOI] [PubMed] [Google Scholar]
- Cornuet J.-M., Piry S., Luikart G., Estoup A., Solignac M., 1999. New methods employing multilocus genotypes to select or exclude populations as origins of individuals. Genetics 153: 1989–2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ching’Oma, G. P., 2006 Spatial distribution and movement of the lesser grain borer, Rhyzopertha dominica (F.). Ph.D. dissertation, Kansas State University, Manhattan, KS. [Google Scholar]
- Demuth J. P., Drury D. W., Peter M. L., Van Dyken J. D., Priest N. K., et al. , 2007. Genome-wide survey of Tribolium castaneum microsatellites and description of 509 polymorphic markers. Mol. Ecol. 7: 1189–1195. 10.1111/j.1471-8286.2007.01826.x [DOI] [Google Scholar]
- Dias P. C., 1996. Sources and sinks in population biology. Trends Ecol. Evol. 11: 326–330. 10.1016/0169-5347(96)10037-9 [DOI] [PubMed] [Google Scholar]
- Diffendorfer J. E., 1998. Testing models of source-sink dynamics and balanced dispersal. Oikos 81: 417–433. 10.2307/3546763 [DOI] [Google Scholar]
- Dowdy A. K., McGaughey W. H., 1994. Seasonal activity of stored-product insects in and around farm-stored wheat. J. Econ. Entomol. 87: 1351–1358. 10.1093/jee/87.5.1351 [DOI] [Google Scholar]
- Dray S., Dufour A. B., 2007. The ade4 package: implementing the duality diagram for ecologists. J. Stat. Softw. 22: 1–20. 10.18637/jss.v022.i04 [DOI] [Google Scholar]
- Drury D. W., Sinard A. L., Wade M. J., 2009. Genetic differentiation among wild populations of Tribolium castaneum estimated using microsatellite markers. J. Hered. 100: 732–741. 10.1093/jhered/esp077 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Earl D. A., von Holdt B. M., 2012. STRUCTURE HARVESTER: a website and program for visualization STRUCTURE output and implementing the Evanno method. Conserv. Genet. Resour. 4: 359–361. 10.1007/s12686-011-9548-7 [DOI] [Google Scholar]
- Edde P. A., Phillips T. W., Nansen C., Payton M. E., 2006. Flight activity of the lesser grain borer, Rhyzopertha dominica F. (Coleoptera: Bostrichidae), in relation to weather. Environ. Entomol. 35: 616–624. 10.1603/0046-225X-35.3.616 [DOI] [Google Scholar]
- Edde, P. A., 2012 A review of the biology and control of Rhyzopertha dominica (F.) the lesser grain borer. J. Stored Prod. Res. 48: 1–18. [Google Scholar]
- Ellstrand N. C., Elam D. R., 1993. Population genetic consequences of small population-size - implications for plant conservation. Annu. Rev. Ecol. Evol. Syst. 24: 217–242. 10.1146/annurev.es.24.110193.001245 [DOI] [Google Scholar]
- EPA, 1997 Ecological regions of North America: toward a common perspective. Commission for environmental Cooperation. Retrieved 2015–08–27: http://www.cec.org/files/PDF/BIODIVERSITY/eco-eng_EN.pdf
- Excoffier L., Smouse P. E., Quattro J. M., 1992. Analysis of molecular variance inferred from metric distances among DNA haplotypes: application to human mitochondrial DNA restriction data. Genetics 131: 479–491. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Falush D., Stephens M., Pritchard J. K., 2003. Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies. Genetics 164: 1567–1587. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fields P. M., White N. D. G., 2002. Alternative to methyl bromide treatments for stored-product and quarantine research. Annu. Rev. Entomol. 47: 331–359. 10.1146/annurev.ento.47.091201.145217 [DOI] [PubMed] [Google Scholar]
- Garcia-Elfring A., Barrett R. D. H., Combs M., Davies T. J., Munshi-South J., et al. , 2017. Admixture on the northern front: population genomics of range expansion in the white-footed mouse (Peromyscus leucopus) and secondary contact with the deer mouse (Peromyscus maniculatus). Heredity 119: 447–458. 10.1038/hdy.2017.57 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Goslee S. C., Urban D. L., 2007. The ecodist package for dissimilarity-based analysis of ecological data. J. Stat. Softw. 22: 1–19. 10.18637/jss.v022.i07 [DOI] [Google Scholar]
- Hijmans R. J., Cameron S. E., Parra J. L., Jones P. G., Jarvis A., 2005. Very high-resolution interpolated climate surfaces for global land areas. Int. J. Climatol. 25: 1965–1978. 10.1002/joc.1276 [DOI] [Google Scholar]
- Hohenlohe P. A., Bassham S., Etter P. D., Stiffler N., Johnson E. A., et al. , 2010. Population genomics of parallel adaptation in threespine stickleback using sequenced RAD tags. PLoS Genet. 6: e1000862 10.1371/journal.pgen.1000862 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hohenlohe P. A., Amish S. J., Catchen J. M., Allendorf F. W., Luikart G., 2011. Next-generation RAD sequencing identifies thousands of SNPs for assessing hybridization between rainbow and westslope cutthroat trout. Mol. Ecol. Resour. 11: 117–122. 10.1111/j.1755-0998.2010.02967.x [DOI] [PubMed] [Google Scholar]
- Holsinger K. E., Weir B. S., 2009. Genetics in geographically structured populations: defining, estimating and interpreting FST. Nat. Rev. Genet. 10: 639–650. 10.1038/nrg2611 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Holt R. D., 1997. On the evolutionary stability of sink populations. Evol. Ecol. 11: 723–731. 10.1023/A:1018438403047 [DOI] [Google Scholar]
- Ilut D. C., Nydam M. L., Hare M. P., 2014. Defining loci in restriction-based reduced representation genomic data from non-model species: sources of bias and diagnostics for optimal clustering. BioMed Res. Int. 2014: 675158. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jakobsson M., Rosenberg N. A., 2007. CLUMPP: a cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure. Bioinformations 23: 1801–1806. 10.1093/bioinformatics/btm233 [DOI] [PubMed] [Google Scholar]
- Jia F., Toews M. F., Campbell J. F., Ramaswamy S. B., 2008. Survival and reproduction of lesser grain borer, Rhyzopertha dominica (F.) (Coleoptera: Bostrichidae) on flora associated with native habitats in Kansas. J. Stored Prod. Res. 44: 366–372. 10.1016/j.jspr.2008.06.001 [DOI] [Google Scholar]
- Johnson J. B., Omland K. S., 2004. Model selection in ecology and evolution. Trends Ecol. Evol. 19: 101–108. 10.1016/j.tree.2003.10.013 [DOI] [PubMed] [Google Scholar]
- Leos-Martinez J., Granovsky T. A., Williams H. J., Vinson S. B., Burkholder W. E., 1986. Estimation of aerial density of the lesser grain borer (Coleoptera: Bostrichidae) in a warehouse using dominicalure traps. J. Econ. Entomol. 79: 1134–1138. 10.1093/jee/79.4.1134 [DOI] [Google Scholar]
- Linck E. B., Battey C. J., 2017. Minor allele frequency thresholds strongly affect population structure inference with genomic datasets. bioRxiv. 10.1101/188623 [DOI] [PubMed] [Google Scholar]
- Lischer H. E. L., Excoffier L., 2012. PGDSpider: an automated data conversion tool for connecting population genetics and genomics program. Bioinformatics 28: 298–299. 10.1093/bioinformatics/btr642 [DOI] [PubMed] [Google Scholar]
- Lowe W. H., Allendorf F. W., 2010. What can genetics tell us about population connectivity? Mol. Evol. 19: 5320 10.1111/j.1365-294X.2010.04688.x [DOI] [PubMed] [Google Scholar]
- Mahroof R., Phillips T. W., 2007. Stable isotopes as markers to investigate host use by Rhyzopertha dominica. Entomol. Exp. Appl. 125: 205–213. 10.1111/j.1570-7458.2007.00618.x [DOI] [Google Scholar]
- NASS - USDA National Agricultural Statistic Service, 2014 NASS 2012 Census of Agriculture Atlas: Crops and Plants. http://gis.ers.usda.gov/arcgis
- Hernandez Nopsa J. F., Daglish G. J., Hagstrum D. W., Leslie J. F., Phillips T. W., et al. , 2015. Ecological Networks in stored grain: key postharvest nodes for emerging pests, pathogens, and mycotoxins. Bioscience 65: 985–1002. 10.1093/biosci/biv122 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Paetkau D., Slade R., Burdens M., Estoup A., 2004. Genetic assignment methods for the direct, real-time estimation of migration rate: a simulation-based exploration of accuracy and power. Mol. Ecol. 13: 55–65. 10.1046/j.1365-294X.2004.02008.x [DOI] [PubMed] [Google Scholar]
- Paetkau D., Calvert W., Stirling I., Strobeck C., 1995. Microsatellite analysis of population structure in Canadian polar bears. Mol. Ecol. 4: 347–354. 10.1111/j.1365-294X.1995.tb00227.x [DOI] [PubMed] [Google Scholar]
- Prater M. E., O’Neil D., Sparger A., 2013. State Grain Rail Statistical Summary, U.S. Department of Agriculture, Agricultural Marketing Service, Washington, DC: 10.9752/TS066.06-2013 [DOI] [Google Scholar]
- Pritchard J. K., Stephens M., Donnely P., 2000. Inference of population structure using multilocus genotype date. Genetics 155: 945–959. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Potter C., 1935. The biology and distribution of Rhyzopertha dominica (FAB.). T. Roy. Ent. Soc. London 83: 449–482. 10.1111/j.1365-2311.1935.tb02995.x [DOI] [Google Scholar]
- Ridley A. W., Hereward J. P., Daglish G. J., Raghu S., McCulloch G. A., et al. , 2016. Flight of Rhyzopertha dominica (Coleopera: Bostrichidae) – a Spatial – Temporal Analysis with Pheromone Trapping and Population Genetics. J. Econ. Entomol. 109: 2561–2571. 10.1093/jee/tow226 [DOI] [PubMed] [Google Scholar]
- Rochette N. C., Catchen J. M., 2017. Deriving genotypes from RAD-seq short-read data using Stacks. Nat. Protoc. 12: 2640–2659. 10.1038/nprot.2017.123 [DOI] [PubMed] [Google Scholar]
- Roesti M., Salzburger W., Berner D., 2012. Uniformative polymorphisms bias genome scans for signatures of selection. BMC Evol. Biol. 12: 94 10.1186/1471-2148-12-94 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rosenberg N. A., 2004. Disrupt: a program for the graphical display of population structure. Mol. Ecol. Notes 4: 137–138. 10.1046/j.1471-8286.2003.00566.x [DOI] [Google Scholar]
- Ryne C., Bench S., 2008. Do anthropogenic transport facilitate stored-product pest moth dispersal? A molecular approach. Naturwissenschaffen 95: 155–159. 10.1007/s00114-007-0299-y [DOI] [PubMed] [Google Scholar]
- Saintenac C., Jiang D., Wang S., Akhunov E., 2013. Sequence-based mapping of the polyploidy wheat genome. G3 (Bethesda) 3: 1105–1114. 10.1534/g3.113.005819 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schrey N. M., Schrey A. W., Heist E. J., Reeve J. D., 2008. Fine-scale population structure of southern pine beetle (Coleoptera: Curculionidae) in Mississippi forest. Environ. Entomol. 37: 271–276. 10.1093/ee/37.1.271 [DOI] [PubMed] [Google Scholar]
- Semeao A. A., Campbell J. F., Beeman R. W., Lorenzen M. D., Whitworth R. J., et al. , 2012. Genetic structure of Tribolium castaneum (Coleoptera: Tenebrionidae) populations in mills. Environ. Entomol. 41: 188–199. 10.1603/EN11207 [DOI] [PubMed] [Google Scholar]
- Sinclair E. R., Haddrell R. L., 1985. Flight of stored product beetles over a grain farming area in southern Queensland. J. Aust. Entomol. Soc. 24: 9–15. 10.1111/j.1440-6055.1985.tb00177.x [DOI] [Google Scholar]
- Szpiech Z. A., Jakobsson M., Rosenberg N. A., 2008. ADZE: a rarefaction approach for counting alleles private to combinations of populations. Bioinformatics 24: 2498–2504. 10.1093/bioinformatics/btn478 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Toews M. D., Campbell J. F., Arthur F. H., Ramaswamy S. B., 2006. Outdoor flight activity and immigration of Rhyzopertha dominica into seed wheat warehouses. Entomol. Exp. Appl. 121: 73–85. 10.1111/j.1570-8703.2006.00462.x [DOI] [Google Scholar]
- Thangaraj S. R., McCulloch G. A., Subbarayalu M., Subramaniam C., Walter G. H., 2016. Development of microsatellite markers and a preliminary assessment of population structuring in the rice weevil, Sitophilus oryzae (L.). J. Stored Prod. Res. 66: 12–17. 10.1016/j.jspr.2015.12.005 [DOI] [Google Scholar]
- Wade M. J., McCauley D. E., 1988. Extinction and recolonization: Their effects on the genetic differentiation of local populations. Evolution 43: 995–1005. 10.1111/j.1558-5646.1988.tb02518.x [DOI] [PubMed] [Google Scholar]
- Wade M. J., McKnight M. L., Shaffer H. B., 1994. The effects of kin-structured colonization on nuclear and cytoplasmic genetic diversity. Evolution 48: 1114–1120. 10.1111/j.1558-5646.1994.tb05298.x [DOI] [PubMed] [Google Scholar]
- Wagenmakers E.-J., Farrell S., 2004. AIC model selection using Akaike weights. Psychon. Bull. Rev. 11: 192–196. 10.3758/BF03206482 [DOI] [PubMed] [Google Scholar]
- Whitlock M. C., McCauley D. E., 1990. Some population genetic consequences of colony formation and extinction: genetic correlation within founding groups. Evolution 44: 1717–1724. 10.1111/j.1558-5646.1990.tb05243.x [DOI] [PubMed] [Google Scholar]
- Whitlock M. C., McCauley D. E., 1999. Indirect measures of gene flow and migration, FST≠ 1/(4Nm+1). Heredity 82: 117–125. 10.1038/sj.hdy.6884960 [DOI] [PubMed] [Google Scholar]
- Wright V. F., Fleming E. E., Post D., 1990. Survival of Rhyzopertha dominica (Coleoptera: Bostrichidae) on fruits and seeds collected from woodrat nests in Kansas. J. Kans. Entomol. Soc. 63: 344–347. [Google Scholar]
- Zheng X., Levine D., Shen J., Gogarten S. M., Laurie C., et al. , 2012. A high-performance computing toolset for relatedness and principal component analysis of SNP data. Bioinformatics 28: 3326–3328. 10.1093/bioinformatics/bts606 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhivotovsky L. A., 2015. Relationship between wright’s FST and FIS statistics in a context of Wahlund effect. J. Hered. 106: 306–309. 10.1093/jhered/esv019 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
A detailed supplementary document was prepared and made available at FigShare and GitHub at github.com/cordeiroemg. Tables and figures contain all the necessary information to reproduce the results presented here. Also, a fasta, genepop and VCF file containing all SNP information was made available. Supplemental material available at Figshare: https://doi.org/10.25387/g3.7242893.






