Skip to main content
Plant Biotechnology Journal logoLink to Plant Biotechnology Journal
. 2016 Sep 30;15(3):357–366. doi: 10.1111/pbi.12630

Population genomics identifies the origin and signatures of selection of Korean weedy rice

Qiang He 1, Kyu‐Won Kim 1, Yong‐Jin Park 1,2,
PMCID: PMC5316921  PMID: 27589078

Summary

Weedy rice is the same biological species as cultivated rice (Oryza sativa); it is also a noxious weed infesting rice fields worldwide. Its formation and population‐selective or ‐adaptive signatures are poorly understood. In this study, we investigated the phylogenetics, population structure and signatures of selection of Korean weedy rice by determining the whole genomes of 30 weedy rice, 30 landrace rice and ten wild rice samples. The phylogenetic tree and results of ancestry inference study clearly showed that the genetic distance of Korean weedy rice was far from the wild rice and near with cultivated rice. Furthermore, 537 genes showed evidence of recent positive or divergent selection, consistent with some adaptive traits. This study indicates that Korean weedy rice originated from hybridization of modern indica/indica or japonica/japonica rather than wild rice. Moreover, weedy rice is not only a notorious weed in rice fields, but also contains many untapped valuable traits or haplotypes that may be a useful genetic resource for improving cultivated rice.

Keywords: genetic resources, natural selection, origin, weedy rice, whole‐genome resequencing

Introduction

Weedy rice (Oryza sativa f. spontanea Rosh.) is one of the most notorious weeds in rice fields worldwide, causing both crop yield losses and degrading the quality of rice (Qiu et al., 2014). It can be defined as any spontaneously and strongly shattering rice that occurs in cultivated rice fields, and it harbours phenotypes of both wild and domesticated rice. Weedy rice only occurs in or nearby rice fields and is characterized by its easy seed shattering, deep dormancy and red pericarp, which serve as keys to distinguish weedy rice from wild species and cultivated rice (Xia et al., 2011). Weedy rice produces fewer grains per plant and competes aggressively with the cultivated rice. It is present worldwide, including South and North America, southern Europe and southern and southeast Asia (Noldin et al., 1999; Sun et al., 2013). In the last 20 years, weedy rice has become a serious agricultural issue because of direct seeding. In southern parts of the United States, weedy rice causes an annual loss of more than $50 million (Gealy et al., 2002). In China, weedy rice affects more than three million ha and reduces the crop yield by about 3.4 billion kg (Liang and Qiang, 2011).

In Korea, a lot of weedy rice strains (O. sativa L., locally called ‘Aengmi’ and ‘Share’) have been collected from farmers’ fields. The regional distribution and genetics of weedy rice have been extensively characterized (Hak‐Soo and Mun‐Hue, 1992; Heu, 1988). ‘Share rice’ in Korea occurs mainly on Kanghwa Island and is geographically isolated from other weedy subpopulations. ‘Share rice’ has almost all of the characteristics of ‘Red rice’, while its spikelet within one panicle always has a different ripening time and the culm length is always shorter than those of ‘Red rice’ and landraces (Chung and Park, 2010). Although many morphological traits have been studied in detail, the large variability in complex quantitative traits in weedy rice remains unexploited (Chung and Park, 2010). Weedy rice plants are disease‐resistant and herbicide‐resistant (Chen et al., 2004; Olofsdotter et al., 2000). The seeds of weedy rice have high ability to persist in the soil (Delouche and Labrada, 2007). Therefore, it is difficult to obliterate weedy rice from rice fields. On the other hand, cultivated rice is susceptible to rice diseases such as rice blast disease, which leads to a 20%–30% loss of annual rice harvest (Kou and Wang, 2012). Weedy rice has attracted attention in the area of rice research because of its tolerance against biostress and abiostress (Chen et al., 2004; Liu et al., 2014; Lu et al., 2014; Qiu et al., 2014; Sun et al., 2013). Because weedy rice in Korea has acclimatized and adapted to different growing environments, it is also thought to be tolerant to a wide range of adverse conditions. In addition, we postulated that weedy rice may have additional useful characteristics, either in stress tolerance or rice eating quality, that could be used to improve cultivated rice in the future.

In addition, there is an active area of research toward determining the origin of weedy rice as a means to keep it out of rice fields. The origin of weedy rice has been debated intensively over the past 40 years, and there are three main hypotheses: (i) weedy rice descended directly from cultivated rice, with some populations from indica‐type cultivars (Londo and Schaal, 2007), some from japonica‐type cultivars (Cao et al., 2006, 2009; Vaughan et al., 2008), and others from indica × japonica hybrids (Ishikawa et al., 2005; Qiu et al., 2014); (ii) weedy rice may have generated from the process of ongoing selection and adaptation of wild rice (Oryza rufipogon and Oryza nivara which is considered as annual O. rufipogon) (De Wet and Harlan, 1975; Kelly Vaughan et al., 2001); and (iii) weedy rice may have originated from the hybridization between cultivated rice and its wild ancestor O. rufipogon (Londo and Schaal, 2007). A number of other hypotheses have also been proposed, including weedy rice originating from reversion of cultivated rice when domesticated rice was abandoned (Bres‐Patry et al., 2001) or forming by hybridization of its cultivated relatives (Ishikawa et al., 2005; Reagon et al., 2010; Xiong et al., 2012). This is supported by observations that some weedy‐type offspring could occur after inter‐subspecies and inter‐varietal hybridization in rice. Despite increasing focus on the evolutionary study of weedy rice, its origin is still unclear. A number of molecular markers have been applied to infer the population structure, genetic diversity and adaptive loci of weedy rice. However these molecular markers, either simple sequence repeats or restriction fragment length polymorphisms, which were frequently used in studies of weedy rice, only allowed scanning of the genome at very low density (Chung and Park, 2010; Gross et al., 2010; Kelly Vaughan et al., 2001; Londo and Schaal, 2007; Reagon et al., 2010; Sun et al., 2013; Thurber et al., 2010). It is easy to obtain biased conclusions with low‐density molecular markers because the accuracy of population structure, genetic diversity and adaptive loci identities is dependent on the representativeness of the markers, and it is difficult to evaluate the representative efficiency of the markers for unknown populations.

The rapid increase in the density of molecular markers through next‐generation sequencing (NGS) technology has facilitated building more solid hypotheses on genomics‐based study of rice (He et al., 2015; Huang et al., 2010, 2012a,b; Xu et al., 2012). Qiu et al. (2014) used a whole‐genome resequencing strategy for three weedy rice samples and suggested the origin of weedy rice from domesticated indica/japonica hybridization. However, the case study of these three weedy rice samples is not representative of all weedy rice populations, and the limited population size was not suitable to identify the adaptation loci for weedy rice.

Recently, a number of analytical methods have been performed to identify signals of recent positive selection or adaptation loci on a genome‐wide scale using huge NGS data, such as the fixation index (F ST) (Akey et al., 2002), nucleotide diversity (π) (Nei and Li, 1979), integrated haplotype score (iHS) (Voight et al., 2006), extended haplotype homozygosity (EHH) (Sabeti et al., 2002), cross‐population EHH (XP‐EHH) (Sabeti et al., 2007; Tang et al., 2007), composite likelihood ratio (CLR) (Nielsen et al., 2005; Williamson et al., 2007) and cross‐population CLR (XP‐CLR) (Chen et al., 2010). Most of these methods were first used in human populations and were later extended to animals and plants. All of the techniques significantly stimulate population genetic studies. In rice, Xu et al. (2012) detected many domesticated or selective sweeps by comparing the reduction in nucleotide diversity and F ST between wild rice and cultivated rice. They found two well‐known domesticated rice genes, prog1 (Jin et al., 2008) and sh4 (Li et al., 2006), among their candidate selective sweep regions. However, this approach has not been used for weedy rice with NGS data.

In this study, we resequenced the whole genomes of 30 Korean weedy rice individuals and 30 Korean landrace rice individuals. Through high‐density single‐nucleotide polymorphism (SNP) markers and accredited population size, we performed population studies of Korean weedy rice to obtain reasonable hypotheses for the origin of Korean weedy rice. Furthermore, we detected many adaptive loci in the weedy rice population that may be valuable resources for control and utilization of weedy rice in the future.

Results

Polymorphisms across the rice genome

From high‐coverage whole‐genome sequencing of 30 weedy rice and 30 landrace unrelated individuals (Table S1) we collected more than two billion clean reads aligned to 374 Mb of the O. sativa genome (http://rapdb.dna.affrc.go.jp/download/archive/irgsp1/IRGSP-1.0_genome.fasta.gz). Ninety‐eight percent of reads were mapped to the reference with an average depth of 10× (Table S2). From these data, we detected more than eight million SNPs and one million InDels. For the ten wild rice plants we finally obtained 10.6 million SNPs and 1.3 million InDels (Table 1). It is reasonable that wild rice has much more variation than cultivated rice or weedy rice. In addition to SNP and Indel analysis, we studied the structure variations among different groups. We found 3957, 3754, and 3006 copy number variations among weedy rice, landrace rice and wild rice, respectively. A total of 19 391 inversions and 22 812 translocations were detected in weedy rice. While only 12 381 inversions and 19 767 translocations were detected in landrace rice (Table 1). It is suggested that weedy rice have much more structure variations than landrace rice.

Table 1.

Summary of sequencing variations for 70 samples

Populations Indels (m) SNPs (m) High‐quality Indels (m) High‐quality SNPs (m) CNV Inversion Translocation
Wild rice 1.33 10.63 0.914 6.96 3006 3666 24 357
Landrace rice 0.96 6.98 0.374 2.54 3754 12 381 19 767
Weedy rice 0.86 6.42 0.342 2.44 3957 19 391 22 812
Landrace_weedy 1.09 8.18 0.313 2.23
Total 1.65 13.25 0.338 2.25

CNV, copy number variation; SNP, single‐nucleotide polymorphism.

Most of the variations were rare (minor allele frequency [MAF] <0.05). To obtain the SNPs used for population studies, we excluded those missing in any of 70 accessions and those with an MAF of <0.05, because these variations may biased the population study. This yielded a final total of 2.2 million high‐quality SNPs (Tables 1 and S8). To our knowledge, this represents the largest high‐quality SNP data set for weedy rice reported to date. These data will be used to identify important adaptation loci of weedy or landrace rice, as well as for breeding. Indeed, using this data set we identified one novel allele on the badh2 gene, which can be used to improve cultivated rice (He and Park, 2015).

Population structure of weedy, landrace and wild rice

To understand the genetic population structure and relationships among the major groups of Korean landrace and weedy rice, we constructed a maximum likelihood (ML) tree and conducted population structure analysis based on the 2.25 million high‐quality SNPs. The ML tree contained four major groups corresponding to O. rufipogon, O. nivara, O. japonica and O. indica. This was consistent with the results of Xu et al. (2012). However, the O. nivara in our study was separated from cultivated rice compared with the results reported by Xu et al. (2012). This may have been because Korean indica rice has a different genetic background from those of other regions. The 30 weedy rice can be clearly divided into two major groups, ind_weedy (9) and jap_weedy (21). Similar to weedy, landrace can also be separated into two groups, ind_landrace (5) and jap_landrace (25). Moreover, we found that weedy rice could be divided into three major subgroups and some admixed individuals (Figure 1a). Because most people in Korea prefer japonica rice in their daily diet, it is reasonable that more than 76% of local rice samples belonged to the japonica group. Furthermore, we investigated the population structure of the 70 samples. We analysed the data by increasing K (number of populations) from 2 to 7 (Figure 1b). For = 2, we found a division between japonica and others. It is suggested the Korea indica rice was more closely related to wild rice than japonica. For = 4, ind_weedy (G1) was separated from ind_landrace. For = 5, one of the jap_weedy (G3) subgroups was clearly separated from jap_landrace, while another jap_weedy subgroup, G2, was always mixed with other jap_landrace. From = 2 to 7, there were no patterns indicating that weedy rice formed directly from wild rice. Because the phylogenetic tree showed that ind_weedy was consistently farther from wild rice than ind_landrace (Figure 1a), and it did not contain any wild patterns from the japonica rice group in the structural study (Figure 1b), we assumed that Korean ind_weedy rice was formed from indica rice without admixture of patterns from other populations. For = 5, the japonica group was separated into three subgroups, with many admixture patterns among the different subgroups. In comparison to ind_weedy, jap_weedy had more admixture patterns from among jap_weedy or between jap_weedy and jap_landrace, but not from indica or wild species. Therefore, we postulated that jap_weedy was derived from japonica rice and that gene exchange occurred frequently between or among japonica rice. Meanwhile, the PCA plot showed clearly that weedy rice was clustered with landrace rice in both the indica and japonica groups (Figure 1c).

Figure 1.

Figure 1

Population structure analysis of 30 weedy rice samples with 40 other Oryza species. (a) Maximum likelihood phylogenetic tree. Blue branches are weedy rice, black branches are landrace rice. (b) Maximum likelihood clustering with K ranging from 2 to 7. For each K, the different colour represents different populations. Each accession is represented by a vertical bar, and the length of each coloured segment in each vertical bar represents the proportion contributed by ancestral populations. (c) PCA plots. Blue dots are weedy rice, black dots are landrace rice. (d) Illustration of genetic diversity and population differentiation in wild rice and japonica_landrace, japonica_weedy, indica_landrace and indica_weedy. The sizes of the circles represent the levels of genetic diversity of groups, and the distances are F ST values between different groups.

For all subgroups, we calculated the π values with a 100‐kb sliding window, the common summary statistics for measuring genetics diversity in a population. The average diversity levels of O. rufipogon and O. nivara are markedly higher than those of jap_landrace, jap_weedy, and ind_weedy, but not ind_landrace (Figure 1d). It is suggested that Korean japonica rice may have undergone a stronger reduction in effective population size than indica or weedy rice, while the indica landrace may not have encountered a strong bottleneck during domestication. The F ST by the 100‐kb sliding window was used to measure the level of population differentiation. The average F ST between O. rufipogon and O. nivara (0.19) was much lower than that between wild rice and O. sativa [ruf‐jap_weedy (0.41), ruf‐jap_landrace (0.43), niv‐ind_weedy (0.31). However, the F ST between O. nivara and ind_landrace was only 0.07, which was much lower than that with ind_landrace‐ind_weedy (0.16). This suggests that indica landrace is more closely related to O. nivara than the local weedy rice, possibly due to the materials of O. nivara. In the study by Xu et al. (2012), these five individuals were mixed with the indica varieties in the phylogenetic analysis, like in our study. The average F ST between jap_landrace and jap_weedy was only 0.02 because of the complex and frequent gene exchange between these two subgroups (Figure 1d). This was confirmed by phylogenetic and population structure pattern analysis.

To estimate the linkage disequilibrium (LD) patterns in different rice groups, we calculated r 2 between pairs of SNPs using PLINK. We quantified the average extent of the genome‐wide LD decay distance in wild rice, Korean landrace rice, and Korean weedy rice. These estimates were approximately 11, 72 and 80 kb in these three groups, respectively, where the r 2 dropped to half from the highest value (0.50, 0.53 and 0.66 respectively) (Figure S1). The LD decay of wild rice here was the same as reported previously. Because our weedy and landrace groups included indica and japonica subgroups, the LD decay was lower than japonica and larger than indica, as expected.

Recent positive and divergent selection

Population studies suggested that weedy rice formed from modern rice and not wild rice. LD patterns showed large differences in genome patterns between landrace and weedy rice, although the population study suggested that landrace rice and weedy rice undergo frequent gene exchange. To find the adaptive genome patterns of weedy rice, we employed five different methods to detect selective signatures between Korean landrace rice and weedy rice. Five distinct metrics of natural selection using a 100‐kb sliding window or per site across the whole genome were used; that is, differentiation (F ST), reduction in diversity (ROD), XP‐CLR, iHS and XP‐EHH. From these data, we classified the empirical top 2.5% of the windows or regions as ‘selection outliers’. After annotation, we found 1068, 821, 552, 536 and 1329 genes over the outliers using ROD, F ST, XP‐EHH, iHS and XP‐CLR respectively (Table S3). Most of the significantly selected genes (top 2.5%) occurred only one time among different selection approaches, suggesting that each statistic method will give a different view of selection. It is similar with previous study on Populus trichocarpa, that different selective forces are shaping different genomic regions (Evans et al., 2014). We also termed the regions in the top 2.5% for at least two of the selection scan metrics as candidate selection genes. Finally, 537 genes were detected as candidate selection genes (Table S4 and Figure 2a).

Figure 2.

Figure 2

Unique and shared genomic regions (genes) among five selection scans. (a) Venn diagram of the number of genes throughout the genome in the top 2.5% for each selection scan. (b) GO enrichment of candidate selection genes. The blue bar indicates candidate selection genes, and the green bar shows the rice reference genome genes (c) KEGG pathway enrichment of candidate selection genes.

To explore the biological functions of 537 candidate selective genes, we performed a functional enrichment analysis to identify gene ontology terms. In total, 27 significantly enriched GO terms (< 0.05) were characterized in three major categories: biological process, cellular component and molecular function (Figure 2b and Table S5). By using the rice GO terms as background, the candidate selected genes were enriched in response to stress, metabolic process and cellular process in biological process; extracellular region, membrane, cell and cell part in cellular component; organic acid transmembrane transporter activity, carboxylic acid transmembrane transporter activity and other 18 categories in molecular function (Figure 2b). Meanwhile, we performed KEGG pathway analysis for these 537 genes, which were enriched into 59 pathways, especially ascorbate and aldarate metabolism, RNA transport, histidine metabolism and Porphyrin and chlorophyII metabolism pathways (Figure 2c and Table S6).

Furthermore, we clustered those candidate selective genes against published database by using the PantGSEA. All genes were clustered into 90 independent experiment based data sets. We clustered these experiment‐based data sets into three categories: plant development or growth‐related categories, which are concentrated in tissue‐specific expression genes; abiotic stress‐related categories, including genes regulated by abiotic stress, such as cold stress, drought stress, salinity stress, Pi starvation stress, Cr stress, Fe stress, auxin stress etc.; and biotic stress‐related categories, which were regulated by agrobacterium infection, blast fungus, bacterium infection in rice (Table S7). Among the 537 genes, 36% were related to plant development, 18% were related to abiotic stress, 17% were related to biotic stress and the remaining were still unknown in current rice database (Figure 3a). Some genes played different roles in different biological processes, in which 21 genes were clustered into three categories, and 66 genes shared in two different categories (Figure 3b).

Figure 3.

Figure 3

Candidate selected genes clustered based on public databases. (a) Functional category of candidate selected genes. (b) Venn diagram of three functional category. (c) The ‘Biotic Stress overiew’ MapMan pathway was used to visualize candidate genes. Red square are genes. (d) Genes involved in receptor‐like kinases. Red square are genes.

To check the RNA expression pattern in specific tissues, we performed k‐means co‐expression clustering for these candidate selection genes. We downloaded the expression patterns of 11 different rice tissues from RiceXPro database (http://ricexpro.dna.affrc.go.jp). Three hundred and ninety‐seven candidate selection genes were detected among this database. We clustered ten co‐expression clusters (Figure S2 and Table S9) across all 11 tissues. Genes were defined as tissue‐specific expressed genes if their tissue expression pattern deviation over twofold of standard deviation among the cluster. Cluster 10 was removed from the tissue‐specific expression genes due to very low expression level. Finally, we found six tissue‐specific expression clusters (cluster 1, 2, 3, 4, 8, 9) among the ten clusters (Figure S2). Cluster 1, 4, 8, 9 have high expression patterns in rice leaf, endosperm, embryo and root respectively. We found two pleiotropic drug resistance (PDR) family genes in cluster 9, PDR8 and PDR9 (Table S9). The PDR9 in rice were significantly induced by jasmonates, which are involved in plant defence (Moons, 2008). We also found two receptor‐like cytoplasmic kinase (RLCK) family gene cluster 1, which play roles in development and stress response in plants (Vij et al., 2008). It is suggested that genes in cluster 1, 4, 8, 9 may play important roles in weedy rice development and stress response. Cluster 2 and 3 show low expression levels in endosperm and leaf respectively. These genes in cluster 2 and 3 may not play important roles for rice endosperm and leaf development, but important for other rice tissue development.

In order to obtain a global view of gene functions in biotic stress, we used MapMan analysis, which groups candidate gene sets into hierarchical functional categories on the basis of putative involvement in biotic stress (Figure 3c). Four hundred and forty‐eight genes out of 537 candidate genes were mapped to MapMan rice database. Among that, 107 genes were mapped to biotic stress pathway. Including five genes related to respiratory burst, 46 genes related to signalling, seven are transcription factors, eight secondary metabolites, two heat shock proteins, ten PR‐proteins, five related to hormone signalling, nine cell wall genes, 19 proteolysis related genes and one beta glucanase. These annotations provided a valuable source of data for investigation of processes, function and pathways involved in the reaction to biotic stress in weedy rice. When we preformed the same strategy to overview the receptor‐like kinases, 29 genes were enriched. Most of them (18) have LRR motifs, which were usually involved in protein–protein interactions and played an important role in regulating plant development and defence (Diévart and Clark, 2004). Six genes are receptor‐like cytoplasmatic kinases, which may related to rice development or stress (Vij et al., 2008). Five genes are S‐locus genes, which related to self‐incompatibility responses of crucifers (Xing et al., 2013) (Figure 3d).

Discussion

Weedy rice is a noxious agriculture pest that has significantly reduced the grain yield of cultivated rice worldwide (He et al., 2014; Qiu et al., 2014). However, it has recently been realized that weedy rice possesses a number of adaptive traits lacking in cultivated rice, such as high reproduction and both biostress and abiostress tolerance (Green et al., 2001), so it may become an important resource for improving current cultivated rice. Knowledge regarding the origin, genetic diversity, structure and adaptive genomic loci of weedy rice populations will facilitate the design of effective methods to control and use this weed.

Origin of Korean weedy rice

Most previous population genetics studies of weedy rice used limited polymorphism markers. Here, we used the whole‐genome resequencing method and detected more than two million high‐quality SNPs from 30 weedy rice, 30 landrace rice and ten wild rice samples. Using these high‐density SNP makers (approximately 5.2 SNP/kb) will provide a more confident phylogenetic relationship and admixture pattern among these three rice populations.

The results of this study suggest that the weedy rice in Korea originated directly from japonica/japonica or indica/indica rice hybridization rather than from wild rice. Three major results support this speculation. (i) The phylogenetic tree based on 2.2 million SNPs indicated that 30 weedy rice samples clearly clustered into two major subgroups with japonica landrace and indica landrace rice, respectively. Weedy rice was more closely related to landrace than wild rice in either the japonica group or indica group, based on the evolutionary tree topology (Figure 1a). (ii) The results of the ancestry study of 70 samples showed that weedy rice divided into ind_weedy and jap_weedy subgroups and that landrace rice divided into ind_landrace and jap_landrace subgroups (Figure 1b). From = 3 to = 6, two major weedy rice subgroups (G1, G3) were separated from indica landrace and japonica landrace rice. Another main weedy subgroup, G2, had many admixture patterns from jap_landrace and other jap_weedy. There were no wild rice patterns in weedy rice. Therefore, we propose that ind_weedy has only two genomic components: ind_landrace and ind_weedy. It was clearly demonstrated that jap_weedy had only two genomic components: jap_landrace and jap_weedy. (iii) The PCA study showed that weedy rice clearly clustered with landrace rice and not wild rice (Figure 1c).

Genomic adaptive of weedy rice

Determining the influences of positive and purifying selection, as well as neutral forces in shaping genetic variation is the primary goal of evolutionary biology (Evans et al., 2014). Genetic diversity and phenotypic plasticity allow weeds to exploit novel and diverse opportunities as they occur in and infest agroecosystems (Dekker, 1997), there has long been an interest in understanding the genetic basis of weedy rice adaptation (Delouche and Labrada, 2007; He et al., 2014; Qiu et al., 2014). Understanding the adaptive genomic loci is a prerequisite for designing relevant strategies for effective control and management of different types of weeds in agroecosystems, as well as using these adaptive loci to improve cultivated rice. With the large genome‐wide data sets for 70 rice variations discussed here, there is now an unprecedented opportunity for investigating issues such as the extent to which phenotypic differences among weedy rice and cultivated rice (O. sativa) populations are driven by natural selection.

In this study, we concluded that weedy rice formed from cultivated rice. We performed positive selection or adaptive genomic loci identification using five different approaches between weedy and landrace rice. The ROD between two populations was used to detect reduced diversity at putatively neutral sites. F ST was used to detect excess SNPs (or regions) with extreme population differentiation. XP‐CLR, which is not sensitive to SNP ascertainment bias, was used to detect the selective sweeps based on multilocus allele frequency differentiation between two populations. XP‐EHH and iHS were used to detect ongoing or nearly fixed selective sweeps by comparing haplotypes from two populations. A total of 537 genes were detected by at least two approaches, and this included many important genes; that is, the abiotic stress‐related genes SNAC3 on chromosome 1 (Fang et al., 2015), OsAsr1 on chromosome 2 (Vaidyanathan et al., 1999), and OsGIRL1 on chromosome 2 (Park et al., 2014); the panicle morphology‐related gene OsRCN2 on chromosome 2 (Nakagawa et al., 2002); biotic stress‐related genes PDR8 on chromosome 1 (Stein et al., 2006) and Snl6 on chromosome 1 (Bart et al., 2010); the ion channel gene related to biotic and abiotic stresses OsCNGC on chromosome 12 (Nawaz et al., 2014); the chlorophyll IIb biosynthesis‐related gene CAO on chromosome 11 (Oster et al., 2000); tiller development‐related gene SAD1 on chromosome 8 (Li et al., 2015); and others (Table S4). It is well known that weedy rice always accompanied by red pericarp, while most cultivated rice have white pericarp, which mainly because of the 14 bp deletion of Rc gene on chromosome 7. Rc was already proved as the domesticated gene in rice domestication. But we did not find this gene among all the candidate selective genes. This is because we used all SNPs without Indels for selective signature detection. But selective sweeps always clustered or had selective blocks on chromosomes due to the linkage disequilibrium or other reasons. We mapped all 537 genes to the whole rice chromosomes. It is clearly shown that these candidate genes clustered on certain regions of rice chromosomes (Figure S3). When we checked the Rc region, two genes (Os07g0200700 and Os07g0222300) were found in 600 kb frank region of Rc (Figure S3). Around these two genes may have selective block and genetic linked with Rc. There's a big selective block from 8.7 to 13.6 m on chromosome 5 (Figure S3). However, we are did not find the popular domesticated genes in this region. We presume that this region may became hot research region for weedy rice study in the future. Although most of the candidate selection genes have not been cloned and their functions are not yet clear, most of these genes appear to be related to very important functions for weedy rice or cultivated rice. Os01g0191300 (SNAC3) (Fang et al., 2015), which was detected using F ST and XP‐CLR, was recently confirmed to be related to drought and heat tolerance in rice, providing us a stronger reason to suggest that most of the candidate selected genes played important roles in rice or weedy rice evolution.

We noticed that the genes enriched in response to stress gene ontology terms, with rich factor 0.02 in biological process by using GO analysis (Figure 2b and Table S5). KEGG pathway enrichment suggested roles of these genes in histidine metabolism, which drives the reproduction and plant development‐related pathway (Figure 2c and Table S6). These supported the idea that weedy rice has high reproduction (Song et al., 2009) and high stress‐resistant capacity (Chen et al., 2004; Liu et al., 2014; Lu et al., 2014; Qiu et al., 2014; Sun et al., 2013). Meanwhile there are additional 26 GO terms (< 0.05) and 58 pathways were enriched among these 537 genes. It is suggested that the adaptive regions in weedy rice may not only just relate to stress or plant development but also relate to many other biological progress. The tissue‐specific expression pattern study suggested that these candidate selection genes plays different roles at different tissues (Table S9 and Figure S2). The further study based on published databases suggested that most of the genes were related to the plant development and stress resistance. Except 29% unknown genes, others are all related to plant development, abiotic stress and biotic stress among 537 candidate genes (Figure 3a). Among those, 107 genes were enriched in biotic stress pathway, based on MapMan database (Figure 3c). These genes are distributed mostly in the sub‐biological process response to biotic stresses, especially in signalling and proteolysis processes. Moreover, many of the candidate genes are LRR motif enriched gene, receptor‐like cytoplasmatic kinases genes and S‐locus genes (Figure 3d), which are related to plant development, defence and self‐incompatibility based on previous studies (Diévart and Clark, 2004; Vij et al., 2008; Xing et al., 2013). In summary, these results suggested that weedy rice population significantly accumulated plant development and defence‐related gene variations during the evolution. This may be the main reason why weedy rice exploit novel and diverse opportunities as they occur in and infest agroecosystems.

The results of this study using high‐density polymorphism information clearly showed that Korean weedy rice formed by hybridization of O. sativa rather than wild rice. The weedy rice populations contain many genomic adaptive loci, which are different from other populations. These valuable candidate‐selective genes or regions will make weedy rice a good resource to improve cultivated rice in the future, and the total 2.2 million high‐quality SNPs generated here can be used for multiple objectives in the further rice genetics, genomics or evolution studies.

Experimental procedures

Plant materials

In 2010, we developed one core set for 4406 worldwide varieties, which collected from the National Genebank of the Rural Development Administration (RDA‐Genebank, Republic of Korea) using the program PowerCore (Kim et al., 2007; Zhao et al., 2010). Among those, there are 30 Korean landrace rice and 30 Korean weedy rice accessions. These 60 available accessions were maintained by selfing at Kongju National University Experimental Farm. Each accession was transplanted in two rows, with 15 cm between plants and 30 cm between rows. Field management essentially followed normal agricultural practice. Total genomic DNA from a single plant of each accession was extracted from the leaf tissues by using the DNeasy Plant Mini Kit (Qiagen).

Whole‐genome resequencing, SNP calling and structure variation calling

In previous study, we resequenced 137 core rice accessions (Kim et al., 2016). Here, we use 60 of 137 sequence data and ten wild rice sequence data for this study. HiSeq 2500 were used for whole‐genome resequencing of the 60 rice accessions (Kim et al., 2016). Raw sequences were first processed to remove residual adapter sequences from the reads using Trimmomatic ver 0.36 (http://www.usadellab.org/cms/index.php?page=trimmomatic). Next, high‐quality reads were aligned to the rice reference genome IRGSP‐1.0 (http://rapdb.dna.affrc.go.jp/download/irgsp1.html) using the Burrows–Wheeler Aligner (BWA) (version 0.7.5a) with the default parameters (Li and Durbin, 2009). The reads will be removed if it did not meet BWA quality criteria or did not align to the reference genome. Duplicate reads were removed by using PICARD (version 1.88) (http://broadinstitute.github.io/picard/). Regional realignment and quality score recalibration were carried out using the Genome Analysis Toolkit (version 2.3.9 Lite) (McKenna et al., 2010), and then variations were identified with ≥3× read depth coverage. Overall, the mapping depth was about 10× on average (Table S2). Sequence data for ten wild rice accessions (O. rufipogon,= 5; O. nivara,= 5) were downloaded from the National Centre for Biotechnology Information under accession number of SRA023116, and the same method, used for the 60 accessions, was used for SNP calling. The copy number variation study was performed by CNVnator (Abyzov et al., 2011). Inversion and translocation were detected by using DELLY program (Rausch et al., 2012).

Population structure and positive selection analysis

The population phylogenetic tree was constructed by the maximum likelihood method using RaxML (Stamatakis, 2014), and FigTree (Rambaut, 2007) was used to display the tree. We used a maximum likelihood method‐based program, Frappe (Tang et al., 2005), to generate the population structure. For positive selection, we used VCFtools (Danecek et al., 2011) for nucleotide diversity and F ST analysis. We performed XP‐CLR using the XP‐CLR program (http://genepath.med.harvard.edu/~reich). XP‐EHH and iHS were performed by using selscan software (Szpiech and Hernandez, 2014).

Gene ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway

All candidate selective genes were used to GO and KEGG analysis. GO enrichment analysis was performed with PlantGSEA (Yi et al., 2013) with the ‘Oryza sativa’ set as species background. KEGG pathway was performed by KOBAS (Xie et al., 2011).

Author contributions

K.K.W and P.Y.J supervised the projects. K.K.W and P.Y.J contributed materials and analysis tools. H.Q, K.K.W and P.Y.J designed the research and wrote the manuscript.

Supporting information

Figure S1 Different linkage disequilibrium(LD) decay patterns of weedy rice, landrace rice and wild rice.

Figure S2 Co‐expression patterns of candidate selective genes.

Figure S3 The distribution of all candidate‐selective genes across rice chromosomes.

PBI-15-357-s006.docx (1.3MB, docx)

Table S1 Materials used in this study.

Table S2 Summary of mapping information.

Table S3 Genes over selection outliers by five different selection metrics.

Table S4 Candidate selection genes with annotation.

PBI-15-357-s005.xlsx (124.7KB, xlsx)

Table S5 Gene ontology of candidate‐selective genes.

PBI-15-357-s007.xlsx (103.7KB, xlsx)

Table S6 KEGG pathway of candiate‐selective genes.

PBI-15-357-s001.xlsx (54.4KB, xlsx)

Table S7 Biological function of candidate‐selective genes.

PBI-15-357-s002.xlsx (64KB, xlsx)

Table S8 High‐quality SNPs used in this study.

PBI-15-357-s003.zip (23.2MB, zip)

Table S9 Tissue‐specific expression patterns of candidate‐selective genes.

PBI-15-357-s004.xlsx (209.8KB, xlsx)

Acknowledgements

This work was carried out with the support of “Cooperative Research Program for Agriculture Science & Technology Development (Project No. PJ01116101)” Rural Development Administration, Republic of Korea. And also supported by Basic Science Research Program through the National Research Foundation of Korea(NRF) funded by the Ministry of Education (NRF‐2014R1A1A2059399).

Reference

  1. Abyzov, A. , Urban, A.E. , Snyder, M. and Gerstein, M. (2011) CNVnator: an approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing. Genome Res. 21, 974–984. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Akey, J.M. , Zhang, G. , Zhang, K. , Jin, L. and Shriver, M.D. (2002) Interrogating a high‐density SNP map for signatures of natural selection. Genome Res. 12, 1805–1814. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Bart, R.S. , Chern, M. , Vega‐Sánchez, M.E. , Canlas, P. and Ronald, P.C. (2010) Rice Snl6, a cinnamoyl‐CoA reductase‐like gene family member, is required for NH1‐mediated immunity to Xanthomonas oryzae pv. oryzae. PLoS Genet. 6, e1001123. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Bres‐Patry, C. , Bangratz, M. and Ghesquiere, A. (2001) Genetic diversity and population dynamics of weedy rice in Camargue area [France]. Genet. Sel. Evol. 33, 425–440. [Google Scholar]
  5. Cao, Q. , Lu, B.‐R. , Xia, H. , Rong, J. , Sala, F. , Spada, A. and Grassi, F. (2006) Genetic diversity and origin of weedy rice (Oryza sativa f. spontanea) populations found in north‐eastern China revealed by simple sequence repeat (SSR) markers. Ann. Bot. 98, 1241–1252. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Cao, Q.J. , Xia, H. , Yang, X. and Lu, B.R. (2009) Performance of hybrids between weedy rice and insect‐resistant transgenic rice under field experiments: implication for environmental biosafety assessment. J. Integr. Plant Biol. 51, 1138–1148. [DOI] [PubMed] [Google Scholar]
  7. Chen, L.J. , Lee, D.S. , Song, Z.P. , Suh, H.S. and LU, B.R. (2004) Gene flow from cultivated rice (Oryza sativa) to its weedy and wild relatives. Ann. Bot. 93, 67–73. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Chen, H. , Patterson, N. and Reich, D. (2010) Population differentiation as a test for selective sweeps. Genome Res. 20, 393–402. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Chung, J.W. and Park, Y.J. (2010) Population structure analysis reveals the maintenance of isolated sub‐populations of weedy rice. Weed Res. 50, 606–620. [Google Scholar]
  10. Danecek, P. , Auton, A. , Abecasis, G. , Albers, C.A. , Banks, E. , DePristo, M.A. , Handsaker, R.E. et al. (2011) The variant call format and VCFtools. Bioinformatics, 27, 2156–2158. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. De Wet, J. and Harlan, J.R. (1975) Weeds and domesticates: evolution in the man‐made habitat. Econ. Bot. 29, 99–108. [Google Scholar]
  12. Dekker, J. (1997) Weed diversity and weed management. Weed Sci. 45, 357–363. [Google Scholar]
  13. Delouche, J.C. and Labrada, R. (2007) Weedy rices: origin, biology, ecology and control. Food & Agriculture Org. 188, 45–93. [Google Scholar]
  14. Diévart, A. and Clark, S.E. (2004) LRR‐containing receptors regulating plant development and defense. Development, 131, 251–261. [DOI] [PubMed] [Google Scholar]
  15. Evans, L.M. , Slavov, G.T. , Rodgers‐Melnick, E. , Martin, J. , Ranjan, P. , Muchero, W. , Brunner, A.M. et al. (2014) Population genomics of Populus trichocarpa identifies signatures of selection and adaptive trait associations. Nat. Genet. 46, 1089–1096. [DOI] [PubMed] [Google Scholar]
  16. Fang, Y. , Liao, K. , Du, H. , Xu, Y. , Song, H. , Li, X. and Xiong, L. (2015) A stress‐responsive NAC transcription factor SNAC3 confers heat and drought tolerance through modulation of reactive oxygen species in rice. J. Exp. Bot. doi: 10.1093/jxb/erv386. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Gealy, D.R. , Tai, T.H. and Sneller, C.H. (2002) Identification of red rice, rice, and hybrid populations using microsatellite markers. Weed Sci. 50, 333–339. [Google Scholar]
  18. Green, J. , Barker, J. , Marshall, E. , Froud‐Williams, R. , Peters, N. , Arnold, G. , Dawson, K. et al. (2001) Microsatellite analysis of the inbreeding grass weed Barren Brome (Anisantha sterilis) reveals genetic diversity at the within‐and between‐farm scales. Mol. Ecol. 10, 1035–1045. [DOI] [PubMed] [Google Scholar]
  19. Gross, B.L. , Reagon, M. , Hsu, S.C. , Caicedo, A.L. , Jia, Y. and Olsen, K.M. (2010) Seeing red: the origin of grain pigmentation in US weedy rice. Mol. Ecol. 19, 3380–3393. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Hak‐Soo, S. and Mun‐Hue, H. (1992) Collection and evaluation of Korean red rices I. Regional distribution and seed characteristics. Korean J. Crop Sci. 37, 425–430. [Google Scholar]
  21. He, Q. and Park, Y.‐J. (2015) Discovery of a novel fragrant allele and development of functional markers for fragrance in rice. Mol. Breed. 35, 1–10. [Google Scholar]
  22. He, Z. , Jiang, X. , Ratnasekera, D. , Grassi, F. , Perera, U. and Lu, B.‐R. (2014) Seed‐mediated gene flow promotes genetic diversity of weedy rice within populations: implications for weed management. PLoS ONE, 9, e112778. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. He, Q. , Yu, J. , Kim, T.‐S. , Cho, Y.‐H. , Lee, Y.‐S. and Park, Y.‐J. (2015) Resequencing reveals different domestication rate for BADH1 and BADH2 in rice (Oryza sativa). PLoS ONE, 10, e0134801. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Heu, M. (1988) Weed rice” Sharei” showing closer cross‐affinity to Japonica type. Rice Genet. Newsl. 5, 72–74. [Google Scholar]
  25. Huang, X. , Wei, X. , Sang, T. , Zhao, Q. , Feng, Q. , Zhao, Y. , Li, C. et al. (2010) Genome‐wide association studies of 14 agronomic traits in rice landraces. Nat. Genet. 42, 961–967. [DOI] [PubMed] [Google Scholar]
  26. Huang, X. , Kurata, N. , Wei, X. , Wang, Z.‐X. , Wang, A. , Zhao, Q. , Zhao, Y. et al. (2012a) A map of rice genome variation reveals the origin of cultivated rice. Nature, 490, 497–501. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Huang, X. , Zhao, Y. , Wei, X. , Li, C. , Wang, A. , Zhao, Q. , Li, W. et al. (2012b) Genome‐wide association study of flowering time and grain yield traits in a worldwide collection of rice germplasm. Nat. Genet. 44, 32–39. [DOI] [PubMed] [Google Scholar]
  28. Ishikawa, R. , Toki, N. , Imai, K. , Sato, Y. , Yamagishi, H. , Shimamoto, Y. , Ueno, K. et al. (2005) Origin of weedy rice grown in Bhutan and the force of genetic diversity. Genet. Resour. Crop Evol. 52, 395–403. [Google Scholar]
  29. Jin, J. , Huang, W. , Gao, J.‐P. , Yang, J. , Shi, M. , Zhu, M.‐Z. , Luo, D. et al. (2008) Genetic control of rice plant architecture under domestication. Nat. Genet. 40, 1365–1369. [DOI] [PubMed] [Google Scholar]
  30. Kelly Vaughan, L. , Ottis, B.V. , Prazak‐Havey, A.M. , Bormans, C.A. , Sneller, C. , Chandler, J.M. and Park, W.D. (2001) Is all red rice found in commercial rice really Oryza sativa? Weed Sci. 49, 468–476. [Google Scholar]
  31. Kim, K.‐W. , Chung, H.‐K. , Cho, G.‐T. , Ma, K.‐H. , Chandrabalan, D. , Gwag, J.‐G. , Kim, T.‐S. et al. (2007) PowerCore: a program applying the advanced M strategy with a heuristic search for establishing core sets. Bioinformatics, 23, 2155–2162. [DOI] [PubMed] [Google Scholar]
  32. Kim, T.‐S. , He, Q. , Kim, K.‐W. , Yoon, M.‐Y. , Ra, W.‐H. , Li, F.P. , Tong, W. et al. (2016) Genome‐wide resequencing of KRICE_CORE reveals their potential for future breeding, as well as functional and evolutionary studies in the post‐genomic era. BMC Genom., 17, 1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Kou, Y. and Wang, S. (2012) Toward an understanding of the molecular basis of quantitative disease resistance in rice. J. Biotechnol. 159, 283–290. [DOI] [PubMed] [Google Scholar]
  34. Li, H. and Durbin, R. (2009) Fast and accurate short read alignment with Burrows‐Wheeler transform. Bioinformatics, 25, 1754–1760. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Li, C. , Zhou, A. and Sang, T. (2006) Rice domestication by reducing shattering. Science, 311, 1936–1939. [DOI] [PubMed] [Google Scholar]
  36. Li, W. , Yoshida, A. , Takahashi, M. , Maekawa, M. , Kojima, M. , Sakakibara, H. and Kyozuka, J. (2015) SAD1, an RNA polymerase I subunit A34. 5 of rice, interacts with Mediator and controls various aspects of plant development. Plant J. 81, 282–291. [DOI] [PubMed] [Google Scholar]
  37. Liang, D. and Qiang, S. (2011) Current situation and control strategy of weedy rice in China. China Plant Prot. 31, 21–24. [Google Scholar]
  38. Liu, Y. , Jia, Y. , Qi, X. , Olsen, K. , Caicedo, A. and Gealy, D. (2014) Insights into molecular mechanism of blast resistance in weedy rice. In APS Annual Meeting. pp. 67–70. [Google Scholar]
  39. Londo, J. and Schaal, B. (2007) Origins and population genetics of weedy red rice in the USA. Mol. Ecol. 16, 4523–4535. [DOI] [PubMed] [Google Scholar]
  40. Lu, Y.‐L. , Burgos, N.R. , Wang, W.‐X. and Yu, L.‐Q. (2014) Transgene flow from Glufosinate‐resistant rice to improved and weedy rice in China. Rice Sci. 21, 271–281. [Google Scholar]
  41. McKenna, A. , Hanna, M. , Banks, E. , Sivachenko, A. , Cibulskis, K. , Kernytsky, A. , Garimella, K. et al. (2010) The genome analysis toolkit: a MapReduce framework for analyzing next‐generation DNA sequencing data. Genome Res. 20, 1297–1303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Moons, A. (2008) Transcriptional profiling of the PDR gene family in rice roots in response to plant growth regulators, redox perturbations and weak organic acid stresses. Planta, 229, 53–71. [DOI] [PubMed] [Google Scholar]
  43. Nakagawa, M. , Shimamoto, K. and Kyozuka, J. (2002) Overexpression of RCN1 and RCN2, rice TERMINAL FLOWER 1/CENTRORADIALIS homologs, confers delay of phase transition and altered panicle morphology in rice. Plant J. 29, 743–750. [DOI] [PubMed] [Google Scholar]
  44. Nawaz, Z. , Kakar, K.U. , Saand, M.A. and Shu, Q.‐Y. (2014) Cyclic nucleotide‐gated ion channel gene family in rice, identification, characterization and experimental analysis of expression response to plant hormones, biotic and abiotic stresses. BMC Genom. 15, 1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Nei, M. and Li, W.‐H. (1979) Mathematical model for studying genetic variation in terms of restriction endonucleases. Proc. Natl Acad. Sci. 76, 5269–5273. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Nielsen, R. , Williamson, S. , Kim, Y. , Hubisz, M.J. , Clark, A.G. and Bustamante, C. (2005) Genomic scans for selective sweeps using SNP data. Genome Res. 15, 1566–1575. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Noldin, J.A. , Chandler, J.M. and McCauley, G.N. (1999) Red rice (Oryza sativa) biology. I. Characterization of red rice ecotypes. Weed Technol. 13, 12–18. [Google Scholar]
  48. Olofsdotter, M. , Valverde, B. and Madsen, K.H. (2000) Herbicide resistant rice (Oryza sativa L.): global implications for weedy rice and weed management*. Ann. Appl. Biol. 137, 279–295. [Google Scholar]
  49. Oster, U. , Tanaka, R. , Tanaka, A. and Rüdiger, W. (2000) Cloning and functional expression of the gene encoding the key enzyme for chlorophyll b biosynthesis (CAO) from Arabidopsis thaliana. Plant J. 21, 305–310. [DOI] [PubMed] [Google Scholar]
  50. Park, S. , Moon, J.‐C. , Park, Y.C. , Kim, J.‐H. , Kim, D.S. and Jang, C.S. (2014) Molecular dissection of the response of a rice leucine‐rich repeat receptor‐like kinase (LRR‐RLK) gene to abiotic stresses. J. Plant Physiol. 171, 1645–1653. [DOI] [PubMed] [Google Scholar]
  51. Qiu, J. , Zhu, J. , Fu, F. , Ye, C.‐Y. , Wang, W. , Mao, L. , Lin, Z. et al. (2014) Genome re‐sequencing suggested a weedy rice origin from domesticated indica‐japonica hybridization: a case study from southern China. Planta, 240, 1353–1363. [DOI] [PubMed] [Google Scholar]
  52. Rambaut, A. (2007) FigTree, a graphical viewer of phylogenetic trees.
  53. Rausch, T. , Zichner, T. , Schlattl, A. , Stütz, A.M. , Benes, V. and Korbel, J.O. (2012) DELLY: structural variant discovery by integrated paired‐end and split‐read analysis. Bioinformatics, 28, i333–i339. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Reagon, M. , Thurber, C.S. , Gross, B.L. , Olsen, K.M. , Jia, Y. and Caicedo, A.L. (2010) Genomic patterns of nucleotide diversity in divergent populations of US weedy rice. BMC Evol. Biol. 10, 1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Sabeti, P.C. , Reich, D.E. , Higgins, J.M. , Levine, H.Z. , Richter, D.J. , Schaffner, S.F. , Gabriel, S.B. et al. (2002) Detecting recent positive selection in the human genome from haplotype structure. Nature, 419, 832–837. [DOI] [PubMed] [Google Scholar]
  56. Sabeti, P.C. , Varilly, P. , Fry, B. , Lohmueller, J. , Hostetter, E. , Cotsapas, C. , Xie, X. et al. (2007) Genome‐wide detection and characterization of positive selection in human populations. Nature, 449, 913–918. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Song, X. , Liu, L. , Wang, Z. and Qiang, S. (2009) Potential gene flow from transgenic rice (Oryza sativa L.) to different weedy rice (Oryza sativa f. spontanea) accessions based on reproductive compatibility. Pest Manag. Sci. 65, 862–869. [DOI] [PubMed] [Google Scholar]
  58. Stamatakis, A. (2014) RAxML version 8: a tool for phylogenetic analysis and post‐analysis of large phylogenies. Bioinformatics, 30, 1312–1313. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Stein, M. , Dittgen, J. , Sánchez‐Rodríguez, C. , Hou, B.‐H. , Molina, A. , Schulze‐Lefert, P. , Lipka, V. et al. (2006) Arabidopsis PEN3/PDR8, an ATP binding cassette transporter, contributes to nonhost resistance to inappropriate pathogens that enter by direct penetration. Plant Cell, 18, 731–746. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Sun, J. , Qian, Q. , Ma, D.R. , Xu, Z.J. , Liu, D. , Du, H.B. and Chen, W.F. (2013) Introgression and selection shaping the genome and adaptive loci of weedy rice in northern China. New Phytol. 197, 290–299. [DOI] [PubMed] [Google Scholar]
  61. Szpiech, Z.A. and Hernandez, R.D. (2014) Selscan: an efficient multithreaded program to perform EHH‐based scans for positive selection. Mol. Biol. Evol. 31, 2824–2827. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Tang, H. , Peng, J. , Wang, P. and Risch, N.J. (2005) Estimation of individual admixture: analytical and study design considerations. Genet. Epidemiol. 28, 289–301. [DOI] [PubMed] [Google Scholar]
  63. Tang, K. , Thornton, K.R. and Stoneking, M. (2007) A new approach for using genome scans to detect recent positive selection in the human genome. PLoS Biol. 5, e171. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Thurber, C.S. , Reagon, M. , Gross, B.L. , Olsen, K.M. , Jia, Y. and Caicedo, A.L. (2010) Molecular evolution of shattering loci in US weedy rice. Mol. Ecol. 19, 3271–3284. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Vaidyanathan, R. , Kuruvilla, S. and Thomas, G. (1999) Characterization and expression pattern of an abscisic acid and osmotic stress responsive gene from rice. Plant Sci. 140, 21–30. [Google Scholar]
  66. Vaughan, D.A. , Lu, B.‐R. and Tomooka, N. (2008) The evolving story of rice evolution. Plant Sci. 174, 394–408. [Google Scholar]
  67. Vij, S. , Giri, J. , Dansana, P.K. , Kapoor, S. and Tyagi, A.K. (2008) The receptor‐like cytoplasmic kinase (OsRLCK) gene family in rice: organization, phylogenetic relationship, and expression during development and stress. Mol. Plant, 1, 732–750. [DOI] [PubMed] [Google Scholar]
  68. Voight, B.F. , Kudaravalli, S. , Wen, X. and Pritchard, J.K. (2006) A map of recent positive selection in the human genome. PLoS Biol. 4, e72. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Williamson, S.H. , Hubisz, M.J. , Clark, A.G. , Payseur, B.A. , Bustamante, C.D. and Nielsen, R. (2007) Localizing recent adaptive evolution in the human genome. PLoS Genet. 3, e90. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Xia, H.‐B. , Wang, W. , Xia, H. , Zhao, W. and Lu, B.‐R. (2011) Conspecific crop‐weed introgression influences evolution of weedy rice (Oryza sativa f. spontanea) across a geographical range. PLoS ONE, 6, e16189. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Xie, C. , Mao, X. , Huang, J. , Ding, Y. , Wu, J. , Dong, S. , Kong, L. et al. (2011) KOBAS 2.0: a web server for annotation and identification of enriched pathways and diseases. Nucleic Acids Res. 39, W316–W322. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Xing, S. , Li, M. and Liu, P. (2013) Evolution of S‐domain receptor‐like kinases in land plants and origination of S‐locus receptor kinases in Brassicaceae. BMC Evol. Biol. 13, 69. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Xiong, H. , Xu, H. , Xu, Q. , Zhu, Q. , Gan, S. , Feng, D. , Zhang, X. et al. (2012) Origin and evolution of weedy rice revealed by inter‐subspecific and inter‐varietal hybridizations in rice. Mol. Plant Breed. 10, 131–139. [Google Scholar]
  74. Xu, X. , Liu, X. , Ge, S. , Jensen, J.D. , Hu, F. , Li, X. , Dong, Y. et al. (2012) Resequencing 50 accessions of cultivated and wild rice yields markers for identifying agronomically important genes. Nat. Biotechnol. 30, 105–111. [DOI] [PubMed] [Google Scholar]
  75. Yi, X. , Du, Z. and Su, Z. (2013) PlantGSEA: a gene set enrichment analysis toolkit for plant community. Nucleic Acids Res. 41, W98–W103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Zhao, W. , Cho, G.‐T. , Ma, K.‐H. , Chung, J.‐W. , Gwag, J.‐G. and Park, Y.‐J. (2010) Development of an allele‐mining set in rice using a heuristic algorithm and SSR genotype data with least redundancy for the post‐genomic era. Mol. Breed. 26, 639–651. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Figure S1 Different linkage disequilibrium(LD) decay patterns of weedy rice, landrace rice and wild rice.

Figure S2 Co‐expression patterns of candidate selective genes.

Figure S3 The distribution of all candidate‐selective genes across rice chromosomes.

PBI-15-357-s006.docx (1.3MB, docx)

Table S1 Materials used in this study.

Table S2 Summary of mapping information.

Table S3 Genes over selection outliers by five different selection metrics.

Table S4 Candidate selection genes with annotation.

PBI-15-357-s005.xlsx (124.7KB, xlsx)

Table S5 Gene ontology of candidate‐selective genes.

PBI-15-357-s007.xlsx (103.7KB, xlsx)

Table S6 KEGG pathway of candiate‐selective genes.

PBI-15-357-s001.xlsx (54.4KB, xlsx)

Table S7 Biological function of candidate‐selective genes.

PBI-15-357-s002.xlsx (64KB, xlsx)

Table S8 High‐quality SNPs used in this study.

PBI-15-357-s003.zip (23.2MB, zip)

Table S9 Tissue‐specific expression patterns of candidate‐selective genes.

PBI-15-357-s004.xlsx (209.8KB, xlsx)

Articles from Plant Biotechnology Journal are provided here courtesy of Society for Experimental Biology (SEB) and the Association of Applied Biologists (AAB) and John Wiley and Sons, Ltd

RESOURCES