Abstract
The yield heterosis of rice is sought by farmers and strong contributes to food safety, but the quality of hybrid rice may be reduced. Therefore, developing new varieties with both high yield and good quality is a heavily researched topic in hybrid rice breeding. However, the molecular mechanism governing yield heterosis and high rice quality has not been elucidated to date. In this study, a comparative transcriptomics and genomic analysis was performed on a hybrid rice variety, Chuanyou6203 (CY6203), and its parents to investigate the molecular mechanism and gene regulation network governing the formation of yield and quality stages. A total of 66,319 SNPs and InDels between CH3203 and C106B were detected in the 5′-UTR, exon, intronic, and 3′-UTR regions according to the reference genome annotation, which involved 7473 genes. A total of 436, 70, 551, 993, and 1216 common DEGs between CY6203 and both of its parents were identified at the same stage in panicles and flag leaves. Of the common DEGs, the numbers of upregulated DEGs between CY6203 and CH3203 were all greater than those of upregulated DEGs between CY6203 and C106B in panicles and flag leaves at the booting, flowering, and middle filling stages. Approximately 40.61% of mRNA editing ratios were between 0.4 and 0.6, and 1.68% of mRNA editing events (editing ratio ≥ 0.8) in CY6203 favored one of its parents at three stages or a particular stage, suggesting that the hypothetical heterosis mechanism of CY6203 might involve dominance or epistasis. Also 15,934 DEGs were classified into 19 distinct modules that were classified into three groups by the weighted gene coexpression network analysis. Through transcriptome analysis of panicles and flag leaves in the yield and quality stages, the DEGs in the green-yellow module primarily contributed to the increase in the source of CY6203 due to an in increase in photosynthetic efficiency and nitrogen utilization efficiency, and a small number of DEGs related to the grain number added spikelet number per panicle amplified its sink. The balanced expression of the major high-quality alleles of C106B and CH3203 in CY6203 contributed to the outstanding quality of CY6203. Our transcriptome and genome analyses offer a new data set that may help to elucidate the molecular mechanism governing the yield heterosis and high quality of a hybrid rice variety.
Subject terms: Genetics, Plant sciences
Introduction
Rice is one of the world's major crops and is the staple food of more than half of the world's population. Successful application of rice heterosis significantly increased yield. In the past hundred years, the heterosis of crops with distinct yield advantages has been widely used in corn, sorghum, rape, and other crops. Researchers have frequently attempted to reveal the genetic mechanism of heterosis, and have proposed several classical genetic models, such as dominance, over-dominance, epistasis, and non-additive gene expression1. However, the molecular mechanism governing heterosis has not been elucidated.
At the DNA and mRNA levels, genomic and transcriptome analysis represent a useful method to study the genetic mechanism of heterosis. Bao et al.2 identified 595 upregulated and 25 downregulated tags in LYP9 using the serial analysis of gene expression technique and found that most of the genes were related to enhancing carbon and nitrogen assimilation. Wei et al.3 detected 3926 differentially expressed genes (DEGs) between hybrid F1, LYP9, and its parents (DEGHP) and between the parents (DEGPP) and found that the DEGs in the categories of energy metabolism and transport were enriched in DEGHP, rather than in DEGPP. Huang et al.4 believed that numerous superior alleles, such as Hd3a, OsSPL14, Gn1a, Waxy, ALK, and qSW5, exhibiting positive dominance, contributed to heterosis. Chen et al.5 concluded that the overdominant effect probably contributed to the grain number heterosis of WFYT025. The genome-wide gene expression profiles were compared to identify the genetic basis of yield heterosis of such species maize6,7, Bombyx mori8, rubber tree9, and oil palm10. These studies all showed that the DEGs and major genes contributed to heterosis.
The yield heterosis of rice is sought by farmers, but the quality of hybrid rice is reduced. The poor eating and cooking quality of most hybrid rice, especially indica hybrid rice, is mainly attributable to a high amylose content (AC), hard gel consistency (GC), and high gelatinization temperature (GT)11–13. Recently, we released a new hybrid rice variety, Chuanyou6203 (CY6203), which has the same eating and cooking qualities as the Thai rice variety. The quality of CY6203 conforms to the second grade of the State Standard of the People's Republic of China (Rice)(GB1354-2009). After the variety Chuanyou6203 (CY6203) was released, it was welcomed by farmers. And its planted area had quickly reached 200,000 hectares in rice area of southwest China. In this study, a transcriptome analysis of the panicles and flag leaves of the hybrid rice CY6203 and its parents in the formation of yield and quality stages was performed to identify the molecular modules related to yield heterosis and high quality. The data presented in this report may be help to elucidate the molecular mechanism governing yield heterosis and the formation of high quality.
Results
Performance and heterosis of Chuanyou6203
Figure 1 shows the plant type, panicle type, and grain shape of hybrid rice variety CY6203 and its parents. We investigated a total of ten traits, including five agronomic traits (SPW/D, PH, PL, ANSPP, and TGW) and five quality traits (HRR, L/W, AC, ASV, and GC) (Table 1). The results showed that there was a high significant difference (p < 0.01) between C106B and CH3203, except for ASV and GC, and a significant difference (p < 0.05) in ASV. Between CY6203 and its parents, there was a high significant difference (p < 0.01) except for PL, TGW, and ASV. There was a high significant difference (p < 0.01) between CY6203 and C106B in TGW and ASV, respectively. In PL, a high significant difference (p < 0.01) was found between CY6203 and CH3203, and there was no significant difference between CY6203 and C106B. The values of over-female-parent heterosis (OFPH) ranged from − 25.53% (L/W) to 45% (TGW) (Fig. 2). Of these values, OFPHs of agronomic traits were all positive. However, OFPHs were only detected positively in two quality traits, AC (6.71%) and ASV (15%), while strongly negative OFPHs were identified in the other three quality traits HRR (− 6.45%), L/W (− 25.53%), and GC (− 9.64%). The values of over-male-parent heterosis (OMPH) ranged from − 9.64% (GC) to 44.63% (AC). Positive OMPHs were observed in SPW (8.85%), PL (11.3%), ANSPP (27.77%), HRR (11.54%), AC (44.63%), and ASV (6.15%). In contrast, the intensities of OMPHs and OFPHs were remarkably different in PH, PL, TGW, HRR, AC, and L/W, but similar in SPW/D, ANSPP, ASV, and GC. The results suggest that the high yield of CY6203 should be primarily determined by heterosis in SPW/D and ANSPP, while its high quality maybe attributable to the balance among traits AC, ASV, and GC.
Table 1.
Traits | SPW/D (g/d) | PH (cm) | PL (cm) | ANSPP | TGW (g) | HRR (%) | L (W) | AC (%) | ASV (Grade) | GC (mm) |
---|---|---|---|---|---|---|---|---|---|---|
C106B | 0.215 ± 0.002 | 95 ± 2.15 | 25.5 ± 0.54** | 140 ± 5.31** | 20 ± 1.12 | 62 ± 1.35** | 4.7 ± 0.15** | 16.4 ± 0.64** | 6 ± 0.1 | 83 ± 2.5 |
CH3203 | 0.226 ± 0.002** | 110 ± 3.71** | 23 ± 1.5 | 130 ± 2.58 | 30 ± 2.45** | 52 ± 1.37 | 3.1 ± 0.15 | 12.1 ± 0.75 | 6.5 ± 0.1* | 83 ± 1.5 |
CY6203 | 0.246 ± 0.001**ab | 111.6 ± 1.9**ab | 25.6 ± 0.63**b | 166.1 ± 6.12**ab | 29 ± 2.5**a | 58 ± 1.33**ab | 3.5 ± 0.05**ab | 17.5 ± 0.48**ab | 6.9 ± 0.15**a | 75 ± 1.8**ab |
SPW/D: single plant weight per day; PH: plant height; PL: panicle length; ANSPP: average number of spikelets per panicle; TGW: 1000-gains weight; HRR: head rice ratio (ratio of head milled rice); L/W: grain length/grain width; AC: amylose content; ASV: alkali spreading value; GC: gel consistency. *,**Significant difference with p < 0.05, 0.01, respectively. a,bSignificance between CY6203 and C106B or CH3203.
Genomic variation between two parents of Chuanyou6203
Using the Illumina-platform, a total of ~ 175 million and ~ 65 million paired-end clean reads measuring 150 bp in length were generated with average coverage of ~ 61 × and ~ 22 × for the genomes of CH3203 and C106B, respectively (Supplementary Table S1). The GC contents of CH3203 and C106B were 43.29% and 40.73%, respectively. The high-quality reads of each parent were aligned to the Nipponbare reference genome MSU RICE GENOME ANNOTATION PROJECT RELEASE 7 (MSU 7), and their mapped ratios were 94.98% and 92.63%. A total of 66,319 SNPs and InDels between CH3203 (34,102) and C106B (32,217) were detected in the 5′-UTR, exon, intronic, and 3′-UTR regions according to the reference genome annotation (Fig. 3 and Supplementary Table S2), which involved 7473 genes. The average number of variations per gene was 8.87, and the average number of variations per 200 Kb was 35.47. Although most of the variations were randomly distributed in 12 chromosomes, there were several hot-spot regions with high-density variations (mean value = 385.4) on chromosomes 1, 2, 4, 8, 10 and 11. On 3,000,001 bp-3200000 bp of chromosomes 2, the number of variations per 200 Kb was up to 578.
Gene expression profiling of Chuanyou6203 and its parents
We performed RNA sequencing and obtained an average of 67,645,949 high-quality clean reads for each of the 45 samples after the removal of rRNA and low-quality reads (Supplementary Table S3).The mean ratio of the high-quality reads mapped to the reference genome was 82.80%, ranging from 77.59 to 86.61%, and an average of 26,344.67 mapped unique genes per sample was identified simultaneously. The significantly positive correlation (R2 = 0.625, R2 = 0.631, P < 0.0001) between the RNA-seq and qRT-PCR using the value of Log2FCCY6203/CH3203 or C106B of 18 randomly selected genes indicated the accuracy and reliability of the RNA-seq results (Fig. 4 and Supplementary Table S7). Then, a total of 39,503 differentially expressed genes (DEGs) were detected among all samples (Supplementary Table S4). Based on the hierarchical clustering of these samples using the FPKM value (Supplementary Figs. S1 and S2), two discrete samples E6203-2 and Y6203-1 were excluded.
There were 1164 (42 + 591 + 95 + 436), 642 (23 + 530 + 19 + 70), 4515 (497 + 3264 + 203 + 551), 2180 (246 + 586 + 355 + 993), and 2615 (135 + 1103 + 161 + 1216) DEGs between CY6203 and C106B, and similarly 7617, 915, 1738, 7742 and 8049 between CY6203 and CH3203 in panicles and flag leaves at the booting, flowering, and middle filling stages, respectively (Fig. 5A). Among these DEGs between CY6203 and C106B, there were 829 (27 + 475 + 73 + 228) (829/1164 × 100% = 71.22%), 459 (71.5%), 1,658 (36.72%), 1,488 (68.26%), and 1,453 (55.56%) upregulated DEGs, respectively. Among these DEGs between CY6203 and CH3203, there were similarly 4,582 (60.15%), 673 (73.55%), 868 (49.94%), 5,662 (73.13%), and 5927 (73.64%) upregulated DEGs at the booting, flowering, and middle filling stages, respectively. At the booting stage, the number of DEGs between CY6203 and CH3203 was greater than that of DEGs between CY6203 and CY106B, but the ratio of upregulated common DEGs between CY6203 and C106B was almost the same (58.26% vs. 59.17% and 72% vs. 73.72%) as that of upregulated common DEGs between CY6203 and CH3203. At the middle filling stage, the phenomenon was opposite and the ratio of upregulated common DEGs between CY6203 and CH3203 was higher than (60.8% vs. 42.11% and 67.85% vs. 47.04%) that of upregulated common DEGs between CY6203 and C106B. The number of common DEGs between C106B and CH3203 in panicles and flag leaves at the booting, flowering, and middle filling stages was greater than that of common DEGs between CY6203 and its parents (Fig. 5B). In addition, a total of 436, 70, 551, 993, and 1,216 common DEGs between CY6203 and both of its parents were identified at the same stage in panicles and flag leaves. Of the common DEGs, the numbers of upregulated DEGs between CY6203 and CH3203 were all greater than those of upregulated DEGs between CY6203 and C106B in panicles and flag leaves at the booting, flowering, and middle filling stages.
The frequency of mRNA editing ratios of each sample of CY6203 followed normal distribution (Fig. 6 and Supplementary Table S5). The average frequency of mRNA editing loci of 15 samples of C106B was 8.13 per Mb, and that of CH3203 was 7.94 per Mb, and that of CY6203 was 63.05 per Mb. The results showed that a great quantity of mRNA editing phenomena occurred in CY6203 due to its heterozygosity, and most mRNA editing ratios were approximately equal to 0.5. Though CH3203 and C106B were pure lines, a small amount of mRNA editing phenomena occurred. For example, the mRNA editing ratio of the Wx gene was an average of approximately 0.5119 in an SNP site (T1768724 to C1768724) of exon 9 in CY6203, while it was an average of approximately 0.8144 in C106B.
Identification of coexpressed gene network modules
We remove no significant DEGs between F1 (CY6203) and its parents (C106B and CH3203) in panicles and flag leaves at booting, flowering, and middle filling stages. The 17,105 DEGs were used to construct coexpression modules by WGCNA14. A total of 15,934 DEGs were classified into 19 distinct modules (R2 = 0.85, β = 18) (Supplementary Fig. S3), which were generated by a hierarchical clustering tree. Among these DEGs, discrete DEGs were classified into the grey module. The number of DEGs in the 18 functional modules ranged from 39 (light green module) to 2996 (turquoise module). When the Euclidean distance was 1.0, 19 modules were divided into three groups (Supplementary Fig. S3D). Grey 60 (88 genes), yellow (1574 genes), green-yellow (298 genes), and midnight blue (186 genes) modules were merged into one group, and tan (292 genes), red (1198 genes), black (706 genes), pink (673 genes), blue (2905 genes), and purple (337 genes) modules were combined into another group, and the remaining modules were classified into the other group.
The expression heatmap of each module and expression histogram of corresponding eigengenes showed that the eigengenes were regularly expressed at different levels at the different stages or materials (Supplementary Fig. S4). For example, blue and turquoise modules showed upregulation at the booting and heading stages in panicles. The brown module showed upregulation in CY6203 compared with CH3203, except for the middle filling stage. The green-yellow module showed that eigengenes were higher in CY6203 than C106B. Among the 18 modules, few eigengenes showed higher expression in CY6203 than its parents at the three stages.
GO and KEGG pathway enrichment in coexpression network modules
A total of 219 DEGs of 9 modules were significantly enriched in 20 KEGG pathways (Fig. 7A and Supplementary Tables S8, S9). Out of these DEGs, KEGG pathways of photosynthesis-antenna proteins, photosynthesis, nitrogen metabolism, and carbon fixation in photosynthetic organisms were significantly enriched in the green-yellow module (Fig. 7B, Supplementary Table S10), suggesting that the green-yellow module should be associated with yield heterosis. The GO terms of the green-yellow module were significantly enriched in GO:0015979 (photosynthesis; FDR = 3.6E−07), GO:0019684 (photosynthesis, light reaction; FDR = 5.8E−03), and GO:0009765 (photosynthesis, light-harvesting; FDR = 1.4E−04) (Supplementary Fig. S5). The DEG expression levels of CY6203 were inhibited first and then increased in pathways of photosynthesis-antenna proteins and photosynthesis compared with its parents (Supplementary Figs. S6 and S7). In contrast, the DEG expression levels of CY6203 in pathways of nitrogen metabolism and carbon fixation were continuously higher than those of its parents in photosynthetic organisms of all three stages, while their increase gradually decreased, especially in charge of carbonic an hydrase [EC:4.2.1.1] (LOC_Os01g45274, LOC_Os04g33660, and LOC_Os08g36680) and ribulose-bisphosphate carboxylase small chain [EC:4.1.1.39] (LOC_Os12g17600, LOC_Os12g19381, and LOC_Os12g19470) (Supplementary Fig. S8).
Interestingly, the nitrogen metabolism pathway was also identified in the yellow module (FDR = 0.02), including 9 enriched DEGs. Of these DEGs, 5 known genes, PSR115, OsNRT2.316,17, OsNRT2.418, OsGS1;219–21 and OsGS222,23, carried variants between CH3203 and C106B (Supplementary Table S9). The mRNA editing ratios of these genes were an average of approximately 0.5 in CY6203 (Supplementary Table S5). Their expression levels were higher in CY6203 than C106B and were approximately equivalent to that of CH3203 at the middle filling stage (Supplementary Table S6).
Comparison of cloned gene expression between Chuanyou6203 and its parents related to yield, quality and KEGG pathway
According to the annotation of the cloned genes (Fig. 3), we listed the Log2FCCY6203/CH3203 or C106B values of cloned genes related to the detected KEGG pathways, yield, and quality in Supplementary Table S11. Of these genes, the expression levels of some genes of CY6203 were higher than that of C106B and lower than that of CH3203 at three stages, such as HGW24 (Fig. 8), while some genes exhibited the opposite phenotype, such as Gn1a25. Some were synchronously higher than that of its parents at the booting stage, and then synchronously lower than that of its parents at the middle filling stage, such as Gnp426. The expression level of most genes related to chlorophyll was higher in CY6203 than that of C106B, but lower than that of CH3203 at the middle filling stage, for example, Fd1, LHCB, and CAB2R. While the genes related to grain quality were opposite, such as Wx27–29 and GL730. The expression level of ALK31,32 was higher in CY6203 than in its parents at the middle filling stage (Fig. 8), which might be the main reason why CY6203 showed over-parent heterosis in ASV (Fig. 2).
Discussion
The use of crop heterosis has notably helped to ensure world food security. For over a hundred years, researchers have been exploring the molecular mechanism of heterosis, starting from the previously proposed hypotheses of dominance, over-dominance, and epistasis to later transcriptome analysis at gene expression levels1. It is generally believed that the DEGs between CY6203 and its parents can control the heterosis of the hybrid2,3,6–8. Wei et al. identified 3,926 DEGs between LYP9 and its parents and between the parents, 93-11, and PA64s3. Zhang et al. indicated that polymorphic promoter cis-regulatory elements were closely correlated with DEGs with an additive, as well as over- and under-dominance, gene action33. Huang et al. believed that numerous superior alleles contributed to heterosis, but only a small number of loci with strong overdominance affected heterosis in hybrids; for example, Ehd1, Hd1, Ghd7, and OsSoc1 strongly affected heading date, and Gn1, Ghd7, and OsSPL14 affected grain number, and qSW5 and Wx affected chalky grain rate4. Chen et al. also believed that the overdominant effect was a major contributor to the grain number heterosis of WFYT0255.
Therefore, heterozygosity is generally a prerequisite for gene expression and phenotypic variation in hybrids1. In this study, CY6203 with excellent quality and high yield showed visibly overparent heterosis in SPW/D (OMPH:8.85%; OFPH:14.42%), ANSPP (OMPH:27.77%; OFPH:18.64%), AC (OMPH:44.63%; OFPH:6.71%) and ASV (OMPH:6.15%; OFPH:15%). A total of 7473 heterozygous genes in CY6203 were identified due to variant sites between CH3203 and C106B (Supplementary Table S5). In the results of mRNA editing (Supplementary Table S5), transcripts of 14,979 genes from both parents were simultaneously expressed in CY6203, and approximately 40.61% of mRNA editing ratios were between 0.4 and 0.6 (Fig. 5). Interestingly, 1.68% of mRNA editing events (editing ratio ≥ 0.8) in CY6203 favoured one of its parents at three stages or a particular stage, suggesting that the hypothetical heterosis mechanism of CY6203 might involve dominance or epistasis. For example, the average 0.8376 of GL730 type belonged to CH3203 type at three stages, and 0.8576 of the Edh2 type belonged to CH3203 type at only the booting stage. We also noticed that the DEG number of CY3203 vs. CY6203 (7617 DEGs, 4582 upregulated) substantially exceeded that of C106B vs. CY6203 (1164 DEGs, 829 upregulated) in panicles at the booting stage, but at the middle filling stage, the number of DEGs was quite the opposite (4515 and 1658 vs. 1738 and 868). These results indicated that CH3203 and CY106B together contributed to the high yield and good quality of CY6203.
To explain the genetic mechanism of high yield and quality of CY6203, we summarized the following two points. First, the DEGs in the green-yellow module mainly contributed to the increase in the source of CY6203 due to an increase in photosynthetic efficiency and nitrogen utilization efficiency, and a small number of DEGs related to the grain number added spikelet number per panicle amplified its sink (Fig. 9). OMPHs for SPW/D and ANSPP were 8.85% and 27.77%, respectively. Similarly, OFPHs for SPW/D and ANSPP were 14.42% and 18.64%, respectively (Figs. 2, 9). Epigenetic regulation of circadian-mediated changes in chlorophyll biosynthesis and starch metabolism is a direct reason for growth vigour in plant hybrids1. Further studies showed that an appropriate reduction in the size of the chlorophyll antenna or chlorophyll content can improve the photosynthetic CO2 uptake rates and nitrogen utilization efficiency34–37. Of the 298 DEGs in the green-yellow module, 31 DEGs were significantly enriched in pathways of photosynthesis-antenna proteins, photosynthesis, nitrogen metabolism, carbon fixation in photosynthetic organisms, glyoxylate, and dicarboxylate metabolism and linoleic acid metabolism (Supplementary Table S8). In the biological process, photosynthesis negatively regulated light-harvesting (Supplementary Fig. S5). The pathway of nitrogen metabolism was also identified from the yellow module. Of the five cloned genes, PSR115 encoded a nitrite reductase with nitric acid assimilation, OsNRT2.316,17 and OsNRT2.418 were inferred to control regulation of rice high-affinity nitrate transport, and OsGS1;219–21 and OsGS222,23 showed coexpression affecting nitrogen metabolism and plant growth. In summary, we speculate that the photosynthetic CO2 uptake rates and nitrogen utilization efficiency are improved in CY6203 due to its sustained better chlorophyll accumulation and suitable size of the photosynthesis antenna from the booting to the middle filling stage (Fig. 9). Therefore, these genes were coexpressed, which increased nitrogen utilization efficiency and carbon dioxide absorption from the booting to the middle filling stage in CY6203. Thus, the coordinated expression of DEGs in the green-yellow module can improve the photosynthetic efficiency and nitrogen utilization efficiency of CY6203.
Among 1079 cloned genes with different alleles between CH3203 and C106B, 15 genes were related to yield trait-based previous reports at https://www.ricedata.cn/ (Fig. 3). Only SPP1 was inferred to control the number of spikelets per panicle38. The expression level of CY6203 was higher than that of its parents at the booting stage (Supplementary Table S11). However, the gene (new name: OsJAR2) was determined to be involved in the wound- and pathogen-induced jasmonic acid signalling39. Although two well-known genes controlling the grain number per panicle, Gn1a25 and Gnp426,40, without variants between CH3203 and C106B, showed middle-parent and over-parent expression levels in CY6203 at the booting stage, respectively (Supplementary Table S11). Ashikari et al.25 reported that the decrease in the activity of Gn1a increased the spikelet number. Similarly, increasing the expression level of Gnp4 also increased the spikelet number26. Therefore, we infer that the coregulation of Gn1a and Gnp4 jointly increases the average number of spikelets per panicle, resulting in the amplified sink of CY6203.
Second, the balanced expression of the major high-quality alleles of C106B and CH3203 in CY6203 contributed to the outstanding quality of CY6203. At the middle filling stage, the number of DEGs between CY6203 and C106B was greater than that of DEGs between CY6203 and CH3203. However among coexpressed DEGs, the ratio of upregulated DEGs between CY6203 and CH3203 was greater than that of DEGs between CY6203 and C106B (Fig. 5A), which might indicate the balanced expression of some related quality genes of C106B and CH3203 in CY6203. Complex genetic networks that affect AC, GC, and gelatinization temperature (GT) determine the cooking and eating quality (ECQs) of rice41 (Fig. 10). Among these genes, Wx and SSII-3 are the two essential genes that affect AC, GC, and GT. Different alleles of Wx lead to regional variation in rice AC and have affected consumer preferences27–29,41–43. The substitution of bases in the coding region of the ALK gene may change the crystal structure of amylopectin, which may lead to a change in ASV in rice31. By changing the expression level of OsBEIIb44,45 in rice endosperm, GC can be manipulated due to the change in starch structure. AC is significantly negatively correlated with GC and ASV/GT, and GC is significantly positively correlated with GT41. However, in the quality traits of CY6203, the increase in the ASV value did not lead to an increase in the GC value, and the increase in the AC value did not lead to a decrease in the ASV value (Fig. 2 and Table 1).
Genome comparison results showed that there were SNP/InDel variants in the four major quality-related genes between CH3203 and C106B, including SSIIB, Wx46,47, OsSSI48, and OsSSIIIa49. The Wx alleles of CH3203 and C106B both belonged to the Wxb type of japonica rice, except for a nucleotide deletion in the intronic sequence of the Wx gene between CH3203 and C106B (Supplementary Table S2). Compared with CH3203, there were three single-nucleotide deletions, one single-nucleotide insertion, and one SNP in the OsSSI sequence of C106B. There were six InDels and 12 SNPs in the OsSSIIIa sequence between CH3203 and C106B. We noticed that the average mRNA editing ratios of Wx, OsSSI, and OsSSIIIa were 0.4412, 0.4679 and 0.5255 in CY6203 at the booting, flowering, and middle filling stages, respectively, which indicated that these genes expressed additive effects. However, the expression levels of Wx and OsSSIIIa in CY6203 were upregulated compared with CH3203, and was downregulated compared with C106B at the middle filling stage, which may lead to increased AC content in CY6203. The expression level of OsSSI in CY6203 was downregulated, and ALK was upregulated compared with its parents at the middle filling stage (Fig. 10), which may be the reason for the increase in the ASV value in CY6203. In addition, the decrease in OsSSI gene expression resulting in the decrease insoluble starch content may be an important reason for the decrease in GC value in CY6023. Therefore, we believed that superior starch synthesis alleles from C106B and CH3203 balanced expression contributed to the outstanding quality of CY6203.
Conclusions
In summary, we systematically analysed the transcriptome and genome profiles from hybrid rice, Chuanyou6203 (CY6203), and its parents by RNA-sequencing and resequencing. A total of 7,473 different genes were found between CH3203 and C106B. A total of 436, 70, 551, 993, and 1,216 common DEGs between CY6203 and both of its parents were identified at the same stage in panicles and flag leaves. Of the common DEGs, the numbers of upregulated DEGs between CY6203 and CH3203 were all greater than those of upregulated DEGs between CY6203 and C106B in panicles and flag leaves at the booting, flowering, and middle filling stages. In addition, approximately 40.61% of mRNA editing ratios were between 0.4 and 0.6, and 1.68% of mRNA editing events (editing ratio ≥ 0.8) in CY6203 favoured one of its parents at three stages or a particular stage, suggesting that the hypothetical heterosis mechanism of CY6203 might involve dominance or epistasis. The results of WGCNA, KEGG pathway and GO enrichment analysis showed that the green-yellow module was inferred to be associated with yield heterosis including KEGG pathways of photosynthesis-antenna proteins, photosynthesis, nitrogen metabolism, and carbon fixation in photosynthetic organisms. The balanced expression of the major high-quality alleles of C106B and CH3203 in CY6203 contributed to the high quality of CY6203. Our study offers a new data set that may help to elucidate the molecular mechanism governing yield heterosis and the formation of a high quality hybrid rice variety.
Materials and methods
Plant materials
Chuanyou6203 (CY6203) is an outstanding hybrid rice variety with excellent quality and high yield, stained from a cross between the Wild Abortive Cytoplasmic Male Sterility (WA-CMS) line Chuan106A (C106A) and the restorer line Chenghui3203 (CH3203). Chuan106B is the isonuclear fertile line of Chuan106A. CY6203 was released in 2011 and 2014 by the Variety Certification Committee of Sichuan province and the Country Variety Certification Committee in the rice zone of the upper Yangtze River, respectively.
Phenotyping and sampling
CY6203, CH3203, and Chuan106B (C106B) (the isonuclear fertile line of Chuan106A) were grown in a rice paddy at the experimental field of the Sichuan Academy of Agricultural Sciences. To ensure sampling at the same time, CY6203, CH3203, and C106B were sown on April 5, 10, and 25, 2019, respectively. The 30-day seedlings were transplanted with a spacing of 17 cm × 25 cm, with three replications. Each material was planted in five rows with 10 plants per row. Field management followed normal procedures for rice. These three lines were selected in this study for the measurement of five agronomic traits. The five agronomic traits were single plant weight per day (SPW/D, in grams/plant/day) = single plant weight/days of maturity, plant height (PH, in centimetres), panicle length (PL, in centimetres), the average number of spikelets per panicle (ANSPP), and 1000-gain weight (TGW, in grams). For each material, 500 g grains were sent to the Rice Product Quality Inspection & Supervision Testing Center of Ministry of Agriculture and Rural Affairs to measure quality traits with three replications. They were head rice ratio (HRR, the ratio of head milled rice), grain length/grain width (L/W), amylose content (AC), alkali spreading value (ASV), and gel consistency (GC). The means of replications for each trait were used to calculate the over-male-parent heterosis (OMPH) and over-female-parent heterosis (OFPH) of the F1 hybrid CY6203. The formulas were as follows:
The was the mean of replications, for each trait for CY6203. The was the mean of replications for each trait, for CH3203. The was the mean of replications, for each trait for C106B.
The first samples were collected seven days before heading, including flag leaves and panicles. The flag leaf samples were named Y6203, Y3203, and Y106B, and the panicle samples were named S6203, S3203, and S106B, respectively. The second samples, including flag leaves and panicles, were taken 3 days after heading. The flag leaf samples were named H6203, H3203, and H106B, and the panicle samples were named E6203, E3203, and E106B, respectively. The third panicle sampling was conducted 15 days after heading. The panicle samples were named W6203, W3203, and W106B. Each sampling time was 9:00–10:00 with three replications. All samples were stored at -80℃ for genomic resequencing and RNA-Seq analysis.
DNA, total RNA isolation and sequencing
The total DNA of the two parents was extracted with the Plant Genome DNA Extraction kit. The total RNA of each sample tissue was extracted with the RNA prep Pure Plant Kit with RNase-Free DNase. The quality and integrity of DNA and RNA were tested using the Agilent Bioanalyzer 2100 system (Agilent, Santa Clara, CA, USA). All DNA and RNA samples were sent to Gene Denovo Biotechnology Co., Ltd. (Guangzhou, China) for sequencing. The DNA/cDNA libraries were sequenced using Illumina HiSeq 4000 by Gene Denovo Biotechnology Co., Ltd.
Identification of variations between CH3203 and C106B
Based on the Nipponbare reference genome MSU 7, the SNP/InDel variants between CH3203 and C106B were identified and filtered (MAPQ ≥ 25) by SAMtools50. ANNOVAR51 was used for the SNP/InDel annotation. Then, by comparing the variant types of CH3203 and C106B at the same position, different variant type sites were selected, and the number of different SNP/InDel per 200 kb was calculated. RIdeogram52 was used to visualize and localize the density distribution of different SNP/InDel densities and cloned genes related to yield, quality, and some KEGG pathways in the whole genome.
Read mapping and analysis
The Nipponbare reference genome MSU7 and its annotation file were downloaded from the Rice Genome Annotation Project (https://rice.plantbiology.msu.edu/)53. First, we used FastQC software to evaluate the quality of the raw reads. To obtain high-quality clean reads, the raw reads were further filtered according to the following rules: 1) Delete the reads containing the adapters. 2) Remove the reads containing more than 10% of unknown nucleotides (N). 3) Remove low-quality reads containing more than 50% of low-quality (Q-value ≤ 20) bases. Second, the RNA-seq clean reads were aligned to the ribosomal RNA (rRNA) database using Bowtie254, and then the rRNA-mapped reads were removed. Third, the remaining reads of CH3203 and C106B were mapped to the Nipponbare reference genome by Bowtie254. The alignment parameters were as follows: (1) maximum read mismatch was 2; (2) the error of distance between mate-pair reads was ± 80 bp; and (3) the distance between mate-pair reads was 50 bp. Fourth, the transcripts were detected by the Reference Annotation Based Transcript method using Cufflinks software55. Fifth, gene abundances were quantified by RSEM software56. The gene expression level was normalized using the fragments per kilobase of transcript per million mapped reads (FPKM) method. The edgeR package57 was applied to identify differentially expressed genes (DEGs) among all samples. Outlier samples were examined using hierarchical clustering and removed.
To characterize common DEGs between CY6203 and its parents, Venn diagrams were drawn by https://bioinformatics.psb.ugent.be/webtools/Venn/.
Finally, RNA editing was also analysed. RNA editing referred to variants on the mRNA level through InDel or base substitutions58,59. The reliable RNA editing sites from SNP sites were screened according to the following steps60,61: (1) SNPcalling was carried out by SAMtools (https://www.htslib.org/doc/samtools.html)50; (2) correction of the SNPs around the InDel region; (3) selection of non-overlapping SNPs in UTR and exon region; (4) selection of SNPs with reference reads ≥ 2 and variant reads ≥ 3; (5) selection of SNPs with the mRNA editing ratios between 0.1000 and 0.9000; and (6) counting the frequency of SNPs per interval ratio of 0.1000 in each material, and calculating the mean of mRNA editing loci.
Raw sequence data reported in this paper have been deposited (PRJCA003209) in the Genome Sequence Archive62 in the BIG Data Center63, Chinese Academy of Sciences under accession code ‘CRA003262’ for C106B and CH3203 genome sequencing data and CY6203 transciptome sequencing data that are publicly accessible at https://bigd.big.ac.cn/gsa.
Quantitative real-time PCR (qRT-PCR) analysis
Eighteen DEGs detected by RNA-seq were selected for validation by qRT-PCR. Total RNA was extracted from forty-five samples using an RNA Prep Pure Plant Kit containing RNase-Free DNase. Two micrograms of total RNA was reverse transcribed using the HiScript First Strand cDNA Synthesis Kit (Vazymebiotech Co., Ltd., Nanjing, China). SYBR-based qRT-PCR analyses were performed on ABI Step One Plus according to the protocol (ChamQ SYBR qPCR Master Mix) provided by the manufacturer (Vazymebiotech Co., Ltd., Nanjing, China). All qRT-PCR reactions were biologically repeated 3 times, and the results were analysed based on the 2−△△Ct method using the system’s relative quantification software (version 1.5). The cycle threshold values for each gene were normalized against the expression level of the rice Actin gene with the primer sequences 5′-CCACTATGTTCCCTGGCATT-3′ and 5′-GTACTCAGCCTTGGCAATCC-3′. All primers used for qRT-PCR in this study are listed in Supplementary Table S12.
Weighted gene coexpression network analysis
To identify DEGs between CY6203 and its parents, we identified genes with a Log2fold change > 1 or < − 1, and a false discovery rate (FDR) < 0.05 in comparison as significant DEGs64. The weighted gene coexpression network analysis (WGCNA) package14 was used to infer the DEG modules of the co-expression network. DEGs were filtered before coexpression network inference to eliminate genes not expressed in more than 10% of samples or that showed variance = 0 across samples. The automatic one-step network construction and module detection method with default settings were carried out, including an unsigned type of topological overlap matrix (TOM), a power β of 18 (R Squared Cut = 0.85), a minimum module size of 30, and a branch merge cut height of 0.25. The module eigengene value was calculated and used to test the association of each module with each of 43 samples. Heatmaps for the gene expression and characteristic bar graphs of each module were drawn. Moreover, the eigengene adjacency cluster and corresponding heatmap were visualized based on eigengene-based intramodular connectivity measures (kME).
KEGG pathway and GO enrichment analysis of WGCNA modules
To further clarify the biological functions of each module between F1 (CY6203) and its parents (C106B and CH3203), the genes of each WGCNA module were previously subjected to KEGG pathway enrichment analysis by ExPath 2.0 (https://expath.itps.ncku.edu.tw/)65 and GO enrichment analysis by agriGO v2 (https://systemsbiology.cau.edu.cn/agriGOv2/)66. The significance level was FDR ≤ 0.05. The enrichment results were visualized by ggPlot267. The expressed pattern of DEGs involved in each enriched pathway of the green-yellow module was visualized using the Log2(fold change) value by Pathview Web (https://pathview.uncc.edu/)68,69.
Fold change comparison of yield, quality, and KEGG related genes
Yield-, quality-, and KEGG pathway-related genes were selected based on previous reports at https://www.ricedata.cn/ and their annotated information in the funRiceGenes database (https://funricegenes.github.io/). We calculated Log2FCCY6203/CH3203 and Log2FCCY6203/C106B of these genes for each sample tissue of CY6203. We made a definition for a calculating formula:
where the meant is the mean FPKM value of a gene in the CY6203 transcript. The is the mean FPKM value of a gene in the CH3203 or C106B transcript.
Supplementary information
Acknowledgements
We thank Guangzhou Genedenovo Biotechnology Co., Ltd for assisting in sequencing.
Abbreviations
- AC
Amylose content
- ANSPP
Average number of spikelets per panicle
- ASV
Alkali spreading value
- C106B
Chuan 106B
- CH3203
Chuanhui3203
- CY6203
Chuanyou6203
- DEGs
Different expression genes
- ECQ
The cooking and eating quality
- FC
Fold change
- FDR
False discovery rate
- FPKMs
Fragments per kb for a million reads
- GC
Gel consistency
- GO
Gene ontology
- HRR
Head rice ratio
- KEGG
Kyoto encyclopedia of genes and genomes
- kME
Eigengene-based intramodular connectivity measures
- L/W
Grain length/grain width
- MSU 7
MSU RICEGENOMEANNOTATIONPROJECT RELEASE 7
- OFPH
Over-female-parent heterosis
- OMPH
Over-male-parent heterosis
- PH
Plant height
- PL
Panicle length
- qRT-PCR
Quantitative real-time polymerase chain reaction
- TGW
1000-gains weight
- SPW/D
Single plant weight per day
- WA-CMS
Wild abortive cytoplasmic male sterility
- WGCNA
The weighted gene coexpression network analysis
Author contributions
J.S.R. performed the project design, data analysis and wrote the manuscript; F.Z. performed data analysis and wrote the manuscript; F.Y.G., L.H.Z., X.J.L., X.Q.Z., J.Q.L. X.W.S., L.P.L., and M.L.D. performed the field management, DNA and RNA extraction, and phenotypic analysis; J.L.X. directed the project; G.J.R. directed the project and wrote the manuscript.
Funding
This research was supported by Grant (2017YFD0100200) from The National Key Research and Development Program of China, Grant (CARS-01-08) of National System of Modern Agriculture Industrial Technology of China, and Grant (2016NYZ0028) Sichuan Rice Breeding Project for Tackling Key Problems of China.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Juansheng Ren and Fan Zhang.
Supplementary information
is available for this paper at 10.1038/s41598-020-76762-3.
References
- 1.Chen ZJ. Molecular mechanisms of polyploidy and hybrid vigor. Trends Plant Sci. 2010;15:57–71. doi: 10.1016/j.tplants.2009.12.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Bao J, et al. Serial analysis of gene expression study of a hybrid rice strain (LYP9) and its parental cultivars. Plant Physiol. 2005;138:1216–1231. doi: 10.1104/pp.105.060988. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Wei G, et al. A transcriptomic analysis of superhybrid rice LYP9 and its parents. Proc. Natl. Acad. Sci. 2009;106:7695–7701. doi: 10.1073/pnas.0902340106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Huang X, et al. Genomic analysis of hybrid rice varieties reveals numerous superior alleles that contribute to heterosis. Nat. Commun. 2015;6:6258–6258. doi: 10.1038/ncomms7258. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Chen L, et al. Genetic analysis for the grain number heterosis of a super-hybrid rice WFYT025 combination using RNA-Seq. Rice. 2018;11:37. doi: 10.1186/s12284-018-0229-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Guo M, et al. Genome-wide transcript analysis of maize hybrids: allelic additive gene expression and yield heterosis. Theor. Appl. Genet. 2006;113:831–845. doi: 10.1007/s00122-006-0335-x. [DOI] [PubMed] [Google Scholar]
- 7.Li B, Zhang D-F, Jia G-Q, Dai J-R, Wang S-C. Genome-wide comparisons of gene expression for yield heterosis in maize. Plant Mol. Biol. Rep. 2009;27:162–176. doi: 10.1007/s11105-008-0068-x. [DOI] [Google Scholar]
- 8.Wang H, et al. Heterosis and differential gene expression in hybrids and parents in Bombyx mori by digital gene expression profiling. Sci. Rep. 2015;5:8750. doi: 10.1038/srep08750. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Li D, et al. Gene expression analysis and SNP/InDel discovery to investigate yield heterosis of two rubber tree F1 hybrids. Sci. Rep. 2016;6:24984. doi: 10.1038/srep24984. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Jin J, et al. Transcriptome and functional analysis reveals hybrid vigor for oil biosynthesis in oil palm. Sci. Rep. 2017;7:439. doi: 10.1038/s41598-017-00438-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Ni D, et al. Improving cooking and eating quality of Xieyou57, an elite indica hybrid rice, by marker-assisted selection of the Wx locus. Euphytica. 2011;179:355–362. doi: 10.1007/s10681-011-0377-2. [DOI] [Google Scholar]
- 12.Tan Y, et al. The three important traits for cooking and eating quality of rice grains are controlled by a single locus in an elite rice hybrid, Shanyou 63. Theor. Appl. Genet. 1999;99:642–648. doi: 10.1007/s001220051279. [DOI] [PubMed] [Google Scholar]
- 13.Zhou P, Tan Y, He Y, Xu C, Zhang Q. Simultaneous improvement for four quality traits of Zhenshan 97, an elite parent of hybrid rice, by molecular marker-assisted selection. Theor. Appl. Genet. 2003;106:326–331. doi: 10.1007/s00122-002-1023-0. [DOI] [PubMed] [Google Scholar]
- 14.Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinform. 2008;9:559. doi: 10.1186/1471-2105-9-559. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Nishimura A, et al. Isolation of a rice regeneration quantitative trait loci gene and its application to transformation systems. Proc. Natl. Acad. Sci. 2005;102:11940–11944. doi: 10.1073/pnas.0504220102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Fan X, et al. Overexpression of a pH-sensitive nitrate transporter in rice increases crop yields. Proc. Natl. Acad. Sci. 2016;113:7118–7123. doi: 10.1073/pnas.1525184113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Yan M, et al. Rice OsNAR2. 1 interacts with OsNRT2. 1, OsNRT2. 2 and OsNRT2. 3a nitrate transporters to provide uptake over high and low concentration ranges. Plant Cell Environ. 2011;34:1360–1372. doi: 10.1111/j.1365-3040.2011.02335.x. [DOI] [PubMed] [Google Scholar]
- 18.Feng H, Yan M, Fan X, Li B, Shen Q, Miller AJ, Xu G. Spatial expression and regulation of rice high-affinity nitrate transporters by nitrogen and carbon status. J. Exp. Bot. 2011;62:2319–2332. doi: 10.1093/jxb/erq403. [DOI] [PubMed] [Google Scholar]
- 19.Tabuchi M, et al. Severe reduction in growth rate and grain filling of rice mutants lacking OsGS1; 1, a cytosolic glutamine synthetase1; 1. Plant J. 2005;42:641–651. doi: 10.1111/j.1365-313X.2005.02406.x. [DOI] [PubMed] [Google Scholar]
- 20.El-Kereamy A, et al. The rice R2R3-MYB transcription factor OsMYB55 is involved in the tolerance to high temperature and modulates amino acid metabolism. PLoS ONE. 2012;7:e52030. doi: 10.1371/journal.pone.0052030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Bao A, et al. Accumulated expression level of cytosolic glutamine synthetase 1 gene (OsGS1; 1 or OsGS1; 2) alter plant development and the carbon-nitrogen metabolic status in rice. PLoS ONE. 2014;9:e95581. doi: 10.1371/journal.pone.0095581. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Cai H, Xiao J, Zhang Q, Lian X. Co-suppressed glutamine synthetase2 gene modifies nitrogen metabolism and plant growth in rice. Chin. Sci. Bull. 2010;55:823–833. doi: 10.1007/s11434-010-0075-9. [DOI] [Google Scholar]
- 23.Singh KK, Ghosh S. Regulation of glutamine synthetase isoforms in two differentially drought-tolerant rice (Oryza sativa L.) cultivars under water deficit conditions. Plant Cell Rep. 2013;32:183–193. doi: 10.1007/s00299-012-1353-6. [DOI] [PubMed] [Google Scholar]
- 24.Li J, et al. The rice HGW gene encodes a ubiquitin-associated (UBA) domain protein that regulates heading date and grain weight. PLoS ONE. 2012;7:e34231. doi: 10.1371/journal.pone.0034231. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Ashikari M, et al. Cytokinin oxidase regulates rice grain production. Science. 2005;309:741–745. doi: 10.1126/science.1113373. [DOI] [PubMed] [Google Scholar]
- 26.Tabuchi H, et al. LAX PANICLE2 of rice encodes a novel nuclear protein and regulates the formation of axillary meristems. Plant Cell. 2011;23:3276–3287. doi: 10.1105/tpc.111.088765. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Rui-zhen Z, et al. Identification of multiple alleles at the Wx locus and development of single segment substitution lines for the alleles in rice. Rice Sci. 2006;13:9–14. [Google Scholar]
- 28.Hirano HY, Eiguchi M, Sano Y. A single base change altered the regulation of the Waxy gene at the posttranscriptional level during the domestication of rice. Mol. Biol. Evol. 1998;15:978–987. doi: 10.1093/oxfordjournals.molbev.a026013. [DOI] [PubMed] [Google Scholar]
- 29.Hirano HY, Sano Y. Comparison of Waxy gene regulation in the endosperm and pollen in Oryza sativa L. Genes Genet. Syst. 2000;75:245–249. doi: 10.1266/ggs.75.245. [DOI] [PubMed] [Google Scholar]
- 30.Wang S, et al. The OsSPL16—GW7 regulatory module determines grain shape and simultaneously improves rice yield and grain quality. Nat. Genet. 2015;47:949–954. doi: 10.1038/ng.3352. [DOI] [PubMed] [Google Scholar]
- 31.Gao Z, et al. Map-based cloning of the ALK gene, which controls the gelatinization temperature of rice. Sci. China Ser. C Life Sci. 2003;46:661–668. doi: 10.1360/03yc0099. [DOI] [PubMed] [Google Scholar]
- 32.Zhang G, et al. Double repression of soluble starch synthase genes SSIIa and SSIIIa in rice (Oryza sativa L.) uncovers interactive effects on the physicochemical properties of starch. Genome. 2011;54:448–459. doi: 10.1139/g11-010. [DOI] [PubMed] [Google Scholar]
- 33.Zhang H-Y, et al. A genome-wide transcription analysis reveals a close correlation of promoter INDEL polymorphism and heterotic gene expression in rice hybrids. Mol. Plant. 2008;1:720–731. doi: 10.1093/mp/ssn022. [DOI] [PubMed] [Google Scholar]
- 34.Gu J, et al. Rice (Oryza sativa L.) with reduced chlorophyll content exhibit higher photosynthetic rate and efficiency, improved canopy light distribution, and greater yields than normally pigmented plants. Field Crops Res. 2017;200:58–70. doi: 10.1016/j.fcr.2016.10.008. [DOI] [Google Scholar]
- 35.Song Q, Wang Y, Qu M, Ort DR, Zhu XG. The impact of modifying photosystem antenna size on canopy photosynthetic efficiency—development of a new canopy photosynthesis model scaling from metabolism to canopy level processes. Plant Cell Environ. 2017;40:2946–2957. doi: 10.1111/pce.13041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Zhu X-G, Long SP, Ort DR. Improving photosynthetic efficiency for greater yield. Annu. Rev. Plant Biol. 2010;61:235–261. doi: 10.1146/annurev-arplant-042809-112206. [DOI] [PubMed] [Google Scholar]
- 37.Slattery RA, VanLoocke A, Bernacchi CJ, Zhu X-G, Ort DR. Photosynthesis, light use efficiency, and yield of reduced-chlorophyll soybean mutants in field conditions. Front. Plant Sci. 2017;8:549. doi: 10.3389/fpls.2017.00549. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Liu T, Mao D, Zhang S, Xu C, Xing Y. Fine mapping SPP1, a QTL controlling the number of spikelets per panicle, to a BAC clone in rice (Oryza sativa) Theor. Appl. Genet. 2009;118:1509–1517. doi: 10.1007/s00122-009-0999-0. [DOI] [PubMed] [Google Scholar]
- 39.Wakuta S, et al. OsJAR1 and OsJAR2 are jasmonyl-l-isoleucine synthases involved in wound- and pathogen-induced jasmonic acid signalling. Biochem. Biophys. Res. Commun. 2011;409:634–639. doi: 10.1016/j.bbrc.2011.05.055. [DOI] [PubMed] [Google Scholar]
- 40.Zhang Z, et al. Fine mapping and cloning of the grain number per-panicle gene (Gnp4) on chromosome 4 in rice (Oryza sativa L.) Agric. Sci. China. 2011;10:1825–1833. doi: 10.1016/S1671-2927(11)60182-X. [DOI] [Google Scholar]
- 41.Tian Z, et al. Allelic diversities in rice starch biosynthesis lead to a diverse array of rice eating and cooking qualities. Proc. Natl. Acad. Sci. 2009;106:21760–21765. doi: 10.1073/pnas.0912396106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Ayres NM, et al. Microsatellites and a single-nucleotide polymorphism differentiate apparentamylose classes in an extended pedigree of US rice germ plasm. Theor. Appl. Genet. 1997;94:773–781. doi: 10.1007/s001220050477. [DOI] [Google Scholar]
- 43.Yamanaka S, Nakamura I, Watanabe KN, Sato Y. Identification of SNPs in the waxy gene among glutinous rice cultivars and their evolutionary significance during the domestication process of rice. TAG Theor. Appl. Genet. 2004;108:1200–1204. doi: 10.1007/s00122-003-1564-x. [DOI] [PubMed] [Google Scholar]
- 44.Tanaka N, et al. The structure of starch can be manipulated by changing the expression levels of starch branching enzyme IIb in rice endosperm. Plant Biotechnol. J. 2004;2:507–516. doi: 10.1111/j.1467-7652.2004.00097.x. [DOI] [PubMed] [Google Scholar]
- 45.Lu F-H, Park Y-J. An SNP downstream of the OsBEIIb gene is significantly associated with amylose content and viscosity properties in rice (Oryza sativa L.) J. Cereal Sci. 2012;56:706–712. doi: 10.1016/j.jcs.2012.08.007. [DOI] [PubMed] [Google Scholar]
- 46.Sano Y. Differential regulation of waxy gene expression in rice endosperm. Theor. Appl. Genet. 1984;68:467–473. doi: 10.1007/BF00254822. [DOI] [PubMed] [Google Scholar]
- 47.Wang Z-Y, et al. Nucleotide sequence of rice waxy gene. Nucleic Acids Res. 1990;18:5898. doi: 10.1093/nar/18.19.5898. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Fujita N, et al. Function and characterization of starch synthase I using mutants in rice. Plant Physiol. 2006;140:1070–1084. doi: 10.1104/pp.105.071845. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Fujita N, et al. Characterization of SSIIIa-deficient mutants of rice: the function of SSIIIa and pleiotropic effects by SSIIIa deficiency in the rice endosperm. Plant Physiol. 2007;144:2009–2023. doi: 10.1104/pp.107.102533. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Li H, et al. The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Wang K, Li M, Hakonarson H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010;38:e164–e164. doi: 10.1093/nar/gkq603. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Hao Z, et al. RIdeogram: drawing SVG graphics to visualize and map genome-wide data on the idiograms. PeerJ Comput. Sci. 2020;6:e251. doi: 10.7717/peerj-cs.251. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Kawahara Y, et al. Improvement of the Oryza sativa Nipponbare reference genome using next generation sequence and optical map data. Rice. 2013;6:4. doi: 10.1186/1939-8433-6-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat. Methods. 2012;9:357. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Trapnell C, et al. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat. Protoc. 2012;7:562–578. doi: 10.1038/nprot.2012.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Li B, Dewey CN. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinform. 2011;12:323. doi: 10.1186/1471-2105-12-323. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26:139–140. doi: 10.1093/bioinformatics/btp616. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Giudice CL, Tangaro MA, Pesole G, Picardi E. Investigating RNA editing in deep transcriptome datasets with REDItools and REDIportal. Nat. Protoc. 2020;15:1098–1131. doi: 10.1038/s41596-019-0279-7. [DOI] [PubMed] [Google Scholar]
- 59.Picardi E, Derchia AM, Montalvo A, Pesole G. Using REDItools to detect RNA editing events in NGS datasets. Curr. Protoc. Hum. Genet. 2015;49(1):12. doi: 10.1002/0471250953.bi1212s49. [DOI] [PubMed] [Google Scholar]
- 60.Ramaswami G, et al. Identifying RNA editing sites using RNA sequencing data alone. Nat. Methods. 2013;10:128. doi: 10.1038/nmeth.2330. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Ramaswami G, et al. Accurate identification of human Alu and non-Alu RNA editing sites. Nat. Methods. 2012;9:579. doi: 10.1038/nmeth.1982. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.GSA: Genome sequence archive. Genom. Proteom. Bioinform. (2017).
- 63.Database resources of the National Genomics Data Center in 2020. Nucleic Acids Res.48, D24–D33 (2020). [DOI] [PMC free article] [PubMed]
- 64.Dalman MR, Deeter A, Nimishakavi G, Duan Z-H. Fold change and p-value cutoffs significantly alter microarray interpretations. BMC Bioinform. 2012;13:S11. doi: 10.1186/1471-2105-13-S2-S11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Chien, C.-H. et al. in BMC genomics. S6 (Springer).
- 66.Tian T, et al. agriGO v2.0: a GO analysis toolkit for the agricultural community, 2017 update. Nucleic Acids Res. 2017;45:W122–W129. doi: 10.1093/nar/gkx382. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Wickham H. ggplot2: Elegant Graphics for Data Analysis. Berlin: Springer; 2016. [Google Scholar]
- 68.Luo W, Brouwer C. Pathview: an R/Bioconductor package for pathway-based data integration and visualization. Bioinformatics. 2013;29:1830–1831. doi: 10.1093/bioinformatics/btt285. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Luo W, Pant G, Bhavnasi YK, Blanchard SG, Jr, Brouwer C. Pathview Web: user friendly pathway visualization and data integration. Nucleic Acids Res. 2017;45:W501–W508. doi: 10.1093/nar/gkx372. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.