Skip to main content
Annals of Botany logoLink to Annals of Botany
. 2014 Jul 1;114(3):499–511. doi: 10.1093/aob/mcu127

Linking ecophysiological modelling with quantitative genetics to support marker-assisted crop design for improved yields of rice (Oryza sativa) under drought stress

Junfei Gu 1,, Xinyou Yin 1,*, Chengwei Zhang 2, Huaqi Wang 2, Paul C Struik 1
PMCID: PMC4204662  PMID: 24984712

Abstract

Background and Aims

Genetic markers can be used in combination with ecophysiological crop models to predict the performance of genotypes. Crop models can estimate the contribution of individual markers to crop performance in given environments. The objectives of this study were to explore the use of crop models to design markers and virtual ideotypes for improving yields of rice (Oryza sativa) under drought stress.

Methods

Using the model GECROS, crop yield was dissected into seven easily measured parameters. Loci for these parameters were identified for a rice population of 94 introgression lines (ILs) derived from two parents differing in drought tolerance. Marker-based values of ILs for each of these parameters were estimated from additive allele effects of the loci, and were fed to the model in order to simulate yields of the ILs grown under well-watered and drought conditions and in order to design virtual ideotypes for those conditions.

Key Results

To account for genotypic yield differences, it was necessary to parameterize the model for differences in an additional trait ‘total crop nitrogen uptake’ (Nmax) among the ILs. Genetic variation in Nmax had the most significant effect on yield; five other parameters also significantly influenced yield, but seed weight and leaf photosynthesis did not. Using the marker-based parameter values, GECROS also simulated yield variation among 251 recombinant inbred lines of the same parents. The model-based dissection approach detected more markers than the analysis using only yield per se. Model-based sensitivity analysis ranked all markers for their importance in determining yield differences among the ILs. Virtual ideotypes based on markers identified by modelling had 10–36 % more yield than those based on markers for yield per se.

Conclusions

This study outlines a genotype-to-phenotype approach that exploits the potential value of marker-based crop modelling in developing new plant types with high yields. The approach can provide more markers for selection programmes for specific environments whilst also allowing for prioritization. Crop modelling is thus a powerful tool for marker design for improved rice yields and for ideotyping under contrasting conditions.

Keywords: Crop yield modelling, drought stress, ecophysiological modelling, genotype–phenotype relationships, introgression lines, model-based ideotyping, QTL, recombinant inbred lines, rice, Oryza sativa

INTRODUCTION

Rice (Oryza sativa) yield has been successfully improved during the last 60 years for both favourable and stressful environments (Peng et al., 2008), through extensive, largely empirical, selection. Developments in genomics provided useful tools and information for dissecting complex traits into single quantitative trait loci (QTLs). QTLs related to important agronomic traits, such as yield and stress tolerance, have been mapped, cloned and characterized (e.g. Xing et al., 2008; for reviews, see Miura et al., 2011; Xing and Zhang, 2011). These developments have provided a firm basis for further improving yield through marker-assisted selection or genetic transformation of crops. However, selection for, or transformation of, only a few or even a complex of genes may not result in a major yield increase (Sinclair et al., 2004; Yin and Struik, 2008). While proven for disease and quality traits, the marker based approach has rarely been proven successful for complex traits such as yield, which typically have low heritabilities and exhibit strong genotype × environment (G × E) interactions (Collard and Mackill, 2008).

The complexity of the yield trait stems from its many underlying processes, which are often environment dependent and show strong feedback and feedforward mechanisms during crop growth. Crop yield can be evaluated using ecophysiological crop simulation models that integrate information about processes at lower levels (Yin and Struik, 2008; Hammer et al., 2010; Zhu et al., 2011). Such models quantify causality between relevant physiological processes and responses of these processes to environmental variables (e.g. irradiance, temperature and availability of water and nutrients). By feeding crop models with weather data from other locations, these models predict yield beyond the environments in which the model parameters were derived and could explain variation in yield of a specific genotype among contrasting environments (Yin et al., 2000a; Sinclair, 2011).

The other type of model inputs are model parameters, often referred to as ‘genetic coefficients’ (Messina et al., 2006; White et al., 2008; Boote et al., 2013). Compared with yield itself, these parameters are thought to be less affected by variation in environment (Yin et al., 2000a). Modelling could thus assist in quantifying the G × E interactions and genotype–phenotype relationships (Yin et al., 2004; Reymond et al., 2004; Hammer et al., 2005; Yin et al., 2005a; Hammer et al., 2006; Chenu et al., 2008; Bertin et al., 2010; Tardieu and Tuberosa, 2010; Messina et al., 2011). This is especially true given advances in the use of molecular markers to map QTLs for model parameters and to integrate this QTL information into the crop models (Yin et al., 2000b; Reymond et al., 2003, 2004; Nakagawa et al., 2005; Quilot et al., 2005; Uptmoor et al., 2008; Xu et al., 2011). This ‘QTL-based modelling’ approach can dissect complex traits into physiologically relevant component traits, integrate effects of QTLs on the component traits over time and space at the whole-crop level and predict a complex trait of various allele combinations under different environmental conditions (Chenu et al., 2009; Yin and Struik, 2010). Such a QTL-based modelling approach was proven to be robust in predicting genetic differences in traits, such as leaf elongation rate in maize (Reymond et al., 2003), flowering time (Nakagawa et al., 2005; Yin et al., 2005a; Uptmoor et al., 2008, 2012) and fruit quality (Quilot et al., 2004, 2005; Bertin et al., 2010; Prudent et al., 2011), under different environmental conditions. A major challenge is to predict phenotypic differences in crop yield between relatively similar lines from a genetic population on the basis of QTL-based model parameters (Yin et al., 2000a, b), especially under stressful conditions.

In the present study, we followed the same approach and used the crop model GECROS (Yin and van Laar, 2005) to simulate variation in grain yield and biomass of biparental crosses of rice under well-watered and drought stress environments. Based on the experience of using an older crop model (Yin et al., 2000a, b), the GECROS model was designed in such a way that its input parameters include those close to the traits breeders score for selection. Given the plethora of genetic factors underlying yield, we aimed to analyse the relative importance of individual markers in accounting for variation in yield. To that end, we calibrated the GECROS model to account for yield differences among introgression lines (ILs) of rice. We identified markers for each model input parameter, and incorporated the effects of these markers to develop a marker-based model. In addition, the marker-based model was extrapolated to account for yield variation among recombinant inbred lines (RILs) derived from the same parents. Finally, we examined whether our modelling approach could enhance marker-assisted crop design for improving yields.

MATERIALS AND METHODS

To achieve the above objectives, we followed a pair-wise methodology (Fig. 1). Its individual steps are described in the following sections.

Fig. 1.

Fig. 1.

Diagram of the methodology used in this study, which combines a simple genetic model, eqn (1), and an ecophysiological crop model, GECROS, into the marker-based crop modelling, in order to rank the relative importance of the identified markers and to design ideotypes for improving rice yield. For acronyms of parameters of GECROS, see Table 1. IL(s), introgression line(s); RIL(s), recombinant inbred line(s).

Plant material and field experiments

The genetic population consisted of 94 advanced backcross ILs and two parents, as described by Gu et al. (2012a). The parents were the lowland cultivar ‘Shennong265’ (japonica, recurrent parent) and the upland rice ‘Haogelao’ (indica–japonica intermediate, donor parent). ‘Haogelao’ is drought tolerant, but low-yielding; ‘Shennong265’ is drought susceptible, but high-yielding under irrigated conditions. After a cross between the two parents, the resultant F1 plants were backcrossed with ‘Shennong265’ three times, and these BC3F1 plants were consecutively self-pollinated five times to construct the genetic population by the single-seed descent method.

Field experiments were conducted to assess model parameters and to measure grain yield and shoot biomass (two major model output traits). The ILs and the two parents were sown on 10 May 2009 by direct seeding at the experimental station of China Agricultural University, Beijing (39°54′N, 116°24′E; elevation 50 m a.s.l.), China (Gu et al., 2012a). The mean temperature was 24·8 °C, the total precipitation was 384 mm and the mean global radiation was 16·7 MJ m−2 d−1 during the growing season. During grain set, maximum temperatures reached values up to 39·6 °C. The soil was classified as a calciaquoll, containing 23·5 % sand, 57·1 % silt and 19·4 % clay. The field experiments followed a randomized complete block design, with two replications, four rows of 2·5 m per plot, 0·30 m between rows, in both rainfed upland and fully irrigated lowland conditions. Seed was hand sown at a depth of 0·03–0·04 m. At the seedling stage, plants were thinned to a 0·075 m distance between plants within each row, resulting in a plant density of 44·4 plants m−2. Weeds in both conditions were controlled by a combination of chemical and manual methods. Insects and diseases were controlled chemically. Basal fertilizer application included 48 kg N ha−1 (as urea), 120 kg P2O5 ha−1 and 100 kg K2O ha−1; in addition, 86 kg N ha−1 was applied at the tillering stage and 28 kg N ha−1 at the booting stage. For fully irrigated lowland conditions, rice was grown under continual standing water until harvest. For rainfed upland conditions, besides rainfall, irrigation was only applied when necessary at critical stages (i.e. at sowing, 120 mm; at tillering, 150 mm; and at booting, 130 mm).

An independent population of 251 RILs derived from the same parents (La, 2004; Zhang, 2006) was sown on 7 May 2005 by direct seeding at the experimental station in Zhuozhou (39·29′N, 115·59′E; elevation 45 m a.s.l.), China (Zhang, 2006). The mean temperature was 25·3 °C, the total precipitation was 331 mm and the mean global radiation was 16·7 MJ m−2 d−1 during the growing season. The experiment design and management for both irrigated lowland and rainfed upland conditions were the same as in the field experiments in Beijing in 2009.

Measurements conducted in the experiment for ILs were: grain yield and yield components (grains m−2 and grain weight), above-ground biomass, flowering date, maturity date, plant height, leaf area and weight, and grain nitrogen concentration which was determined by means of micro-Kjeldahl digestion and distillation. ‘Total crop nitrogen uptake at maturity’ (Nmax) was determined based on dry weight and the nitrogen concentration in plant organs, assuming that nitrogen concentration in straw was conservative at 0·463 % (see data of Singh et al., 1998), and nitrogen accumulation in the roots was assumed to be 5 % of Nmax (Yin and van Laar, 2005). Phenology, grain yield and biomass were also measured from the experiment for RILs.

The crop growth model

The model used in this study was the crop growth model GECROS, first described by Yin and van Laar (2005). GECROS is a generic model that simulates the growth and development of the crop on a daily basis. It generates phenotypes for a multitude of traits, based on concepts of the interaction and feedback mechanisms among various contrasting components of crop growth, carbon–nitrogen interaction in particular (Yin and Struik, 2010). Crop phenology, canopy photosynthesis, canopy transpiration, crop respiration, nitrogen uptake, partitioning of carbon and nitrogen assimilates among growing organs, green surface area index and senescence are among the physiological processes or traits that the model simulates. The summary information about the latest GECROS model (v3.0) is given in Supplementary Data, Model Description (see also Yin, 2013). For a given set of model parameters and environmental conditions, the model produces simulated grain yield and biomass as output variables.

Model inputs, parameterization, calibration and test

The weather inputs for the GECROS model are daily radiation, vapour pressure, maximum and minimum temperature, rainfall and wind speed. These required weather data were collected from a nearby weather station in 2005 (experiment with RILs) and 2009 (experiments with ILs) at Zhuozhou and Beijing, respectively. Atmospheric CO2 concentration and the amount of irrigated water were also used as model input.

A complete set of model parameters (Table 1) was determined for each IL from data collected in the experiment under well-watered conditions in 2009. These parameters, covering various morphological, phenological and physiological characteristics of the crop, are: individual seed dry weight (SW), seed nitrogen concentration (nSO), maximum plant height (Hmax), the minimum number of days for the vegetative growth phase (mV) or for the reproductive (seed-fill) phase (mR) provided both photoperiod and temperature are optimal, and specific leaf area constant for newly appeared leaves (Sla). Table 1 also lists measured Nmax as a model parameter. Nmax per se, as an accumulative quantity in the crop life cycle, is not considered as a model parameter of the original GECROS. However, there was not sufficient information about the soil at the experimental sites, and modelling of nitrogen availability for transition between flooded and non-flooded soil environments is complex and usually full of uncertainties (Gaydon et al., 2012). The model without using observed Nmax did not simulate yield differences among the ILs sufficiently (see the Results). To reduce an influence of uncertainties in predicting edaphic variables for nitrogen supply on our model-based sensitivity analysis to identify important markers (see below), we treated Nmax as if it were a model parameter.

Table 1.

List of genotype-specific parameters of the GECROS model (see the Materials and Methods)

Trait Description Units
SW Seed dry weight g d. wt seed−1
nSO Seed (storage organ) N concentration g N g−1 d. wt
Hmax Maximum plant height m
mV Minimum days for vegetative growth phase d
mR Minimum days for reproductive (seed fill) phase d
Sla Specific leaf area constant of newly appeared leaves m2 leaf g−1 d. wt
Nmax Total crop N uptake at crop maturity* g N m−2 ground

d. wt, dry weight; N, nitrogen.

*Not an input parameter in the original GECROS (see the text).

All model parameters were estimated for each genotype from the well-watered treatment of the 2009 experiment in Beijing. Parameters mV and mR are calculated based on a flexible bell-shaped non-linear function of phenological response to temperature (Yin et al., 2005b), flowering time and harvest time for each IL. For other non-genotype-specific model parameters and setting initial conditions of simulation, default values described by Yin and van Laar (2005) for rice were used for all lines.

To test the model, simulated dry grain yield and dry shoot biomass were compared with measured data for both ILs and RILs. To evaluate the model outputs, we used the relative root mean square error (rRMSE; Wallach et al., 2006), calculated as the root mean square error divided by the mean of the observed value. In addition, the R2 coefficient of linear regression between simulated and observed values was used to indicate the percentage of phenotypic variation accounted for by the model.

Statistical identification of markers for model parameters and yield

A total of 130 simple sequence repeat (SSR) markers and their position for the IL population were reported previously (Gu et al., 2012a; see also Supplementary Data Fig. S1). In order to select markers which could be potentially used for breeding, the effects of markers were analysed using a two-stage approach as described by Gu et al. (2012a). First, using the general linear model (GLM) procedure in the statistical package SAS 9·2 (SAS Instute Inc.), one-way analysis of variance (ANOVA) was used to test the significance (P < 0·05) of markers across the whole genome. Secondly, all significant markers were put into a multiple regression model in a GLM procedure, using eqn (1) where a model parameter value Y (listed in Table 1) of introgression line k, as affected by N markers, was presented as:

Yk=μ+Σn=1NanMk,n (1)

where μ is the intercept and an is the additive effect of the nth marker; Mk, n is the genetic score of the nth marker of the individual introgression line k that takes either the value –1 if the allele comes from recurrent parent ‘Shennong265’ or the value 1 if it comes from donor parent ‘Haogelao’. We used this simple additive model, as additive genetic effects are predictably transmitted to progeny and more complex effects on model parameters are presumbly taken into account by ecophysiological crop models (Chapman et al., 2003). There may be non-significant markers in the multiple regression because of the collinearity of markers (Martens and Næs, 1992; Næs and Mevik, 2001). The non-significant marker with the highest P-value was excluded in the next round of multiple regression. This approach left out one marker at a time, until all markers in the multiple regression became significant (P < 0·05) (Ott and Longnecker, 2010).

Incorporating the effects of identified markers into a crop model

Using the principle as described previously (Yin et al., 2000b, 2005a), the effects of the markers identified in the preceding step were fed to GECROS for simulating differences in yield and biomass among the ILs (Fig. 1). This was achieved by replacing the measured parameter values with the marker-based parameter values of GECROS. The marker-based parameter values were calculated using eqn (1) for each genotype, based on the estimated additive effects, i.e. an in eqn (1), for each parameter and the marker allelic information of the ILs. The same approach was used to derive parameter values, based on addititive effects of markers and the marker alleles of RILs, when using GECROS to simulate yield differences among individuals of the RIL population.

Identifying important yield-determining markers and ideotype design

Linear regression analyses were performed to identify which model parameter in Table 1 influenced yield most. A sensitivity analysis using the GECROS model was performed to identify the contribution of single markers to yield production, by following the approach of Yin et al. (2000a), i.e. examining yield variation accounted for by the GECROS model when the tested marker was excluded in estimating the marker-based model parameters. First, the baseline simulation was conducted, where IL-specific allelic values for all markers were used as input for simulation. Then, allelic values were fixed, one marker at a time, at zero (Fig. 1). The extent to which the percentage of yield variation accounted for (R2) was decreased relative to the percentage accounted for in the baseline simulation was used to rank the relative importance of the markers in determining grain yields: the more R2 decreases, the more important is the marker. The ranking of markers identified by GECROS was compared with that from the conventional analysis of yield data per se. Furthermore, simulated yields of ideotypes designed by pyramiding yield-increasing alleles of model-identified markers were compared with the simulated yields of ideotypes designed by pyramiding those markers identified by analysing yield per se.

RESULTS

Phenotypic variation in yield and physiological model parameters

There was no yield difference between replicates in either well-watered or drought-stressed conditions (P > 0·05). The IL population exhibited considerable phenotypic variation in model parameters Nmax and grain yield (Fig. 2), showing transgressive segregation. Model parameters SW, nSO, Hmax and Sla, Nmax in a drought-stressed environment and yield presented a unimodal distribution. For parameters mV and mR, and Nmax in a well-watered environment, a bimodal distribution was observed. Parent ‘Shennong 265’ yielded more than ‘Haogelao’, even under drought.

Fig. 2.

Fig. 2.

Frequency distribution of seven model parameters and yield in the population of introgression lines (ILs). Arrows show values for the two parents (full arrow for ‘Haogelao’ and dotted arrow for ‘Shennong265’).

Contribution of individual model parameters to yield

Effects of individual model parameters on yield were analysed, for both well-watered and drought-stressed environments as assessed in the 2009 experiments. Simple regression and correlation based on data of all genotypes revealed that yield correlated with most model parameters (Supplementary Data Table S1). Among them, Nmax was correlated with yield most. Nmax alone accounted for 57·6 and 59·2 % of the variation in yield under well-watered and drought-stressed environments, respectively (Table 2). Nmax was also associated with other model parameters (Supplementary Data Table S1). For example, Nmax under well-watered conditions correlated with Sw, Hmax, mV and mR (r = 0·29, 0·44, –0·29 and 0·32, respectively; P < 0·01). Therefore, Nmax was used as covariate when multiple regression was conducted relating yield to each model parameter (Table 2). The results showed that yield correlated significantly with all model parameters, except for SW. Besides Nmax, nSO was best correlated to yield under well-watered conditions, and mV was best correlated to yield under drought-stressed conditions. Except for the correlation of mR with yield, all other significant correlations with yield were negative (Table 2). These results agreed well with the simple correlation analysis for yield and model input parameters (Supplementary Data Table S1), and suggested that lower grain N concentrations, shorter vegetative and longer grain-filling phases, lower stem lengths and thicker leaves were associated with higher grain yields.

Table 2.

Linear regression of rice yield (Y) against total crop N uptake (Nmax) and one other parameter trait of both well-watered and drought-stressed input parameters (n = 96; for definition of these traits, see Table 1)

Equation b0 b1 b2 R2
Y = b0 + b1Nmax −15·43/–62·61 44·49***/37·13*** 0·576/0·592
Y = b0 + b1Nmax + b2SW −39·73/–44·54 43·86***/37·37*** 1356·6/–881·1 0·577/0·593
Y = b0 + b1Nmax + b2nSO 466·60/86·83 47·11***/35·94*** −35698·7***/–10033·9* 0·749/0·614
Y = b0 + b1Nmax + b2Hmax 77·43/48·84 49·70***/40·49*** −128·5**/–120·7*** 0·609/0·646
Y = b0 + b1Nmax + b2mV 630·66/553·40 39·03***/29·32*** −6·73***/–6·41*** 0·668/0·714
Y = b0 + b1Nmax + b2mR −310·43/–339·14 38·54***/29·04*** 15·69***/14·63*** 0·669/0·710
Y = b0 + b1Nmax + b2Sla 161·52/92·18 42·74***/36·78*** −8334·2*/–7903·0** 0·598/0·625

Values are presented as ‘well-watered/drought-stressed’.

*, **, ***Significant at the 0·05, 0·01 and 0·001 probability levels, respectively.

Phenotypic calibration of the GECROS model

The GECROS model was evaluated for both well-watered and drought-stressed experiments in 2009. The model did not simulate well differences among the ILs in Nmax, and, therefore, in grain yield. Similarly when using the across-IL mean value of observed Nmax, the model did not simulate well the differences in grain yield (for well-watered, R2 = 0·24; rRMSE = 0·17; for drought-stressed, R2 = 0·12, rRMSE = 0·34).

After using the genotype-specific values for Nmax, the model accounted for 72 % of the variation in yield (Fig. 3A) and for 78 % of the variation in biomass (Fig. 3B), with rRMSE values of 0·10 and 0·09, respectively, for the well-watered conditions.

Fig. 3.

Fig. 3.

Comparison between observed and simulated values of grain yield (A, C) and biomass (B, D) of rice introgression lines for both well-watered (A, B) and drought-stressed (C, D) environments. The simulation was based on the phenotypic values of seven parameters (Table 1) estimated from the 2009 experiment. The diagonal line is the 1:1 line.

For simulating yield in the drought-stressed environment, first, all parameter values as used for the well-watered environment were applied. This procedure resulted in systematic overestimations, as the actual nitrogen uptake was much less, resulting in reduced growth under drought. Therefore, observed Nmax from the drought-stressed environment was used. The model accurately simulated biomass, but still overestimated grain yield by overestimating the number of grains per m2 (results not shown). A calibration was applied by reducing the seed number (i.e. approx. 6171 m−2) for all ILs, based on the difference between simulated average population mean and experimental data. After such a calibration, the model accounted for 57 % of the variation in grain yield (Fig. 3C) and 73 % of the variation in biomass (Fig. 3D), with rRMSE values of 0·23 and 0·12, respectively.

Both estimations of yield for well-watered and drought-stressed environments were slightly poorer than the best fit of linear regression in Table 2. This suggests that the input parameters required for GECROS (Sw, nSO, Hmax, mV, mR and Sla) were not all important for defining yield for this IL population, as confirmed by later analysis.

Coupling the effects of identified markers to the crop model

First, a multiple regression analysis was conducted to identify markers for each model parameter. In total, 20 markers were detected for all seven model parameters, with 3–6 markers per parameter (Table 3; Supplementary Data Fig. S1). The total fraction of the phenotypic variation accounted for by the markers ranged from 27·3 to 51·7 %. Marker RM410 showed pleiotropic effects on nSO, Hmax, mV, mR and Sla; marker RM8030 had pleiotropic effects on SW, Hmax, and Nmax under well-watered conditions; marker RM11 was related to phenology influencing both mV and mR; marker RM338 influenced both mV and Nmax in the drought-stressed environment; marker RM475 was related to Nmax in both well-watered and drought-stressed environments. Also based on eqn (1), we detected five or four markers when analysing yield per se (Table 3). These markers were generally among those markers identified for one or more model parameters; however, RM4085 for yield in the well-watered environment, and RM538 for yield in the drought-stressed environment (Table 3; Supplementary Data Fig. S1) were not detected for any model parameter.

Table 3.

Coefficients of eqn (1) used to identify markers conferring seven physiological model input parameters and for grain yield, using data from the well-watered conditions in 2009 and also from drought-stressed conditions for total nitrogen uptake

Trait μ Chromosome Location (cM) Markers Additive effect (an) P-value R2 (%)
SW 0·0222 1 124·8 RM1152 0·0010 0·0002 45·2
2 110·9 RM1367 0·0008 0·0007
2 139·3 RM8030 −0·0009 0·0035
4 123·8 RM2799 −0·0006 0·0039
nSO 0·0148 3 79·1 RM251 0·0009 <0·0001 36·8
9 64·4 RM410 0·0003 0·0002
12 61·6 RM1261 −0·0004 0·0039
Hmax 1·174 1 9·5 RM8068 0·037 0·0213 51·7
2 139·3 RM8030 −0·057 0·0004
4 25·5 RM518 −0·035 0·0042
7 43·5 RM432 0·081 <0·0001
9 64·4 RM410 0·042 <0·0001
10 87·1 RM294A 0·058 0·0021
mV 90·78 1 124·8 RM1152 −1·38 0·0041 33·6
3 108·4 RM338 1·31 0·0487
7 47 RM11 1·61 0·0191
9 64·4 RM410 1·23 0·0002
mR 21·71 1 124·8 RM1152 0·52 0·0127 27·3
7 47 RM11 −0·81 0·0086
9 64·4 RM410 −0·59 0·0001
Sla 0·0203 1 25·4 RM8145 0·0007 0·0006 31·0
7 81·05 RM3753 0·0006 0·0020
9 64·4 RM410 0·0003 0·0098
Nmax well-watered 7·83 2 92·5 RM475 −0·44 0·0082 37·0
2 139·3 RM8030 −0·50 0·0052
8 83·7 RM284 −0·53 0·0004
9 0·8 RM5799 −0·35 0·0460
Nmax drought-stressed 5·13 1 98·1 RM306 0·62 0·0011 36·2
2 92·5 RM475 −0·46 0·0066
3 108·4 RM338 −0·76 0·0042
5 20·6 RM7302 −0·47 0·0277
Yield well-watered 325·3 2 92·5 RM475 −21·7 0·0288 56·5
2 139·3 RM8030 −34·9 0·0009
8 35·7 RM4085 −14·5 0·0431
8 83·7 RM284 −22·4 0·0113
9 64·4 RM410 −19·5 0·0048
Yield drought-stressed 137·2 3 108·4 RM338 −42·2 0·0002 45·4
5 20·6 RM7302 −22·2 0·0067
5 132·7 RM538 25·7 0·0328
9 64·4 RM410 −18·8 0·0004

For definitions and units of these parameters, see Table 1. Marker positions were based on the SSR marker linkage map established for the rice introgression lines population (Gu et al., 2012a; see Supplementary Data Fig. S1).

Positive values of the additive effect indicate that increasing alleles of the trait score came from ‘Haogelao’.

R2 denotes the percentage of phenotypic variation accounted for by all markers identified for a given parameter or trait.

Secondly, based on the additive effects estimated by the regression analysis and allele information at each detected locus, marker-based values for each of the model parameters were calculated using eqn (1) for each IL. The performance of GECROS with marker-based estimates of model parameters was examined (Fig. 4). The marker-based GECROS model accounted for 52 % of the across-IL phenotypic variation of yield in the well-watered environment and for 43 % of the across-IL phenotypic variation in the drought-stressed environment, with rRMSE of 0·13 and 0·27, respectively. These percentages were almost comparable with those percentages accounted for by the markers identified for yield per se (Table 3).

Fig. 4.

Fig. 4.

Comparison between observed values and those simulated by the marker-based trait inputs for 96 rice genotypes of the IL population, (A) for grain yield and (B) for biomass, for well-watered and drought-stressed environments (see key in A). The diagonal line is the 1:1 line.

The GECROS model using marker-based parameters gave less accurate simulations than using measured model parameters (Fig. 4 vs. Fig. 3). In both well-watered and drought-stressed cases, the marker-based model seemed to overestimate the lower end of observed yield and biomass, and to underestimate the higher end of observed yield and biomass, and, as a result, the range of simulated values was narrower than that of the observed data. This narrower range could be caused by the fact that the detected markers only explained part of the variation of model parameters (Table 3). The marker-based simulations correlated with the original simulations in grain yield for both the well-watered (r = 0·73) and the drought-stressed (r = 0·70) environment (Supplementary Data Fig. S2). Similar correlations were obtained in biomass (r = 0·75 and 0·71 for the two environments, respectively; Supplementary Data Fig. S2).

For predicting yield differences within an independent, larger population of RILs derived from the same parents, marker-based estimates of model parameters were used according to the same approach as in the IL population using eqn (1) and values of the additive effects shown in Table 3. The model explained 21 % of the phenotypic variation among 251 RILs under well-watered conditions and 20 % of the phenotypic variation under drought-stressed conditions (Fig. 5), with rRMSE = 0·31 and 0·45, respectively.

Fig. 5.

Fig. 5.

Comparison between observed values of grain yield and those simulated by additive effects of marker-based parameters for a population of 251 recombinant inbred lines grown under well-watered and drought-stressed environments. The diagonal line is the 1:1 line.

Model-based sensitivity analysis to identify important yield-defining markers

The sensitivity of crop yield in response to selection for an individual marker was analysed by excluding the effect of the marker in estimating marker-based model parameters. Marker RM8030 on chromosome 2 contributed most to the yield of the IL population under well-watered conditions. When the additive effect of RM8030 was excluded, the phenotypic variation accounted for by the GECROS model decreased most: from 51·6 to 34·2 % (Table 4). A different marker, RM338 on chromosome 3, contributed most under drought-stressed conditions. Excluding the additive effect of RM338 decreased the explained phenotypic variation in yield most: from 42·6 to 29·8 %. The extents to which the model accounted for the phenotypic variation decreased from that of the baseline simulation were used to rank the markers in importance for determining crop yield (Table 4). This ranking agreed well with the linear regression analysis of rice yield against model parameters (Table 2). As shown by the linear regression, Nmax contributed most to the variation in yield. In accordance with that observation, the markers identified for Nmax (Table 3) proved to be the most important yield-influencing markers identified by the sensitivity analysis (Table 4).

Table 4.

Percentage of the phenotypic variation in yield among rice ILs (R2) accounted for by different sets of simulations using the marker-based version of the GECROS model, when markers were fixed one at a time to calculate different sets of marker-based parameter inputs for GECROS

Fixed marker
Well-watered
Drought-stressed
Chr_cM Name R2 (%) Rank R2 (%) Rank
1_9.5 RM8068 51·6 11 42·6 13
1_25.4 RM8145 53·9 18 41·4 10
1_98.1 RM306 51·6 11 39·5 6
1_124.8 RM1152 50·9 9 44·6 18
2_92.5 RM475* 46·2 3 37·7 5
2_110.9 RM1367 51·7 14 45·5 19
2_139.3 RM8030* 34·2 1 40·9 9
3_79.1 RM251 47·9 5 46·2 20
3_108.4 RM338 52·6 17 29·8 1
4_25.5 RM518 59·4 20 44·1 17
4_123.8 RM2799 51·8 16 40·2 7
5_20.6 RM7302 51·6 11 33·2 2
7_43.5 RM432 50·7 8 36·9 4
7_47 RM11 51·7 14 43·8 16
7_81.05 RM3753 49·4 7 41·9 11
8_83.7 RM284* 45·7 2 42·6 13
9_0.8 RM5799 48·3 6 42·6 13
9_64.4 RM410*,† 47·3 4 35·9 3
10_87.1 RM294A 51·1 10 40·5 8
12_61.6 RM1261 53·9 19 42·1 12
Baseline simulation 51·6 42·6

Marker positions are denoted as ‘Chr_cM’, i.e. chromosome_centiMorgan, as identified in Table 3.

The baseline simulation gives the R2 values for the simulation, in which no marker was fixed, i.e. IL-specific allelic values (–1 or 1) were used for all markers in calculating marker-based inputs; for other sets of simulations, markers were fixed one at a time, in which all ILs were assumed to carry an identical allele (i.e. 0) at the locus of the considered marker in calculating marker-based inputs.

*These markers were also identified for yield per se under well-watered conditions (see Table 3);

These markers were also identified for yield per se under drought-stressed conditions (see Table 3).

Table 4 also shows that under well-watered conditions, markers RM410 and RM251 related to nSO had a higher ranking than RM5799 related to Nmax; in drought-stressed conditions, marker RM410 and RM432 influencing Hmax had a higher ranking than RM306 and RM475 influencing Nmax. These results were in line with an earlier regression analysis, which showed that parameters nSO and Hmax also had significant effects on yield for well-watered and drought-stressed conditions, respectively (Table 2).

Most high-ranking markers found in this approach were consistent with the markers identified for yield per se in Table 3; for example, the four highest-ranking markers in the well-watered environment (i.e. RM8030, RM284, RM475 and RM410) and the three highest-ranking markers in the drought-stressed environment (i.e. RM338, RM7302 and RM410). Our approach detected a total of 20 markers for each of well-watered and drought-stressed conditions (Table 4), more than the markers identified from multiple regression analysis for yield (Table 3). However, only ten markers mattered for the model to account for yield variation for each of the environments. The remaining markers listed in Table 4 were not important in accounting for yield as removing these markers had no effect on, or even increased the value of, the explained percentage (Table 4). For example, by removing the additive effect of marker RM251, the simulation for drought-stressed conditions could improve from 42·6 to 46·2 % variation accounted for.

Designing ideotypes based on identified markers

Given the markers identified above, it is possible to simulate yields of ideotypes designed by pyramiding yield-increasing alleles of relevant identified markers. We hypothetically designed three ideotypes: one based on the markers in Table 4 that mattered for the GECROS to account for yield variation (Ideotype I), one based on the complete set of the markers in Table 4 (Ideotype II) and one based on only those markers identified by analysing yield per se (Ideotype III). Figure 6 shows GECROS-simulated yields for these three ideotypes as well as the population mean of observed yields. Compared with Ideotype III, Ideotype I was simulated to have 10 and 23 % higher yields, and Ideotype II had 19 and 36 % higher yields, under the well-watered and drought-stressed environments, respectively. On across-environment average, there was only 3 % yield advantage for Ideotype III over the population mean, whereas average yield advantages for Ideotypes I and II were 18 and 29 %, respectively.

Fig. 6.

Fig. 6.

The population mean in the observed yield (with standard error of the mean) of 94 ILs and two parents, and simulated yields using the GECROS model for three hypothetical ideotypes, under well-watered and drought-stressed environments. Ideotype I is the hypothetical genotype designed by pyramiding positive alleles of the ten markers in Table 4 that mattered in improving the percentage of yield variation explained by the GECROS model. Ideotype II is the hypothetical genotype designed by pyramiding positive alleles of all 20 markers in Table 4. Ideotype III is the hypothetical genotype designed by pyramiding positive alleles of 4–5 markers identified by analysing yield per se as shown in Table 3. The ‘pyramiding’ was done by summing up the additive effect of all relevant markers affecting GECROS model parameters that are expected to have positive impacts for improving yield. It is worth noting that in simulating the yield of Ideotype III, the effect of the marker identified by analysing yield per se that had not been detected by analysing model parameters (i.e. RM4085 for well-watered and RM538 for drought-stressed environments, see the text) needs to be accounted for. This was done by multiplying the simulated yield based on the remaining three or four markers found by analysing yield per se with the factor derived as the ratio of the best yield expected from additive effects of 4–5 markers to the best yield expected from additive effects of 3–4 markers.

DISCUSSION

Simulating genotypic differences using the (marker-based) GECROS model

The GECROS model uses the concept of carbon–nitrogen interactions for simulating crop growth (Yin and van Laar, 2005; Yin and Struik, 2010; Yin, 2013). Unlike those of earlier Wageningen models, the input parameters of GECROS are few and most of them are related to the traits that breeders usually measure (Table 1), which may facilitate the use of crop modelling in support of breeding (Yin et al., 2004). Using as few as seven parameters (Table 1), the calibrated GECROS model accounted for observed differences in yield among the 96 ILs including the parents (Fig. 3). Marker-based GECROS also simulated yield differences among the 96 genotypes (Fig. 4).

Since the marker-based model parameters were based on the estimated genetic effects, the marker-based crop model should be able to simulate the variation within any progeny from the same parents. This was shown to be the case, using independent lines of the same cross that were not included in the QTL mapping step (Reymond et al., 2003). Here we tested this possibility using a different population, i.e. RILs derived from the same parents. The comparatively low percentage of yield variation accounted for by the RIL population (Fig. 5) could have been caused by the comparatively larger number of RILs (n = 251), which might have involved segregations that were not revealed by markers found in the smaller IL population. The limited number of markers with small additive effects only accounting for from 27·3 to 51·7 % of the phenotypic variation of model input parameters in the IL population (Table 3) could be another reason.

There were also problems to overcome when applying this approach. This is reflected by the fact that GECROS, like many models, tended to overestimate low yield and underestimate high yield. Specifically, we first showed that the model performance was sensitive to nitrogen uptake, as plant nitrogen content affects not only canopy dynamics but also leaf photosynthesis, and, therefore, biomass and yield, in line with the long recognized role of nitrogen in determining crop yield (Sinclair and de Wit, 1975). However, due to the complexity in modelling the transition between flooded and non-flooded soil environments (Gaydon et al., 2012) and the lack of accurate information on soil-related parameters needed to simulate nitrogen uptake, Nmax was used in this study as if it were an input parameter of GECROS (Table 1).

We secondly showed that the drought treatment changed the sink–source relationships. In our simulation, model parameters were estimated from the 2009 well-watered experiment in Beijing. However, a further calibration was found necessary for simulating spikelet numbers when the model was applied to the drought-stressed environment (see the Results). Drought environments reduce transpirational cooling, leading to high tissue organ temperature and high spikelet sterility in rice; this effect can be highly genotype dependent (Jagadish et al., 2007). The generic model GECROS relates potential seed number to carbon and nitrogen accumulation in the vegetative phase, and does not have algorithms to account for the direct effect of panicle temperature on spikelet fertility (Yin and van Laar, 2005).

Boote et al. (2013) emphasized the needs for more mechansims in crop models when used for characterizing genotype–phenotype relationships. Our study suggests that there is an obvious need for robust algorithms to dissect Nmax involving rooting density and depth for resource capture from the soil (King et al., 2003). Similarly, better estimation of the final spikelet number of rice when applying the model for stress conditions is needed by considering floret opening time in the flowering day and transpirational control of panicle tissue temperature (Jagadish et al., 2007).

Role of crop models in dissecting complex traits

Crop modelling can dissect complex traits into physiological components. Using the crop model GECROS, yield was connected to, and dissected into, seven model parameters in this study. By dissecting complex traits into physiologically meaningful components – model parameters – it is possible to assess genetic variation for each component and evaluate its relative importance by sensitive analyses or regression analyses. Regression analyses showed that Nmax had the most significant effect on yield (Table 2). This is in line with the result showing that the important yield-determining markers identified by GECROS-based sensitivity analysis (Table 4) were mainly those for Nmax (Table 3). Similarly, Prudent et al. (2011), combining a fruit sugar model and QTL analysis, identified key elementary processes and genetic factors underlying tomato fruit sugar concentration. These results show that the dissection approach based on physiological models can indicate where the QTLs for complex traits come about (Yin et al., 2002), thereby revealing biological insights into complex traits. Given that the number of QTLs identified for any single trait is always limited (Kearsey and Farquhar, 1998), model-based dissection can detect more QTLs/markers than analysing yield per se (Table 3).

The power of model-based dissection may depend on whether model parameters represent ultimate physiological components. For example, Hmax is chosen as a GECROS parameter because it is among the traits that breeders measure; but it can be further dissected into parameters for stem extension rate and duration. Nevertheless, the current GECROS can indicate traits and markers that have the least impact on grain yield (Table 4). Such an analysis may suggest whether or not the model has incorporated the correct parameters for explaining yield differences among genotypes in a population. One of our initial thoughts was to explore the role of leaf photosynthesis in improving rice yield given extensive reports in the literature on this subject (e.g. Richards, 2000; Fischer and Edmeades, 2010); so, our previous major effort was to map QTLs for leaf photosynthesis (Gu et al., 2012a, b). Based on marker effects on individual biochemical parameters of leaf photosynthesis identified by Gu et al. (2012b) using representative ILs, we attempted to introduce these marker effects into the biochemical photosynthesis sub-model of GECROS. The variation accounted for by the model decreased significantly for both well-watered and drought-stressed conditions (results not shown). This suggests that biochemical parameters of leaf photosynthesis are not important for defining yield in our population of ILs.

The lack of importance of some other parameters for our IL population was suggested by the result showing that the marker-based GECROS accounted for 43–52 % of yield variation (Fig. 4) while identified markers typically accounted for only 25–30 % of variation in individual parameters (Table 3). Removing some markers even increased the explained percentage, as was the case for marker RM251 under drought-stressed conditions (Table 4). This could have been caused by the fact that marker RM251 only influenced nSO (Table 3), which only had a marginally significant effect on yield under drought-stressed conditions (Table 2). The irrelevance of parameters in another crop model in defining yield differences has been noted previously (Yin et al., 2000a) for a barley RIL population of similar size. Whether or not this type of result is general requires further research using a larger population with more recombinant events that probably show more diverse relationships between yield and model parameters.

The model-based dissection approach, despite many advantages (see also the next section), may never replace the approach of the yield per se, because the latter not only is simple but also identified markers (e.g. RM4085 for well-watered, and RM538 for drought-stressed environment, Table 3; Supplementary Data Fig. S1) that were not detected by the model-based approach. This arises from the possibility that markers under the statistical threshold of detection for component traits can be detected when the aggregated complex trait itself was analysed (Yin et al., 2002). The other possibility is that the present GECROS model might miss some yield-influencing traits as input parameters, especially those important for determining the spikelet number.

Marker-based modelling to improve the efficiency of designing crop ideotypes

Combined with conventional breeding, marker-based approaches have been used to integrate major genes or QTLs with large effect into widely grown varieties (Jena and Mackill, 2008). However, so far, the marker-based approach has only had a moderate impact on breeding for complex traits such as yield and drought tolerance for which many genes with only small effect are involved, and which are highly environment dependent (Collard and Mackill, 2008). A crop model-based dissection approach can identify the most determinant yield-defining traits, and can suggest how to create the combination of component traits for an ideal plant type that will perform best under given conditions (Peng et al., 2008). QTL-based models can be used to evaluate the contribution of a single QTL to yield (Chenu et al., 2009), and, therefore, could assist in finding the most important markers.

We showed that the GECROS model, if well parameterized and calibrated, can be a useful tool to enhance the efficiency of selection for improved yields. The markers were first identified for various yield-determining physiological traits that are input parameters of GECROS (Table 3). The relative importance of these markers was then ranked by sensitivity analysis using the marker-based model (Table 4). Such an analysis detected markers that breeders can prioritize in their programmes for specific environments. Our analysis also confirms the assertion that rather than looking only for QTLs for a complex trait (yield) itself, determining QTLs for underlying component traits will provide more genetic information (Yin et al., 2002; Tardieu and Tuberosa, 2010; Prudent et al., 2011). Notably, the GECROS model-based approach identified some markers that were otherwise unidentified by analysis of yield per se, e.g. marker RM432 for the drought-stressed environment (Tables 3 and 4). This approach provides breeders with more choice of markers for selection. Alleles of these markers can be pyramided to create ideotypes (Gu et al., 2012b). Simulations showed that ideotypes based on more markers identified by the model (Ideotypes I and II) had a higher yield potential than the ideotype based on the markers identified for yield per se (Ideotype III) (Fig. 6). It remains to be tested through actual breeding whether this additional information does indeed result in better genotypes.

Quantitative trait locus/marker-based modelling combined with sensitivity analysis (Table 4) can also directly evaluate the effect of a a single QTL/marker on yield level, which could be used to evaluate a specific genotype in silico, thus potentially reducing labour-intensive selection in the field. Crop modelling quantifies causality between relevant physiological processes and the responses of these processes to environmental variables, and might therefore help to resolve the QTL × environment interaction (Hemamalini et al., 2000; Asins, 2002). Using the CROPGRO-soybean model, Messina et al. (2006) estimated the effects of QTL markers from a set of near-isogenic lines and satisfactorily predicted the variation of yield across 5 years and eight sites among an independent set of soybean cultivars. Chenu et al. (2009), using the crop model APSIM-Maize, simulated that a QTL accelerating leaf elongation will increase yield in an environment with water deficit before flowering, but reduced yield under terminal drought stress. Our model analysis showed that the marker RM338 contributed the greatest to yield under a stressed environment, but had no effect at all under a well-watered environment (Table 4). This modelling analysis will greatly improve the selection efficiency for traits which are greatly influenced by environment factors.

Concluding remarks

This study outlines a genotype-to-phenotype approach that exploits potential values of marker-based crop modelling in developing new plant types with high yields. The approach can provide more, yet prioritize, markers in the selection programmes for specific environments. Although the modelling approach may never replace the analysis of yield per se approach, our simulation analysis showed that ideotypes based on the modelling approach had 10–36 % yield advantages. Further work could be achieved by upgrading crop models for rice, based on the identified weakness of the model, especially for drought stress conditions. As pointed out recently by Boote et al. (2013), ‘model linkage to genetics is a new area with good potential’, and incorporating new mechanisms and algorithms into crop models to identify more relevant traits is needed to better account for genotype–phenotype relationships.

SUPPLEMENTARY DATA

Supplementary data are available online at www.aob.oxfordjournals.org and consist of the following. Model Description: description of the crop growth model GECROS. Figure S1: chromosome locations of markers associated with the seven model input parameters and with yield per se. Figure S2: correlation between simulated values from marker-based model parameters and those from measured model parameter values for 96 rice genotypes of the IL population for grain yield and biomass, for well-watered and drought-stressed environments. Table S1: simple correlation coefficients for rice yield and model input parameters for well-watered and drought-stressed environments.

Supplementary Data

ACKNOWLEDGEMENTS

This research was partially funded by the INCO project CEDROME (015468) of the European Commission FP6 and partly funded by the China Scholarship Council. We thank Dr T. J. Stomph for his comments on earlier versions of the manuscript.

LITERATURE CITED

  1. Asins M. Present and future of quantitative trait locus analysis in plant breeding. Plant Breeding. 2002;121:281–291. [Google Scholar]
  2. Bertin N, Martre P, Génard M, Quilot B, Salon C. Under what circumstances can process-based simulation models link genotype to phenotype for complex traits? Case-study of fruit and grain quality traits. Journal of Experimental Botany. 2010;61:955–967. doi: 10.1093/jxb/erp377. [DOI] [PubMed] [Google Scholar]
  3. Boote KJ, Jones JW, White JW, Asseng S, Lisaso JI. Putting mechanisms into crop production models. Plant, Cell and Environment. 2013;36:1658–1672. doi: 10.1111/pce.12119. [DOI] [PubMed] [Google Scholar]
  4. Chapman SC, Cooper M, Podlich D, Hammer GL. Evaluating plant breeding strategies by simulating gene action and dryland environment effects. Agronomy Journal. 2003;95:99–113. [Google Scholar]
  5. Chenu K, Chapman SC, Hammer GL, McLean G, Salah HBH, Tardieu F. Short-term responses of leaf growth rate to water deficit scale up to whole-plant and crop levels: an integrated modelling approach in maize. Plant, Cell and Environment. 2008;31:378–391. doi: 10.1111/j.1365-3040.2007.01772.x. [DOI] [PubMed] [Google Scholar]
  6. Chenu K, Chapman SC, Tardieu F, McLean G, Welcker C, Hammer GL. Simulating the yield impacts of organ-level quantitative trait loci associated with drought response in maize: a ‘gene-to-phenotype’ modeling approach. Genetics. 2009;183:1507–1523. doi: 10.1534/genetics.109.105429. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Collard BCY, Mackill DJ. Marker-assisted selection: an approach for precision plant breeding in the twenty-first century. Philosophical Transactions of the Royal Society B: Biological Sciences. 2008;363:557–572. doi: 10.1098/rstb.2007.2170. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Fischer RA, Edmeades GO. Breeding and cereal yield progress. Crop Science. 2010;50:85–98. [Google Scholar]
  9. Gaydon DS, Probert ME, Buresh RJ, et al. Rice in a cropping systems – modelling transitions between flooded and non-flooded soil environments. European Journal of Agronomy. 2012;30:9–24. [Google Scholar]
  10. Gu J, Yin X, Struik PC, Stomph TJ, Wang H. Using chromosome introgression lines to map quantitative trait loci for photosynthesis parameters in rice (Oryza sativa L.) leaves under drought and well watered field conditions. Journal of Experimental Botany. 2012a;63:455–469. doi: 10.1093/jxb/err292. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Gu J, Yin X, Stomph TJ, Wang H, Struik PC. Physiological basis of genetic variation in leaf photosynthesis among rice (Oryza sativa L.) introgression lines under drought and well-watered conditions. Journal of Experimental Botany. 2012b;63:5137–5153. doi: 10.1093/jxb/ers170. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Hammer GL, Chapman S, Van Oosterom E, Podlich DW. Trait physiology and crop modelling as a framework to link phenotypic complexity to underlying genetic systems. Crop and Pasture Science. 2005;56:947–960. [Google Scholar]
  13. Hammer G, Cooper M, Tardieu F, et al. Models for navigating biological complexity in breeding improved crop plants. Trends in Plant Science. 2006;11:587–593. doi: 10.1016/j.tplants.2006.10.006. [DOI] [PubMed] [Google Scholar]
  14. Hammer GL, van Oosterom E, McLean G, et al. Adapting APSIM to model the physiology and genetics of complex adaptive traits in field crops. Journal of Experimental Botany. 2010;61:2185–2202. doi: 10.1093/jxb/erq095. [DOI] [PubMed] [Google Scholar]
  15. Hemamalini G, Shashidhar H, Hittalmani S. Molecular marker assisted tagging of morphological and physiological traits under two contrasting moisture regimes at peak vegetative stage in rice (Oryza sativa L.) Euphytica. 2000;112:69–78. [Google Scholar]
  16. Jagadish S, Craufurd P, Wheeler T. High temperature stress and spikelet fertility in rice (Oryza sativa L.) Journal of Experimental Botany. 2007;58:1627–1635. doi: 10.1093/jxb/erm003. [DOI] [PubMed] [Google Scholar]
  17. Jena K, Mackill D. Molecular markers and their use in marker-assisted selection in rice. Crop Science. 2008;48:1266–1276. [Google Scholar]
  18. Kearsey M, Farquhar AGL. QTL analysis in plants; where are we now? Heredity. 1998;80:137–142. doi: 10.1046/j.1365-2540.1998.00500.x. [DOI] [PubMed] [Google Scholar]
  19. King J, Gay A, Sylvester-Bradley R, et al. Modelling cereal root systems for water and nitrogen capture: towards an economic optimum. Annals of Botany. 2003;91:383–390. doi: 10.1093/aob/mcg033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. La H. Beijing, China: China Agricultural University; 2004. The QTL mapping of traits related to drought tolerance and genetic transformation of herbicide tolerance in rice (Oryza sativa L.) Dissertation. [Google Scholar]
  21. Martens H, Næs T. Multivariate calibration. Chichester, UK: John Wiley and Sons; 1992. [Google Scholar]
  22. Messina CD, Jones JW, Boote KJ, Vallejos CE. A gene-based model to simulate soybean development and yield responses to environment. Crop Science. 2006;46:456–466. [Google Scholar]
  23. Messina CD, Podlich D, Dong Z, Samples M, Cooper M. Yield-trait performance landscapes: from theory to application in breeding maize for drought tolerance. Journal of Experimental Botany. 2011;62:855–868. doi: 10.1093/jxb/erq329. [DOI] [PubMed] [Google Scholar]
  24. Miura K, Ashikari M, Matsuoka M. The role of QTLs in the breeding of high-yielding rice. Trends in Plant Science. 2011;16:319–326. doi: 10.1016/j.tplants.2011.02.009. [DOI] [PubMed] [Google Scholar]
  25. Næs T, Mevik BH. Understanding the collinearity problem in regression and discriminant analysis. Journal of Chemometrics. 2001;15:413–426. [Google Scholar]
  26. Nakagawa H, Yamagishi J, Miyamoto N, Motoyamo M, Yano M, Nemoto K. Flowering response of rice to photoperiod and temperature: a QTL analysis using a phenological model. Theoretical and Applied Genetics. 2005;110:778–786. doi: 10.1007/s00122-004-1905-4. [DOI] [PubMed] [Google Scholar]
  27. Ott RL, Longnecker M. An introduction to statistical methods and data analysis. California: Duxbury Press; 2010. [Google Scholar]
  28. Peng S, Khush GS, Virk P, Tang Q, Zou Y. Progress in ideotype breeding to increase rice yield potential. Field Crops Research. 2008;108:32–38. [Google Scholar]
  29. Prudent M, Lecomte A, Bouchet JP, Bertin N, Causse M, Genard M. Combining ecophysiological modelling and quantitative trait locus analysis to identify key elementary processes underlying tomato fruit sugar concentration. Journal of Experimental Botany. 2011;62:907–919. doi: 10.1093/jxb/erq318. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Quilot B, Wu BH, Kervella J, Génard M, Foulongne M, Moreau K. QTL analysis of quality traits in an advanced backcross between Prunus persica cultivars and the wild relative species P. davidiana. Theoretical and Applied Genetics. 2004;109:884–897. doi: 10.1007/s00122-004-1703-z. [DOI] [PubMed] [Google Scholar]
  31. Quilot B, Kervella J, Génard M, Lescourret F. Analysing the genetic control of peach fruit quality through an ecophysiological model combined with a QTL approach. Journal of Experimental Botany. 2005;56:3083–3092. doi: 10.1093/jxb/eri305. [DOI] [PubMed] [Google Scholar]
  32. Reymond M, Muller B, Leonardi A, Charcosset A, Tardieu F. Combining quantitative trait loci analysis and an ecophysiological model to analyze the genetic variability of the responses of maize leaf growth to temperature and water deficit. Plant Physiology. 2003;131:664–675. doi: 10.1104/pp.013839. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Reymond M, Muller B, Tardieu F. Dealing with the genotype × environment interaction via a modelling approach: a comparison of QTLs of maize leaf length or width with QTLs of model parameters. Journal of Experimental Botany. 2004;55:2461–2472. doi: 10.1093/jxb/erh200. [DOI] [PubMed] [Google Scholar]
  34. Richards RA. Selectable traits to increase crop photosynthesis and yield of grain crops. Journal of Experimental Botany. 2000;51:447–458. doi: 10.1093/jexbot/51.suppl_1.447. [DOI] [PubMed] [Google Scholar]
  35. Sinclair TR. Challenges in breeding for yield increase for drought. Trends in Plant Science. 2011;16:289–293. doi: 10.1016/j.tplants.2011.02.008. [DOI] [PubMed] [Google Scholar]
  36. Sinclair TR, de Wit CT. Photosynthate and nitrogen requirements for seed production by various crops. Science. 1975;189:565–567. doi: 10.1126/science.189.4202.565. [DOI] [PubMed] [Google Scholar]
  37. Sinclair TR, Purcell LC, Sneller CH. Crop transformation and the challenge to increase yield potential. Trends in Plant Science. 2004;9:70–75. doi: 10.1016/j.tplants.2003.12.008. [DOI] [PubMed] [Google Scholar]
  38. Singh U, Ladha JK, Castillo EG, Punzalan G, Tirol-Padre A, Duqueza M. Genotypic variation in nitrogen use efficiency in medium- and long-duration rice. Field Crops Research. 1998;58:35–53. [Google Scholar]
  39. Tardieu F, Tuberosa R. Dissection and modelling of abiotic stress tolerance in plants. Current Opinion in Plant Biology. 2010;13:206–212. doi: 10.1016/j.pbi.2009.12.012. [DOI] [PubMed] [Google Scholar]
  40. Uptmoor R, Schrag T, Stützel H, Esch E. Crop model based QTL analysis across environments and QTL based estimation of time to floral induction and flowering in Brassica oleracea. Molecular Breeding. 2008;21:205–216. [Google Scholar]
  41. Uptmoor R, Li J, Schrag T, Stützel H. Prediction of flowering time in Brassica oleracea using a quantitative trait loci-based phenology model. Plant Biology. 2012;14:179–189. doi: 10.1111/j.1438-8677.2011.00478.x. [DOI] [PubMed] [Google Scholar]
  42. Wallach D, Makowski D, Jones JW. Working with dynamic crop models: evaluation, analysis, parameterization, and applications. Amsterdam, The Netherlands: Elsevier; 2006. [Google Scholar]
  43. White JW, Herndl M, Hunt LA, Payne TS, Hoogenboom G. Simulation-based analysis of effects of Vrn and Ppd loci on flowering in wheat. Crop Science. 2008;48:678–687. [Google Scholar]
  44. Xing Y, Zhang Q. Genetic and molecular bases of rice yield. Annual Review of Plant Biology. 2011;61:421–442. doi: 10.1146/annurev-arplant-042809-112209. [DOI] [PubMed] [Google Scholar]
  45. Xing Y, Tang W, Xue W, Xu C, Zhang Q. Fine mapping of a major quantitative trait loci, qSSP7, controlling the number of spikelets per panicle as a single Mendelian factor in rice. Theoretical and Applied Genetics. 2008;116:789–796. doi: 10.1007/s00122-008-0711-9. [DOI] [PubMed] [Google Scholar]
  46. Xu L, Henke M, Zhu J, Kurth W, Buck-Sorlin G. A functional–structural model of rice linking quantitative genetic information with morphological development and physiological processes. Annals of Botany. 2011;107:817–828. doi: 10.1093/aob/mcq264. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Yin X. Improving ecophysiological simulation models to predict the impact of elevated atmospheric CO2 concentration on crop productivity. Annals of Botany. 2013;112:465–475. doi: 10.1093/aob/mct016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Yin X, Struik PC. Applying modelling experiences from the past to shape crop systems biology: the need to converge crop physiology and functional genomics. New Phytologist. 2008;179:629–642. doi: 10.1111/j.1469-8137.2008.02424.x. [DOI] [PubMed] [Google Scholar]
  49. Yin X, Struik PC. Modelling the crop: from system dynamics to systems biology. Journal of Experimental Botany. 2010;61:2171–2183. doi: 10.1093/jxb/erp375. [DOI] [PubMed] [Google Scholar]
  50. Yin X, van Laar HH. Crop systems dynamics: an ecophysiological simulation model for genotype-by-environment interactions. Wageningen, The Netherlands: Wageningen Academic Publishers; 2005. [Google Scholar]
  51. Yin X, Kropff MJ, Goudriaan J, Stam P. A model analysis of yield differences among recombinant inbred lines in barley. Agronomy Journal. 2000a;92:114–120. [Google Scholar]
  52. Yin X, Chasalow SD, Dourleijn CJ, Stam P, Kropff MJ. Coupling estimated effects of QTLs for physiological traits to a crop growth model: predicting yield variation among recombinant inbred lines in barley. Heredity. 2000b;85:539–549. doi: 10.1046/j.1365-2540.2000.00790.x. [DOI] [PubMed] [Google Scholar]
  53. Yin X, Chasalow S, Stam P, et al. Use of component analysis in QTL mapping of complex crop traits: a case study on yield in barley. Plant Breeding. 2002;121:314–319. [Google Scholar]
  54. Yin X, Struik PC, Kropff MJ. Role of crop physiology in predicting gene-to-phenotype relationships. Trends in Plant Science. 2004;9:426–432. doi: 10.1016/j.tplants.2004.07.007. [DOI] [PubMed] [Google Scholar]
  55. Yin X, Struik PC, van Eeuwijk FA, Stam P, Tang J. QTL analysis and QTL-based prediction of flowering phenology in recombinant inbred lines of barley. Journal of Experimental Botany. 2005a;56:967–976. doi: 10.1093/jxb/eri090. [DOI] [PubMed] [Google Scholar]
  56. Yin X, Struik PC, Tang J, Qi C, Liu T. Model analysis of flowering phenology in recombinant inbred lines of barley. Journal of Experimental Botany. 2005b;56:959–965. doi: 10.1093/jxb/eri089. [DOI] [PubMed] [Google Scholar]
  57. Zhang C. Beijing, China: China Agricultural University; 2006. The genetic analysis and the QTL mapping of the major agronomic traits in RIL between lowland and upland ric (Oryza sativa L.) Dissertation. [Google Scholar]
  58. Zhu X-G, Zhang G, Tholen D, Wang Y, Xin C, Song Q. The next generation models for crops and agro-ecosystems. Science China Information Sciences. 2011;54:589–597. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Data

Articles from Annals of Botany are provided here courtesy of Oxford University Press

RESOURCES