Skip to main content
Journal of Experimental Botany logoLink to Journal of Experimental Botany
. 2020 Feb 25;71(6):1885–1898. doi: 10.1093/jxb/erz545

Functional QTL mapping and genomic prediction of canopy height in wheat measured using a robotic field phenotyping platform

Danilo H Lyra 1,✉,#, Nicolas Virlet 2,#, Pouria Sadeghi-Tehran 2, Kirsty L Hassall 1, Luzie U Wingen 3, Simon Orford 3, Simon Griffiths 3, Malcolm J Hawkesford 2, Gancho T Slavov 1,4
Editor: Greg Rebetzke5
PMCID: PMC7094083  PMID: 32097472

Functional analysis of longitudinal phenotypic data for five and 10 time points saturates QTL detection power and genomic predictive ability for canopy height in wheat.

Keywords: Data smoothing, dimensionality reduction, dynamic QTLs, factor-analytic model, function-valued traits, genomic selection, phenomics

Abstract

Genetic studies increasingly rely on high-throughput phenotyping, but the resulting longitudinal data pose analytical challenges. We used canopy height data from an automated field phenotyping platform to compare several approaches to scanning for quantitative trait loci (QTLs) and performing genomic prediction in a wheat recombinant inbred line mapping population based on up to 26 sampled time points (TPs). We detected four persistent QTLs (i.e. expressed for most of the growing season), with both empirical and simulation analyses demonstrating superior statistical power of detecting such QTLs through functional mapping approaches compared with conventional individual TP analyses. In contrast, even very simple individual TP approaches (e.g. interval mapping) had superior detection power for transient QTLs (i.e. expressed during very short periods). Using spline-smoothed phenotypic data resulted in improved genomic predictive abilities (5–8% higher than individual TP prediction), while the effect of including significant QTLs in prediction models was relatively minor (<1–4% improvement). Finally, although QTL detection power and predictive ability generally increased with the number of TPs analysed, gains beyond five or 10 TPs chosen based on phenological information had little practical significance. These results will inform the development of an integrated, semi-automated analytical pipeline, which will be more broadly applicable to similar data sets in wheat and other crops.

Introduction

In recent years, field-based high-throughput plant phenotyping (HTPP) tools have been used extensively in crop trials, aiming to reduce or eliminate manual measurements, increase the amount and quality of data for temporally dynamic phenotypes, and, ultimately, translate ‘big data’ collected using various sensors into knowledge (Rebetzke et al., 2016; Sadeghi-Tehran et al., 2017; Virlet et al., 2017; Araus et al., 2018). However, even with rapidly increasing computational power and efficiency, image post-processing (i.e. alignment, calibration, and segmentation) is demanding, and the collection of data throughout the crop growth cycle of large trials remains onerous (Araus and Cairns, 2014; Haghighattalab et al., 2016; Pauli et al., 2016). Thus, collecting measurements at excessively frequent intervals may not be cost-efficient (Araus and Kefauver, 2018; Reynolds et al., 2019) and can even result in poorer overall quality of the data set, as certain phenological stages (e.g. reproductive) are particularly prone to data collection errors (Dreccer et al., 2019; Rebetzke et al., 2019). Because of these limitations and the critical importance of manual or automated data cleaning (Tardieu et al., 2017), the number of time points (TPs) for which HTPP data are to be extracted and analysed is an important consideration, as is the distribution of these data TPs throughout the growing period of the crop (Rutkoski et al., 2016; Dreccer et al., 2019).

The opportunities created by the availability of HTPP data also pose significant analytical challenges in genetic studies aimed at identifying quantitative trait loci (QTLs) in mapping families (Li and Sillanpaa, 2015; Sun and Wu, 2015) and genomic prediction of complex phenotypic traits in breeding populations (Cabrera-Bosquet et al., 2012; van Eeuwijk et al., 2019). Functional mapping based on maximum likelihood (Ma et al., 2002) was originally proposed to enhance conventional QTL mapping approaches by capturing the additional dynamic information in longitudinal phenotypic data using mathematical functions [i.e. treating time series as ‘function-valued traits’ (Stinchcombe et al., 2012)]. Over the last two decades, this methodology has been applied across a range of experimental systems (Wu and Lin, 2006; Li et al., 2010; Jiang et al., 2015; Xu et al., 2016). However, when implemented in its original version, functional mapping is computationally inefficient, particularly when large numbers of TPs and/or markers are analysed (Wang et al., 2017). In an attempt to overcome these limitations, while retaining or further increasing statistical power, Moore et al. (2013) and Kwak et al. (2014) proposed more computationally efficient regression-based methods. These approaches were refined further by Kwak et al. (2016), who replaced the original observed trait data with a smoothed approximation (i.e. reducing phenotype noise) and then applied a dimensionality reduction technique to reduce the number of tests.

Previous studies of wheat plant height, an important agronomic trait, have identified several dynamic QTLs (Wang et al., 2010; Wu et al., 2010; Zhang et al., 2017). In addition, functional mapping approaches have generated empirical evidence that QTLs can be expressed (i) during very short periods (hereafter referred to as ‘transient’ QTLs); (ii) during specific phenological stages; or (iii) throughout the growing season (hereafter referred to as ‘persistent’ QTLs) (Bac-Molenaar et al., 2015; Al-Tamimi et al., 2016; Campbell et al., 2017; Feldman et al., 2017; Muraya et al., 2017; Ward et al., 2019). Recently, Camargo et al. (2018) used longitudinal data from a glasshouse HTPP facility to characterize several physiological and morphometric wheat traits in much greater detail. However, similar mapping studies of wheat grown under field conditions are currently lacking.

Several decades of linkage mapping studies have resulted in the identification of thousands of QTLs across a wide variety of traits and crops, but the rate of translation of these QTLs into marker-assisted selection in breeding programmes has generally been very poor (Bernardo, 2016; Wallace et al., 2018). With the gradual realization that genomic selection is the only viable option of using marker–trait associations in breeding programmes (Crossa et al., 2017), the interest of geneticists working with HTPP function-valued data is increasingly shifting towards genomic prediction (Rutkoski et al., 2016; Aguate et al., 2017; Campbell et al., 2017, 2018; Montesinos-Lopez et al., 2017a, b, 2018; Sun et al., 2017, 2019; Watanabe et al., 2017; Crain et al., 2018; Juliana et al., 2019; Krause et al., 2019; van Eeuwijk et al., 2019). However, despite several methodological developments in this field, many questions remain. For example, it is not currently clear if increasing the number of TPs consistently helps to increase the predictive ability for temporally dynamic traits, or how to most efficiently extract simple, genetically meaningful parameters from highly dimensional data (van Eeuwijk et al., 2019). Most studies so far have focused on multitrait and random regression models (Sun et al., 2017; Campbell et al., 2018; Crain et al., 2018), but a number of potentially more effective approaches (e.g. functional regression analysis) have not yet been fully explored (Montesinos-Lopez et al., 2017a, 2018).

In this study, we used a high-throughput Field Scanalyzer (Virlet et al., 2017) to collect a dense time series of height measurements in a wheat mapping population. Our goals were to (i) combine individual TP and functional mapping approaches for the detection of transient, stage-specific, and persistent QTLs; (ii) explore various approaches for using longitudinal data for genomic prediction; and (iii) compare results between different sampling intensities (i.e. number of TPs) and sampling strategies (i.e. systematically interspersed versus informed by phenology).

Materials and methods

Mapping population and field experiment

A recombinant inbred line (RIL) population (F7 generation) comprising 197 lines derived from crossing the Chinese Spring (CS) and Paragon (PAR) genotypes was used in this study (Allen et al., 2017; Wingen et al., 2017). The field trials were conducted on the Field Scanalyzer phenotyping platform (LemnaTec GmbH) (Virlet et al., 2017) during the 2015–2016 (hereafter referred to as the 2016 data set) and 2016–2017 (hereafter referred to as the 2017 data set) growing seasons (from November to June) at Rothamsted Research, Harpenden, UK (51°48'34.56''N, 0°21'22.68''W). The experimental scheme was laid out in augmented blocks consisting of RILs, as well as one (2016; Crusoe) and five (2017; CS, PAR, Crusoe, and two RILs) check genotypes. Plots of 1.0×1.0 m were used in 2016 and 0.8×1.0 m in 2017. They were spaced 0.2 m apart, and the within-plot inter-row distance was 0.15 m. In 2016, RILs were grown in individual rows (six different RILs per plot) and, in 2017, RILs were grown in three consecutive rows (two RILs per plot). Both experiments were grown under conventional fertilization, weed, and pest control.

High-throughput data collection and image processing

A fully automated, high-throughput, and fixed-site phenotyping platform was used to acquire all data (Virlet et al., 2017). The twin lasers within the camera bay were used to collect data on multiple days throughout the growing season, from tillering through flowering (growth stages 20–69) (Zadoks, 1974; Tottman, 1987), and canopy height was extracted from the point clouds. Briefly, the main steps of the method consisted of merging the two point clouds, cropping the RIL of interest, and then splitting it into sub-regions (Supplementary Fig. S1 at JXB online, step 2 to 3). Subsequently, an average canopy height was estimated for each sub-region of the RIL and then averaged together to obtain the plot canopy height (Supplementary Fig. S1, step 4 and 5). A detailed description of the steps is given in the text preceding Supplementary Fig. S1. In total, data for 22 TPs [between 90 and 251 days after sowing (DAS)] for 2016 and 26 TPs (between 23 and 228 DAS) for 2017 were automatically extracted for each plot. In addition, canopy height of each RIL was measured manually on five representative plants at four TPs in 2016 and 2017 (Supplementary Fig. S2). For the latter data set, the coefficient of determination (R2) between manual and automatic measurements was 0.59, 0.78, 0.85, and 0.81 for 70, 155, 208, and 233 DAS, respectively (Supplementary Fig. S2B). However, we observed reduced data quality in the 2016 data set after 196 DAS due to a strong overlap between RILs as each row was planted with a different RIL (i.e. R2 of 0.66, 0.66, 0.35, and 0.55 for 115, 196, 216, and 239 DAS, respectively) (Supplementary Fig. S2C). Thus, we used the 2016 data set only for validation of the 2017 QTL mapping analyses.

Phenotypic analysis

Factor-analytic (time series) analysis

We used the ASReml package (Butler et al., 2009) within R (R Core Team, 2018) to obtain the best linear unbiased predictions (BLUPs) for the RIL genotypes across all TPs by fitting the following mixed linear model:

y=Xβ +Cw+Zg+ε  (1)

where y was a matrix of phenotypic values of genotypes across TPs; β was a vector of the fixed effects of TPs, w was a vector of the fixed effects of check genotypes, g was a vector of the random-effect genotypic values of individuals across TPs, and ε was a vector of random residuals. The incidence matrices for β, w, and g were X, C, and Z, respectively. A factor-analytic (FA, order 1) variance–covariance structure (VCOV) was used for the genotype effects (Meyer, 2009). Thus, we assumed g~MVN(0,σ 2gI), with covariance matrix σ 2g=Var(gA, where Σ=ΛΛ T + Ψ (Λ was a t TP × k loadings matrix, and Ψ was a t×t diagonal matrix with a specific variance for each TP), A was the numerator relationship matrix (we used the I identity matrix), was the Kronecker product, and MVN was the multivariate normal distribution. For the residual term, we assumed heterogeneous variances between TPs (block-diagonal structure) jointly with a two-dimensional separable autoregressive (AR1) matrix to fit the row and column effects (i.e. spatial correction). Thus, the vector of errors was partitioned into ε=ξ+η, where ξ and η refer to the spatially correlated and independent errors, respectively. The variance of residuals was assumed to be Var( ε )=var(ξ+η )=R=Σξ2[c(Φc)r(Φr)]+(IDη ), where σ 2ξ was the variance due to local tendency and Dη was a t×t diagonal VCOV matrix, with each TP having a specific spatially independent variance component. Matrices c(Φc) and r(Φr) formed the AR1 matrix with auto-correlation parameters Φ c and Φ r and order given by the number of columns (c) and rows (r), respectively (Gilmour et al., 1997). The significance of random and fixed effects was estimated for each TP using likelihood ratio and Wald tests at α=0.01 using the reml.lrt.asreml and wald functions from the asremlPlus R (Brien, 2019) and ASReml-R packages, respectively (Supplementary Table S1).

In order to visualize genetic parameters, we generated a surface quadratic plot of covariance matrix of TPs as follows:

Var(g)=ΣA=[ ​​λ​​ 112+Ψ1 ​​λ​​ 11 ​​λ​​ 1t ​​λ​​ 11 ​​λ​​ 1t ​​λ​​ 112+Ψt]A, where Λ=[ ​​λ​​ 11 ​​λ​​ 1t] and Ψ=[Ψ1000000Ψt] (Meyer, 2009). In addition, we calculated the genetic correlations (rg) among traits (i.e. heights at different TPs) (Supplementary Fig. S3), following the equation: rg=COV12/Σg12Σg22, where COV12 was the genetic covariance between two traits; σ 2g1 and σ 2g2 were the genetic variances associated with each trait. The VCOV and genetic correlations were extracted from the ASReml-R output using the function met.corr2 from the AAfun set of additional functions for ASReml-R (https://github.com/yzhlinscau/AAfun/). Finally, broad-sense heritability (h2) was estimated based on an entry-mean for each TP, following the equation: h2=σ 2g/(σ 2g+σ 2ε), where σ 2g=Vg and σ 2ε= were the genetic and residual variances (V), respectively. The SEs of heritability estimates were estimated using the ‘delta’ method (Venables and Ripley, 2000) using the pin R function (http://www.homepages.ed.ac.uk/iwhite/asreml/uop).

Smoothing and dimensionality reduction

We applied data smoothing followed by dimensionality reduction of genotype BLUPs across all TPs following the approach described by Kwak et al. (2016). First, we smoothed the data for each individual genotype using B-splines, choosing the number of basis splines to be used for all genotypes by minimizing the sum of squared errors from 10-fold cross-validation. Secondly, we applied functional principal component analysis (PCA) of the smoothed data to reduce their dimensionality. We used the funqtl (Kwak et al., 2014) and fda (Ramsay et al., 2009, 2018) R packages for these analyses. The resulting coefficients of B-splines and functional principal components (PCs) were used for functional QTL mapping and genomic prediction (see below).

Time point selection

We compared two approaches for selecting five or 10 TPs from the 26 TPs available for the 2017 data by sampling TPs in two different ways (Table 1). In the systematic (SY) approach, TPs were selected as equally spaced as possible (scenarios R1, R2, R5, and R6 in Table 1). This corresponds to a situation in which the image data are collected and processed throughout the growing season, without using previous knowledge about variability in different phenological stages. In contrast, in the growth stage (GS) approach, TPs were allocated preferentially to later stages (scenarios R3, R4, R7, and R8 in Table 1) as phenotypic variation for height is known to increase over the course of the growing season (Holman et al., 2016). This corresponds to a situation in which data collection and processing efforts are allocated based on previous knowledge and periodic inspection of the predominant phenological stage in the population.

Table 1.

Systematic (SY) and phenological growth stage (GS) time point (TP) selection approaches for functional QTL mapping and genomic prediction in 197 wheat recombinant inbred lines (see the Materials and methods)

No. of TPs Approach Scenario Days after sowing selected Growth stagea Multiple QTLsb Genomic heritabilityc
TL SE HF HK ML SL PC1 BS (3, 4, 7) BS (4, 5, 8)
5 SY R1 23/70/128/181/228 3 1 1 5A 7A 2B 2B 0.23±0.02 0.20±0.02 0.20±0.02
SY R2 23/78/155/193/228 2 2 1 2B 2B 2B 0.23±0.02 0.22±0.02 0.20±0.02
GS R3 70/172/219/223/228 1 1 3 5A 7A 5A 7A 5A 7A 0.29±0.04 0.36±0.03 0.21±0.02
GS R4 181/214/219/223/228 0 1 4 2B 5A 7A 7A 7A 0.30±0.04 0.34±0.03 0.20±0.02
10 SY R5 23/44/64/84/119/155/ 172/187/214/228 5 3 2 2B 5A 5B 7A 5A 7A 5A 0.26±0.03 0.30±0.04 0.21±0.02
SY R6 37/57/70/98/119/155/ 172/193/219/228 5 3 2 2B 5A 5B 7A 5A 7A 5A 0.26±0.04 0.29±0.04 0.24±0.03
GS R7 84/155/166/181/193/ 209/214/219/223/228 1 5 4 2B 5A 7A 5A 5A 0.28±0.04 0.32±0.03 0.24±0.03
GS R8 166/172/181/187/193/209/214/219/223/228 0 6 4 2B 5A 7A 5A 5A 0.28±0.04 0.34±0.04 0.21±0.02
26 R9 14 8 4 2B 5A 5B 7A 2B 5A 7A 0.28±0.04 0.32±0.02 0.21±0.04

a Number of TPs selected in the tillering (TL), stem elongation (SE), and heading/flowering (HF) growth stages.

b Chromosomes on which QTLs were detected at α=0.05 from the multiple functional QTL model based on multitrait (HKLOD), maximum (MLOD), and average (SLOD) score profiles.

c Genomic heritability from the GBLUP model using dimension-reduced (first principal component, PC1) and the last two B-spline coefficients (BS) for five (BS 3–4), 10 (BS 4–5), and 26 (BS 7–8) TPs. Values are means ±SE estimated from 50 random cross-validations.

Molecular markers

The 197 RILs were genotyped using the 35K Affymetrix Axiom® HD wheat single nucleotide polymorphism (SNP) array (Allen et al., 2017). Markers with significant segregation distortion and >20% missing data were removed, and a genetic linkage map was constructed as described previously by Allen et al. (2017). Briefly, the map included 9434 SNPs, covering a total of ~6632.3 cM of the wheat genome (21 linkage groups).

Markers with a low call rate (<90%) were removed, and the remaining missing SNP data were imputed using Beagle 4.1 (Browning and Browning, 2016) within the codeGeno function from the Synbreed R package (Wimmer et al., 2012). Furthermore, markers with minor allele frequency (MAF) <0.01 and with redundant genotypes across all individuals were removed. The final molecular marker data consisted of 2330 SNPs, with the number of markers per chromosome ranging from 14 (chromosome 4D) to 223 (chromosome 5A; Supplementary Fig. S4).

QTL mapping

QTL scanning was performed using both individual TP and functional mapping approaches because we were aiming to detect both transient and persistent QTLs. Thus, individual TP analysis was performed for each TP using the genotype BLUPs from the FA model utilizing the R/qtl package (Broman et al., 2003). We performed both interval mapping (IM) and composite interval mapping (CIM) via Haley–Knott regression (Haley and Knott, 1992). For CIM, the number of marker covariates was set at five, with a window size of 10 cM. Logarithm of odds (LOD) thresholds for type I error rate (α=0.05) were obtained for each TP based on 1000 permutations.

Functional mapping was performed using the non-parametric approach suggested by Kwak et al. (2014, 2016) on the functional PCs (three, four, and five PCs for five, 10, and 26 TPs, respectively) using the funqtl R package. Single and multiple QTL analyses were carried out using the multitrait (HKLOD), maximum (MLOD), and average (SLOD) score approaches.

For the HKLOD approach, we used the following model:

y=Xβ +ε  (2)

where y is an n×p matrix of derived phenotypes (n genotypes×p functional PCs), X is an n×2 matrix of QTL genotype probabilities, β is a 2×p matrix of QTL effects, and ε is an n×p matrix of random residuals following a multivariate normal distribution. At each position λ (putative QTL location), a multivariate regression model with a single QTL was fitted. Thus, the log10 likelihood ratio of the HK model was HKLOD=n2log10{|RSS0||RSS(λ)|} where RSS was the matrix of sums of squares and cross-products of residuals, and |RSS| was its determinant; that is, |RSS0| was the matrix determinant for the null model with no QTL, and |RSS(λ)| was the matrix determinant for the model with a single QTL at position λ.

In contrast, the MLOD and SLOD approaches used the maximum and average LOD scores across TPs, respectively: MLOD(λ)=maxtLOD(t,λ), and SLOD(λ)=1Tt=1TLOD(t,λ), where T is the number of TPs. LOD thresholds corresponding to α=0.05 were obtained for all three methods based on 1000 permutations using the scanoneM function of the funqtl package.

Finally, for multiple functional mapping (i.e. modelling several marker intervals simultaneously), we followed the methodology proposed by Kwak et al. (2014, 2016) in which the penalized LOD scores of Broman and Speed (2002) and Manichaikul et al. (2009) were used to estimate the HKLOD, MLOD, and SLOD scores. Initially, we selected the main effect, heavy chain, and light chain penalties obtained from 1000 permutations (α=0.05) using the scantwoF function. Then, we used the stepwise model search algorithm (Broman and Speed, 2002) to scan the genome using forward selection of a model with up to six fixed QTLs, followed by backward elimination to the null model using the stepwiseqtlM function.

Power simulations

We performed data perturbation simulations (Yu et al., 2006) to assess the statistical power of detecting persistent and transient QTLs using individual TP and functional mapping analyses. For each of 1000 iterations of the simulations, we randomly chose an SNP from the genotypic data set and assigned constant phenotypic effects –α and α (Falconer and Mackay, 1995) to alternative homozygous genotypes for that SNP (i.e. there were no heterozygotes in the data). These effects were then added to BLUPs for all TPs (persistent QTLs) or for a single, randomly selected TP (transient QTLs), and the resulting values were used as phenotypes for QTL mapping (i.e. as described above). We varied the additive effects to achieve different proportions of variance explained (PVE) using the relationship PVE=11+1k2p(1p) (Yu et al., 2006; Slavov et al., 2014), where p is the estimated frequency of an arbitrarily chosen allele and k is the simulated additive effect divided by the SD of the BLUPs for the respective TP. Finally, statistical power was estimated as the proportion of iterations in which the simulated additive effects were detected as QTLs at α=0.05 based on the permutation thresholds used in our analyses of empirical data (see above).

Genomic prediction

The phenotypic traits used for prediction were the (i) individual TP BLUPs; (ii) the last two B-spline coefficients; and (iii) the first functional PCs from analyses of five, 10, and 26 TPs, including different TP selection scenarios (R1–R8, Table 1).

We used the additive genomic best linear unbiased prediction (GBLUP) model:

y^=Xβ +Za+ε  (3)

where y^ was a vector of phenotypic values, β was the vector of fixed effects, a was the vector of random additive genetic effects of the SNP markers, and ε was a vector of random residuals. The incidence matrices for β and a were X and Z, respectively. The distributions of random effects were assumed to be a~N(0,σ 2aGa) and ε~N(0,σ 2εI), where I was the identity matrix and Ga was the additive genomic relationship matrix calculated using the first formula proposed by VanRaden (2008).

We also integrated QTL mapping information into the GBLUP model (Equation 3). This model is expected to be advantageous assuming that height is controlled by relatively few genes of large effect and many genes of smaller effect (Sarinelli et al., 2019). Initially, we performed a single HKLOD functional mapping scan to identify significant cofactor SNPs at α=0.05 based on the 1000 permutations in each training set (148 individuals). Then, these cofactor SNPs, if any were detected, were included as fixed-effect covariates into the GBLUP model. For this analysis, we only used the systematic R1 and R5 scenarios, as well as the full set of 26 TPs (R9; Table 1).

Predictive ability (r) was calculated as the Pearson correlation between adjusted values and genomic estimated breeding values in 50 replications from independent validation scenarios (Albrecht et al., 2014), randomly sampling 75% of the genotypes (n=148) to form a training set, while the remaining 25% (n=49) were used as a validation set. We applied Fisher’s Z transformation of the predictive abilities and compared them among scenarios using Scott–Knott’s test (Scott and Knott, 1974) at α=0.05. All prediction analyses were performed using the Bayesian Generalized Linear Regression (BGLR) R package (Perez and de los Campos, 2014), using 60 000 Markov Chain Monte Carlo (MCMC) iterations, with 15 000 iterations for burn-in, and keeping only one from every five consecutive iterations to minimize auto-correlation. In addition to predicted phenotypes, we extracted genomic estimates of the additive genetic (σ 2a=Vga) and residual (σ 2ε=Vgε) variances, enabling the calculation of genomic heritability as h2g=σ 2a/(σ 2a+σ 2ε).

Results

Genetic variances and temporal covariances in height increased over time

We generated 3D height images (Fig. 1A) for each plot under the robotic Field Scanalyzer over the entire crop growth cycle. The mean broad-sense heritability across all TPs was 0.68 for the 2017 data. As expected, both variances and covariances among TPs increased over the course of the growing season (Fig. 1B–D). In the 2016 data set, the mean broad-sense heritability was 0.57 (Supplementary Fig. S5A, B). However, due to the low quality of the 2016 data set, particularly during the late stages of the growing season (see the Materials and methods), most downstream analyses were limited to data from 2017.

Fig. 1.

Fig. 1.

Temporal pattern of canopy height growth in a wheat recombinant mapping population measured from a robotic Field Scanalyzer in 2017. (A) 3D canopy images at 23, 70, 181, and 228 days after sowing (DAS) from one field plot. (B) Density distributions of the adjusted phenotypic values (best linear unbiased predictions, BLUPs) within time points (TPs) over the course of the growing season. (C) Adjusted height values of 197 recombinant inbred line genotypes, and both parents Chinese Spring (CS) and Paragon (PAR) from 26 TPs. The dotted line shows the estimated cubic spline curve across all genotypes. The mean broad-sense heritability across all TPs was 0.68. The growth stages delineated by dashed lines are tillering (23–128 TP), stem elongation (155–209 TP), and heading/flowering (214–228 TP). (D) Temporal genetic variance–covariance structure (VCOV) surface plot. (E) B-spline smoothed and (F) dimension-reduced [first (PC1) and second (PC2) functional principal components] height phenotypes based on 26 TPs. Percentages in parentheses represent the variance explained by each of the two PCs. (This figure is available in colour at JXB online.)

To enable comparisons among different TP selection strategies (Table 1), we used B-spline smoothing followed by functional PCA of the adjusted values for each scenario. The resulting temporal pattern was consistent among scenarios, initially following an accelerating and then a linear trend, with distinct, genotype-specific trajectories, including for the two RIL parents (i.e. significant genotype×TP interaction effects, P<0.01) (Fig. 1C, E; Supplementary Fig. S6A, B). The proportion of variance explained by the first two PCs was high (93–96%) and increased slightly with the number of TPs analysed (Fig. 1F; Supplementary Fig. S6C, D).

Individual time point analyses detected multiple QTLs across all developmental stages

From the IM and CIM scans of the 2017 data (i.e. analysing data from each TP separately), we detected a total of six QTLs (Fig. 2; Supplementary Figs S7, S8A). Three of these (i.e. on chromosomes 2A, 3A, and 6A) were associated with height only during the tillering growth stage, whereas one (on chromosome 7A) was detected exclusively during late stages. In contrast, the QTL on chromosome 2B was detected in both the tillering and stem elongation (mid) stages, whereas the QTL on chromosome 5A was, surprisingly, detected in the tillering and heading stages, but not the intervening stem elongation stage.

Fig. 2.

Fig. 2.

Summary of individual time point (TP) and functional QTL mapping results in 2017. Growth stages are defined as in Fig. 1. Chr, chromosome on which a QTL was detected; FFE, flag leaf fully emerged; FSO, flag leaf sheath opening; SV, first spikelet visible; SE, spike 100% emerged; AC, anthesis completed. (This figure is available in colour at JXB online.)

Based on the 2016 data, we detected one QTL on chromosome 2B across multiple TPs, whereas another QTL on chromosome 6A was detected in a single TP, in a different growth stage from 2017 (Supplementary Figs S8B, S9A). Furthermore, analyses of the manual plant height data from 2016 resulted in the detection of two additional late-stage QTLs on chromosomes 1A and 2D (Supplementary Fig. S9B–E).

Additional QTLs could be detected using functional mapping

Using single functional mapping at five, 10, and 26 TPs based on the multivariate LOD test statistic (HKLOD score), we identified two, three, and four QTLs, respectively (Fig. 3A–C). Results from the MLOD and SLOD approaches were largely consistent with this (Fig. 3D–I) as were results from multiple functional mapping analyses (Fig. 3J–R). Most importantly, we detected an additional QTL on chromosome 5B that was consistently identified based on single and multiple functional QTL mapping (Figs 2; 3C, F, I, K, L), but not in scans of individual TP data (Fig. 2; Supplementary Fig. S7). As with individual TP approaches, analyses of the 2016 data using single QTL mapping only resulted in the detection of the QTL on chromosome 2B (Supplementary Fig. S10A–I).

Fig. 3.

Fig. 3.

Functional QTL mapping analysis for five, 10, and 26 time points (TPs) in 2017. (A–I) Single and (J–R) multiple QTL approach based on multitrait (HKLOD), maximum (MLOD), and average (SLOD) profiles. The 5 and 10 TP results correspond to the systematic R1 and R5 scenarios (see Table 1). Dashed lines indicate the α=0.05 thresholds. Numbers above peaks in the multiple QTL mapping plots represent the positions of the loci on the respective chromosomes (Chr). (This figure is available in colour at JXB online.)

As expected under the presumed genetic model of highly complex temporal dynamics of height growth, both absolute effect sizes and proportions of PVEs varied substantially across the growing season, with clear examples of early (5A, 5B), late (7A), or persistent (2B) QTLs (Fig. 4A–F). Furthermore, despite some variability in the relative performances of HKLOD, MLOD, and SLOD models, power simulations demonstrated a slight advantage of functional mapping approaches (i.e. versus conventional individual TP analyses) for the detection of persistent QTLs (Fig. 5A–C). In contrast, individual TP scans had substantially greater power for detecting transient QTLs (Fig. 5D–F).

Fig. 4.

Fig. 4.

Estimated QTL effect sizes for analyses of five, 10, and 26 time points (TPs) in 2017. (A–C) Estimated absolute QTL effects and (D–F) percentage of phenotypic variance explained (PVE%), by the QTLs in the multiple functional mapping based on the multivariate LOD test statistic (HKLOD score). The black dotted line in (D–F) indicates α=0.01. The 5 and 10 TP results correspond to the systematic R1 and R5 scenarios (see Table 1). DAS, days after sowing. Signs of QTL effects indicate allele substitution effects: Chinese Spring was the donor of height-increasing alleles for the QTLs on chromosomes 2B and 5B, whereas Paragon contributed height-increasing alleles for the QTLs on chromosomes 5A and 7A. (This figure is available in colour at JXB online.)

Fig. 5.

Fig. 5.

Statistical power for analyses of five, 10, and 26 time points (TPs) in 2017. Power for persistent (A–C) and transient (D–F) QTLs was calculated at α=0.05 based on data perturbation simulations, and functional mapping multitrait (HKLOD), maximum (MLOD), and average (SLOD) profiles compared with interval mapping (IM) analyses. The 5 and 10 TP results correspond to the systematic R1 and R5 scenarios (see Table 1). (This figure is available in colour at JXB online.)

Individual time point genomic heritability and predictive ability were stable across the growing season

Genetic and residual variance components from the GBLUP model generally followed the same temporal patterns as those from the FA model, with increasing values after the tillering growth stage (Fig. 6A). Consequently, the time trend of genomic heritability resembled that of broad-sense heritability, stabilizing at ~0.25 after the first two TPs (Fig. 6B). As expected, genomic predictive ability followed a similar pattern, fluctuating at ~0.30 and decreasing slightly at the last TP, presumably because of lodging, which affected data quality for some genotypes (Figs 1C, 6C). As expected from the lower quality of the 2016 data set (see the Materials and methods), we observed very low individual TP predictive abilities (ranging from 0 to 0.1; results not shown).

Fig. 6.

Fig. 6.

Genomic prediction of canopy height across 26 individual time points (TPs). (A) Variance estimates for each TP from the phenotypic (factor analytic, FA) and genomic (genomic best linear unbiased predictor, GBLUP) model. Growth stages are defined as in Fig. 1. (B) Broad-sense (H2) and genomic (h2g) heritability based on the phenotypic FA and GBLUP models, respectively. Error bars represent SEs. (C) Predictive ability (r) based on the GBLUP model. DAS, days after sowing. (This figure is available in colour at JXB online.)

Smoothing height phenotypes reduces noise and increases predictive ability

The predictive ability for B-spline coefficients of phenotypes that had been smoothed was generally higher than that for individual TP or PC data (Fig. 7). Based on our evaluation of different TP selection strategies, there was a clear advantage (i.e. from ~10% to >3-fold) of using phenological information compared with systematically interspersed sampling, particularly when a small number of TPs were evaluated (Fig. 7B; Table 1). Interestingly, increasing the number of TPs analysed beyond five did not improve predictive abilities (Fig. 7B–D). Finally, including functional QTLs detected in the training population as fixed-effect covariates in the GBLUP model tended to increase predictive ability slightly, particularly for PC phenotypes (Fig. 8).

Fig. 7.

Fig. 7.

Functional genomic prediction and assessment of time point (TP) selection strategies. (A) Predictive abilities for the ‘best’ (i.e. having the highest predictive ability, DAS=223) and last (DAS=228) individual TPs. (B–D) Predictive abilities for dimension-reduced phenotypes (first principal component, PC1), and the last two B-spline basis coefficients (BS) for five, 10, and 26 TPs. TP selection strategies (B, C) followed a systematic approach (SY) or were informed by phenological growth stages (GS) as shown in Table 1. Numbers inside plots and error bars represent the mean and SE from 50 random cross-validations. Letters above bars indicate significant differences at α=0.05 across all methods. DAS, days after sowing. (This figure is available in colour at JXB online.)

Fig. 8.

Fig. 8.

Genomic prediction including functional QTLs as fixed-effect covariates. (A–D) Predictive ability with and without QTLs detected in the training population as fixed-effect covariates in the GBLUP model. (A) Results for the ‘best’ (i.e. having the highest predictive ability, DAS=223) and last (DAS=228) individual time points (TPs). (B–D) Results for dimension-reduced (first principal component, PC1), and the last two B-spline coefficients (BS) for five, 10, and 26 TPs. The 5 and 10 TP results are for the systematic R1 and R5 scenarios (see Table 1). Numbers inside plots and error bars represent the mean and SE from 50 random cross-validations. Letters above bars indicate significant differences at α=0.05 across all methods. DAS, days after sowing. (This figure is available in colour at JXB online.)

Discussion

Genetic variances and temporal covariances in height increased over time

Previous longitudinal studies of plant height suggested that both genetic and environmental variation tend to increase during the growing season, presumably because of the cumulative effects of large numbers of interacting genetic loci and environmental cues (Wang et al., 2010; Wu et al., 2010; Holman et al., 2016; Davey et al., 2017). Consistent with this expectation, we observed higher genetic variability in later developmental stages and increasing temporal genetic covariance, but also substantially higher residual (i.e. presumably, to a large extent, environmental) variation during the second half of the growing season (Figs 1C–E; 6A). Consequently, both broad-sense heritabilities increased steeply during the early part of the tillering phase and then fluctuated considerably during the stem elongation and flowering phases (Fig. 6B). Similar patterns of highly variable heritability during crop growth have been reported across a range of complex traits by high-throughput phenotyping studies in wheat (Madec et al., 2017), rice (Campbell et al., 2017), barley (Chen et al., 2014), and Setaria (Feldman et al., 2017). Thus, the distribution of sampled TPs across developmental and/or phenological stages is an important practical consideration.

Individual time point analyses detected multiple QTLs across all developmental stages

All of the QTLs detected in our study (Figs 2–4) are on chromosomes for which QTLs have been reported previously (Wu et al., 2010; Griffiths et al., 2012; Bentley et al., 2014; Zanke et al., 2014; Gao et al., 2015; Wurschum et al., 2015; Tian et al., 2017; Guan et al., 2018). For example, Griffiths et al. (2012) detected meta-QTLs on chromosomes 2A, 2B, 3A, 5A, and 6A based on analyses of four mapping populations. Similarly, Gao et al. (2015) identified QTLs on chromosomes 2B, 5A, and 7A in a set of RILs from a cross between Zhou8425B and CS (i.e. one of the parents used in our study). Thus, it seems plausible that some of the QTLs detected in our analyses are caused by variation in well-known Rht dwarfing genes (Ellis et al., 2005; Wu et al., 2010). However, we did not detect any strong signals on homoeologous chromosome group 4, presumably because the well-characterized Rht-B1 and Rht-D1 loci (Wurschum et al., 2017, 2018) do not segregate in the cross we studied (Gao et al., 2015; Kowalski et al., 2016).

Interestingly, our individual TP analyses resulted in the detection of putatively transient QTLs, some of which were not detectable using functional mapping approaches (Fig. 2). This finding is consistent with a number of recent studies that have reported temporally dynamic QTL expression in wheat (Zhang et al., 2017; Camargo et al., 2018) and other crops (Busemeyer et al., 2013; Al-Tamimi et al., 2016; Campbell et al., 2017; Feldman et al., 2017; Muraya et al., 2017; Ward et al., 2019). However, because transient QTLs are inherently more difficult to detect (i.e. due to their typically smaller effect sizes and narrow windows of expression) and are more likely to be false positives, their overall contribution to the genetic architecture of complex traits is currently unclear. Addressing this question would require using much larger mapping populations and possibly analyses of more TPs. Technological advances should make considerable upscaling possible in the near future (Roitsch et al., 2019).

Functional analyses increased QTL detection power and genomic predictive ability

Our functional mapping analyses resulted in the detection of an additional QTL on chromosome 5B (Figs 2–4), which appeared to affect plant height growth primarily during the tillering stage (Fig. 4E, F). A QTL has previously been reported on the same chromosome, also with effects predominantly in early phenological stages (Wang et al., 2010; Wu et al., 2010).

One of the motivating arguments for the development of functional mapping approaches was the premise that these approaches would have greater statistical power than conventional QTL mapping of data from individual TPs (Ma et al., 2002; Wu et al., 2004; Wu and Lin, 2006). Both our empirical results (Fig. 2) and power simulations (Fig. 5A–C) were consistent with this expectation, but only for persistent QTLs. In contrast, individual TP analyses resulted in the detection of higher numbers of transient or stage-specific QTLs (Fig. 2), which was also consistent with our power simulations for transient QTLs (Fig. 5D–F). Therefore, combining both individual TP and functional mapping approaches may be preferable as these approaches are able to capture different types of QTLs (Bac-Molenaar et al., 2015; Campbell et al., 2017; Muraya et al., 2017; Camargo et al., 2018).

The HKLOD method often outperformed the MLOD and SLOD approaches in analyses of both empirical (Fig. 3) and simulation data (Fig. 5). However, this pattern was far from consistent, in agreement with previous findings about the contrasting characteristics of these approaches Kwak et al. (2014, 2016). Given their relatively low computational cost, we therefore recommend using all three functional mapping methods.

Unlike the highly variable conventional estimates of heritability (i.e. based exclusively on phenotypic data, Fig. 6A), genomic heritability and predictive ability were relatively stable throughout the growing season, particularly after the early part of the tillering stage (Fig. 6B, C). Thus, genomic prediction based on data from a single, appropriately chosen TP would be reasonably accurate. However, we identified three factors that can further increase predictive ability. First, predictions of smoothed height phenotypes (i.e. multiple TPs) were >20% more accurate than those of phenotypes from individual TPs (Fig. 7), presumably as a result of the reduction of measurement error (Kwak et al., 2016). Consistent with this explanation, genomic heritabilities of the smoothed data were on average 7% higher than those of individual TPs (Table 1 versus Fig. 6B). Second, we observed a slight increase in predictive ability when including significant QTLs as fixed-effect covariates in the GBLUP model (Fig. 8). This is consistent with several previous empirical and simulation studies (Bernardo, 2014; Arruda et al., 2016; Spindel et al., 2016; Bian and Holland, 2017; McElroy et al., 2018; Sarinelli et al., 2019). Finally, as expected based on general knowledge about wheat growth and development (Fischer, 2011; Fischer and Rebetzke, 2018; Dreccer et al., 2019), predictive abilities were up to three times higher when TPs were chosen based on phenological growth stages compared with a systematically interspersed selection of TPs throughout the growing season (Fig. 7). Furthermore, as in previous studies (Camargo et al., 2016; Rutkoski et al., 2016), our findings emphasized the importance of sampling during the reproductive crop stage (i.e. heading and flowering), presumably because of the higher genetic variance captured during that stage (Table 1; Figs 6, 7).

Recommendations

The crux of our findings is the existence of a trade-off between the extraction of data for large numbers of TPs (i.e. to detect as many transient or stage-specific QTLs as possible) and the additional computational cost (mostly image processing but also downstream analyses) that may not necessarily translate into improved genomic prediction or more complete understanding of trait architecture. Based on our results, we formulated the following two practical recommendations, which are naturally only applicable in situations similar to our study. First, when the emphasis is on dissecting trait architecture, we recommend using both individual TP and functional mapping approaches to maximize the overall number and types of QTLs detected. Secondly, for more applied purposes (e.g. in breeding), analyses of spline-smoothed phenotypes selected based on phenological information may be sufficient to maximize genomic predictive ability. Further slight improvements may be achieved through the inclusion of QTLs as fixed-effect covariates, though it is not currently clear if the additional complexity (and consequently delay) associated with QTL detection is justified.

Supplementary data

Supplementary data are available at JXB online.

Fig. S1. Illustration of steps for processing of point clouds.

Fig. S2. Manual ground height versus height from the 3D point cloud.

Fig. S3. Heatmap of genetic correlations of height between 26 TPs in 2017.

Fig. S4. Genetic map consisting of 2330 SNPs.

Fig. S5. Temporal pattern of canopy height growth in 2016.

Fig. S6. Smoothed and dimension-reduced height in 2017.

Fig. S7. Interval and composite QTL mapping at 26 TPs in 2017.

Fig. S8. Signed LOD scores for each time point in 2017 and 2016.

Fig. S9. Summary of individual TP mapping results in 2016.

Fig. S10. Functional QTL mapping analysis for five, 10, and 22 TPs in 2016.

Table S1. Likelihood ratio and Wald tests at 26 TPs in 2017.

erz545_suppl_Supplementary_Figures_S1-S10_and_Table_S1

Acknowledgements

Rothamsted Research receives support from the Biotechnology and Biological Sciences Research Council (BBSRC) of the UK as part of the Designing Future Wheat Institute Strategic Programme (BB/P016855/1). The recombinant inbred line population was developed under the Wheat Genetic Improvement Network (WGIN; http://www.wgin.org.uk/). Seeds are available on request at Germplasm Resources Unit SeedStor (https://www.seedstor.ac.uk/search-browsecollections.php). The genotypic information is available through the web site ‘CerealsDB’ (Wilkinson et al., 2016). We thank the Editor and two anonymous reviewers for their comments on an earlier version of this manuscript. The authors declare no conflict of interest.

Glossary

Abbreviations

BLUP

best linear unbiased prediction

DAS

days after sowing

GBLUP

genomic best linear unbiased prediction

HKLOD

multitrait method score

HTPP

high-throughput plant phenotyping

MLOD

maximum score

PC

principal component

QTL

quantitative trait locus

SLOD

average score

SNP

single nucleotide polymorphism

TP

time point.

Data availability

The phenotypic and genotypic data, as well as the R scripts used for all analyses in this study, can be found at Mendeley (Lyra, 2019; https://data.mendeley.com/datasets/pkxpkw6j43/2).

References

  1. Aguate FM, Trachsel S, Perez LG, Burgueno J, Crossa J, Balzarini M, Gouache D, Bogard M, de los Campos G. 2017. Use of hyperspectral image data outperforms vegetation indices in prediction of maize yield. Crop Science 57, 2517–2524. [Google Scholar]
  2. Albrecht T, Auinger HJ, Wimmer V, Ogutu JO, Knaak C, Ouzunova M, Piepho HP, Schon CC. 2014. Genome-based prediction of maize hybrid performance across genetic groups, testers, locations, and years. Theoretical and Applied Genetics 127, 1375–1386. [DOI] [PubMed] [Google Scholar]
  3. Allen AM, Winfield MO, Burridge AJ, et al. 2017. Characterization of a wheat breeders’ array suitable for high-throughput SNP genotyping of global accessions of hexaploid bread wheat (Triticum aestivum). Plant Biotechnology Journal 15, 390–401. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Al-Tamimi N, Brien C, Oakey H, Berger B, Saade S, Ho YS, Schmockel SM, Tester M, Negrao S. 2016. Salinity tolerance loci revealed in rice using high-throughput non-invasive phenotyping. Nature Communications 7, 13342. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Araus JL, Cairns JE. 2014. Field high-throughput phenotyping: the new crop breeding frontier. Trends in Plant Science 19, 52–61. [DOI] [PubMed] [Google Scholar]
  6. Araus JL, Kefauver SC. 2018. Breeding to adapt agriculture to climate change: affordable phenotyping solutions. Current Opinion in Plant Biology 45, 237–247. [DOI] [PubMed] [Google Scholar]
  7. Araus JL, Kefauver SC, Zaman-Allah M, Olsen MS, Cairns JE. 2018. Translating high-throughput phenotyping into genetic gain. Trends in Plant Science 23, 451–466. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Arruda MP, Lipka AE, Brown PJ, Krill AM, Thurber C, Brown-Guedira G, Dong Y, Foresman BJ, Kolb FL. 2016. Comparing genomic selection and marker-assisted selection for Fusarium head blight resistance in wheat (Triticum aestivum L.). Molecular Breeding 36, 84. [Google Scholar]
  9. Bac-Molenaar JA, Vreugdenhil D, Granier C, Keurentjes JJB. 2015. Genome-wide association mapping of growth dynamics detects time-specific and general quantitative trait loci. Journal of Experimental Botany 66, 5567–5580. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Bentley AR, Scutari M, Gosman N, et al. 2014. Applying association mapping and genomic selection to the dissection of key traits in elite European wheat. Theoretical and Applied Genetics 127, 2619–2633. [DOI] [PubMed] [Google Scholar]
  11. Bernardo R. 2014. Genomewide selection when major genes are known. Crop Science 54, 68–75. [Google Scholar]
  12. Bernardo R. 2016. Bandwagons I, too, have known. Theoretical and Applied Genetics 129, 2323–2332. [DOI] [PubMed] [Google Scholar]
  13. Bian Y, Holland JB. 2017. Enhancing genomic prediction with genome-wide association studies in multiparental maize populations. Heredity 118, 585–593. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Brien C. 2019. asremlPlus: augments the use of ‘ASReml-R’ in fitting mixed models. R package version 4.1-22.https://cran.r-project.org/web/packages/asremlPlus/index.html.
  15. Broman KW, Speed TP. 2002. A model selection approach for the identification of quantitative trait loci in experimental crosses. Journal of the Royal Statistical Society Series B-Statistical Methodology 64, 641–656. [Google Scholar]
  16. Broman KW, Wu H, Sen S, Churchill GA. 2003. R/qtl: QTL mapping in experimental crosses. Bioinformatics 19, 889–890. [DOI] [PubMed] [Google Scholar]
  17. Browning BL, Browning SR. 2016. Genotype imputation with millions of reference samples. American Journal of Human Genetics 98, 116–126. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Busemeyer L, Ruckelshausen A, Moller K, Melchinger AE, Alheit KV, Maurer HP, Hahn V, Weissmann EA, Reif JC, Wurschum T. 2013. Precision phenotyping of biomass accumulation in triticale reveals temporal genetic patterns of regulation. Scientific Reports 3, 2442. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Butler DG, Cullis BR, Gilmour AR, Gogel BJ. 2009. ASReml-R reference manual. Queensland, Australia: Department of Primary Industries. [Google Scholar]
  20. Cabrera-Bosquet L, Crossa J, von Zitzewitz J, Serret MD, Araus JL. 2012. High-throughput phenotyping and genomic selection: the frontiers of crop breeding converge. Journal of Integrative Plant Biology 54, 312–320. [DOI] [PubMed] [Google Scholar]
  21. Camargo AV, Mackay I, Mott R, Han J, Doonan JH, Askew K, Corke F, Williams K, Bentley AR. 2018. Functional mapping of quantitative trait loci (QTLs) associated with plant performance in a wheat MAGIC mapping population. Frontiers in Plant Science 9, 887. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Camargo AV, Mott R, Gardner KA, Mackay IJ, Corke F, Doonan JH, Kim JT, Bentley AR. 2016. Determining phenological patterns associated with the onset of senescence in a wheat MAGIC mapping population. Frontiers in Plant Science 7, 1540. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Campbell MT, Du Q, Liu K, Brien CJ, Berger B, Zhang C, Walia H. 2017. A comprehensive image-based phenomic analysis reveals the complex genetic architecture of shoot growth dynamics in rice (Oryza sativa). Plant Genome 10, 1–14. [DOI] [PubMed] [Google Scholar]
  24. Campbell M, Walia H, Morota M. 2018. Utilizing random regression models for genomic prediction of a longitudinal trait derived from high-throughput phenotyping. The Plant Journal 2, 1–11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Chen DJ, Neumann K, Friedel S, Kilian B, Chen M, Altmann T, Klukas C. 2014. Dissecting the phenotypic components of crop plant growth and drought responses based on high-throughput image analysis. The Plant Cell 26, 4636–4655. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Crain J, Mondal S, Rutkoski J, Singh RP, Poland J. 2018. Combining high-throughput phenotyping and genomic information to increase prediction and selection accuracy in wheat breeding. Plant Genome 11, 1–14. [DOI] [PubMed] [Google Scholar]
  27. Crossa J, Perez-Rodriguez P, Cuevas J, et al. 2017. Genomic selection in plant breeding: methods, models, and perspectives. Trends in Plant Science 22, 961–975. [DOI] [PubMed] [Google Scholar]
  28. Davey CL, Robson P, Hawkins S, Farrar K, Clifton-Brown JC, Donnison IS, Slavov GT. 2017. Genetic relationships between spring emergence, canopy phenology, and biomass yield increase the accuracy of genomic prediction in Miscanthus. Journal of Experimental Botany 68, 5093–5102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Dreccer MF, Gemma >Molero G, Rivera-Amado C, John-Bejai C, Wilson Z. 2019. Yielding to the image: how phenotyping reproductive growth can assist crop improvement and production. Plant Science 282, 73–82. [DOI] [PubMed] [Google Scholar]
  30. Ellis MH, Rebetzke GJ, Azanza F, Richards RA, Spielmeyer W. 2005. Molecular mapping of gibberellin-responsive dwarfing genes in bread wheat. Theoretical and Applied Genetics 111, 423–430. [DOI] [PubMed] [Google Scholar]
  31. Falconer DS, Mackay TFC. 1995. Introduction to quantitative genetics, 4th edn London: Pearson. [Google Scholar]
  32. Feldman MJ, Paul RE, Banan D, et al. 2017. Time dependent genetic analysis links field and controlled environment phenotypes in the model C4 grass Setaria. PLoS Genetics 13, e1006841. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Fischer RA. 2011. Wheat physiology: a review of recent developments. Crop & Pasture Science 62, 95–114. [Google Scholar]
  34. Fischer RA, Rebetzke GJ. 2018. Indirect selection for potential yield in early-generation, spaced plantings of wheat and other small-grain cereals: a review. Crop & Pasture Science 69, 439–459. [Google Scholar]
  35. Gao F, Wen W, Liu J, Rasheed A, Yin G, Xia X, Wu X, He Z. 2015. Genome-wide linkage mapping of QTL for yield components, plant height and yield-related physiological traits in the Chinese wheat cross Zhou 8425B/Chinese Spring. Frontiers in Plant Science 6, 1099. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Gilmour AR, Cullis BR, Verbyla AP. 1997. Accounting for natural and extraneous variation in the analysis of field experiments. Journal of Agricultural, Biological, and Environmental Statistics 2, 269–293. [Google Scholar]
  37. Griffiths S, Simmonds J, Leverington M, Wang YK, Fish L, Sayers L, Alibert L, Orford S, Wingen L, Snape J. 2012. Meta-QTL analysis of the genetic control of crop height in elite European winter wheat germplasm. Molecular Breeding 29, 159–171. [DOI] [PubMed] [Google Scholar]
  38. Guan P, Lu L, Jia L, et al. 2018. Global QTL analysis identifies genomic regions on chromosomes 4A and 4B harboring stable loci for yield-related traits across different environments in wheat (Triticum aestivum L.). Frontiers in Plant Science 9, 529. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Haghighattalab A, Perez LG, Mondal S, Singh D, Schinstock D, Rutkoski J, Ortiz-Monasterio I, Singh RP, Goodin D, Poland J. 2016. Application of unmanned aerial systems for high throughput phenotyping of large wheat breeding nurseries. Plant Methods 12, 35. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Haley CS, Knott SA. 1992. A simple regression method for mapping quantitative trait loci in line crosses using flanking markers. Heredity 69, 315–324. [DOI] [PubMed] [Google Scholar]
  41. Holman FH, Riche AB, Michalski A, Castle M, Wooster MJ, Hawkesford MJ. 2016. High throughput field phenotyping of wheat plant height and growth rate in field plot trials using UAV based remote sensing. Remote Sensing 8, 1031. [Google Scholar]
  42. Jiang LB, Clavijo JA, Sun LD, Zhu XL, Bhakta MS, Gezan SA, Carvalho M, Vallejos CE, Wu RL. 2015. Plastic expression of heterochrony quantitative trait loci (hQTLs) for leaf growth in the common bean (Phaseolus vulgaris). New Phytologist 207, 872–882. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Juliana P, Montesinos-Lopez OA, Crossa J, et al. 2019. Integrating genomic-enabled prediction and high-throughput phenotyping in breeding for climate-resilient bread wheat. Theoretical and Applied Genetics 132, 177–194. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Kowalski AM, Gooding M, Ferrante A, Slafer GA, Orford S, Gasperini D, Griffiths S. 2016. Agronomic assessment of the wheat semi-dwarfing gene Rht8 in contrasting nitrogen treatments and water regimes. Field Crops Research 191, 150–160. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Krause MR, Gonzalez-Perez L, Crossa J, et al. 2019. Hyperspectral reflectance-derived relationship matrices for genomic prediction of grain yield in wheat. G3-Genes Genomes Genetics 9, 1231–1247. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Kwak IY, Moore CR, Spalding EP, Broman KW. 2014. A simple regression-based method to map quantitative trait loci underlying function-valued phenotypes. Genetics 197, 1409–U1562. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Kwak IY, Moore CR, Spalding EP, Broman KW. 2016. Mapping quantitative trait loci underlying function-valued traits using functional principal component analysis and multi-trait mapping. G3-Genes Genomes Genetics 6, 79–86. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Li Q, Huang ZW, Xu M, Wang CG, Gai JY, Huang YJ, Pang XM, Wu RL. 2010. Functional mapping of genotype–environment interactions for soybean growth by a semiparametric approach. Plant Methods 6, 13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Li ZT, Sillanpaa MJ. 2015. Dynamic quantitative trait locus analysis of plant phenomic data. Trends in Plant Science 20, 822–833. [DOI] [PubMed] [Google Scholar]
  50. Lyra DH. 2019. Functional QTL mapping and genomic prediction of 3D height in wheat measured using a robotic field phenotyping platform. Mendeley Data 2, doi: 10.17632/pkxpkw6j43.2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Ma CX, Casella G, Wu R. 2002. Functional mapping of quantitative trait loci underlying the character process: a theoretical framework. Genetics 161, 1751–1762. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Madec S, Baret F, de Solan B, Thomas S, Dutartre D, Jezequel S, Hemmerle M, Colombeau G, Comar A. 2017. High-throughput phenotyping of plant height: comparing unmanned aerial vehicles and ground LiDAR estimates. Frontiers in Plant Science 8, 2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Manichaikul A, Moon JY, Sen S, Yandell BS, Broman KW. 2009. A model selection approach for the identification of quantitative trait loci in experimental crosses, allowing epistasis. Genetics 181, 1077–1086. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. McElroy MS, Navarro AJR, Mustiga G, et al. 2018. Prediction of cacao (Theobroma cacao) resistance to Moniliophthora spp. diseases via genome-wide association analysis and genomic selection. Frontiers in Plant Science 9, 343. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Meyer K. 2009. Factor-analytic models for genotype × environment type problems and structured covariance matrices. Genetics Selection Evolution 41, 21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Montesinos-Lopez A, Montesinos-Lopez OA, Cuevas J, et al. 2017a. Genomic Bayesian functional regression models with interactions for predicting wheat grain yield using hyper-spectral image data. Plant Methods 13, 62. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Montesinos-Lopez A, Montesinos-Lopez OA, de Los Campos G, Crossa J, Burgueno J, Luna-Vazquez FJ. 2018. Bayesian functional regression as an alternative statistical analysis of high-throughput phenotyping data of modern agriculture. Plant Methods 14, 46. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Montesinos-Lopez OA, Montesinos-Lopez A, Crossa J, de Los Campos G, Alvarado G, Suchismita M, Rutkoski J, Gonzalez-Perez L, Burgueno J. 2017b. Predicting grain yield using canopy hyperspectral reflectance in wheat breeding data. Plant Methods 13, 4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Moore CR, Johnson LS, Kwak IY, Livny M, Broman KW, Spalding EP. 2013. High-throughput computer vision introduces the time axis to a quantitative trait map of a plant growth response. Genetics 195, 1077–1086. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Muraya MM, Chu J, Zhao Y, Junker A, Klukas C, Reif JC, Altmann T. 2017. Genetic variation of growth dynamics in maize (Zea mays L.) revealed through automated non-invasive phenotyping. The Plant Journal 89, 366–380. [DOI] [PubMed] [Google Scholar]
  61. Pauli D, Chapman SC, Bart R, Topp CN, Lawrence-Dill CJ, Poland J, Gore MA. 2016. The quest for understanding phenotypic variation via integrated approaches in the field environment. Plant Physiology 172, 622–634. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Perez P, de los Campos G. 2014. Genome-wide regression and prediction with the BGLR statistical package. Genetics 198, 483–495. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. R Core Team 2018. R: a language and environment for statistical computing.Vienna, Austria: R Foundation for Statistical Computing. [Google Scholar]
  64. Ramsay JO, Hooker G, Graves S. 2009. Functional data analysis with R and MATLAB. New York: Springer Science & Business Media. [Google Scholar]
  65. Ramsay JO, Wickham H, Graves S, Hooker G. 2018. fda: functional data analysis. R package version 2.4.8.https://cran.r-project.org/web/packages/fda/index.html.
  66. Rebetzke GJ, Jimenez-Berni JA, Bovill WD, Deery DM, James RA. 2016. High-throughput phenotyping technologies allow accurate selection of stay-green. Journal of Experimental Botany 67, 4919–4924. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Rebetzke GJ, Jimenez-Berni J, Fischer RA, Deery DM, Smith DJ. 2019. High-throughput phenotyping to enhance the use of crop genetic resources. Plant Science 282, 40–48. [DOI] [PubMed] [Google Scholar]
  68. Reynolds D, Baret F, Welcker C, et al. 2019. What is cost-efficient phenotyping? Optimizing costs for different scenarios. Plant Science 282, 14–22. [DOI] [PubMed] [Google Scholar]
  69. Roitsch T, Cabrera-Bosquet L, Fournier A, Ghamkhar K, Jiménez-Berni J, Pinto F, Ober ES. 2019. New sensors and data-driven approaches—a path to next generation phenomics. Plant Science 282, 2–10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Rutkoski J, Poland J, Mondal S, Autrique E, Perez LG, Crossa J, Reynolds M, Singh R. 2016. Canopy temperature and vegetation indices from high-throughput phenotyping improve accuracy of pedigree and genomic selection for grain yield in wheat. G3-Genes Genomes Genetics 6, 2799–2808. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Sadeghi-Tehran P, Sabermanesh K, Virlet N, Hawkesford MJ. 2017. Automated method to determine two critical growth stages of wheat: heading and flowering. Frontiers in Plant Science 8, 252. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Sarinelli JM, Murphy JP, Tyagi P, et al. 2019. Training population selection and use of fixed effects to optimize genomic predictions in a historical USA winter wheat panel. Theoretical and Applied Genetics 132, 1247–1261. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Scott AJ, Knott M. 1974. Cluster-analysis method for grouping means in analysis of variance. Biometrics 30, 507–512. [Google Scholar]
  74. Slavov GT, Nipper R, Robson P, Farrar K, Allison GG, Bosch M, Clifton-Brown JC, Donnison IS, Jensen E. 2014. Genome-wide association studies and prediction of 17 traits related to phenology, biomass and cell wall composition in the energy grass Miscanthus sinensis. New Phytologist 201, 1227–1239. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Spindel JE, Begum H, Akdemir D, Collard B, Redona E, Jannink JL, McCouch S. 2016. Genome-wide prediction models that incorporate de novo GWAS are a powerful new tool for tropical rice improvement. Heredity 116, 395–408. [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Stinchcombe JR, Function-valued Traits Working Group, Kirkpatrick M. 2012. Genetics and evolution of function-valued traits: understanding environmentally responsive phenotypes. Trends in Ecology & Evolution 27, 637–647. [DOI] [PubMed] [Google Scholar]
  77. Sun J, Poland JA, Mondal S, et al. 2019. High-throughput phenotyping platforms enhance genomic selection for wheat grain yield across populations and cycles in early stage. Theoretical and Applied Genetics 132, 1705–1720. [DOI] [PubMed] [Google Scholar]
  78. Sun J, Rutkoski JE, Poland JA, Crossa J, Jannink JL, Sorrells ME. 2017. Multitrait, random regression, or simple repeatability model in high-throughput phenotyping data improve genomic prediction for wheat grain yield. Plant Genome 10, 1–12. [DOI] [PubMed] [Google Scholar]
  79. Sun LD, Wu RL. 2015. Mapping complex traits as a dynamic system. Physics of Life Reviews 13, 155–185. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Tardieu F, Cabrera-Bosquet L, Pridmore T, Bennett M. 2017. Plant phenomics, from sensors to knowledge. Current Biology 27, R770–R783. [DOI] [PubMed] [Google Scholar]
  81. Tian X, Wen W, Xie L, et al. 2017. Molecular mapping of reduced plant height gene Rht24 in bread wheat. Frontiers in Plant Science 8, 1379. [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Tottman DR. 1987. The decimal code for the growth-stages of cereals, with illustrations. Annals of Applied Biology 110, 441–454. [Google Scholar]
  83. van Eeuwijk FA, Bustos-Korts D, Millet EJ, et al. 2019. Modelling strategies for assessing and increasing the effectiveness of new phenotyping techniques in plant breeding. Plant Science 282, 23–39. [DOI] [PubMed] [Google Scholar]
  84. VanRaden PM. 2008. Efficient methods to compute genomic predictions. Journal of Dairy Science 91, 4414–4423. [DOI] [PubMed] [Google Scholar]
  85. Venables WN, Ripley BD. 2000. S programming. New York: Springer-Verlag. [Google Scholar]
  86. Virlet N, Sabermanesh K, Sadeghi-Tehran P, Hawkesford MJ. 2017. Field Scanalyzer: an automated robotic field phenotyping platform for detailed crop monitoring. Functional Plant Biology 44, 143–153. [DOI] [PubMed] [Google Scholar]
  87. Wallace JG, Rodgers-Melnick E, Buckler ES. 2018. On the road to breeding 4.0: unraveling the good, the bad, and the boring of crop quantitative genomics. Annual Review of Genetics 52, 421–444. [DOI] [PubMed] [Google Scholar]
  88. Wang N, Tian H, Li Y, Wu R, Luo J, Wang Z. 2017. Fast computation of significance threshold in QTL mapping of dynamic quantitative traits. Journal of Biometrics & Biostatistics 8, 329. [Google Scholar]
  89. Wang Z, Wu X, Ren Q, Chang X, Li R, Jing R. 2010. QTL mapping for developmental behavior of plant height in wheat (Triticum aestivum L.). Euphytica 174, 447–458. [Google Scholar]
  90. Ward B, Brien C, Oakey H, et al. 2019. High-throughput 3D modelling to dissect the genetic control of leaf elongation in barley (Hordeum vulgare). The Plant Journal 98, 555–570. [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Watanabe K, Guo W, Arai K, et al. 2017. High-throughput phenotyping of sorghum plant height using an unmanned aerial vehicle and its application to genomic prediction modeling. Frontiers in Plant Science 8, 421. [DOI] [PMC free article] [PubMed] [Google Scholar]
  92. Wilkinson PA, Winfield MO, Barker GL, et al. 2016. CerealsDB 3.0: expansion of resources and data integration. BMC Bioinformatics 17, 256. [DOI] [PMC free article] [PubMed] [Google Scholar]
  93. Wimmer V, Albrecht T, Auinger HJ, Schon CC. 2012. synbreed: a framework for the analysis of genomic prediction data using R. Bioinformatics 28, 2086–2087. [DOI] [PubMed] [Google Scholar]
  94. Wingen LU, West C, Leverington-Waite M, et al. 2017. Wheat landrace genome diversity. Genetics 205, 1657–1676. [DOI] [PMC free article] [PubMed] [Google Scholar]
  95. Wu RL, Lin M. 2006. Functional mapping—how to map and study the genetic architecture of dynamic complex traits. Nature Reviews. Genetics 7, 229–237. [DOI] [PubMed] [Google Scholar]
  96. Wu RL, Ma CX, Lin M, Casella G. 2004. A general framework for analyzing the genetic architecture of developmental characteristics. Genetics 166, 1541–1551. [DOI] [PMC free article] [PubMed] [Google Scholar]
  97. Wu X, Wang Z, Chang X, Jing R. 2010. Genetic dissection of the developmental behaviours of plant height in wheat under diverse water regimes. Journal of Experimental Botany 61, 2923–2937. [DOI] [PMC free article] [PubMed] [Google Scholar]
  98. Wurschum T, Langer SM, Longin CF. 2015. Genetic control of plant height in European winter wheat cultivars. Theoretical and Applied Genetics 128, 865–874. [DOI] [PubMed] [Google Scholar]
  99. Wurschum T, Langer SM, Longin CFH, Tucker MR, Leiser WL. 2017. A modern Green Revolution gene for reduced height in wheat. The Plant Journal 92, 892–903. [DOI] [PubMed] [Google Scholar]
  100. Wurschum T, Liu G, Boeven PHG, Longin CFH, Mirdita V, Kazman E, Zhao Y, Reif JC. 2018. Exploiting the Rht portfolio for hybrid wheat breeding. Theoretical and Applied Genetics 131, 1433–1442. [DOI] [PubMed] [Google Scholar]
  101. Xu M, Jiang L, Zhu S, et al. 2016. A computational framework for mapping the timing of vegetative phase change. New Phytologist 211, 750–760. [DOI] [PubMed] [Google Scholar]
  102. Yu J, Pressoir G, Briggs WH, et al. 2006. A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nature Genetics 38, 203–208. [DOI] [PubMed] [Google Scholar]
  103. Zadoks JC. 1974. A decimal code for the growth stages of cereals. Weed Research 14, 415–421. [Google Scholar]
  104. Zanke CD, Ling J, Plieske J, et al. 2014. Whole genome association mapping of plant height in winter wheat (Triticum aestivum L.). PLoS One 9, e113287. [DOI] [PMC free article] [PubMed] [Google Scholar]
  105. Zhang N, Fan X, Cui F, et al. 2017. Characterization of the temporal and spatial expression of wheat (Triticum aestivum L.) plant height at the QTL level and their influence on yield-related traits. Theoretical and Applied Genetics 130, 1235–1252. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

erz545_suppl_Supplementary_Figures_S1-S10_and_Table_S1

Data Availability Statement

The phenotypic and genotypic data, as well as the R scripts used for all analyses in this study, can be found at Mendeley (Lyra, 2019; https://data.mendeley.com/datasets/pkxpkw6j43/2).


Articles from Journal of Experimental Botany are provided here courtesy of Oxford University Press

RESOURCES