Crop yields must increase in order to meet food security goals. Estimating crop yield-related traits accurately and rapidly is therefore of utmost importance, with applications in breeding, precision farming, and yield monitoring. Hight-throughput phenotyping (HTPP) refers to the use of non-invasive techniques to measure physiological and agronomical traits of crop plants. Optical remote-sensing techniques are widely used in HTPP. They encompass data captured from areas of the visible (400–700 nm, captured using red green blue sensors), near infrared (700–1350 nm), red edge (680–730 nm), and shortwave infrared (1350–2500 nm) regions of the spectrum (Gamon et al. 2019). This spectral information has numerous applications within crop monitoring direct prediction of a trait of interest such as pigment content or biomass area, or through using spectral indices, otherwise known as vegetative indices, which correlate spectral reflectance with physiological traits such as green area, portrayed by the normalized difference vegetation index (Fu et al. 2020; Tattaris et al. 2016).
One of the most important components determining crop yield is the harvest index (HI), the ratio between harvestable yield and above ground biomass. However, relative to other yield-related traits, HI is poorly understood. This is partly a result of the fact that HI is traditionally measured via destructive field sampling, which is both time consuming and labor intensive. Within many of the staple crops, such as wheat and rice, HI is approaching the theoretical maximum value. However, rapid screening of HI is valuable within breeding programs and for the temporal evaluation of growth status.
As the name suggests, remote sensing via optical or other techniques permits data to be collected from a distance, without the need for physical interaction with a plant. These sensors may be fixed or in motion. For example, remote sensing from satellite data has many uses within agricultural research; however, in some instances, application is limited by low resolution and fixed measurement times. Therefore, an alternative approach for field data collection is through the use of unmanned aerial vehicles (UAVs). UAVs are able to provide high-resolution spatial and temporal data for a relatively low cost. So far, UAV-based data have been used to estimate crop traits, including seedling emergence, plant height, leaf area index, above-ground biomass, and yield (Tsouros et al. 2019).
Following collection, data analysis can be supported by a variety of techniques, including machine learning approaches. Ensemble learning (EL) refers to a subset of machine learning whereby multiple learning algorithms are combined within a single framework to obtain higher predictive performance. EL methods are able to overcome problems associated with small training sets, such as overfitting, because outputs of independent base models are integrated through secondary learning methods (Zhang et al. 2019). This integration can be facilitated by Bayesian model averaging, in which the posterior probability of each of the basic models are taken as weights for the secondary learning step. As such, Bayesian model averaging is able to overcome uncertainty in the modeling process, leading to a higher estimation accuracy and so has been widely applied for many fields (Shu et al. 2022).
Within this issue of Plant Physiology, Ji et al. (2023) used multi-spectral data captured from a UAV combined with machine learning to calculate the HI of pea (Pisum sativum L.) and faba bean (Vicia faba L.) at 4 different growth stages. Ji et al. (2023) combined data collection from red green blue sensors, multispectral and thermal infrared sensors, as well as 4 different machine learning models and an integrated EL model based on MBA (Fig. 1). Combining multi-modal data provided a better overall estimate of HI compared to any single sensor with improvements in model fitting (i.e. R2) of up to 18% and 31% for bean and pea, respectively. Similarly to spectral information, the combined EL model provided the best and most stable estimation of HI, and combining information from all 4 growth stages provided a better estimate compared to any single growth stage.
Figure 1.
Overview of the remote sensing approach designed by Ji et al. (2023) for the estimation of the harvest index (HI) of field grown fava bean and pea. Optical data were collected using a UAV from red green blue (RGB), multispectral (MS), and thermal infrared (TIR) sensors. Data were analyzed using a variety of machine learning approaches singularly and in combination using ensemble learning (EL) and Bayesian model averaging. The most accurate estimation of HI was obtained using multi-modal data combined with the EL model.
Together, the results of Ji et al. (2023) indicate the power of combining multi-sensor and multi-model information for the prediction of plant physiological traits. The improved HI prediction power reflects the results of previous studies that demonstrate how multi-sensor information can improve crop trait estimation (Feng et al. 2020; Li et al. 2022). However, species-specific differences were seen in the estimation accuracy of HI, with pea achieving more accurate predictions relative to bean. This improved HI estimation in pea was predicted to be a result of reduced canopy cover and thus reduced saturation of the optical sensors. In some instances, combining information from 2 sensors yielded greater predictive power than 3, indicating possible data redundancy. Furthermore, in some cases, the base machine learning models performed better than the EL approach. Thus the optimal combination of sensors and models is likely to be species, growth stage, and trait specific.
Despite the recent rise in machine learning approaches for plant science disciplines, its application for crop yield prediction is not yet viable for wide-scale use. Nevertheless, although improvements can still be made for the estimation of HI, the study of Ji et al. (2023) presents a further step in the generation of HTPP for monitoring crop performance. Combining low-cost sensor information and machine learning permits robust and accurate measurements of physiological traits, ultimately providing an acceleration in the breeding process. Future improvements will require large datasets encompassing multiple genotypes across a variety of sites and assessment of the redundancy between multi-sensor data.
Acknowledgments
AB wishes to thank Plant Physiology for the opportunity to act as an Assistant Features Editor.
References
- Feng A, Zhou J, Vories E, Sudduth K, Zhang M. Yield estimation in cotton using UAV-based multi-sensor imagery. Biosyst Eng. 2020:193:101–114. 10.1016/j.biosystemseng.2020.02.014 [DOI] [Google Scholar]
- Fu P, Meacham-Hensold K, Guan K, Wu J, Bernacchi C. Estimating photosynthetic traits from reflectance spectra: a synthesis of spectral indices, numerical inversion, and partial least square regression. Plant Cell Environ. 2020:43(5):1241–1258. 10.1111/pce.13718 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gamon J, Somers B, Malenovský Z, Middleton E, Rascher U, Schaepman M. Assessing vegetation function with imaging spectroscopy. Surv Geophys. 2019:40(3):489–513. 10.1007/s10712-019-09511-5 [DOI] [Google Scholar]
- Ji Y, Liu Z, Cui Y, Liu R, Chen Z, Zong X, Yang T. Faba bean and pea harvest index estimations using aerial-based multimodal data and machine learning algorithms. Plant Physiol. 2024:194(3):1512–1526. 10.1093/plphys/kiad577 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li J, Schachtman D, Creech C, Wang L, Ge Y, Shi Y. Evaluation of UAV-derived multimodal remote sensing data for biomass prediction and drought tolerance assessment in bioenergy sorghum. Crop J. 2022:10(5):1363–1375. 10.1016/j.cj.2022.04.005 [DOI] [Google Scholar]
- Shu M, Fei S, Zhang B, Yang X, Guo Y, Li B, Ma Y. Application of UAV multisensor data and ensemble approach for high-throughput estimation of maize phenotyping traits. Plant Phenomics. 2022:2022:9802585. 10.34133/2022/9802585 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tattaris M, Reynolds M, Chapman S. A direct comparison of remote sensing approaches for high-throughput phenotyping in plant breeding. Front Plant Sci. 2016:7:1131. 10.3389/fpls.2016.01131 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tsouros D, Bibi S, Sarigiannidis P. A review on UAV-based applications for precision agriculture. Information. 2019:10(11):349. 10.3390/info10110349 [DOI] [Google Scholar]
- Zhang Z, Pasolli E, Crawford M. An adaptive multiview active learning approach for spectral–spatial classification of hyperspectral images. IEEE Trans Geosci Remote Sens. 2019:58(4):2557–2570. [Google Scholar]

