Skip to main content
Plant Phenomics logoLink to Plant Phenomics
. 2025 Jul 15;7(3):100089. doi: 10.1016/j.plaphe.2025.100089

Predictive modeling, pattern recognition, and spatiotemporal representations of plant growth in simulated and controlled environments: A comprehensive review

Mohamed Debbagh 1,, Shangpeng Sun 1, Mark Lefsrud 1
PMCID: PMC12710002  PMID: 41416188

Abstract

Accurate predictions and representations of plant growth patterns in simulated and controlled environments are important for addressing various challenges in plant phenomics research. This review explores various works on state-of-the-art predictive pattern recognition techniques, focusing on the spatiotemporal modeling of plant traits and the integration of dynamic environmental interactions. We provide a comprehensive examination of deterministic, probabilistic, and generative modeling approaches, emphasizing their applications in high-throughput phenotyping and simulation-based plant growth forecasting. Key topics include regressions and neural network-based representation models for the task of forecasting, limitations of existing experiment-based deterministic approaches, and the need for dynamic frameworks that incorporate uncertainty and evolving environmental feedback. This review surveys advances in 2D and 3D structured data representations through functional-structural plant models and conditional generative models. We offer a perspective on opportunities for future works, emphasizing the integration of domain-specific knowledge to data-driven methods, improvements to available datasets, and the implementation of these techniques toward real-world applications.

Keywords: Pattern recognition, Spatiotemporal plant growth modelling, High-throughput plant phenotyping, Generative model, Plant growth forecasting

1. Introduction

With the growing prevalence of challenges in global food systems, including restricted access to fresh produce in remote areas and issues faced within traditional cultivation practices, outlooks on controlled environment agriculture (CEA) offer prospective solutions through precision methods of localized food production [[1], [2], [3]]. Controlled environments within crop production systems provide opportunities to observe and cultivate plants with diverse characteristics, or phenotypes, making it suitable for specialized applications from trait selection for cultivar development and biopharming [[4], [5], [6], [7], [8]], to food production in extreme and resource scarce environments [[9], [10], [11]]. Subsequently, the ability to accurately predict and represent the phenotypic expression of a plant and its trajectory at various points in its growth cycle demonstrates a fundamental step towards advancing simulation and modeling research within CEA.

Recent developments in computational and algorithmic theories, coupled with innovations in data collection and fusion techniques, have opened new avenues for exploring plant phenotypes and growth patterns within simulated and controlled environments [12]. These environments allow for the precise manipulation of variables such as light, temperature, humidity, and nutrient availability, enabling researchers to study the dynamic interactions that influence plant development. By leveraging techniques in predictive pattern recognition, models can be developed to accurately capture and represent plant growth behaviors.

1.1. Comparing controlled and open field environments

To contextualize the focus of this review, it is necessary to compare the differences in observational parameters between controlled environment and open field systems. Open field environments, as with traditional agricultural cropping systems, are characterized by the exposure to highly variable and often uncontrollable factors including precipitation, solar conditions, climate fluctuations, and biotic stressors such as pest, pathogens, and soil microbe root interactions. These fluctuations introduce noise and confounding variables into experimental data, making it difficult to isolate causal relationships in plant development. Moreover, instrumentation methods for capturing these factors of variation (FOV) are often developed for constrained experiments, where samples are collected manually, and remain difficult to scale for application in traditional agricultural systems. Thus, outlooks within open-field research often highlight the bottlenecks in sensing and data handling which in turn limit the assessment of physiological development and crop forecasting to sparse spatiotemporal samples of FOV [13,14]. However, due to the desire for open environment resilient plant varieties for outdoor cropping systems, trends in field-based phenomics research have primarily focused on phenotype-genotype mapping for applications in breeding programs within open-field environments [[15], [16], [17]].

Controlled environments are typically found within CEA systems such as greenhouses, vertical farms, and growth chambers and are characterized by a confined growing space in which the environmental conditions are known, regulated, or fully controlled. These controlled environment conditions give rise to regularized observations made on plant behavior by constraining some FOVs within known parameterized boundaries. While not all environmental conditions can be fully isolated, key factors such as light (intensity, duration, spectrum), air and root-zone temperature, humidity, CO2 levels, and nutrient delivery are routinely monitored and controlled [18]. Controlled environments lend themselves well to modeling systems as instrumentation and control systems are easily integrated and necessary to maintain optimal conditions for plant growth. These measurements can then be parameterized to develop advanced representation models for predictive forecasting and simulation.

1.2. Defining simulated environments and plant growth

The phenotypic outcomes of a plant are influenced by high-dimensional interactions between genotype, environment, and management (G ​× ​E ​× ​M), often resulting in complex non-linear responses [[19], [20], [21]]. Subsequently, the objective of controlled environment systems is to constrain and reduce the influence of the environmental dimension by maintaining known optimal conditions of various crop varieties. However, a second useful objective of these systems becomes apparent as advances in automated sensing and controls are implemented within CEA; The development of robust simulations and plant growth prediction models.

The term simulation can take on two meanings within this context. The first, is an empirical simulation where CEA is applied to mimic open-field conditions such that, the plant’s behavior can be explored under controlled conditions, and the insights to be employed under real world field conditions. The second, is a computational simulation, or in silico representation and quantification of plant behaviors given simulated conditions through models. In this review, the use of the term simulation borrows from both meanings and describes in silico models that predict plant growth under real world conditions, which are developed by observing plant behaviors under controlled environments.

With this definition, it is also important to define plant growth within these models. Plant growth refers to the manifestation of plant traits, or phenotypes, over time. More specifically, the size, mass, and complexity of a plant over time, resulting from cell division, cell expansion, and differentiation processes that are regulated by internal genetic programs and modulated by external environmental factors [22,23]. Quantitatively, plant growth can be represented by morphological traits such as leaf area, height, and biomass, physiological measurements such as photosynthetic rate and transpiration, or biochemical markers, all of which vary across developmental stages and are sensitive to both spatial and temporal heterogeneity [24]. Moreover, the outputs of these simulations can have various representations from low dimensional quantitative traits to high dimensional 2D and 3D renderings and are explored further in this review.

1.3. Background: predictive plant growth modeling

Predictive models within plant research are often dominated by a frequentist approach to statistical inference, where empirical observations and static conditions are tested against hypotheses to determine the probable outcome through confidence intervals or point estimations. This statistical approach is favored within the field for its simplicity in analysis and interpretation. Frequentist methods, including various forms of regression modeling, provide robust tools for analyzing experimental data and have become the standard in plant research due to their well-established procedures and ease of use [25,26]. However, this experiment-centric approach is often limited when applied toward plant observation tasks that involve complex data structures, as with analyzing sequential and spatiotemporal images collected within the domain of high-throughput plant phenotyping (HTPP). Moreover, plant growth modeling and forecasting is an ill-posed problem where the trajectory of a plant’s development does not have a unique solution. Dynamic growing conditions and stochastic elements within plant development are prevalent due to the inherent variability and uncertainty that exists in biological systems. For example, plant growth is influenced by numerous interacting factors, including G ​× ​E ​× ​M variables. Thus, methods for regularization of the problem should be explored.

Within the context of simulations, computational models aim to predict plant traits, such as biomass accumulation, crop yield, presence of diseases, and overall growth dynamics, during various development stages under varying environmental conditions [27]. Some classical methods have approached this task through a combination of process models that simulate physiological principles and data-driven empirical models, typically obtained through a frequentist framework, to form various subsystems of an overarching hybrid model, offering a detailed understanding of how plants respond to different environmental stimuli at a field scale. Models such as the Crop Environment Resource Synthesis (CERES) [28] and the Decision Support System for Agrotechnology Transfer (DSSAT) have been extensively used to forecast plant growth by integrating weather data, soil conditions, and crop management practices [29,30]. These models have proven instrumental in evaluating the potential impacts of climate change on crop productivity, guiding decision-making in traditional agriculture. However, these methods often demonstrate the various limitations of a experiment-centric approach.

The first limitation of this classical framework is that the underlying inference models are typically discriminative. The parameters of these models are treated as fixed variables that characterize boundaries used to determine the most probable outcome based on the frequencies of occurrence in sample data. This lack of flexibility in describing data distributions makes these models less effective in representing systems that involve stochastic processes, as with the various biological processes that constitute plant growth. As a result, the deterministic process models derived from these empirical outcomes are often sensitive to initial conditions and are constrained to rigid scopes of inference, making it unsuitable for distant future predictions and the synthesis of realistic data. The second limitation is that the outputs of such models are typically expressed through low-dimensional or scalar quantitative traits, such as yield, biomass, and vegetative index. While this allows for simple comparative analysis and assessments of seasonality of data points, insights toward spatial information and links between input conditions and plant structural developments are lost. The third limitation is that frequentist-based inference models do not intrinsically incorporate mechanisms to update parameters and integrate new information dynamically. This paradigm makes these models less effective for sequence analysis, as they provide no insights on prior information and treat parameters and datasets as static. In contrast, probabilistic approaches, such as those based on Bayesian inference (BI), explicitly quantify uncertainties and dynamically update with new data [31]. The static nature of the classical approach to modeling plant growth can lead to the underestimation of uncertainty, as the underlying assumptions about the ”true” distribution of data may change, compounded by the cumulative effect of decisions made during earlier stages of inference in sequence analysis [32]. An overview of these dynamics between modeling paradigms are illustrated in Fig. 1.

Fig. 1.

Fig. 1

Overview of two modeling paradigms in plant growth research. The Frequentist approach emphasizes fixed data collection, hypothesis testing, and multivariate inference within an experimental-centric framework. In contrast, the probabilistic approach employs dynamic data integration, iterative updates to prior distributions, and posterior evaluation for decision-making, enabling data-driven adaptability over time.

1.4. Scope of this review

We explore these paradigms and assess the body of works available for various modeling tasks that require spatiotemporal analysis and pattern recognition of plant traits during its growth and development. We further propose insights to advance future works within this field and provide a perspective towards a model-centric probabilistic approach to plant growth pattern recognition and modeling through structured 2D and 3D data representations. This review serves as a comprehensive examination of the field of pattern recognition in plant growth. The review is organized as follows:

  • We present our rational and approach for presenting the works and state our limitations (Section 2).

  • We define plant growth modeling through various temporal paradigms and examine the prominent methods explored for scalar quantitative traits (Section 3).

  • We examine the application of these modeling paradigms towards 2D and 3D representations (Section 4).

  • We explore progress made within the developing field of probabilistic and generative computer-vision that utilize image-based forecasting (Section 4.2).

  • We provide a prospective towards the current state of research and propose a path for future works within the field (Section 5).

2. Rationale and approach

Recent state-of-the-art (SOTA) developments made within the field of HTPP have led to the establishment of data collection platforms and protocols for multimodal datasets, which include image sensor and 3D point cloud data [[33], [34], [35], [36], [37]]. With the introduction of these complex datasets and multi-modal data fusion techniques, plant phenomic research has shifted towards computer vision (CV) and sensor-based methods for efficient assessment of plant traits when compared to traditional, labor-intensive approaches [38,39]. Advances in CV, particularly through the use of deep learning (DL) models such as convolutional neural networks (CNN), have enabled the extraction of phenotypic traits such as leaf area, plant height, and disease symptoms from high-resolution images with good accuracy [40]. 3D imaging techniques, including structure-from-motion (SfM), stereo vision, and light detection and ranging (LiDAR) scanning, provide methods for processing spatial information to produce reconstructed 3D representation of plant architectures [41].

While these datasets are applied towards methods for assessing crop quality and quantifying biophysical properties throughout various stages of a plant’s growth cycle, the scope of phenomics research is often limited to recognition tasks such as detection and segmentation, with limited work done on CV-based predictive forecasting. In contrast, this review focuses on work that explore the spatiotemporal components of plant phenomics research and the representation of plant traits through time.

2.1. Literature overview

A rigorous screening process was applied to articles indexed within the following publication databases: SCOPUS, Web of Science (WOS), PubMed, and Google Scholar. A total of 51 method papers on plant growth modeling and 22 temporal dataset papers were selected and reviewed (Fig. 2). These papers include articles published between the years, 2015–2025, with a particular focus on assessing plant traits over time, where temporal analysis is a key component to the method or dataset. Foundational works predating 2015, such as standard numerical crop models, are referenced for context, but are not explored in depth as they are beyond the scope of this review. Articles in which traits are observed sparsely over time, for example traits observed once at the beginning of an experiment and once after harvest, are excluded from this review, opting for consistent, relatively high temporal resolution tracking of traits over time.

Fig. 2.

Fig. 2

Breakdown of publications reviewed, categorized by modeling approaches (left) and dataset types (right). The figure highlights the distribution of works across regression models, latent representation models, functional-structural plant models, and computer vision-based techniques, as well as their focus on temporal scalar, 2D image-based, and 3D point cloud datasets.

2.2. Plant growth representations and datasets

A repository of recently published datasets on various temporal plant growth dynamics was compiled for this review and is shown in Table 1. This repository establishes several representations of observable plant traits which can be categorized into broader segments and provides insights towards the current focus of plant phenomics research. These data representations include: 3D representations of plant structures, typically expressed through 3D point clouds; 2D representations of plant structures expressed through images; and finally, the most common amongst a broader spectrum of classical methods, is 1D representation of various traits expressed through quantitative scalar data.

Table 1.

Comprehensive list of temporal plant research datasets, published between 2015 and 2025, capturing various growth dynamics and traits.

Dataset Crop Traits Data Type
Sun et al. (Soybean-MVS) [42] Soybean (Glycine max var. DN251, DN252, DN253, HN48, HN51) Segmented plant structure (leaf shape, stem, branches) Point cloud (3D)
Schunck et al. (Pheno4D) [43] Maize (Zea mays), Tomato (Solanum lycopersicum) Segmented plant structure (leaf shape, stem, branches) Point cloud (3D)
Sultan et al. (Shoots and Roots) [44] Maize (Zea mays), Tomato (Solanum lycopersicum), rice (Azucena, Bala), Arabidopsis (Arabidopsis thaliana) Root system traits and structure Point cloud (3D)
Uchiyama et al. (KOMATSUNA) [45] Komatsuna (Brassica rapa var. perviridis) Plant structure Multi-view RGBD images and annotated masks (2D/3D)
Jayasuriya et al. [46] Capsicum (Capsicum annuum L.) Plant structure RGBD images (2D/3D)
Kierdorf et al. (GrowliFlower) [45] Cauliflower (Brassica oleracea var. botrytis) Plant structure RGB and multi-spectral images (2D)
Shadrin et al. [47] Lettuce (Lactuca sativa) Leaf growth and leaf segmentation Top-down RGB images and annotated masks (2D), and environmental sensor readings (tabular)
Genze et al. [48] Maize (Zea mays L.), Sorghum (Sorghum bicolor) Plant structure Top-down RGB images and annotated masks (2D)
Thoday-Kennedy et al. [49] Safflower (Carthamus tinctorius L.) Plant morphology and salt response Multi-view RGB images (2D)
Lärm et al. [50] Maize (Zea mays), Wheat (Triticum aestivum) Root system traits and structure RGB images and ground-penetrating radar map (2D), soil sensor data (tabular)
Parmentier et al. [51] General vegetation in Arctic ecosystems Vegetative phenology and climate interactions RGB images (2D) and environmental vegetative index data (tabular)
Craig et al. [52] Maize (Zea mays L.) Various agronomic traits and vegetative index Scalar (tabular)
Dépigny et al. (Plantain-Optim) [53] Plantain (Musa acuminata ​× ​Musa balbisiana) Various agronomic traits and biomass Scalar (tabular)
Ngaba et al. [54] Sugarcane (Saccharum officinarum) Various agronomic traits and biomass Scalar (tabular)
Newman et al. [55] Multiple field-crop species Long-term agronomic traits and biomass Scalar (tabular)
Campbell et al. [56] Maize (Zea mays) Photosynthetic function and vegetative index Scalar (tabular)
Manjarrez-Sanchez et al. [57] Greenhouse Tomato (Lycopersicum esculentum) Photosynthetic function and agronomic traits Scalar (tabular)
Niu et al. [58] Maize (Zea mays) Long-term phenological and climate interactions Scalar (tabular)
Zhu et al. [59] Multiple field-crop species Long-term phenological and climate interactions Scalar (tabular)
Mergner et al. [60] Arabidopsis (Arabidopsis thaliana) Gene expression Scalar (tabular)
Castro et al. [61] Camu-camu (Myrciaria dubia) Gene expression Scalar (tabular)
Peng et al. [62] Peanut (Arachis hypogaea) Gene expression Scalar (tabular)

2.3. Limitations

This review highlight works that have contributed significantly towards advanced techniques in pattern recognition by providing deeper insights into the mathematical frameworks or, in the case of more traditional forecasting methods, are recent and representative of the fundamental principles with some novel insights discussed. Due to the exploratory nature of this review of novel and SOTA approaches within this developing field, the availability of works and datasets that cover longitudinal predictive forecasting and simulated growth representations and meet this criterion is limited. We have omitted some works that simply use models already established by other works, and thus the number of works available for a full systemic review is not possible. Moreover, this review is constrained to published research and datasets, which may lead to an inherent bias towards studies and datasets that have achieved publication rather than those that may be equally relevant but remain in preprint stages, unpublished, or private.

3. Temporal modeling paradigms of quantitative traits

The abstraction of various plant biophysical properties into 1D quantitative traits provides a way to produce model parameters with functional meaning for a wide application of plant processes. Within plant phenomics, these applications can range from the molecular level with gene-protein expressions and phenotype mapping; the plant level in which we observe various plant-environment interactions; and the phenological level, where cyclical patterns of vegetative traits over a population within an ecosystem are mapped. In typical HTPP methods, scalar quantitative traits are inferred from high dimensional data such as RGB and multi-spectral images collected over time [[63], [64], [65], [66]]. These low dimensional quantitative traits can be assessed through time and represented through various temporal modeling paradigms.

3.1. Regression models

A common approach to simulate and predict the trajectory of these traits through time is through some form of regression-based inference model. Much of the works covered in this review describe methods for pattern recognition of plant growth that adhere to this framework. Phenotypic traits, denoted as y, can be expressed by explicitly fitting one of various forms of regressions (Equations (2), (3), (4))). We can define the configuration of independent parameters that contribute to the expression of these traits, as X ​= ​{xii ​∈ ​{1, 2, …, N}} for N parameters. For instance, y can represent a phenotypic trait such as plant height or leaf area, and xi represents the expression level of gene i. In order to model these effects continuously through time, t, a generic regression model is given by Equation (1). This regression approach provides an empirical basis for fitting discrete observation data and determining the fixed components of models such as regression and spline coefficients, β and γ, respectively. Discrepancies between observed data and modeled outputs can be described as noise through error term, ϵ.

y(t)=f(t)+ϵ(t) (1)
f(t)=i=0nβiti(Polynomial) (2)
f(t)=β0+β1t+j=1KγjSj(t)(Spline) (3)
f(t)=11+e(β0+β1t)(Logistic) (4)

The most common of these include non-linear regressions models that capture the typical patterns seen within plant growth traits. Following the conventions of a frequentist approach, sources of variation amongst growth patterns are hypothesized and made explicit before the collection of data. Non-experimental parameters, such as environmental and spatial variations, are standardized or corrected. Two works, Perez et al. [67] and Kar et al. [68], have proposed the implementation of spatial analysis of trials with splines (SPATS) for correcting spatial variations amongst experimental groups as a data pre-processing step before performing temporal modeling. The fixed but unknown statistics that describe the data are obtained empirically to fit the regression function for longitudinal trait predictions. The most prevalent implementations for modeling non-linear time-series patterns of scalar growth traits are spline-based forms, with common application of the model demonstrated in works such as Momen et al. [69] and Pauli et al. [70]. Polynomial forms, demonstrated in works such as Baba et al. [71] and Morales et al. [72], account for the majority of regressions used for longitudinal trait prediction.

3.1.1. Separating effects using hierarchical models

To disentangle the individual and compounding effects of various factors on plant growth trajectories, a hierarchical regression structure can be employed to represent plant traits for different levels of variation. This can be particularly useful for complex systems representing plant growth that is determined by multiple cumulative factors. For instance, plant traits such as height or leaf area can be modeled as responses to a combination of fixed and random effects. Fixed effects capture the consistent factors, such as the overall population-wide trends in growth, while random effects model the variation at each hierarchical level including genotype-specific effects and individual plant-specific variations. The random effects can vary across time, capturing the longitudinal nature of plant growth, while still maintaining a structure that allows for partial pooling of information across the hierarchy.

One such application of this is demonstrated by Equation (5) and shown in Fig. 3. A hierarchical logistic regression is employed to represent plant growth at the population level, genotype level, and individual plant level. A generic form logistic transformation (Equation (4)) can be applied towards the compounding fixed population effects, fp, the random effects at the genotype level, fpg(t), and the random effects of each individual plants, fpgi(t), to describe the development of plant traits over time, y(t). In this example β, L, and t0 in Equations (6), (7), (8)) are model parameters to be determined in order to fit our regressions to corresponding categorized data.

logyijt(t)Lijyijt(t)=fp(t)+fpg(t)+fpgi(t)+ϵijt,ϵijtN(0,σ2) (5)
fp(t)=βp,0+βp,1(tt0)(Fixed ​Population ​Effects) (6)
fpg(t)=βpg,0i+βpg,1i(tt0)(Random ​Genotype ​Effects) (7)
fpgi(t)=βpgi,0ij+βpgi,1ij(tt0)(Random ​Indvidual ​Plant ​Effects) (8)
Fig. 3.

Fig. 3

Hierarchical logistic regression modeling of typical plant growth with fixed and random effects. This hierarchical approach highlights variability at multiple levels, with the logistic model capturing both population-wide trends and genotype-specific and plant-specific variations in plant traits such as height, biomass, or leaf area.

When modeling plant traits over time, fixed effects are commonly defined by a logistic regression to capture typical growth curve patterns and to distinguish temporal trends between categorical groups, such as genotypes. A typical implementation of such models is demonstrated in works such as Baker et al. [73], in which the effects of genotypes are separated using Bayesian routines to estimate logistic parameters. Certain instances may require the disentanglement of complex effects between groups. In such cases, the mapping of these relationships require that regression models capture higher degrees of non-linearity. In Perez et al. [74], a P-spline-based hierarchical structure (Equation (3)) is employed to model the effects not just at the population and genotype level but also the effects between individual plants, experimental groups and spatio-temporal variations. This approach allows for the simultaneous representation of growth curves at various levels of variability and the incorporation of shared parameters amongst each group effects on plant growth.

3.1.2. How controlled environments affect fixed variables in regression

Temporal regression models for assessing longitudinal quantitative traits are typically developed under open-field conditions, which results in higher variability in observed outcomes and limit predictions made under dynamic conditions. One might want to separate these environmental effects as fixed variables to limit this variability for better performance on inference. While, controlled environment systems can regularize these conditions, the strict control of these factors can also introduce certain modeling constraints and affect how fixed effects are interpreted in temporal regression models. Specifically, the fixed variables in a regression model may become conflated with the uniform environmental conditions of the controlled environment. For example, plant height trajectories measured under identical light and nutrient regimes may exhibit reduced variability, masking G ​× ​E interactions that would otherwise emerge in field conditions [75]. As a result, the fitted regression parameters may overrepresent the influence of genotype or treatment effects, while underestimating environmental contributions.

Moreover, fixed effect parameters often capture the average trait progression under a narrow set of environmental conditions. This introduces a risk of overfitting to the specific experimental regime and limits the generalizability of the model to more variable or open-field conditions [76]. The deterministic nature of environmental control may also induce collinearity amongst time-dependent variables, making it difficult to separate the temporal effect of a fixed factor from its environmental context [77].

3.1.3. Pitfalls of regression-based temporal inference

Modeling plant growth through regressions provides meaningful interpretations of trends on labeled data points. However, temporal inferences made from these models suffer from the pitfalls of a post-hoc analysis. Following the typical experiment-centric modeling pipeline (Fig. 1), inference models are selected after data has been collected and are typically implemented to explain trends once experimental groups are found to be statistically significant. As such, various regression forms and their hyper-parameters are selected once this data is revealed in order to achieve best fit. This post-hoc selection provides an inherent risk of overfitting due to the lack of a priori model constraints, leading to interpretations that may capture transient or idiosyncratic patterns rather than robust, underlying temporal trends [78,79]. As a result, post-hoc regression models lack the generalizability necessary to predict future data accurately, particularly in time-series with complex dependencies or irregular periodicity [80,81].

Another pitfall involves the interpretability of the regression parameters. Temporal inferences often rely on the assumption that parameters estimated from the data capture the true characteristics of the underlying temporal process. However, in the absence of predefined model assumptions, these parameters can be sensitive to random variations or transient anomalies in the dataset. This sensitivity can result in misleading estimates of trend strength, direction, or duration, especially in nonlinear regression models, which are more prone to parameter instability when applied post hoc [82]. Consequently, regression-based temporal models may yield unreliable estimates that lack external validity, thereby compromising the inference drawn from these analyses.

Furthermore, temporal data often contain complex dependencies due to autocorrelation, seasonal effects, or multi-modal factors. When regression models are employed in a post-hoc framework, these dependencies are frequently under accounted for, leading to temporal confounding. Such confounding poses a significant challenge as it obscures the causative factors underlying observed trends and further limits the model’s applicability beyond the studied period [83]. Works such as Nesteruk et al. [84] demonstrate the use of autoregressive model improved predictions when compared to other non-correlated regressions.

3.2. Latent representation models

Quantitative traits can also be assessed through time by representing their complex temporal dependencies through some reduced latent representations,1 or hidden state. These representations can be applied as a transformation on sequential inputs to derive predictive outputs. In this section we explore various methods for extracting latent representation from the temporal dependencies between scalar quantitative traits and the models used to perform sequence-based inference from these hidden states.

3.2.1. Dimensionality reduction through principal component analysis

Methods like those employed by Adak et al. [85], aim to extract key features from time-series data that correspond to a reduced representation of growth rates using principal component analysis (PCA). In this context, PCA is a pattern recognition technique that captures the temporal components of the evolution of traits, such as key moments found within vegetative indices (VI) of plants of differing genotypes. These reduced representations effectively act as growth-rate phenotypes, embodying the entirety of a plant’s growth cycle and can be incorporated as an independent variable to improve accuracy of genomic and phenomic prediction models [86]. However, meaningful interpretations of these phenotypes, especially for simulation purposes, are lost as these representations are abstracted into a reduced latent form. Interpretations of these representations back to the time domain for inference requires a decoding process. We explore such a process for decoding a latent representation in Section 3.2.2.

3.2.2. Neural network-based models

More recently, with advancements in computational systems and optimization algorithms, NNs have been employed in models to perform powerful opaque mappings between inputs and outputs. As such, they have expanded the range of tools available for dimensionality reduction and feature extraction. While some early implementations of simple feed forward DL systems are employed for pattern recognition, they often perform inference on fixed time windows which result in the same pitfalls of regression systems described in Section 3.1.3 [87]. For instance, Matsumura et al. [88] employs a multi-layer perceptron (MLP) with inputs of early season parameters in the form of a fixed window to forecast end of season outputs. While this approach can capture relationships within the window, it fails to incorporate longer-term temporal dependencies beyond the window itself. This results in models that are limited in their ability to adapt to changes outside the fixed period, potentially leading to overfitting to short-term patterns and reduced generalizability to future time steps or different seasons.

3.2.2.1. Recurrent neural networks

To resolve this issue of fixed windows, sequence-based NNs, such as recurrent neural networks (RNN) and transformers, have been the focus of recent works to assess traits through time. Two main implementations of these networks include direct sequence predictions and latent sequence encoding (Fig. 4). The former utilizes NNs for sequence generation tasks such as time-series forecasting. In the context of RNNs, prior sequential information is defined by the weights of an NN and represented as a hidden state, h, which embodies all cumulative prior sequential information. This h is used to transform the next given input, Xt, to predict the next element in the sequence, Yt+1 (Fig. 4(a)). Various recent works within plant yield forecasting implement a subset of RNNs known as long short term memory (LSTM) networks to predict the overall real-time trajectory of plant growth traits [89,90].

Fig. 4.

Fig. 4

Directed graph models (folded) of the typical recurrent neural network (RNN) pipeline for sequence prediction and encoding. (a) Shows the sequence prediction task, where inputs X are processed through recurrent connections to generate hidden states h, leading to intermediate outputs O and a final output Y. This demonstrates how RNNs can predict each subsequent element in the sequence based on prior context.(b) Shows a sequence encoding process, where X is processed into a final encoded latent variable Z, capturing the entire sequence’s context in a single vector. Z can be decoded to obtain Y.

The second implantation of these NNs aims to capture and represent temporal relationships through a fixed latent variable, Z (Fig. 4(b)). Z is obtained by processing sequential information through an RNN and is subsequently derived from h, also typically defined in the form of the weights of a MLP between the two states. This representation captures the entirety of the sequence as a context driven conditional and can be decoded to perform inference on various tasks. One such application is demonstrated by Gong et al. [91], in which end of harvest yield is forecasted from a latent embedding of sequential environmental parameters through a temporal CNN decoder. Moreover, the decoding process can also be applied to task in which it is useful to predict future sequences of the same trait within a sequence-to-sequence approach. Works such as Bashar et al. [92], show the utility of such approach using a LSTM-based networks as both a sequence encoder to capture significant temporal relationships within plant growth development into a low dimensional embedding and as a decoder to obtain the next sequence of predicted outcomes from these embeddings.

3.2.2.2. Transformers and attention mechanisms

It is worth noting the applicability of transformers and attention mechanisms for analyzing sequential data similar to RNNs. However, unlike RNNs, transformers leverage a self-attention mechanism to process entire sequences in parallel, enabling them to capture long-range dependencies effectively as each element directly attends to all others [93]. Although few studies have applied these architectures towards temporal plant trait modeling, their ability to weigh the importance of different time steps dynamically demonstrates their potential for analyzing plant growth patterns influenced by both recent and distant environmental factors. This adaptability suggests significant opportunity for advancing plant growth forecasting, particularly for complex, multi-modal time-series data.

3.2.3. State-space models

State-space models (SSM) define a broader categorization of sequence processing approaches similar to RNNs. However, rather than learning an implicit hidden state, which captures prior information as memory, SSMs are defined by an explicit transition state model and observation model. These models can be expressed flexibly between deterministic functions for dynamical systems or as probabilistic methods for describing complex stochastic processes (Table 2). The transition state model defines how latent states, zt+1, evolve over time, and their dependence with the prior input, xt, and latent state, zt. The observation model relates the latent state to observed variables, yt. SSMs assume a Markov property, in which the future state of a system depends only on its current state. As such, the process has no memory beyond the current state as with RNNs. This Markov property can be relaxed with the inclusion of memory states of prior sequences in the transition state model (Section 4.2.2).

Table 2.

Comparison between the components of deterministic and probabilistic state-space models.

Deterministic SSM Probabilistic SSM
State Transition Model zt+1 ​= ​fθ(zt, xt) ​+ ​ηt zt+1pθ(zt+1|zt, xt)
Process Noise ηt ​≈ ​0 ηtN(0,Ση)

Observation Modela yt ​= ​gθ(zt) ​+ ​ξt yt ​∼ ​pθ(yt|zt)
Observation Noisea ξt ​≈ ​0 ξtN(0,Σξ)
a

Models may introduce stochasticity to a core deterministic functional. In such cases, process and observation noise do not equal 0.

Within the context of plant growth modeling and phenological pattern recognition, SSMs present an approach that captures the gradual changes in traits by modeling transitions within latent growth states [94,95]. This approach has the advantage of continuously updating estimates of latent variables based on context driven observations such as environmental factors. For instance, Viaud et al. [96] proposes that complex plant growth dynamics, including the variability from noisy environmental conditions can be better represented through a classical SSM framework, when compared to traditional models. More specifically, BI can be applied to estimate the parameters, θ, of a hidden Markov model (HMM), through a particle MCMC method. As such the stochastic elements of plant growth can be captured through a probabilistic latent transition state. In recent works, NNs have been employed as the transition state models to capture complex non-linear relationships between environmental and plant trait development. Shibata et al. [97] introduces a deep SSM designed to predict sugar content in greenhouse tomatoes from inputs representing environmental conditions. In this work it is suggested that a semi-surpervised approach can be taken and paired with reinforcement learning for dynamic control tasks in CEA, where optimal decisions must be made in response to changing conditions.

3.2.4. Pitfalls of representation models

Assessing plant growth patterns through latent representations provides a way to capture the complex long-term temporal dynamics of trait development. The hidden and latent states, particularly of RNNs and SSMs, determine the transformations on prior states, which can be used to interpret how the system will change through time. As such, these methods improve upon classic regression systems by providing a context driven inference paradigm that resolves various issues of post-hoc analysis and allows for real-time predictions. However, the abstraction of these temporal representations into latent forms results in challenges of interpretability [98,99]. Methods that inspect inner network structures reveal that even mechanistic dissection of latent variables often yields abstract or semantically opaque units [100].

As the prior states are transformed into latent or hidden states, the connection between the model inputs and the output space can become unclear. For instance, PCA reduces temporal data to principal components, but these components often lack a direct explainable meaning, making it difficult to infer how individual traits evolve over time directly and require some additional mapping to outputs [101]. This challenge is further exacerbated if mappings are performed through NN-based models, where latent and output states are learned through complex nonlinear weight transformations, leading to opaque predictions that prevent practical interpretations. Moreover, determining the inverse mapping of the model solution is not directly possible. For example, given the outcome of a plant’s trait at a particular time, we cannot determine the input conditions and prior states that gave rise to the outcome directly, especially if latent states are determined through NNs or other highly abstracted models.

3.2.4.1. Integrating domain-specific knowledge

Future work may focus on the integration of domain-specific knowledge within these pattern recognition paradigms, especially in cases where meaningful interpretation of processes is important. These knowledge-driven models typically define various plant processes based on biological principles such as photosynthesis, respiration, and growth dynamics through mechanistic models [102]. Xing-Rong et al. [103] explores a combined data-driven and knowledge-based approach to resolve issues of strict calibration constraints, and incomplete representations of complex plant dynamics that result from purely mechanistic models. Much of these hybrid models are early in their conception, utilizing simple data-driven models, such as regressions or simple MLPs, to account for variability of outputs or as direct inputs, as an estimator, to a mechanistic model. Consequently, multiple models are often employed as separate components and consolidated through a two-step process; one step to obtain the data driven term, and the second step for the mechanistic output. This approach has the disadvantage of higher computational cost and potential loss of capturing hierarchical or interdependent effects, especially if training processes are separated. Thus, there are significant potential for further contributions of advanced data-driven techniques to improve methods within a hybrid paradigm [104].

3.3. Comparisons of temporal models

Table 3 provides a comparative summary of the temporal modeling paradigms explored in this section for assessing longitudinal quantitative traits. Each method varies in its data requirements, ability to model uncertainty, interpretability of its parameters, and its typical use cases in plant phenomics. Regression-based models remain the most interpretable and widely applied due to their simplicity and empirical grounding, particularly in structured experimental setups. However, they often lack the flexibility to model complex temporal dependencies and are prone to post-hoc inference limitations.

Table 3.

Comparison of temporal models for assessing longitudinal quantitative traits.

Model Data Requirements Uncertainty Handling Interpretability Application
Regressiona Low; structured, independent data Not modeled during inference, but handled through confidence intervals High (coefficients are interpretable) Growth trend analysis; treatment comparisons
Hierarchical Regression Moderate; nested or multi-level data Explicit modeling via random effects Moderate to high Multi-environment trials; genotype ​× ​environment modeling
Feedforward NN (MLP)a High; labeled training data Not explicitly handled Low; black-box weights Trait prediction; nonlinear phenotype modeling
Sequence RNN High; time-ordered sequential data Implicit via temporal encoding Low; hidden states are abstract Dynamic trait prediction; sequential pattern learning
Encoder RNN High; time-series data Stochastic encoding model uncertainty Low; latent space hard to interpret Latent growth pattern discovery; generative modeling
PCAa Moderate; continuous multivariate data Not explicitly handled Low; PCs are linear combinations without direct meaning Dimensionality reduction; exploratory trait analysis
State-Space Model (SSM) Moderate; time-series with latent dynamics Explicit modeling of process and observation noise Moderate; states interpretable if well-defined Latent process modeling; sensor fusion; phenotype control
a

Not probabilistic by default, but uncertainty can be modeled using Bayesian or variational extensions.

Latent representation methods, including PCA and NN-based models, offer powerful tools for capturing abstract patterns and nonlinear dynamics, but at the cost of interpretability. Probabilistic models such as SSMs provide a middle ground by explicitly modeling noise and latent transitions while allowing for context-aware updates. Hybrid approaches that combine data-driven inference with mechanistic plant models may offer a promising direction for advancing interpretability and generalization across varied environmental and biological conditions.

4. 2D/3D structured representations through time

From low-dimensional scalar traits, we move toward representing the structural elements of plant growth through higher order data structures, such as 2D images and 3D point clouds. This is particularly useful for describing the development of plant structures through time, providing a graphical representation and visual insights to plant dynamics that are normally abstracted away with vectorized quantitative traits. The spatiotemporal simulation of these attributes offers a holistic assessment of strategies exhibited by the various plant behaviors based on contextual stimuli or signals. In this section, we take the foundation of pattern recognition techniques explored in Section 3 and apply them toward tasks that represent plant morphologies as structured outputs. This typically involves the addition of some rendering process which defines our plant representation model and is driven by a contextual process (Fig. 5). As such, we explore the various works on the 2D or 3D structured plant representations, as well as works that attempt to model the entire context driven spatiotemporal pipeline for simulating plant growth.

Fig. 5.

Fig. 5

High-level conceptual overview of a context driven simulation framework for structured output representation through time.

4.1. Functional-structural plant models

Functional-structural plant models (FSPM) represent a class of plant representation models that construct the structural and geometric elements of a plant from its mechanistic processes [105,106]. Particularly, these models capture the interaction between the plant’s physiological processes, such as photosynthesis, water transport, and nutrient allocation, and its development of plant organs such as leaves, branches, and roots over time [[107], [108], [109], [110]]. By coupling these two aspects, FSPMs provide a comprehensive view of plant behavior in response to various dependent parameters, allowing for the visualization of resource allocation and its effect on growth patterns. More recently, empirical methods have been integrated to the development process of FSPMs to establish the relationships between features extracted from HTPP data [111]. As such, an emergence of hybrid data-driven, mechanistic FSPMs have been applied towards plant growth simulations.

4.1.1. Process-driven differential growth models

One application of these FSPMs is to capture the morphological developments within plant organs through differential growth models. The development and emergence of complex 3D morphologies within plant structures through time can be represented through numerical models, such as finite element methods (FEM), to describe differential growth across localized regions. The basis of the mechanistic models that determine these differentials are typically derived through experimental observations, utilizing image-processing techniques such as optical flow to obtain growth parameters from HTPP data [112]. Changes are mapped spatially and reconstructed from the distribution of conditions which influence the final shape of plant organs. One example of such modeling is demonstrated in Huang et al. [113], where biomechanical conditions are observed to determine the shape formations of leaves and petals. In this work, the modeling of growth differentials can be used to describe and reconstruct twisting, helical twisting, and edge-waving configurations within leaf structures through FEM simulations.

More often, this approach for plant structural modeling is applied toward developing fate and growth maps of cells within individual plant organs to simulate localized development [114]. For instance, in the work done by Kierzkowski et al. [115], the dynamics of genomic-proteomics pathways of two homeobox genes and their effects on promoting, inhibiting, and differentiating plant cells are modeled through spatial growth differentials using FEM. This phenomena is modeled through time and driven by the localized stimuli of plant hormone, namely auxin, concentrations. In this case, this model is applied toward developing a deep understanding of the causal effects of these specific signal pathways through empirical methods by mapping signals through time.

4.1.2. Rule-based Lindenmayer systems

Another common implementation of FSPMs involves the application of a rule-based modeling paradigm known as the Lindenmayer systems (L-systems). L-systems define a formal set of recursive rules that specify the configuration of plant structures based on a string of symbolic characters. These rules guide the growth and branching patterns of simulated plants through iterative processes that aim to generalize and represent the realistic biological development and propagation of plant structures [[116], [117], [118]]. By repeatedly applying these recursive rules, L-systems can simulate natural variations and intricate details found in plant morphology, making them a powerful tool in computational botany and generative modeling. A basic example is given by Algorithm 1, which establishes a generalized execution of a simple L-system, where Fig. 6 shows the visual rendering of a rule set commonly used for describing a simple branching pattern in 2D.

Algorithm 1

Generate L-System String

Require Axiom (initial string) S, number of iterations N, set of production rules P
Ensure Final string after applying rules
1: Initialize currentS
2: Fori ​= ​1 to
3: N next ⇐ empty string
4: For each character c in current do
5: if there is a rule creplacement in P then
6: Append replacement to next
7: else
8: Append c to next
9: end if
10: end for
11: currentnext
12: end for
13: Return current
Fig. 6.

Fig. 6

Demonstration of plant-like structures generated using L-Systems with varying parameters. The axiom S ​= ​”F” and production rule P are iterated N times with specified angles. Examples show branching patterns for N ​= ​2, 3, 4 and angles 15°, 30°, 40°, demonstrating the influence of parameterized rules on the rendering process. The symbols represent drawing actions.

This algorithm can be expanded to include parameterized production rules that encapsulate complex relationships at each iteration of the simulated plant branching. For example, various works have developed production rules that dictate tree organ formation based on a source-sink dynamic between existing plant organs at previous iterations, which is then subsequently rendered for the 3D simulation of various tree cultivars [119,120]. More recent work has expanded on production rules using sink-source dynamics by incorporating context-driven systems such as environmental and management conditions, soil nutrient availability, and distribution to improve the dynamic tree simulation process [[121], [122], [123]].

Furthermore, works such as Hitti et al. [124] and Mußmann et al. [125] introduce an adaptation of the L-systems paradigm that captures the dynamic development of plant structures using agent-based interactions. Plant structures, represented as agents, can independently interact with various stimuli. Within such systems, agent-based feedback mechanisms, such as reinforcement learning, can be used to train model growth patterns based on external stimuli.

4.2. Computer vision approach to plant modeling

FSPMs typically rely on procedural methods for the visual rendering process of the plant simulation pipeline (Fig. 5). Although the outcomes of such models are interpretable and result in visually consistent structures, the outputs of such methods are often derived from deterministic solutions that lack stochastic variability. Such solutions tend to drift from biological realism, particularly when simulating complex interactions or environmental responses, requiring the integration of experiment data to drive realistic outputs [106]. Alternatively, some initial efforts have been made towards a novel approach using data-driven CV for rendering the structural elements of the simulation task.

For instance, the task of plant growth modeling can be structured to fall within one of two classic problems found within the domain of predictive CV and video analysis; frame-to-frame translation and frame synthesis. Within this perspective, the structural elements of plant growth can be represented as images of a temporal scene, or frames, where the goal of our methods is to predict the pixel-wise configuration of these frames, such that the corresponding scene describes the realistic spatiotemporal development of plant growth. These methods do not rely on procedural rendering routines using pre-rendered graphics. Rather, the visual rendering process becomes a CV-based image translation task driven by empirical methods for predicting frames through features learned from high dimensional 2D or 3D HTPP data. We explore some early works which utilize such methods for predictive pattern recognition.

4.2.1. Next frame prediction

Some early works on CV-based plant growth forecasting include Sakurai et al. [126] and Wang et al. [127], which have proposed the adoption of convolutional recurrent neural networks (CRNN) for the task of image-to-image translation to predict the next frame of a plant growth scene given a set of prior frames from the same scene. This approach expands on the sequence-based prediction techniques described in Section 3.2.2 to assess spatial features from consecutive images. This is done by incorporating convolutional layers within a RNN framework to extract and encode features from a set of input images, X, to a transitional hidden state h. The rendering of this hidden state is described by a decoding step which consists of an up-sampling process of h through deconvolutional layers. The typical generic implementation of the CRNN is shown in Fig. 7. This approach combines pattern recognition techniques for forecasting plant parameters using RNNs with the visual rendering of plant structures through image translation methods. While CRNNs allow for spatial features to be captured through time, the deterministic mapping through convolutional and deconvolutional layers often result in artifacts and ambiguities within structures of the output images [128].

Fig. 7.

Fig. 7

Directed graph model (unfolded) of a Convolutional Recurrent Neural Network (CRNN) implementation for predicting the next sequence in a plant scene. The model includes convolutional layers (ConvL1, ConvL2, …, ConvLn) for feature extraction and encoding, followed by a recurrent layer for temporal processing of sequential frame inputs,X, at t0, t1, …, tn. A decoder module with deconvolution layers (DeConvL) reconstructs the output frames, Y, at tn+1 based on learned spatiotemporal features.

To mitigate problems in image quality, recent work has focused on improving the image generation component of model architectures, which correspond to the decoder of a CRNN. Yasrab et al. [129] proposes the use of a Generative Adversarial Network (GAN)-based network architectures for improved realistic image synthesis. Other works such as Kim et al. [130] propose multi-step networks that separate various components of the image generation task, such as shape estimation and image reconstruction, utilizing a transformer-based network for spatial pattern recognition. Although these approaches improve image generation quality, they do not address the challenges of ambiguous structures caused by a deterministic mappings. CV-based plant forecasting is an ill-posed problem, in which, there exists non-unique outcomes of a plants growth trajectory given the same initial set of frames. This is particularly evident under diverging environmental conditions. Moreover, predictions from the current models are made solely on the previous frames of the scene and further contribute to the prevalence of ambiguities during image translation.

4.2.2. Frame synthesis through conditional generative models

A probabilistic approach can be taken to regularize the plant growth modeling problem and to guide the prediction process by incorporating stochastic elements caused by dynamic growing conditions. In such methods, a generative model is employed for the task of frame synthesis to predict the expected frames of the plant growth scene. While generative models are typically used as data augmentation techniques for enhancing sparse agricultural datasets [131,132], few works have explored the use of conditional generative models (CGM) for guided image synthesis. A probabilistic approach using a CGM is illustrated by a generic conditional variational autoencoder (CVAE) shown in Fig. 8. Instead of performing direct deterministic mappings between inputs and output frames, the objective of the CVAE is to learn a prior distribution, pθ(z|x), where z ​∼ ​pθ(z|x) is the latent representation of scene at a given point in time, t, directed by some conditional vector x. The prior distribution can be sampled such that it incorporates stochasticity through auxiliary variable, ϵ. This process is known as latent sampling and it guides the generation process by randomly sampling from an underlying latent distribution that captures the hidden structure of the data. The network structures explored within the domain of plant growth modeling typically consists of a probabilistic latent variable encoder and a deterministic decoder to render output frames, yˆpθ(y|z,x).

Fig. 8.

Fig. 8

Conditional generative model architecture for guided plant scene simulation. The framework includes a latent encoder FΦ(x, yz) that maps input data (X, Y) to a latent variable Z, and a decoder GΘ(z, xy) that reconstructs ground truth (GT) outputs, Y. The latent space is defined by μ and σ2 and incorporates auxiliary stochastic noise ϵ. Conditional vector, X, is directed by environmental parameters, c(τ), such as temperature, humidity, pH, and EC, over a time period τ ​= ​[0, t]. The architecture allows for generating sampled outputs, yˆ, guided by X.

Early works, including Miranda et al. [133] and Drees et al. [134], propose the adoption of a conditional GAN (cGAN), which introduces a discriminator with adversarial routines to the generic CVAE structure, for synthesizing field images and forecasting the state of the crop canopy. The conditional vector consists of fixed soil treatment embeddings set at the start of the growing season. Forecasts are made at discrete points in time driven by the initial states of the treatment conditions. These works have demonstrated SOTA guided image generation for realistic image synthesis with variability in outputs defined by stochastic elements, or “factors of variation” (FOV). The frames synthesized from such models can be used in conjunction with image-based feature extraction models to provide quantitative estimation of biomass for a complete simulation process [135].

In these works, predictions are guided by a conditional vector comprised of treatment conditions, a prior frame state, and a time factor. The treatment embeddings are fixed at a particular point in time, typically set at the start of experimentation, and do not account for dynamic changes in conditions. For spatiotemporal simulation and trajectory estimation, the guidance of predictions without feedback can lead to larger variability when diverging from initial conditions due to the accumulated effects of compounding factors, including environmental conditions [136]. As such, this approach may be applicable for simulating images of end of harvest canopies under varying treatment conditions at a discrete point in time in the future, but lacks the dynamics for real-time continuous plant growth predictions useful for real world applications such as anomaly detection and intervention practices.

Debbagh et al. [137] proposes a sequence-informed approach which introduces dynamic and compounding environmental conditions as feedback to a CGM, guiding a continuous simulation process. In this work, the conditional vector, x, which guides the image generation of the CGM, is directed by a configuration of random variables, c, that describe dynamic factors that contribute to the development of the plants within the scene (i.e. environmental parameters). x is determined by encoding a sequence of these environmental parameters from the start of the simulation up to the point of inference, c(τ), where τ ​= ​[0, t]. The encoding process consists of an RNN encoder, which allows the model to learn the compounding and sequential relationships between environmental conditions as a vector representation. The dependencies between each model component is illustrated by the directed graph model shown in Fig. 9. This approach provides a framework that further regularizes the image generation task through dynamic context, allowing for the realistic synthesis of consecutive frames.

Fig. 9.

Fig. 9

Directed graph model (folded) of a sequence-informed method for incorporating compounding environmental feedback to the conditional generative model (CGM). Environmental parameters, c(τ), are sequentially processed through an RNN encoder to obtain conditional vector, X. X guides the generation and latent encoding process of the CGM to produce output frames, Y. The generation edges correspond to the synthesis of outputs frames, Y, while the recognition edges correspond the process of learning the probability distributions of latent variable, Z. The recurrent connections allow the system to integrate sequential information for dynamic plant growth predictions.

4.3. Comparisons of structured output representations through time

Table 4 provides a comparative summary of the structured modeling paradigms discussed in this section for simulating plant morphologies through time. Each method represents a trade-off between data needs, realism, and interpretability.

Table 4.

Comparison of structured models for simulating plant morphologies through time.

Model Data Requirements Uncertainty Handling Interpretability Application
Differential Growth (FEM) High; cell-level growth data, imaging-based flow analysis Implicit in parameter variability; deterministic model High; spatial and physical dynamics interpretable Leaf morphogenesis, biomechanical growth simulations
L-Systema Low to moderate; rule sets and symbolic encodings Not explicitly handled High; rules directly map to morphology Branching patterns, 2D/3D growth simulations, generative modeling
Agent-based L-System High; interaction data with agents/environment Stochasticity and feedback via agent logic; tunable Moderate to high; emergent from explicit interactions Context-aware growth modeling; reinforcement learning driven growth
CRNN (Next Frame Prediction) High; sequential image data of plant growth Implicit via temporal encoding Low to moderate; convolution and hidden state opacity Visual trait prediction; spatiotemporal structure forecasting
Conditional Generative Models (CVAE, cGAN) High; image pairs with metadata or environmental labels Explicit; latent distribution sampling (KL divergence, adversarial) Moderate; constrained latent space enables partial interpretation Visual scene generation; growth under static treatment conditions
a

Not probabilistic by default, but uncertainty can be modeled using Bayesian or variational extensions.

Mechanistic FSPMs such as differential growth simulations based on FEM rely on high-resolution imaging and cell-level observations to reconstruct localized growth processes. These models are highly interpretable, driven by physical and biological insights, but are typically deterministic and do not formally account for uncertainty. L-systems offer interpretable and symbolic modeling of branching structures using production rules. While L-systems are not probabilistic by default, stochastic extensions and agent-based variants allow for the incorporation of environmental interaction and feedback, enabling more dynamic simulations.

CV-based approaches introduce data-driven alternatives for visualizing plant growth. CRNNs are used to capture sequential spatial features from image sequences but suffer from low interpretability and deterministic output mappings, while CGMs, including the sequence-informed CVAE and cGANs, address these challenges by introducing latent variable modeling and sampling to represent uncertainty and generate more biologically plausible outputs.

5. Perspectives

5.1. Data-driven vs experiment-driven approaches

Plant growth modeling has traditionally relied on experiment-driven approaches, where predefined hypotheses are tested using carefully controlled experiments. These methods provide valuable insights into specific biological processes but are often time-consuming, resource-intensive, and limited in scalability [[138], [139], [140]]. By contrast, data-driven and model-centric approaches leverage large datasets and machine learning techniques to capture patterns and relationships that might not be immediately apparent within an experiment-driven framework [141]. The works presented in this review demonstrate a shift towards the latter, as challenges with dynamic updating and long-term predictions are prevalent in simple regression methods as described in Section 3.1.3.

The shift towards 2D image and 3D point cloud data for describing plant phenemena, as with HTPP datasets, requires model frameworks designed for interpreting and processing higher order data-structures [142,143]. This inherently moves prospective research towards the CV-based models described in Sections 4.2, 4.2.1.2. However, the advantages of these data-driven approaches come with the limitation of reduced interpretability and a lack of the biological grounding inherent in experiment-driven methods [144]. Model frameworks such as FSPMs present potential opportunities for incorporating knowledge-based approaches for structured representation that is highly controllable, but is completely reliant on experiment-driven approaches for developing knowledge-based rule sets.

Future research in this domain should prioritize the development of hybrid approaches that integrate the strengths of both paradigms. Two key directions are proposed to achieve this goal. The first involves designing model architectures that incorporate knowledge-informed mechanisms, enabling the integration of biological principles and domain expertise into data-driven frameworks. This is demonstrated by works that aim to integrate deep learning frameworks into large scale crop models such as CERES and DSSAT [104]. The second focuses on advancing methods for decoding and interpreting opaque models, ensuring that complex representation models provide outputs that align with biological processes and facilitate deeper understanding [144].

5.2. Dataset gap

The shift toward data-driven approaches has placed datasets at the center of plant growth modeling and predictive frameworks. Large-scale, high-quality datasets are necessary for training DL models and validating their predictions [145,146]. However, the effectiveness of these models are dependent on the availability, diversity, and representativeness of the data [147]. Despite advancements in HTPP platforms, significant gaps remain in the scope, quality, and accessibility of existing datasets.

First, many datasets focus on single time points or static snapshots, which fail to capture the dynamic growth trajectories of plants over time. In such cases, works such as Zhu et al. [148] aim to simulate and analyze the physiological development of soybean plants using interpolated 3D reconstructions of discrete snapshots. Existing publicly available datasets that capture longitudinal data, compiled together and shown in Table 1, are scarce and limited in sample frequency. This is particularly evident for datasets that capture multiple modalities of data including 2D images and 3D point clouds along with environmental parameters. Second, the diversity of crop species represented by current datasets in Table 1 and HTTP platforms is limited, with a heavy emphasis on staple crops such as maize (Zea mays) and soybean (Glycine max) [149]. This narrow scope restricts the ability of models to generalize across plant species with differing morphologies, such as root vegetables or perennials. Finally, the lack of consolidation of datasets across different research groups and platforms creates significant barriers to broader adoption and cross-comparative studies. Inconsistent formats, annotation practices, and metadata standards hinder the ability to combine datasets, thereby limiting the diversity and scale necessary for developing generalizable models [150].

To address these gaps, several considerations for developing future datasets are proposed. First, works should aim to apply standardization and open-access practices during dataset development to enable reproducibility and comparability across studies through benchmarking. High-frequency sampling should also be prioritized to produce longitudinal datasets that span entire growth cycles, improving the ability to model and predict temporal dynamics accurately. Furthermore, the development of multi-modal datasets, combining 2D images, 3D point cloud data, and environmental sensor measurements is suggested for capturing a holistic view of plant-environment interactions. Finally, integrating domain knowledge into dataset design to ensure that datasets not only capture observable traits but also reflect underlying biological processes. Such biologically informed datasets will enable models to generate predictions that are both empirically and biologically meaningful [151].

5.3. Practical applications

The recent advancements in pattern recognition techniques explored in this review have significant implications for practical applications. While traditional applications include crop yield forecasting and climate simulations, these novel approaches expand the scope to include controlled environment systems, field crop systems, and plant genomics and phenomics research. These advancements allow for precise detection, simulation, and representation of plant growth processes in complex environments.

5.3.1. Anomaly detection

Anomaly detection and early interventions practices during crop development benefit from predictive pattern recognitions techniques that incorporate dynamic feedback from environmental growing conditions. The focus on the development of models that capture long-term temporal dependencies through sequence-based methods, such as transformers and RNNs, provides an effective approach to identifying deviations in plant growth trajectories. For instance, temporal anomalies such as stunted growth or disease occurrence can be detected in advance by analyzing sequential data from real-time HTPP platform measurements [152]. Moreover, the incorporation of multi-modal approaches to combine imaging data with contextual environmental parameters presents a more comprehensive assessment of plant health. Models such as multi-variate transformers have been employed to improve the precision of anomaly detection in time-series data [153,154]. Such techniques are particularly useful for early intervention, mitigating potential yield losses and optimizing resource use.

5.3.2. Plant-environment simulation

Simulation frameworks informed by generative and probabilistic models improve the ability to predict plant growth under varying environmental conditions. Models like the CGMs presented in Section 4.2.2 enable the synthesis of realistic growth trajectories influenced by the spatial and dynamic growing condition, including the distribution of light, nutrient conditions, temperature, and CO2 [134,137]. This provides in silico methods to inform the structural development process of a controlled plant system and to develop strategies prior to conducting in-field operations. These frameworks allow for resource optimization in controlled environments and improve the forecasting of long-term plant responses to changing environments.

5.3.3. Digital twinning and representation

Digital twinning is the virtual replication of a physical system, maintained through continuous data exchange, simulation, and feedback. Within plant phenomics, digital twinning is limited and typically consists of methods for 3D reconstruction of plant structures taken at sparse static snapshots within its growth cycle. This is often due to limitations in imaging techniques for capturing frequent 3D spatial data during its growth and challenges in continuous collection of auxiliary data [142,155]. The structured representation methods explored within this review have implications on expanding this static framework to capture a continuous virtual replica of a plant system during its entire growth cycle. The continuous frame synthesis techniques explored in Debbagh et al. [137], allow for the interpolation of structured outputs between points of time. Moreover, FSPMs such as L-systems also demonstrate a potential for continuous state estimation by basing structural development on biological rules as a method that does not rely on continuous data collection [156]. These methods present an opportunity to generate digital plant structures with the guidance of few snapshots of plants. Moreover, the divergence of these plant structures from the snapshots can be explored through simulated growing conditions.

6. Conclusion

In this review, we provide a comprehensive examination of recent works and explore prospects towards advancing predictive pattern recognition techniques within plant phenomics research. These works cover various approaches that address challenges in processing and interpreting various representations of spatiotemporal plant data, which include longitudinal scalar growth traits and structured 2D images and 3D point cloud data. Within these works, novel modeling paradigms, which take advantage of recent advancements in computational systems, HTPP data collection and data fusion techniques, have presented significant improvements to the precision and accuracy of models for long-term forecasting and other simulation tasks. Notably, NN-based representation models and CGMs present a shift toward a data-driven approach to predictive pattern recognition that not only allows for dynamic updating for guiding predictions, but also allows for the synthesis of structured representation for spatial understanding of outputs.

Additionally, this review offers a perspective for opportunities to advance works within in this field. This includes perspectives on the integration of domain knowledge with data-driven approaches, and the emphasis on developing comprehensive, standardized datasets. Furthermore, we suggest practical applications within various domains to be explored in future works. The collection of these works and our perspective aims to consolidate and expand the understanding of spatiotemporal modeling frameworks and their role in advancing plant phenomics research and applications.

General

Special thanks to Anita Parmar, Ollivier Dyens, and the Building 21 community for their valuable contributions through engaging and thought-provoking discussions.

Author contributions

M. Debbagh drafted the manuscript, prepared the figures and tables, and incorporated feedback from co-authors. S. Sun and M. Lefsrud reviewed and edited the manuscript, offering feedback to improve clarity and impact.

Tools

Fig. 1 (https://BioRender.com/y80s089) and Fig. 5 (https://BioRender.com/y69z168) were created in BioRender.com.

Funding

This work was supported by the Natural Sciences and Engineering Research Council of Canada (RGPIN-2021-03266, RGPIN-2021-03410); and Gardyn through the Mitacs Accelerate program (IT16220).

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Footnotes

1

Latent representations refer to reduced-dimensional, abstract encodings of quantitative trait data that capture temporal dynamics (time dimension) and underlying structure.

References

  • 1.Mirzabaev Alisher, Olsson Lennart, Kerr Rachel Bezner, Pradhan Prajal, Ferre Marta Guadalupe Rivera, Lotze-Campen Hermann. Springer International Publishing; Cham: 2023. Climate Change and Food Systems; pp. 511–529. ISBN 978-3-031-15703-5. [DOI] [PubMed] [Google Scholar]
  • 2.O’Sullivan C.A., Bonnett G.D., McIntyre C.L., Hochman Z., Wasson A.P. Strategies to improve the productivity, product diversity and profitability of urban agriculture. Agric. Syst. 2019;174:133–144. doi: 10.1016/j.agsy.2019.05.007. ISSN 0308-521X. [DOI] [Google Scholar]
  • 3.Benke Kurt, Tomkins Bruce. Future food-production systems: vertical farming and controlled-environment agriculture. Sustain. Sci. Pract. Pol. 2017;13(1):13–26. doi: 10.1080/15487733.2017.1394054. [DOI] [Google Scholar]
  • 4.Halperin Ofer, Gebremedhin Alem, Wallach Rony, Moshelion Menachem. High-throughput physiological phenotyping and screening system for the characterization of plant–environment interactions. Plant J. 2017;89(4):839–850. doi: 10.1111/tpj.13425. [DOI] [PubMed] [Google Scholar]
  • 5.Henry Robert J. Innovations in plant genetics adapting agriculture to climate change. Curr. Opin. Plant Biol. 2020;56:168–173. doi: 10.1016/j.pbi.2019.11.004. ISSN 1369-5266. [DOI] [PubMed] [Google Scholar]
  • 6.Langstroff Anna, Heuermann Marc C., Stahl Andreas, Junker Astrid. Opportunities and limits of controlled-environment plant phenotyping for climate response traits. Theor. Appl. Genet. 2022;135(1):1–16. doi: 10.1007/s00122-021-03892-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Anant Bapat Vishwas, Kavi Kishor P.B., Jalaja Naravula, Jain Shri Mohan, Penna Suprasanna. Plant cell cultures: biofactories for the production of bioactive compounds. Agronomy. 2023;13(3) doi: 10.3390/agronomy13030858. ISSN 2073-4395. [DOI] [Google Scholar]
  • 8.Yang Yang, Hassan Sedky H.A., Kumar Awasthi Mukesh, Gajendran Babu, Sharma Monika, Ji Min-Kyu, Salama El-Sayed. The recent progress on the bioactive compounds from algal biomass for human health applications. Food Biosci. 2023;51 doi: 10.1016/j.fbio.2022.102267. ISSN 2212-4292. [DOI] [Google Scholar]
  • 9.Nguyen Mai Trinh Phuong, Knowling Matthew, Tran Nam N., Burgess Alexandra, Fisk Ian, Watt Michelle, Escribá-Gelonch Marc, This Herve, Culton John, Hessel Volker. Space farming: horticulture systems on spacecraft and outlook to planetary space exploration. Plant Physiol. Biochem. 2023;194:708–721. doi: 10.1016/j.plaphy.2022.12.017. ISSN 0981-9428. [DOI] [PubMed] [Google Scholar]
  • 10.Lefers Ryan M., Tester Mark, Lauersen Kyle J. Emerging technologies to enable sustainable controlled environment agriculture in the extreme environments of middle east-north Africa coastal regions. Front. Plant Sci. 2020;11 doi: 10.3389/fpls.2020.00801. ISSN 1664-462X. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Paul Zabel, Conrad Zeidler, Vincent Vrakking, Dorn Markus, Schubert Daniel. Biomass production of the eden iss space greenhouse in Antarctica during the 2018 experiment phase. Front. Plant Sci. 2020;11 doi: 10.3389/fpls.2020.00656. ISSN 1664-462X. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Benes Bedrich, Guan Kaiyu, Lang Meagan, Long Stephen P., Lynch Jonathan P., Marshall-Colón Amy, Peng Bin, Schnable James, Sweetlove Lee J., Turk Matthew J. Multiscale computational models can guide experimentation and targeted measurements for crop improvement. Plant J. 2020;103(1):21–31. doi: 10.1111/tpj.14722. [DOI] [PubMed] [Google Scholar]
  • 13.Großkinsky Dominik K., Svensgaard Jesper, Christensen Svend, Roitsch Thomas. Plant phenomics and the need for physiological phenotyping across scales to narrow the genotype-to-phenotype knowledge gap. J. Exp. Bot. 2015;66(18):5429–5440. doi: 10.1093/jxb/erv345. ISSN 0022-0957. [DOI] [PubMed] [Google Scholar]
  • 14.White Jeffrey W., Andrade-Sanchez Pedro, Gore Michael A., Bronson Kevin F., Coffelt Terry A., Conley Matthew M., Feldmann Kenneth A., French Andrew N., Heun John T., Hunsaker Douglas J., Jenks Matthew A., Kimball Bruce A., Roth Robert L., Strand Robert J., Thorp Kelly R., Wall Gerard W., Wang Guangyao. Field-based phenomics for plant genetics research. Field Crops Res. 2012;133:101–112. doi: 10.1016/j.fcr.2012.04.003. ISSN 0378-4290. [DOI] [Google Scholar]
  • 15.Deery David M., Jones Hamlyn G. Field phenomics: will it enable crop improvement? Plant Phenomics. 2021:9871989–9872021. doi: 10.34133/2021/9871989. ISSN 2643-6515. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Furbank Robert T., Jimenez-Berni Jose A., George-Jaeggli Barbara, Potgieter Andries B., Deery David M. Field crop phenomics: enabling breeding for radiation use efficiency and biomass in cereal crops. New Phytol. 2019;223(4):1714–1727. doi: 10.1111/nph.15817. [DOI] [PubMed] [Google Scholar]
  • 17.Luis Araus José, Cairns Jill E. Field high-throughput phenotyping: the new crop breeding frontier. Trends Plant Sci. 2014;19(1):52–61. doi: 10.1016/j.tplants.2013.09.008. ISSN 1360-1385. [DOI] [PubMed] [Google Scholar]
  • 18.Srivani P., Manjula S.H., et al. 2019 Fifteenth International Conference on Information Processing (ICINPRO) IEEE; 2019. A controlled environment agriculture with hydroponics: variants, parameters, methodologies and challenges for smart farming; pp. 1–8. [DOI] [Google Scholar]
  • 19.Cooper Mark, Messina Carlos D., Tang Tom, Gho Carla, Powell Owen M., Podlich Dean W., Technow Frank, Hammer Graeme L. Plant Breeding Reviews. 2022. Predicting genotype× environment × management (g × e × m) interactions for the design of crop improvement strategies: integrating breeder, agronomist, and farmer perspectives; pp. 467–585. [DOI] [Google Scholar]
  • 20.Messina C.D., Technow F., Tang T., Totir R., Gho C., Cooper M. Leveraging biological insight and environmental variation to improve phenotypic prediction: integrating crop growth models (cgm) with whole genome prediction (wgp) Eur. J. Agron. 2018;100:151–162. doi: 10.1016/j.eja.2018.01.007. ISSN 1161-0301. [DOI] [Google Scholar]
  • 21.Mahmood Tariq, Ahmed Talaat, Trethowan Richard. Genotype x environment x management (gem) reciprocity and crop productivity. Front. Agron. 2022;4 doi: 10.3389/fagro.2022.800365. [DOI] [Google Scholar]
  • 22.Brukhin V., Morozova N. Plant growth and development - basic knowledge and current views. Math. Model Nat. Phenom. 2011;6(2):1–53. doi: 10.1051/mmnp/20116201. [DOI] [Google Scholar]
  • 23.Alberts Bruce, Johnson Alexander, Lewis Julian, Raff Martin, Roberts Keith, Walter Peter. Molecular Biology of the Cell. fourth ed. Garland Science; 2002. Plant development.https://www.ncbi.nlm.nih.gov/books/NBK26922/ URL. [Google Scholar]
  • 24.Fiorani Fabio, Ulrich Schurr. Future scenarios for plant phenotyping. Annu. Rev. Plant Biol. 2013;64:267–291. doi: 10.1146/annurev-arplant-050312-120137. ISSN 1545-2123. [DOI] [PubMed] [Google Scholar]
  • 25.Hoshmand Reza. Chapman and Hall/CRC; 2006. Design of Experiments for Agriculture and the Natural Sciences. [DOI] [Google Scholar]
  • 26.Bender Filmore E. CRC Press; 1989. Statistical Methods for Food and Agriculture. [DOI] [Google Scholar]
  • 27.Fenu Gianni, Malloci Francesca Maridina. Forecasting plant and crop disease: an explorative study on current algorithms. Big Data Cognit. Comput. 2021;5(1) doi: 10.3390/bdcc5010002. ISSN 2504-2289. [DOI] [Google Scholar]
  • 28.Basso Bruno, Lin Liu, Ritchie Joe T. In: Advances in Agronomy, Volume 136 of Advances in Agronomy. Sparks Donald L., editor. Academic Press; 2016. A comprehensive review of the ceres-wheat, -maize and -rice models' performances; pp. 27–132. [DOI] [Google Scholar]
  • 29.Jones J.W., Hoogenboom G., Porter C.H., Boote K.J., Batchelor W.D., Hunt L.A., Wilkens P.W., Singh U., Gijsman A.J., Ritchie J.T. The dssat cropping system model. Eur. J. Agron. 2003;18(3):235–265. doi: 10.1016/S1161-0301(02)00107-7. ISSN 1161-0301. [DOI] [Google Scholar]
  • 30.Jean-Yves N’guessan Konan, Adahi Botou, Konan-Waidhet Arthur-Brice, Masayoshi Satoh, Assidjo Nogbou Emmanuel. Assessment of climate change impact on water requirement and rice productivity. Rice Sci. 2023;30(4):276–293. doi: 10.1016/j.rsci.2023.03.010. ISSN 1672-6308. [DOI] [Google Scholar]
  • 31.MacKay David JC. Cambridge university press; 2003. Information Theory, Inference and Learning Algorithms. [DOI] [Google Scholar]
  • 32.McElreath Richard. Chapman and Hall/CRC; 2018. Statistical Rethinking: a Bayesian Course with Examples in R and Stan. [DOI] [Google Scholar]
  • 33.Walter Achim, Liebisch Frank, Hund Andreas. Plant phenotyping: from bean weighing to image analysis. Plant Methods. 2015;11:1–11. doi: 10.1186/s13007-015-0056-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Li Daoliang, Quan Chaoqun, Song Zhaoyang, Xiang Li, Yu Guanghui, Cheng Li, Muhammad Akhter. High-throughput plant phenotyping platform (ht3p) as a novel tool for estimating agronomic traits from the lab to the field. Front. Bioeng. Biotechnol. 2021;8 doi: 10.3389/fbioe.2020.623705. ISSN 2296-4185. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Sarić Rijad, Nguyen Viet D., Burge Timothy, Berkowitz Oliver, Trtílek Martin, Whelan James, Lewsey Mathew G., Čustović Edhem. Applications of hyperspectral imaging in plant phenotyping. Trends Plant Sci. 2022;27(3):301–315. doi: 10.1016/j.tplants.2021.12.003. [DOI] [PubMed] [Google Scholar]
  • 36.Forero Manuel G., Murcia Harold F., Méndez Dehyro, Betancourt-Lozano Juan. Lidar platform for acquisition of 3d plant phenotyping database. Plants. 2022;11(17) doi: 10.3390/plants11172199. ISSN 2223-7747. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Wen Ting, Li Jian-Hong, Qi Wang, Gao Yang-Yang, Ge-Fei Hao, Song Bao-An. Thermal imaging: the digital eye facilitates high-throughput phenotyping traits of plant growth and stress responses. Sci. Total Environ. 2023;899 doi: 10.1016/j.scitotenv.2023.165626. ISSN 0048-9697. [DOI] [PubMed] [Google Scholar]
  • 38.Choudhury Sruti Das, Samal Ashok, Awada Tala. Leveraging image analysis for high-throughput plant phenotyping. Front. Plant Sci. 2019;10 doi: 10.3389/fpls.2019.00508. ISSN 1664-462X. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Jadwiga Olas Justyna, Fichtner Franziska, Apelt Federico. All roads lead to growth: imaging-based and biochemical methods to measure plant growth. J. Exp. Bot. 2019;71(1):11–21. doi: 10.1093/jxb/erz406. ISSN 0022-0957. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Singh Arti, Ganapathysubramanian Baskar, Singh Asheesh Kumar, Sarkar Soumik. Machine learning for high-throughput stress phenotyping in plants. Trends Plant Sci. 2016;21(2):110–124. doi: 10.1016/j.tplants.2015.10.015. [DOI] [PubMed] [Google Scholar]
  • 41.Sun Shangpeng, Li Changying, Chee Peng W., Paterson Andrew H., Jiang Yu, Xu Rui, Robertson Jon S., Adhikari Jeevan, Shehzad Tariq. Three-dimensional photogrammetric mapping of cotton bolls in situ based on point cloud segmentation and clustering. ISPRS J. Photogrammetry Remote Sens. 2020;160:195–207. doi: 10.1016/j.isprsjprs.2019.12.011. ISSN 0924-2716. [DOI] [Google Scholar]
  • 42.Sun Yongzhe, Zhang Zhixin, Sun Kai, Li Shuai, Yu Jianglin, Miao Linxiao, Zhang Zhanguo, Li Yang, Zhao Hongjie, Hu Zhenbang, Xin Dawei, Chen Qingshan, Zhu Rongsheng. Soybean-mvs: Annotated three-dimensional model dataset of whole growth period soybeans for 3d plant organ segmentation. Agriculture. 2023;13(7) doi: 10.3390/agriculture13071321. ISSN 2077-0472. [DOI] [Google Scholar]
  • 43.Schunck David, Magistri Federico, Rosu Radu Alexandru, Cornelißen André, Chebrolu Nived, Paulus Stefan, Léon Jens, Behnke Sven, Stachniss Cyrill, Kuhlmann Heiner, Klingbeil Lasse. Pheno4d: a spatio-temporal dataset of maize and tomato plant point clouds for phenotyping and advanced plant analysis. PLoS One. 2021;16(8):1–18. doi: 10.1371/journal.pone.0256340. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Sultan Sam, Snider Joseph, Conn Adam, Mao Li, Topp Christopher N., Navlakha Saket. A statistical growth property of plant root architectures. Plant Phenomics. 2020:2020. doi: 10.34133/2020/2073723. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Uchiyama Hideaki, Sakurai Shunsuke, Mishima Masashi, Arita Daisaku, Okayasu Takashi, Shimada Atsushi, Taniguchi Rin-ichiro. 2017 IEEE International Conference on Computer Vision Workshops (ICCVW) 2017. An easy-to-setup 3d phenotyping platform for komatsuna dataset; pp. 2038–2045. [DOI] [Google Scholar]
  • 46.Jayasuriya Namal, Ghannoum Oula, Wen Hu, Klause Norbert, Liang Weiguang, Guo Yi. Pc4c_capsi: image data of capsicum plant growth in protected horticulture. Data Brief. 2024;55 doi: 10.1016/j.dib.2024.110735. ISSN 2352-3409. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Shadrin Dmitrii G., Kulikov Victor, Fedorov Maxim V. British Machine Vision Conference. 2018. Instance segmentation for assessment of plant growth dynamics in artificial soilless conditions.http://bmvc2018.org/contents/workshops/cvppp2018/0027.pdf URL. [Google Scholar]
  • 48.Genze Nikita, Vahl Wouter K., Groth Jennifer, Wirth Maximilian, Grieb Michael, Grimm Dominik G. Manually annotated and curated dataset of diverse weed species in maize and sorghum for computer vision. Sci. Data. 2024;11(1):109. doi: 10.1038/s41597-024-02945-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Thoday-Kennedy Emily, Dimech Adam M., Joshi Sameer, Daetwyler Hans D., Hudson David, Spangenberg German, Hayden Matthew, Kant Surya. An image dataset of diverse safflower (carthamus tinctorius l.) genotypes for salt response phenotyping. Data Brief. 2023;46 doi: 10.1016/j.dib.2022.108787. ISSN 2352-3409. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Lärm Lena, Bauer Felix Maximilian, Hermes Normen, van der Kruk Jan, Vereecken Harry, Vanderborght Jan, Nguyen Thuy Huu, Lopez Gina, Seidel Sabine Julia, Ewert Frank, et al. Multi-year belowground data of minirhizotron facilities in selhausen. Sci. Data. 2023;10(1):672. doi: 10.1038/s41597-023-02570-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Parmentier F.-J.W., Nilsen L., Tømmervik H., Cooper E.J. A distributed time-lapse camera network to track vegetation phenology with high temporal detail and at varying scales. Earth Syst. Sci. Data. 2021;13(7):3593–3606. doi: 10.5194/essd-13-3593-2021. [DOI] [Google Scholar]
  • 52.Craig Valerie, Earl Hugh, Sulik John, Lee Elizabeth A. Hyperspectral time series datasets of maize during the grain filling period. BMC Res. Notes. 2022;15(1):152. doi: 10.1186/s13104-022-06029-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Dépigny Sylvain, Tchotang Frédéric, Talla Médard, Fofack Désirée, Essomé David, Ebongué Jean-Pierre, Kengni Bernard, Lescot Thierry. The ’plantain-optim’ dataset: agronomic traits of 405 plantains every 15 days from planting to harvest. Data Brief. 2018;17:671–680. doi: 10.1016/j.dib.2018.01.065. ISSN 2352-3409. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Ngaba Billy, Christina Mathias, Mansuy Alizé, Chetty Julien, Soulé Mathilde, Schwartz Marion, Heuclin Benjamin, Auzoux Sandrine. Experimental dataset of sugarcane-cover crop intercropping trials to control weeds in Reunion island. Data Brief. 2023;48 doi: 10.1016/j.dib.2023.109244. ISSN 2352-3409. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Justin Newman Saul, Furbank Robert T. A multiple species, continent-wide, million-phenotype agronomic plant dataset. Sci. Data. 2021;8(1):116. doi: 10.1038/s41597-021-00898-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Campbell Petya, Middleton Elizabeth, Huemmrich Karl, Ward Lauren, Julitta Tommaso, Yang Peiqi, van der Tol Christiaan, Craig Daughtry, Russ Andrew, Alfieri Joseph, Kustas William. Scaling photosynthetic function and co2 dynamics from leaf to canopy level for maize – dataset combining diurnal and seasonal measurements of vegetation fluorescence, reflectance and vegetation indices with canopy gross ecosystem productivity. Data Brief. 2021;39 doi: 10.1016/j.dib.2021.107600. ISSN 2352-3409. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Manjarrez-Sanchez Jorge, Martinez-Carrillo Gerardo. Dataset of photosynthesis and photosynthetic factors measurements of greenhouse tomato. Data Brief. 2020;32 doi: 10.1016/j.dib.2020.106274. ISSN 2352-3409. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Niu Q., Li X., Huang J., Huang H., Huang X., Su W., Yuan W. A 30 m annual maize phenology dataset from 1985 to 2020 in China. Earth Syst. Sci. Data. 2022;14(6):2851–2864. doi: 10.5194/essd-14-2851-2022. [DOI] [Google Scholar]
  • 59.Zhu M., Dai J., Wang H., Alatalo J.M., Liu W., Hao Y., Ge Q. Mapping 24 woody plant species phenology and ground forest phenology over China from 1951 to 2020. Earth Syst. Sci. Data. 2024;16(1):277–293. doi: 10.5194/essd-16-277-2024. [DOI] [Google Scholar]
  • 60.Mergner Julia, Frejno Martin, Messerer Maxim, Lang Daniel, Samaras Patroklos, Wilhelm Mathias, Mayer Klaus FX., Schwechheimer Claus, Kuster Bernhard. Proteomic and transcriptomic profiling of aerial organ development in arabidopsis. Sci. Data. 2020;7(1):334. doi: 10.1038/s41597-020-00678-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Castro Juan C., Dylan Maddox J., Rodríguez Hicler N., Castro Carlos G., Imán-Correa Sixto A., Cobos Marianela, Paredes Jae D., Marapara Jorge L., Braga Janeth, Adrianzén Pedro M. Dataset of de novo assembly and functional annotation of the transcriptome during germination and initial growth of seedlings of myrciaria dubia ”camu-camu”. Data Brief. 2020;31 doi: 10.1016/j.dib.2020.105834. ISSN 2352-3409. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Peng Zhenying, Jia Kai-Hua, Meng Jingjing, Wang Jianguo, Zhang Jialei, Li Xinguo, Wan Shubo. Transcriptome profiling of aerial and subterranean peanut pod development. Sci. Data. 2024;11(1):364. doi: 10.1038/s41597-024-03205-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Ge Yufeng, Bai Geng, Vincent Stoerger, Schnable James C. Temporal dynamics of maize plant growth, water use, and leaf water content using automated high throughput rgb and hyperspectral imaging. Comput. Electron. Agric. 2016;127:625–632. doi: 10.1016/j.compag.2016.07.028. ISSN 0168-1699. [DOI] [Google Scholar]
  • 64.Anche Mahlet T., Morales Nicholas, Kaczmar Nicholas S., Santantonio Nicholas, Gore Michael A., Robbins Kelly R. Scalable growth models for time-series multispectral images. Plant Phenom. J. 2023;6(1) doi: 10.1002/ppj2.20064. [DOI] [Google Scholar]
  • 65.Anche Mahlet T., Kaczmar Nicholas S., Morales Nicolas, Clohessy James W., Ilut Daniel C., Gore Michael A., Robbins Kelly R. Temporal covariance structure of multi-spectral phenotypes and their predictive ability for end-of-season traits in maize. Theor. Appl. Genet. 2020;133:2853–2868. doi: 10.1007/s00122-020-03637-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Sun Shangpeng, Li Changying, Paterson Andrew H., Jiang Yu, Xu Rui, Robertson Jon S., Snider John L., Peng W Chee. In-field high throughput phenotyping and cotton plant growth analysis using lidar. Front. Plant Sci. 2018;9:16. doi: 10.3389/fpls.2018.00016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Pérez-Valencia Diana M., Rodríguez-Álvarez María Xosé, Boer Martin P., Kronenberg Lukas, Hund Andreas, Cabrera-Bosquet Llorenç, Millet Emilie J., van Eeuwijk Fred A. A two-stage approach for the spatio-temporal analysis of high-throughput phenotyping data. Sci. Rep. 2022;12(1):3177. doi: 10.1038/s41598-022-06935-9. ISSN 2045-2322. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Kar Soumyashree, Vincent Garin, Kholová Jana, Vincent Vadez, Durbha Surya S., Tanaka Ryokei, Iwata Hiroyoshi, Urban Milan O., Spatemhtp J. Adinarayana. A data analysis pipeline for efficient processing and utilization of temporal high-throughput phenotyping data. Front. Plant Sci. 2020;11 doi: 10.3389/fpls.2020.552509. ISSN 1664-462X. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Momen Mehdi, Campbell Malachy T., Walia Harkamal, Morota Gota. Predicting longitudinal traits derived from high-throughput phenomics in contrasting environments using genomic legendre polynomials and B-Splines. G3 Genes Genom. Genet. 2019;9(10):3369–3380. doi: 10.1534/g3.119.400346. ISSN 2160-1836. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Pauli Duke, Andrade-Sanchez Pedro, Carmo-Silva A Elizabete, Gazave Elodie, French Andrew N., Heun John, Hunsaker Douglas J., Lipka Alexander E., Setter Tim L., Strand Robert J., Thorp Kelly R., Wang Sam, White Jeffrey W., Gore Michael A. Field-based high-throughput plant phenotyping reveals the temporal patterns of quantitative trait loci associated with stress-responsive traits in cotton. G3 Genes Genom. Genet. 2016;6(4):865–879. doi: 10.1534/g3.115.023515. ISSN 2160-1836. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Baba Toshimi, Momen Mehdi, Campbell Malachy T., Walia Harkamal, Morota Gota. Multi-trait random regression models increase genomic prediction accuracy for a temporal physiological trait derived from high-throughput phenotyping. PLoS One. 2020;15(2):1–17. doi: 10.1371/journal.pone.0228118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Nicolas Morales, Anche Mahlet T., Kaczmar Nicholas S., Lepak Nicholas, Ni Pengzun, Romay Maria Cinta, Santantonio Nicholas, Buckler Edward S., Gore Michael A., Mueller Lukas A., Robbins Kelly R. Spatio-temporal modeling of high-throughput multispectral aerial images improves agronomic trait genomic prediction in hybrid maize. Genetics. 2024;227(1) doi: 10.1093/genetics/iyae037. ISSN 1943-2631. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Baker R.L., Leong W.F., Welch S., Weinig C. Mapping and predicting non-linear Brassica rapa growth phenotypes based on Bayesian and frequentist complex trait estimation. G3 Genes Genom. Genet. 2018;8(4):1247–1258. doi: 10.1534/g3.117.300350. ISSN 2160-1836. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Perez-Valencia Diana Marcela, Rodriguez-Alvarez Maria Xose, Boer Martin P., Eeuwijk Fred A van. A one-stage approach for the spatio-temporal analysis of high-throughput phenotyping data. J. Agric. Biol. Environ. Stat. 2024:1–23. doi: 10.1007/s13253-024-00642-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Teressa Temesgen, Semahegn Zigale, Bejiga Tamirat. Multi environments and genetic-environmental interaction (gxe) in plant breeding and its challenges: a review article. Int. J. Res. Stud. Agric. Sci. 2021;7(4):11–18. doi: 10.20431/2454-6224.0704002. [DOI] [Google Scholar]
  • 76.Hawinkel Stijn, Meyer Sam De, Maere Steven. Spatial regression models for field trials: a comparative study and new ideas. Front. Plant Sci. 2022;13 doi: 10.3389/fpls.2022.858711. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Dormann Carsten F., Elith Jane, Bacher Sven, Buchmann Carsten, Carl Gudrun, Carré Gabriel, Marquéz Jaime R García, Gruber Bernd, Lafourcade Bruno, Leitão Pedro J., et al. Collinearity: a review of methods to deal with it and a simulation study evaluating their performance. Ecography. 2013;36(1):27–46. doi: 10.1111/j.1600-0587.2012.07348.x. [DOI] [Google Scholar]
  • 78.Babyak Michael A. What you see May not be what you get: a brief, nontechnical introduction to overfitting in regression-type models. Biopsychosoc. Sci. Med. 2004;66(3):411–421. doi: 10.1097/01.psy.0000127692.23278.a9. [DOI] [PubMed] [Google Scholar]
  • 79.Prakash Abhinav, Tuo Rui, Ding Yu. The temporal overfitting problem with applications in wind power curve modeling. Technometrics. 2023;65(1):70–82. doi: 10.1080/00401706.2022.2069158. [DOI] [Google Scholar]
  • 80.Hong Liang, Kuffner Todd A., Martin Ryan. On overfitting and post-selection uncertainty assessments. Biometrika. 2018;105(1):221–224. doi: 10.1093/biomet/asx083. [DOI] [Google Scholar]
  • 81.Fithian William, Sun Dennis, Taylor Jonathan. Optimal inference after model selection. arXiv preprint arXiv:1410.2597. 2014 doi: 10.48550/arXiv.1410.2597. [DOI] [Google Scholar]
  • 82.Gutenkunst Ryan N., Waterfall Joshua J., Casey Fergal P., Brown Kevin S., Myers Christopher R., Sethna James P. Universally sloppy parameter sensitivities in systems biology models. PLoS Comput. Biol. 2007;3(10) doi: 10.1371/journal.pcbi.0030189. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Shumway Robert H., Stoffer David S., Stoffer David S. ume 3. Springer; 2000. (Time Series Analysis and its Applications). [DOI] [Google Scholar]
  • 84.Nesteruk Sergey, Shadrin Dmitrii, Kovalenko Vladislav, Rodríguez-Sánchez Antonio, Somov Andrey. 2020 IEEE 29th International Symposium on Industrial Electronics (ISIE) 2020. Plant growth prediction through intelligent embedded sensing; pp. 411–416. [DOI] [Google Scholar]
  • 85.Adak Alper, DeSalvio Aaron J., Arik Mustafa A., Murray Seth C. Field-based high-throughput phenotyping enhances phenomic and genomic predictions for grain yield and plant height across years in maize. G3 Genes Genom. Genet. 2024;14(7) doi: 10.1093/g3journal/jkae092. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Xu Yuhang, Li Yehua, Qiu Yumou. Growth dynamics and heritability for plant high-throughput phenotyping studies using hierarchical functional data analysis. Biom. J. 2021;63(6):1325–1341. doi: 10.1002/bimj.202000315. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Hewamalage Hansika, Ackermann Klaus, Bergmeir Christoph. Forecast evaluation for data scientists: common pitfalls and best practices. Data Min. Knowl. Discov. 2023;37(2):788–832. doi: 10.1007/s10618-022-00894-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Matsumura K., Gaitan C.F., Sugimoto K., Cannon A.J., Hsieh W.W. Maize yield forecasting by linear regression and artificial neural networks in jilin, China. J. Agric. Sci. 2015;153(3):399–410. doi: 10.1017/S0021859614000392. [DOI] [Google Scholar]
  • 89.Lee Joon-Woo, Moon Taewon, Son Jung-Eek. Development of growth estimation algorithms for hydroponic bell peppers using recurrent neural networks. Horticulturae. 2021;7(9):284. doi: 10.3390/horticulturae7090284. [DOI] [Google Scholar]
  • 90.Alhnaity Bashar, Pearson Simon, Leontidis Georgios, Kollias Stefanos. International Symposium on Advanced Technologies and Management for Innovative Greenhouses: Greensys2019 1296. 2019. Using deep learning to predict plant growth and yield in greenhouse environments; pp. 425–432. [DOI] [Google Scholar]
  • 91.Gong Liyun, Yu Miao, Jiang Shouyong, Cutsuridis Vassilis, Pearson Simon. Deep learning based prediction on greenhouse crop yield combined tcn and rnn. Sensors. 2021;21(13) doi: 10.3390/s21134537. ISSN 1424-8220. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.Alhnaity Bashar, Kollias Stefanos, Leontidis Georgios, Jiang Shouyong, Schamp Bert, Pearson Simon. An autoencoder wavelet based deep neural network with attention mechanism for multi-step prediction of plant growth. Inf. Sci. 2021;560:35–50. doi: 10.1016/j.ins.2021.01.037. ISSN 0020-0255. [DOI] [Google Scholar]
  • 93.Vaswani Ashish, Shazeer Noam, Parmar Niki, Uszkoreit Jakob, Jones Llion, Gomez Aidan N., Kaiser Łukasz, Polosukhin Illia. Proceedings of the 31st International Conference on Neural Information Processing Systems, NIPS’17. Curran Associates Inc; Red Hook, NY, USA: 2017. Attention is all you need; pp. 6000–6010. ISBN 9781510860964. [DOI] [Google Scholar]
  • 94.Auger-Méthé Marie, Newman Ken, Cole Diana, Empacher Fanny, Gryba Rowenna, King Aaron A., Leos-Barajas Vianey, Mills Flemming Joanna, Nielsen Anders, Petris Giovanni, Thomas Len. A guide to state–space modeling of ecological time series. Ecol. Monogr. 2021;91(4) doi: 10.1002/ecm.1470. [DOI] [Google Scholar]
  • 95.Shadrin Dmitrii, Podladchikova Tatiana, Ovchinnikov George, Pavlov Artem, Pukalchik Mariia, Somov Andrey. 2020 IEEE International Instrumentation and Measurement Technology Conference (I2MTC) 2020. Kalman filtering for accurate and fast plant growth dynamics assessment; pp. 1–6. [DOI] [Google Scholar]
  • 96.Viaud Gautier, Chen Yuting, Cournède Paul-Henry. Full bayesian inference in hidden markov models of plant growth. Ann. Appl. Stat. 2022;16(4):2352–2368. doi: 10.1214/21-AOAS1594. [DOI] [Google Scholar]
  • 97.Shibata S., Mizuno R., Mineno H. Semisupervised deep state-space model for plant growth modeling. Plant Phenomics. 2020:2020. doi: 10.34133/2020/4261965. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98.Lipton Zachary C. The mythos of model interpretability: in machine learning, the concept of interpretability is both important and slippery. ACM Queue. 2018;16(3):31–57. doi: 10.1145/3236386.3241340. [DOI] [Google Scholar]
  • 99.O’Mahony Laura. University of Limerick; 2025. Interpretability and the Nature of Latent Representations in Deep Neural Networks. PhD thesis. [Google Scholar]
  • 100.Räuker Tilman, Ho Anson, Casper Stephen, Hadfield-Menell Dylan. 2023 Ieee Conference on Secure and Trustworthy Machine Learning (Satml) IEEE; 2023. Toward transparent ai: a survey on interpreting the inner structures of deep neural networks; pp. 464–483. [DOI] [Google Scholar]
  • 101.Ian T Jolliffe and Jorge Cadima. Principal component analysis: a review and recent developments. Phil. Trans. Math. Phys. Eng. Sci., 374(2065):20150202, 20152016. doi:10.1098/rsta.2015.0202. [DOI] [PMC free article] [PubMed]
  • 102.Schiestl-Aalto Pauliina, Kulmala Liisa, Mäkinen Harri, Nikinmaa Eero, Mäkelä Annikki. Cassia – a dynamic model for predicting intra-annual sink demand and interannual growth variation in scots pine. New Phytol. 2015;206(2):647–659. doi: 10.1111/nph.13275. [DOI] [PubMed] [Google Scholar]
  • 103.Fan Xing-Rong, Kang Meng-Zhen, Heuvelink Ep, de Reffye Philippe, Hu Bao-Gang. A knowledge-and-data-driven modeling approach for simulating plant growth: a case study on tomato growth. Ecol. Model. 2015;312:363–373. doi: 10.1016/j.ecolmodel.2015.06.006. ISSN 0304-3800. [DOI] [Google Scholar]
  • 104.Yue Shi, Han Liangxiu, Zhang Xin, Sobeih Tam, Gaiser Thomas, Thuy Nguyen Huu, Behrend Dominik, Srivastava Amit Kumar, Halder Krishnagopal, Ewert Frank. Deep learning meets process-based models: a hybrid approach to agricultural challenges. arXiv preprint arXiv:2504.16141. 2025 doi: 10.48550/arXiv.2504.16141. [DOI] [Google Scholar]
  • 105.Godin Christophe, Sinoquet Hervé. Functional–structural plant modelling. New Phytol. 2005;166(3):705–708. doi: 10.1111/j.1469-8137.2005.01445.x. [DOI] [PubMed] [Google Scholar]
  • 106.Vos J., Evers J.B., Buck-Sorlin G.H., Andrieu B., Chelle M., de Visser P.H.B. Functional–structural plant modelling: a new versatile tool in crop science. J. Exp. Bot. 2009;61(8):2101–2115. doi: 10.1093/jxb/erp345. ISSN 0022-0957. [DOI] [PubMed] [Google Scholar]
  • 107.Coussement Jonas R., De Swaef Tom, Lootens Peter, Roldán-Ruiz Isabel, Steppe Kathy. Introducing turgor-driven growth dynamics into functional–structural plant models. Ann. Bot. 2018;121(5):849–861. doi: 10.1093/aob/mcx144. ISSN 0305-7364. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 108.Evers Jochem B. Springer; Netherlands, Dordrecht: 2016. Simulating Crop Growth and Development Using Functional-Structural Plant Modeling; pp. 219–236. ISBN 978-94-017-7291-4. [DOI] [Google Scholar]
  • 109.Wang Xiujuan, Hua Jing, Kang Mengzhen, Wang Haoyu, Reffye Philippe. Functional–structural plant model ”greenlab”: a state-of-the-art review. Plant Phenomics. 2024;6:118. doi: 10.34133/plantphenomics.0118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 110.Yin Hoon Chew, Wenden Bénédicte, Flis Anna, Mengin Virginie, Taylor Jasper, Davey Christopher L., Tindal Christopher, Thomas Howard, Ougham Helen J., de Reffye Philippe, Stitt Mark, Williams Mathew, Muetzelfeldt Robert, Halliday Karen J., Millar Andrew J. Multiscale digital Arabidopsis predicts individual organ and whole-organism growth. Proc. Natl. Acad. Sci. 2014;111(39):E4127–E4136. doi: 10.1073/pnas.1410238111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 111.Parasurama Sriram, Banan Darshi, Yun Kyungdahm, Doty Sharon, Kim Soo-Hyung. Bridging time-series image phenotyping and functional–structural plant modeling to predict adventitious root system architecture. Plant Phenomics. 2023;5:127. doi: 10.34133/plantphenomics.0127. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 112.Nagano Shogo, Moriyuki Shogo, Wakamori Kazumasa, Mineno Hiroshi, Fukuda Hirokazu. Leaf-movement-based growth prediction model using optical flow analysis and machine learning in plant factory. Front. Plant Sci. 2019;10 doi: 10.3389/fpls.2019.00227. ISSN 1664-462X. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 113.Huang Changjin, Wang Zilu, Quinn David, Suresh Subra, Jimmy Hsia K. Differential growth and shape formation in plant organs. Proc. Natl. Acad. Sci. 2018;115(49):12359–12364. doi: 10.1073/pnas.1811296115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 114.Mosca Gabriella, Adibi Milad, Strauss Soeren, Runions Adam, Sapala Aleksandra, Smith Richard S. Springer International Publishing; Cham: 2018. Modeling Plant Tissue Growth and Cell Division; pp. 107–138. ISBN 978-3-319-99070-5. [DOI] [Google Scholar]
  • 115.Kierzkowski Daniel, Runions Adam, Vuolo Francesco, Strauss Sören, Lymbouridou Rena, Routier-Kierzkowska Anne-Lise, Wilson-Sánchez David, Jenke Hannah, Galinha Carla, Mosca Gabriella, Zhang Zhongjuan, Canales Claudia, Ioio Raffaele Dello, Huijser Peter, Smith Richard S., Tsiantis Miltos. A growth-based framework for leaf shape development and diversity. Cell. 2019;177(6):1405–1418.e17. doi: 10.1016/j.cell.2019.05.011. ISSN 0092-8674. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 116.Lindenmayer Aristid. Mathematical models for cellular interactions in development i. filaments with one-sided inputs. J. Theor. Biol. 1968;18(3):280–299. doi: 10.1016/0022-5193(68)90079-9. ISSN 0022-5193. [DOI] [PubMed] [Google Scholar]
  • 117.Prusinkiewicz Przemyslaw, Lindenmayer Aristid. Springer Science & Business Media; 2012. The Algorithmic Beauty of Plants. [DOI] [Google Scholar]
  • 118.Prusinkiewicz Przemyslaw, Cieslak Mikolaj, Ferraro Pascal, Hanan Jim. Springer International Publishing; Cham: 2018. Modeling Plant Development with L-Systems; pp. 139–169. ISBN 978-3-319-99070-5. [DOI] [Google Scholar]
  • 119.Allen M.T., Prusinkiewicz P., DeJong T.M. Using l-systems for modeling source–sink interactions, architecture and physiology of growing trees: the l-peach model. New Phytol. 2005;166(3):869–880. doi: 10.1111/j.1469-8137.2005.01348.x. [DOI] [PubMed] [Google Scholar]
  • 120.DeJong T.M., Da Silva D., Negron C., Cieslak M., Prusinkiewicz P. The l-almond model: a functional-structural virtual tree model of almond tree architectural growth, carbohydrate dynamics over multiple years. Int. Sympos. Modell. Fruit Res. Orchard Manag. 2015;1160:43–50. doi: 10.17660/ActaHortic.2017.1160.7. [DOI] [Google Scholar]
  • 121.DeJong Theodore. Simulating fruit tree growth, structure, and physiology using l-systems. Crop Sci. 2022;62(6):2091–2106. doi: 10.1002/csc2.20847. [DOI] [Google Scholar]
  • 122.Rokhana Rika, Herulambang Wiwiet, Indraswari Rarasmaya. 2020 International Electronics Symposium (IES) 2020. Machine learning and polynomial – l system algorithm for modeling and simulation of glycine max (l) merrill growth; pp. 463–467. [DOI] [Google Scholar]
  • 123.Yang Le, Wu Panpan, Zuo Zhengkang, Long Lan, Shi Junlin, Liu Yutang. Eroots: a three-dimensional dynamic growth model of rice roots coupled with soil. Biosyst. Eng. 2024;244:122–133. doi: 10.1016/j.biosystemseng.2024.06.002. ISSN 1537-5110. [DOI] [Google Scholar]
  • 124.Hitti Yasmeen, Buzatu Ionelia, Del Verme Manuel, Lefsrud Mark, Golemo Florian, Durand Audrey. Growspace: a reinforcement learning environment for plant architecture. Comput. Electron. Agric. 2024;217 doi: 10.1016/j.compag.2024.108613. ISSN 0168-1699. [DOI] [Google Scholar]
  • 125.Mußmann Marc, Hofstadler Daniel Nicolas, von Mammen Sebastian. Volume ALIFE 2024: Proceedings of the 2024 Artificial Life Conference of Artificial Life Conference Proceedings. 2024. An agent-based, interactive simulation model of root growth; p. 7. [DOI] [Google Scholar]
  • 126.Shunsuke Sakurai, Hideaki Uchiyama, Atshushi Shimada, Taniguchi Rin-Ichiro. INSTICC; 2019. Plant growth prediction using convolutional lstm; pp. 105–113. (Proceedings of the 14th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (5: VISAPP)). ISBN 978-989-758-354-4. [DOI] [Google Scholar]
  • 127.Wang Chunying, Pan Weiting, Song Xubin, Yu Haixia, Zhu Junke, Liu Ping, Xiang Li. Predicting plant growth and development using time-series images. Agronomy. 2022;12(9) doi: 10.3390/agronomy12092213. ISSN 2073-4395. [DOI] [Google Scholar]
  • 128.Kingma Diederik P., Welling Max. Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114. 2013 doi: 10.48550/arXiv.1312.6114. [DOI] [Google Scholar]
  • 129.Yasrab Robail, Zhang Jincheng, Smyth Polina, Pound Michael P. Predicting plant growth from time-series data using deep learning. Remote Sens. 2021;13(3) doi: 10.3390/rs13030331. ISSN 2072-4292. [DOI] [Google Scholar]
  • 130.Kim Taehyeon, Lee Sang-Ho, Kim Jong-Ok. A novel shape based plant growth prediction algorithm using deep learning and spatial transformation. IEEE Access. 2022;10:37731–37742. doi: 10.1109/ACCESS.2022.3165211. [DOI] [Google Scholar]
  • 131.Li Yinglun, Zhan Xiaohai, Liu Shouyang, Lu Hao, Jiang Ruibo, Guo Wei, Chapman Scott, Ge Yufeng, Solan Benoit, Ding Yanfeng, Baret Frédéric. Self-supervised plant phenotyping by combining domain adaptation with 3d plant model simulations: application to wheat leaf counting at seedling stage. Plant Phenomics. 2023;5:41. doi: 10.34133/plantphenomics.0041. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 132.Lu Yuzhen, Chen Dong, Olaniyi Ebenezer, Huang Yanbo. Generative adversarial networks (gans) for image augmentation in agriculture: a systematic review. Comput. Electron. Agric. 2022;200 doi: 10.1016/j.compag.2022.107208. ISSN 0168-1699. [DOI] [Google Scholar]
  • 133.Miranda Miro, Drees Lukas, Roscher Ribana. 2022 26th International Conference on Pattern Recognition (ICPR) 2022. Controlled multi-modal image generation for plant growth modeling; pp. 5118–5124. [DOI] [Google Scholar]
  • 134.Drees Lukas, Junker-Frohn Laura Verena, Kierdorf Jana, Roscher Ribana. Temporal prediction and evaluation of brassica growth in the field using conditional generative adversarial networks. Comput. Electron. Agric. 2021;190 doi: 10.1016/j.compag.2021.106415. ISSN 0168-1699. [DOI] [Google Scholar]
  • 135.Leonhardt Johannes, Drees Lukas, Jung Peter, Roscher Ribana. In: Pattern Recognition. Andres Björn, Bernard Florian, Cremers Daniel, Frintrop Simone, Goldlücke Bastian, Ihrke Ivo., editors. Springer International Publishing; Cham: 2022. Probabilistic biomass estimation with conditional generative adversarial networks; pp. 479–494. ISBN 978-3-031-16788-1. [Google Scholar]
  • 136.Drees Lukas, Demie Dereje T., Paul Madhuri R., Leonhardt Johannes, Seidel Sabine J., Döring Thomas F., Roscher Ribana. Data-driven crop growth simulation on time-varying generated images using multi-conditional generative adversarial networks. Plant Methods. 2024;20(1) doi: 10.1186/s13007-024-01205-3. ISSN 1746-4811. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 137.Debbagh Mohamed, Liu Yixue, Zheng Zhouzhou, Jiang Xintong, Sun Shangpeng, Lefsrud Mark. Artificial Neural Networks in Pattern Recognition. Springer Nature Switzerland; Cham: 2024. Generative plant growth simulation from sequence-informed environmental conditions; pp. 308–319. ISBN 978-3-031-71602-7. [DOI] [Google Scholar]
  • 138.Akhavizadegan Faezeh, Ansarifar Javad, Wang Lizhi, Huber Isaiah, Archontoulis Sotirios V. A time-dependent parameter estimation framework for crop modeling. Sci. Rep. 2021;11(1) doi: 10.1038/s41598-021-90835-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 139.Suliansyah Irfan, Arlius Feri, Chaniago Irawati, Xuan Nguyen Thi Thanh, Quang Pham Van, et al. Literature review crop modeling and introduction a simple crop model. J. Appl. Agric. Sci. Technol. 2023;7(3):197–216. doi: 10.55043/jaast.v7i3.123. [DOI] [Google Scholar]
  • 140.Ntuchu Kephe Priscilla, Ayisi Kingsley Kwabena, Petja Brilliant Mareme. Challenges and opportunities in crop simulation modelling under seasonal and projected climate change scenarios for crop production in South Africa. Agric. Food Secur. 2021;10(1):1–24. doi: 10.1186/s40066-020-00283-5. [DOI] [Google Scholar]
  • 141.Roscher Ribana, Roth Lukas, Stachniss Cyrill, Walter Achim. Data-centric digital agriculture: a perspective. arXiv preprint arXiv:2312.03437. 2023 doi: 10.48550/arXiv.2312.03437. [DOI] [Google Scholar]
  • 142.Harandi Negin, Vandenberghe Breght, Vankerschaver Joris, Depuydt Stephen, Van Messem Arnout. How to make sense of 3d representations for plant phenotyping: a compendium of processing and analysis techniques. Plant Methods. 2023;19(1):60. doi: 10.1186/s13007-023-01031-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 143.Hodač Ladislav, Karbstein Kevin, Kösters Lara, Rzanny Michael, Christian Wittich Hans, Boho David, Šubrt David, Mäder Patrick, Wäldchen Jana. Deep learning to capture leaf shape in plant images: validation by geometric morphometrics. Plant J. 2024;120(4):1343–1357. doi: 10.1111/tpj.17053. [DOI] [PubMed] [Google Scholar]
  • 144.Mostafa Sakib, Mondal Debajyoti, Karim Panjvani, Kochian Leon, Stavness Ian. Explainable deep learning in plant phenotyping. Front. Artific. Intellig. 2023;6 doi: 10.3389/frai.2023.1203546. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 145.Xu Mingle, Kim Hyongsuk, Yang Jucheng, Fuentes Alvaro, Meng Yao, Yoon Sook, Kim Taehyun, Park Dong Sun. Embracing limited and imperfect training datasets: opportunities and challenges in plant disease recognition using deep learning. Front. Plant Sci. 2023;14 doi: 10.3389/fpls.2023.1225409. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 146.Danilevicz Monica F., Bayer Philipp E., Nestor Benjamin J., Bennamoun Mohammed, Edwards David. Resources for image-based high-throughput phenotyping in crops and data sharing challenges. Plant Physiol. 2021;187(2):699–715. doi: 10.1093/plphys/kiab301. ISSN 0032-0889. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 147.Li Jiajia, Chen Dong, Qi Xinda, Li Zhaojian, Huang Yanbo, Morris Daniel, Tan Xiaobo. Label-efficient learning in agriculture: a comprehensive review. Comput. Electron. Agric. 2023;215 doi: 10.1016/j.compag.2023.108412. ISSN 0168-1699. [DOI] [Google Scholar]
  • 148.Zhu Rongsheng, Sun Kai, Yan Zhuangzhuang, Yan Xuehui, Yu Jianglin, Shi Jia, Hu Zhenbang, Jiang Hongwei, Xin Dawei, Zhang Zhanguo, et al. Analysing the phenotype development of soybean plants using low-cost 3d reconstruction. Sci. Rep. 2020;10(1):7055. doi: 10.1038/s41598-020-63720-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 149.Lei Li, Zhang Qin, Huang Danfeng. A review of imaging techniques for plant phenotyping. Sensors. 2014;14(11):20078–20111. doi: 10.3390/s141120078. ISSN 1424-8220. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 150.Andrés-Hernández Liliana, Halimi Razlin Azman, Mauleon Ramil, Mayes Sean, Baten Abdul, King Graham J. Challenges for fair-compliant description and comparison of crop phenotype data with standardized controlled vocabularies. Database. 2021 doi: 10.1093/database/baab028. baab028, 05 2021. ISSN 1758-0463. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 151.Schramowski Patrick, Stammer Wolfgang, Teso Stefano, Brugger Anna, Herbert Franziska, Shao Xiaoting, Luigs Hans-Georg, Mahlein Anne-Katrin, Kersting Kristian. Making deep neural networks right for the right scientific reasons by interacting with their explanations. Nat. Mach. Intell. 2020;2(8):476–486. doi: 10.1038/s42256-020-0212-3. [DOI] [Google Scholar]
  • 152.Cheng Weijun, Ma Tengfei, Wang Xiaoting, Wang Gang. Anomaly detection for internet of things time series data using generative adversarial networks with attention mechanism in smart agriculture. Front. Plant Sci. 2022;13 doi: 10.3389/fpls.2022.890563. 2022. ISSN 1664-462X. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 153.Tuli Shreshth, Casale Giuliano, Jennings Nicholas R. Tranad: deep transformer networks for anomaly detection in multivariate time series data. arXiv preprint arXiv:2201.07284. 2022 doi: 10.48550/arXiv.2201.07284. [DOI] [Google Scholar]
  • 154.Xu Jiehui, Wu Haixu, Wang Jianmin, Long Mingsheng. Anomaly transformer: time series anomaly detection with association discrepancy. arXiv preprint arXiv:2110.02642. 2021 doi: 10.48550/arXiv.2110.02642. [DOI] [Google Scholar]
  • 155.Li Wei, Zhu Deli, Wang Qing. A single view leaf reconstruction method based on the fusion of resnet and differentiable render in plant growth digital twin system. Comput. Electron. Agric. 2022;193 doi: 10.1016/j.compag.2022.106712. ISSN 0168-1699. [DOI] [Google Scholar]
  • 156.Mitsanis Christos, Hurst William, Tekinerdogan Bedir. A 3d functional plant modelling framework for agricultural digital twins. Comput. Electron. Agric. 2024;218 doi: 10.1016/j.compag.2024.108733. ISSN 0168-1699. [DOI] [Google Scholar]

Articles from Plant Phenomics are provided here courtesy of Nanjing Agricultural University

RESOURCES