Skip to main content
PLOS One logoLink to PLOS One
. 2026 Feb 17;21(2):e0341904. doi: 10.1371/journal.pone.0341904

Image-based machine learning models for customized soil moisture management

Yooan Kim 1, Taehyeong Kim 2,3, Sungyong Lee 4, Suhyun Lee 5, Kyo Suh 2,5,6,7,*
Editor: Gobinath Ravindran8
PMCID: PMC12912613  PMID: 41701764

Abstract

Crop growth can vary even under the same cultivation conditions, highlighting the limitations of conventional smart farming systems that apply uniform treatments to all crops. These average-based approaches often overlook individual plant needs shaped by microenvironments and physiological differences, resulting in inefficient resource use and reduced yields. While crop-specific management is important for improving productivity, there is a lack of non-invasive methods to monitor soil conditions at the individual plant level. This study presents an AI-based system that combines soil sensors and image analysis to support customized moisture management. Transplanted wild-simulated ginseng was used as a model crop. RGB images of the soil surface were collected with sensor data from different depths (3 cm, 10 cm, and 15 cm) to capture vertical moisture distribution. Several deep learning models were evaluated for predicting surface moisture, with DenseNet121 showing the highest accuracy (R² = 97.3%, RMSE = 4.14). For deeper soil layers, the random forest regression model achieved the best performance (R² = 90.6%, RMSE = 4.97), effectively capturing nonlinear moisture dynamics. These results demonstrate that surface image data can be used to estimate soil moisture non-invasively and enable data-driven, plant-specific crop management systems. This research provides a foundation for data-driven, customized, soil moisture management in smart farming. Future studies should focus on validating the model across diverse crops and soil types, and integrate additional spectral data to enhance its robustness and scalability.

1. Introduction

Climate-smart agriculture is a technological innovation that transforms the paradigm of crop cultivation by minimizing the impact of natural environments on agriculture [1]. It simultaneously addresses climate change while enhancing agricultural productivity and minimizing environmental impacts through efficient resource utilization [25]. Smart agriculture, including precision agriculture and smart farms, plays a pivotal role in achieving the goals of climate-smart agriculture [610]. Smart farms focusing on controlled environment agriculture initially emerged as first-generation remote management systems. These systems provided monitoring and control functions for agricultural environments [1113]. However, they were limited to simple environmental control. Subsequently, smart farms evolved into second-generation intelligent systems that autonomously controlled environments, used accumulated data, and adapt flexibly to climate change [14,15]. The next generation of smart farms is projected to evolve beyond simple environmental control by incorporating sophisticated systems capable of customized management tailored to the growth status of individual crops [16,17]. Smart farming technology is a critical tool for minimizing resource waste and maximizing productivity in complex agricultural environments affected by climate change [18,19]. It is also essential foundational technology for realizing climate-smart agriculture [1,20].

In the evolution of smart farming, water resource management technology has gained significant attention [21,22]. As climate change intensifies water scarcity, efficient water resource management in agriculture has become an increasingly critical challenge [23]. Moisture management in crop cultivation is a key factor in determining productivity and quality [24]. For example, inadequate moisture management can cause direct economic losses by reducing plant growth, increasing pest and disease occurrence, and decreased yields [23]. Precise moisture management requires accurately understanding crop water requirements based on growth stages and environmental conditions, and then providing the appropriate amount of water for each plant at the right time [24,25]. Since each soil layer plays a distinct role in crop growth, understanding soil moisture variation by depth is also essential [26]. The upper layer reflects rapid irrigation and evaporation changes [27], the middle layer is the primary zone for root activity, and the bottom layer functions as a reservoir that buffers excess moisture [26,28].

Despite the critical importance of precise moisture management, most current smart farm systems employ uniform management approaches that fail to consider individual crop characteristics. These systems have primarily focused on optimizing environmental factors such as temperature, humidity, and lighting [29,30]. However, plants grown under identical conditions vary in their physiological traits and growth rates, making standardized approaches insufficient [31]. With climate change and resource scarcity intensifying, maximizing productivity while using resources efficiently has become a necessity [23]. Customized management optimizes each plant’s growth while minimizing resource waste [32,33]. Thus, it serves as a key strategy for agricultural productivity and sustainability [1,20]. Smart farm systems incorporating AI models can analyze complex agricultural data to determine optimized conditions for individual plants. Such models can potentially overcome conventional limitations [34]. Implementing such customized management requires innovative technological solutions that accurately monitor individual plant conditions and translate the data into actionable management strategies.

Most artificial intelligence research related to smart farms incorporating various digital technologies has primarily focused on uniform management at the field or greenhouse level [33,35,36]. Limited studies exist on customized cultivation management systems that consider the growth status of individual crops [3739]. Previous studies have mainly emphasized developing models that manage overall soil conditions or facility environments based on information from large-scale monitoring systems using satellite data or drones [4042]. However, these models have limitations in reflecting the detailed growth differences of individual crops. A remote-management, IoT-based smart irrigation system has been developed to support soil-moisture management at both field and greenhouse scales, demonstrating meaningful gains in water-resource conservation and crop productivity [43]. Beyond systems for managing moisture that affects crop growing environments, various studies have conducted moisture monitoring using remote sensing. Satellite-based remote sensing has also been applied to improve watering management at both regional and farm scales. In addition, small-sized satellite systems offering high spatial and temporal resolution have been utilized to assess crop water use in relatively narrow areas [44] Their approach mitigated limitations of wide coverage and low resolution inherent in conventional satellite data. However, despite the reduction in target scale, limitations remain. Detailed management is necessary at the individual plant level to implement customized cultivation management based on the growth status of individual crops. For customized cultivation to become more widespread, simpler methods are required beyond installing sensors on each plant. The purpose of this study is to evaluate and compare the performance of various image-based machine learning algorithms for customized management based on the growth status of individual crops using image data associated with sensor data. Transplanted wild-simulated ginseng, a moisture-sensitive and high-value crop, was selected as the target plant. RGB images of the soil surface were collected during the early germination stage, along with sensor-based soil moisture data from surface, middle, and deep layers. Multiple deep learning models, including ResNet and EfficientNet, were evaluated for surface-level prediction. Additionally, this study applied regression models by soil depth for moisture estimation, including polynomial regression, support vector regression, and random forest regression. This approach enables non-destructive, image-based estimation of soil moisture, facilitating individualized crop management strategies tailored to the specific environmental and physiological conditions of each plant.

2. Materials and methods

2.1. Customized growth management setup

2.1.1. Target crop: Wild-simulated Ginseng.

High value-added agricultural and forest products command premium prices relative to their yield, highlighting the need for customized management to maximize productivity. The market price per gram of wild-simulated ginseng (WSG) exceeds that of cultivated ginseng by more than 20-fold, highlighting its prominence as a high-value forest product [45,46]. Most WSG is marketed after more than seven years of cultivation. The quality and price are determined by various factors such as region, shape, weight, and other attributes that affect market value [4749]. Since the survival rate of WSG decreases dramatically from the fourth year onward, effective management strategies are needed to improve survival rates and produce high-quality ginseng [50,51]. To establish legal standards for WSG and clearly distinguish it from cultivated ginseng, we investigated changes in WSG characteristics after a short post-treatment period following pre-germination transplantation. Accordingly, five-year-old WSG cultivated in Pyeongchang, Gangwon Province, was selected and transplanted into experimental soil as the target forest product for this study.

2.1.2. Experiment setup.

Changes in major saponin components were tracked to develop a standardized cultivation technique. This post-treatment technology confirmed that the composition and weight of WSG (three years or older) could increase to levels comparable to six-year-old ginseng within a short period of 90 days (maximum 150 days) (SNU R&DB Foundation, 2023 [52]). Image data collection associated with sensor data for soil moisture management was conducted while performing WSG cultivation using methods that are identical to the patented post-treatment technology. The WSG specimens used in the experiment had an average weight of 0.954g, with diverse weights ranging from 0.401g to 2.488g to ensure experimental representativeness. The transplantation of WSG was performed in indoor facilities, with each specimen placed in an individual pot. The transplanted WSG underwent post-treatment in the Genesis Farmbot system (Fig 1).

Fig 1. Experimental setup for wild-simulated ginseng cultivation.

Fig 1

The left image shows the pot specifications used for cultivation, measuring 25 cm in height and 9 cm in diameter, which provides an optimal environment for root development. The right image illustrates the cultivation system of using a Genesis FarmBot (bed size: 1.5 m × 3 m, height: 0.5 m), equipped with an X-Y-Z gantry.

WSG seedlings were grown under controlled greenhouse conditions following transplantation. The image-based analysis in this study focused on the early growth period after transplantation, a stage during which wild-simulated ginseng is highly sensitive to moisture conditions. Moisture stress during this period has a direct influence on seedling survival and early physiological responses, including changes in saponin composition [53]. To ensure consistent rooting conditions across plants, we used a standardized peat moss and perlite substrate in a 6:4 ratio, which is commonly adopted for ginseng seedling cultivation [54,55]. This substrate composition is known to provide high porosity, low bulk density, and a favorable balance between aeration and water-holding capacity. Such physical characteristics create a uniform and supportive rooting environment, which is particularly important during the moisture-sensitive early growth stage of WSG.

Following transplantation, the soil moisture content was adjusted to 50 ± 10% of saturation, based on two-point calibration (oven-dry and fully saturated conditions), to provide a suitable starting environment. In this study, soil moisture content refers to the relative saturation of the soil, expressed as a percentage with 0% for oven-dry soil and 100% for saturation. Hereafter, this is referred to simply as soil moisture content. Fig 2 presents representative images of the soil surface used in the experiment. The presence of mixed materials such as peat moss and perlite made it difficult to visually distinguish differences in soil moisture levels. All other environmental conditions, except for moisture, remained consistent across groups. This experiment aimed to quantitatively assess the effects of early-stage moisture stress on WSG growth after transplantation and to identify optimal moisture management strategies for this critical stage. All experiments were conducted within institutional facilities, and no specific permits were required for this research.

Fig 2. Sample soil surface images at different moisture levels.

Fig 2

These images are provided solely for qualitative illustration, and no spatial or physical scale is implied (not to scale).

2.2. Soil moisture sensor system

The core hardware components of the soil moisture sensing system included an Arduino Mega 2560, an ESP8266 WiFi module, and capacitive soil moisture sensors (Table 1, Fig 3). We managed all sensor data for customized soil moisture management of WSG through an integrated data acquisition system. We developed data collection and transmission code using the Arduino IDE, enabling each sensor to transmit data to a cloud server (ThingSpeak) via WiFi. This system allows for real-time or interval-based of soil moisture conditions. By continuously monitoring moisture levels, the system supports optimized moisture management tailored to the specific needs of the crop.

Table 1. Hardware and Software Development Environment.

Type Hardware/Software Function and role
Hardware Arduino 2560 Arduino Board
Capacitive Soil
Moisture Sensor
Moisture sensor for measuring water content in the soil
ESP8266 & adapter Microcontroller with built-in WiFi capabilities
Software Arduino IDE Software platform used for writing, compiling, and uploading code to Arduino boards
ThingSpeak Cloud server enabling data collection and storage for IoT applications

Fig 3. Soil moisture sensing system and sensor placement.

Fig 3

Soil moisture status was measured using capacitive moisture sensors integrated into the Arduino-based moisture management system. Each sensor was installed individually for each WSG plant to enable customized monitoring and control of growth conditions. To avoid signal distortion, which is commonly caused by corrosion in resistive sensors, we used HW-390 capacitive soil moisture sensors. The capacitive soil moisture sensors used in this study detect changes in dielectric permittivity associated with varying soil moisture levels based on frequency domain reflectometry (FDR). These readings are converted into electrical signals ranging from 0 to 1023. Higher values correspond to lower moisture conditions. To convert the raw sensor outputs into normalized moisture (%) values, we applied a two-point calibration procedure using oven-dry (0%) and fully saturated (100%) reference states. The average sensor readings were 542 ± 54 at 0% and 344 ± 50 at 100%. In this study, the moisture (%) values represent a normalized index derived from the calibrated sensor signals rather than direct volumetric or gravimetric water content.

For the soil-depth analysis experiment, we installed three moisture sensors per plant at distinct heights above the pot base: 15 cm (top layer), 10 cm (middle layer), and 3 cm (deep layer). These layer designations were adopted to align with conventional soil profile terminology and to provide clearer interpretation of the soil structure within the pot.

2.3. Image-based prediction system

2.3.1. Image collection.

We collected image data associated with the sensor system using a readily available RGB camera (iPhone 12 Pro), simulating practical, real-world conditions. To minimize variations in image brightness caused by sunlight we captured all images between 1:00 PM and 3:00 PM. The image acquisition was conducted under natural lighting conditions within the greenhouse, rather than in a controlled lighting setup, to evaluate the feasibility of predicting soil moisture levels using AI in actual cultivation environments. As outlined in Table 2 and Fig 4, images were continuously captured from immediately after transplantation until the soil surface became partially obscured by plant foliage during early growth.

Table 2. Camera Setting for in situ image acquisition.
Focal Length 26 mm
Aperture F1.6
ISO Sensitivity Auto
Camera Orientation Portrait
Shooting Equipment Camera bracket utilized
Focusing Distance 200mm
Digital Image Dimensions 4032 x 3024 pixels
Camera Resolution 12MP
Fig 4. Camera Setting for in situ image acquisition.

Fig 4

2.3.2. Image preprocessing.

A hybrid sampling strategy combining data augmentation and oversampling was applied to mitigate class imbalance and improve the generalization performance of the model. The target number of samples was set based on the moisture level with the highest sample count. For moisture levels with fewer samples, we performed oversampling with replacement until the target was reached. We then applied data augmentation techniques, including rotation, translation, scaling, and flipping, to further diversify the training data. This preprocessing ensured that all moisture levels were represented with an equal number of samples, preventing the model from overfitting to any specific range. As a result, the model was able to learn more diverse visual patterns across different soil moisture conditions.

We processed images using OpenCV’s IMREAD function to convert raw pixel data into RGB matrices and resized each image to 224 × 224 pixels (Fig 5). To maintain model accuracy, we normalized pixel values to a range between 0 and 1, ensuring consistent input values across the dataset [56,57]. The RGB color space was preserved throughout preprocessing without conversion to grayscale or HSV. We chose this approach because the deep learning models used in this study (i.e., DenseNet121 and InceptionV3) were pre-trained on RGB images and optimized to extract informative features from color channels. Therefore, preserving the RGB format allowed the model to leverage its full capacity for feature extraction and contributed to stable performance without additional color space transformation.

Fig 5. Input and processing of soil images.

Fig 5

The original soil image on the left is converted into a three-channel RGB matrix. This process uses IMRED to decompose pixel values into red, green, and blue channels. The resulting matrix is input data for feature learning in deep learning models.

2.3.3. Customized soil moisture prediction model.

To develop a deep neural network (DNN) regression model for customized soil moisture management based on soil images, we used Python with the TensorFlow Keras framework. We adopted six pre-trained deep learning architectures—ResNet50 (He et al., 2016), EfficientNetB0 (Tan & Le, 2019), MobileNetV2 (Sandler et al., 2018), InceptionV3 (Szegedy et al., 2016), DenseNet121 (Huang et al., 2017), and NASNetMobile (Zoph et al., 2018). All were originally trained on the ImageNet dataset. Although these architectures are primarily designed for image classification, we adapted them for regression by modifying their final layers to output continuous values. We employed transfer learning, removing the original classification layers and adding fully connected layers to support regression, enabling prediction of continuous soil moisture values from images.

The model architecture consisted of four main blocks, as summarized in Table 3. A pre-trained convolutional neural network (Base Model) received input images with a shape of 224 × 224 × 3 and used weights trained on the ImageNet dataset. The output was a 7 × 7 × C feature map, where C denotes the number of feature channels, varying by architecture. To reduce parameters and prevent overfitting, we applied global average pooling to compress the 3D feature map into a 1D feature vector of length C. This process was followed by a dense layer with 1,024 neurons and a ReLU activation function to capture high-level representations. The final output layer consisted of a single neuron with a linear activation function for continuous soil moisture prediction. The number of trainable parameters varied by architecture, which affected both the training time and performance. Hyperparameters were selected through a trial-and-error process [5860]. The final configurations were optimized for regression performance. We selected these architectures for their suitability in soil moisture prediction. They are categorized into three types based on their design characteristics: (1) standard deep CNNs, (2) complex, high-capacity extractors, and (3) lightweight, efficiency-focused models.

Table 3. (Top) Comparison of pre-trained CNN architectures, including input/output shapes, number of features, and total parameters. All models take 224 × 224 × 3 input images and generate architecture-specific feature maps. (Bottom) Architectural overview of the soil moisture prediction model, consisting of a pre-trained CNN base followed by modified top layers.
Input Shape Output of Last Convolution Layer Number of Features Total Parameters
DenseNet121 224 × 224 × 3 7 x 7 x 1024 1024 8.0M
EfficientNetB0 7 x 7 x 1280 1280 5.3M
InceptionV3 7 x 7 x 2048 2048 23.9M
MobileNet 7 x 7 x 1024 1024 4.2M
NASNetMobile 7 x 7 x 1056 1056 5.3M
ResNet50 7 x 7 x 2048 2048 25.6M

ResNet50, a deep and complex architecture, employs skip connections that enable stable training of deep networks and facilitate the extraction of complex patterns. Despite its effectiveness, it involves a relatively large number of parameters. DenseNet121 incorporates dense connectivity, which encourages feature reuse, alleviates the vanishing gradient problem, and improves memory efficiency through parameter sharing [61]. EfficientNetB0 achieves strong performance with fewer parameters by applying compound scaling, making it suitable for extracting soil image features efficiently. MobileNetV2, designed for lightweight applications, offers fast inference speed. However, its performance may slightly degrade when dealing with highly complex patterns [61,62]. InceptionV3 captures multi-scale features simultaneously, which is advantageous for identifying patterns of varying sizes. This benefit, however, comes at the cost of increased structural complexity. NASNetMobile, optimized through a neural architecture search, provides automatically designed models. Nevertheless, it may exhibit training instability, particularly on small datasets [63,64].

2.4. Depth-based soil moisture monitoring system

2.4.1. Data collection and preprocessing.

This study constructed an additional estimation model to predict subsurface soil moisture using top-view images. We added this model because measuring only the surface moisture is insufficient to capture the actual content in deeper layers. A depth-specific soil moisture prediction approach was also adopted to estimate the lower-layer moisture conditions. To eliminate the influence of plant uptake, we conducted the experiment using pots without planted crops. We applied water from the top using a standard irrigation method, which ensured consistent control of the watering process and accurate tracking of moisture changes at each depth. We installed soil moisture sensors at three heights above the pot base. The sensors were positioned 15 cm, 10 cm, and 3 cm above the base, corresponding to the top, middle, and deep soil layers, respectively. This configuration allowed for accurate collection of soil moisture data at multiple depths.

We performed data preprocessing to improve the model performance and enhance the accuracy of the predictions. We removed data points where all sensors recorded maximum values (100%) immediately after irrigation, because these oversaturated conditions were not informative. Next, we filtered out error-prone data showing unrealistic distributions, such as when the upper or middle sensor reported higher moisture levels than the bottom sensor. We also excluded cases where the upper layer recorded higher moisture than the middle layer. Finally, we eliminated outliers using the interquartile range (IQR) method. Values outside the Q1–Q3 range were removed to increase reliability and integrity. The initial dataset consisted of 888 samples. After preprocessing, 519 valid samples remained and were used for model training and evaluation.

2.4.2. Soil moisture prediction model by depth.

We designed the model to predict deep-layer moisture based on readings from the top and middle layers. The input variables were the soil moisture measurements obtained from the top and middle layers (15 cm and 10 cm above the pot base), and the target variable was defined as the soil moisture measured in the deep layer at 3 cm above the base. This configuration enabled the estimation of lower-layer moisture levels from surface and mid-layer conditions.

Several nonlinear regression models were evaluated to predict soil moisture variations across different soil depths. The soil-water characteristic curve (SWCC), which describes the relationship between soil matric potential and moisture content, typically follows a nonlinear pattern. This characteristic served as the basis for applying nonlinear regression methods to improve the accuracy of predictions. To identify the most effective model, we compared six nonlinear regression approaches: polynomial regression, support vector regression (SVR), bagging decision trees, random forest regression, k-nearest neighbors (KNN) regression, and gradient boosting. These models were selected for their ability to capture complex, nonlinear patterns. By applying and comparing these models on depth-based soil moisture data, the goal was to select the optimal model that best reflects the physical properties and hydrodynamic behavior of the soil.

Polynomial regression incorporates polynomial terms of the independent variable to model higher-order nonlinear relationships. This approach was used to capture the complex moisture distribution patterns observed across different soil depths. SVR models nonlinear relationships between soil moisture and depth in a high-dimensional space using kernel functions. Because of its robustness against outliers, SVR is particularly suitable for handling uncertainties in field-measured data [65,66]. Bagging decision trees and random forest regression, which are both tree-based ensemble models, offer the advantage of capturing complex interactions among the various factors influencing soil moisture dynamics. In particular, random forest models estimate nonlinear relationships by aggregating multiple decision trees, enhancing prediction stability [67,68]. KNN regression was applied based on the assumption that nearby measurement points share similar soil physical properties, such as porosity and particle size distribution [69,70]. Moisture values recorded at spatially close locations are likely to influence predictions. Thus, gradient boosting was selected for its ability to incrementally learn complex nonlinear variations in soil moisture. This model also allows for the evaluation of the relative importance of different predictive factors.

We optimized hyperparameters for the nonlinear regression models by balancing model complexity and generalization performance, following Occam’s razor. This principle suggests that among competing hypotheses that explain the same phenomenon, the simplest one is preferable. In the context of machine learning, Occam’s razor serves as a guideline for selecting models that are neither overly complex nor prone to overfitting. Evaluating predictive performance requires not only accuracy on training data but also the ability to generalize to unseen data. To assess generalization, the cross-validated R² score was used as the primary evaluation metric. A model was considered well-generalized when the difference between its training R² and cross-validation R² remained within 10% [71,72]. Each nonlinear regression model was tuned using its optimal set of hyperparameters and then applied to the soil moisture prediction task. The final model was selected based on comparative performance, with the goal of identifying the approach that best captured the physical characteristics and hydrodynamic behavior of the soil.

3. Experiments

3.1. Model training and experimental setup

We implemented the deep learning models using TensorFlow (Keras) on a Windows platform equipped with an Intel i9-11900K CPU, NVIDIA RTX 3050 GPU (Santa Clara, CA, USA), and 32GB of RAM.

We divided the image dataset into training, validation, and test sets in a 6:2:2 ratio. This allocation was based on theoretical justifications from prior studies. To ensure reliable evaluation, at least 20% of the total data was reserved for testing, which meets the minimum requirement for assessing statistical significance at the 95% confidence level [73]. When applying the hold-out validation approach, allocating 20–30% of the data for validation has been shown to be effective for assessing generalization performance and detecting overfitting [74]. According to Vapnik-Chervonenkis (VC) theory, assigning 60% of the data to the training set provides a sufficient sample size for stable learning [75]. This split ratio addressed the bias-variance trade-off by ensuring adequate sizes for validation and test sets, reducing variance in performance estimates, and providing stable evaluation metrics [76].

During the training, we tested three learning rates (0.00001, 0.0001, and 0.001) and up to 100 epochs to compare model performance. Model optimization was guided by standard regression metrics, which were monitored during training to evaluate model convergence and stability. We treated the number of epochs as a critical parameter for balancing performance and efficiency. When no meaningful improvement appeared, we reduced the number of epochs to avoid unnecessary computation. To prevent overfitting, the optimal number of epochs for each model was determined based on the point at which performance stabilized during early training. All deep learning models were trained using the mean squared error (MSE) loss function and the Adam optimizer.

We also split the depth-based dataset into training, validation, and test sets in a 6:2:2 ratio. To improve model stability and convergence speed, all soil moisture values were normalized to a range between 0 and 1 using a MinMax scaler. A batch size of 32 was used in the data loader configuration. Nonlinear regression models for predicting moisture content at each soil depth were evaluated under various hyperparameter settings. For random forest regression, the number of trees varied from 20 to 100. SVR was tested with different regularization values (C) ranging from 0.1 to 1000. For KNN regression, we adjusted the number of neighbors from 3 to 11 to identify the optimal configuration.

3.2. Performance metrics

Evaluating regression model performance was essential to assess how well the models fit the data. In this study, MSE was selected as the loss function for training deep neural network (DNN) regression models. MSE penalizes large errors more heavily, encouraging the model to focus on reducing them (Goodfellow et al., 2016). We used standard regression metrics to evaluate model performance, including MSE, MAE, RMSE, and R². These metrics assess the average error, sensitivity to outliers, and explained variance, respectively. The formulas are shown in Equation (1) through (4) MAE calculates the average of absolute differences between predicted and actual values, providing a direct measure of typical error size. RMSE, computed as the square root of MSE, estimates the average prediction error with greater sensitivity to large deviations. R² measures the proportion of variance in the observed data explained by the model and is widely used for predictive accuracy. We applied these evaluation metrics to assess both accuracy and suitability of the models, and to select the optimal predictive approach for soil moisture estimation.

MAE=1n\nolimitsi=1n|yiyi^| (1)
MSE=1n\nolimitsi=1n(yiyi^)2 (2)
RMSE=1n\nolimitsi=1n(yiyi^)2 (3)
R2=1i=1n(yiyi^)2i=1n(yiyi)2 (4)

where y represents the measured value, y^ is the predicted value by the models, y is the average value of the measured value, and n is the number of samples.

4. Results

4.1. Performance of image-based prediction system

4.1.1. Comparison of deep learning models.

We compared six deep learning architectures (ResNet50, EfficientNetB0, MobileNetV2, InceptionV3, DenseNet121, and NASNetMobile) based on their performance metrics and model complexity. As shown in Table 4, the upper part summarizes each model’s R², RMSE, learning rate and the number of training epochs. Fig 6 provides a graphical comparison illustrating the number of parameters, MAE, and R² for each architecture. Models positioned in the lower-left quadrant of the graph demonstrate superior performance, characterized by fewer parameters, lower computational cost, and reduced MAE. The size of each bubble indicates the difference between the validation MAE and test MAE, with smaller bubbles reflecting stronger generalization performance.

Table 4. Performance comparison of deep learning architectures for soil moisture prediction. The upper section summarizes R², RMSE, learning rate, and epoch for six models. DenseNet121 showed the highest prediction accuracy, while NASNetMobile performed the poorest.
RMSE LR Epoch
DenseNet121 97.3 4.14 0.00001 68
EfficientNetB0 96.9 4.99 0.001 69
InceptionV3 96.9 4.20 0.001 69
ResNet50 96.6 8.81 0.0001 52
MobileNetV2 95.8 5.20 0.001 79
NASNetMobile 89.7 8.39 0.00001 100
Fig 6. Comparison of model complexity, MAE, and generalization gap.

Fig 6

Bubble size denotes generalization gap and labels indicate R². DenseNet121 achieved the best trade-off between accuracy and efficiency, while NASNetMobile showed poor generalization despite low complexity.

Among the evaluated models, DenseNet121 demonstrated the best overall performance for soil moisture prediction, achieving the lowest MAE (2.07) and highest validation R² (97.3%), despite using only 8.08 million parameters. Its dense connectivity structure, which enables direct feature reuse across layers, enhanced information flow and preserved spatial resolution, allowing the model to effectively capture fine-grained texture variations in soil images. This architectural efficiency led to minimal prediction errors, making DenseNet121 the optimal model for practical image-based soil moisture management. EfficientNetB0 also performed strongly, with an MAE of 2.14 and validation R² of 96.9%, using only 5.36 million parameters. Its configuration (learning rate: 0.001, 69 epochs) achieved a strong balance between model complexity and predictive power, highlighting its suitability for lightweight applications.

ResNet50 and InceptionV3 required significantly more computational resources, approximately 25.6 million and 23.9 million parameters, respectively, which is about 4.5 to 4.8 times greater than EfficientNetB0. Despite the increased complexity, both models demonstrated comparable performance in soil moisture prediction. ResNet50 was optimized with a learning rate of 0.0001 and 52 training epochs. Its skip connection structure helped mitigate the vanishing gradient problem in deep layers, allowing the model to effectively distinguish distinct texture regions within the soil images. InceptionV3, trained with a learning rate of 0.001 over 69 epochs, employed convolutional filters of varying sizes to capture the soil structure at multiple spatial scales. Both models achieved high predictive performance, with R² values of 96.6% for ResNet50 and 96.9% for InceptionV3. The small differences between validation MAE and test MAE indicated strong generalization capabilities. Although InceptionV3 incurred higher computational cost, its accuracy and reliability suggest that it remains a promising candidate for image-based soil moisture prediction.

NASNetMobile had a parameter count similar to EfficientNetB0 (5.35 million), but it yielded the poorest performance among the six architectures, with an MAE exceeding 4.5 and an R² of 89.7%. Even though it used a low learning rate of 0.00001 and trained for 100 epochs, the model underperformed. This limitation likely stemmed from the nature of neural architecture search (NAS), which can cause overfitting to specific patterns while overlooking broader feature generalization. MobileNetV2, with 3.57 million parameters, features a lightweight architecture and achieved an R² of 95.8% and an MAE of 2.98 when trained with a learning rate of 0.001 for 79 epochs. The model effectively captured textures and patterns across global regions of the soil images. Notably, MobileNetV2 demonstrated the smallest gap between validation and test MAE, indicating superior generalization compared to the other architectures. However, although NASNetMobile and MobileNetV2 are designed for resource-constrained environments and they exhibit low computational demand, their performance in soil moisture prediction was relatively limited.

4.1.2. Actual vs. predicted in deep learning models.

The predictive performance of each deep learning architecture was further evaluated by comparing the actual and predicted soil moisture values. DenseNet121 and InceptionV3 demonstrated the highest levels of agreement, showing minimal prediction error across most data points. DenseNet121 achieved the most accurate results by leveraging its dense connectivity structure for efficient feature reuse, while InceptionV3 benefited from multi-scale convolutional filters that captured diverse spatial features. ResNet50 and EfficientNetB0 also achieved strong predictive accuracy. Their architectural strengths (e.g., skip connections in ResNet50 and compound scaling in EfficientNetB0) contributed to improved texture recognition and consistent moisture estimation. These trends are visually illustrated in Fig 7, which presents scatter plots of actual versus predicted values for each model.

Fig 7. Comparison of actual and predicted values across the six DNN regression models: (a) EfficientNetB0, (b) MobileNetV2, (c) NASNetMobile, (d) ResNet50, (e) DenseNet121, (f) InceptionV3. Each plot demonstrates the alignment between actual and predicted values, with the red dashed line representing perfect prediction.

Fig 7

Models with tighter clustering around the perfect prediction line, such as DenseNet121 and InceptionV3, exhibit higher prediction accuracy, whereas models like NASNetMobile show greater deviations.

4.2. Depth-wise analysis

4.2.1. Soil moisture characteristics by depth.

This study analyzed the variation in soil moisture content at different soil depths. As expected, deeper soil layers exhibited higher moisture levels, primarily due to reduced exposure to evaporation. Table 5 presents the descriptive statistics for soil moisture measured at three depths: deep, middle, and top. A total of 519 samples were collected for each depth, including the mean, standard deviation, minimum, maximum, and quartiles. The deep layer showed the highest mean soil moisture content, measured as relative saturation, with a mean value of 65.33% and a standard deviation of 15.01%. In contrast, the middle layer recorded a significantly lower mean of 41.00%, with a standard deviation of 11.74%. The top layer exhibited the lowest mean moisture content at 21.26% and the lowest variability (SD = 8.29%), indicating more stable moisture conditions near the surface. These results reveal a clear stratification of moisture levels by depth, with a notable decrease from deep to top. This pattern is attributed to water movement following irrigation, where moisture gradually percolates downward, accumulating more in the lower layers. The observed distribution confirms that water supplied from the top infiltrates the soil and contributes significantly to moisture retention in the deep layer.

Table 5. Descriptive Statistics of Soil Moisture by Depth.
Deep Middle Top
count 519 519 519
mean 65.33 ± 15.01 41 ± 11.74 21.26 ± 8.29
min 23 5 0
25% 55 29 16
50% 70 44 22
75% 77 51 27
max 93 64 42

4.2.2. Regression models for depth-wise prediction.

Simple linear regression using soil moisture at the upper or middle depths exhibited only limited explanatory power for the deep layer, thereby underscoring the need for more advanced nonlinear approaches. Various nonlinear regression models were evaluated to predict soil moisture at different depths (Table 6, Fig 8, left). The two tree-based models (i.e., random forest regression and bagging decision tree regression) showed the best performance, with test R² of 0.906 and 0.903, respectively. The random forest model maintained a consistent gap between the training R² (0.942) and cross-validation R² (0.842), remaining within 10%, indicating strong generalization capability. Additionally, the standard deviation of the prediction error remained below 0.005 across different depths and moisture conditions, demonstrating stable predictive performance. The bagging decision tree model showed a 7% difference between training and cross-validation R², confirming its robustness and low risk of overfitting. These results suggest that ensemble learning approaches based on decision trees, such as random forest, effectively capture the complex nonlinear relationships inherent in soil moisture dynamics. In contrast, polynomial regression and KNN regression yielded lower cross-validation R² (0.762 and 0.816, respectively), indicating limited accuracy. This finding suggests that these models were less effective in modeling the nonlinear variability of soil moisture compared to tree-based methods.

Table 6. Performance comparison of six machine learning regression models for soil moisture prediction. Reported R² and RMSE correspond to the best cross-validated hyperparameters. Random forest and bagging decision tree regression achieved the best predictive performance, followed by gradient boosting. Polynomial regression and KNN regression showed moderate performance, while SVR with an RBF kernel yielded the lowest. All models were evaluated on the same dataset containing multi-depth soil moisture measurements.
RMSE Optimal Hyperparameters
Polynomial Regression 80.0 7.24 Degree = 2
Support Vector Regression 76.7 9.75 C = 10, RBF kernel
Bagging Decision Tree 90.3 5.05 n_estimators = 20
Random Forest Regression 90.6 4.97 n_estimators = 50
KNN Regression 86.8 5.89 n_neighbors = 3
Gradient Boosting 88.6 5.45 n_estimators = 50
Fig 8. Performance evaluation and comparison of six machine learning models for soil moisture prediction.

Fig 8

Left panels display R² scores across different hyperparameter values. Red shaded areas represent the standard deviation from k-fold cross-validation on the test set, and blue lines indicate training performance. Right panel is a scatter plot of actual vs. predicted soil moisture on the testing data using the optimized hyperparameter. The black dashed line indicates the 1:1 reference, and tighter clustering of points along this line reflects better predictive performance.

Model hyperparameters were selected based on Occam’s razor principle and the criterion that the difference between training and cross-validation R² remains within 10%. For polynomial regression, the polynomial degree varied from 2 to 5 to evaluate performance. Across all degrees, the test, training, and cross-validation R² values remained consistent at approximately 80.0%, 77.6%, and 76.2%, respectively. The performance gap between training and cross-validation was maintained at 1.8%, which is well below the 10% threshold. Therefore, based on Occam’s razor, the second-degree polynomial was selected as the optimal hyperparameter due to its minimal computational cost. For SVR, regularization parameter C varied from 0.1 to 1000. When using the radial basis function (RBF) kernel, the model achieved the highest R² of 76.7% at C = 10. The RBF kernel outperformed both the linear and polynomial kernels, which yielded a lower R² of 71.6% and 59.4%, respectively. However, as C increased beyond 10, performance declined, with the R² dropping to 73.9% at C = 1000.

To optimize the hyperparameters for the tree-based models (i.e., bagging decision tree, random forest, and gradient boosting), the number of trees was adjusted and evaluated. For the bagging decision tree model, the number of estimators varied from 10 to 100. While the highest cross-validation R² of 84.2% was observed at 100 trees, the performance at 20 trees was only marginally lower at 83.9%, a difference of just 0.3%. Although the training–validation gap was slightly smaller at 100 trees, we selected 20 trees as the optimal hyperparameter considering the minimal performance difference, computational cost, and model complexity. For random forest, the number of trees was adjusted between 50 and 300. The highest cross-validation R² of 84.6% was achieved with 200 trees; however, the performance at 50 trees was nearly equivalent at 84.1%, with only a 0.5% difference. For the gradient boosting model, while the training R² increased from 90.0% to 95.1% as the number of trees increased, the cross-validation R² showed a declining trend (from 83.2% to 81.0%) indicating signs of overfitting. All three models demonstrated comparable or even better performance with a smaller number of trees. Therefore, based on Occam’s razor and computational efficiency, the final number of estimators was set to 20 for the bagging decision tree model, and 50 for both random forest and gradient boosting.

For the KNN regression model, the number of neighbors varied from 3 to 11 for performance evaluation. The highest predictive performance was observed when the number of neighbors was set to 3, yielding a training R² of 90.5%, a cross-validation R² of 81.6%, and a test R² of 86.8 As the number of neighbors increased, all performance metrics gradually declined. Based on these results, 3 neighbors were selected as the optimal hyperparameter setting.

Fig 8 (right) illustrates the predictive performance of six nonlinear regression models (i.e., polynomial regression, SVR, decision tree regression, random forest regression, KNN regression, and gradient boosting) by comparing the actual and predicted soil moisture values. random forest regression exhibited the highest level of agreement between the predicted and actual values, indicating the strongest predictive performance. A majority of the data points were tightly clustered around the ideal prediction line, suggesting that the model could predict soil moisture with high accuracy and consistency across various soil conditions, while minimizing prediction errors. Similarly, the decision tree regression model demonstrated strong predictive accuracy, with most data points located near the prediction line. This finding indicates that the model effectively captured the nonlinear characteristics of the data. In contrast, the predicted values from the Polynomial Regression model were more widely scattered, reflecting a moderate level of agreement with the actual values. Several data points exhibited large prediction errors, suggesting that the model struggled to fully capture the nonlinear variability in the dataset. SVR showed the weakest predictive performance, with many data points deviating significantly from the ideal line. This result indicates that the model was unable to effectively learn the underlying data patterns and exhibited poor generalization in soil moisture prediction.

5. Discussion

5.1. Customized soil moisture management at plant level

To better understand the unique growth environments and moisture requirements of individual non-timber forest products, this study developed a non-invasive, customized soil moisture management model by integrating image-based machine learning with sensor data. The approach was designed to overcome limitations of conventional smart farming systems that rely on uniform control based on average environmental conditions. These systems often overlook the unique growth needs of individual plants. Although sensor-based systems have advanced environmental monitoring capabilities, they are typically applied in a generalized manner across entire cultivation areas, making it difficult to reflect plant-level variability. To address these limitations, this study proposes an alternative approach that enables customized management by predicting soil moisture using only surface images, thereby minimizing the need for dense sensor deployment. The study was conducted at the seedling stage of wild-simulated ginseng, where soil remains visually accessible before being covered by foliage. While this stage-specific focus limits generalization across the entire growth cycle, it nonetheless underscores a practical contribution, as irrigation management is particularly critical during early development.

5.2. Comparison with previous approaches

The model was trained using RGB images of the soil surface paired with sensors at multiple depths that collected soil moisture values. This pairing allowed the model to learn the visual features associated with moisture distribution, enabling it to estimate soil moisture levels even in locations where sensors were not installed. This cross-referenced learning strategy bridges the gap between detailed sensing and scalable application, offering a practical solution for precise soil monitoring in resource-limited environments.

Previous sensor-based studies, such as Lloret et al. (2021), required dense sensor networks to capture subsurface moisture variability. In contrast, our image–sensor paired approach reduces sensor dependency by leveraging visual cues. Similarly, while recent remote-sensing efforts have mapped soil moisture at field or regional scales using machine learning with satellite or UAV imagery (Peng et al., 2024; Lamichhane et al., 2025) and UAV-based RGB sensing at local scales (Hernandez et al., 2025), our results demonstrate that RGB imagery alone can yield robust predictions at the plant scale. Deep learning models effectively extracted moisture-related features from soil textures, while ensemble-based regression models such as Random Forest captured nonlinear patterns in deeper soil layers. Notably, upper-layer moisture information emerged as a reliable predictor of subsurface moisture conditions. This finding suggests that surface imagery can, under certain conditions, serve as a proxy for deeper soil insights, offering a practical alternative in cases where intrusive sensing is infeasible. While deep-layer estimation is constrained by physical limitations, its potential value in sensor-limited environments warrants further exploration.

5.3. Practical contributions and agricultural implications

Beyond moisture estimation, the core contribution of this research lies in enabling plant-level management strategies based on non-destructive image data. Traditional cultivation practices often apply uniform treatments across crops, overlooking microenvironmental variation and individual physiological differences. Previous plant-based irrigation studies have emphasized the importance of individualized management (Jones, 2004), but practical methods have remained limited due to sensing costs and scalability issues. Our approach addresses this gap by allowing image-based assessment of plant-specific water status, supporting adaptive decisions tailored to each plant’s developmental stage and soil environment.

Similar to recent imaging studies that evaluated leaf or canopy water status using hyperspectral and phenotyping platforms (Furbank & Tester, 2011; Ge et al., 2018), our framework demonstrates that even simple RGB imagery can provide actionable insights at the plant scale. Rather than merely improving irrigation efficiency, our approach promotes customized crop care, contributing to sustainable and resource-efficient cultivation. This aligns with the broader objectives of climate-smart agriculture (Lipper et al., 2014) by offering an accessible, sensor-free alternative for precision monitoring, particularly valuable in areas with limited infrastructure or water availability. Moreover, this system is especially suitable for managing high-value or sensitive crops, where minimal physical disturbance is essential. In future applications, the image-based approach shows potential for broader implementation. In this study, we primarily employed smartphone-based imaging to ensure accessibility for farmers, as smartphones are the most readily available and easy-to-use cameras in practice. This choice highlights the feasibility of applying our method for plant-level precision management in smallholder or high-value crop settings. However, direct scale-up to large-scale farming systems presents challenges, as smartphone-based imaging is not practical for monitoring extensive areas. To address this limitation, future implementations may require integration with UAV platforms, distributed camera networks, or hybrid approaches combining imaging with sensor-based monitoring. Thus, while our current framework demonstrates clear value at the individual plant level, particularly for sensitive or high-value crops, its large-scale application will necessitate complementary technologies.

5.4. Limitations and future directions

Nevertheless, the study has certain limitations. We trained the model on data from a single crop, wild-simulated ginseng, under controlled laboratory conditions, which may affect generalizability. In addition, the current system relies exclusively on RGB images, which could limit sensitivity to subtle soil features. Future research should explore how to apply this method to diverse crop types and field environments and incorporate additional spectral data such as thermal or multispectral imagery to enhance prediction robustness. In addition, developing and testing indices that directly reflect changes in soil moisture could provide more sensitive predictors beyond raw spectral inputs. Furthermore, integrating this system with plant growth models could enhance its ability to support adaptive, plant-level crop management strategies throughout the entire growth cycle, enabling more context-aware decisions beyond irrigation alone.

At the same time, the reliance on smartphone-based imaging, while intentionally chosen to ensure accessibility for farmers, presents challenges when considering scalability. Smartphones are the most available and user-friendly tools for smallholders or high-value crop settings, but their use is not practical for monitoring large agricultural areas. Thus, while our framework demonstrates clear value at the individual plant level, its implementation in large-scale systems will likely require complementary technologies such as UAV platforms, distributed camera networks, or hybrid imaging–sensor solutions. Another constraint is the seedling-stage focus of our dataset. Although early-stage monitoring provides valuable insights when soil surfaces are visible, canopy closure at later growth phases limits direct soil observation. This necessitates further research of canopy-penetrating modalities, including multispectral or thermal imaging, and coupling with plant growth models to extend applicability throughout the crop cycle.

Although this study successfully developed depth-specific soil moisture models using sensor measurements collected at 15 cm, 10 cm, and 3 cm above the pot base, the surface-image–based component was applied only to the shallow layer. As a result, the present framework does not yet integrate surface imaging into an end-to-end system for predicting deeper soil moisture. Developing such an integrated approach remains a future goal, but would require corresponding ground-truth data for deeper layers as well as methods capable of addressing the substantial spatial heterogeneity of soil moisture conditions. Previous studies have similarly reported that machine learning based soil moisture models often exhibit reduced performance when applied across different locations or soil conditions unless multi-site calibration or domain adaptation techniques are employed [7780]. Future extensions of the framework will therefore require multi-environment datasets and modeling strategies that explicitly account for spatial heterogeneity in soil moisture dynamics.

Soil moisture at transplantation was adjusted to approximately 40–60% of saturation to provide a uniform starting condition, and subsequent irrigation and natural drying generated substantial variation throughout the experiment. The dataset therefore included a wide spectrum of soil moisture levels, allowing the model to learn from heterogeneous conditions rather than being confined to a narrow range. Nevertheless, systematic evaluation under more extreme or fluctuating regimes, as encountered in field environments, represents an important avenue for future research to strengthen robustness and generalizability.

Resolution also constitutes a limitation, as training images were standardized to 224 × 224 pixels. While this choice ensured compatibility with convolutional neural network architectures and facilitated efficient model training, the trade-offs among image resolution, computational cost, and predictive robustness remain insufficiently explored. Future studies should systematically examine these trade-offs, as different resolutions may prove more effective depending on crop type, imaging conditions, and computational resources. Estimating soil moisture in deeper layers remains an open challenge. Future research may explore whether surface-based approaches can provide meaningful insights into subsurface dynamics. These limitations highlight promising directions for further research and emphasize that while the present framework is suited for plant-level monitoring in controlled or high-value crop settings, its extension to large-scale agriculture will depend on methodological innovations and integration with complementary sensing technologies.

6. Conclusion

This study proposes a non-invasive soil moisture management model that integrates image-based machine learning with sensor data to support plant-specific management decisions. Among the evaluated deep learning models, DenseNet121 achieved the most accurate surface moisture prediction, while Random Forest Regression performed best for depth-wise estimation by capturing nonlinear moisture dynamics. The results demonstrate that surface image data can serve as a reliable proxy for subsurface soil conditions, reducing the need for extensive sensor deployment. This approach enables customized, plant-level crop management tailored to heterogeneous growing conditions, while minimizing physical disruption and operational costs. The system offers a scalable, cost-effective alternative to traditional uniform management strategies, particularly in the case of high-value or sensitive crops.

While the model showed strong performance, it was developed and tested under controlled conditions using a single crop. Future research should validate its applicability across diverse crop species, soil types, and field environments. Incorporating additional spectral data such as thermal or multispectral imagery may further improve model robustness and accuracy. Ultimately, this system lays the foundation for intelligent, adaptive crop management platforms, where AI not only predicts environmental conditions but also guides real-time, data-driven cultivation strategies across the entire crop lifecycle.

Data Availability

All relevant data are within the manuscript.

Funding Statement

This research was supported by the Artificial Intelligence Institute, Seoul National University, and the Youlchon Foundation (Nongshim Corporation and its affiliated companies) through the Youlchon AI Scholarship Grant in 2023 (to Y.K.). This research was supported by the Industry–Academia–Research Collaboration Activation Support (R&D) Program (Project No. 2025-25422608 to K.S.) funded by the Ministry of Science and ICT (MSIT), Republic of Korea.

References

  • 1.Tilahun G, Bantider A, Yayeh D. Empirical and methodological foundations on the impact of climate-smart agriculture on food security studies: Review. Heliyon. 2024;11(1):e41242. doi: 10.1016/j.heliyon.2024.e41242 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Ahmed MM, Sharif U, Raza A, Safdar M, Ali W, Asim M, et al. Climate-smart agriculture for resilience and profitability. Sustainable practices for agriculture and marketing convergence. IGI Global; 2024. p. 27–50. [Google Scholar]
  • 3.Mehraj J, Ahmed L, Qayoom K, Rasool F, Bhat R, Fayaz U, et al. Climate smart agriculture: A roadmap to sustainable food security. Int J Environ Clim Chang. 2024;14(7):234–53. [Google Scholar]
  • 4.Panda SS, Banerjee S, Alok S. Application of Industry 4.0 Technologies in Climate-Smart Agricultural Practices. Fostering Sustainable Development in the Age of Technologies. Emerald Publishing Limited; 2023. p. 289–302. doi: 10.1108/978-1-83753-060-120231020 [DOI] [Google Scholar]
  • 5.Srivastav AL, Dhyani R, Ranjan M, Madhav S, Sillanpää M. Climate-resilient strategies for sustainable management of water resources and agriculture. Environ Sci Pollut Res Int. 2021;28(31):41576–95. doi: 10.1007/s11356-021-14332-4 [DOI] [PubMed] [Google Scholar]
  • 6.Assimakopoulos F, Vassilakis C, Margaris D, Kotis K, Spiliotopoulos D. The Implementation of “Smart” Technologies in the Agricultural Sector: A Review. Information. 2024;15(8):466. doi: 10.3390/info15080466 [DOI] [Google Scholar]
  • 7.Singh AK, Balabaygloo BJ, Bekee B, Blair SW, Fey S, Fotouhi F. Smart connected farms and networked farmers to improve crop production, sustainability and profitability. Frontiers in Agronomy. 2024;6. [Google Scholar]
  • 8.Ma W, Rahut DB. Climate-smart agriculture: adoption, impacts, and implications for sustainable development. Mitig Adapt Strateg Glob Change. 2024;29(5). doi: 10.1007/s11027-024-10139-z [DOI] [Google Scholar]
  • 9.Fuentes-Peñailillo F, Gutter K, Vega R, Silva GC. Transformative Technologies in Digital Agriculture: Leveraging Internet of Things, Remote Sensing, and Artificial Intelligence for Smart Crop Management. Journal of Sensor and Actuator Networks. 2024;13(4):39. doi: 10.3390/jsan13040039 [DOI] [Google Scholar]
  • 10.Konfo TRC, Chabi ABP, Amoussouga Gero A, Lagnika C, Avlessi F, Biaou G, et al. Recent climate-smart innovations in agrifood to enhance producer incomes through sustainable solutions. Journal of Agriculture and Food Research. 2024;15:100985. doi: 10.1016/j.jafr.2024.100985 [DOI] [Google Scholar]
  • 11.Mitchell CA. History of Controlled Environment Horticulture: Indoor Farming and Its Key Technologies. HortScience. 2022;57(2):247–56. doi: 10.21273/hortsci16159-21 [DOI] [Google Scholar]
  • 12.Boyd AP, Zankowski P, Wheeler R, Stokes-Draut JR, Chudnovsky Y, Ingram D, et al. Controlled environment agriculture: An opportunity to strengthen interagency research collaboration in the US government. PNAS Nexus. 2025;4(6):pgaf155. doi: 10.1093/pnasnexus/pgaf155 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Dohlman E, Maguire K, Davis WV, Husby M, Bovay J, Weber C. Trends, insights, and future prospects for production in controlled environment agriculture and agrivoltaics systems. 2024.
  • 14.Farhan M, Khujamshukurov NA, Fatima Z, Ahmad T, Zaid M, Anton K. The status of smart agriculture: A method to enhancing management. Smart Systems. CRC Press; 2025. p. 53–68. [Google Scholar]
  • 15.Sodhi GK, Jamwal P, editors. Smart Farming: Harnessing the Power of IoT for Agricultural Transformation. 2024 5th International Conference on Recent Trends in Computer Science and Technology (ICRTCST). IEEE; 2024. [Google Scholar]
  • 16.Dobre AE, Drăghici BG, Ciobanu B, Stan OP, Miclea LC, editors. Smart Agriculture: Farm Management through IoT with Predictive and Precision Monitoring. 2024 IEEE International Conference on Automation, Quality and Testing, Robotics (AQTR). IEEE; 2024. [Google Scholar]
  • 17.Haapala H, Kataja J, Pirttiniemi J, Sarvela K, Ludwig G, Appelgrén I, et al. How and why we built our Smart Farm. Suomen Maataloustieteellisen Seuran Tiedote. 2024;(42). doi: 10.33354/smst.143672 [DOI] [Google Scholar]
  • 18.Getahun S, Kefale H, Gelaye Y. Application of Precision Agriculture Technologies for Sustainable Crop Production and Environmental Sustainability: A Systematic Review. ScientificWorldJournal. 2024;2024:2126734. doi: 10.1155/2024/2126734 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Duguma AL, Bai X. How the internet of things technology improves agricultural efficiency. Artif Intell Rev. 2024;58(2). doi: 10.1007/s10462-024-11046-0 [DOI] [Google Scholar]
  • 20.Bank W. Climate-smart agriculture: World Bank; 2024 [updated February 26, 2024. Available from: https://www.worldbank.org/en/topic/climate-smart-agriculture
  • 21.Parra-López C, Ben Abdallah S, Garcia-Garcia G, Hassoun A, Trollman H, Jagtap S, et al. Digital technologies for water use and management in agriculture: Recent applications and future outlook. Agricultural Water Management. 2025;309:109347. doi: 10.1016/j.agwat.2025.109347 [DOI] [Google Scholar]
  • 22.Neophytides SP, Eliades M, Mavrovouniotis M, Papoutsa C, Papadavid G, Hadjimitsis DG. Improved water resources management for smart farming: a case study for Cyprus. Sci Rep. 2024;14(1):31751. doi: 10.1038/s41598-024-82797-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Ingrao C, Strippoli R, Lagioia G, Huisingh D. Water scarcity in agriculture: An overview of causes, impacts and approaches for reducing the risks. Heliyon. 2023;9(8):e18507. doi: 10.1016/j.heliyon.2023.e18507 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Al-Kaisi MM, Broner I. Crop water use and growth stages. Colorado State University Extension; 2014. [Google Scholar]
  • 25.Bounajra A, Guemmat KE, Mansouri K, Akef F. Towards efficient irrigation management at field scale using new technologies: A systematic literature review. Agricultural Water Management. 2024;295:108758. doi: 10.1016/j.agwat.2024.108758 [DOI] [Google Scholar]
  • 26.He J, Hantush MM, Kalin L, Rezaeianzadeh M, Isik S. A two-layer numerical model of soil moisture dynamics: Model development. J Hydrol (Amst). 2021;602:1–12. doi: 10.1016/j.jhydrol.2021.126797 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.United States Department of Agriculture SCS. National engineering handbook: Section 15, irrigation. Chapter 1: Soil-plant-water relationships. 210-VI, NEH 15-1, Edition ed: U.S. Department of Agriculture; 1991. [Google Scholar]
  • 28.Bai X, Fan S, Li R, Dai T, Li W, Ye S, et al. Estimating root zone soil moisture in farmland by integrating multi-source remote sensing data based on the water balance equation. Agricultural Water Management. 2025;314:109544. doi: 10.1016/j.agwat.2025.109544 [DOI] [Google Scholar]
  • 29.John M, Reddy T, Raddi PN, Dutta D, Sangeetha A, editors. Smart Farming Implementation using Arduino and LoRa. 2024 3rd International Conference on Artificial Intelligence For Internet of Things (AIIoT). IEEE: 2024. [Google Scholar]
  • 30.Tan YS, Chew LW, Tan YX, Tan SZ. IoT-based Smart Farming System. IJEMD-CSAI. 2024;3(1). doi: 10.54938/ijemdcsai.2024.03.1.270 [DOI] [Google Scholar]
  • 31.van Voorn GAK, Boer MP, Truong SH, Friedenberg NA, Gugushvili S, McCormick R, et al. A conceptual framework for the dynamic modeling of time-resolved phenotypes for sets of genotype-environment-management combinations: a model library. Front Plant Sci. 2023;14:1172359. doi: 10.3389/fpls.2023.1172359 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Ramin Shamshiri R, Kalantari F, C. Ting K, R. Thorp K, A. Hameed I, Weltzien C, et al. Advances in greenhouse automation and controlled environment agriculture: A transition to plant factories and urban agriculture. International Journal of Agricultural and Biological Engineering. 2018;11(1):1–22. doi: 10.25165/j.ijabe.20181101.3210 [DOI] [Google Scholar]
  • 33.Nautiyal M, Joshi S, Hussain I, Rawat H, Joshi A, Saini A, et al. Revolutionizing agriculture: A comprehensive review on artificial intelligence applications in enhancing properties of agricultural produce. Food Chem X. 2025;29:102748. doi: 10.1016/j.fochx.2025.102748 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Parashar A, Mabrouki J, Sharma J. AI and Smart Technologies for Smart Agriculture Environment. Advanced Systems for Environmental Monitoring, IoT and the application of Artificial Intelligence: Springer; 2024. p. 95–107. [Google Scholar]
  • 35.Hoseinzadeh S, Astiaso Garcia D. Ai-driven innovations in greenhouse agriculture: Reanalysis of sustainability and energy efficiency impacts. Energy Conversion and Management: X. 2024;24:100701. doi: 10.1016/j.ecmx.2024.100701 [DOI] [Google Scholar]
  • 36.Muhammed D, Ahvar E, Ahvar S, Trocan M, Montpetit M-J, Ehsani R. Artificial Intelligence of Things (AIoT) for smart agriculture: A review of architectures, technologies and solutions. Journal of Network and Computer Applications. 2024;228:103905. doi: 10.1016/j.jnca.2024.103905 [DOI] [Google Scholar]
  • 37.Service EPR. Artificial intelligence in the agri-food sector: Applications, risks and impacts. Scientific Foresight Unit (STOA); 2023. March. Contract No.: PE 734.711. [Google Scholar]
  • 38.Akkem Y, Biswas SK, Varanasi A. Smart farming using artificial intelligence: A review. Engineering Applications of Artificial Intelligence. 2023;120:105899. doi: 10.1016/j.engappai.2023.105899 [DOI] [Google Scholar]
  • 39.Soussi A, Zero E, Ouammi A, Zejli D, Zahmoun S, Sacile R. Smart greenhouse farming: a review towards near zero energy consumption. Discov Cities. 2025;2(1). doi: 10.1007/s44327-025-00096-w [DOI] [Google Scholar]
  • 40.Roy A, Murtugudde R, Narvekar P, Sahai AK, Ghosh S. Remote sensing and climate services improve irrigation water management at farm scale in Western-Central India. Sci Total Environ. 2023;879:163003. doi: 10.1016/j.scitotenv.2023.163003 [DOI] [PubMed] [Google Scholar]
  • 41.Terlizzi I, Toson F, Chiodini S, Bettanini C, Colombatti G, Morbidini F, et al., editors. Remote Sensing Measurements for Efficient Crop Irrigation Management. 2023 IEEE International Workshop on Metrology for Agriculture and Forestry (MetroAgriFor); 2023. 6–8 Nov. 2023. [Google Scholar]
  • 42.Elsayed H, Ibrahim H, Farag H, Sobeih MF. Remote sensing-based techniques for water management in small-scale farms in arid climate. Water Supply. 2022;22(8):6692–714. doi: 10.2166/ws.2022.288 [DOI] [Google Scholar]
  • 43.Kadu R, Tiwar A, Jaurkar N, editors. Remotely Managed IoT Based Smart Irrigation System. 2024 International Conference on Emerging Systems and Intelligent Computing (ESIC). Bhubaneswar, India: 2024. 2024. [Google Scholar]
  • 44.Aragon B, Houborg R, Tu K, Fisher J, McCabe M. CubeSats Enable High Spatiotemporal Retrievals of Crop-Water Use for Precision Agriculture. Remote Sensing. 2018;10(12):1867. doi: 10.3390/rs10121867 [DOI] [Google Scholar]
  • 45.Han H, Jang K, Chung J. Selecting suitable sites for mountain ginseng (Panax ginseng) cultivation by using geographically weighted logistic regression. J Mt Sci. 2017;14(3):492–500. doi: 10.1007/s11629-016-4118-9 [DOI] [Google Scholar]
  • 46.Jeong B-G, Jung G-R, Kim MS, Moon H-G, Park S-J, Chun J. Ginsenoside contents and antioxidant activities of cultivated mountain ginseng (Panax ginseng C.A. Meyer) with different ages. Korean Journal of Food Preservation. 2019;26(1):90–100. doi: 10.11002/kjfp.2019.26.1.90 [DOI] [Google Scholar]
  • 47.Park HE, Lee S, Hyun S, Kim D, Marriott P, Choi H. Gas chromatography/mass spectrometry-based metabolic profiling and differentiation of ginseng roots according to cultivation age using variable selection. 2013. [DOI] [PubMed]
  • 48.Moon JW, Jang IB, Yu J, Jang IB, Seo SJ, Lee SW. Changes in Growth Characteristics, Biological Activity and Active Compound Contents in Ginseng of Different Ages. Korean Journal of Medicinal Crop Science. 2019;27(6):383–9. doi: 10.7783/kjmcs.2019.27.6.383 [DOI] [Google Scholar]
  • 49.Shaoqing C, Lin C, Jun W, editors. Discrimination among ginsengs of different growing years based on electronic nose. St. Joseph, Michigan: American Society of Agricultural and Biological Engineers. 2013 ASABE; 2013. [Google Scholar]
  • 50.Korea Forest Service K. 2019 Statistical Data Book on Wild-Simulated Ginseng. Daejeon, Korea: Korea Forest Service; 2019. [Google Scholar]
  • 51.Woo I. inventorCultivation method of wild-simulated ginseng. Korea. 2016. [Google Scholar]
  • 52.Foundation SRD, inventor; Suh, Kyo Kim, Solhee, assignee. A method of post-treating wild ginseng through transplanting to climate-smart environment, and ginseng produced by the method. Korea. 2023. [Google Scholar]
  • 53.Li TS, Berard R. Effects of soil moisture on the growth of American ginseng (Panax quinquefolium L.). Journal of Ginseng Research. 1998;22(2):122–5. [Google Scholar]
  • 54.Park HW, Jang IB, Kim YC, Mo HS, Park KC, Yu J, et al. Growth Characteristics of Ginseng Seedlings as Affected by Mixed Nursery Soil under Polyethylene Film Covered Greenhouse. Korean Journal of Medicinal Crop Science. 2014;22(5):363–8. doi: 10.7783/kjmcs.2014.22.5.363 [DOI] [Google Scholar]
  • 55.Kormanek M, Małek S, Banach J, Durło G, Jagiełło-Leńczuk K, Dudek K. Seasonal changes of perlite–peat substrate properties in seedlings grown in different sized container trays. New Forests. 2020;52(2):271–83. doi: 10.1007/s11056-020-09793-3 [DOI] [Google Scholar]
  • 56.Bengio Y, Courville A, Vincent P. Representation learning: a review and new perspectives. IEEE Trans Pattern Anal Mach Intell. 2013;35(8):1798–828. doi: 10.1109/TPAMI.2013.50 [DOI] [PubMed] [Google Scholar]
  • 57.Géron A. Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow: Concepts, tools, and techniques to build intelligent systems. O’Reilly Media, Inc.; 2022. [Google Scholar]
  • 58.Bergstra J, Bengio Y. Random search for hyper-parameter optimization. The Journal of Machine Learning Research. 2012;13(1):281–305. [Google Scholar]
  • 59.Smith LN. A disciplined approach to neural network hyper-parameters: Part 1 -- learning rate, batch size, momentum, and weight decay. 2018.
  • 60.Xiao X, Yan M, Basodi S, Ji C, Pan Y. Efficient hyperparameter optimization in deep learning using a variable length genetic algorithm. arXiv preprint arXiv:200612703. 2020. [Google Scholar]
  • 61.Luo C, He X, Zhan J, Wang L, Gao W, Dai J. Comparison and benchmarking of ai models and frameworks on mobile devices. arXiv preprint arXiv:200505085. 2020. [Google Scholar]
  • 62.Sandler M, Howard A, Zhu M, Zhmoginov A, Chen L-C, editors. MobileNetV2: Inverted Residuals and Linear Bottlenecks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE; 2018. [Google Scholar]
  • 63.Zoph B, Le QV. Neural Architecture Search with Reinforcement Learning. 2016.
  • 64.Zoph B, Vasudevan V, Shlens J, Le QV, editors. Learning Transferable Architectures for Scalable Image Recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR): IEEE; 2018. 2018/06. [Google Scholar]
  • 65.Agarwal D, Bachan P. Machine learning approach for the classification of wheat grains. Smart Agricultural Technology. 2023;3(100136). [Google Scholar]
  • 66.Nakano T, Nukala BT, Tsay J, Zupancic S, Rodriguez A, Lie DYC, et al. Gaits Classification of Normal vs. Patients by Wireless Gait Sensor and Support Vector Machine (SVM) Classifier. International Journal of Software Innovation. 2017;5(1):17–29. doi: 10.4018/ijsi.2017010102 [DOI] [Google Scholar]
  • 67.Breiman L. Bagging Predictors. Machine Learning. 1996;24(2):123–40. doi: 10.1023/a:1018054314350 [DOI] [Google Scholar]
  • 68.Breiman L. Random Forests. Machine Learning. 2001;45(1):5–32. doi: 10.1023/a:1010933404324 [DOI] [Google Scholar]
  • 69.Zhang M, Shi W, Xu Z. Systematic comparison of five machine-learning models in classification and interpolation of soil particle size fractions using different transformed data. Hydrol Earth Syst Sci. 2020;24(5):2505–26. doi: 10.5194/hess-24-2505-2020 [DOI] [Google Scholar]
  • 70.Bashir O, Bangroo SA, Shafai SS, Shah TI, Kader S, Jaufer L, et al. Mathematical vs. machine learning models for particle size distribution in fragile soils of North-Western Himalayas. J Soils Sediments. 2024;24(6):2294–308. doi: 10.1007/s11368-024-03820-y [DOI] [Google Scholar]
  • 71.Hastie T, Tibshirani R, Friedman JH, Friedman JH. The elements of statistical learning: data mining, inference, and prediction: Springer; 2009. [Google Scholar]
  • 72.Goodfellow I, Bengio Y, Courville A, Bengio Y. Deep learning: MIT press Cambridge; 2016. [Google Scholar]
  • 73.Guyon I, Eliseeff A. An introduction to variable and feature selection. Journal of Machine Learning Research. 2003;3(Mar):1157–82. [Google Scholar]
  • 74.Kohavi R, editor A study of cross-validation and bootstrap for accuracy estimation and model selection. Morgan Kaufman Publishing. International Joint Conference on Articial Intelligence (IJCAI); 1995. [Google Scholar]
  • 75.Vapnik VN, Chervonenkis AY. On the Uniform Convergence of Relative Frequencies of Events to Their Probabilities. In: Vovk V, Papadopoulos H, Gammerman A, editors. Measures of Complexity: Festschrift for Alexey Chervonenkis. Cham: Springer International Publishing; 2015. p. 11–30. [Google Scholar]
  • 76.Chang CC, Lin CJ. LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology. 2011;2(3):1–27. [Google Scholar]
  • 77.Lamichhane M, Mehan S, Mankin KR. Soil Moisture Prediction Using Remote Sensing and Machine Learning Algorithms: A Review on Progress, Challenges, and Opportunities. Remote Sensing. 2025;17(14):2397. doi: 10.3390/rs17142397 [DOI] [Google Scholar]
  • 78.Taheri M, Bigdeli M, Imanian H, Mohammadian A. An overview of machine-learning methods for soil moisture estimation. Water. 2025;17(11). [Google Scholar]
  • 79.Koné BAT, Grati R, Bouaziz B, Boukadi K, Mecella M. Optimized and Explainable Feature Selection for Soil Moisture Prediction Across Sites. 2025.
  • 80.Houben T, Ebeling P, Khurana S, Schmid JS, Boog J. Machine‐learning based spatiotemporal prediction of soil moisture in a grassland hillslope. Vadose Zone Journal. 2025;24(2). doi: 10.1002/vzj2.70011 [DOI] [Google Scholar]

Decision Letter 0

Babak Mohammadi

24 Jun 2025

Dear Dr. Suh,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Aug 08 2025 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org . When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols . Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols .

We look forward to receiving your revised manuscript.

Kind regards,

Babak Mohammadi

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. In your Methods section, please provide additional information regarding the permits you obtained for the work. Please ensure you have included the full name of the authority that approved the field site access and, if no permits were required, a brief statement explaining why.

3. Please note that PLOS ONE has specific guidelines on code sharing for submissions in which author-generated code underpins the findings in the manuscript. In these cases, we expect all author-generated code to be made available without restrictions upon publication of the work. Please review our guidelines at https://journals.plos.org/plosone/s/materials-and-software-sharing#loc-sharing-code and ensure that your code is shared in a way that follows best practice and facilitates reproducibility and reuse.

4. When completing the data availability statement of the submission form, you indicated that you will make your data available on acceptance. We strongly recommend all authors decide on a data sharing plan before acceptance, as the process can be lengthy and hold up publication timelines. Please note that, though access restrictions are acceptable now, your entire data will need to be made freely accessible if your manuscript is accepted for publication. This policy applies to all data except where public deposition would breach compliance with the protocol approved by your research ethics board. If you are unable to adhere to our open data policy, please kindly revise your statement to explain your reasoning and we will seek the editor's input on an exemption. Please be assured that, once you have provided your new statement, the assessment of your exemption will not hold up the peer review process.

Additional Editor Comments:

Dear Authors,

The manuscript has been evaluated by three experts. Please provide point-to-point feedback, revisions, and/or responses to each comment of the reviewers. Besides, please consider the following comments during the revision and modify the manuscript accordingly, and provide feedback and modifications for each comment:

- Abstract, Results, Discussion, and Conclusion sections need more interpretation, discussion, and key findings in terms of Soil Moisture.

- Please investigate more literature review in the Introduction and then summarize their findings and make linkages between them and the research gaps you want to address in this study.

- Table 4: please provide statistical metrics for training and testing phases separately.

- Table 4: please also include Relative Root Mean Squared Error (RRMSE) as a new metric.

- Figure 2: please provide such figures for the training and testing phases separately.

- Figure 5 needs more explanation and discussion in the text.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously? -->?>

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available??>

The PLOS Data policy

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: No

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English??>

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

Reviewer #1: This manuscript presents a very interesting approach involving image recognition and the use of AI to estimate soil moisture. I have a few minor comments to help improve the quality of the manuscript:

- Please add relevant references between lines 19 and 26.

- In lines 40–41, to which studies are the authors referring? Please clarify.

- Similarly, which studies are referenced in line 43? Please review the Introduction section to ensure all citations are properly included.

- The content between lines 72 and 82 appears to be background information. I suggest synthesizing this content and incorporating it into the Introduction.

- In lines 104–105, is the soil moisture reported on a gravimetric or volumetric basis? Please clarify this point.

- In Section 2.2, the authors describe the soil moisture monitoring system. Arduino-based soil moisture sensors are known for their limited accuracy (see for example https://doi.org/10.3390/s23052451), which can harm actual soil water conditions. Additionally, the manuscript mentions that only two points were used for sensor calibration. How can the authors ensure the reliability of the sensors using only a two-point calibration approach for Arduino-based sensors?

- In line 130, the authors mention a “100%” volumetric water content. Would this not refer to the degree of saturation of the soil sample rather than its volumetric water content? Please verify and ensure that the correct concepts and terminology are used consistently throughout the manuscript.

- The content between lines 239 and 246 also appears to be background information. Please remove it from Section 2.4.2 and integrate a synthesized version into the Introduction.

- In lines 424–426, could the reduced reliability of the soil moisture data have influenced the findings? Please consider and address this possibility.

- In lines 435–436, the authors seem to be referring again to the degree of saturation. If so, please correct the terminology.

- Currently, the Discussion section summarizes the main findings and highlights the study's limitations and contributions. However, a typical Discussion should also compare the results with findings from other studies. I strongly recommend revising this section to include comparisons with relevant recent literature.

Reviewer #2: • Maintain consistent use of the Oxford comma throughout lists.

• Replace passive voice were unnecessary, especially in methodology sections, with active constructions for clarity.

• Avoid stacking multiple clauses in one sentence; break them into two where necessary.

• Use consistent units of measurement (e.g., always clarify if % is gravimetric or volumetric moisture content).

• Use either American or British English consistently (e.g., “analyze” vs. “analyse”).

Reviewer #3: In this manuscript, the authors present a non-destructive image-based approach for estimating soil moisture. Top-view RGB images from a mobile camera were used as predictors for an adopted version of pre-trained deep learning architectures to predict the soil moisture content at the top layer. DenseNet121 was reported as the optimal model for soil moisture estimation in the top layer with high accuracy and computational efficiency. Moreover, non-linear regression models were optimized to allow for soil-moisture estimation at the bottom layer. The random forest and bagging decision tree performed the best in estimating the soil moisture at the deep layer from the measured soil moisture at the top and middle layers. Generally, the manuscript is well-written and structured in a logical order with well-justified choices. However, a few points need to be addressed before this work can be published.

**Major comments**

1) I recommend adding a timeseries figure including both actual and predicted soil moisture (of Figure 4 and Figure 5). This allows for further understanding of the capabilities and limitations of the image-based estimation. If this is not possible, please explain why not?!

2) What is the coefficient of determination between soil moisture at the upper and middle depths against the bottom layer? This check is important to understand the added value of the regression models used to predict the bottom layer.

3) The discussion of the manuscript can be further strengthened. Here are some points that are worth discussing:

- The study focused on the seedling stage, which raises the question of robustness and generalizability of your approach to the different growth stages. For example, a shadow will be introduced as the growth progresses, suggesting an updated model training, thus reducing its practicality.

- The approach applied looks infeasible for the intended purpose of crop management on a large scale. A key question would be how such an approach can be feasibly implemented for large-scale areas.

- Since the initial soil moisture was controlled (40-60%), how would changes in the initial soil moisture impact the performance of the AI model? Try to discuss this point and explain what is needed to quantitatively analyze this sensitivity.

- What was the rationale behind selecting 224x224 pixels? What is the implication of such a decision?

- The practicality does not fully convince me of your second part (i.e., soil moisture estimation at deep layers from the upper and middle measurements). In which cases can such a model be useful? The rationale behind this analysis requires strong arguments to accept it.

**Minor comments**

1) Lines 43-45: an example of references is missing.

2) Figure 1: Add dimensions to the cultivation system with more elaboration on its different components.

3) Figure 2: Add scale per subpanel.

4) Lines 171-172: add references to the pre-trained models.

5) Lines 167, 407: Typo in figure numbering

6) Table 3: caption is not complete. “is used to reduce…”

7) Lines 336-338: give different numbers for each question. Moreover, the equation of MSE is missing, and the second equation is for NRMSE, not RMSE. Make the necessary changes!

8) I found the heading and subheading titles are long. I suggest making shorter titles.

9) Table 4 should be split into Table 4 and Figure 4. Moreover, I suggest adding a legend that links the difference in the MAE values and the bubble size.

10) Table 6: The ranking of the models is not correct in the caption. Change it to “random forest and bagging decision tree regression … followed by gradient boosting”.

11) Figure 5: Add more information about the right panels in the caption. For example, they are based on the testing data and the optimized hyperparameter indicated by the green vertical dashed line, or so!

12) Type in line 550, change “Furesearch” to “Future research”.

13) Line 552: future work should explore other predictors beyond the measured spectrum (i.e., indices that can indicate changes in soil moisture).

Awad M. Ali

Hydrology and Environmental Hydraulics Group,

Wageningen University

**********

what does this mean? ). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy

Reviewer #1: Yes: Luis Eduardo Bertotto

Reviewer #2: Yes: Dr. Iftikhar Ahmed

Reviewer #3: Yes: Awad Mohammed Ali

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/ . PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org . Please note that Supporting Information files do not need this step.

Attachment

Submitted filename: Review Report for PONE.docx

pone.0341904.s001.docx (16.7KB, docx)
PLoS One. 2026 Feb 17;21(2):e0341904. doi: 10.1371/journal.pone.0341904.r002

Author response to Decision Letter 1


1 Sep 2025

Manuscript Reference: PONE-D-25-27146

Title: Image-Based Machine Learning Models for Customized Soil Moisture Management

Journal: PLOS One

Reviewer #1

We would like to say thank you for your valuable comments to improve our paper. We have addressed the comments as explained below.

[1] Please add relevant references between lines 19 and 26.

Thank you for your comment. As suggested, we have added relevant references between lines 19 and 26 to strengthen the background and support the statements. The revised text now includes recent studies on smart farming and water resource management (Parra-López et al., 2025; Neophytides et al., 2024; Ingrao et al., 2023; Al-Kaisi & Broner, 2014; Kelly et al., 2025; Bounajra et al., 2024).

In manuscript:

(Line 19-26)

In the evolution of smart farming, water resource management technology has gained significant attention (Parra-López et al., 2025; Neophytides et al., 2024). As climate change intensifies water scarcity, efficient water resource management in agriculture has become an increasingly critical challenge (Ingrao et al., 2023). Moisture management in crop cultivation is a key factor in determining productivity and quality (Al-Kaisi & Broner, 2014). For example, inadequate moisture management can cause direct economic losses through reduced plant growth, increased pest and disease occurrence, and decreased yields (Ingrao et al., 2023). Precise moisture management requires accurate understanding crop water requirements based on growth stages and environmental conditions, and then providing the appropriate amount of water for each plant at the right time (Kelly et al., 2025; Al-Kaisi &Broner, 2014; Bounajra et al., 2024).

[2] Similarly, which studies are referenced in line 43? Please review the Introduction section to ensure all citations are properly included.

We have revised the citation for line 43 to ensure accuracy, and the sentence now cites European Parliamentary Research Service (2023), Akkem et al. (2023), and Soussi et al. (2023) as the relevant studies. In addition, as suggested, we reviewed the entire Introduction section to ensure the section is supported by relevant literature, and we have added citations where necessary.

In manuscript:

(Line 43)

Most artificial intelligence research related to smart farms incorporating various digital technologies has primarily focused on uniform management at the field or greenhouse level. Limited studies exist on customized cultivation management systems that consider the growth status of individual crops (European Parliamentary Research Service, 2023; Akkem et al., 2023; Soussi et al., 2023).

[3] The content between lines 72 and 82 appears to be background information. I suggest synthesizing this content and incorporating it into the Introduction.

Thank you for your comment. We carefully reviewed the content between lines 72 and 82 and agree that it provides important background information on the target crop. While your suggestion was to incorporate this content into the Introduction, we determined that it is more closely related to the description of experimental materials and design. Therefore, in revising the manuscript, we reorganized Section 2.1 Experimental Setup for Customized Growth Management to improve clarity and logical flow, creating two subsections: 2.1.1 Target Crop: Wild-simulated Ginseng and 2.1.2 Experiment Setup. The content originally found between lines 72 and 82 has been moved to the Target Crop subsection, where it directly supports the experimental design.

Below is the revised text for 2.1.1 Target Crop: Wild-simulated Ginseng, extracted from the reorganized Section 2.1 Experimental Setup for Customized Growth Management. Only the relevant subsection is presented here for clarity.

In manuscript:

(Line 79-92)

2.1.1. Target Crop: Wild-simulated Ginseng

High value-added agricultural and forest products command premium prices relative to their yield, highlighting the need for customized management to maximize productivity. The market price per gram of wild-simulated ginseng (WSG) exceeds that of cultivated ginseng by more than 20-fold, highlighting its prominence as a high-value forest product (Han et al., 2017; Jeong et al., 2019). Most WSG is marketed after more than seven years of cultivation. The quality and price are determined by various factors such as region, shape, weight, and other attributes that affect market value (Moon et al., 2019; Park et al., 2013; Shaoqing et al., 2013). Since the survival rate of WSG decreases dramatically from the fourth year onward, effective management strategies are required to improve survival rates and produce high-quality ginseng (Korea Forest Service, 2019; Woo, 2016). To establish legal standards for WSG and clearly distinguish it from cultivated ginseng, we investigated changes in WSG characteristics after a short post-treatment period following pre-germination transplantation. Accordingly, five-year-old WSG cultivated in Pyeongchang, Gangwon Province, was selected and transplanted into experimental soil as the target forest product for this study.

[4] In lines 104–105, is the soil moisture reported on a gravimetric or volumetric basis? Please clarify this point.

Thank you for your comment. The soil moisture content (50 ± 10%) reported in lines 104–105 was expressed as a percentage of the fully saturated condition of the soil, based on two-point calibration (0% for oven-dry soil and 100% for saturation). While the sensor output was calibrated using volumetric water content readings from a pre-calibrated device, the reported value represents the relative saturation level rather than an absolute volumetric water content. We have clarified this distinction in the revised manuscript.

In manuscript:

(Line 112-116)

Following transplantation, the soil moisture content was adjusted to 50 ± 10% of saturation, based on two-point calibration (oven-dry and fully saturated conditions), to provide a suitable starting environment. In this study, soil moisture content refers to the relative saturation of the soil, expressed as a percentage with 0% for oven-dry soil and 100% for saturation. Hereafter, this is referred to simply as soil moisture content.

[5] In Section 2.2, the authors describe the soil moisture monitoring system. Arduino-based soil moisture sensors are known for their limited accuracy (see for example https://doi.org/10.3390/s23052451), which can harm actual soil water conditions. Additionally, the manuscript mentions that only two points were used for sensor calibration. How can the authors ensure the reliability of the sensors using only a two-point calibration approach for Arduino-based sensors?

Thank you for your comment and for highlighting the potential limitations of Arduino-based soil moisture sensors. We acknowledge that such sensors, in general, have lower absolute accuracy compared to high-grade instruments. In our study, to minimize this limitation, we individually calibrated each Arduino-based sensor against a pre-calibrated professional soil moisture meter prior to the experiment. The professional device had undergone factory calibration, ensuring a reliable reference for the calibration process.

Although a high-grade instrument was available, we opted to use Arduino-based sensors because our experimental design required simultaneous real-time monitoring of soil moisture across multiple pots. This was not feasible with a single high-grade device. Among the various types of Arduino-compatible sensors, we selected capacitive sensors, which are known to be more resistant to corrosion than resistive types. Furthermore, the experiment was conducted over a relatively short period, reducing the risk of sensor degradation. These measures, combined with controlled environmental conditions, allowed us to maintain sufficient reliability for our experimental objectives while addressing the known limitations of Arduino based sensors.

[6] In line 130, the authors mention a “100%” volumetric water content. Would this not refer to the degree of saturation of the soil sample rather than its volumetric water content? Please verify and ensure that the correct concepts and terminology are used consistently throughout the manuscript.

Thank you for your comment. As clarified in our response to Comment [4] and reflected in the revised Section 2.1, we defined “soil moisture content” based on a two-point calibration (0% for oven-dry soil and 100% for saturation). Therefore, the value referred to as “100%” in the original text indicates full saturation rather than volumetric water content in the strict sense. To maintain consistency, we have revised the manuscript to use the term “soil moisture content” throughout, which hereafter denotes relative saturation as defined in Section 2.1.

In manuscript:

(Line 140-141)

Based on two-point calibration, the average sensor output was 542 ± 54 at 0% (oven-dry) and 344 ± 50 at 100% (saturation).

[7] The content between lines 239 and 246 also appears to be background information. Please remove it from Section 2.4.2 and integrate a synthesized version into the Introduction.

Thank you for your helpful comment. In accordance with your suggestion, we have removed the background description from Section lines 239–246 in the original manuscript and integrated the relevant content into the Introduction to improve the manuscript.

In manuscript:

(Line 30-33)

Since each soil layer plays a distinct role in crop growth, understanding soil moisture variation by depth is also essential (He et al., 2021). The upper layer reflects rapid irrigation and evaporation changes (USDA, 1991), the middle layer is the primary zone for root activity, and the bottom layer functions as a reservoir that buffers excess moisture (Bai et al., 2025; He et al., 2025).

[8] In lines 424–426, could the reduced reliability of the soil moisture data have influenced the findings? Please consider and address this possibility.

Thank you for the comment. In the section referenced (lines 424–426), the increased prediction uncertainty at high-moisture conditions arises primarily from image-domain factors rather than degraded label reliability. Near saturation, the soil surface becomes darker and more homogeneous, pools of water introduce specular reflections, and the peatmoss/perlite mixture reduces micro-texture contrast, collectively weakening the visual cues available to the model. By design, ground-truth moisture labels were obtained from short-term, individually calibrated sensors and data were collected before canopy occlusion, which limited label noise. We have added a note to clarify that the observed uncertainty is mainly due to diminished discriminability in surface imagery under high moisture, while label reliability was managed through calibration and controlled acquisition.

In manuscript:

(Line 431-436)

Across all models, there was a general trend of increased prediction uncertainty under high-moisture conditions, which is likely due to the subtle changes in the soil texture that complicate accurate prediction. The findings highlight that an effective feature extraction strategy plays a more critical role in predictive performance than model complexity alone. Notably, the higher uncertainty under near-saturation is attributable to reduced visual discriminability of the soil surface (darkening, specular reflections, and homogenization of the peatmoss/perlite mixture), which limits image-based cues.

[9] In lines 435–436, the authors seem to be referring again to the degree of saturation. If so, please correct the terminology.

We appreciate the reviewer’s comment regarding the terminology. To address this, we clarified in the Methods section that soil moisture content in this study is defined as relative saturation (%) based on two-point calibration (oven-dry and fully saturated). We also revised Section 4.2.1 to explicitly state that the reported values represent relative saturation.

In manuscript:

(Line 444-445)

The bottom layer showed the highest mean soil moisture content, measured as relative saturation, with a mean value of 65.33% and a standard deviation of 15.01%.

[10] Currently, the Discussion section summarizes the main findings and highlights the study's limitations and contributions. However, a typical Discussion should also compare the results with findings from other studies. I strongly recommend revising this section to include comparisons with relevant recent literature.

We thank you for this important suggestion. In the revised manuscript, we have substantially expanded the Discussion to include explicit comparisons with recent literature. Specifically, we now contrast our image–sensor paired approach with prior sensor-based methods that required dense sensor networks for subsurface monitoring (Lloret et al., 2021). We also compare our plant-scale results with recent remote sensing studies that used satellite or UAV imagery for soil moisture estimation at larger scales (Peng et al., 2024; Lamichhane et al., 2025; Hernandez et al., 2025). Furthermore, we discuss similarities with imaging studies that employed hyperspectral and phenotyping platforms for assessing canopy water status (Furbank & Tester, 2011; Ge et al., 2018). By situating our findings within these recent advances, we highlight the novelty of our approach namely, demonstrating that simple RGB imagery can enable robust, plant-level moisture estimation with reduced sensor dependency. These revisions strengthen the Discussion by clarifying how our study complements and extends existing research.

Reviewer #2

We would like to say thank you for your valuable comments to improve our paper. We have addressed the comments as explained below.

[1] Maintain consistent use of the Oxford comma throughout lists.

Thank you for your observation. In the revised manuscript, we carefully reviewed all lists and ensured the consistent application of the Oxford comma. For example, the list of models in the Results section was revised to read “ResNet50, EfficientNetB0, MobileNetV2, InceptionV3, DenseNet121, and NASNetMobile.” This consistent usage enhances readability and maintains stylistic uniformity throughout the manuscript.

[2] Replace passive voice were unnecessary, especially in methodology sections, with active constructions for clarity.

We appreciate your suggestion. We revised the manuscript to reduce unnecessary passive voice, particularly in the Methodology and Results sections. Whenever possible, sentences were rewritten in active voice to improve clarity and readability. For example, the previous phrasing “The number of trees was adjusted and evaluated” was revised to “We adjusted and evaluated the number of trees.” These revisions highlight the authors’ role in the research process and improve narrative flow.

[3] Avoid stacking multiple clauses in one sentence; break them into two where necessary.

We agree with your comment. Throughout the manuscript, sentences that contained multiple clauses were revised and divided into two or more concise statements to enhance readability. For instance, in the Results section, the sentence “As C increased beyond 10, performance declined, with the R² dropping to 73.9% at C = 1000” was revised into “As C increased beyond 10, performance declined. The R² dropped to 73.9% at C = 1000.” These changes improve clarity and ensure that the manuscript is easier to follow for readers.

[4] Use consistent units of measurement (e.g., always clarify if % is gravimetric or volumetric moisture content).

Thank you for raising this important point. We have revised the manuscript to clarify that soil moisture content is expressed as relative saturation, where 0 percent corresponds to oven-dry soil and 100 percent corresponds to fully saturated soil. To avoid ambiguity, we now consistently use this definition throughout the text and explicitly state that the values refer to relative saturation rather than gravimetric or volumetric content.

[5] Use either American or British English consistently (e.g., “analyze” vs. “analyse”).

Thank you for pointing this out. The manuscript has been carefully revised to ensure consistent use of American English spelling and style throughout the text.

Reviewer #3

We would like to say thank you for your valuable comments to improv

Attachment

Submitted filename: Soil_Image_1stRound_20250818.pdf

pone.0341904.s003.pdf (317.7KB, pdf)

Decision Letter 1

Julfikar Haider

29 Sep 2025

Dear Dr. Suh,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Nov 13 2025 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org . When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols . Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols .

We look forward to receiving your revised manuscript.

Kind regards,

Julfikar Haider

Academic Editor

PLOS ONE

Journal Requirements:

If the reviewer comments include a recommendation to cite specific previously published works, please review and evaluate these publications to determine whether they are relevant and should be cited. There is no requirement to cite these works unless the editor has indicated otherwise.

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

Additional Editor Comments:

Please find few comments to address

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

Reviewer #1: All comments have been addressed

Reviewer #3: (No Response)

**********

2. Is the manuscript technically sound, and do the data support the conclusions??>

Reviewer #1: Yes

Reviewer #3: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously? -->?>

Reviewer #1: Yes

Reviewer #3: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available??>

The PLOS Data policy

Reviewer #1: Yes

Reviewer #3: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English??>

Reviewer #1: Yes

Reviewer #3: Yes

**********

Reviewer #1: (No Response)

Reviewer #3: I would like to thank the authors for their genuine consideration of my first review. However, few points remain to finalize the reviewing from my side.

- I noticed that my minor comment about Figure 2 was missed. Please add scale per subpanel.

- For point [2], I suggest adding the R2 values of the linear models for better clarification.

- For point [3], I believe my final suggestion was not fully addressed in the revised manuscript. Training a ML model to predict the deeper layer from the surface layer requires training hence information about the deeper soil moisture (from sensors?). Given that soil moisture can be largely heterogeneous, to what extent one can transfer the trained model to other locations. Please add some insights from literature to properly address my concern.

**********

what does this mean? ). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy

Reviewer #1: Yes: Luis Eduardo Bertotto

Reviewer #3: Yes: Awad Mohammed Ali

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/ . PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org

PLoS One. 2026 Feb 17;21(2):e0341904. doi: 10.1371/journal.pone.0341904.r004

Author response to Decision Letter 2


23 Nov 2025

We would like to say thank you for your valuable comments to improve our paper. We have addressed the comments as explained below.

[1] The manuscript does not provide sufficient detail on soil physical properties (e.g., texture, bulk density, field capacity), which are critical to understanding water retention behavior and image-based interpretation.

We appreciate your comment regarding the lack of detail on soil. In this study, all plants were grown in a commercially standardized peat moss and perlite mixture (6:4 ratio), which was intentionally kept constant across experimental units because the objective was to examine moisture stress responses and image-based predictions within a uniform substrate rather than to compare different soil types. For this reason, soil physical properties were not treated as independent experimental variables. However, we fully agree that the general physical characteristics of the substrate are important for understanding water retention behavior and sensor interpretation. To address this concern, we have revised the manuscript to include a description of the typical physical properties of peat–perlite mixtures based on manufacturer information and relevant literature (high porosity, low bulk density, and moderate-to-high water-holding capacity). We appreciate your helpful suggestion, which has strengthened the clarity and contextualization of the experimental setup.

In manuscript:

(Line 97-106) WSG seedlings were grown under controlled greenhouse conditions following transplantation. The image-based analysis in this study focused on the early growth period after transplantation, a stage during which wild-simulated ginseng is highly sensitive to moisture conditions. Moisture stress during this period has a direct influence on seedling survival and early physiological responses, including changes in saponin composition[53]. To ensure consistent rooting conditions across plants, we used a standardized peat moss and perlite substrate in a 6:4 ratio, which is commonly adopted for ginseng seedling cultivation[54, 55]. This substrate composition is known to provide high porosity, low bulk density, and a favorable balance between aeration and water-holding capacity. Such physical characteristics create a uniform and supportive rooting environment, which is particularly important during the moisture-sensitive early growth stage of WSG

[2] The sensor data appears to be in arbitrary units (0–1023) converted via calibration, yet final reported moisture predictions are in percentages.

Thank you for raising this point regarding the conversion of raw sensor outputs into soil moisture values. The capacitive sensors used in this study generate electrical signals in the 01023 range, which must be converted before meaningful interpretation is possible. To address this concern, we have clarified the calibration procedure in the revised manuscript. Specifically, we now explain that soil moisture (%) values were derived using a standard two-point calibration approach based on oven-dry (0%) and fully saturated (100%) reference states. This method converts the raw electrical signals into normalized moisture estimates rather than direct volumetric water content. We have updated the Methods section to explicitly describe this conversion process and to provide the corresponding calibration values. We believe these revisions improve transparency and strengthen the methodological clarity of the study. We sincerely appreciate your suggestion.

In manuscript:

(Line 130-142) Soil moisture status was measured using capacitive moisture sensors integrated into the Arduino-based moisture management system. Each sensor was installed individually for each WSG plant to enable customized monitoring and control of growth conditions. To avoid signal distortion, which is commonly caused by corrosion in resistive sensors, we used HW-390 capacitive soil moisture sensors. The capacitive soil moisture sensors used in this study detect changes in dielectric permittivity associated with varying soil moisture levels based on frequency domain reflectometry (FDR). These readings are converted into electrical signals ranging from 0 to 1023. Higher values correspond to lower moisture conditions. To convert the raw sensor outputs into normalized moisture (%) values, we applied a two-point calibration procedure using oven-dry (0%) and fully saturated (100%) reference states. The average sensor readings were 542 ± 54 at 0% and 344 ± 50 at 100%. In this study, the moisture (%) values represent a normalized index derived from the calibrated sensor signals rather than direct volumetric or gravimetric water content.

For the soil-depth analysis experiment, we installed three moisture sensors per plant at distinct heights above the pot base: 15 cm (top layer), 10 cm (middle layer), and 3 cm (deep layer). These layer designations were adopted to align with conventional soil profile terminology and to provide clearer interpretation of the soil structure within the pot.

[3] The reported moisture levels (e.g., >60%) seem high for peat-perlite mixes, unless the value is gravimetric or saturated conditions were used. Confirm the unit (volumetric vs gravimetric) and measurement technique used for "moisture %" throughout the manuscript.

We appreciate insightful comment regarding the interpretation of the moisture (%) values. We would like to clarify that the moisture percentages reported in this study do not represent direct volumetric or gravimetric water content. Instead, these values correspond to a normalized index derived from the calibrated electrical outputs of the capacitive sensors. The raw sensor signals (0–1023) were converted into normalized moisture (%) values using a two-point calibration procedure based on oven-dry (0%) and fully saturated (100%) reference conditions. As a result, values such as >60% indicate relative moisture levels with respect to saturation rather than absolute volumetric water content. To avoid any ambiguity, we have revised the Methods section to explicitly state that moisture (%) is presented as a normalized index rather than a physical water content measure. We thank you for this helpful suggestion, which has improved the clarity of the manuscript.

In manuscript:

(Line 135-137) In this study, the moisture (%) values represent a normalized index derived from the calibrated sensor signals rather than direct volumetric or gravimetric water content.

[4] Terms like “bottom layer = 3 cm from base” may confuse readers, as this is typically referred to as "deep layer." Consider using conventional soil profile terminology (surface, subsoil, etc.).

We sincerely appreciate your comment regarding the terminology used to describe the soil layers. As noted, the expression “bottom layer = 3 cm from base” may lead to confusion and does not align well with conventional soil profile terminology. In response, we have revised the manuscript to use clearer and more standard terms. Specifically, the sensors positioned 15 cm, 10 cm, and 3 cm above the pot base are now described as the top, middle, and deep soil layers, respectively. We believe this terminology provides a more intuitive understanding of the soil structure and improves the clarity of the experimental description. In addition, we have carefully reviewed the entire manuscript and updated all related terminology to ensure consistency throughout the text. We sincerely thank you for this helpful suggestion.

[5] Include soil moisture distribution profiles over time (e.g., after irrigation) to visualize how moisture moves through layers. A confusion matrix or regression scatterplots with residuals may further support the model’s predictive capability.

We sincerely appreciate your suggestions regarding the visualization of soil moisture dynamics and model performance. We would like to clarify that the present study did not track soil moisture redistribution over time after irrigation. Our experiment matched the soil moisture measured by sensors with the images captured at the same moment to train the models for estimating individual soil moisture conditions. Because continuous time-series data were not collected, we are unable to generate soil moisture distribution profiles over time. Nonetheless, we find your suggestion highly valuable and agree that incorporating temporal moisture redistribution analysis would be a meaningful direction for future advanced research.

Regarding your recommendation to include a confusion matrix or regression visualizations, we interpreted the suggestion for a confusion matrix as an interest in improving clarity in model evaluation. However, since the present study does not use a classification model, tools that are specifically designed for classification such as confusion matrices cannot be applied within our modeling framework. As for regression-based visualizations, scatterplots and residual distributions have already been provided in Figure 5 and Figure 6, which illustrate the predictive performance and residual patterns of the models. We appreciate your feedback and believe that the suggestions have helped strengthen the clarity of the manuscript.

[6] Some references are cited as "2024/2025" which may still be in press or unavailable. Ensure all references are accessible and published.

We appreciate your comment regarding the references listed as 2024/2025. We have thoroughly reviewed the reference list and corrected all instances of 2024/2025, ensuring that only accessible and fully published sources are cited in the revised manuscript. Thank you for pointing this out.

Reviewer [3]

We would like to say thank you for your valuable comments to improve our paper. We have addressed the comments as explained below.

[1] I noticed that my minor comment about Figure 2 was missed. Please add scale per subpanel.

We appreciate helpful suggestion regarding the inclusion of scale bars in Figure 2. Although the images were captured at a fixed camera distance, no physical reference object or measured scale was included during image acquisition. Under these conditions, it is not possible to derive an accurate physical scale, and adding an arbitrary scale bar could mislead readers by implying a level of spatial or dimensional accuracy that the images do not provide. Figure 2 is intended solely to offer a qualitative illustration of visual differences across moisture levels, rather than to support spatial or size-dependent interpretation. To avoid potential misunderstanding while maintaining clarity, we have added a note to the figure caption indicating that the images are not to scale.

In manuscript:

Figure 2 Sample soil surface images at different moisture levels. These images are provided solely for qualitative illustration, and no spatial or physical scale is implied (not to scale).

[2] I suggest adding the R2 values of the linear models for better clarification.

We sincerely appreciate your thoughtful comment regarding the inclusion of R² values. We would like to clarify that all R² values associated with the models used in this study are already fully reported in Table 4 and Table 6 of the revised manuscript. These values reflect the best-performing models obtained. However, in the event that we have misunderstood your intended meaning or if you were referring to a different form or context of R² beyond what has already been provided, we would be more than willing to incorporate any additional R² values or clarifications as necessary. We appreciate your careful evaluation and remain grateful for the feedback.

[3] Training a ML model to predict the deeper layer from the surface layer requires training hence information about the deeper soil moisture (from sensors?).

We sincerely appreciate the comment regarding the relationship between surface imaging and deeper-layer soil moisture estimation. To clarify the scope of the current study, the deeper-layer analysis was conducted using actual sensor measurements collected at 15 cm, 10 cm, and 3 cm above the pot base, and these data were used to train and evaluate the depth-specific prediction models. Therefore, the deeper-layer models were developed with the necessary information. At the same time, the present work does not implement an end-to-end system that predicts deeper soil moisture directly from surface images. The image-based component was designed only for surface-layer estimation, and integrating surface imagery with deeper-layer prediction remains a future research goal. We agree with the reviewer that achieving such integration would require careful consideration of soil heterogeneity and the challenges of transferring models across different conditions. We appreciate your insight, and we have revised the manuscript to better clarify the scope of the current framework and the implications for future development.

In manuscript:

(Line 570-579) Although this study successfully developed depth-specific soil moisture models using sensor measurements collected at 15 cm, 10 cm, and 3 cm above the pot base, the surface-image–based component was applied only to the shallow layer. As a result, the present framework does not yet integrate surface imaging into an end-to-end system for predicting deeper soil moisture. Developing such an integrated approach remains a future goal but would require corresponding ground-truth data for deeper layers as well as methods capable of addressing the substantial spatial heterogeneity of soil moisture conditions. Previous studies have similarly reported that machine learning based soil moisture models often exhibit reduced performance when applied across different locations or soil conditions unless multi-site calibration or domain adaptation techniques are employed[77-80]. Future extensions of the framework will therefore require multi-environment datasets and modeling strategies that explicitly account for spatial heterogeneity in soil moisture dynamics.

Attachment

Submitted filename: Revision_2ndRound.docx

pone.0341904.s004.docx (36.5KB, docx)

Decision Letter 2

Gobinath Ravindran

14 Jan 2026

Image-Based Machine Learning Models for Customized Soil Moisture Management

PONE-D-25-27146R2

Dear Dr. Suh,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice will be generated when your article is formally accepted. Please note, if your institution has a publishing partnership with PLOS and your article meets the relevant criteria, all or part of your publication costs will be covered. Please make sure your user information is up-to-date by logging into Editorial Manager at Editorial Manager®  and clicking the ‘Update My Information' link at the top of the page. For questions related to billing, please contact billing support .

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Gobinath Ravindran

Academic Editor

PLOS One

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

Reviewer #1: All comments have been addressed

Reviewer #3: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions??>

Reviewer #1: Yes

Reviewer #3: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously? -->?>

Reviewer #1: Yes

Reviewer #3: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available??>

The PLOS Data policy

Reviewer #1: Yes

Reviewer #3: No

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English??>

Reviewer #1: Yes

Reviewer #3: Yes

**********

Reviewer #1: I have no further comments to the manuscript, therefore I recommend acceptance of the article in the current version.

Reviewer #3: I am satisfied with your response to my previous comments. I believe the manuscript is ready to be published. Well done!

**********

what does this mean? ). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy

Reviewer #1: No

Reviewer #3: Yes: Awad M. Ali

**********

Acceptance letter

Gobinath Ravindran

PONE-D-25-27146R2

PLOS One

Dear Dr. Suh,

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS One. Congratulations! Your manuscript is now being handed over to our production team.

At this stage, our production department will prepare your paper for publication. This includes ensuring the following:

* All references, tables, and figures are properly cited

* All relevant supporting information is included in the manuscript submission,

* There are no issues that prevent the paper from being properly typeset

You will receive further instructions from the production team, including instructions on how to review your proof when it is ready. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few days to review your paper and let you know the next and final steps.

Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

You will receive an invoice from PLOS for your publication fee after your manuscript has reached the completed accept phase. If you receive an email requesting payment before acceptance or for any other service, this may be a phishing scheme. Learn how to identify phishing emails and protect your accounts at https://explore.plos.org/phishing.

If we can help with anything else, please email us at customercare@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Gobinath Ravindran

Academic Editor

PLOS One

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    Attachment

    Submitted filename: Review Report for PONE.docx

    pone.0341904.s001.docx (16.7KB, docx)
    Attachment

    Submitted filename: Soil_Image_1stRound_20250818.pdf

    pone.0341904.s003.pdf (317.7KB, pdf)
    Attachment

    Submitted filename: Revision_2ndRound.docx

    pone.0341904.s004.docx (36.5KB, docx)

    Data Availability Statement

    All relevant data are within the manuscript.


    Articles from PLOS One are provided here courtesy of PLOS

    RESOURCES