Comparative assessment of Pest damage identification of coconut plant using damage texture and color analysis

Utpal Barman; Chhandanee Pathak; Nirmal Kumar Mazumder

doi:10.1007/s11042-023-14369-2

. 2023 Jan 25:1–23. Online ahead of print. doi: 10.1007/s11042-023-14369-2

Comparative assessment of Pest damage identification of coconut plant using damage texture and color analysis

Utpal Barman ^1,^✉, Chhandanee Pathak ², Nirmal Kumar Mazumder ³

PMCID: PMC9874181 PMID: 36712953

Abstract

Coconut cultivation is a promising agricultural activity. But to keep the coconut plants pest-free, the detection of various pest damage in coconut plants is of utmost importance for the cultivators. The processes that the cultivators use to detect pest damage in coconut plants are conventional methods, experts’ views, or some laboratory techniques. But these procedures are not adequate in the detection of coconut damage identification. In this study, 16 different color and texture features are reported for 1265 coconut pest damage images by extracting the color and texture features of the damage images in the color and grey domain after the damage segmentation using the thresholding technique. The Gray Level Co-occurrence Matrix (GLCM) and Gray Level Run Length Matrix (GLRLM) techniques are applied to extract the texture features of the damages and two Artificial Neural Network (ANN) architectures are reported to classify the extracted data features of the damages into 5 different classes such as Eriophyid_Mite, Rhinoceros_Beetle, Red_Palm_Weevil, Rugose_Spiraling_White_fly, and Rugose_in_Mature with an average testing accuracy of almost 100% respectively. To compare the results with the other machine learning techniques, the Support Vector Machine(SVM), Decision Tree (DT), and Naïve Bayes (NB) are also introduced for damage identification where the SVM methods also report almost 100% accuracy on the fuse features of GLCM and GLRLM. The results of the ANN and SVM are compared by finding the confusion matrix, precision, recall, and f-1 score of the ANN model with the DT and NB classifier. The ANN and SVM outperform in all matrices and they can be used as the base model for further study of coconut pest damage identification using deep learning techniques.

Keywords: Coconut Pest, Image segmentation, GLCM, ANN, SVM, Decision tree, Naïve Bayes

Introduction

Coconut has its vivid versatility being used as a fruit, as a source of milk and oil, as a regular portion of the diets of many people in the tropics also in subtropics as seed nuts, as a fuel, and many more [5]. In coconut production, India is the third-largest producer in the World [16]. Like other palm trees, coconut plants are also vulnerable to various damages caused by pest infection. In laboratories early detection of coconut pests and diseases is possible. Also, the cultivators can get help from plant pathologists to detect the pest and diseases of the plant. But these methods are not convenient for the rural cultivators due to the unavailability of laboratories and experts in remote areas. Various computerized methods have been developed in recent years to identify and classify the diseases of plants using machine learning techniques [2]. Researchers used computer vision, machine learning, and deep learning techniques for plant pest and diseases detection such as Tomato [12], Potato [3], maize [4], and citrus [2] but the use of the machine and deep learning for coconut damage identification is very limited.

In the previous study of coconut plant, pests and diseases detection using machine and deep learning techniques were back propagation neural network, feed-forward neural network, and probabilistic neural network [6]. Along with these methods, image processing techniques such as morphological features extraction, wavelet-based image processing, zooming, and clipping with OSTU segmentation were also applied to identify coconut plant pests and diseases identification [9, 11]. In the paper [6], the Channel-spatial attention (CSA) and Region proposal network (RPN) was used to distinguish between the pest affected and normal regions. The SoC with ARM Cortex-based CPU and GPU were embedded in the drone for processing the images. Hence no requirement for pre-processing. Due to the SoC and fast interpreting algorithm, processing was done. [16] emphasized on detection of stem bleeding, leaf blight, and red palm weevil pest infection in coconut trees by applying the k-mean clustering segmentation technique along with MobileNet and customized the 2D-CNN model automatically. Nesarajan et al., (2020) reported the application of SVM for color and shape-based coconut disease detection in their study but the CNN model, reported by Nesarajan et al., (2020) was EfficientNetB0. Along with the plant disease detection, the application of CNN and hand-crafted features were already reported in the case of human disease identification such as skin cancer identification [14] and Covid-19 indentification [18]. Apart from disease identification, deep learning is massively used in other areas including stereo matching [19], and person indetification [13].

In the case of using drones mentioned above operational safety, privacy and insurance protection are some of the concerns. While applying CNN, the larger requirement of image datasets is another hurdle. So, identification of pest infection of coconut for a small image dataset is a very challenging task. In this paper, machine learning techniques such as ANN, SVM, DT, and NB were used to identify the different pest infections of coconut. Therefore, the objective of this work is to the application of efficient machine learning techniques to identify and classify the infected area in the images and to develop an optimized machine learning model to locate the infection from the segmented images to make the model more robust. The paper contributes the following towards the pest identification of coconut.

i)
The coconut pest infection images are captured using a DSLR camera in a natural environment.
ii)
The color and texture features of the images are extracted from the coconut images to handle the small dataset.
iii)
The ANN, SVM, DT, and NB models are exported to identify pest damage infection.
iv)
The ANN and SVM model outperform and are considered a robust methods for coconut pest infection.

Materials and methods

Overview of the proposed work

The proposed work for classifying coconut pest damage identification was carried out in six phases. The pictorial representation of the proposed method is presented in Fig. 1.

Step 1: Creation of coconut pest infection image dataset.
Step 2: Data augmentation and preprocessing.
Step 3: Color and Feature Extraction of the pest infection image dataset.
Step 4: Pest identification using ANN.
Step 5: Pest identification using SVM, DT, and NB.
Step 6: Choosing the model with the best performance.
Step 7: Assessment and evaluation of the model.

Fig. 1 — Sample of coconut pest infection

Creation of pest infection image dataset

The process of image acquisition is to obtain various images from the samples using digital image sensing gadgets such as digital cameras, smartphones, DSLRs, etc. For this study, coconut plant growing in the Assam region was chosen. The datasets were collected from the different districts of Assam. A comparison study was conducted to identify the disease plants from the healthier plant-bearing of coconut. The image datasets for the study were collected through an image captured by the digital camera (Canon PowerShot A590 IS) with an image quality set as 3.8 MP with 6000*4000 resolution. The file size of images taken with these settings varied between 1.4 MB to 12.3 MB. For the variation in image size, the average dimensions of the images were 5568*3712 and saved in JPEG format by keeping it in 8-bit RGB mode.

Image augmentation and preprocessing of coconut plant images

The images of various parts of a coconut plant present in the datasets were augmented and pre-processed using the following steps.

A.
The augmentation techniques are required to increase the number of pest infection images in the dataset. Five different image dataset classes with the number of images are presented in Table 1. To make the dataset balanced and reduced overfitting during training and testing, the following augmentation techniques were applied to the dataset.
1. The images were rescaled by a factor up to 1/255.
2. The images were randomly flipped in the horizontal as well as sheared in the counter-clockwise direction up to 0.2 degrees.
3. Brightness level was set to be in the range of (0.5,1.5).

Table 1.

Total numbers of images in coconut pest infection image dataset

Sl no	Image dataset class	Number of images
1	Eriophyid Mite	58
2	Rhinoceros Beetle	93
3	Red Palm Weevil	72
4	Rugose Spiraling White fly	87
5	Rugose in Mature plants	138

Sl. no.	Name of the Class	No of images	New Augmented images
1	Eryophite Mite damage	58	186
2	Red Palm Weevil damage	72	264
3	Rugose Spiraling White fly damage	87	244
4	Rhicerous Beetle leaf symptoms	93	253
5	Rugose in Mature plants damage	138	318

Eq. no	Parameter	Equation
1	Energy	$\sum_{i, j = 0}^{N - 1} m {(i, j)}^{2}$
2	Correlation	$\sum_{i, j = 0}^{N - 1} m_{i, j} \frac{((i - μ) (j - μ))}{σ^{2}}$
3	Dissimilarity	∑_i1∑_j ∣ i − j ∣ m(i, j)
4	Homogeneity	$\sum_{i, j = 0}^{N - 1} \frac{m (i, j)}{1 + {(i - j)}^{2}}$
5	Contrast	$\sum_{i, j = 0}^{N - 1} {(i, j)}^{2} m (i, j)$
6	Mean	$\sum_{i, j = 0}^{N - 1} i . m_{i, j}$
7	Standard Deviation	$\sum_{i, j = 0}^{N - 1} {(i - Mean)}^{2} m {(i, j)}^{\frac{1}{2}}$

Eq. no	Parameter	Equation
1	Grey level Uniformity (GRU)	$\sum_{i = Ng} {(\sum_{j = Nr} Pij)}^{2} / \sum_{i = Ng} \sum_{j = Nr} Pij$
2	Long Run Emphasis (LRE),	$\sum_{i = Ng} \sum_{j = Nr} j 2 . Pij / \sum_{i = Ng} \sum_{j = Nr} Pij$
3	Short Run Emphasis (SRE)	$\sum_{i = Ng} \sum_{j = Nr} \frac{Pij}{j 2} / \sum_{i = Ng} \sum_{j = Nr} Pij$
4	Run Length Uniformity (RLU)	$\sum_{i = Nr} {(\sum_{j = Ng} Pij)}^{2} / \sum_{i = Ng} \sum_{j = Nr} Pij$
5	Run percentage (RP)	$\sum_{i = Ng} \sum_{j = Nr} Pij / N$

Structure	Epoch	Training Accuracy	Training Loss	Validation Accuracy	Validation Loss	Testing Accuracy	Testing Loss
ANN 1	1	0.6047	1.0048	0.977	0.0928	0.9368	0.1743
	5	0.9042	0.2557	0.973	0.0799	0.9960	0.0269
	10	0.9466	0.1346	0.977	0.0548	0.9960	0.0170
	15	0.9575	0.1242	0.977	0.0592	0.9960	0.0211
	20	0.9723	0.0970	0.973	0.0682	0.9921	0.0127
ANN 2	1	0.6384	0.9456	0.973	0.0726	0.9368	0.2191
	5	0.8983	0.2845	0.981	0.0483	0.9684	0.0931
	10	0.9424	0.1720	0.985	0.0448	0.9553	0.1803
	15	0.9571	0.1395	0.977	0.0678	0.9789	0.0654
	20	0.9751	0.0848	0.985	0.0505	0.9895	0.0334

Model	Class	Precision	Recall	F1-score	Support	Confusion Matrix	Accuracy
1st ANN	0	1	1	0.95	35	[[35 0 0 0 0]	99%
	1	1	1	1	48	[0 48 0 0 0]
	2	0.98	1	0.99	59	[0 0 59 0 0]
	3	0.98	0.98	0.98	61	[0 0 16 0 0]
	4	1	0.98	0.99	50	[0 0 0 1 49]]
2nd ANN	0	0.94	0.96	1	52	[[50 0 1 10]	98%
	1	1	1	1	76	[0 76 0 0 0]
	2	0.98	0.99	0.98	86	[1 0 85 0 0]
	3	0.99	0.97	0.98	91	[2 0 1 88 0]
	4	1	1	1	75	[0 0 0 0 75]]

Structure	Epoch	Training Accuracy	Training Loss	Validation Accuracy	Validation Loss	Testing Accuracy	Testing Loss
ANN 1	1	0.7352	0.6433	0.9972	0.0162	0.9980	0.0080
	5	0.9822	0.0661	0.9986	0.0012	0.9960	0.0026
	10	0.9921	0.0260	1.000	0.0005	0.9921	0.0178
	15	0.9970	0.0111	1.000	0.0008	0.9723	0.1075
	20	0.9980	0.0076	0.9986	0.0032	0.9992	0.0029

Structure	Epoch	Training Accuracy	Training Loss	Validation Accuracy	Validation Loss	Testing Accuracy	Testing Loss
ANN 1	1	0.8073	0.0531	0.9944	0.0195	0.9960	0.0100
	5	0.9852	0.0452	0.9986	0.0031	0.9960	0.0070
	10	0.9970	0.0077	0.9944	0.0144	1.0000	0.0001
	15	0.9941	0.0163	0.9972	0.0062	1.0000	0.0024
	20	0.9993	0.0008	1.0000	0.0001	0.9960	0.0079

Data	Class	Precision	Recall	F1 score	Support	Confusion Matrix	Accuracy
GLRLM	0	1	1	1	34	[[34 0 0 0 0]	100%
	1	1	1	1	53	[0 53 0 0 0]
	2	1	1	1	66	[0 0 66 0 0]
	3	1	1	1	53	[0 0 0 53 0]
	4	1	1	1	47	[0 0 0 0 47]]
GLRLM + GLCM	0	1	1	1	34	[[34 0 0 0 0]	100%
	1	1	1	1	53	[0 53 0 0 0]
	2	1	1	1	66	[0 0 66 0 0]
	3	1	1	1	53	[0 0 0 53 0]
	4	1	1	1	47	[0 0 0 0 47]]

Model	Class	Precision	Recall	F1 score	Support	Confusion Matrix	Accuracy
SVM (RBF)	0	0.80	0.91	0.85	35	[[32 0 0 3 0]	90%
	1	1	0.98	0.99	48	[1 47 0 0 0]
	2	0.91	0.90	0.91	59	[3 0 53 1 2]
	3	0.87	0.87	0.87	61	[4 0 2 53 2]
	4	0.91	0.86	0.89	50	[0 0 3 4 43]]
SVM (POLY)	0	0.75	0.26	0.38	35	[[9 0 1 25 0]	75%
	1	1	0.96	0.98	48	[2 46 0 0 0]
	2	0.95	0.59	0.73	59	[1 0 35 21 2]
	3	0.52	0.97	0.67	61	[0 0 1 59 1]
	4	0.93	0.82	0.87	50	[0 0 0 9 41]]

Model	Class	Precision	Recall	F1 score	Support	Confusion Matrix	Accuracy
Decision Tree	0	0.84	0.89	0.86	35	[[31 1 0 3 0]	90%
	1	0.94	1	0.96	48	[0 48 0 0 0]
	2	0.92	0.92	0.92	59	[2 0 54 2 1]
	3	0.85	0.85	0.85	61	[3 2 2 52 2]
	4	0.93	0.84	0.88	50	[1 0 3 4 42]]
Naïve Bayes	0	0.41	0.69	0.52	35	[[24 2 1 8 0]	58%
	1	0.88	0.90	0.89	48	[5 43 0 0 0]
	2	0.69	0.58	0.63	59	[10 0 34 8 7]
	3	0.32	0.21	0.25	61	[16 4 12 13 16]
	4	0.59	0.69	0.62	50	[3 0 2 12 33]]

Sl. no.	Author	Plant	Features	Algorithm	Testing Accuracy
1	Singh et al., (2021)	Coconut	Deep Feature	(MobileNet)	82.10%
2	Manjula, (2021)	Coconut	RGB	Image Processing	90%
3	Nesarajan et al., (2020)	Coconut	Deep Feature	CNN and SVM	93.72% and 93.72%
4	Chandy, (2019)	Coconut	Deep feature	DL Algorithm	not known
5	Proposed Method	Coconut	GLRLM	ANN1	Almost 100%
6	Proposed Method	Coconut	GLCM + GLRLM	ANN1	Almost 100%
7	Proposed Method	Coconut	GLCM + GLRLM	SVM(RBF)	Almost 100%
8	Proposed Method	Coconut	GLRLM	SVM(RBF)	99%
9	Proposed Method	Coconut	GLCM + GLRLM	DT	99%

Model	Class	Precision	Recall	F1 score	Support	Confusion Matrix	Accuracy
SVM (RBF)	0	1	0.97	99	34	[[33 1 0 0 0]	100%
	1	0.98	1	0.99	53	[0 53 0 0 0]
	2	1	1	1	66	[0 0 66 0 0]
	3	1	1	1	53	[0 0 0 53 0]
	4	1	1	1	47	[0 0 0 0 47]]
SVM (POLY)	0	1	0.88	0.94	34	[[30 1 0 3 0]	95%
	1	0.98	0.92	0.95	53	[0 49 0 4 0]
	2	1	0.95	0.98	66	[0 0 63 0 3]
	3	0.88	0.98	0.93	53	[0 0 0 52 1]
	4	0.92	1	0.96	47	[0 0 0 0 47]]

sl no	Model	Data Features	Accuracy
1	ANN (1st)	GLCM	99%
2	ANN(2nd)	GLCM	98%
3	SVM(RBF)	GLCM	90%
4	SVM(POLY)	GLCM	75%
5	DT	GLCM	90%
6	NB	GLCM	58%
7	ANN (1st)	GLRLM	100%
8	ANN (1st)	GLCM + GLRLM	100%
9	SVM(RBF)	GLRLM	99%
10	SVM(POLY)	GLRLM	96%
11	DT	GLRLM	98%
12	NB	GLRLM	83%
13	SVM(RBF)	GLCM + GLRLM	100%
14	SVM(POLY)	GLCM + GLRLM	95%
15	DT	GLCM + GLRLM	99%
16	NB	GLCM +GLRLM	88%

PERMALINK

Comparative assessment of Pest damage identification of coconut plant using damage texture and color analysis

Utpal Barman

Chhandanee Pathak

Nirmal Kumar Mazumder

Abstract

Introduction

Materials and methods

Overview of the proposed work

Fig. 1.

Creation of pest infection image dataset

Image augmentation and preprocessing of coconut plant images

Table 1.

Fig. 2.

Table 2.

Table 3.

Fig. 4.

Fig. 3.

Fig. 5.

Table 4.

Data standardization of color and texture features of the coconut pest damage images

Pest damage identification using ANN

Table 5.

Pest damage identification using SVM, DT and Naïve Bayes classifiers

Results and discussion

Environmental set-up

Metrics used to measure the performance of the present work

Results of ANN models

Table 6.

Table 7.

Table 8.

Table 9.

Fig. 6.

Fig. 7.

Fig. 8.

Fig. 9.

Table 10.

Results of SVM, DT, and Naïve bayes

Table 11.

Table 12.

Table 13.

Table 14.

Table 15.

Table 16.

Table 17.

Best ML model for the present study and discussion

Table 18.

Conclusion

Acknowledgments

Data availability

Declarations

Conflict of interest

Footnotes

Contributor Information

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases