Gray level co-occurrence matrix (GLCM) texture based crop classification using low altitude remote sensing platforms

Naveed Iqbal; Rafia Mumtaz; Uferah Shafi; Syed Mohammad Hassan Zaidi

doi:10.7717/peerj-cs.536

. 2021 May 19;7:e536. doi: 10.7717/peerj-cs.536

Gray level co-occurrence matrix (GLCM) texture based crop classification using low altitude remote sensing platforms

Naveed Iqbal ^1,^✉,^#, Rafia Mumtaz ^1,^#, Uferah Shafi ¹, Syed Mohammad Hassan Zaidi ¹

Editor: Tariq Masood

PMCID: PMC8176538 PMID: 34141878

Abstract

Crop classification in early phenological stages has been a difficult task due to spectrum similarity of different crops. For this purpose, low altitude platforms such as drones have great potential to provide high resolution optical imagery where Machine Learning (ML) applied to classify different types of crops. In this research work, crop classification is performed at different phenological stages using optical images which are obtained from drone. For this purpose, gray level co-occurrence matrix (GLCM) based features are extracted from underlying gray scale images collected by the drone. To classify the different types of crops, different ML algorithms including Random Forest (RF), Naive Bayes (NB), Neural Network (NN) and Support Vector Machine (SVM) are applied. The results showed that the ML algorithms performed much better on GLCM features as compared to gray scale images with a margin of 13.65% in overall accuracy.

Keywords: Machine learning, Remote sensing, Texture analysis, Classification, Unmanned aerial vehicles, Feature extraction, GLCM

Introduction

Remote sensing is a valuable tool in evaluating, monitoring, and management of land, water, and crop resources. Satellite imagery and aerial imagery have wide applications in the field of agriculture, monitoring snow cover trends, wildfire trends, water level rising, and forestry. In agriculture, crop health monitoring, yield estimation, classification of crops based on land cover, and monitoring of droughts are some common applications of remote sensing (Navalgund, Jayaraman & Roy, 2007; Seelan et al., 2003; Hufkens et al., 2019; Sivakumar et al., 2004). Among these applications, crop classification is quite challenging due to the texture and color similarity of crops in the initial stages. For this purpose, satellite data is commonly used which provides free access to the data. The data obtained from such platforms is coarse in a resolution which makes it difficult to classify the different types of crops. Apart from a coarse resolution of satellite data, the effect of atmospheric particles and cloud cover percentage in the image, if greater than 90% will result in discarding the images as no valuable information can be extracted from the satellite for these images.

The low-cost Un-manned Aerial Vehicles (UAV) are the substitute of the satellite platforms which provide high-resolution data with flexibility in data collection. After high-resolution images acquisition, several Machine/Deep Learning (ML/DL) algorithms are applied to classify the different types of crops. A lot of applications use texture information as features that are further given as input to the ML classification algorithms. The texture features provide useful insights into the color, its spatial arrangement, and intensities.

In Kwak & Park (2019), the crop classification based on texture features is performed on the data collected by a drone mounted with a multi-spectral camera. The acquired images are up-scaled to 25 cm resolution and mosaiced later to obtain a complete field of view. To extract texture features, Gray Level Co-occurrence Matrix (GLCM) at different kernel sizes is used including 3 × 3, 15 × 15, and 31 × 31. The mosaiced images act as an input to classification algorithms, such as Random Forest and Support Vector Machine (SVM). It is seen that using textural features obtained from larger kernel size showed improvement in classification results by 7.72% in overall accuracy rather than only using spectral information for classification.

Similarly in Böhler, Schaepman & Kneubühler (2018), texture base classification of crops is performed at pixel and parcel-based level where the crops in the study are maize, bare soil, sugar beat, winter wheat, and grassland. The images are acquired by eBee UAV in four flights of 30 min each on 26 June 2015. Textural features are extracted from the obtained UAV images. Random forest algorithm is applied after extracting the texture features which obtained an overall accuracy of 86.3%.

In this study, we performed a classification of four different types of crops including wheat, soybean, rice, maize. The main objective of this research is to investigate the texture feature-based crop classification of different crops having the same spatial texture and colors. The high-resolution optical images are acquired by the drone. The multiple texture features are extracted including contrast, homogeneity, dimensionality, angular second moment, energy, and correlation. To perform classification, Support Vector Machine, Naive Bayes, Random Forest, and Neural Network are applied on the grayscale images and the texture features.

Related Work

Crop classification traditional techniques

Over the two decades, a lot of research has been done in the agriculture domain to perform different agricultural activities, particularly in crop disease detection, crop health monitoring, crop yield estimation, crop classification, and others (Latif, 2018). To perform these activities; machine learning or deep learning techniques are applied to the data collected from satellite, drone or IoT sensors which are discussed in the sections below.

Crop classification using satellite data

An analysis on crop classification and land cover is presented in Khaliq, Peroni & Chiaberge (2018), in which Sentinel-2 is used to capture the multi-spectral imagery. The phonological cycle of crops is analyzed by computing the NDVI of time series spectral imagery data. The ‘Random Forest’ classifier is used to classify the land cover where NDVI values are used as feature vectors. The Random Forest shows 91.1% classification accuracy i.e. predicted land cover match with the actual ground cover. In Deng et al. (2019), land cover classification is performed using Random Forest as a classifier. The images are acquired from two satellites including Landsat 8 and MODIS. These images are fused based on Enhanced Spatial-Temporal and Fusion Model to generate time series-based Landsat-8 images. The data from the GF-1 satellite and Google Earth is used as supporting data for training and validation. In this research work, object base classification is used instead of pixel-based classification. The classification results show an accuracy of 94.38% on the fused data.

In Luciani et al. (2017), an analysis on crop classification is presented in which Landsat-8 OLI is used to capture the multispectral imagery at a coarse spatial resolution of 30 m. The acquired images are resampled to 15 m spatial resolution using the pan-sharpening technique. The phenological profile of crops is extracted by processing NDVI of time series spectral imagery data. The phenological profile is extracted based on pixel-level and interpolation is used for the reconstruction of missing NDVI value at a particular pixel. The univariate decision tree is applied to the data where the feature vector consists of NDVI values. Results show that the univariate decision tree achieved an accuracy of 92.66%.

There are a lot of datasets that are publicly available for land classification. In

Helber et al. (2018), land classification is performed using the publicly available dataset ‘EuroSAT’ which is comprised of 27,000 labeled examples covering 10 distinctive classes. Each image patch is 64 × 64 pixels which is collected from 30 cities in European Urban Atlas. For classification, the data set is divided in the ratio of 80 to 20 which is used for training and testing respectively. Two deep learning architectures such as ‘GoogLeNet’ and ResNet-50 are trained on the dataset which achieved an accuracy of 98.18% and 98.57% respectively.

In Hufkens et al. (2019), the health of the wheat crop is monitored using near-surface imagery captured by a smartphone. Images are collected from 50 fields by smartphone during the complete life cycle of the wheat crop. Each day, farmers captured images three times and captured images are transmitted to the cloud where the green level is assessed by green chromatic coordinates. The crop is classified as healthy or unhealthy based on the green level. Subsequently, the classification result is compared with Landsat 8 imagery in which classification of healthy and unhealthy crops is performed based on Normalized Difference Vegetation Index (NDVI) and Enhanced Vegetation Index (EVI) values. Results show that there is a small deviation between the classification results based on smartphone imagery and satellite imagery.

Crop classification using drone data

Textural features from an image help to extract useful information from the images. In Liu et al. (2018), the experimental area is selected in Minzhu Township, Daowai District, Harbin, where the variety of crops are planted. The 12 types of cropland cover in the study include rice, unripe wheat, ripe wheat, harvested wheat, soybean, corn, trees, grassland, bare land, houses, greenhouses, and roads. The measurement and marking of Ground Control Points (GCP) are conducted on 3 August 2017 and data is collected on 4 August 2017 using a fixed-wing UAV with a Sony Digital Camera. Digital Surface Model(DSM) and Digital Orthophoto Map (DOM) are produced with the help of POS data and GCP. Texture features mean, variance, homogeneity, contrast, dissimilarity, entropy, second moment, and correlation are extracted using ENVI software for RGB and DSM bands. SVM is used to perform the classification of crops with RBF kernel. The combination of different features is performed to see the impact of each feature. By using RGB resulted in a classification accuracy of 72.94% and a combination of RGB, DSMs, Second Moment of green band, DSMs variance (27 * 27), DSMs contrast (27 * 27) achieved an accuracy of 94.5%. The results show that the hard to differentiate classes in color space became separable by adding altitude as a spatial feature where height for each tree, crop, and grass differs.

In Hu et al. (2018), a hyper-spectral imaging sensor is mounted on a UAV to offer images at a higher spatial and higher spectral resolution at the same interval. The study area chosen is a field in the southern farm in Honghu city, located in China. The images are taken from the altitude of 100 m at a spatial resolution of 4 cm with 274 spectral bands. To fully utilize the potential of the spatial and spectral resolution of the image, a combination of the CNN-CRF model is proposed, to classify crops accurately. For this to work, in preprocessing phase, the Principal Component Analysis (PCA) is performed for dimensionality reduction of the data while in meantime preserving spectral information. Each patch on the image will be passed to CNN as input, to get the rule image from the PCA. The rule image, which is the output of CNN will be passed to the CRF model to generate a classification map of the output. The CNN-CRF model achieved an accuracy of 91.79% in classifying different crop types.

Image fusion between satellite and UAV can help in the classification of crops at a detailed level. In Zhao et al. (2019), a fusion between Sentinel-2A satellite and images acquired from fixed-wing Agristrong UAV drone is performed to get the image at high spatial, high spectral, and high temporal resolution. For this purpose, an experimental area covering around 750 ha is selected in Harbin city, Heilongjiang province, China. The crop types in the current study include rice, soybean, corn, buckwheat, other vegetation, bareland, greenhouses, waters, houses, and roads. The images are acquired using a UAV drone for 14 September 2017 at 0.03 m resolution and Sentinel-2A images for 16 September 2017 are downloaded. The high-resolution 0.03 m images are sub-sampled at lower resolution (0.10 m, 0.50 m, 1.00 m, and 3.00 m). The fusion between UAV images at different resolutions and Sentinel-2A images is performed using Gram-Schmidt transformation (Laben & Brower, 2000). Random forest algorithm performed better crop classification for a fused image at 0.10 m with accuracy at 88.32%, whereas without fusion the accuracy is at 76.77% and 71.93% for UAV and Sentinel-2A images respectively.

In Böhler, Schaepman & Kneubühler (2018), classification of crops is done at pixel and parcel-based level. The study area covering 170 hectares is selected in the Canton of Zurich, Switzerland. The crops in the study are maize, bare soil, sugar beat, winter wheat, and grassland. The images are acquired by eBee UAV in four flights of 30 min each on 26 June 2015. Subsequently, the textural features are extracted from the obtained UAV images. The random forest algorithm is applied to the extracted features and crop maps are generated where the object-based classification resulted in the overall accuracy of 86.3% for the overall set of crop classes.

Deep learning for crop classification

In Trujillano et al. (2018), a deep learning network is used to classify the corn crop in the region of Peru, Brazil. The images are acquired for two locations where the first location contained corn plots, trees, river, and other crops situated in a mountainous region, where flight is conducted at 100 and 180 m respectively. The second location is a coastal area where images are acquired at an altitude of 100 m, area consists of a corn crop and some nearby houses. The multi-spectral camera mounted on the UAV acquired images in 5 different bands, at a spatial resolution of 8 cm. Photoscan tool is used to generate the mosaic of the image. The image is divided into a patch size of 28 × 28, covering two rows of the cornfields. The patch is labeled as corn or no corn field. Four datasets are generated from the acquired images where dataset #1 and dataset #2 covered classes with images acquired at an altitude of 100 m and 180 m. The dataset #3 merged the corn classes from different altitude flight images whereas, in dataset #4, the dataset #1 is augmented which included rotation and flipping of images. Each dataset containing 28 × 28 patches of images is trained using the LeNet model, in which dataset number two achieved an accuracy of 86.8% on the test set.

In Zhou, Li & Shao (2018), the various types of crop classification methods are proposed using CNN and SVM algorithms. For this purpose, Yuanyang Country, in the province of Henan, China is selected as a study area where the main crops in the region are rice, peanut, and corn. The Sentinel-2A images are acquired for two dates, where all the bands data has been resampled to 10 m resolution and the resultant stack of the 26-dimensional image is generated. A ground survey is conducted in August 2017, for the labeling of different types of crops. Around 1,200 pixels are selected for training and the rests of the pixels are used for validation. The labeled pixel in the final stack image is converted to grayscale which is given as an input to the model. The CNN outperformed the SVM, where it clearly shows the deep learning-based model is better at learning the features while achieving an accuracy of 95.61% in the classification of crops. In Sun et al. (2020), an application for smart home is presented. The application monitors the moisture of the soil and the value of nitrogen, phosphorous, and potassium for an indoor plant with the help of IoT sensors. The value is classified based on various levels and provides feedback to the user with help of the dashboard. The system designed is a prototype, which helps the farmers when to irrigate the crop and what ratio of the value of nutrients is suitable for the specific plant. Water content estimation in plant leaves can help in the productivity of the crops. In Zahid et al. (2019), a novel approach based on machine learning is presented to estimate the health status of the plant leaves terahertz waves by measuring transmission response for four days. Each frequency point recorded is used as a feature in the study. Feature selection was carried out to discard any irrelevant feature that could result in the wrong prediction of water content in the leaves. The support vector machine (SVM) algorithm clearly performed better at predicting the accurate water content in the leaves for 4 days.

The work proposed in this paper will process the optical images acquired by UAV by data augmentation for the crop class with very few images. The processed images will be converted to grayscale downscaled to a low resolution. The textural features will be extracted from the grayscale images. Crop classification will be performed by using machine learning and deep learning algorithms for grayscale and textural-based images. With the evaluation measure, we will compare and evaluate the performance of how GLCM based textural features will outperform the ones with grayscale images. In this work, the main focus is how textural features will be helpful to distinguish between different types of crops compared to grayscale images. The paper is organized as follows, where a literature review is conducted in “Related Work”, data set used in the study along with methodology is discussed in “Methodology”, results and discussion in “Results and Discussion” and conclusion and future work in “Conclusion and Future Work”.

Methodology

Normalized Difference Vegetation Index (NDVI) is known as a standard index used in remote sensing for identifying chlorophyll content in an image based on Near Infra band (NIR) and Red (R) bands. It is quite challenging to differentiate the crop based on NDVI values, because various crops have similar profile. For instance, it is hard to discriminate between wheat and maize crop based on NDVI profile acquired from satellite imagery. In order to address the problem of differentiating different crops based on NDVI profile, UAV optical imagery is collected and GLCM features are extracted from the images. In this study, machine learning and deep learning algorithms are applied for classification of crops and GLCM-based texture features are used as an additional features to help in the classification. A comparison is also performed between classification with gray scale based images and GLCM features based classification. This section provide the details of study area for experiment and then discusses the methodology based on modules of our experiment.

Study area & data set

To perform crop classification, an experimental area in the capital of Pakistan, Islamabad located at the National Agriculture Research Center (NARC) is selected. In the NARC region, various types of crops are grown throughout the year and experiments are performed. For our research, we selected four crops wheat, maize, rice, and soybean as shown in Fig. 1. The crop calendar for Pakistan can be viewed at Crop Calendar (2020), where the particular locations of the crops in the study along with their growth cycle is enlisted in Table 1. The climate of Islamabad is a humid subtropical climate with an average rainfall of 790.8 mm.

Table 1. List of crops selected in study area.

Crop	Crop-cycle	Location
Wheat-I	Dec-18 to Jun-19	30°40′ 22.25″ N, 73°07′ 18.28″ E
Rice	Jun-19 to Oct-19	30°40′ 25.19″ N, 73°07′ 27.93″ E
Soybean	Jul-19 to Dec-19	33°40′ 34.46″ N, 73°08′ 10.20″ E
Wheat-II	Nov-19 to May-20	33°40′ 17.29″ N, 73°07′ 48.98″ E
Maize	Mar-19 to Jul-19	33°40′ 18.69″ N, 73°07′ 37.84″ E

Characteristics	Technical specifications
Type	Four-rotor electric UAV
Weight	1,368 g
Manufacturer	DJI
Model	FC6310
Operating Temperature	0° to 40°
Camera Sensor	1″ CMOS
Image Size	4,864 × 3,648
Flight Duration	30 min
Battery	5,870 mAH LIPo 4S

Crop	Stage	Acquisition date	Acquisition time	Altitude	Images count
Wheat-I	Max Maturity	16-May-2019	12:20 PM	70 foot	41
Rice	Max-Tiller	03-Sept-2019	12:15 PM	120 foot	3
Soybean	V2 Stage	03-Sept-2019	12:40 PM	70 foot	20
Wheat-II	Tiller Stage	02-March-2020	01:30 PM	70 foot	20
Maize	Max Maturity	24-July-2019	01:15 PM	70 foot	39

Class	Soybean	Rice	Wheat-T	Wheat	Maize	PA (%)
Soybean	0	0	0	9	0	0
Rice	0	5	0	0	0	100
Wheat-T	0	0	5	0	0	100
Wheat	0	0	0	13	1	92.9
Maize	0	0	3	0	8	72.7
UA (%)	0	100	62.5	59.1	88.9
OAA (%)	70.45%

Class	Soybean	Rice	Wheat-T	Wheat	Maize	PA (%)
Soybean	6	0	0	3	0	66.7
Rice	0	5	0	0	0	100
Wheat-T	0	0	5	0	0	100
Wheat	1	1	0	12	0	85.7
Maize	0	1	1	0	9	81.8
UA (%)	85.7	71.4	83.3	80	100
OAA (%)	84.1%

Class	Soybean	Rice	Wheat-T	Wheat	Maize	PA (%)
Soybean	1	0	0	8	0	12.5
Rice	0	5	0	0	0	100
Wheat-T	0	0	5	0	0	100
Wheat	0	0	0	14	0	100
Maize	0	0	0	0	11	100
UA (%)	100	100	100	63.6	100
OAA (%)	81.82%

Class	Soybean	Rice	Wheat-T	Wheat	Maize	PA (%)
Soybean	5	0	0	4	0	55.6
Rice	0	5	0	0	0	100
Wheat-T	0	0	5	0	0	100
Wheat	0	0	0	14	0	100
Maize	0	0	0	0	11	100
UA (%)	100	100	100	77.8	100
OAA (%)	90.91%

Class	Accuracy (%)		Precision		Recall		F-1 Score
Class	Gray scale	GLCM	Gray scale	GLCM	Gray scale	GLCM	Gray scale	GLCM
Soybean	79.55	91.11	0.0	0.67	0.0	0.86	0.0	0.75
Rice	100	95.56	1.0	1.0	1.0	0.71	1.0	0.83
Wheat-T	93.18	97.78	1.0	1.0	0.63	0.83	0.77	0.91
Wheat	77.27	86.67	0.93	0.80	0.59	0.80	0.72	0.80
Maize	90.91	93.33	0.73	0.82	0.89	0.90	0.80	0.86

PERMALINK

Gray level co-occurrence matrix (GLCM) texture based crop classification using low altitude remote sensing platforms

Naveed Iqbal

Rafia Mumtaz

Uferah Shafi

Syed Mohammad Hassan Zaidi

Abstract

Introduction

Related Work

Crop classification traditional techniques

Crop classification using satellite data

Crop classification using drone data

Deep learning for crop classification

Methodology

Study area & data set

Figure 1. Crops marked in © Google Earth (NARC Region).

Table 1. List of crops selected in study area.

Figure 2. System architecture.

Table 2. Specifications of UAV drone used in the study.

Table 3. Crop fields images acquired at various stage of crop cycle.

Figure 3. Crops optical images captured by using DJI Phantom.

Data pre-processing

Feature extraction

Crop classification

Naive Bayes classifier

Neural network

Support vector machines

Random forest classifier

Convolutional Neural Network (CNN)

Long Short Term Memory (LSTM) netwrok

Evaluation metrics

Producer accuracy

User accuracy

Overall accuracy

Precision

Recall

F1-score

Accuracy

Figure 4. Confusion matrix.

Results and Discussion

Table 4. Confusion matrix for classification performed on grayscale images using SVM.

Table 5. Confusion matrix for classification performed on generated textures features images using SVM.

Table 6. Confusion matrix for classification performed on grayscale images using Random Forest Classifier.

Table 7. Confusion matrix for classification performed on GLCM features using Random Forest Classifier.

Table 8. Confusion matrix for classification performed on gray scale images using Naive Bayes Classifier.

Table 9. Confusion matrix for classification performed on generated textures features images using Naive Bayes Classifier.

Table 10. Confusion matrix for classification performed on gray scale images using Neural Networks.

Table 11. Confusion matrix for classification performed on generated textures features images using Neural Networks based classifier.

Table 12. Confusion matrix for classification performed on gray scale images using LSTM.

Table 13. Confusion matrix for classification performed on generated textures features images using LSTM.

Table 14. Confusion matrix for classification performed on gray scale images using CNN.

Table 15. Confusion matrix for classification performed on generated textures features images using CNN.

Table 16. Precision, Recall & F1-Score on gray scale images and texture images using SVM.

Table 17. Precision, Recall & F1-Score on gray scale images and texture images using Random Forest Classifier.

Table 18. Precision, Recall & F1-Score on gray scale images and texture images using Naive Bayes Classifier.

Table 19. Precision, Recall & F1-Score on gray scale images and texture images using Neural Networks.

Table 20. Precision, Recall & F1-Score on gray scale images and texture images using LSTM.

Table 21. Precision, Recall & F1-Score on gray scale images and texture images using CNN.

Conclusion and Future Work

Acknowledgments

Funding Statement

Additional Information and Declarations

Competing Interests

Author Contributions

Data Availability

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases