GACN: Generative Adversarial Classified Network for Balancing Plant Disease Dataset and Plant Disease Recognition

Xiaotian Wang; Weiqun Cao

doi:10.3390/s23156844

. 2023 Aug 1;23(15):6844. doi: 10.3390/s23156844

GACN: Generative Adversarial Classified Network for Balancing Plant Disease Dataset and Plant Disease Recognition

Xiaotian Wang ^1,², Weiqun Cao ^1,^2,^*

Editor: Marcin Woźniak

PMCID: PMC10422207 PMID: 37571626

Abstract

Plant diseases are a critical threat to the agricultural sector. Therefore, accurate plant disease classification is important. In recent years, some researchers have used synthetic images of GAN to enhance plant disease recognition accuracy. In this paper, we propose a generative adversarial classified network (GACN) to further improve plant disease recognition accuracy. The GACN comprises a generator, discriminator, and classifier. The proposed model can not only enhance convolutional neural network performance by generating synthetic images to balance plant disease datasets but the GACN classifier can also be directly applied to plant disease recognition tasks. Experimental results on the PlantVillage and AI Challenger 2018 datasets show that the contribution of the proposed method to improve the discriminability of the convolution neural network is greater than that of the label-conditional methods of CGAN, ACGAN, BAGAN, and MFC-GAN. The accuracy of the trained classifier for plant disease recognition is also better than that of the plant disease recognition models studied on public plant disease datasets. In addition, we conducted several experiments to observe the effects of different numbers and resolutions of synthetic images on the discriminability of convolutional neural network.

Keywords: deep learning, generative adversarial network, data augmentation, plant disease recognition

1. Introduction

Agriculture is one of the most important food sources for human beings. With the rapid growth of the global population, agriculture has become increasingly important. Agricultural security has an important impact on people worldwide, especially in areas where agricultural technology is underdeveloped. Plant diseases seriously hinder agricultural production and affect food quality. Accurate plant disease recognition is crucial to ensure food security, especially in less developed countries where agricultural experts are scarce. With the spread of the internet and smartphones, agricultural practitioners can take photos of plant diseases and use plant disease recognition software to correctly classify disease types. This can reduce reliance on agricultural experts and increase productivity in the agricultural sector.

With the rapid development of convolutional neural network (CNN), remarkable progress has also been made in plant disease recognition tasks. Convolutional neural networks make full use of the end-to-end learning mode and surpass machine learning methods in plant disease recognition accuracy. Brahimi et al. [1] proposed a CNN model for tomato disease classification. The tomato disease dataset contains nine diseases and 14,828 images. Fuentes et al. [2] adopted ResNet as the network backbone and proposed a local and global class annotation method to improve recognition accuracy. Ma et al. [3] designed a deep CNN model for cucumber disease image recognition. In this study, the accuracy of identifying four diseases in cucumbers was 93.4%. Bhattacharya et al. [4] used the CNN model to classify bacterial blight, blast, and brown mark diseases of rice with an accuracy of 78.44%. Huang et al. [5] proposed first separating leaves and background and then using a pretrained universal classification model to classify diseases, with an accuracy of 87.45% on the AI Challenger dataset. Sumita et al. [6] proposed a real-time recognition method based on the deep convolution neural network of corn leaf disease, which reached 88.46% accuracy. Wang et al. [7] proposed a trilinear convolute on a neural network model, which reached 84.1% accuracy. Chen et al. [8] added SE attention module on the basis of YOLOv5 to enhance recognition accuracy, and the model identified powdery mildew and anthracnose detection rates of 86.5% and 86.8%, respectively.

However, the accuracy of a deep learning model depends on the quality and quantity of the dataset. Due to insufficient numbers or unbalanced classes of some datasets, the accuracy of deep learning models is poor, so some researchers use transfer learning techniques to classify plant diseases. Fang et al. [9] proposed an instance-based transfer learning method to solve the problem of insufficient training samples of agricultural disease images. Wang et al. [10] pretrained on the PlantVillage dataset using CNN and fine-tuned their plant disease dataset. The experimental results show that combining CNN with transfer learning can improve the classification accuracy of small datasets. Zhang et al. [11] proposed using GoogLeNet to pretrain on the ImageNet dataset and then fine-tuned on 1200 cherry leaf disease datasets, achieving 99.6% accuracy. Verma et al. [12] used a pretrained ResNet18 network to fine-tune the grape leaf disease dataset to accurately identify grape disease severity. Chen et al. [13] proposed a MobileNet that added SE attention modules and increased plant disease recognition accuracy through twice-transfer learning. Vallabhajosyula et al. [14] proposed a deep ensemble neural network method to detect plant diseases and then fine-tuned pretrained models using transfer learning techniques.

The above work has made remarkable progress in plant disease recognition. However, making a plant disease image dataset requires the participation of many agricultural experts and is laborious and time-consuming. There is often a class imbalance problem in the data collection process; that is, the number of samples in some classes is significantly less than that in others. The use of class-imbalanced datasets biases the recognition model training toward the sample class which has a majority. The problems of some plant disease datasets are insufficient quantity and imbalance among classes.

Generative adversarial network (GAN) [15] has been used to synthesize images with high visual fidelity. The high-quality samples synthesized by GAN models can now be used as additional training data for tasks such as classification [16,17] and data augmentation. Data augmentation is a common technique used to synthesize more training data, which can enhance the universalization ability of the model. In image processing, data augmentation techniques usually include image flipping [18], random cropping [19], and color enhancement [20]. In image classification tasks, models trained on class-imbalanced datasets are often biased toward the majority class. This problem can be ameliorated by applying augmented dataset techniques to minority classes. Some works [21,22,23,24,25,26] use a GAN to expand the dataset or solve the problem of class-imbalanced datasets. Refs. [21,26] use CGAN to synthesize images to augment and balance datasets, but experiments in this paper prove that synthetic images of CGAN have a low level of accuracy. The methods [22,23,24,25] use nonlabel-conditional GANs to augment or balance plant disease datasets, but the disadvantage of these methods is that GANs need to be trained separately for each class. In addition, ref. [27] used transfer learning on the samples synthesized by GAN to enhance the classification accuracy of convolutional neural networks.

However, the above methods have problems with low accuracy of synthetic images and complex training processes. To solve the problem of unbalanced plant disease datasets, we propose a generative adversarial classified network (GACN) to enhance the classification accuracy of convolutional neural networks. The GACN aims to further improve the contribution of synthetic images to the discriminability of specific classification convolutional neural network. The synthetic image of the proposed method has higher accuracy, while the proposed method can generate synthetic images of any class through one-time training. The GACN consists of a generator, discriminator, and classifier. The generator is used to synthesize images. The discriminator distinguishes between the real images and the synthetic images as much as possible. The classifier is designed to correctly classify real images and synthetic images. The GACN can be directly applied to plant disease recognition tasks, or the generated synthetic images can be used to balance the dataset to improve CNN accuracy. We evaluate our method using the PlantVillage [28] and AI 2018 Challenger datasets. We compare conditional generative adversarial network (CGAN) [29], auxiliary classifier GAN (ACGAN) [30], multiple fake class GAN (MFC-GAN) [31], balancing GAN (BAGAN) [32], and ControlGAN [33] methods in the task of balancing plant disease datasets. These five methods are the existing label-conditional methods, which can output images according to the label. The difference between GACN and the above label-conditional GANs is the addition of a classifier. GACN is designed to enhance the discriminability of a specific classification CNN, so the classifier structure needs to be the same as the specific classification CNN structure. The classifier of GACN is trained on both synthetic and real images and adds a loss function for predicting the real image class to the generator, which can encourage the generator to produce more accurate synthetic images. GACN solves the problem that existing label-conditional GANs do not consider encouraging the generator to generate synthetic images with higher accuracy for specific classification CNN. The experimental results show that the GACN performance in balancing plant disease datasets is better than that of these label-conditional GANs. In addition, compared with other plant disease recognition models studied on public plant disease datasets, the classifier of the proposed method achieves higher classification accuracy.

The contributions of this paper are summarized as follows:

A dual-purpose model GACN is proposed in this paper. The GACN is proposed to improve the accuracy of plant disease recognition tasks. It can classify plant diseases directly or generate synthetic images that can be used to balance plant disease dataset to improve CNN accuracy.
The proposed GACN classifier is applied to the plant disease recognition task, and its accuracy exceeds that of the current methods studied on open plant disease recognition dataset.
The synthetic image accuracy and balanced dataset performance generated by the proposed GACN model are better than those of the existing label-conditional GANs.

The remainder of this paper is organized as follows: Section 2 introduces label-conditional GANs and plant disease recognition methods based on GAN. The proposed GACN method is described in Section 3. Section 4 discusses the performance of the proposed classifier on the plant disease recognition task and the performance of GACN on balanced plant disease datasets. Section 5 describes the conclusion and what can be done in the future.

2. Related Work

2.1. Label-Conditional GANs

The GAN comprises a generator and discriminator. The purpose of the generator is to synthesize as realistic a sample for spoofing the discriminator as possible, and the purpose of the discriminator is to distinguish as much as possible between true samples and synthetic samples. The generator and discriminator are trained against each other to reach a state of equilibrium. Addressing the problem that GAN cannot synthesize samples with labels, Mirza proposed CGAN in 2014. In a standard GAN, there are no restrictions on the synthetic sample, so a sample of a given class cannot be accurately synthesized. To address this issue, CGAN conditions the generator on additional information to direct the sample generation process. The CGAN can synthesize samples of any specified class. Therefore, CGAN can be specified to synthesize samples with specific labels to balance different classes of samples in the dataset. As a variant of CGAN, ACGAN adds a loss function for correct sample classification to the discriminator, so it can synthesize higher-quality conditional samples. ControlGAN added an additional classifier but did not add a loss function to encourage the generator to synthesize more accurate images. MFC-GAN uses fake classes to ensure the accuracy of minority class generation. BAGAN models combine processes such as autoencoder training to synthesize samples. However, these methods do not consider the classification network structure when generating images, and generators are also not encouraged to produce more accurate synthetic samples, so the synthetic samples cannot further enhance the performance of the classification models.

2.2. Plant Disease Recognition Model Based on GAN

Jordan et al. [21] proposed a method to enhance fruit quality classification accuracy using CGAN. The experiment showed that an accuracy of 88.75% was obtained by using synthetic image enhancement training. Zhou et al. [22] proposed a GAN-based method for grape leaf spot recognition. The study generated 1000 local spot images per class and achieved 96.27% accuracy on ResNet50 by mixing the synthetic image with the real image. Lamba et al. [23] enhanced the rice disease dataset by GAN and then used CNN for classification, achieving 98.23% accuracy. Haruna et al. [24] proposed balancing the rice leaf disease dataset with StyleGAN and achieved 93% accuracy using the fast-RCNN model. Zhao et al. [25] used DoubleGAN to form an image of unhealthy plant leaves to balance the dataset and improve plant disease recognition accuracy. Abbas et al. [27] proposed synthesizing images of tomato plant leaves using CGAN. Subsequently, they used transfer learning to train the DenseNet121 model on both synthetic and real images, further improving the accuracy of the DenseNet121 model.

3. Method

3.1. Network Structure

Nonlabel-conditional GANs are not suitable for generating images that can improve CNN accuracy because synthetic images do not have label information. However, some plant disease datasets have too few minority samples, which is not enough to support GAN training. Even the models based on label-conditional GAN, such as CGAN, ACGAN, MFC-GAN, and BAGAN, do not set the corresponding loss function to improve the accuracy of the synthetic image while generating the synthetic image. In addition, they do not consider the problem of training different CNN structures on synthetic images. In view of the above problems, a classifier is added to the GAN in this paper to enhance the accuracy of the synthetic images according to the classification results of the classifier. The classifier is trained with the generator and discriminator. The synthetic images produced by GACN can output synthetic images with higher accuracy according to specific CNN structure. The trained classifier can also be directly applied to image classification tasks.

The proposed GACN is shown in Figure 1. It comprises a generator, discriminator, and classifier. The discriminator uses convolution with a stride of 2 and finally uses nonlinear mapping layers to output a source $S$ and a class label $C$ . The discriminator’s structure is shown in Table 1. The structure of the proposed discriminator is similar to that of the other GAN discriminator. Most discriminators use a convolution layer with stride 2 to downsample the feature map and gradually increase the number of convolution channels. The drop operation prevents the generator’s synthetic images from being too similar due to overtraining of the discriminator. The generator structure is shown in Table 2. Each synthetic image to be synthesized $X_{f a k e} = G e n e r a t o r (c, z)$ has a corresponding class label $c$ , which is entered into the generator as additional information in the form of one-hot encoding. The mapped latent code $z$ and additional information are combined and fed into the generator through concatenate operations. The nonlinear mapping layer maps latent code to increase its size and reshape the feature map to $R^{C \times 4 \times 4}$ before entering the generator. The generator uses a bicubic interpolation operation to upsample the feature map size before the convolution layer. Early GAN use of deconvolution as an upsampling operation produces grid-like artifacts, but bicubic interpolation operation solves this problem. Notably, the network structure of the generator and discriminator has a limited influence on the results, and the loss function is the key to generating highly accurate images. The proposed model aims to further enhance the contribution of synthetic images to specific classification network discriminability. Therefore, the classifier structure is the same as the specific classification network structure. For example, for synthetic images to enhance the ResNet performance, the classifier must have the same structure as ResNet. In this paper, we set the classifier structure to ResNet18 to compare performance with other models.

Structure of the generative adversarial classified network. (a) Structure of the generator. (b) Structure of the discriminator. (c) The classifier structure is the same as the specific classification network. “z” represents noise. “c” represents the label. “N” represents the number of classes.

Table 1.

Structure of discriminator. The convolution is followed by BN, LeakyReLU (slope 0.2), and dropout.

Type	Kernel	Strides	Feature Maps	Setting	Dropout	Nonlinearity
Convolution	4 × 4	2	64	-	0.5	LeakyReLU
Convolution	4 × 4	2	128	BN	0.5	LeakyReLU
Convolution	4 × 4	2	256	BN	0.5	LeakyReLU
Convolution	4 × 4	2	512	BN	0.5	LeakyReLU

Type	Kernel	Feature Maps	Setting	Nonlinearity
Convolution	3 × 3	512	Bilinear ×2	LeakyReLU
Convolution	3 × 3	256	Bilinear ×2	LeakyReLU
Convolution	3 × 3	128	Bilinear ×2	LeakyReLU
Convolution	3 × 3	64	Bilinear ×2	LeakyReLU
Convolution	3 × 3	32	Bilinear ×2	LeakyReLU
Convolution	3 × 3	3	-	Tanh

Study	Year	Network	Param	AI Challenger 2018	PlantVillage
Ferentinos [37]	2018	VGG	138 M	82.71%	97.67%
Too et al. [38]	2019	DenseNet	8 M	84.56%	98.02%
Kamal et al. [39]	2019	MobileNet variant	0.5 M	83.78%	97.45%
Chen et al. [40]	2020	VGG19 variant	41 M	83.12%	98.74%
Ramamurthy et al. [41]	2020	CNN+attnetion	0.7 M	76.53%	96.21%
Ronghua Gao et al. [42]	2021	ResNet18 variant +attention	51 M	85.73%	99.25%
Zhao et al. [43]	2022	CNN+attention	59 M	84.23%	98.54%
Li et al. [44]	2023	CNN	4 M	85.56%	98.61%
Singh Thakur et al. [45]	2023	VGG variant	6 M	85.89%	98.75%
Our	2023	GACN (Classifier ResNet18)	11 M	86.52%	99.78%

Method	Setting	Param	AI Challenger 2018	PlantVillage
ResNet18	-	11 M	84.75%	98.74%
GACN	Classifier ResNet18	11 M	86.52%	99.78%
ResNet34	-	21 M	84.87%	98.81%
GACN	Classifier ResNet34	21 M	86.12%	99.26%
ResNet50	-	23 M	84.92%	98.92%
GACN	Classifier ResNet50	23 M	86.64%	99.41%
ResNet101	-	42 M	85.04%	99.02%
GACN	Classifier ResNet101	42 M	86.36%	99.31%

Datasets	k = 1	k = 2	k = 3	k = 4	k = 5	Average
AI Challenger 2018	86.52%	86.25%	86.46%	86.44%	86.16%	86.37%
PlantVillage	99.78%	99.72%	99.65%	99.43%	99.56%	99.63%

Dataset	Classes	Min	Mean	Median	Max
PlantVillage	38	152	1150	1403	5507
AI Challenger 2018	61	1	754	343	2221

Amount	CGAN	ControlGAN	ACGAN	BAGAN	MFC-GAN	Our
3800	5.2%	7.1%	11.1%	12.6%	15.4%	23.5%
19,000	5.8%	6.8%	33.2%	32.5%	38.2%	43.3%
38,000	5.4%	7.6%	34.9%	35.2%	39.6%	44.2%
76,000	5.5%	7.2%	32.1%	32.5%	39.8%	42.2%

Amount	CGAN	ControlGAN	ACGAN	BAGAN	MFC-GAN	Our
15,250	2.3%	3.2%	9.4%	10.2%	12.5%	16.6%
30,500	2.1%	3.6%	19.2%	19.6%	20.2%	28.5%
61,000	1.9%	3.3%	23.5%	22.5%	26.1%	32.4%
91,500	2.2%	2.9%	22.1%	21.6%	24.7%	31.2%

Metrics	Real Images	CGAN	ControlGAN	ACGAN	BAGAN	MFC-GAN	Our
PIQE	16.42	3.47	4.67	5.51	5.34	6.87	7.62
NIQE	17.84	88.57	56.89	42.27	45.64	38.56	31.15
Inception Score	2.78	3.46	3.78	3.26	3.51	3.56	3.05

Amount	CGAN	ControlGAN	ACGAN	BAGAN	MFC-GAN	Our
500	96.8%	97.2%	97.3%	97.1%	97.2%	97.1%
1000	97.1%	96.7%	98.1%	98.2%	99.1%	99.3%
1500	96.9%	97.1%	97.2%	97.8%	98.1%	98.5%
2000	96.7%	96.6%	96.2%	97.3%	96.8%	96.9%
Real images	98.7%

Amount	CGAN	ControlGAN	ACGAN	BAGAN	MFC-GAN	Our
500	83.2%	84.4%	84.6%	84.1%	85.5%	85.8%
750	83.9%	83.2%	85.4%	84.2%	85.8%	86.3%
1000	83.5%	83.6%	85.1%	84.5%	85.2%	85.1%
1500	81.8%	82.5%	83.7%	83.6%	83.3%	83.8%
Real images	84.7%

Dataset	k = 1	k = 2	k = 3	k = 4	k = 5	Average
AI Challenger 2018	86.3%	85.9%	86.1%	85.8%	86.2%	86.1%
PlantVillage	99.3%	99.1%	98.9%	98.7%	99.2%	99%

Resolution	CGAN	ControlGAN	ACGAN	BAGAN	MFC-GAN	Our
64 × 64	5.7%	6.5%	29.7%	30.1%	38.2%	43.5%
128 × 128	5.4%	7.6%	34.9%	35.2%	39.6%	44.2%

PERMALINK

GACN: Generative Adversarial Classified Network for Balancing Plant Disease Dataset and Plant Disease Recognition

Xiaotian Wang

Weiqun Cao

Roles

Abstract

1. Introduction

2. Related Work

2.1. Label-Conditional GANs

2.2. Plant Disease Recognition Model Based on GAN

3. Method

3.1. Network Structure

Figure 1.

Table 1.

Table 2.

3.2. Objective Function

4. Experiment

4.1. Datasets and Implementation Details

4.2. Evaluation Metrics

4.3. Plant Disease Recognition Performance

4.3.1. Comparison of Plant Disease Recognition Accuracy

Table 3.

Table 4.

Table 5.

4.3.2. Accuracy Curve during Training

Figure 2.

4.3.3. Influence of False Image Weights on the Results during Training

Figure 3.

4.4. Performance of Synthetic Images

4.4.1. Performance of the Synthetic Image among Different GANs

Table 6.

Figure 4.

Table 7.

Table 8.

Table 9.

4.4.2. Effect of Images Synthesized by Different GANs on Dataset Balancing

Table 10.

Table 11.

Table 12.

4.4.3. Influence of Synthetic Image Resolution on the Accuracy

Table 13.

Table 14.

4.4.4. Ablation Experiment

Table 15.

5. Conclusions

Author Contributions

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Funding Statement

Footnotes

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases