Skip to main content
PLOS One logoLink to PLOS One
. 2020 Aug 24;15(8):e0238149. doi: 10.1371/journal.pone.0238149

Classification of multicategory edible fungi based on the infrared spectra of caps and stalks

Rui Gao 1,#, Cheng Chen 1,*, Hang Wang 1, Chen Chen 1,#, Ziwei Yan 1, Huijie Han 2, Fangfang Chen 1, Yan Wu 3, Zhiao Wang 1, Yuxiu Zhou 1, Rumeng Si 1, Xiaoyi Lv 1,*
Editor: Jie Zhang4
PMCID: PMC7444812  PMID: 32833991

Abstract

As a characteristic edible fungus with a high nutritional value and medicinal effect, the Bachu mushroom has a broad market. To distinguish among Bachu mushrooms with high value and other fungi effectively and accurately, as well as to explore a universal identification method, this study proposed a method to identify Bachu mushrooms by Fourier Transform Infrared Spectroscopy (FT-IR) combined with machine learning. In this experiment, two kinds of common edible mushrooms, Lentinus edodes and club fungi, were selected and classified with Bachu mushrooms. Due to the different distribution of nutrients in the caps and stalks, the caps and stalks were studied in this experiment. By comparing the average normalized infrared spectra of the caps and stalks of the three types of fungi, we found differences in their infrared spectra, indicating that the latter can be used to classify and identify the three types of fungi. We also used machine learning to process the spectral data. The overall steps of data processing are as follows: use partial least squares (PLS) to extract spectral features, select the appropriate characteristic number, use different classification algorithms for classification, and finally determine the best algorithm according to the classification results. Among them, the basis of selecting the characteristic number was the cumulative variance interpretation rate. To improve the reliability of the experimental results, this study also used the classification results to verify the feasibility. The classification algorithms used in this study were the support vector machine (SVM), backpropagation neural network (BPNN) and k-nearest neighbors (KNN) algorithm. The results showed that the three algorithms achieved good results in the multivariate classification of the caps and stalks data. Moreover, the cumulative variance explanation rate could be used to select the characteristic number. Finally, by comparing the classification results of the three algorithms, the classification effect of KNN was found to be the best. Additionally, the classification results were as follows: according to the caps data classification, the accuracy was 99.06%; according to the stalks data classification, the accuracy was 99.82%. This study showed that infrared spectroscopy combined with a machine learning algorithm has the potential to be applied to identify Bachu mushrooms and the cumulative variance explanation rate can be used to select the characteristic number. This method can also be used to identify other types of edible fungi and has a broad application prospect.

1. Introduction

The Bachu mushroom is a characteristic edible fungus in Xinjiang, China. It belongs to the Saddle fungus genus and is produced in the natural Populus euphratica forest region of the Yeerqiang River Basin in Xinjiang [1]. The Bachu mushroom not only has high nutritional value, rich in various amino acids and proteins but also has high medicinal value [2]. Studies have shown that the Bachu mushroom has anti-tumor, anti-oxidation and cholesterol-lowering effects and is used to treat gastric cancer, cerebral arteriosclerosis, cardiovascular disease, hypertension and other diseases [3]. Its nutritional value is much higher than that of general edible fungi; thus, it is of great research value. However, because the Bachu mushroom cannot be cultivated artificially, the market has been in short supply, increasing its price. Presently, the processing technology of the Bachu mushroom is developing and relatively mature, such as polysaccharide extraction of the Bachu mushroom and preparation of compound beverages, and these processing technologies have a wide application prospect [46]. In the future, after the derivatives of the Bachu mushroom are mass produced and the treatment process is widely used, controlling the quality of raw materials will be very significant to ensure product quality. Therefore, to prevent businesses from choosing other low-priced mushrooms as raw materials for high profits, it is necessary to identify a simple and rapid way to distinguish Bachu mushrooms from other types of mushrooms. However, current methods to identify Bachu mushrooms and other edible fungi depend on appearance. This method can distinguish mushroom species to a large extent, but it also has great limitations—that is, it is limited to individual intact edible bacteria—but liquid extract and powdered mushroom powder cannot be distinguished. Thus, to overcome the limitations of conventional methods and explore a more universal mushroom classification method, the spectral data of Bachu mushroom powder and two other types of mushroom powder were measured, and the classification of Bachu mushroom powder and other types of mushroom powder was identified by a machine learning algorithm in this study.

Due to the differences in the types and contents of nutrients in the stalks and caps of edible fungi, the caps and stalks were analyzed in this study [7]. Previous studies have shown some differences in the contents of proteins and amino acids between caps and stalks in edible fungi [8]. Additionally, the difference in the distribution of nutrients is related to the species of edible fungi [9]. Therefore, to make the experimental results more accurate and persuasive, as well as to avoid the uneven distribution of the substance content in the samples, this study adopted the classification analysis method of grouping according to the attribute index to study the caps and stalks [10].

The research method used in this experiment classified the Bachu mushroom and two other types of edible fungi by infrared spectra combined with a machine learning algorithm. Infrared spectra have the characteristics of wide applicability, high efficiency, convenience, repeatability and high sensitivity; thus, it has been widely used in physics, remote sensing, biology, food, medical and other research fields [11, 12]. Infrared spectroscopy has great application value in food research [13, 14]. Additionally, infrared spectroscopy combined with a machine learning algorithm has been applied to the classification of mushroom as food, the effect of the producing area on the nutritional value of fungi and the fine classification of rare edible fungi [1517]. The purpose of this study was to explore a universal identification method to identify Bachu mushroom using infrared spectra combined with a machine learning algorithm and verify the feasibility of the application of infrared spectra combined with an algorithm in the identification of edible mushroom species.

2. Experimental methods

2.1. Sample preparation

In this study, Lentinus edodes and club fungi were selected and classified with Bachu mushrooms. Among them, Lentinus edodes are produced in Fujian Province of China, and purchased from Fuchang Food Limited Company, Fujian Province of China; club fungi are produced in Yunnan Province of China, purchased from Wuweijin Store, and Bachu mushrooms are produced in Bachu County, Xinjiang Province, and purchased from the most famous wholesale market in Urumqi—Six Markets. The three kinds of mushrooms purchased are all dried. The caps of the three kinds of mushrooms are umbrella-shaped and dark brown. The stalks of club fungi are longer and those of Lentinus edodes and Bachu mushrooms are shorter. The three kinds of mushrooms are similar in appearance. The samples of the three types of edible fungi were purchased from the market. After incubating the prepared sample in an 80°C electric steam oven for dehydration treatment for 1 hour, the stalks were separated from the caps. Next, every three complete caps were crushed together, and the powder was passed through a 200-mesh sieve as a sample and named according to the mushroom species. The stalks were treated in the same way. Finally, 39 samples of Lentinus edodes powder, 47 samples of club fungi powder and 35 samples of Bachu mushroom powder were obtained.

2.2. Measurement of NIF spectra

The sample powder was placed into a 4-ml sample tube, and its infrared spectrum was measured. The spectrum acquisition instrument was a VERTEX 70 infrared spectrometer from BRUKER, Germany. Before each measurement of the FT-IR spectrum, the atmospheric background data were measured using OPUS65 software. The selected resolution was 8 cm-1, the number of scans was 32, the scan range was 4000–11000 cm-1, and the atmospheric compensation parameter was CO2. To reduce the influence of human error and other factors, each sample was scanned 3 times. Finally, the data obtained for the caps were 117 for Lentinus edodes, 141 for club fungi, 105 for Bachu mushrooms, and the same number of stalks data.

2.3. Statistical algorithm analysis

Statistical algorithms have been widely used to manage infrared spectral data [18]. In this study, PLS, SVM, KNN, and BPNN were used to process and analyze the spectral data. The caps data were reduced by PLS to extract features, and then the appropriate characteristic number was selected as the input of the three classification algorithms, namely, SVM, KNN and BPNN, and then the accuracy was obtained. Additionally, the stalk data were processed in the same way. All algorithms in this study are implemented on MATLAB 2018a.

The partial least squares (PLS) method is a mathematical optimization supervised learning method that can identify the best matching function for a set of data by minimizing the sum of the squares of errors. Based on the advantages of the PLS model, which is easy to identify noise and allows regression modeling with a small sample number, the PLS algorithm is widely used in various research fields [7, 19]. In food research, PLS has been used in food nutrition testing, food quality research and food industry research [2022]. PLS is often used in combination with spectra for feature extraction and further spectral data analysis [23]. In this study, to improve the classification efficiency and filter out the worthless spectral information, PLS was used to reduce the dimensionality of the original spectral data.

After dimensionality reduction of the original data, an appropriate characteristic number should be selected as the basis for classification. In this study, the characteristic number was selected based on the cumulative variance interpretation rate of the characteristic number. The PLS program used in this study is the plsregress function. The variance explanation rates of the factors extracted from the first and second columns of the PCTVAR matrix correspond to the variances of x and y, respectively; this study chose the variance explanation rate of y [24]. The variance interpretation rate is the degree of interpretation of the data characteristics of the dependent variables by a single factor, and the cumulative interpretation rate of n factors is the degree of interpretation of the data characteristics of the dependent variables by n factors—that is, the influence of n factors on the dependent variables. Therefore, in theory, we can select the appropriate number of factors according to the cumulative variance interpretation rate and select as few factors as possible to improve the classification efficiency to ensure the integrity of the extracted features. To explore the applicability of the theory, this study will further use the classification results to verify the theory. After selecting an appropriate characteristic number, it can be used as the input of the classification algorithms SVM, KNN and BPNN.

Support vector machine (SVM) is a commonly used generalized linear classifier, in which the core idea is to apply the principle of risk minimization to the field of classification. Regarding pattern classification, it has good generalization performance, robustness, versatility and simple calculation [25]. Therefore, in food science, SVM is widely applied to food classification and food quality testing [26, 27]. Based on the advantages of SVM and characteristics that can be used for multiple classifications, in this study, SVM was used to classify the spectral data of mushroom powder directly after three features were extracted.

The k-nearest neighbors (KNN) classification algorithm is one of the most practical algorithms in data mining classification technology. It is easy to understand and powerful at the same time [28]. Different from other classification algorithms, KNN does not need training. It directly finds the k samples nearest to the sample and divides them into categories with the largest number of samples among the k samples; thus, KNN is suitable for multivariate classification and has high classification accuracy when the category boundary is obvious [29]. Additionally, KNN has been widely used in food classification and quality inspection [3032]. Therefore, we chose the KNN algorithm as the second algorithm of multivariate classification.

The backpropagation neural network (BPNN) is a multilayer feedforward neural network trained according to the error backpropagation algorithm. The BP neural network has strong nonlinear mapping ability, parallel information processing ability and excellent self-learning ability; thus, it has been widely used in food research, biomedical fields and other research fields [3335]. Additionally, the BP neural network can achieve good classification results when it is used in multivariate classification [36]. In summary, we chose the BPNN as the third classification algorithm.

3. Results and discussion

3.1. Spectral analysis

After the obtained spectral data were averaged, normalized and smoothed, the obtained spectrogram is shown in Fig 1. Fig 1 shows the FT-IR spectra of the Bachu mushroom and Lentinus edodes stalks were similar, both with characteristic peaks at 5099 cm-1 and 8744 cm-1, and the spectral intensity of Lentinus edodes was higher than that of the Bachu mushroom. Additionally, the spectrum of the Lentinus edodes stalk has a characteristic peak at 5778 cm-1, and the spectral intensity in the range of 8500~11000 cm-1 was significantly lower than that of Lentinus edodes and the Bachu mushroom. Comparing the spectra of the three caps, the spectra of club fungi are quite different from those of the other two types of fungi. Fig 2 shows that the average normalized spectra of Lentinus edodes, club fungi and Bachu mushroom stalks all had peaks at 5153 cm-1 and 8755 cm-1, and the spectral intensity of Lentinus edodes was the highest in these two places. At 5153 cm-1, the peak intensity of club fungi was higher than that of the Bachu mushroom; however, at 8755 cm-1, the peak intensity of club fungi was slightly lower than that of the Bachu mushroom. Although the three spectral lines changed roughly the same; however, in the range of 4300~6800 cm-1, the spectral intensity of the Bachu mushroom was significantly lower than that of the other two kinds of fungi, and the spectral intensity of the Bachu mushroom was significantly lower than that of the other two kinds of fungi.

Fig 1. The average normalized spectrum of the caps of the three types of edible fungi.

Fig 1

Fig 2. The average normalized spectrum of the stalks of the three types of edible fungi.

Fig 2

Through comparative analysis of the infrared spectra of the caps and stalks of the three types of fungi, their infrared spectral images showed the same trend, but many peak intensities were different. Therefore, we can classify them according to the spectral data based on the infrared spectral differences among the three types. However, it is difficult to distinguish among the three types of edible fungi directly and accurately only by spectroscopy. Thus, to classify them efficiently and accurately, we used the combined infrared spectrum analysis with machine learning.

3.2. Data analysis

3.2.1. Dimensionality reduction with PLS

In the PLS algorithm, 50 features were selected to obtain the cumulative variance explanation rate curve (Figs 3 and 4). The cumulative variance explanation rate of the first five features of the caps reached 90%, while the variance explanation rate of the first five features of the stalks also reached more than 80%, and the cumulative variance explanation rate of the first 30 features of both was close to 100% (Figs 3 and 4). Thus, the extracted features can fully express the features of the original data [37]. The stalks data and caps data were classified by the SVM, BPNN, and KNN algorithms.

Fig 3. The principal component cumulative variance explanation rate curve of the caps.

Fig 3

Fig 4. The principal component cumulative variance explanation rate curve of the stalks.

Fig 4

3.2.2. Classification by algorithm

In this experiment, three classification algorithms were used—SVM, BPNN, and KNN—to classify the caps data according to different characteristic numbers. Additionally, the stalk data were processed in the same way. The parameter setting and classification results of the algorithm were as follows:

The main ideas of the SVM model in this experiment were as follows: Select the test set and training set; preprocess the data; select the best C and g parameters, and then use the best parameters for network training and prediction; and obtain the accuracy. Among them, the training set and test set are randomly selected according to the proportion 7:3. Pre-processing was used to normalize all the sample data [0,1]. In the SVM algorithm, the selection of C and g parameters will directly affect the classification results; thus, it is necessary to select the best C and g parameters to achieve the best classification results. In this study, the variation range of parameter C was [2–2,24], the range of parameter g was [2–4,24], and the method of parameter optimization was grid optimization [38]. To classify the caps data, 5, 10, 15, 20, 25, 30, 35, 40, 45 and 50 features were selected. The stalks data were processed in the same way. The multiple classification results of the stalks and caps were then obtained, as shown in Table 1.

Table 1. Classification results of the SVM algorithm.
Number of PLS components Classification accuracy of Caps Classification accuracy of Stalks
5 100.00% 79.63%
10 100.00% 81.48%
15 100.00% 80.56%
20 98.15% 82.41%
25 99.07% 82.41%
30 98.15% 83.33%
35 98.15% 83.33%
40 98.15% 83.33%
45 98.15% 81.48%
50 98.15% 81.48%

In the KNN algorithm, the k value was 5, the proportion of the random selection of the test set was 30%, and the method to calculate the distance between data was the cosine distance (Cosine KNN) [39]. To classify the caps data, 5, 10, 15, 20, 25, 30, 35, 40, 45 and 50 features were selected. The stalks data were processed in the same way. Each result was expressed as the average of five computational results. The multiple classification results of the stalks and caps were then obtained, as shown in Table 2.

Table 2. Classification results of the KNN algorithm.
Number of PLS components Classification accuracy of Caps Classification accuracy of Stalks
5 98.14% 95.36%
10 98.89% 100.00%
15 99.06% 99.82%
20 98.12% 99.82%
25 98.89% 99.82%
30 98.90% 100.00%
35 98.14% 100.00%
40 98.32% 99.82%
45 97.40% 99.46%
50 97.40% 99.44%

In the BPNN algorithm of this experiment, the transfer function of the hidden layer was tamsig, the output layer was purelin, the learning training function was trainlm, and the weight learning function was learngdm. The network parameters were set to 300 training times, the network performance goal was 0.1, and the learning rate was 0.1 [40]. Thirty percent of all samples were selected randomly as the test set. To classify the caps data, 5, 10, 15, 20, 25, 30, 35, 40, 45 and 50 features were selected. The stalks data were processed in the same way. Each result was expressed as the average of five computational results. The multiple classification results of the stalks and caps were then obtained, as shown in Table 3.

Table 3. Classification results of the BPNN algorithm.
Number of PLS components Classification accuracy of Caps Classification accuracy of Stalks
5 94.26% 93.33%
10 92.24% 94.81%
15 92.96% 94.08%
20 95.56% 98.70%
25 99.07% 94.07%
30 98.15% 95.56%
35 97.96% 93.15%
40 97.78% 93.70%
45 97.59% 93.70%
5 0 97.59% 93.52%

3.2.3. Verification of the feasibility of selecting the characteristic number with the cumulative variance explanation rate

Figs 5 and 6 shows the line chart of the classification results of the three algorithms when selecting different characteristic numbers. The accuracy of SVM in the classification of caps was 100% when selecting 5, 10 and 15 features, decreased slightly with the increase in the characteristic number, and then stabilized at 98%. According to the fungal stalks data, the accuracy of SVM classification increased gradually when selecting 5, 10, 15, 20 features, fluctuated slightly with the increase in the characteristic number, and then stabilized at 82%. Using KNN to classify the caps data, the accuracy was stable between 97.40% and 99.07% but fluctuated slightly. When the characteristic number tended to 50, the accuracy was stable at 97.40%. According to the classification of stalks data by KNN, the accuracy varied greatly when selecting 5–10 features, fluctuated slightly with the increase in the characteristic number, and then stabilized at 99.50%. For the classification of caps by BPNN, the accuracy varied greatly when the characteristic number was 5~30, and the classification effect was unstable. When the characteristic number was more than 30, the accuracy decreased slightly and finally stabilized at 97.59%. When BPNN classified stalks, its accuracy varied greatly when the characteristic number was less than 35; when the characteristic number was more than 35, the accuracy was stabilized at 93.6%.

Fig 5. Accuracy of the three algorithms to select different characteristic numbers according to the data of the caps.

Fig 5

Fig 6. Accuracy of the three algorithms to select different characteristic numbers according to the data of the stalks.

Fig 6

Combined with the cumulative variance explanation rate, with the increase in the characteristic number, the extraction degree of the extracted features to the original data information gradually increased. Thus, the classification results become more reliable, and the accuracy will gradually tend to a certain value—that is, in the ideal state, the accuracy of all features extracted for classification. However, in this process, not all information is conducive to improving accuracy and some information will interfere with the classification results; thus, the accuracy will fluctuate slightly [41]. Therefore, when we select the characteristic number according to the variance interpretation rate, we should consider the degree of feature extraction, accuracy and classification efficiency comprehensively. In summary, when the cumulative variance interpretation rate of features reaches 90%, the classification results are sufficiently reliable and the accuracy is high, confirming that it is feasible to select the characteristic number according to the cumulative variance interpretation rate.

3.2.4. Selection of the best algorithm

According to the caps data classification, the PLS-SVM algorithm has the best classification effect when selecting 5~15 features, with an accuracy of 100%. The PLS-KNN algorithm has the best classification effect when selecting 15 features, with an accuracy of 99.06%. When selecting 25 features, the PLS-BPNN algorithm has the best classification effect, with an accuracy of 99.07% (Fig 5). According to the stalks data classification, the classification effect of the PLS-SVM algorithm was the best when selecting 30~40 features, with an accuracy of 83.33%. The classification effect of the PLS-KNN algorithm was the best when selecting 35 features, with an accuracy of 100%. When 20 features were selected, the classification effect of the PLS-BPNN algorithm was the best, reaching 98.70% (Fig 6).

Combined with the selection of the characteristic number to analyze the three algorithms, the PLS-KNN algorithm has a better classification effect on stalks and caps, and the accuracy is more stable when selecting different characteristic numbers. Thus, the PLS-KNN algorithm was chosen as the optimal algorithm. Using comprehensive analysis of the accuracy of PLS-KNN classification when selecting different characteristic numbers, a characteristic number of 15 reveals a higher accuracy for both caps and stalks and a high classification efficiency. Therefore, in this experiment, the PLS-KNN algorithm was finally selected, and the characteristic number was 15. The final classification accuracy was 99.06% for the caps and 99.82% for the stalks.

4. Conclusions

This study verified the feasibility of infrared spectroscopy combined with the PLS-SVM, PLS-KNN and PLS-BPNN algorithms in the classification of the Bachu mushroom and other edible mushrooms. We compared the classification results and selected the optimal algorithm and best feature number to reveal an efficient, rapid and universal method to identify the Bachu mushroom, overcoming the limitation that the current identification of Bachu mushroom only depends on its appearance. Moreover, the method is universal and can be applied to the classification and identification of other types of food. Additionally, this study proposed to select the characteristic number according to the cumulative variance interpretation rate and used the classification results of the three algorithms to verify its feasibility. This method of selecting the characteristic number can also be extended to other research fields of factor analysis, and the appropriate characteristic number can be selected intuitively and quickly.

Supporting information

S1 Data

(ZIP)

Data Availability

All relevant data are within the manuscript and its Supporting Information files.

Funding Statement

This work has been supported by the Science and Technology Project on aid to Xinjiang Uygur Autonomous Region (No.2018E02058), the National Science Foundation of China (No.61765014), Xinjiang Uygur Autonomous Region Graduate Innovation Project of China, National Innovation Program for College students (No.201910755039), the Urumqi Science and Technology Project (No. P161310002), the Reserve Talents Project of the National Highlevel Personnel of Special Support Program (QN2016YX0324).

References

  • 1.Zhao Q., et al. , Species clarification of the culinary Bachu mushroom in western China. Mycologia, 2016. 108(4): p. 828–36. 10.3852/16-002 [DOI] [PubMed] [Google Scholar]
  • 2.Chen X., Wang J., and Li H., Basic Ingredientsand Nutritional Assessment Of Baehu Mushroom. Farm Products Processing, 2015(09): p. 46–48. [Google Scholar]
  • 3.Zeng D. and Zhu S., Purification, characterization, antioxidant and anticancer activities of novel polysaccharides extracted from Bachu mushroom. International journal of biological macromolecules, 2018. 107: p. 1086–1092. 10.1016/j.ijbiomac.2017.09.088 [DOI] [PubMed] [Google Scholar]
  • 4.XuJie H., et al. , Extraction of BaChu mushroom polysaccharides and preparation of a compound beverage. Carbohydrate Polymers, 2008. 73(2): p. 289–294. [Google Scholar]
  • 5.XuJie H. and Wei C., Optimization of extraction process of crude polysaccharides from wild edible BaChu mushroom by response surface methodology. Carbohydrate Polymers, 2008. 72(1): p. 67–74. [Google Scholar]
  • 6.Wang C.-Y., A Review on the Potential Reuse of Functional Polysaccharides Extracted from the By-Products of Mushroom Processing. Food and Bioprocess Technology, 2020. 13(2): p. 217–228. [Google Scholar]
  • 7.Akindahunsi A.A. and Oyetayo F.L., Nutrient and antinutrient distribution of edible mushroom, Pleurotus tuber-regium (fries) singer. LWT—Food Science and Technology, 2006. 39(5): p. 548–553. [Google Scholar]
  • 8.Zhu Y. and Tan A.T.L., Chemometric Feature Selection and Classification of <i>Ganoderma lucidum</i> Spores and Fruiting Body Using ATR-FTIR Spectroscopy. American Journal of Analytical Chemistry, 2015. 06(10): p. 830–840. [Google Scholar]
  • 9.Oboh G. and Shodehinde S., Distribution of nutrients, polyphenols and antioxidant activities in the pilei and stipes of some commonly consumed edible mushrooms in Nigeria. Bulletin of the Chemical Society of Ethiopia, 2009. 23(3). [Google Scholar]
  • 10.Ball G.H., Classification Analysis. 1970. [Google Scholar]
  • 11.Ferrari M. and Quaresima V., A brief review on the history of human functional near-infrared spectroscopy (fNIRS) development and fields of application. Neuroimage, 2012. 63(2): p. 921–35. 10.1016/j.neuroimage.2012.03.049 [DOI] [PubMed] [Google Scholar]
  • 12.Chen C., et al. , Exploration research on the fusion of multimodal spectrum technology to improve performance of rapid diagnosis scheme for Thyroid Dysfunction. Journal of biophotonics, 2020. 13(2): p. e201900099 10.1002/jbio.201900099 [DOI] [PubMed] [Google Scholar]
  • 13.Van de Voort F., Fourier transform infrared spectroscopy applied to food analysis. Food Research International, 1992. 25(5): p. 397–403. [Google Scholar]
  • 14.Chen C., et al. , Application of near infrared spectroscopy combined with SVR algorithm in rapid detection of cAMP content in red jujube. Optik, 2019. 194: p. 163063. [Google Scholar]
  • 15.Meenu M. and Xu B., Application of vibrational spectroscopy for classification, authentication and quality analysis of mushroom: A concise review. Food chemistry, 2019. [DOI] [PubMed] [Google Scholar]
  • 16.Li Y., et al. , Geographical traceability of wild Boletus edulis based on data fusion of FT-MIR and ICP-AES coupled with data mining methods (SVM). Spectrochim Acta A Mol Biomol Spectrosc, 2017. 177: p. 20–27. 10.1016/j.saa.2017.01.029 [DOI] [PubMed] [Google Scholar]
  • 17.Fu H., et al. , A comprehensive quality evaluation method by FT-NIR spectroscopy and chemometric: Fine classification and untargeted authentication against multiple frauds for Chinese Ganoderma lucidum. Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy, 2017. 182: p. 17–25. [DOI] [PubMed] [Google Scholar]
  • 18.Geladi P. and Dåbakk E., An overview of chemometrics applications in near infrared spectrometry. Journal of Near Infrared Spectroscopy, 1995. 3(3): p. 119–132. [Google Scholar]
  • 19.Rosipal R. and Krämer N.. Overview and recent advances in partial least squares in International Statistical and Optimization Perspectives Workshop" Subspace, Latent Structure and Feature Selection". 2005. Springer. [Google Scholar]
  • 20.Xie L., et al. , Quantification of glucose, fructose and sucrose in bayberry juice by NIR and PLS. Food Chemistry, 2009. 114(3): p. 1135–1140. [Google Scholar]
  • 21.Ivorra E., et al. , Detection of expired vacuum-packed smoked salmon based on PLS-DA method using hyperspectral images. Journal of food engineering, 2013. 117(3): p. 342–349. [Google Scholar]
  • 22.Heyder M., Theuvsen L., and Hollmann-Hespos T., Investments in tracking and tracing systems in the food industry: a PLS analysis. Food Policy, 2012. 37(1): p. 102–113. [Google Scholar]
  • 23.Mehmood T., et al. , A review of variable selection methods in Partial Least Squares Regression. Chemometrics and Intelligent Laboratory Systems, 2012. 118: p. 62–69. [Google Scholar]
  • 24.Lorber A., Wangen L.E., and Kowalski B.R., A theoretical foundation for the PLS algorithm. Journal of Chemometrics, 1987. 1(1): p. 19–31. [Google Scholar]
  • 25.Karamizadeh, S., et al. Advantage and drawback of support vector machine functionality. in 2014 international conference on computer, communications, and control technology (I4CT). 2014. IEEE.
  • 26.Caetano S., et al. , Geographical classification of olive oils by the application of CART and SVM to their FT‐IR. Journal of Chemometrics: A Journal of the Chemometrics Society, 2007. 21(7‐9): p. 324–334. [Google Scholar]
  • 27.Zheng H. and Lu H., A least-squares support vector machine (LS-SVM) based on fractal analysis and CIELab parameters for the detection of browning degree on mango (Mangifera indica L.). Computers and Electronics in Agriculture, 2012. 83: p. 47–51. [Google Scholar]
  • 28.Zhang M.-L. and Zhou Z.-H., ML-KNN: A lazy learning approach to multi-label learning. Pattern recognition, 2007. 40(7): p. 2038–2048. [Google Scholar]
  • 29.Dhanabal S. and Chandramathi S., A review of various k-nearest neighbor query processing techniques. International Journal of Computer Applications, 2011. 31(7): p. 14–22. [Google Scholar]
  • 30.Ouyang Q., Zhao J., and Chen Q., Classification of rice wine according to different marked ages using a portable multi-electrode electronic tongue coupled with multivariate analysis. Food research international, 2013. 51(2): p. 633–640. [Google Scholar]
  • 31.Ivorra E., et al. , Shelf life prediction of expired vacuum-packed chilled smoked salmon based on a KNN tissue segmentation method using hyperspectral images. Journal of food engineering, 2016. 178: p. 110–116. [Google Scholar]
  • 32.Makelele L., et al. , Microbiological quality of food sold by street vendors in Kisangani, Democratic Republic of Congo. African Journal of Food Science, 2015. 9(5): p. 285–290. [Google Scholar]
  • 33.Suliman A. and Zhang Y., A review on back-propagation neural networks in the application of remote sensing image classification. Journal of Earth Science and Engineering, 2015. 5: p. 52–65. [Google Scholar]
  • 34.Magwaza L.S., et al. , NIR spectroscopy applications for internal and external quality analysis of citrus fruit—a review. Food and Bioprocess Technology, 2012. 5(2): p. 425–444. [Google Scholar]
  • 35.Zhou, Y., Y. Wang, and Q. Yao. Segmentation of rice disease spots based on improved BPNN. in 2010 International Conference on Image Analysis and Signal Processing. 2010. IEEE.
  • 36.Liu D., et al. , Lychee variety discrimination by hyperspectral imaging coupled with multivariate classification. Food analytical methods, 2014. 7(9): p. 1848–1857. [Google Scholar]
  • 37.Lorenzo-Seva U., How to report the percentage of explained common variance in exploratory factor analysis. Tarragona, Italy: Department of Psychology, 2013. [Google Scholar]
  • 38.Lin S.-W., et al. , Particle swarm optimization for parameter determination and feature selection of support vector machines. Expert Systems with Applications, 2008. 35(4): p. 1817–1824. [Google Scholar]
  • 39.Kozma L., k Nearest Neighbors algorithm (kNN). Helsinki University of Technology, 2008. [Google Scholar]
  • 40.Dai H. and MacBeth C., Effects of learning parameters on learning procedure and performance of a BPNN. Neural networks, 1997. 10(8): p. 1505–1521. 10.1016/s0893-6080(97)00014-2 [DOI] [PubMed] [Google Scholar]
  • 41.Rosenthal R. and Rubin D.B., A note on percent variance explained as a measure of the importance of effects. Journal of Applied Social Psychology, 1979. 9(5): p. 395–396. [Google Scholar]

Decision Letter 0

Jie Zhang

8 Jul 2020

PONE-D-20-12067

Classification of Multicategory Edible Fungi based on the Infrared Spectra of Caps and Stalks

PLOS ONE

Dear Dr. Lv,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Specifically, please specify which version of Matlab was used and to italicise the biological genus and species names of the fungi.

Please submit your revised manuscript by Aug 21 2020 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

We look forward to receiving your revised manuscript.

Kind regards,

Jie Zhang

Academic Editor

PLOS ONE

Journal Requirements:

1. We understand that you purchased mushrooms from local markets for this study. In your Methods section, please provide additional details regarding the source of this material.

Please provide the geographic coordinates and names of the purchase locations (e.g., stores, markets), if available, as well as any further details about the purchased items (e.g., lot number, source origin, description of appearance) to ensure reproducibility of the analyses.

Additional Editor Comments (if provided):

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The manuscript is very well written all the statistical analysis have been done with great rigour and very nicely presented. The manuscript is very easy to follow and targeted for chemometricians. My only comments is to specify which version of Matlab was used and to italicise the biological genus and species names of the fungi. Good job once again to all the authors involved in this publication.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Frederick Lia

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2020 Aug 24;15(8):e0238149. doi: 10.1371/journal.pone.0238149.r002

Author response to Decision Letter 0


10 Aug 2020

Subject: Revised version of the manuscript

No.: PONE-D-20-12067

Title: Classification of Multicategory Edible Fungi based on the Infrared Spectra of Caps and Stalks

Dear Editor and Reviewer:

Thanks to the editor and reviewer for your patient review and important suggestion for our article. We have further revised your valuable suggestions. We have highlighted our manuscript changes through the use of red font in the revised manuscript.

The revision was carried out as a result of the comments of the editor and the reviewer.

The main corrections in the paper and the responds to the editor’s comments and the responds to the reviewer’s comments are as following:

Editor :

Response to comment (1):

( We understand that you purchased mushrooms from local markets for this study. In your Methods section, please provide additional details regarding the source of this material. Please provide the geographic coordinates and names of the purchase locations (e.g., stores, markets), if available, as well as any further details about the purchased items (e.g., lot number, source origin, description of appearance) to ensure reproducibility of the analyses.)

Response-1: Many thanks to the editor for the suggestions. Based on your suggestions, we have supplemented the description of the source origin, names of the purchase locations and appearance of mushrooms in 2. Experimental methods to ensure reproducibility of the analyses. Thanks again to the editor for your valuable suggestions.

The contents are as follows:

Among them, Lentinus edodes are produced in Fujian Province of China, and purchased from Fuchang Food Limited Company, Fujian Province of China; club fungi are produced in Yunnan Province of China, purchased from Wuweijin Store, and Bachu mushrooms are produced in Bachu County, Xinjiang Province, and purchased from the most famous wholesale market in Urumqi -- Six Markets. The three kinds of mushrooms purchased are all dried. The caps of the three kinds of mushrooms are umbrella-shaped and dark brown. The stalks of club fungi are longer and those of Lentinus edodes and Bachu mushrooms are shorter. The three kinds of mushrooms are similar in appearance. The samples of the three types of edible fungi were purchased from the market.

Reviewer #1:

1.Response to comment (1-4):

Response: Thank you very much for the reviewer's agreement on our work, which will encourage us to do better work in the future.

2.Response to comment (5):

( The manuscript is very well written all the statistical analysis have been done with great rigour and very nicely presented. The manuscript is very easy to follow and targeted for chemometricians. My only comments is to specify which version of Matlab was used and to italicise the biological genus and species names of the fungi. Good job once again to all the authors involved in this publication.)

Response-5: Thanks to the reviewer for the suggestions. We have supplemented the MATLAB version and italicised the biological genus and species names of the fungi in the article.

Thank the reviewer and editor for your patient and careful review. I hope you have good health and success in your work.

Sincerely yours,

Dr. Xiaoyi Lv, Dr. Cheng Chen

Attachment

Submitted filename: Response to Reviewers.doc

Decision Letter 1

Jie Zhang

11 Aug 2020

Classification of Multicategory Edible Fungi based on the Infrared Spectra of Caps and Stalks

PONE-D-20-12067R1

Dear Dr. Lv,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Jie Zhang

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Acceptance letter

Jie Zhang

13 Aug 2020

PONE-D-20-12067R1

Classification of Multicategory Edible Fungi based on the Infrared Spectra of Caps and Stalks

Dear Dr. Lv:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Jie Zhang

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Data

    (ZIP)

    Attachment

    Submitted filename: Response to Reviewers.doc

    Data Availability Statement

    All relevant data are within the manuscript and its Supporting Information files.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES