Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2020 Jun 23;15(6):e0229751. doi: 10.1371/journal.pone.0229751

Precise automatic classification of 46 different pollen types with convolutional neural networks

Víctor Sevillano 1, Katherine Holt 2, José L Aznarte 1,*
Editor: Mudassar Raza3
PMCID: PMC7310700  PMID: 32574174

Abstract

In palynology, the visual classification of pollen grains from different species is a hard task which is usually tackled by human operators using microscopes. Many industries, including medical and pharmaceutical, rely on the accuracy of this manual classification process, which is reported to be around 67%. In this paper, we propose a new method to automatically classify pollen grains using deep learning techniques that improve the correct classification rates in images not previously seen by the models. Our proposal manages to properly classify up to 98% of the examples from a dataset with 46 different classes of pollen grains, produced by the Classifynder classification system. This is an unprecedented result which surpasses all previous attempts both in accuracy and number and difficulty of taxa under consideration, which include types previously considered as indistinguishable.

Introduction

Pollen is widely recognized as a nuisance, but also as a valuable tool in several scientific fields. An estimated 40% of the world’s population experience seasonal allergic rhinitis (SAR) driven by exposure to pollen [1]. Pollen forecasting, informed by examination of airborne pollen, has become a key tool for management of SAR [2]. Pollen is also particularly important for quality verification of honey [3], reconstructing past vegetation to understand past changes in climate change [4], biodiversity [5], and human impacts [6] and as a forensic tool [7]. Common to all these areas is the need for experienced analysts to spend considerable amounts of time identifying and counting pollen on slides. While other branches of science have been transformed by the technological advances of recent decades, palynology is languishing, with the practical methodology of pollen counting having hardly advanced much beyond that of the 1950s.

But this is not for want of trying. Flenley [8] was the first to call attention to the need and potential of automation of pollen counting. A handful of early attempts were published in the later decades of the 20th century, but the rapid increase in capability in computational intelligence over the early part of the 21st century resulted in considerable acceleration in the field during this time, with numerous attempts at partial or complete automation of palynology appearing in the literature, summarized in [9] and [10].

While the results of the existing studies can be regarded as promising, they are rather limited in that they typically deal with a relatively small number of taxa (max 30, mean 8), and success/accuracy rates vary. While some palynological applications may require a lower level of taxonomic diversity than others, it is arguable that many ‘real world’ pollen problems will require higher diversity than that of most of the existing studies. For example, in [11], Stillman and Flenley, suggested that the minimum number of taxa for paleoecological applications would be around 40 types.

Recently, Sevillano and Aznarte in [10] presented an example of pollen classification which applied deep learning convolutional neural networks to the POLEN23E image dataset, a freely available set of 805 bright-field microscope images of 23 different pollen types from the Brazilian savannah [12]. Their model achieved accuracies of over 97%. 97% accuracy on a 23 class problem represents one of the best successes rate, when weighted for the number of taxa, of any attempt at automated pollen analysis currently documented in the literature.

In this paper, we apply the same approach to a wider set of pollen images produced by an automated pollen classification system, the Classifynder, marketed by Veritaxa ltd. This image set includes twice as many pollen types as the POLEN23E set, and over 19,000 individual images.

State of the art

Previous approaches to automated palynology are comprehensively summarized in [9] and [10]. They can be divided into image-based and non-image based methods. Non-image based methods use alternative techniques for sensing characteristics used for differentiation, for example fluorescence [13], Fourier-Transform infrared [14], and Raman spectroscopy [15], etc. Non-image based methods won’t be discussed further here.

Traditionally, image-based methods typically involve defining and extracting discriminating features from pollen images, followed by sorting via statistical or machine learning-based classifiers. An example which uses the same type of images as this research is [16]. As per [10], these image-based methods fall into three different categories based on the type of features being used: (1) discriminant features are largely visual/geometrical (e.g. shape, symmetry, diameter, etc.); (2) discriminant features are texture-based, i.e. they capture information about the pollen grain ornamentation (e.g. gray-level co-occurrence matrices,entropy features etc.,); and (3) a combination of the two approaches.

Recently, a new method for image-based pollen classification has emerged which harnesses the power of deep learning neural network frameworks. While still a combination of classification based on key image features, this new approach involves a model determining and extracting the features itself, rather than them being predefined by human analysts. Transfer learning is used to maximize classification ability by leveraging the representativeness of these synthetic features and the power of a linear discriminant classifier. In [17] a model learns not only the features but also the classifier itself from a deep learning neural network. The method makes use of transfer learning to leverage knowledge from pre-trained networks.

Existing studies applying this new approach outperform the traditional more supervised methods defined in 1-3 above. For example, [17] achieved a 94% of training accuracy on a dataset of 30 pollen types. Their technique is similar to ours, but their results are based on the training set and no information is given about how the model behaves with unseen images. The same can be said of [18], which informs of 100% accuracy on 10 very different pollen grains by using transfer learning over a convolutional neural network based on the VGG16 architecture, and of [19], which reports 99.8% training accuracy for 5 species. [10] experimented with three deep learning models for classification of pollen images from the POLEN23E dataset [12], and the same dataset has been recently used by [20]. These models use different combinations of automatic feature extraction, transfer learning and a pre-trained convolutional neural network.

Table 1 which provides a summary table of previous studies, including class sizes and accuracy/success rates.

Table 1. Previous attempts at automated classification of 10 or more pollen types, with number of taxa, classifier type and highest reported correct classification rate (CCR).

Abbreviations: NN = neural network, SVM = support vector machine, LDA = linear discriminant analysis, KNN = k-nearest neighbour, CNN = convolutional neural network, RNN = recurrent neural network.

# taxa classifier reported CCR
Daood et al. (2016) [21] 30 CNN 94%
Sevillano & Aznarte (2018) [10] 23 CNN 97%
Goncalves et al. (2016) [12] 23 SVM 64%
Kaya et al. (2013) [22] 19 Rough Set 91%
Ticay-Rivas et al. (2011) [23] 17 NN 96%
Marcos et al. (2015) [24] 15 KNN 95%
Lagerstrom et al. (2015) [25] 15 LDA 81%
Li et al. (2004) [26] 13 LDA, NN 100%
Treloar et al. (2004) [27] 12 LDA 100%
Khanzhina et al. (2018) [19] 11 CNN 96%
Dhawale et al. (2013) [28] 10 NN 85%
Daood et al. (2018) [18] 10 CNN+RNN 100%

Materials and methods

Pollen image set

The images used in our experiment are dark field microscope images captured on the Classifynder system (for an example, see Fig 1, for the full dataset see Supporting Information Section, in page 10). The Classifynder (formerly known as AutoStage) is documented in [29]. It was designed as a complete ‘standalone’ system for automated pollen analysis. The system uses basic shape features to identify the locations of objects of interest (i.e. pollen grains) in conventional microscope slides under a low-power objective. It then switches to a higher power objective and visits the location of each pollen object to capture an image of it to be used for classification. Objects are imaged at different focus levels, producing a ‘Z-stack’. A ‘Z-stack’ is a set of images with different focus depths which show the different vertical details of a microscopic object. The system subsets the best-focused portions of each object Z-stack images, and then combines them to produce a single composite image, followed by segmentation from the background. This image is then falsely colored to show depth (Fig 1).

Fig 1. Samples of seven pollen types found in the dataset.

Fig 1

By rows, pollen types usually considered as indistinguishable.

In its routine operation, the Classifynder extracts values for 43 different image features (both geometrical and textural, see [16] for a list) and tags them on to each image as metadata. Images are classified using a simple neural network (feed-forward with a single hidden layer), which compares the feature values of the unknown pollen types with the feature values of known ‘library’ images.

In this paper, we have used the Classifynder to generate our image set only, and have not used any of the Classifynder-generated image features or labeling. Our images were automatically collected from pollen reference slides. Images gathered from each reference slide were manually sorted, and were chosen with the intention of creating a set that was fully representative of the pollen type in question, i.e. accounting for variability in appearance resulting from different viewing angles. Images with irregularities such as debris, or images containing multiple grains were excluded. While the Classifynder has produced satisfactory classification in previous experiments with 6 different types of taxa (Betula pendula, Dactylis glomerata, Cupressus macrocarpa, Ligustrum lucidum, Acacia dealbata and Pinus radiata) in [29], performance of the neural network classifier declines with 15 different types of taxa (Acacia Ramoissima, Atriplex Paludosa, Asteraceae, Casuarina Littoralis, Disphyma, Dracophyllum, Euphorbia Hirta, Eucalyptus Fasciculosa, Isoetes Pusilla, Myrsine, Nothofagus Cunninghamii, Nothofagus Discoidea, Nothofagus Discoidea, Olearia Algida, Phyllocladus, Aspleniifolius) in [16].

The original image dataset we use comprises a total of 19,500 images, from 46 different pollen types, representing 37 different families. This was the total number of taxa for which suitable datasets were available at the time our research commenced. Data augmentation is applied on this original dataset.

Unlike the POLEN23E dataset, which is representative of the Brazilian savannah, our dataset is a mix of taxa found in New Zealand and the Pacific, including both native and introduced taxa. Many are types encountered in honeys from these regions.

The number of images per taxon varies from 40 to 1700. We have included the number of images per taxon below in Fig 8, in the Results and Discussion section (page 13). The majority of the images were captured from reference pollen slides, with the exception of 14 taxa (Coprosma sp., Echium vulgare, Geniostoma sp., Griselinia sp., Ixerba brexiodes, Lotus sp., Lycopodium clavatum, Knightia excelsa, Metrosideros sp., Quintinnia sp., Ranunuculus sp., Salix sp., Taraxacum sp., Trifolium sp., and Weinmannia sp.) whose images were captured from slides of pollen extracted from honey samples. All pollen samples, regardless of origin, were acetolysed following the method of Erdtman [30] and then suspended in either silicone oil or glycerine jelly, and mounted under cover-slips on glass slides.

Fig 8. Distribution of f1 score per type in the 10 folds of the cross-validation process.

Fig 8

Taxa are ordered with respect to the number of images for each type, which is included in brackets next to the taxon name.

Slides were scanned using the Classifynder system, which automatically locates and images the pollen grains, with only limited human input needed at the start of the process. It is this feature which has allowed for such a large image dataset to be generated, as we were not reliant on a human analyst to manually locate and image each grain.

The resultant raw image sets for each taxon or honey sample were manually examined. Image sets from honey samples were manually sorted into individual taxa, while image sets from reference slides were manually filtered to remove images of ‘outliers’, i.e. grains not representative of that pollen type due to deformation or malformation, as well as any non-pollen debris or pollen of other taxa that may have made their way onto the slide.

Overall, the number of images per taxon was dictated by what was available/gathered. As mentioned above, many of the individual sets were originally generated for other projects or experiments. This partially accounts for the considerable differences in the number of images per taxon. For example, the images for the taxa with the highest numbers of images (Leptospermum scoparium and Kunzea ericoides) were gathered as part of work on differentiating these two taxa using the Classifynder [31], and an even larger number of images were available for these two types. A figure of 1700 was arbitrarily selected.

The 46 taxa comprise a wide range of pollen morphologies, with some more similar than others. We have deliberately included two pairs and one trio of morphologically similar taxa to test the system’s ability to discriminate closely related types, as well as across a broader range of morphologies. These are shown in Fig 1 and include 1) Fuscospora fusca and F. truncata, 2) Leptospermum scoparium and Kuzea ericoides, and 3) Olearia semidentata, Brachyglottis huntii and B. repanda. The last three are members of the Asteraceae family, many members of which are notoriously difficult to distinguish. L. scoparium and K. ericoides represent two important nectar sources in New Zealand honey, with L. scoparium being the source of the high-value New Zealand Mānuka honey. Traditionally, these two taxa have been regarded as virtually indistinguishable, limiting the use of pollen analysis in Mānuka honey quality assessment (although see [31] for more information). F. fusca and F. truncata are two species of southern beech (Fagaceae), whose pollen (along with the two other species of Fuscospora) are regarded as virtually identical [32].

Deep learning convolutional neural network

Neural networks are inspired by the function and structure of the human brain. In these models, there are multiple layers of artificial neurons trained to process and identify concrete features of the input space, each layer extracting different valuable information. Deep learning is a common name for the technique to train very complex neural networks that can be used on many types of data, like signal processing, image processing, speech recognition or natural language processing among many others, to produce results that often are similar to those that a human being would produce.

In the field of image recognition, deep learning of neural networks has reached levels of accuracy not previously achieved. While traditional neural networks contain a few hidden layers of neurons, deep learning networks can contain tens or hundreds. These models are trained with large data sets and can learn features without the need for manual intervention. This ability to extract features from large data sets makes them especially suited for the task of the classification of pollen grains, where the correct identification of these features is especially complex. Eliminating the manual selection of features significantly simplifies the classification process, while increasing the reliability.

There are different techniques to create and train these models. The most common three are 1) training from scratch, where the network is built from the beginning, 2) transfer learning, where the structure of a pre-trained model is adapted and 3) feature extraction, a more specialized approach in which the learned features are used as input for another automatic learning model as, for example, a support vector machine or a linear discriminant classifier. In this work, we present a hybrid solution between the last two configurations.

First, we use the pre-trained network Alexnet [33] to automatically extract significant features of the images. Alexnet is one of the most popular convolutional neural networks, and it was originally designed to classify images from 1000 different classes. It is much larger than previous convolutional neural networks, consisting of 5 convolutional layers and 3 fully connected layers, and having around 60 million parameters and 650,000 neurons.

Through the different layers we can observe what features the network learns by comparing areas of activation with the original image. The first layers learn basic features like color and edges, while in the deeper layers the network learns more complex and abstract features. Identifying features in different layers allows us to understand what the network has learned. In Fig 2 we show an example of nine of the trained filters for each of the five convolutional layers of the network. These filters allow deep layers to detect more complex patterns. The deeper layers can learn higher level combinations of features learned by the previous layers. After these layers, Alexnet contains three fully connected layers, of which the last contains 1000 neurons, one for each class. The ith neuron of this layer is interpreted as the probability that the input image belongs to the ith class. In our model, we extract the automatically created features from the second fully connected layer because it is the deeper layer and it is thus expected to contain the higher level representation of the differences amongst the classes.

Fig 2. Example of 9 filters from the five convolutional layers of the trained network.

Fig 2

The features extracted by these layers usually do not offer an overly clear vision of the network operation. We can, however, analyze the activations that a concrete image produces when processed through the network to obtain its classification. In this case, the different filters and activations in the different layers are clearer to the naked eye. To illustrate the example, an image belonging to the Brachyglottis huntii class has been selected (see Fig 3).

Fig 3. A sample Brachyglottis huntii image (left) and the activation it produced in filters 29 (center) and 85 (right) of the first layer of the convolutional neural network.

Fig 3

The image is initially preprocessed to be adapted to the network requirements as described below. We then feed the image to the trained convolutional neural network and display the activations of different layers of the network to, later, examine these activations and analyze what features the network learns by comparing them with the original image. Fig 3 shows the activations that the image produces in two of the filters of the first convolutional layer. White pixels represent strong positive activations and black pixels represent strong negative activations. When a filter is mostly gray, it does not respond to any strong activation on the original image. The position of the pixels on the activations corresponds to the same position on the original image. This way, we can analyze which part of the image is identified by each of the activated filters.

For filter 29 (center of Fig 3) we can see positive and negative activations, in black and white respectively, corresponding to the features detected by this filter. The positive activations of this filter recognize the edges, identifying the external peaks of the image and the internal purple points. Filter 85, in the right hand side of Fig 3, is the one showing lager activations for the first convolutional layer and the selected sample image.

The deeper layers contain filters that have learned to identify more complex features from the previous layers. A closer look at all the activations produced by this sample image in the convolutional layer 5, in Fig 4, bring us closer to the complexity of the features learned by this deep layer. Considering the whiter areas as positive activations, we can observe, for example, how the layer learns about the external shape of the grain.

Fig 4. Fifth convolutional layer activations for the Brachyglottis huntii class sample image of Fig 3.

Fig 4

As mentioned above, after this layer another three fully connected layers are present, the last layer being an output layer. Once the network is trained, we extract the relevant features from the second fully connected layer and we train a linear discriminant classifier to perform the classification on these features.

Experimental design

To properly estimate the error and to assess the robustness of the models against overfitting, the accuracy on each conducted experiment was computed using a 10 fold cross-validation process summarized in Fig 5. As we can see in this figure, the images are divided into two subsets, training and validation, comprising 90% and 10% of the images respectively. This division occurs ten times, or folds, guaranteeing that all the images have been present at least once in both sets, training and validation. The data augmentation process is performed on the training set, once the validation images have been separated, to avoid bias in the model.

Fig 5. Diagram for the cross-validation procedure.

Fig 5

The red circles represent the validation sets for each step, containing 10% of the images for each class.

We compute the following common performance measures to evaluate model performance:

CCR=TP+TNTP+TN+FP+FN (1)
precision=TPTP+FP (2)
recall=TPTP+FN (3)
F1score=2*precision*recallprecision+recall (4)

where CCR indicates the correct classification rate, TP refers to true positives, TN to true negatives, FP to false positives, and FN to false negatives. Precision, recall and F1 score were computed as an average weighted by the number of images in each class.

Image preprocessing and augmentation

In our experiment, in addition to performing a custom image preprocessing procedure, we implemented data augmentation, which considerably increases the convolutional neural network training time, and the training time of the linear discriminant classifier, but that also increases the accuracy accordingly. This process is summarized in Fig 6.

Fig 6. Summary of the data augmentation process applied.

Fig 6

One of the most relevant features of the dataset is the image size. This feature is important since different pollen grains of different classes have different sizes, and thus this feature is essential for classification. As the convolutional neural network needs a 227x227x3 format as an input, all images must be re framed into that format. Our preprocessing algorithm crops the grains from the original images while keeping a minimum padding around them and maintaining their size so that the network can identify it more effectively. This reduction of the images to their effective content facilitates the process of data augmentation, since it allows to generate new images from the originals without resizing them in the process. This technique allows to balance the amount of images among different classes but also provides a wider variety of spatial features. By rotating the image, relevant features can be identified in different locations, helping to provide spatial invariance. The result is equivalent to obtain samples in different orientations from the microscope. After that, we insert the cropped and rotated image in a 227x227 black frame.

In the framework of convolutional neural networks it is common to derive an augmented image dataset by generating batches of new images, with optional preprocessing such as resizing, rotation, and reflection, based on the available images. Augmenting image data helps to prevent the network from over-fitting and memorizing the exact details of the training images. It also increases the effective size of the training data set.

Furthermore, the original set of images from this experiment has classes with very different amounts of images, ranging from 1700 in the most represented case to 40 in the least. In order to obtain the greatest number of features of each class while maintaining a reasonable training time, we have increased the images of each class to the number of images of the class with more examples.

Hence, from the original preprocessed images, subsequent synthetic images are obtained to complete the image data set. The creation process we have used relies only on the rotation of images. A resizing process was not considered to be effective since they would lose the aforementioned size ratio. Each new image is generated from a random image of its class. Once the original image has been randomly selected, a rotation angle, also randomly chosen, is applied, generating an rotated image from the original one, but maintaining the dimensions of it, even if the final image contains a different canvas size.

This technique allows to generate synthetic images that preserve the original proportions but contain different characteristics, significantly increasing the features presented to the convolutional neuronal network.

Results and discussion

As explained above, to properly evaluate the model performance, we use a validation set composed of images not seen by the model during the training process. These images constitute an example close to reality, since they are completely unknown to the model and allow us to anticipate its behavior against new observations.

Table 2 presents the results obtained by the model using the training and the validation datasets, as an average of the 10 cross-validation partitions together with the standard deviation of each value, in brackets. The CCR obtained by the examples of the validation set rises to 97.86%, which approximates the results obtained during training, 99.91%.

Table 2. Results for the training and validation sets.

CCR Precision Recall F1 score
Training 99.91% 0.998 1.000 0.999
(± 0.011) (± 0.021) (± 4.84e-04) (± 0.011)
Validation 97.86% 0.979 0.978 0.978
(± 0.252) (± 0.030) (± 0.031) (± 0.027)

The low deviation between training and validation values demonstrates the model robustness, ruling out the possibility of overfitting during the training process. This is supported by the low values in the standard deviations of the different measures, which also prove the stability of the model.

However, the CCR measure alone does not provide sufficient information about the model behavior against false positives and false negatives produced in the process of predicting new observations. The values obtained for precision and recall, 0.979 and 0.978 respectively reinforce the consistency of the model with respect to false positives and false negatives. Finally, the high value of the F1 score validates the consistency of those measures.

In order to have the full picture and shed light into how the model behaves with different classes, Fig 7 shows the confusion matrix obtained by the model over the 10 validation datasets of the cross-validation procedure. This matrix has been constructed by accumulating the individual cross-validation results: since each validation set was composed by a 10% of the data for each type, and we had 10 such datasets, the matrix columnwise sums are equal to the total of images for each type. We can dive into this matrix by looking at the aforementioned 3 sets of virtually indistinguishable pollen types.

Fig 7. Confusion matrix for the 46 pollen types through the 10-fold cross validation process.

Fig 7

For each type, 10% of the available images were saved for validation.

With regards to Olearia semidentata and Brachyglottis repanda, we see how the model wrongly predicts as O. semidentata 29 images that should have been classified as corresponding to B. huntii. This represents a 9% of the total for the latter. Conversely, 16 images (8%) of O. semidentata are wrongly classified as B. huntii. These comprise every misclassification recorded for both classes. Furthermore, there was only one confusion between B. repanda and B. huntii.

When dealing with Fuscospora fusca and F. truncata, our model gets 16 images of the former misclassified as the latter, representing a 4.8% of the total. And conversely, 12 images of F. truncata are misclassified as Fuscospora fusca (6.7%).

Finally, concerning Leptospermum scoparium and Kunzea ericoides, we see that 42 images of the latter (representing only a 2.5%) are misclassified as Leptospermum scoparium, whereas 73 (4.3%) of the former are also misclassified as the other. Given the known similarities amongst both types, these results can be considered as very good (and are in fact far better than what a human operator can achieve).

Fig 8 shows a boxplot with the distribution of the F1 scores for the validation sets during the cross-validation process. In this figure, we can observe the good general behavior of the model, with two types perfectly classified in all the cross-validation partitions (Carpodetus and Gunnera), with other 10 types perfectly classified in median (B. repanda, Carex, Geniostoma, Ixerba, Knightia, Lycopodium, Maonao, Muelhenbeckia, Plagianthus, Prumnopitys, Ranunuculus, Santalum and Typha), and with medians above 0.95 for all but 7 of the classes.

In Fig 8, the 46 pollen types are ordered according to the number of images available for each one. This reveals a pattern: leaving aside the indistinguishable pairs mentioned above, which of course present lower F1 than others, it seems that there is a relationship between the number of available images and the results. As expected, when the number of images increases, the error of the model decreases: all the types that obtain a median F1 below 0.95 are either part of the indistinguishable pairs or have less than 200 images.

This is evident in the case of the Poaceae class, which is the one with the fewest original images, having only 40 in total (and thus only 4 in the validation sets). This of course implies a greater dependence on the selection of samples in the different cross-validation experiments. It is also clear that the difficulties with this particular class are responsible for a significant decrease in the model overall accuracy.

Conclusions

Finally, we can add some concluding remarks. In this work we present a model for the classification of images of 46 different kinds of pollen grains captured with the Clasifynder automatic microscope. We have used different techniques of image pre-processing and data augmentation to feed a pre-trained convolutional neural network, retrained by transfer learning to extract features from one of its deepest layers. Finally, these automatically extracted features are used to perform classification with a linear discriminant classifier.

The behavior of the model is excellent, with an accuracy higher than 97% in unseen sets of images. Furthermore, we have proven how it is able to correctly set apart pairs of pollen types considered indistinguishable by palynologists. The performance was slightly lower for those types with less images available, pointing at even higher overall accuracies using vaster, more complete datasets.

These groundbreaking results are of great interest for the automation of pollen counting, identified as one of the future achievements in the palynology field.

Supporting information

S1 File

(TXT)

Data Availability

The data underlying this study is available via Figshare: https://doi.org/10.6084/m9.figshare.12370307.

Funding Statement

The authors received no specific funding for this work.

References

  • 1. D’Amato G, Cecchi L, Bonini S, Nunes C, Annesi Maesano I, Behrendt H, et al. Allergenic pollen and pollen allergy in Europe. Allergy. 2007;62(9):976–990. Available from: https://onlinelibrary.wiley.com/doi/abs/10.1111/j.1398-9995.2007.01393.x [DOI] [PubMed] [Google Scholar]
  • 2. Bastl K, Berger U, Kmenta M. Evaluation of Pollen Apps Forecasts: The Need for Quality Control in an eHealth Service. J Med Internet Res. 2017. May;19(5):e152 Available from: http://www.jmir.org/2017/5/e152/ [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Ohe WVD, Oddo LP, Piana ML, Morlot M, Martin P. Harmonized methods of melissopalynology. Apidologie. 2004;35(Suppl. 1):S18–S25. Available from: 10.1051/apido:2004050 [DOI] [Google Scholar]
  • 4. Bogotá-A RG, Groot MHM, Hooghiemstra H, Lourens LJ, Van der Linden M, Berrio JC. Rapid climate change from north Andean Lake Fúquene pollen records driven by obliquity: implications for a basin-wide biostratigraphic zonation for the last 284 ka. Quaternary Science Reviews. 2011. November;30(23):3321–3337. Available from: http://www.sciencedirect.com/science/article/pii/S0277379111002393. [Google Scholar]
  • 5. Odgaard BV. Fossil pollen as a record of past biodiversity. Journal of Biogeography. 1999;26(1):7–17. Available from: https://onlinelibrary.wiley.com/doi/abs/10.1046/j.1365-2699.1999.00280.x [Google Scholar]
  • 6. Flantua SGA, Hooghiemstra H, Vuille M, Behling H, Carson JF, Gosling WD, et al. Climate variability and human impact in South America during the last 2000 years: synthesis and perspectives from pollen records. Climate of the Past. 2016;12:483–523. Available from: http://oro.open.ac.uk/45927/ [Google Scholar]
  • 7. Mildenhall DC, Wiltshire PEJ, Bryant VM. Forensic palynology: why do it and how it works. Forensic Science International. 2006. November;163(3):163–172. 10.1016/j.forsciint.2006.07.012 [DOI] [PubMed] [Google Scholar]
  • 8. Flenley JR. The problem of pollen recognition In: Clowes MB, Penny JP, editors. Problems in Picture Interpretation. Canberra: CSIRO; 1968. p. 141–145. [Google Scholar]
  • 9. Holt KA, Bennett KD. Principles and methods for automated palynology. New Phytologist. 2014;203(3):735–742. Available from: https://nph.onlinelibrary.wiley.com/doi/abs/10.1111/nph.12848 [DOI] [PubMed] [Google Scholar]
  • 10. Sevillano V, Aznarte JL. Improving classification of pollen grain images of the POLEN23E dataset through three different applications of deep learning convolutional neural networks. PLOS ONE. 2018. September;13(9):e0201807 Available from: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0201807 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Stillman EC, Flenley JR. The needs and prospects for automation in palynology. Quaternary Science Reviews. 1996;15(1):1–5. Available from: http://www.sciencedirect.com/science/article/pii/0277379195000763 [Google Scholar]
  • 12. Gonçalves AB, Souza JS, Silva GGd, Cereda MP, Pott A, Naka MH, et al. Feature Extraction and Machine Learning for the Classification of Brazilian Savannah Pollen Grains. PLOS ONE. 2016. June;11(6):e0157044 Available from: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0157044 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Mitsumoto K, Yabusaki K, Aoyagi H. Classification of pollen species using autofluorescence image analysis. Journal of Bioscience and Bioengineering. 2009. January;107(1):90–94. Available from: http://www.sciencedirect.com/science/article/pii/S1389172308000510 [DOI] [PubMed] [Google Scholar]
  • 14. Dell’Anna R, Lazzeri P, Frisanco M, Monti F, Malvezzi Campeggi F, Gottardini E, et al. Pollen discrimination and classification by Fourier transform infrared (FT-IR) microspectroscopy and machine learning. Analytical and Bioanalytical Chemistry. 2009. July;394(5):1443–1452. Available from: 10.1007/s00216-009-2794-9 [DOI] [PubMed] [Google Scholar]
  • 15. Ivleva NP, Niessner R, Panne U. Characterization and discrimination of pollen by Raman microscopy. Analytical and Bioanalytical Chemistry. 2005. January;381(1):261–267. Available from: 10.1007/s00216-004-2942-1 [DOI] [PubMed] [Google Scholar]
  • 16. Lagerstrom R, Arzhaeva Y, Bischof L, Haberle S, Hopf F, Lovell D. A comparison of classification algorithms within the Classifynder pollen imaging system. AIP Conference Proceedings. 2013. October;1559(1):250–259. Available from: https://aip.scitation.org/doi/abs/10.1063/1.4825017 [Google Scholar]
  • 17. Daood A, Ribeiro E, Bush M. Pollen Grain Recognition Using Deep Learning In: Advances in Visual Computing. Lecture Notes in Computer Science. Springer, Cham; 2016. p. 321–330. Available from: https://link.springer.com/chapter/10.1007/978-3-319-50835-1_30. [Google Scholar]
  • 18.Daood A, Ribeiro E, Bush M. Sequential Recognition of Pollen Grain Z-Stacks by Combining CNN and RNN. In: The Thirty-First International Flairs Conference; 2018. Available from: https://www.aaai.org/ocs/index.php/FLAIRS/FLAIRS18/paper/view/17642.
  • 19.Khanzhina N, Putin E, Filchenkov A, Zamyatina E. Pollen grain recognition using convolutional neural network. In: ESANN; 2018.
  • 20.Menad H, Ben-Naoum F, Amine A. Deep Convolutional Neural Network for Pollen Grains Classification. In: JERI; 2019.
  • 21.Daood A, Ribeiro E, Bush M. Pollen recognition using a multi-layer hierarchical classifier; 2016. p. 3091–3096.
  • 22. Kaya Y, PINAR S, Erez E, Fidan M. An expert classification system of pollen of Onopordum using a rough set approach. Review of Palaeobotany and Palynology. 2014. 4;189:50–56. 10.1016/j.revpalbo.2012.11.004 [DOI] [Google Scholar]
  • 23. Ticay-Rivas JR, del Pozo-Baños M, Travieso CM, Arroyo-Hernández J, Pérez ST, Alonso JB, et al. Pollen Classification Based on Geometrical, Descriptors and Colour Features Using Decorrelation Stretching Method In: Iliadis L, Maglogiannis I, Papadopoulos H, editors. Artificial Intelligence Applications and Innovations. Berlin, Heidelberg: Springer Berlin Heidelberg; 2011. p. 342–349. [Google Scholar]
  • 24. Marcos JV, Nava R, Cristébal G, Redondo R, Escalante-Ramírez B, Bueno G, et al. Automated pollen identification using microscopic imaging and texture analysis. Micron. 2015;68:36–46. Available from: http://www.sciencedirect.com/science/article/pii/S096843281400167X [DOI] [PubMed] [Google Scholar]
  • 25. Lagerstrom R, Holt K, Arzhaeva Y, Bischof L, Haberle S, Hopf F, et al. In: Sun C, Bednarz T, Pham TD, Vallotton P, Wang D, editors. Pollen Image Classification Using the Classifynder System: Algorithm Comparison and a Case Study on New Zealand Honey. Cham: Springer International Publishing; 2015. p. 207–226. Available from: 10.1007/978-3-319-10984-8_12. [DOI] [PubMed] [Google Scholar]
  • 26. LI P, Treloar WJ, Flenley JR, Empson L. Towards automation of palynology 2: the use of texture measures and neural network analysis for automated identification of optical images of pollen grains. Journal of Quaternary Science. 2004;19(8):755–762. Available from: https://onlinelibrary.wiley.com/doi/abs/10.1002/jqs.874 [Google Scholar]
  • 27. Treloar WJ, Taylor GE, Flenley JR. Towards automation of palynology 1: analysis of pollen shape and ornamentation using simple geometric measures, derived from scanning electron microscope images. Journal of Quaternary Science. 2004;19(8):745–754. Available from: https://onlinelibrary.wiley.com/doi/abs/10.1002/jqs.871 [Google Scholar]
  • 28.Dhawale VR, Tidke JA, Dudul SV. Neural network based classification of pollen grains. In: 2013 International Conference on Advances in Computing, Communications and Informatics (ICACCI); 2013. p. 79–84.
  • 29. Holt K, Allen G, Hodgson R, Marsland S, Flenley J. Progress towards an automated trainable pollen location and classifier system for use in the palynology laboratory. Review of Palaeobotany and Palynology. 2011. October;167(3):175–183. Available from: http://www.sciencedirect.com/science/article/pii/S0034666711001205 [Google Scholar]
  • 30. Erdtman G. The acetolysis method, a revised description. Svensk Bot Tidskr. 1960;54:561–564. [Google Scholar]
  • 31. Holt KA, Bebbington MS. Separating morphologically similar pollen types using basic shape features from digital images: A preliminary study1. Applications in Plant Sciences. 2014. August;2(8). Available from: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4141716/ [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Moar NT. Pollen grains of New Zealand dicotyledonous plants. Lincoln, N.Z.: Manaaki Whenua Press; 1994. OCLC: 31198927. [Google Scholar]
  • 33. Krizhevsky A, Sutskever I, Hinton GE. ImageNet Classification with Deep Convolutional Neural Networks. Commun ACM. 2017. May;60(6):84–90. Available from: http://doi.acm.org/10.1145/3065386 [Google Scholar]

Decision Letter 0

Mudassar Raza

10 Apr 2020

PONE-D-20-04167

Precise automatic classification of 46 different pollen types with convolutional neural networks

PLOS ONE

Dear Dr. Aznarte,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

We would appreciate receiving your revised manuscript by May 25 2020 11:59PM. When you are ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter.

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). This letter should be uploaded as separate file and labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. This file should be uploaded as separate file and labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. This file should be uploaded as separate file and labeled 'Manuscript'.

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

We look forward to receiving your revised manuscript.

Kind regards,

Mudassar Raza, Ph.D.

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

http://www.journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and http://www.journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. We note that you have stated that you will provide repository information for your data at acceptance. Should your manuscript be accepted for publication, we will hold it until you provide the relevant accession numbers or DOIs necessary to access your data. If you wish to make changes to your Data Availability statement, please describe these changes in your cover letter and we will update your Data Availability statement to reflect the information you provide.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: No

Reviewer #3: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: N/A

Reviewer #3: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: No

Reviewer #2: Yes

Reviewer #3: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The manuscript presents the computational experiment regarding highly accurate classification of the pollen samples based on their microscopic images. The described methodology has gradually emerged from the authors’ previous works which is clearly described and referenced. All expected elements of explanation are provided, in particular: the motivation, new details as compared to the previous works, comparative features when confronted with state-of-the-art references, etc. There are two main areas referred to in the manuscript: a) biological data – i.e. pollen images – along with the need to recognize the type from the image, b) convolutional neural network computational tool. Both are introduced and described in the manner that seems clear enough to the other branch of readers. Some tutorial elements have been provided and the details that were somewhat out of scope have been properly referenced. The results are presented in the carefully designed manner – informative however not overdone. The overall impression is that even particularly keen readers looking for some specific details should be satisfied. In particular, I approve the presented computational and experimental methodology.

I have marked ‘no’ in the category of language quality but only because there were two options (yes/no), as there are only some few typos which need correction, in the manuscript – for example, in the Results and discussion section, right below Table 1: ‘The’.

1. In the Introduction section: some regular references, like [x], would be expected where only the year, like (1996) or (2016), is mentioned.

2. In Materials and Methods/Experimental design: CCR abrv. is not explained, while TP, TN, etc., are – and they all seem to belong to similar level of common knowledge.

3. I wonder if grouping the hard-to-distinguish taxa into ‘grouped categories’ could make any sense from the pollen classification point of view, but then the data presented in Table 1 might look even more promising – in the form of new Table 2 (The results in Table 1 should be kept). It seems that with the resulting data, already at hand, one could easily calculate all the quality assessing parameters for just 42 categories, when three of them would be composed of more than one taxon. The results of kind as in Fig. 8 could be also added for such case.

4. The Acknowledgements section is in Latin, which a bit strange and, unfortunately, unreadable to me.

Reviewer #2: In the study, the authors propose using convolutional neural networks to classify pollen images produced by the Classifynder classification system and claim their proposed method outperform all previous attempts both in accuracy, class number and indistinguishable classes. The manuscript is clear and easy to follow.

Though it has some merits, it suffers some severe problems, which unfortunately cannot be remedied from my point of view.

First of all, they ignore the fact that the Classifynder is embedded with a neural network used to classify pollen images. Since the class labels generated by this neural network is used as the ground truth, no other classification algorithms (including deep learning) can beat this embedded algorithm because its CCP, precision, recall and F1 score are always 100%. Unless there are other pollen image data sets labeled in other approaches (for example manually labeled pollen images), I don’t think they can demonstrate that their proposed method surpasses previous attempts.

Second, a method is claimed to be better than previous attempts on classification but without any quantitative comparison in its result and discussion, which in principle is faulty and not credible. In addition, their results are redundant. The Table 1 shows that the proposed method can achieve performances (CCR, precision, recall and F1 score) greater than 97%. However, they virtually repeat this information by digging into confusion matrix because CCR, precision, recall and F1 score are calculated summaries based on the confusion matrix. The confusion matrix can’t add more information from a different angle than the Table 1. Neither F1 score by type.

At last, the manuscript spends a large space to explain some basic machine learning concepts, such as 10-fold cross validation (Fig. 5) and performance measures etc. It gives me the impression of stuffing the manuscript with unnecessary content.

Overall, I don’t think this manuscript can be improved significantly, especially if there are no other pollen images with labels.

Reviewer #3: In this paper Sevillano et al. use convolutional neural networks for automatic classification of 46 pollen types. Deep learning methods improve the accuracy of the classification over manual methods from 67% to 98%. The authors apply the methodology used on a previous report (their reference [10]) to a larger data set that included a larger number of pollen types. They use a large number of images (19,500) corresponding the 46 pollen types. The number of images per taxon ranges from 40 to 1,700.

The authors use a standard pre-trained convolutional neural network (Alexnet) and a conventional experimental design to train their neural network and analyze its performance. The study shows an impressive increase in classification performance when using deep learning methodologies. It would be helpful that the authors also present a table comparing the performance of other published pollen classifiers including the number of images and pollen types used on those studies.

Below follow some comments and questions that may improve the readability and impact of this paper.

1) The authors should provide a reference for the method in paragraph between lines 54-60.

2) Please describe briefly what is a Z-stack (line 82).

3) To improve readability of the paper, perhaps you can provide the accuracy rate for low and high number of taxa of the Classifynder neural network. (line 93).

4) Figure 8 is discussed in the text after Figure 1. Should then Figure 8 be Figure 2?

5) It is not clear what the reader can learn from Figure 2 (line 183). Perhaps the authors could explain with a greater depth what is in the figure.

6) The authors assume that layer 5 (Figure 4) learns about complex characteristics of the pollen type. Can the authors provide some insight on what these characteristics could be from the activations shown in Figure 4?

7) Does the number of images used (19,500) include those produced by data augmentation, or 19,500 is the number of original images?

8) Can you clarify how the rotation of the original figures increases the features presented to the neural network? Which is the value added of rotating an image? Wouldn’t rotation just create redundant data?

9) How were the training and validation sets constructed? Since the classification performance depends on the number of images available (and therefore on the pollen type, Figure 8), the overall performance reported in Table 1 may depend critically on how the sets were selected.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files to be viewed.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2020 Jun 23;15(6):e0229751. doi: 10.1371/journal.pone.0229751.r002

Author response to Decision Letter 0


6 May 2020

Response to Reviewer #1

Comment 0: I have marked ‘no’ in the category of language quality but only because there were two options (yes/no), as there are only some few typos which need correction, in the manuscript – for example, in the Results and discussion section, right below Table 1: ‘The’.

Response: We thank the reviewer for his/her call, and we have reviewed the grammar.

Comment 1: In the Introduction section: some regular references, like [x], would be expected where only the year, like (1996) or (2016), is mentioned.

Response: References and citations have been reviewed.

Comment 2: In Materials and Methods/Experimental design: CCR abrv. is not explained, while TP, TN, etc., are – and they all seem to belong to similar level of common knowledge

Response: The definition for CCR (Correct Classification Rate) has been included in the “Experimental design” section.

Comment 3: I wonder if grouping the hard-to-distinguish taxa into ‘grouped categories’ could make any sense from the pollen classification point of view, but then the data presented in Table 1 might look even more promising – in the form of new Table 2 (The results in Table 1 should be kept). It seems that with the resulting data, already at hand, one could easily calculate all the quality assessing parameters for just 42 categories, when three of them would be composed of more than one taxon.

The results of kind as in Fig. 8 could be also added for such case.

Response: As part of our future work, we are considering the probabilities of each taxa in reference to each of the different classes. In this way, for those classes that have similar probabilities, we will be able to indicate what classes they resemble instead of giving an absolute classification.

Comment 4: The Acknowledgements section is in Latin, which a bit strange and, unfortunately, unreadable to me.

Response: In reference to this point, a section of the document template was unintentionally included. We have proceeded to eliminate it. We apologize for the inconvenience.

Response to Reviewer #2

Comment 1: First of all, they ignore the fact that the Classifynder is embedded with a neural network used to classify pollen images. Since the class labels generated by this neural network is used as the ground truth, no other classification algorithms (including deep learning) can beat this embedded algorithm because its CCP, precision, recall and F1 score are always 100%. Unless there are other pollen image data sets labelled in other approaches (for example manually labelled pollen images), I don’t think they can demonstrate that their proposed method surpasses previous attempts.

Response: Reviewer 2 appears to have assumed that we have used the native neural network of the Classifynder to provide the initial identifications of the images. This is incorrect. We have not used the neural network of the Classifynder in any part of the research presented here, and this is made clear in the statement within the methods "In this paper, we hav used the Classifynder to generate our image set only."

Furthermore, we cite papers that clearly demonstrate that the native neural network simply cannot produce similar levels of accuracy in classification as the approach used here, i.e. "While the Classifynder has produced satisfactory classification in previous experiments with low numbers of taxa (e.g. Holt et al. 2011), performance of the neural network classifier declines with greater numbers of taxa (e.g. Lagerstrom et al. 2015)". This later paper demonstrates that precision from the native neural network is far from 100%.

This has been clarified in the Material & Methods section of the paper with the following text: "Images gathered from each reference slide were manually sorted, and were chosen with the intention of creating a set that was fully representative of the pollen type in question, i.e. accounting for variability in appearance resulting from different viewing angles. Images with irregularities such as debris, or images containing multiple grains were excluded".

Comment 2: Second, a method is claimed to be better than previous attempts on classification but without any quantitative comparison in its result and discussion, which in principle is faulty and not credible. In addition, their results are redundant. The Table 1 shows that the proposed method can achieve performances (CCR, precision, recall and F1 score) greater than 97%. However, they virtually repeat this information by digging into confusion matrix because CCR, precision, recall and F1 score are calculated summaries based on the confusion matrix. The confusion matrix can’t add more information from a different angle than the Table 1. Neither F1 score by type.

Response: In the first version of the paper, we did not provide a detailed comparison with previous attempts in the discussion. We provided a summary of previous attempts in the 'State of the Art' section, which highlights some of the highest rates of accuracy. But we also made the point that the majority of these previous studies are working with smaller number of classes, so even if they achieve equal or higher precision, they are arguably less valuable in a practical sense, as they are less likely to be useful in solving real world pollen classification problems. We didn't think there was much point in performing a detailed quantitative comparison in the discussion, as we are not able to compare

'apples with apples', in that there are no other studies that have examined a pollen Classification problem on the same scale as this. However, we have added a new table to the paper, based on the one by Holt and Bennett (2014), which provides a summary table of previous studies, including class sizes and accuracy/success rates.

With regard to confusion matrix and F1 score by type, we must disagree with Reviewer 2 since both figures do add information with respect to Table 3 (former Table 1). For example, it is possible to determine those pollen types which are difficult and those easy to distinguish, something that is impossible if we stick to Table 3 (former Table 1) alone.

Comment 3: At last, the manuscript spends a large space to explain some basic machine learning concepts, such as 10-fold cross validation (Fig. 5) and performance measures etc. It gives me the impression of stuffing the manuscript with unnecessary content.

Response: Since the paper is aimed at a broader audience than machine learning experts, we consider necessary to clarify some of the main concepts used in our work. An example is cross-validation: as the review of the state of the art suggests, not all previous attempts were aware of the available error estimation procedures, and their main drawbacks. We thus must insist in leaving that section as it is, despite the "stuffing the manuscript" indication by Reviewer 2.

Comment 4: Overall, I don’t think this manuscript can be improved significantly, especially if there are no other pollen images with labels.

Response: Given the fact that the main objection of Reviewer 2 is not sustained, we hope he or she will find the paper interesting.

Response to Reviewer #3

Comment 0: It would be helpful that the authors also present a table comparing the performance of other published pollen classifiers including the number of images and pollen types used on those studies.

Response: Thanks to Reviewer 3 for his/her indications. We have added a new table (Table 1) to the paper with a summary of the results of previous publications which are comparable with ours.

Comment 1: The authors should provide a reference for the method in paragraph between lines 54-60.

Response: We added a reference for the method in the “State of the art” section.

Comment 2: Please describe briefly what is a Z-stack (line 82).

Response: We have included the definition in the article as follows: “A Z-stack is a set of images with different focus depths which show the different vertical details of a microscopic object.”

Comment 3: To improve readability of the paper, perhaps you can provide the accuracy rate for low and high number of taxa of the Classifynder neural network. (line 93).

Response: We modified the paragraph as follows: “While the Classifynder has produced satisfactory classification in previous experiments with 6 different types of taxa (Betula pendula, Dactylis glomerata, Cupressus macrocarpa, Ligustrum lucidum, Acacia dealbata and Pinus radiata) in [21], performance of the neural network classifier declines with 15 different types of taxa (Acacia Ramoissima, Atriplex Paludosa, Asteraceae, Casuarina Littoralis, Disphyma, Dracophyllum, Euphorbia Hirta, Eucalyptus Fasciculosa, Isoetes Pusilla, Myrsine, Nothofagus Cunninghamii, Nothofagus Discoidea, Nothofagus Discoidea, Olearia Algida, Phyllocladus, Aspleniifolius) in [16]”.

Comment 4: Figure 8 is discussed in the text after Figure 1. Should then Figure 8 be Figure 2?

Response: Indeed we have a problem with this, as the data corresponding to the number of images per taxa is included in Figure 8 (and implictly in Figure 7), both in the Results section, but mentioned earlier on in the text. One alternative is to include in the Material and Methods a new table with the number of images per taxa, but we believe this will take an amount of space which is not justified by the interest. Thus, we decided to maintain the reference, explaining a bit more in the paper the reason for this inconsistency and pointing to the page in which the Figure 8 is displayed. If the editor or the reviewer consider that including this new table is a better solution, we can easily do so (it’s already coded in the source files).

Comment 5: It is not clear what the reader can learn from Figure 2 (line 183). Perhaps the authors could explain with a greater depth what is in the figure.

Response: We added the following explanation for these filters in the Deep learning convolutional neural network section: “These filters allow deep layers to detect more complex patterns. The deeper layers can learn higher level combinations of features learned by the previous layers.”

Comment 6: The authors assume that layer 5 (Figure 4) learns about complex characteristics of the pollen type. Can the authors provide some insight on what these characteristics could be from the activations shown in Figure 4?

Response: We added an example of what the layer learns from positive activations. “Considering the whiter areas as positive activations, we can observe, for example, how the layer learns about the external shape of the grain.”

Comment 7: Does the number of images used (19,500) include those produced by data augmentation, or 19,500 is the number of original images?

Response: The 19,500 images conform the original image data set from where we apply data augmentation techniques as described in “Materials and methods”, section “Pollen image set” as follows: “The original image dataset we use comprises a total of 19,500 images, from 46 different

pollen types, representing 37 different families. This was the total number of taxa for which suitable Classifynder datasets were available at the time our research commenced. Data augmentation is applied on this original dataset.”

Comment 8: Can you clarify how the rotation of the original figures increases the features presented to the neural network? Which is the value added of rotating an image? Wouldn’t rotation just create redundant data?

Response: Rotating the images is a quite common technique in computer vision to balance data sets with hugely different number of images, and at the same time, helping to identify features in distinct locations.

We added the following text in the Image preprocessing and augmentation section to clarify it: “This technique allows not only to balance the number of images among different classes but also provides a wider spatial features variety. By rotating the image, relevant features can be identified in different locations, helping to provide spatial invariance. The result is equivalent to obtain samples in different orientations from the microscope.”

Comment 9: How were the training and validation sets constructed? Since the classification performance depends on the number of images available (and therefore, on the pollen type, Figure 8), the overall performance reported in Table 1 may depend critically on how the sets were selected.

Response: We have included the following text in the “Experimental design” section to explain

the cross-validation process: “As we can see in this figure, the images are divided into two subsets, training and validation, at 90% and 10% respectively. This division occurs ten times, folds, guaranteeing that all the images have been present in both sets, training and validation. The data augmentation process is performed on the training set, once the validation images have been separated, to avoid bias in the model.”

Attachment

Submitted filename: ResponseToReviewers_R1.pdf

Decision Letter 1

Mudassar Raza

20 May 2020

Precise automatic classification of 46 different pollen types with convolutional neural networks

PONE-D-20-04167R1

Dear Dr. Aznarte,

We are pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it complies with all outstanding technical requirements.

Within one week, you will receive an e-mail containing information on the amendments required prior to publication. When all required modifications have been addressed, you will receive a formal acceptance letter and your manuscript will proceed to our production department and be scheduled for publication.

Shortly after the formal acceptance letter is sent, an invoice for payment will follow. To ensure an efficient production and billing process, please log into Editorial Manager at https://www.editorialmanager.com/pone/, click the "Update My Information" link at the top of the page, and update your user information. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, you must inform our press team as soon as possible and no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

With kind regards,

Mudassar Raza, Ph.D.

Academic Editor

PLOS ONE

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #3: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #3: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #3: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #3: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #3: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: I definitely accept the authors response and the changes they have introduced, accordingly.

I have only one subtle comment, which I think should be left to the authors to decide about.

In Fig. 7 the graphical summaries of resulting statistical distributions are presented and it seems to be one of the most vital parts of the material. Therefore, it appears to be some waste of space when only the first plot decides about the range (along the f1 axis) of the whole graph. Consequently, some of the plots lose their clarity or are just unreadable. Could the authors think of some way of conveying the message that the whisker of the first plot sticks out beyond the graph and simultaneously adapt the range so that it covers the remaining plots only – obviously resulting in the relevant extension of all box plots.

I do not have more comments and regard the manuscript certainly good enough to be accepted.

Reviewer #3: (No Response)

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Przemysław Korohoda, AGH University of Science and Technology, Kraków, Poland

Reviewer #3: No

Acceptance letter

Mudassar Raza

10 Jun 2020

PONE-D-20-04167R1

Precise automatic classification of 46 different pollen types with convolutional neural networks

Dear Dr. Aznarte:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Mudassar Raza

Academic Editor

PLOS ONE


Articles from PLoS ONE are provided here courtesy of PLOS

RESOURCES