Raman Spectroscopy in Open-World Learning Settings Using the Objectosphere Approach

Yaroslav Balytskyi; Justin Bendesky; Tristan Paul; Guy M Hagen; Kelly McNear

doi:10.1021/acs.analchem.2c02666

. Author manuscript; available in PMC: 2022 Dec 7.

Published in final edited form as: Anal Chem. 2022 Oct 24;94(44):15297–15306. doi: 10.1021/acs.analchem.2c02666

Raman Spectroscopy in Open-World Learning Settings Using the Objectosphere Approach

Yaroslav Balytskyi ¹, Justin Bendesky ², Tristan Paul ³, Guy M Hagen ⁴, Kelly McNear ⁵

PMCID: PMC9728505 NIHMSID: NIHMS1852771 PMID: 36279588

Abstract

Raman spectroscopy, combined with machine learning techniques, holds great promise for many applications as a rapid, sensitive, and label-free identification method. Such approaches perform well when classifying spectra of chemical species that were encountered during the training phase. That is, species that are known to the neural network. However, in real-world settings, such as in clinical applications, there will always be substances whose spectra have not yet been taken. When the neural network encounters these new species during the testing phase, the number of false positives becomes uncontrollable, limiting the usefulness of these techniques, especially in public safety applications. To overcome these barriers, we implemented the recently introduced Entropic Open Set and Objectosphere loss functions. To demonstrate the efficacy and efficiency of this approach, we compiled a database of hyperspectral Raman images of 40 chemical species separating them into three class categorizations. The known class consisted of 20 biologically relevant species comprising amino acids, the ignored class was 10 “irrelevant” species comprising bio-related chemicals, and the never seen before class was 10 various chemical species that the neural network had not seen before. We show that this approach not only enables the network to effectively separate the unknown species while preserving high accuracy on the known ones and reducing false positives but also performs better than the current gold standards in machine learning techniques. This opens the door to using Raman spectroscopy, combined with our novel machine learning algorithm, in a variety of practical applications. Availability and implementation: freely available on the web at https://github.com/BalytskyiJaroslaw/RamanOpenSet.git.

Graphical Abstract

graphic file with name nihms-1852771-f0001.jpg

INTRODUCTION

Raman spectroscopy was independently developed in 1928 by Raman¹ and Landsberg,² and further supplemented by the usage of laser equipped spectrometers.^3,4 This method is based on light scattering and its interaction with the chemical bonds of the material of interest and provides fingerprint-like spectrum of a given sample. The properties of Raman spectroscopy, combined with its sensitive and non-destructive nature, make it a reliable and universal tool for material analysis suited for both the laboratory as well as field experiments.⁵ There are a number of applications of Raman spectroscopy in different fields, see⁶ for a comprehensive review.

In brief, Raman spectroscopy enabled a detailed characterization of carbonaceous materials^7,8 and has a number of applications in bioanalytics⁹ and in the diagnostics of the cultural heritage.¹⁰ Raman analysis already has large-scale industrial applications including in the food¹¹ and textile industries.¹² Raman spectroscopy also serves as a powerful tool for planet exploration. In particular, the Perseverance rover has two Raman devices installed, SHERLOC (Scanning Environments with Raman and Luminescence for Organics and Chemicals), and SuperCam.¹³

Raman spectroscopy is an especially promising candidate for applications where public safety is a concern such as testing water quality,^14,15 identifying agricultural contaminants,^16,17 and in clinical settings for bacterial pathogen identification.¹⁸ Growing concern regarding contamination from chemicals such as per- and polyfluoroalkyl substances (PFAS)¹⁹ and lead²⁰ in food and water as well as the rise of antibiotic-resistant infections²¹ in our society requires efficient interventions in order to help mitigate public health crises. In these applications, there is a large amount of data or chemical species that need to be identified, which can be a challenge in the case of unknowns or samples with multiple chemicals present.

Raman spectroscopy has significant promise to meet this challenge as it is a sensitive and non-destructive probe providing a unique, fingerprint-like spectrum of a sample,^22,23 but the development of robust and reliable quantitative methods of the data analysis is needed.²⁴ Not only this, but there are often large amounts of data which are acquired, making it difficult and impractical to hard-code all of the rules of classification and analysis. Therefore, some form of machine learning (ML) is needed to ensure this process is practical and efficient.

As a note, we adopt the definition of ML from²⁵ as “the effort to automate intellectual tasks normally performed by humans”. Rather than being hard-coded, the ML system finds appropriate representations of the data which allows determining the rules of classification. “Deep learning (DL) is a specific subfield of machine learning: a new take on learning representations from data that puts an emphasis on learning successive layers of increasingly meaningful representations”.²⁵ For our purposes, these layered representations are learned by the models called neural networks (NNs).

In the definitions of,²⁵ in contrast to the DL, shallow learning techniques only learn one or two consecutive representations of the data. One model of such kind, the principal component analysis (PCA), in combination with linear discriminant analysis (LDA) was applied to Raman spectra analysis.^26–30 Older DL methods have been shown to successfully distinguish between the spectra of different molecules^31–36 and do so more effectively than the typical linear regression methods for data analysis of Raman spectra.³⁷ However, attempts to boost the accuracy by naively stacking extra layers and making the models deeper are limited by the vanishing gradient problem.³⁸ The ResNet architecture overcomes this problem and boosts the accuracy even further by introducing skip connections between the layers.³⁹ Current state of the art deep learning algorithms for Raman spectroscopy favor the ResNet architecture because it can maintain high performance at a lower complexity than its competitors. Using these advances, Raman spectroscopy combined with ML has shown significant promise for real-world applications as a rapid, label-free identification method.¹⁸

However, a significant limitation of the aforementioned NNs is that they operate in “closed-world” settings, where they are only tested on the species on which they were initially trained. As a result, if the inputs for the NN include new, never seen before species, the NN’s behavior becomes ill-defined and will lead to errors and misclassifications. Simply put, if the NN is trained to identify either “cats” or “dogs” but sees a new sample of “fish” during testing, the false-positive rate becomes uncontrollable and the NN will misclassify “fish” as either “cat” or “dog”.

When we consider real-world conditions and practical applications, it is impossible to collect data on all possible chemical or pathogenic species, so there will always be unknowns. To avoid false-positives and misclassifications, the ideal NN should be able to not only accurately identify the samples of the known classes, but should also be able to handle any unknowns without misclassification. That is, it must have the ability to operate in an “open-world” setting and be able to manage new, never-before-seen species. Overcoming the limitation of the “closed-world” setting is the key to the adaptation of NN architectures into routine practices in different applications.

In this article, we implement ideas from Entropic Open-Set and Objectosphere approaches⁴⁰ and combine them with the ResNet26 architecture to show, for the first time, that the ResNet architecture can be modified to accurately and reliably reject samples it has not seen before while remaining highly accurate in its identification of known classes. This new, combined architecture significantly outperforms the naive approaches such as the thresholding softmax score^41–43 and background class methods.⁴¹ The general idea of implementing these approaches is to teach the NN to ignore the features of the irrelevant classes and only focus on the features of the relevant classes (i.e., identify classes that it knows while saying “I don’t know” when presented with unknowns).

To demonstrate the efficiency of our proof-of-concept approach, we compiled a database of spectra of 40 chemical species and separated them into distinct categories. Twenty of them are amino acids, the building blocks of proteins, and we test the classification accuracy of the NN on them. The remaining 20 species were divided into two groups of 10 each, and with their help, the rate of false positives for species not previously encountered in the “open-world” was tested.

The spectra of the 20 amino acids were categorized as a “known” (K) class as they are the class of interest, 10 species comprising biologically relevant chemicals were categorized as an “ignored” (I) class as they are known, but not of interest, and the last 10 species are other various chemicals that are classified as a “never seen before” (N) class. The latter classes are unique in that the NN was not previously trained on them and sees them for the first time during the testing stage. During the training stage, the NN is taught to be highly accurate on the K class and separate them from the I class. During the test stage, we test its accuracy on K as well as determine the rate number of false positives upon the introduction of N.

METHODS

The 20 amino acids were purchased from Carolina Biological Supply Company (Burlington, NC). All other chemicals were purchased from either Alfa Aesar or Sigma-Aldrich. All samples were measured in their crystalline form on glass-bottomed petri dishes. Raman maps were collected on a WITec alpha300 microscope in the confocal Raman mode, schematically shown in Figure 1. The excitation source was a 75 mW, 532 nm green laser, focused with a Zeiss EC-Epiplan 20 × 0.4 NA objective. A WITec UHTS 300 spectrometer with a 600 groove/mm grating was used to record the spectra. The detector was an Andor DV401-BV spectroscopic CCD. This spectrometer has a resolution of 3 cm⁻¹ or 0.009 nm. Raman maps were collected over an area of 50 μm by 50 μm. This area was divided into 100 × 100 pixels, each containing a Raman spectrum collected with an integration time of 0.1 s.

Figure 1. — Schematic drawing of our experimental setup. The incident 532 nm green laser interacts with the sample on a glass slide and the scattered light (red) is collected by the detector.

DATA PREPARATION AND MACHINE LEARNING

Of the collected spectra, 5/6 were used for training (2500 spectra per chemical species) and 1/6 were used for testing (500 spectra per chemical species). Training and testing data were collected separately from one another to ensure appropriate variation in the spectra. Before feeding the data into the NN, the spectra were processed such that the intensity was normalized between 0 and 1.0 and the Rayleigh peak from the laser was removed, as shown in a representative image in Figure 2. Once the spectra were recorded, they were categorized as described above into either “known”, “ignored”, or “never seen before”, further detailed in Table 1 below.

Figure 2. — Representative spectra for (a) raw and (b) normalized lysine data.

Table 1.

Table of Chemical Names for the Known (K), Ignored (I), and Never Seen Before (N) Class Identifications

class	chemical Name
K	dl-alanine, dl-aspartic acid, dl-isoleucine, dl-leucine, dl-methionine, dl-phenylalanine, dl-serine, dl-threonine, dl-tyrosine, dl-valine, glycine, l-arginine, l-asparagine, l-cystine, l-glutamic acid, l-histidine, l-lysine, l(+)-cysteine, l-proline, l-tryptophan
I	anthrone, beta estradiol, chloroquine, fluconazole, laminarin, lauric acid, MOPS, methyl viologen, progesterone, uridine
N	ampicillin, CHAPS, D(+) maltose monohydrate, forskolin, 1-Decanesulfonic acid, polyvinyl pyrrolidone, potato starch, sodium deoxycholate, sodium dodecylsulfate, silver nitrate

Open in a new tab

In the “open-world” settings, the N class is not seen during the training stage and instead appears for the first time during the testing stage. However, to ensure that our NN performed well in the “closed-world” setting, before going to the “open world”, we first exposed it to all classes, K, I, and N. The conventional ResNet26 architecture shown in Figure 3 was run five times to help boost the accuracy of the network. While a few misclassifications may occur during each separate run, we can average the predictions (e.g., majority voting), and boost the accuracy to 99.99%, as shown in Figure 4. The accuracy outputs for each of the five runs as well as the majority vote accuracy for the “closed-world” ResNet26 settings are as follows

{\begin{array}{l} accuracy [1] = 98.72 % \\ accuracy [2] = 99.28 % \\ accuracy [3] = 99.06 % \\ accuracy [4] = 99.98 % \\ accuracy [5] = 97.49 % \end{array} \Rightarrow \underset{prediction ensemble = 1 / 5 \sum_{i = 1}^{5} prediction [i]}{\underset{︸}{final accuracy = 99.99 %}}

Figure 3. — Schematic representation of the ResNet26 architecture. The processed spectra are fed into the NN in order to output the classification probabilities.

Figure 4. — Correlation table after the majority vote of five runs is taken. Accuracy = 99.99%.

Next, before delving into variations on the ResNet architecture, we tested the accuracy of a more conventional method of spectra identification, PCA-LDA as used in.^26–30 PCA-LDA is a shallow method (as opposed to the deep methods we’ll use later), meaning that it does not examine deep and abstract features of the data. Moreover, this method only works in “closed-world” settings. On our data set, the accuracy, on average, of PCA-LDA was

{accuracy}_{PCA - LDA} = 84.88 \pm 3.30 %

When comparing the accuracy of this “closed-world” method to the accuracy of our “closed-world” ResNet architecture, we note that ResNet performs much better. It should also be noted that PCA-LDA will only work in “closed-world” settings and cannot be adapted to the “open-world”. Therefore, in our further considerations, we focus solely on the ResNet architecture.

As previously discussed, there will always be the possibility of encountering unknowns in real-world settings. Though the NN performs well and with high accuracy in “closed-world” settings, it is not entirely realistic and limits its applications in clinical settings. Here, we explore various “open-world” approaches to demonstrate how well the NN performs when presented with new, never before seen chemical species.

We focused on K, chose to ignore I, and presented N only during the testing phase, effectively forcing the NN to expect the unexpected while maintaining high accuracy on the known classes. In the following sections, we investigate four different “open-world” approaches: (1) background class, (2) thresholding of the softmax score,^41–43 (3) Entropic Open Set, and (4) Objectosphere,⁴⁰ and demonstrate the advantages and limitations of each approach. We show that while all these approaches are highly accurate on the known class, with an average accuracy = 99.97 ± 0.02%, the Objectosphere approach is much more efficient in its handling of the N class and is thus better prepared for any unexpected inputs the NN may encounter.

RESULTS

Background Class Approaches and Thresholding the Softmax Score.

The NN has an initial input, x, which, in our case, corresponds to the values of the intensities found in the normalized spectra. The output corresponding to each chemical species is defined as, l_c(x). The deep feature of the NN, F(x), which is the output of the second to last layer of the NN, consists of the outputs of the convolution’s blocks, and the final output is obtained by multiplication by the weights W to the last layer

l_{c} (x) = W \cdot F (x)

(1)

The final probability of which chemical species, c, that a particular input will belong to is calculated by the “softmaxing” procedure

S_{c} (x) = \frac{e^{l_{c} (x)}}{\sum_{c} e^{l_{c} (x)}}

(2)

which lies in the range S_c(x)∈[0,1], and ∑_cS_c(x) = 1 and therefore is interpreted as a probability. In the case where the input sample belongs to the K class, it should be classified as whichever chemical species has the highest softmax score in eq 2. Cases in which the input corresponds to the I and N are described further in the text.

The output of the NN is high-dimensional, between 20 and 40 dimensions in our case, so to provide an illustration of the separability of the known and unknown classes, we use the Uniform Manifold Approximation and Projection (UMAP) dimensionality reduction technique.⁴⁴ This allows for the high-dimensional output of the NN to be visualized in a two-dimensional way, showing a clear overlap or separation between the classes. In all our runs, we used the default parameters of the algorithm (i.e., n_neighbors = 15, min_dist = 0.1). In this approach, n_neighbors limits the number of local neighbors that the algorithm considers while trying to determine the manifold structure of the data, and the min_dist parameter is a minimum distance between points that is allowed in the low-dimensional representation of the data. In a 2D plane, the different species correspond to different colors and if the NN is highly accurate, we should see distinct groupings of colors with no overlap between species (i.e., if there is overlap, there are mistakes and misclassifications). As an example, we show the UMAP output of the 40 species in the “closed-world” setting in Figure 5. Here, each of the 40 species corresponds to a color in the gradient shown in the legend to the right of the UMAP and the separation between the colors as well as the fact that each color is concentrated in a particular area of a 2D space rather than being scattered over all the plane is a good indication that the NN is classifying correctly. As we shift to the various “open-world” settings, we can use the UMAP technique to quickly visualize the efficiency of the NN. For these tests, the K class corresponds to the red to blue gradient and new, never seen before class corresponds to the violet dots on the UMAP diagrams.

Figure 5. — UMAP of 40 species in the closed-world setting.

When unknown, never seen before species are present, one naive approach is to include a trash or background class which corresponds to the background class method. Here, rather than having the network try to classify data that are far from the known class or it has not seen before, they will be categorized as “background” or “trash” class. For this approach, we implemented the following condition. The amino acid number i∈[0,C-1], and the {I,N} classes are encoded as

(amino acid) [i] = \underset{length = C + 1}{\underset{︸}{[0, \dots, \underset{i th position}{\underset{︸}{1}}, \dots, 0]}}

(3)

{I, N} \to \underset{length = C + 1}{\underset{︸}{[0, \dots, \underset{20th position}{\underset{︸}{1}}]}}

(4)

where C = 20—number of chemical species of interest (amino acids).

In this case, the output of the NN is the probability of the chemical species being identified as either a “known” or “ignored”/“never seen before”. Therefore, if the output is in the 20th position (as shown in eq 4), then it will be classified as “trash” or “background” otherwise, it will be classified as “known”. Although this scheme may seem effective, putting 10 separate chemicals in the category of “background” results in the NN needing to identify a large number of features as “background” which interferes with its efficiency.

Another naive approach we investigated was to threshold the softmax score. The softmax score, obtained by eq 2, converts a vector of numbers into a vector of probabilities. By implementing a threshold, the NN will essentially give a percentage corresponding to how certain it is that the data belong to a certain chemical species. However, this technique assumes that the probabilities of the K and {I,N} classes are sufficiently separated in the feature space.

To put this another way, it is assumed that Shannon’s entropy⁴⁵ of the NN’s output on the {I,N} classes is close to the maximal value log₂ (C)

{I, N} \to \underset{length = C}{\underset{︸}{[\frac{1}{C}, \dots, \frac{1}{C}]}}

(5)

while the entropy of the output of K class is close to zero. Assuming that this condition is fulfilled, one can introduce a cutoff (Λ), and if the maximum value of the softmax score is lower than the cutoff value, the input is classified as the new, never seen before class, N.

Though both the background class and thresholding of the softmax score methods achieve high accuracy on the K alone, they work poorly upon the introduction of the N class and the accuracy drops significantly. In both cases, overlap and mixing occurs, signifying mistakes and misclassifications by the NN, as seen in Figure 6. Furthermore, as illustrated in Figure 7, even in a case of introducing a high cutoff value of Λ = 0.99 (corresponding to the NN being confident by 99%), the false positive (FP) rate is 10.0%. In fact, regarding the K class, in 7.0% of the cases, the NN produces an inconclusive output, that is, classifies K as part of the unknown, N class.

Figure 6. — UMAP diagrams of (a) the background class and (b) the thresholding the softmax score methods. Class N (violet) is scattered over a large part of the diagram, mixing the known and the unknown.

Figure 7. — Even at the large cutoff value Λ = 0.99, the FP = 10.0% and the number of inconclusive outcomes on K class is 7.0%.

Though the absolute values of the deep features ‖F‖ of the known class tends to be slightly higher than those of ignored and never before seen classes, as shown in Figure 8, the amount of overlap between the three classes in the UMAP diagram indicates that the NN has difficulty distinguishing between K and N. To overcome these limitations, we investigated the Entropic Open Set and Objectosphere approaches and demonstrate their efficiency in the next section. These methods implement tuning parameters and numerical approaches aimed at increasing the separation between the probability distributions of the K and N classes.

Figure 8. — Absolute value of the deep feature ‖F‖ of K class tends to be higher than those of N and I.

Entropic Open Set and Objectosphere Methods.

Unlike the naive approaches which assume that the NN already produces sufficiently high entropy on the I and N classes, Entropic Open Set and Objectosphere approaches implement loss functions as a means to increase entropy on the I (i.e., ignoring it) and concentrate on the K. That is, by numerically changing the coefficients in the loss functions, the user can achieve greater separation between the probabilities rather than assuming the classes are already sufficiently separated (such as the case of thresholding the softmax score method). The Entropic Open Set loss function⁴⁰ is defined as

V_{E} (x) = {\begin{array}{l} - log (S_{c} (x)), if x \in K \\ - \frac{1}{C} \sum_{c = 1}^{C} log (S_{c} (x)), if x \in I \end{array}

(6)

where C = 20 is the number of chemical species of interest (amino acids). Note, for the K class, it reduces to a regular categorical cross entropy loss function. The N class is not involved here because the NN is not aware of it until the testing stage. For the samples belonging to I class, V_E(x) aims to maximize the entropy and uniformly spread the NN’s output over the known class K.

The Objectosphere loss function, on the other hand, involves the addition of the deep feature F(x) parameter. This allows for minimization of its absolute value for the ignored class and maximization of its absolute value for the known one, resulting in sufficient separation of the K and {I,N} classes

V_{O} (x) = V_{E} (x) + α {\begin{array}{l} max (β - {‖ F (x) ‖}^{2}, 0), if x \in K \\ {‖ F (x) ‖}^{2}, if x \in I \end{array}

(7)

where the ‖·‖ represents a regular Euclidean norm. The values of the α and β coefficients are adjusted numerically by minimizing the number of inconclusive outcomes on the K class. The result of this is that FP = 0% on the I class, and this property is generalized for the N class, even though the NN is not aware of it before the testing phase.

Both the Entropic Open Set and Objectosphere approaches provide a significant improvement over the naive approaches as shown in Figure 9. As shown in Figure 9a, for the Entropic Open Set loss function with the cutoff value Λ_entropic = 0.93 and FP = 0%, the NN has 0% wrong outcomes on the K class. Not only this, but the number of inconclusive outcomes on the known class is 2.37%. Compared to the naive and thresholding softmax approach, this approach reduced the inconclusive outcomes by about 66.1%. For the Objectosphere loss function, shown in Figure 9b, with the cutoff value Λ_{Objectosphere} = 0.91 and FP = 0%, we observe 0%, wrong outcomes on the K class. The number of inconclusive outcomes on the known classes is also drastically reduced to 0.26%, as shown in Figure 9. This shows that the Objectosphere reduces inconclusive outcomes by 89% from the Entropic Open Set loss and 96% from the naive approaches. This is a significant improvement and demonstrates that the Objectosphere approach successfully avoids misclassifications and false positives.

The corresponding UMAPs, shown in Figure 10, visually demonstrate the significant improvement in the treatment of the N class by the NN. The area taken by the N class is significantly reduced in comparison to the naive approaches and one can make a clear distinction between N and K classes. Finally, as demonstrated in Figure 11, the Objectosphere loss function further enhances the separation between the features of the N and K classes, minimizing the incorrect responses from the NN to the new, never seen before inputs.

Figure 11. — Distribution of the deep features of the known (K) and unknown/unexpected inputs (N) by the (a) Entropic Open Set and (b) Objectosphere loss functions. An improved separation of K/N classes for the case of Objectosphere is observed.

CONCLUSIONS AND FUTURE WORK

Raman spectroscopy, in combination with ML architectures, demonstrates significant promise as a rapid, label-free, chemical identification method. It has a strong potential to identify dangerous and toxic contaminants, mitigate public health crises, and save human lives. However, traditional ML architectures applied in this context only work in “closed-world” settings and become ill-defined when presented with unknowns, limiting their practical applications. Through this proof-of-concept work, we have demonstrated that we can overcome this challenge by modifying the ResNet26 architecture to operate in “open-world” settings in order to handle unknowns. Our application of the recently introduced Entropic Open Set and Objectosphere approaches enables the ResNet26 architecture to maintain a high accuracy on the known classes while successfully avoiding mistakes and misclassifications of new, never seen before classes. This method is an important step toward the translation of using Raman spectroscopy combined with ML methods in settings where accurate and efficient identification is of utmost importance. In future work, we plan to explore mixtures of chemical classes and spectra with low signal-to-noise ratios to better mimic real-world settings and further improve the performance of our NN.

ACKNOWLEDGMENTS

This work was supported by the National Institute of General Medical Sciences of the National Institutes of Health [grant number 1R15GM128166-01]. This work was also supported by the UCCS BioFrontiers Center. The funding sources had no involvement in study design; in the collection, analysis, and interpretation of data; in the writing of the report; or in the decision to submit the article for publication. This work was supported in part by the U.S. Civilian Research & Development Foundation (CRDF Global). The authors would like to thank Kyle Culhane, Van Hovenga, and Anatoliy Pinchuk for useful discussions.

Footnotes

Complete contact information is available at: https://pubs.acs.org/10.1021/acs.analchem.2c02666

The authors declare no competing financial interest.

Contributor Information

Yaroslav Balytskyi, Department of Physics and Energy Science and UCCS BioFrontiers Center, University of Colorado, Colorado Springs, Colorado 80918, United States;.

Justin Bendesky, Department of Chemistry, New York University, New York, New York 10003, United States.

Tristan Paul, Department of Physics and Energy Science and UCCS BioFrontiers Center, University of Colorado, Colorado Springs, Colorado 80918, United States.

Guy M. Hagen, UCCS BioFrontiers Center, University of Colorado, Colorado Springs, Colorado 80918, United States

Kelly McNear, UCCS BioFrontiers Center, University of Colorado, Colorado Springs, Colorado 80918, United States.

CODE AVAILABILITY AND REPRODUCIBILITY OF OUR RESULTS

Our code is publicly available on GitHub.⁴⁶ In order to reproduce our results, one needs to upload these data and code into Google Colab,⁴⁷ connect to the TPU, and follow the instructions in the Jupyter notebooks provided.

REFERENCES

(1).Raman CV Indian J. Phys 1928, 2, 387–398. [Google Scholar]
(2).Landsberg G Naturwissenschaften 1928, 16, 558. [Google Scholar]
(3).Platonenko V; Khokhlov R Sov. Phys. JETP 1964, 19, 378–381. [Google Scholar]
(4).Hendra PJ; Stratton P Chem. Rev 1969, 69, 325–344. [Google Scholar]
(5).Pelletier MJ; et al. Analytical Applications of Raman Spectroscopy; Blackwell science; Oxford, 1999; Vol. 427. [Google Scholar]
(6).Orlando A; Franceschini F; Muscas C; Pidkova S; Bartoli M; Rovere M; Tagliaferro A Chemosensors 2021, 9, 262. [Google Scholar]
(7).Tuinstra F; Koenig JL J. Chem. Phys 1970, 53, 1126–1130. [Google Scholar]
(8).Aqel A; El-Nour KM; Ammar RA; Al-Warthan A Arab. J. Chem 2012, 5, 1–23. [Google Scholar]
(9).Cialla-May D; Krafft C; Rösch P; Deckert-Gaudig T; Frosch T; Jahn IJ; Pahlow S; Stiebing C; Meyer-Zedler T; Bocklitz T; et al. Anal. Chem 2021, 94, 86–119. [DOI] [PubMed] [Google Scholar]
(10).Analytical Methods Committee; et al. Anal. Methods 2015, 7, 4844–4847. [DOI] [PubMed] [Google Scholar]
(11).Yang D; Ying Y Appl. Spectrosc. Rev 2011, 46, 539–560. [Google Scholar]
(12).V R; R V; Sathe U; et al. Sens. Actuators, B 2019, 281, 679–688. [Google Scholar]
(13).Beyssac O Elements: An International Magazine of Mineralogy, Geochemistry, and Petrology 2020, 16, 117–122. [Google Scholar]
(14).Li Z; Deen MJ; Kumar S; Selvaganapathy PR Sensors 2014, 14, 17275–17303. [DOI] [PMC free article] [PubMed] [Google Scholar]
(15).Li Z; Wang J; Li D Appl. Spectrosc. Rev 2016, 51, 333–357. [Google Scholar]
(16).Weng S; Zhu W; Zhang X; Yuan H; Zheng L; Zhao J; Huang L; Han P Recent advances in raman technology with applications in agriculture, food and biosystems: A review 2019, 3, 1–10. [Google Scholar]
(17).Yang Y; Creedon N; O’Riordan A; Lovera P Photonics 2021, 8, 568. [Google Scholar]
(18).Ho C-S; Jean N; Hogan CA; Blackmon L; Jeffrey SS; Holodniy M; Banaei N; Saleh AA; Ermon S; Dionne J Nat. Commun 2019, 10, 4927. [DOI] [PMC free article] [PubMed] [Google Scholar]
(19).Al Amin M; Sobhani Z; Liu Y; Dharmaraja R; Chadalavada S; Naidu R; Chalker JM; Fang C Environ. Technol. Innovat 2020, 19, 100879. [Google Scholar]
(20).Järup L Br. Med. Bull 2003, 68, 167–182. [DOI] [PubMed] [Google Scholar]
(21).Murray CJ; Ikuta KS; Sharara F; Swetschinski L; Aguilar GR; Gray A; Han C; Bisignano C; Rao P; Wool E; et al. Lancet 2022, 399, 629–655.35065702 [Google Scholar]
(22).Ellis DI; Goodacre R Analyst 2006, 131, 875–885. [DOI] [PubMed] [Google Scholar]
(23).Barman I; Dingari NC; Kang JW; Horowitz GL; Dasari RR; Feld MS Anal. Chem 2012, 84, 2474–2482. [DOI] [PMC free article] [PubMed] [Google Scholar]
(24).Rohleder DR; Kocherscheidt G; Gerber K; Kiefer W; Köhler W; Möcks J; Petrich WH J. Biomed. Opt 2005, 10, 031108. [DOI] [PubMed] [Google Scholar]
(25).Chollet F Deep Learning with Python; Simon and Schuster, 2021. [Google Scholar]
(26).Zheng X; Yin L; Lv G; Lv X; Chen C; Wu G Optik 2021, 226, 165687. [Google Scholar]
(27).Liu W; Sun Z; Chen J; Jing C J. Spectrosc 2016, 2016, 1. [Google Scholar]
(28).Houhou R; Rösch P; Popp J; Bocklitz T Anal. Bioanal. Chem 2021, 413, 5633–5644. [DOI] [PMC free article] [PubMed] [Google Scholar]
(29).Song D; Yu F; Chen S; Chen Y; He Q; Zhang Z; Zhang J; Wang S Biomed. Opt Express 2020, 11, 1061–1072. [DOI] [PMC free article] [PubMed] [Google Scholar]
(30).Kongklad G; Chitaree R; Taechalertpaisarn T; Panvisavas N; Nuntawong N Methods Protoc. 2022, 5, 49. [DOI] [PMC free article] [PubMed] [Google Scholar]
(31).Liu Y; Upadhyaya BR; Naghedolfeizi M Appl. Spectrosc 1993, 47, 12–23. [Google Scholar]
(32).Maruthamuthu MK; Raffiee AH; De Oliveira DM; Ardekani AM; Verma MS MicrobiologyOpen 2020, 9, No. e1122. [DOI] [PMC free article] [PubMed] [Google Scholar]
(33).Thrift WJ; Ronaghi S; Samad M; Wei H; Nguyen DG; Cabuslay AS; Groome CE; Santiago PJ; Baldi P; Hochbaum AI; et al. ACS Nano 2020, 14, 15336–15348. [DOI] [PubMed] [Google Scholar]
(34).Lussier F; Thibault V; Charron B; Wallace GQ; Masson J-F Trac. Trends Anal. Chem 2020, 124, 115796. [Google Scholar]
(35).Peiffer-Smadja N; Dellière S; Rodriguez C; Birgand G; Lescure F-X; Fourati S; Ruppé E Clin. Microbiol. Infect 2020, 26, 1300–1309. [DOI] [PubMed] [Google Scholar]
(36).Lu W; Chen X; Wang L; Li H; Fu YV Anal. Chem 2020, 92, 6288–6296. [DOI] [PubMed] [Google Scholar]
(37).Gniadecka M; Wulf H; Mortensen NN; Nielsen OF; Christensen DJ Raman Spectrosc. 1997, 28, 125–129. [Google Scholar]
(38).Hochreiter S; Bengio Y; Frasconi P; Schmidhuber J; et al. Gradient Flow in Recurrent Nets: the Difficulty of Learning Long-Term Dependencies; IEEE Press, 2001. [Google Scholar]
(39).He K; Zhang X; Ren S; Sun J Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016; pp 770–778. [Google Scholar]
(40).Dhamija AR; Günther M; Boult TE Reducing network agnostophobia. 2018, arXiv Preprint arXiv:1811.04110. [Google Scholar]
(41).Matan O; Kiang R; Stenard C; Boser B; Denker J; Henderson D; Howard R; Hubbard W; Jackel L; Le Cun Y Handwritten character recognition using neural network architectures. 4th USPS Advanced Technology Conference, 1990; Vol. 2, pp 1003–1011. [Google Scholar]
(42).De Stefano C; Sansone C; Vento M IEEE Trans. Syst. Man Cybern. C Appl. Rev 2000, 30, 84–94. [Google Scholar]
(43).Fumera G; Roli F Support vector machines with embedded reject option. International Workshop on Support Vector Machines; Springer, 2002; pp 68–82. [Google Scholar]
(44).McInnes L; Healy J; Melville J Umap: Uniform manifold approximation and projection for dimension reduction. 2018. arXiv Preprint arXiv:1802.03426. [Google Scholar]
(45).Shannon CE The Bell system technical journal 1948, 27, 379–423. [Google Scholar]
(46).Balytskyi Y Github repository. Available at. https://github.com/BalytskyiJaroslaw/RamanOpenSet.git (accessed on 11 11, 2021).
(47).Bisong E Building Machine Learning and Deep Learning Models on Google Cloud Platform; Springer, 2019. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

[R1] (1).Raman CV Indian J. Phys 1928, 2, 387–398. [Google Scholar]

[R2] (2).Landsberg G Naturwissenschaften 1928, 16, 558. [Google Scholar]

[R3] (3).Platonenko V; Khokhlov R Sov. Phys. JETP 1964, 19, 378–381. [Google Scholar]

[R4] (4).Hendra PJ; Stratton P Chem. Rev 1969, 69, 325–344. [Google Scholar]

[R5] (5).Pelletier MJ; et al. Analytical Applications of Raman Spectroscopy; Blackwell science; Oxford, 1999; Vol. 427. [Google Scholar]

[R6] (6).Orlando A; Franceschini F; Muscas C; Pidkova S; Bartoli M; Rovere M; Tagliaferro A Chemosensors 2021, 9, 262. [Google Scholar]

[R7] (7).Tuinstra F; Koenig JL J. Chem. Phys 1970, 53, 1126–1130. [Google Scholar]

[R8] (8).Aqel A; El-Nour KM; Ammar RA; Al-Warthan A Arab. J. Chem 2012, 5, 1–23. [Google Scholar]

[R9] (9).Cialla-May D; Krafft C; Rösch P; Deckert-Gaudig T; Frosch T; Jahn IJ; Pahlow S; Stiebing C; Meyer-Zedler T; Bocklitz T; et al. Anal. Chem 2021, 94, 86–119. [DOI] [PubMed] [Google Scholar]

[R10] (10).Analytical Methods Committee; et al. Anal. Methods 2015, 7, 4844–4847. [DOI] [PubMed] [Google Scholar]

[R11] (11).Yang D; Ying Y Appl. Spectrosc. Rev 2011, 46, 539–560. [Google Scholar]

[R12] (12).V R; R V; Sathe U; et al. Sens. Actuators, B 2019, 281, 679–688. [Google Scholar]

[R13] (13).Beyssac O Elements: An International Magazine of Mineralogy, Geochemistry, and Petrology 2020, 16, 117–122. [Google Scholar]

[R14] (14).Li Z; Deen MJ; Kumar S; Selvaganapathy PR Sensors 2014, 14, 17275–17303. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R15] (15).Li Z; Wang J; Li D Appl. Spectrosc. Rev 2016, 51, 333–357. [Google Scholar]

[R16] (16).Weng S; Zhu W; Zhang X; Yuan H; Zheng L; Zhao J; Huang L; Han P Recent advances in raman technology with applications in agriculture, food and biosystems: A review 2019, 3, 1–10. [Google Scholar]

[R17] (17).Yang Y; Creedon N; O’Riordan A; Lovera P Photonics 2021, 8, 568. [Google Scholar]

[R18] (18).Ho C-S; Jean N; Hogan CA; Blackmon L; Jeffrey SS; Holodniy M; Banaei N; Saleh AA; Ermon S; Dionne J Nat. Commun 2019, 10, 4927. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R19] (19).Al Amin M; Sobhani Z; Liu Y; Dharmaraja R; Chadalavada S; Naidu R; Chalker JM; Fang C Environ. Technol. Innovat 2020, 19, 100879. [Google Scholar]

[R20] (20).Järup L Br. Med. Bull 2003, 68, 167–182. [DOI] [PubMed] [Google Scholar]

[R21] (21).Murray CJ; Ikuta KS; Sharara F; Swetschinski L; Aguilar GR; Gray A; Han C; Bisignano C; Rao P; Wool E; et al. Lancet 2022, 399, 629–655.35065702 [Google Scholar]

[R22] (22).Ellis DI; Goodacre R Analyst 2006, 131, 875–885. [DOI] [PubMed] [Google Scholar]

[R23] (23).Barman I; Dingari NC; Kang JW; Horowitz GL; Dasari RR; Feld MS Anal. Chem 2012, 84, 2474–2482. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R24] (24).Rohleder DR; Kocherscheidt G; Gerber K; Kiefer W; Köhler W; Möcks J; Petrich WH J. Biomed. Opt 2005, 10, 031108. [DOI] [PubMed] [Google Scholar]

[R25] (25).Chollet F Deep Learning with Python; Simon and Schuster, 2021. [Google Scholar]

[R26] (26).Zheng X; Yin L; Lv G; Lv X; Chen C; Wu G Optik 2021, 226, 165687. [Google Scholar]

[R27] (27).Liu W; Sun Z; Chen J; Jing C J. Spectrosc 2016, 2016, 1. [Google Scholar]

[R28] (28).Houhou R; Rösch P; Popp J; Bocklitz T Anal. Bioanal. Chem 2021, 413, 5633–5644. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R29] (29).Song D; Yu F; Chen S; Chen Y; He Q; Zhang Z; Zhang J; Wang S Biomed. Opt Express 2020, 11, 1061–1072. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R30] (30).Kongklad G; Chitaree R; Taechalertpaisarn T; Panvisavas N; Nuntawong N Methods Protoc. 2022, 5, 49. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R31] (31).Liu Y; Upadhyaya BR; Naghedolfeizi M Appl. Spectrosc 1993, 47, 12–23. [Google Scholar]

[R32] (32).Maruthamuthu MK; Raffiee AH; De Oliveira DM; Ardekani AM; Verma MS MicrobiologyOpen 2020, 9, No. e1122. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R33] (33).Thrift WJ; Ronaghi S; Samad M; Wei H; Nguyen DG; Cabuslay AS; Groome CE; Santiago PJ; Baldi P; Hochbaum AI; et al. ACS Nano 2020, 14, 15336–15348. [DOI] [PubMed] [Google Scholar]

[R34] (34).Lussier F; Thibault V; Charron B; Wallace GQ; Masson J-F Trac. Trends Anal. Chem 2020, 124, 115796. [Google Scholar]

[R35] (35).Peiffer-Smadja N; Dellière S; Rodriguez C; Birgand G; Lescure F-X; Fourati S; Ruppé E Clin. Microbiol. Infect 2020, 26, 1300–1309. [DOI] [PubMed] [Google Scholar]

[R36] (36).Lu W; Chen X; Wang L; Li H; Fu YV Anal. Chem 2020, 92, 6288–6296. [DOI] [PubMed] [Google Scholar]

[R37] (37).Gniadecka M; Wulf H; Mortensen NN; Nielsen OF; Christensen DJ Raman Spectrosc. 1997, 28, 125–129. [Google Scholar]

[R38] (38).Hochreiter S; Bengio Y; Frasconi P; Schmidhuber J; et al. Gradient Flow in Recurrent Nets: the Difficulty of Learning Long-Term Dependencies; IEEE Press, 2001. [Google Scholar]

[R39] (39).He K; Zhang X; Ren S; Sun J Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016; pp 770–778. [Google Scholar]

[R40] (40).Dhamija AR; Günther M; Boult TE Reducing network agnostophobia. 2018, arXiv Preprint arXiv:1811.04110. [Google Scholar]

[R41] (41).Matan O; Kiang R; Stenard C; Boser B; Denker J; Henderson D; Howard R; Hubbard W; Jackel L; Le Cun Y Handwritten character recognition using neural network architectures. 4th USPS Advanced Technology Conference, 1990; Vol. 2, pp 1003–1011. [Google Scholar]

[R42] (42).De Stefano C; Sansone C; Vento M IEEE Trans. Syst. Man Cybern. C Appl. Rev 2000, 30, 84–94. [Google Scholar]

[R43] (43).Fumera G; Roli F Support vector machines with embedded reject option. International Workshop on Support Vector Machines; Springer, 2002; pp 68–82. [Google Scholar]

[R44] (44).McInnes L; Healy J; Melville J Umap: Uniform manifold approximation and projection for dimension reduction. 2018. arXiv Preprint arXiv:1802.03426. [Google Scholar]

[R45] (45).Shannon CE The Bell system technical journal 1948, 27, 379–423. [Google Scholar]

[R46] (46).Balytskyi Y Github repository. Available at. https://github.com/BalytskyiJaroslaw/RamanOpenSet.git (accessed on 11 11, 2021).

[R47] (47).Bisong E Building Machine Learning and Deep Learning Models on Google Cloud Platform; Springer, 2019. [Google Scholar]

PERMALINK

Raman Spectroscopy in Open-World Learning Settings Using the Objectosphere Approach

Yaroslav Balytskyi

Justin Bendesky

Tristan Paul

Guy M Hagen

Kelly McNear

Abstract

Graphical Abstract

INTRODUCTION

METHODS

Figure 1.

DATA PREPARATION AND MACHINE LEARNING

Figure 2.

Table 1.

Figure 3.

Figure 4.

RESULTS

Background Class Approaches and Thresholding the Softmax Score.

Figure 5.

Figure 6.

Figure 7.

Figure 8.

Entropic Open Set and Objectosphere Methods.

Figure 9.

Figure 10.

Figure 11.

CONCLUSIONS AND FUTURE WORK

ACKNOWLEDGMENTS

Footnotes

Contributor Information

CODE AVAILABILITY AND REPRODUCIBILITY OF OUR RESULTS

REFERENCES

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Raman Spectroscopy in Open-World Learning Settings Using the Objectosphere Approach

Yaroslav Balytskyi

Justin Bendesky

Tristan Paul

Guy M Hagen

Kelly McNear

Abstract

Graphical Abstract

INTRODUCTION

METHODS

Figure 1.

DATA PREPARATION AND MACHINE LEARNING

Figure 2.

Table 1.

Figure 3.

Figure 4.

RESULTS

Background Class Approaches and Thresholding the Softmax Score.

Figure 5.

Figure 6.

Figure 7.

Figure 8.

Entropic Open Set and Objectosphere Methods.

Figure 9.

Figure 10.

Figure 11.

CONCLUSIONS AND FUTURE WORK

ACKNOWLEDGMENTS

Footnotes

Contributor Information

CODE AVAILABILITY AND REPRODUCIBILITY OF OUR RESULTS

REFERENCES

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases