Discrimination between Carbapenem-Resistant and Carbapenem-Sensitive Klebsiella pneumoniae Strains through Computational Analysis of Surface-Enhanced Raman Spectra: a Pilot Study

Wei Liu; Jia-Wei Tang; Jing-Wen Lyu; Jun-Jiao Wang; Ya-Cheng Pan; Xin-Yi Shi; Qing-Hua Liu; Xiao Zhang; Bing Gu; Liang Wang

doi:10.1128/spectrum.02409-21

. 2022 Feb 2;10(1):e02409-21. doi: 10.1128/spectrum.02409-21

Discrimination between Carbapenem-Resistant and Carbapenem-Sensitive Klebsiella pneumoniae Strains through Computational Analysis of Surface-Enhanced Raman Spectra: a Pilot Study

Wei Liu ^a,^#, Jia-Wei Tang ^a,^#, Jing-Wen Lyu ^b,^#, Jun-Jiao Wang ^a, Ya-Cheng Pan ^c, Xin-Yi Shi ^c, Qing-Hua Liu ^d, Xiao Zhang ^a,^✉, Bing Gu ^c,^e,^✉, Liang Wang ^a,^✉

Editor: Karen C Carroll^f

Reviewed by: Katarína Rebrošová^g

PMCID: PMC8809336 PMID: 35107359

ABSTRACT

In clinical settings, rapid and accurate diagnosis of antibiotic resistance is essential for the efficient treatment of bacterial infections. Conventional methods for antibiotic resistance testing are time consuming, while molecular methods such as PCR-based testing might not accurately reflect phenotypic resistance. Thus, fast and accurate methods for the analysis of bacterial antibiotic resistance are in high demand for clinical applications. In this pilot study, we isolated 7 carbapenem-sensitive Klebsiella pneumoniae (CSKP) strains and 8 carbapenem-resistant Klebsiella pneumoniae (CRKP) strains from clinical samples. Surface-enhanced Raman spectroscopy (SERS) as a label-free and noninvasive method was employed for discriminating CSKP strains from CRKP strains through computational analysis. Eight supervised machine learning algorithms were applied for sample analysis. According to the results, all supervised machine learning methods could successfully predict carbapenem sensitivity and resistance in K. pneumoniae, with a convolutional neural network (CNN) algorithm on top of all other methods. Taken together, this pilot study confirmed the application potentials of surface-enhanced Raman spectroscopy in fast and accurate discrimination of Klebsiella pneumoniae strains with different antibiotic resistance profiles.

IMPORTANCE With the low-cost, label-free, and nondestructive features, Raman spectroscopy is becoming an attractive technique with great potential to discriminate bacterial infections. In this pilot study, we analyzed surfaced-enhanced Raman spectroscopy (SERS) spectra via supervised machine learning algorithms, through which we confirmed the application potentials of the SERS technique in rapid and accurate discrimination of Klebsiella pneumoniae strains with different antibiotic resistance profiles.

KEYWORDS: Klebsiella pneumoniae, carbapenems, surface-enhanced Raman spectroscopy, machine learning algorithm, antibiotic resistance profile

INTRODUCTION

Many microbial organisms are pathogenic to human beings and are able to cause infectious diseases (1). In addition, drug-resistant bacterial pathogens have been emerging due to the overuse and misuse of antibiotics (2), which leads to difficulty in bacterial control and imposes further threats upon global public health. Thus, fast and accurate detection of antibiotic-resistant bacteria is necessary for clinical treatment of bacterial infection and prevention of bacterial transmission (3). Klebsiella pneumoniae is an encapsulated Gram-negative and facultative anaerobic bacterium that was first described by Edwin Klebs in 1875 and belongs to the Enterobacteriaceae family (4). It is also an opportunistic bacterial pathogen that causes pneumonia-derived sepsis, leading to high morbidity and mortality (5). Due to the rapid dissemination of K. pneumoniae in the hospital environment, it is easy for the bacterial pathogen to cause nosocomial outbreaks (6). In fact, K. pneumoniae is reported to be responsible for around one-third of all Gram-negative infections in the hospital (7), which makes it the second most important opportunistic enterobacterium in nosocomial and community infections just after Escherichia coli (8).

In recent decades, with the increasing abuse of antibiotics in clinical settings, K. pneumoniae shows frequent acquisition of resistance to antibiotics, which makes the nosocomial infections caused by the pathogen particularly problematic (8). For example, extended-spectrum β-lactamases (ESBLs) mediate resistance to broad-spectrum cephalosporins and aztreonam, the coding genes of which are usually found on plasmids and harbored by K. pneumoniae (9). The increasing use of carbapenems has led to the abundant emergence of carbapenem-resistant K. pneumoniae (CRKP) strains (10), while the CRKP strains are defined as being resistant to at least one of the carbapenem agents, including ertapenem, meropenem, and imipenem (11, 12). Currently, there are many mechanisms in K. pneumoniae for carbapenem resistance, but few antimicrobial therapy options exist for infections caused by CRKP (13). Only tigecycline, colistin, and several aminoglycosides show favorable in vitro activities against CRKP, which leads to the emergence of strains with colistin resistance among CRKP strains (7). Due to the difficulty of clinical treatment of CRKP infection, it is therefore important to discriminate CRKP strains from carbapenem-sensitive Klebsiella pneumoniae (CSKP) strains with rapidity, cost-effectiveness, and accuracy, which is essential to instruct the initial antimicrobial use and effective control of the bacterial infection (14).

Surface-enhanced Raman spectroscopy (SERS) is a nondestructive chemical analysis technique that could improve the weak signals of regular Raman spectroscopy through interactions between sample molecules and surface plasmons of nanoscale-structured metal particles (15). In particular, signal-enhancing metal nanostructures, such as silver (Ag), copper (Cu), and gold (Au), can generate a plasmon resonance electromagnetic enhancement of the stimulating light, which could greatly increase the signal level of Raman spectroscopy up to several orders of magnitude (16). However, due to the complexity of Raman spectra, traditional linear analysis is not sufficient for the data-processing procedures, while machine learning (ML) algorithms are capable of extracting important features from the sophisticated SERS spectral data sets (15, 16). Thus, SERS provides a great potential for fast and sensitive microbial detection and identification with the assistance of appropriate ML algorithms (17). At present, few studies have applied and compared machine learning methods in terms of SERS spectral analysis in order to distinguish between CSKP and CRKP strains. In this pilot study, we isolated 7 CSKP and 8 CRKP strains from clinical samples, and their SERS spectra were analyzed via eight supervised machine learning algorithms. Among these algorithms, CNN achieved high-level accuracy in predicting CSKP and CRKP strains, with area under the curve (AUC) values reaching to 99.57% and 5-fold cross validation reaching to 99.78%. Taken together, this study showed that SERS spectra combined with a deep learning algorithm could effectively distinguish CSKP strains from CRKP strains, which reinforced its potential in real-world applications, such as bacterial diagnosis and antibiotic stewardship.

RESULTS

Raman spectra of CRKP and CSKP strains.

(i) Average Raman spectra. Average Raman spectra with standard error bands for CRKP and CSKP strains were generated through calculating the means of signal intensities at corresponding Raman shifts via both biological and technical repeats (Fig. 1). Although spectral profiles for the two K. pneumoniae groups were similar, different Raman intensities and characteristic peaks were identified, which suggested differences in biochemistry and could be used to discriminate strains of CRKP and CSKP. In addition, the standard error bands (shaded region) quantitatively reflected the good reproducibility of Raman spectra for CRKP and CSKP strains, respectively. In order to evaluate the repeatability of Raman spectra, an average Raman spectrum with a standard error band was also generated for each K. pneumoniae strain, together with the distributions of characteristic peaks in a dot matrix plot (Fig. S2 in the supplemental material), according to which Raman spectra were well repeated for each strain.

FIG 1 — Demonstration of the average SERS spectra for CSKP (N = 280) and CRKP (N = 280). Each average Raman spectrum was generated from multiple Raman spectra by calculating the mean Raman intensity at corresponding Raman shift. The characteristic peaks of each average Raman spectrum were marked with vertical black arrows. Different Raman spectra had their own combination of characteristic peaks. The shaded band of each Raman spectrum represents 20% standard error. The x axis shows the Raman shifts from 519.56 cm⁻¹ to 1,800.81 cm⁻¹, while the y axis shows the Raman spectral intensity in artificial units (a.u.).

Meanwhile, we also checked Raman spectra repeatability by calculating the average Raman spectrum and standard error band for each Klebsiella pneumoniae strain, which showed that repeatability of Raman spectra was well maintained (Fig. S2A and B).

(ii) Characteristic peaks. Different bacteria have their own combinations of characteristic peaks in Raman spectra due to their unique chemical compositions, which could be used to distinguish them from each other at different taxonomic levels, such as species and subspecies, etc. (18). However, raw Raman spectral data are not suitable for the identification of characteristic peaks due to the unwanted signals (noises) in the spectra (19). In order to reduce the influences of noises on the identification of characteristic peaks, we used the Savitzky-Golay (SG) smoothing filter algorithm to smooth the Raman spectral data and reduce noise interference (20). The software LabSpec 6 was then used to identify characteristic peaks with a Gaussian-Lorentzian function (21), which were marked with black arrows along the spectra (Fig. 1). In order to check whether K. pneumoniae strains could be successfully separated solely based on the unique profiles of characteristic peaks into CSKP and CRKP groups, we also performed a principal-component analysis (PCA) for the data, which showed that all the samples could be correctly classified except for two samples, 26-272 and 20-18 (Fig. S3). Thus, characteristic peaks could not be reliably used for the classification of antibiotic resistance phenotypes.

According to previous studies, characteristic peaks in Raman spectra corresponded to different biochemical molecules (22). In addition, the more complex the chemical composition of a bacterium, the richer its Raman spectrum (23). In this study, individual chemical components represented by characteristic peaks were sourced from previous reports in the literature and summarized in Table 1. According to the results, CSKP and CRKP had the same characteristic peaks at 566 cm⁻¹ (guanine/thymine/uridine), 654 cm⁻¹ (guanine), 723/725 cm⁻¹ (nucleic acids), 958 cm⁻¹ (carbon-carbon double bond [C = C]), 1,047 cm⁻¹ (phosphorus-oxygen bond [P-O]), 1,129 cm⁻¹ (CH₂), 1,452/1,455 cm⁻¹ (nitrogen-nitrogen double bond [N = N] aromatic and aliphatic), 1,581/1,582 cm⁻¹ (guanine/adenine), and 1,690 cm⁻¹ (carbon-oxygen double bond [C = O], C = C). For details of the biological meanings and the corresponding references of all the characteristic peaks, please refer to Table 1.

TABLE 1.

Characteristic peaks in the average Raman spectra of CSKP and CRKP strains and the corresponding chemical components

Wavenumber (cm⁻¹)	Band assignment	Reference
566	Guanine/thymine/uridine	40
654	Guanine	41
723/725	Nucleic acids	42
783	Thymine	43
890	Tryptophan	44
915	C-C (carbon–carbon single bond)	45
958	C = C	41
1,047	P-O	46
1,129	CH₂	41
1,209	Phenylalanine/tyrosine	47
1,215	CH	48
1,244	Amide III	49
1,321	Guanine	50
1,328	Adenine ring	51
1,452/1,455	N = N aromatic and aliphatic	52
1,581/1,582	Guanine/adenine	51
1,636	Amide I	50
1,690	C = O, C = C	52
1,898	C = O	52

Open in a new tab

Supervised machine learning algorithms.

(i) Algorithm comparison. The purpose of supervised machine learning analysis is to construct appropriate prediction models for recognizing Raman spectra in between CSKP and CRKP strains. In this study, we compared eight supervised machine learning algorithms in terms of their capacities in predicting Raman spectra of CSKP and CRKP strains, which included convolutional neural network (CNN), gradient boosting (GB), linear discriminant analysis (LDA), k-nearest neighbors (KNN), random forest (RF), adaptive boosting (Adaboost), decision tree (DT), and support vector machine (SVM). The process of a supervised learning algorithm is to divide the data into independent training and test sets. Data in the training set will be labeled first, which will then be trained to obtain an optimal prediction model that will be applied to unlabeled test data and mapped into output results. The performance of each algorithm was measured by four indicators, accuracy (ACC), precision, recall, and F1 (F1 is an overall measure of a model’s accuracy that combines precision and recall, a good value of which indicates low false positives and low false negatives). Cross-validation (CV) is also an efficient method for assessing effectiveness, overfitting, and stability of supervised machine learning models when sample size is small (24). Thus, in this study, we performed 5-fold cross-validation (5-fold CV) for all the supervised machine learning algorithms.

In specificity, according to the results, CNN had a prediction accuracy of 100%, and its 5-fold cross-validation reached to 99.78%, which made the algorithm best in predicting CSKP and CRKP strains compared with the other 7 supervised machine learning algorithms. As for GB and LDA, their performance measures were exactly the same (ACC = 99.4%, precision = 99.4%, recall = 99.38%, F1 = 99.4%). However, 5-fold cross-validation showed that GB (94.91%) had higher average accuracy than LDA (81.1%), which suggested that GB was more stable than LDA in terms of Raman spectral analysis. The other five algorithms also showed comparatively good performance in terms of prediction capacities, among which SVM (ACC = 93.54%) had the lowest accuracy. Taken together, CNN was found to be the best prediction model, while SVM was the worst. However, when using 5-fold CV as a measurement, LDA had the worst performance. For details of the performance measures for the eight algorithms, please refer to Table 2. Moreover, it was also noteworthy that two kernel functions of the SVM algorithm, linear function (linear) and radial basis function (rbf), were compared, according to which linear kernel function (ACC = 93.54%, precision = 93.54%, recall = 93.45%, and F1 = 93.49%) performed better than rbf kernel function (ACC = 80.36%, precision = 80.36%, recall = 80.44%, and F1 = 80.36%). Thus, SVM with linear kernel function was more appropriate for dealing with dichotomy problems than the rfb kernel function that was not included in Table 2.

TABLE 2.

Comparison of performance measures of eight different supervised machine learning algorithms

Algorithms	ACC^a	Precision	Recall	F1	5-Fold CV^b
CNN	100%	100%	100%	100%	99.78%
GB	99.40%	99.40%	99.38%	99.40%	94.91%
LDA	99.40%	99.40%	99.38%	99.40%	81.10%
KNN	98.21%	98.21%	98.23%	98.21%	94.90%
RF	98.21%	98.21%	98.28%	98.21%	94.65%
AdaBoost	97.62%	97.62%	97.57%	97.62%	95.17%
DT	96.43%	96.43%	96.47%	96.43%	93.63%
SVM	93.54%	93.54%	93.45%	93.49%	93.43%

Open in a new tab

Algorithms were ranked from high to low values of accuracy (ACC).

5-Fold CV, fivefold cross validation.

(ii) Receiver operating characteristic curves. To measure the advantages and disadvantages of each supervised machine learning model used in this study for the prediction of CSKP and CRKP strains, receiver operating characteristic (ROC) curves were used to compare the sensitivity and specificity of the prediction results of each model (Fig. 2). The x axis represents specificity, which is also called the false-positive rate. The closer the x axis is to zero, the higher the accuracy rate. The y axis represents sensitivity, which is also known as true positive rate (sensitivity). The larger the y axis, the better the accuracy. Therefore, the closer the ROC curve is to the upper left corner, the higher the accuracy of the experiment. Meanwhile, area under curve (AUC) was also calculated for each ROC curve in order to quantitatively measure the performance of each model. The larger the value of AUC, the better the performance of the model (Fig. 2). According to the results, CNN (AUC = 0.9957) had the best performance, which was tightly followed by LDA (AUC = 0.9745) and Adaboost (AUC = 0.9767). As for other models, their AUC values were also comparatively good and were all greater than 0.95, except for SVM (AUC = 0.9414), which had the lowest AUC value.

FIG 2 — ROC curves of eight supervised machine learning algorithms used in this study. Through comparison, it could be seen that the CNN model (AUC = 0.9957) had the best performance for predicting CSKP and CRKP strains in this study; TPR, true positive rate; FPR, false positive rate.

(iii) Confusion matrix. A confusion matrix is a visual display used to describe the performance of a classification model on a set of test data for which the true values are known. Each column of the matrix represents the sample predicted by the model, while each row of the matrix represents the true status of the sample. In this study, we drew a set of binary classification confusion matrices for the eight supervised machine learning models (Fig. 3). Compared with other models, the results showed that the final recognition accuracy of the CNN model for the two groups of Raman spectra, CSKP and CRKP, was the best and reached to 100%, which verified the feasibility of the CNN model for the prediction and classification of bacterial antibiotic-resistant and antibiotic-sensitive phenotypes based on SERS spectra.

FIG 3 — Confusion matrices for eight machine learning algorithms in terms of classification of CSKP and CRKP strains. For each confusion matrix, rows correspond to phenotypes (antibiotic resistance or sensitivity) identified by standard biochemical tests (true class), while columns correspond to phenotypes predicted by supervised machine learning algorithms (predicted class). Numbers in the confusion matrix stand for the percentage of correctly classified (diagonal) or misclassified (off-diagonal) spectra, respectively.

Effects of signal-to-noise ratio on machine learning accuracy.

During the generation of Raman spectra from bacterial samples, it was impossible to avoid the interference of many uncontrollable factors, including environmental noise, fluorescence, and radiation, etc. The existence of a variety of interference noises requires higher anti-interference ability and robustness of classification algorithm. In this study, we added artificial noises to raw Raman spectral data via Gaussian noise interference with different SNRs (1 db, 2 db, 3 db, 5 db, 15 db, 25 db, and 35 db [decibels]) and then compared the effects of SNR on the classification accuracy of eight supervised machine learning algorithms (Fig. 4). According to the results, the general trend was that the higher the SNR, the better the classification accuracy. In addition, CNN showed better stability and higher accuracy than the other seven tested algorithms. Thus, CNN had the best anti-interference ability when dealing with raw Raman spectra data.

FIG 4 — Quantitative analysis of the influences of different SNR on eight supervised machine learning algorithms. As seen in the figure, with the increment of SNR added to raw Raman spectra, accuracy of classification algorithms generally increased. Among all the models, CNN showed the strongest robustness in its antinoise ability. When 1-db SNR was added to the Raman spectra, CNN could still classify Raman spectral data with an accuracy rate over 90%.

DISCUSSION

In recent years, multidrug-resistant (MDR), extensively drug-resistant (XDR), and pan-drug-resistant (PDR) bacterial pathogens are increasingly being reported worldwide and are not uncommon to be identified in different bacterial species, such as Pseudomonas aeruginosa, Acinetobacter baumannii, and Klebsiella pneumoniae, which leads to higher outbreak potentials and international spread of bacterial pathogens (25). As an opportunistic pathogen, K. pneumoniae is a natural inhabitant of the gut microbiota and is also commonly encountered in hospital-acquired infections (7). In addition, K. pneumoniae can cause many serious diseases, such as pneumonia, urinary tract infections, and bloodstream infections, while increasing numbers of strains resistant to antibiotics have been reported due to antibiotics abuse. According to the CLSI guidelines, CRKP has been defined as the first clinical K. pneumoniae-positive culture from inpatients with resistance to at least one of the following carbapenems: meropenem, imipenem, and ertapenem (12). Since carbapenems are the last line of defense against multidrug-resistant Gram-negative infections (26), CRKP represents a great challenge for clinical practitioners (4). Thus, rapid and accurate identification of CRKP is crucial for prescribing antibiotic therapy and relevant treatment strategies (4). Since all the CRKP strains were multidrug resistant, we checked other antibiotic-resistant phenotypes for all the CSKP and CRKP strains isolated in this study in addition to carbapenem resistance. The complete antibiotic resistance profiles are presented in Fig. S1A in the supplemental material and are detailed in Table S1 (CRKP) and Table S2 (CSKP). A PCA algorithm separated CRKP and CSKP strains well into two independent groups (Fig. S1B), while hierarchical clustering analysis (HCA) clustered CRKP and CSKP into two different hierarchies except for one misclassification (Fig. S1A), that is, sample 16-2. The results indicated that the two groups of K. pneumoniae strains were intrinsically different in terms of their antibiotic resistance profiles, although capacity of statistical algorithms differed in separating the samples into corresponding groups solely based on antibiotic-resistant profiles.

Traditional methods for the detection of antibiotic resistance usually take at least 6 to 18 h for preliminary results and 48 h or longer for definitive results, which significantly delays the choice of an appropriate antimicrobial therapy (3), not even mentioning the fastidious and nonculturable bacterial pathogens. Thus, novel methods are needed for fast and reliable identification of bacterial antibiotic resistance. Although Raman spectroscopy (RS) has been considered a potential technique with label-free and noninvasive features for the analysis of bacterial pathogens, there is currently no real-world applications of RS in clinical settings for bacterial analysis due to various restrictions, such as weak Raman scattering effect and low reproducibility and repeatability, etc. (15). Currently, surface-enhanced Raman spectroscopy (SERS) has been extensively developed to overcome the weak Raman scattering effect, which uses metallic nanoparticles (gold, silver, and copper) to concentrate electromagnetic energy via surface plasmons (27), although the reproducibility and repeatability of SERS is also debatable (28). For example, Witkowska et al. systematically compared the differences between RS and SERS in terms of bacterial detection, according to which SERS spectra had much better quality than the normal Raman spectra for both Escherichia coli and Bacillus subtilis (29). In addition, low reproducibility and repeatability are caused by several uncontrollable external factors during an experiment (30), which could be partially reduced through increased number of biological and technical repeats. An averaged Raman spectrum with standard error was thus generated for the analysis of characteristic peaks (Fig. 1), which was also used as the representative spectrum of a specific bacterial strain (Fig. S2). Taken together, due to the greatly enhanced signal intensity, SERS was applied to all the K. pneumoniae strains for antibiotic resistance analysis in this study.

Due to the complexity of SERS spectra, traditional statistical methods are not sufficient to deal with the data analysis procedures (15). Thus, advanced computational methods such as supervised machine learning algorithms have been recruited for sample prediction. For example, Wang et al. (31) used CNN- and artificial neural network (ANN)-classified and predicted 18 Arcobacter species from clinical, environmental, and agri-food sources with an accuracy rate of 97.2%. In addition, Tang et al. also successfully identified a set of clinically isolated Staphylococcus species via the combination of surface-enhanced Raman spectral fingerprinting and machine learning algorithms, which also confirmed the potential applicability of the SERS technology in clinical diagnostics (17). In terms of the differentiation of antibiotic resistance and sensitivity in bacterial strains, a variety of studies have addressed this question. However, most of the studies used simple statistical models, such as linear discriminant analysis (LDA) and principal-component analysis (PCA) for data analysis. For example, Verma et al. used partial least squares-discriminant analysis (PLS-DA) to study Raman spectra of Escherichia coli strains treated with bacteriostatic and bactericidal antibiotics, which identified characteristic peaks that are altered by antibiotic concentrations (32). In addition, Cheong et al. analyzed drop-coating deposition SERS spectra via PCA and SVM to identify quinolone-resistant K. pneumoniae strains (33).

We applied an advanced SERS technique coupled with machine learning algorithms to clinically isolated K. pneumoniae strains, through which CSKP and CRKP strains were rapidly and accurately recognized. A total of eight commonly used supervised machine learning methods, including AdaBoost, CNN, DT, GB, KNN, LDA, RF, and SVM, were performed on SERS spectral data and compared in terms of their capacities in predicting CSKP and CRKP strains. Among these algorithms, CNN consistently performed best based on all the evaluation indicators (Table 2), ROC curves (Fig. 2), and confusion matrix (Fig. 3), achieving 99.78% accuracy during 5-fold cross-validation. Previously, Ho et al. (34) used convolutional neural network (CNN) and support vector machine (SVM) methods to successfully identify methicillin-resistant Staphylococcus aureus (MRSA) and methicillin-sensitive S. aureus (MSSA) with an accuracy of 89 ± 0.1%. Thus, the pilot study showed that CNN could be used for antibiotic resistance predictions in differential bacterial species with better performance. Moreover, by comparing with other machine learning methods used in this study, CNN could handle complex regression and classification problems without assumed mathematical equations between input and output, leading to high computational efficiency and strong fault tolerance (31). So far, few studies have paid attention to how Raman spectral preprocess influenced the analysis of machine learning algorithms. Our study also compared anti-interference capacities of these algorithms in terms of artificially added noises (Fig. 4), which consistently revealed that CNN performed the best compared with other supervised machine learning algorithms. Thus, we concluded that CNN has good robustness on low signal-to-noise ratio data in the SERS spectra.

In this pilot study, we performed comparative analyses of supervised machine learning algorithms on discriminating CSKP and CRKP strains via SERS spectra. Although all the methods achieved relatively high prediction accuracies, there are still many aspects that need to be improved for the potential application of the method in clinical settings. For example, the models that we constructed were not robust and sufficient for real-world applications due to the limited number of K. pneumoniae strains used in this study. In addition, since both CSKP and CRKP strains were multidrug-resistant bacteria, other antibiotic resistances rather than carbapenem resistance may also be involved in the identification of CSKP and CRKP strains because of their contributions to the generation of SERS spectra. Thus, more SERS spectra from clinically isolated CSKP and CRKP strains should be used for training the machine learning models, which would greatly improve the quality and robustness of the models. In addition, antibiotic resistance profiles during K. pneumoniae isolation should be strictly controlled, and those strains only with differences in carbapenem sensitivity and resistance should be used for SERS spectral analysis, while the profiles of other antibiotic resistance should be the same. In this way, machine learning models could reliably predict CSKP and CRKP strains solely based on carbapenem resistance rather than other antibiotic resistances. It should be noted that, although acquisition of the signal for a single SERS spectrum took only seconds, the method used in this study still required bacterial culture and isolation, which made the overall procedure time consuming. In future studies, we will aim to use machine learning models to recognize CSKP and CRKP strains from clinical samples directly, which will greatly improve the efficiency of the rapid diagnostics of carbapenem-resistant and carbapenem-sensitive K. pneumoniae strains.

Conclusion.

Surface-enhanced Raman spectroscopy has been widely studied in terms of its application potentials in the diagnosis of bacterial pathogens and detection of antibiotic resistance. In this study, we calculated the average SERS spectra for CSKP and CRKP strains, through which the profiles of their characteristic peaks were identified. We then explored supervised machine learning algorithms in terms of their capacities in predicting CSKP and CRKP strains via SERS spectra. According to the results, eight supervised machine learning methods could successfully predict carbapenem sensitivity and resistance in K. pneumoniae, with the CNN algorithm on top of all other methods. In addition, CNN also performed best on SERS spectra with low signal-to-noise ratios. Taken together, our study confirmed the application potentials of surface-enhanced Raman spectroscopy in fast and accurate discrimination of K. pneumoniae strains with different antibiotic resistance profiles.

MATERIALS AND METHODS

Bacterial strains.

Both CSKP (n = 7) and CRKP (n = 8) strains were directly isolated from clinical samples and cultured on Columbia blood agar plates (35°C, 18 to 24 h) at the Department of Laboratory Medicine, Affiliated Hospital of Xuzhou Medical University. It is noteworthy that all the clinical samples were previously deidentified, and only bacterial isolates were analyzed in this study. Drug susceptibility was identified through Vitek2 Compact, an automated microbial identification (ID)/antibiotic susceptibility testing (AST) instrument (bioMérieux, La Balme-les-Grottes, France) in the Department of Laboratory Medicine, Affiliated Hospital of Xuzhou Medical University. Carbapenem resistance together with other antibiotic resistance profiles were determined according to the MIC breakpoint standards of the CLSI Subcommittee on Antimicrobial Susceptibility Testing (M100-S30) (Tables S1 and S2 in the supplemental material) (35). All bacteria were confirmed with biochemical tests and matrix-assisted laser desorption ionization–time of flight mass spectrometry (MALDI-TOF MS) for strain typing and were then stored at −80°C (Thermo Fisher, USA). In addition, principal-component analysis (PCA) and hierarchical clustering analysis (HCA) methods were applied to group these bacteria into two groups based on their antibiotic resistance profiles. Distribution patterns of antibiotic resistance in the CSKP and CRKP strains were visualized through interactive Tree of Life (iTOL) and are presented in Fig. S1 (36).

Preparation of AgNO₃ solution.

AgNO₃ (33.72 mg; Sinopharm, Beijing, China) was weighed and gently mixed with 200 mL of deionized water (ddH₂O) in a clean sterile triangular flask, which was then heated on a magnetic stirrer (ZNCL-BS230, Shi-Ji-Hua-Ke Pty. Ltd., Beijing, China) until boiling; then, 8 mL of 1% (wt) sodium citrate was added into the mixture and stirred with a speed of 650 r/min. Heating was stopped, and stirring was continued until the mixture cooled to room temperature (RT). The final volume was set to 200 mL via addition of ddH₂O. One milliliter of the above-made solution was transferred to a 1.5-mL Eppendorf tube and centrifuged at 7,000 r/min for 7 min (centrifuge 5430 R, Eppendorf, USA); the supernatant was discarded after centrifugation, and the pellet was resuspended with 100 μL of ddH₂O to get a uniform milky gray solution. The solution is the negatively charged silver nanoparticle (AgNP) substrate. The solution was stored in the dark at RT for later use.

Surface-enhanced Raman spectroscopy.

After each K. pneumoniae strain was cultured on the agar plate overnight, a single colony was picked and inoculated into 15 μL of phosphate-buffered saline (PBS) with vigorous vortexing. Fifteen microliters of negatively charged AgNPs was then mixed with the PBS solution, which was then dropped onto a silicon wafer to air dry. The dried spot was then measured via a commercial i-Raman Plus Raman spectrometer BWS456-785H (B&W Tek, USA). The measurement settings were set as follows: helium-neon (HeNe) laser power, 20 mW; wavelength, 785 nm; detector type, high quantum efficiency charge-coupled device (CCD) array; Raman shift range, 175 to 2,700 cm⁻¹; spectral acquisition, 5 s; resolution, <3.5 cm⁻¹ at 912 nm. The software BWSpec (version 4.10) was used to generate Raman spectral data. Each spectrum consisted of 657 points measured in the range of 519.56 cm⁻¹ to 1,800.81 cm⁻¹. A total of 15 Klebsiella pneumoniae strains were included in this experiment, which includes 7 strains of CSKP (N = 280) and 8 strains of CRSP (N = 280). Thus, a total of 560 surface-enhanced Raman spectra were collected, which was denoted by the letter N within the parentheses for each group of K. pneumoniae strains. For details, please refer to Table S3.

Raman spectra data analysis.

Raman spectral analysis requires pretreatment of raw data in order to improve signal-to-noise ratio (SNR) and normalize spectral distributions, which includes curve smoothing and denoising, baseline correction, and spectral normalization. In particular, averaged Raman spectra were generated for CSKP and CRKP strains by calculating the averaged value of intensity with artificial units (a.u.) at each Raman shift in the range of from 519.56 cm⁻¹ to 1,800.18 cm⁻¹, respectively. LabSpec 6 (HORIBA Scientific, Japan) was then used for processing and smoothing the averaged Raman spectra. Characteristic peaks were calculated by following the following steps: (i) a “smoothing” function was used to smoothen averaged Raman spectra (degree of 4, size of 5, and height of 50); (ii) for baseline correction, the parameters type = polynom, degree = 6, and attach = NO were set, and “Auto” was selected to start searching for the characteristic peaks; (iii) LabSpec 6 was used to normalize the spectral data automatically in order to better compare the two curves of CSKP and CRKP; and (iv) the GaussLoren () function was used to search characteristic peaks with a level of 0% and size of 32, while other parameters were kept at default. All characteristic peaks were marked with black arrows. We then used the software Origin to generate error bands for the two averaged Raman spectra, which were based on the 20% standard deviation of Raman effect intensity corresponding to each Raman shift and could reflect the reproducibility of the experiment. An averaged Raman spectrum with a 20% standard error band together with characteristic peaks were also generated for each K. pneumoniae strain, which showed the repeatability of Raman spectroscopy for a single K. pneumoniae strain (Fig. S2). In addition, a PCA was performed based on the distribution of characteristic peaks of each bacterial strain in order to separate all the K. pneumoniae strains into different groups (Fig. S3).

Machine learning algorithms.

(i) Data preprocessing. Before machine learning analysis, the software Unscrambler X was used to perform baseline correction, smoothing, denoising, and normalization of the original Raman spectra (37). In this study, we used the Savitzky-Golay (S-G) smoothing filter for denoised smoothing of Raman spectra, in which the polynomial fitting order was set to 2. For baseline correction, the multiple scattering correction (MSC) method was used, which could effectively eliminate the scattering effect of the spectral data and enhance the spectral absorption information related to the molecular compositions (38). Normalization is a method for simplifying calculation, and a variety of methods have been developed to normalize the Raman spectrum (24). In this study, we normalized SERS spectra by column (Raman intensity values at a particular Raman shift). That is, the highest intensity value (peak value) in each column was selected as the maximum constant so that all the other measured spectral intensities in the same column were divided to the highest intensity value to realize the normalization of the spectral data (39).

(ii) Supervised machine learning. In this study, eight supervised machine learning methods, that is, adaptive boosting (AdaBoost), convolutional neural network (CNN), decision tree (DT), gradient boosting (GB), k-nearest neighbors (KNN), linear discriminant analysis (LDA), random forest (RF), and support vector machine (SVM), were compared for their capacities in classifying and predicting Raman spectral data by using the Python machine learning package “sklearn” (https://scikit-learn.org). Among the eight supervised machine learning algorithms, CNN is an artificial neural network, and the weights in CNN are trained through the backpropagation algorithm to achieve deep learning analysis. In this study, we used LeNet-5, a classical and efficient neural network model, for Raman spectral analysis. The schematic illustration of the network structure is shown in Fig. 5.

FIG 5 — Schematic illustration of LeNet-5 neural network architecture. LeNet-5 neural network classified different types of data through convolution and pooling steps and then via the full connection layer. The SoftMax activation function was finally used for the output layer.

(iii) Evaluation of supervised machine learning algorithms. In order to compare the classification ability of different machine learning algorithms on Raman spectrum data, we need an evaluation standard to measure the generalization ability of the model. In the identification of spectral signals, the most commonly used performance measurements are accuracy (acc) and error rate (error), which has the following relationship: acc = 1 – error. In the evaluation of machine learning models, precision (P) and recall (R) are a pair of mutually restrictive performance metrics. Normally, the precision rate and recall rate are measurements of predictive performance. When P is high, R is low and vice versa. Therefore, when evaluating the model, in order to more intuitively reflect the performance of the model, F1 is used as a metric, which is based on the harmonic average of precision and recall (24). Because, in this study, sample size is small, when the data are divided, overfitting may occur due to unbalanced data division. For the CNN model, overfitting is more likely to occur (24). For the optimal model, we used cross-validation to divide the data set, average the results of multiple evaluations, and eliminate the adverse effects caused by the unbalanced data division, which is easier to reflect on small data sets (Fig. 6).

FIG 6 — Schematic illustration of 5-fold cross-validation for supervised machine learning algorithms. The 5-fold cross-validation divided the data into 5 parts and selected 1 part as the test set and the other 4 parts as the training set at each time. The above steps were repeated five times, and different parts were selected as the test set at each time, which would provide average model accuracy, indicating how stable each model was.

Construction of confusion matrix.

A confusion matrix aims to summarize the performance of a machine learning algorithm. During the construction of a confusion matrix for each supervised machine learning algorithm, the CNN model was built on Keras architecture while the other seven supervised algorithms directly call the classifiers in the scikit-learn package, which included KNeighborsClassifier(), SVC(), DecisionTreeClassifier(), RandomForestClassifier(), AdaBoostClassifier(), GradientBoostingClassifier(), and LinearDiscriminantAnalysis(), respectively.

Effects of SNR on machine learning accuracy.

In order to improve the performance of the model, data enhancement is usually adopted to expand the sample data and enhance the diversity of data. In this experiment, seven random SNR Gaussian white noises with different intensity were added to the Raman spectral data. Then, eight supervised machine learning models trained in this study were used for testing the effects of different SNRs on the classification accuracy by following the same procedures as described above.

Supplementary Material

Reviewer comments

reviewer-comments.pdf^{(8.7MB, pdf)}

ACKNOWLEDGMENTS

We thank the anonymous reviewers for their thoughtful comments that greatly improved the quality of the manuscript.

L.W. gratefully acknowledges the financial support of National Natural Science Foundation of China (31900022), Natural Science Foundation of Jiangsu Province (BK20180997), Young Science and Technology Innovation Team of Xuzhou Medical University (TD202001), and Jiangsu Qing-Lan Project (2020). B.G. thanks the financial support of the National Natural Science Foundation of China (81871734, 82072380), Key R & D Program of Jiangsu Province (BE2020646), and Research Foundation for Advanced Talents of Guandong Provincial People’s Hospital (KJ012021097).

L.W., X.Z., B.G., and W.L. conceived and designed the experiments. L.W., X.Z., B.G., and W.L. contributed to the project administration. L.W., J.W.T., J.W.L., J.J.W., X.Y.S., and Y.C.P. carried out the computational and experimental work. L.W., J.W.T., and Q.H.L. wrote and revised the manuscript. L.W., X.Z., and B.G. provided the platform, resources, and student supervision. All authors read and approved the final manuscript.

Footnotes

Supplemental material is available online only.

SUPPLEMENTAL FILE 1

Supplemental material. Download SPECTRUM02409-21_Supp_1_seq9.pdf, PDF file, 5.3 MB^{(5.4MB, pdf)}

Contributor Information

Xiao Zhang, Email: changshui@hotmail.com.

Bing Gu, Email: gb20031129@163.com.

Liang Wang, Email: healthscience@foxmail.com.

Karen C. Carroll, Johns Hopkins Hospital

Katarína Rebrošová, Department of Microbiology, Faculty of Medicine of Masaryk University and St. Anne's University Hospital Brno.

REFERENCES

1.Piret J, Boivin G. 2020. Pandemics throughout history. Front Microbiol 11:631736. doi: 10.3389/fmicb.2020.631736. [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Ventola CL. 2015. The antibiotic resistance crisis: part 1: causes and threats. P T 40:277–283. [PMC free article] [PubMed] [Google Scholar]
3.Burnham C-AD, Leeds J, Nordmann P, O’Grady J, Patel J. 2017. Diagnosing antimicrobial resistance. Nat Rev Microbiol 15:697–703. doi: 10.1038/nrmicro.2017.103. [DOI] [PubMed] [Google Scholar]
4.Reyes J, Aguilar AC, Caicedo A. 2019. Carbapenem-resistant Klebsiella pneumoniae: microbiology key points for clinical practice. Int J Gen Med 12:437–446. doi: 10.2147/IJGM.S214305. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Wu T, Xu F, Su C, Li H, Lv N, Liu Y, Gao Y, Lan Y, Li J. 2020. Alterations in the gut microbiome and cecal metabolome during Klebsiella pneumoniae-induced pneumosepsis. Front Immunol 11:1331. doi: 10.3389/fimmu.2020.01331. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Podschun R, Ullmann U. 1998. Klebsiella spp. as nosocomial pathogens: epidemiology, taxonomy, typing methods, and pathogenicity factors. Clin Microbiol Rev 11:589–603. doi: 10.1128/CMR.11.4.589. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Navon-Venezia S, Kondratyeva K, Carattoli A. 2017. Klebsiella pneumoniae: a major worldwide source and shuttle for antibiotic resistance. FEMS Microbiol Rev 41:252–275. doi: 10.1093/femsre/fux013. [DOI] [PubMed] [Google Scholar]
8.Maicas S, Moya C. 2020. Antimicrobial resistance in Klebsiella pneumoniae strains: mechanisms and outbreaks. Proceedings 66:11. doi: 10.3390/proceedings2020066011. [DOI] [Google Scholar]
9.Paterson DL, Hujer KM, Hujer AM, Yeiser B, Bonomo MD, Rice LB, Bonomo RA, International Klebsiella Study Group. 2003. Extended-spectrum β-lactamases in Klebsiella pneumoniae bloodstream isolates from seven countries: dominance and widespread prevalence of SHV- and CTX-M-type β-lactamases. Antimicrob Agents Chemother 47:3554–3560. doi: 10.1128/AAC.47.11.3554-3560.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Lee CH, Su LH, Tang YF, Liu JW. 2006. Treatment of ESBL-producing Klebsiella pneumoniae bacteraemia with carbapenems or flomoxef: a retrospective study and laboratory analysis of the isolates. J Antimicrob Chemother 58:1074–1077. doi: 10.1093/jac/dkl381. [DOI] [PubMed] [Google Scholar]
11.Li Y, Shen H, Zhu C, Yu Y. 2019. Carbapenem-resistant Klebsiella pneumoniae infections among ICU admission patients in central China: prevalence and prediction model. BioMed Res Int 2019:9767313. doi: 10.1155/2019/9767313. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Fang L, Xu H, Ren X, Li X, Ma X, Zhou H, Hong G, Liang X. 2020. Epidemiology and risk factors for carbapenem-resistant Klebsiella pneumoniae and subsequent MALDI-TOF MS as a tool to cluster KPC-2-producing Klebsiella pneumoniae, a retrospective study. Front Cell Infect Microbiol 10:462. doi: 10.3389/fcimb.2020.00462. [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Wang C, Yuan Z, Huang W, Yan L, Tang J, Liu C-W. 2019. Epidemiologic analysis and control strategy of Klebsiella pneumoniae infection in intensive care units in a teaching hospital of People’s Republic of China. Infect Drug Resist 12:391–398. doi: 10.2147/IDR.S189154. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Banoub J, Huang T-S, Lee SS-J, Lee C-C, Chang F-C. 2020. Detection of carbapenem-resistant Klebsiella pneumoniae on the basis of matrix-assisted laser desorption ionization time-of-flight mass spectrometry by using supervised machine learning approach. PLoS One 15:e0228459. doi: 10.1371/journal.pone.0228459. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Wang L, Liu W, Tang J-W, Wang J-J, Liu Q-H, Wen P-B, Wang M-M, Pan Y-C, Gu B, Zhang X. 2021. Applications of Raman spectroscopy in bacterial infections: principles, advantages, and shortcomings. Front Microbiol 12:683580. doi: 10.3389/fmicb.2021.683580. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Bora T. 2018. Chapter 6. Recent developments on metal nanoparticles for SERS applications. In Seehra MS, Bristow AD (ed), Noble and precious metals – properties, nanoscale effects and applications. IntechOpen, London, UK. [Google Scholar]
17.Tang J-W, Liu Q-H, Yin X-C, Pan Y-C, Wen P-B, Liu X, Kang X-X, Gu B, Zhu Z-B, Wang L. 2021. Comparative analysis of machine learning algorithms on surface enhanced Raman spectra of clinical Staphylococcus species. Front Microbiol 12:696921. doi: 10.3389/fmicb.2021.696921. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.AlMasoud N, Muhamadali H, Chisanga M, AlRabiah H, Lima CA, Goodacre R. 2021. Discrimination of bacteria using whole organism fingerprinting: the utility of modern physicochemical techniques for bacterial typing. Analyst 146:770–788. doi: 10.1039/d0an01482f. [DOI] [PubMed] [Google Scholar]
19.Gautam R, Vanga S, Ariese F, Umapathy S. 2015. Review of multidimensional data processing approaches for Raman and infrared spectroscopy. EPJ Tech Instrum 2:8. doi: 10.1140/epjti/s40485-015-0018-6. [DOI] [Google Scholar]
20.Barton SJ, Ward TE, Hennelly BM. 2018. Algorithm for optimal denoising of Raman spectra. Anal Methods 10:3759–3769. doi: 10.1039/C8AY01089G. [DOI] [Google Scholar]
21.Yuan X, Mayanovic RA. 2017. An empirical study on Raman peak fitting and its application to Raman quantitative research. Appl Spectrosc 71:2325–2338. doi: 10.1177/0003702817721527. [DOI] [PubMed] [Google Scholar]
22.de Siqueira e Oliveira FS, da Silva AM, Pacheco MTT, Giana HE, Silveira L. 2021. Biochemical characterization of pathogenic bacterial species using Raman spectroscopy and discrimination model based on selected spectral features. Lasers Med Sci 36:289–302. doi: 10.1007/s10103-020-03028-9. [DOI] [PubMed] [Google Scholar]
23.Durickovic I. 2016. Chapter 19. Using Raman spectroscopy for characterization of aqueous media and quantification of species in aqueous solution. In Stauffer M (ed), Applications of molecular spectroscopy to current research in the chemical and biological sciences. IntechOpen, London, UK. [Google Scholar]
24.Chicco D. 2017. Ten quick tips for machine learning in computational biology. BioData Min 10:35. doi: 10.1186/s13040-017-0155-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Karakonstantis S, Kritsotakis EI, Gikas A. 2019. Pandrug-resistant Gram-negative bacteria: a systematic review of current epidemiology, prognosis and treatment options. J Antimicrob Chemother 75:271–282. doi: 10.1093/jac/dkz401. [DOI] [PubMed] [Google Scholar]
26.Adegoke AA, Fatunla OK, Okoh AI. 2020. Critical threat associated with carbapenem-resistant Gram-negative bacteria: prioritizing water matrices in addressing total antibiotic resistance. Ann Microbiol 70:43. doi: 10.1186/s13213-020-01579-4. [DOI] [Google Scholar]
27.Pérez-Jiménez AI, Lyu D, Lu Z, Liu G, Ren B. 2020. Surface-enhanced Raman spectroscopy: benefits, trade-offs and future developments. Chem Sci 11:4563–4577. doi: 10.1039/d0sc00809e. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Xiong M, Ye J. 2014. Reproducibility in surface-enhanced Raman spectroscopy. J Shanghai Jiaotong Univ (Sci) 19:681–690. doi: 10.1007/s12204-014-1566-7. [DOI] [Google Scholar]
29.Witkowska E, Niciński K, Korsak D, Szymborski T, Kamińska A. 2019. Sources of variability in SERS spectra of bacteria: comprehensive analysis of interactions between selected bacteria and plasmonic nanostructures. Anal Bioanal Chem 411:2001–2017. doi: 10.1007/s00216-019-01609-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Di Mascolo D, Coclite A, Gentile F, Francardi M. 2019. Quantitative micro-Raman analysis of micro-particles in drug delivery. Nanoscale Adv 1:1541–1552. doi: 10.1039/c8na00187a. [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Wang K, Chen L, Ma X, Ma L, Chou KC, Cao Y, Khan IUH, Gölz G, Lu X, Master ER. 2020. Arcobacter identification and species determination using Raman spectroscopy combined with neural networks. Appl Environ Microbiol 86:e00924-20. doi: 10.1128/AEM.00924-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Verma T, Annappa H, Singh S, Umapathy S, Nandi D. 2021. Profiling antibiotic resistance in Escherichia coli strains displaying differential antibiotic susceptibilities using Raman spectroscopy. J Biophotonics 14:e202000231. doi: 10.1002/jbio.202000231. [DOI] [PubMed] [Google Scholar]
33.Cheong Y, Kim YJ, Kang H, Choi S, Lee HJ. 2017. Rapid label-free identification of Klebsiella pneumoniae antibiotic resistant strains by the drop-coating deposition surface-enhanced Raman scattering method. Spectrochim Acta A Mol Biomol Spectrosc 183:53–59. doi: 10.1016/j.saa.2017.04.044. [DOI] [PubMed] [Google Scholar]
34.Ho C-S, Jean N, Hogan CA, Blackmon L, Jeffrey SS, Holodniy M, Banaei N, Saleh AAE, Ermon S, Dionne J. 2019. Rapid identification of pathogenic bacteria using Raman spectroscopy and deep learning. Nat Commun 10:4927. doi: 10.1038/s41467-019-12898-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
35.Weinstein MP, Lewis JS, Kraft CS. 2020. The clinical and laboratory standards institute subcommittee on antimicrobial susceptibility testing: background, organization, functions, and processes. J Clin Microbiol 58:e01864-19. doi: 10.1128/JCM.01864-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Letunic I, Bork P. 2021. Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Res 49:W293–W296. doi: 10.1093/nar/gkab301. [DOI] [PMC free article] [PubMed] [Google Scholar]
37.Martens H, Karstang T, Næs T. 1987. Improved selectivity in spectroscopy by multivariate calibration. J Chemometrics 1:201–219. doi: 10.1002/cem.1180010403. [DOI] [Google Scholar]
38.Laumer J, O’Leary SK. 2018. An approach to the spectral smoothing of Raman data applied to the specific case of thin-film carbon. J Mater Sci Mater Electron 29:10026–10036. doi: 10.1007/s10854-018-9046-8. [DOI] [Google Scholar]
39.Watt J, Borhani R, Katsaggelos A. 2020. Machine learning refined: foundations, algorithms, and applications, 2nd ed. Cambridge University Press, Cambridge, UK. [Google Scholar]
40.Mert S, Özbek E, Otunctemur A, Culha M. 2015. Kidney tumor staging using surface-enhanced Raman scattering. J Biomed Opt 20:047002. doi: 10.1117/1.JBO.20.4.047002. [DOI] [PubMed] [Google Scholar]
41.Ahmed M, Almagedi S, Yao W. 2013. SERS signatures of foodborne pathogenic zoonotic bacteria using gold colloid. Int J Eng Sci Technol 5:810. [Google Scholar]
42.Chao Y, Zhang T. 2012. Surface-enhanced Raman scattering (SERS) revealing chemical variation during biofilm formation: from initial attachment to mature biofilm. Anal Bioanal Chem 404:1465–1475. doi: 10.1007/s00216-012-6225-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
43.Kim J, Park H, Kim J, Chang B, Park H-K. 2017. Label-free detection for a DNA methylation assay using Raman spectroscopy. Chin Med J (Engl) 130:1961–1967. doi: 10.4103/0366-6999.211874. [DOI] [PMC free article] [PubMed] [Google Scholar]
44.De Luca AC, Dholakia K, Mazilu M. 2015. Modulated Raman spectroscopy for enhanced cancer diagnosis at the cellular level. Sensors (Basel) 15:13680–13704. doi: 10.3390/s150613680. [DOI] [PMC free article] [PubMed] [Google Scholar]
45.Zhang D, Neumann O, Wang H, Yuwono V, Barhoumi A, Perham M, Hartgerink J, Wittung-Stafshede P, Halas N. 2009. Gold nanoparticles can induce the formation of protein-based aggregates at physiological pH. Nano Lett 9:666–671. doi: 10.1021/nl803054h. [DOI] [PubMed] [Google Scholar]
46.Chen W, Lin J, Chen R, Feng S, Yu Y, Lin D, Huang M, Shi H, Huang H. 2015. Detection and identification of Huo–Xue–Hua–Yu decoction (HXHYD) using surface-enhanced Raman scattering (SERS) spectroscopy and multivariate analysis. Laser Phys Lett 12:e045602. doi: 10.1088/1612-2011/12/4/045602. [DOI] [Google Scholar]
47.Chaturvedi D, Balaji S, Bn VK, Ariese F, Umapathy S, Rangarajan A. 2016. Different phases of breast cancer cells: Raman study of immortalized, transformed, and invasive cells. Biosensors 6:57. doi: 10.3390/bios6040057. [DOI] [PMC free article] [PubMed] [Google Scholar]
48.Suzuki T, Takahashi K, Uehara H, Yamanobe T. 2013. Application and analysis of a DSC-Raman spectroscopy for indium and poly(lactic acid). J Therm Anal Calorim 113:1543–1549. doi: 10.1007/s10973-013-3098-z. [DOI] [Google Scholar]
49.Rusciano G, Capriglione P, Pesce G, Del Prete S, Cennamo G, Di Cave D, Cerulli L, Sasso A. 2013. Raman microspectroscopy analysis in the treatment of acanthamoeba keratitis. PLoS One 8:e72127. doi: 10.1371/journal.pone.0072127. [DOI] [PMC free article] [PubMed] [Google Scholar]
50.Zhang X, Yin H, Cooper J, Haswell S. 2008. Characterization of cellular chemical dynamics using combined microfluidic and Raman techniques. Anal Bioanal Chem 390:833–840. doi: 10.1007/s00216-007-1564-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
51.Chisanga M, Muhamadali H, Ellis DI, Goodacre R. 2018. Surface-enhanced Raman scattering (SERS) in microbiology: illumination and enhancement of the microbial world. Appl Spectrosc 72:987–1000. doi: 10.1177/0003702818764672. [DOI] [PubMed] [Google Scholar]
52.Nguyen EP, Chrimes A, Brkljb a.R, Ou J, Berean K, Zhuiykov S, Kalantarzadeh K. 2013. Assessment of a Raman micro-spectroscopy/microfluidics unit using a model E. coli/glucose bio-system, p 157–162. In The 7th IEEE International Conference on Nano/Molecular Medicine and Engineering. IEEE, New York, NY. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Reviewer comments

reviewer-comments.pdf^{(8.7MB, pdf)}

SUPPLEMENTAL FILE 1

Supplemental material. Download SPECTRUM02409-21_Supp_1_seq9.pdf, PDF file, 5.3 MB^{(5.4MB, pdf)}

[B1] 1.Piret J, Boivin G. 2020. Pandemics throughout history. Front Microbiol 11:631736. doi: 10.3389/fmicb.2020.631736. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B2] 2.Ventola CL. 2015. The antibiotic resistance crisis: part 1: causes and threats. P T 40:277–283. [PMC free article] [PubMed] [Google Scholar]

[B3] 3.Burnham C-AD, Leeds J, Nordmann P, O’Grady J, Patel J. 2017. Diagnosing antimicrobial resistance. Nat Rev Microbiol 15:697–703. doi: 10.1038/nrmicro.2017.103. [DOI] [PubMed] [Google Scholar]

[B4] 4.Reyes J, Aguilar AC, Caicedo A. 2019. Carbapenem-resistant Klebsiella pneumoniae: microbiology key points for clinical practice. Int J Gen Med 12:437–446. doi: 10.2147/IJGM.S214305. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B5] 5.Wu T, Xu F, Su C, Li H, Lv N, Liu Y, Gao Y, Lan Y, Li J. 2020. Alterations in the gut microbiome and cecal metabolome during Klebsiella pneumoniae-induced pneumosepsis. Front Immunol 11:1331. doi: 10.3389/fimmu.2020.01331. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B6] 6.Podschun R, Ullmann U. 1998. Klebsiella spp. as nosocomial pathogens: epidemiology, taxonomy, typing methods, and pathogenicity factors. Clin Microbiol Rev 11:589–603. doi: 10.1128/CMR.11.4.589. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B7] 7.Navon-Venezia S, Kondratyeva K, Carattoli A. 2017. Klebsiella pneumoniae: a major worldwide source and shuttle for antibiotic resistance. FEMS Microbiol Rev 41:252–275. doi: 10.1093/femsre/fux013. [DOI] [PubMed] [Google Scholar]

[B8] 8.Maicas S, Moya C. 2020. Antimicrobial resistance in Klebsiella pneumoniae strains: mechanisms and outbreaks. Proceedings 66:11. doi: 10.3390/proceedings2020066011. [DOI] [Google Scholar]

[B9] 9.Paterson DL, Hujer KM, Hujer AM, Yeiser B, Bonomo MD, Rice LB, Bonomo RA, International Klebsiella Study Group. 2003. Extended-spectrum β-lactamases in Klebsiella pneumoniae bloodstream isolates from seven countries: dominance and widespread prevalence of SHV- and CTX-M-type β-lactamases. Antimicrob Agents Chemother 47:3554–3560. doi: 10.1128/AAC.47.11.3554-3560.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B10] 10.Lee CH, Su LH, Tang YF, Liu JW. 2006. Treatment of ESBL-producing Klebsiella pneumoniae bacteraemia with carbapenems or flomoxef: a retrospective study and laboratory analysis of the isolates. J Antimicrob Chemother 58:1074–1077. doi: 10.1093/jac/dkl381. [DOI] [PubMed] [Google Scholar]

[B11] 11.Li Y, Shen H, Zhu C, Yu Y. 2019. Carbapenem-resistant Klebsiella pneumoniae infections among ICU admission patients in central China: prevalence and prediction model. BioMed Res Int 2019:9767313. doi: 10.1155/2019/9767313. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B12] 12.Fang L, Xu H, Ren X, Li X, Ma X, Zhou H, Hong G, Liang X. 2020. Epidemiology and risk factors for carbapenem-resistant Klebsiella pneumoniae and subsequent MALDI-TOF MS as a tool to cluster KPC-2-producing Klebsiella pneumoniae, a retrospective study. Front Cell Infect Microbiol 10:462. doi: 10.3389/fcimb.2020.00462. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B13] 13.Wang C, Yuan Z, Huang W, Yan L, Tang J, Liu C-W. 2019. Epidemiologic analysis and control strategy of Klebsiella pneumoniae infection in intensive care units in a teaching hospital of People’s Republic of China. Infect Drug Resist 12:391–398. doi: 10.2147/IDR.S189154. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B14] 14.Banoub J, Huang T-S, Lee SS-J, Lee C-C, Chang F-C. 2020. Detection of carbapenem-resistant Klebsiella pneumoniae on the basis of matrix-assisted laser desorption ionization time-of-flight mass spectrometry by using supervised machine learning approach. PLoS One 15:e0228459. doi: 10.1371/journal.pone.0228459. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B15] 15.Wang L, Liu W, Tang J-W, Wang J-J, Liu Q-H, Wen P-B, Wang M-M, Pan Y-C, Gu B, Zhang X. 2021. Applications of Raman spectroscopy in bacterial infections: principles, advantages, and shortcomings. Front Microbiol 12:683580. doi: 10.3389/fmicb.2021.683580. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B16] 16.Bora T. 2018. Chapter 6. Recent developments on metal nanoparticles for SERS applications. In Seehra MS, Bristow AD (ed), Noble and precious metals – properties, nanoscale effects and applications. IntechOpen, London, UK. [Google Scholar]

[B17] 17.Tang J-W, Liu Q-H, Yin X-C, Pan Y-C, Wen P-B, Liu X, Kang X-X, Gu B, Zhu Z-B, Wang L. 2021. Comparative analysis of machine learning algorithms on surface enhanced Raman spectra of clinical Staphylococcus species. Front Microbiol 12:696921. doi: 10.3389/fmicb.2021.696921. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B18] 18.AlMasoud N, Muhamadali H, Chisanga M, AlRabiah H, Lima CA, Goodacre R. 2021. Discrimination of bacteria using whole organism fingerprinting: the utility of modern physicochemical techniques for bacterial typing. Analyst 146:770–788. doi: 10.1039/d0an01482f. [DOI] [PubMed] [Google Scholar]

[B19] 19.Gautam R, Vanga S, Ariese F, Umapathy S. 2015. Review of multidimensional data processing approaches for Raman and infrared spectroscopy. EPJ Tech Instrum 2:8. doi: 10.1140/epjti/s40485-015-0018-6. [DOI] [Google Scholar]

[B20] 20.Barton SJ, Ward TE, Hennelly BM. 2018. Algorithm for optimal denoising of Raman spectra. Anal Methods 10:3759–3769. doi: 10.1039/C8AY01089G. [DOI] [Google Scholar]

[B21] 21.Yuan X, Mayanovic RA. 2017. An empirical study on Raman peak fitting and its application to Raman quantitative research. Appl Spectrosc 71:2325–2338. doi: 10.1177/0003702817721527. [DOI] [PubMed] [Google Scholar]

[B22] 22.de Siqueira e Oliveira FS, da Silva AM, Pacheco MTT, Giana HE, Silveira L. 2021. Biochemical characterization of pathogenic bacterial species using Raman spectroscopy and discrimination model based on selected spectral features. Lasers Med Sci 36:289–302. doi: 10.1007/s10103-020-03028-9. [DOI] [PubMed] [Google Scholar]

[B23] 23.Durickovic I. 2016. Chapter 19. Using Raman spectroscopy for characterization of aqueous media and quantification of species in aqueous solution. In Stauffer M (ed), Applications of molecular spectroscopy to current research in the chemical and biological sciences. IntechOpen, London, UK. [Google Scholar]

[B24] 24.Chicco D. 2017. Ten quick tips for machine learning in computational biology. BioData Min 10:35. doi: 10.1186/s13040-017-0155-3. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B25] 25.Karakonstantis S, Kritsotakis EI, Gikas A. 2019. Pandrug-resistant Gram-negative bacteria: a systematic review of current epidemiology, prognosis and treatment options. J Antimicrob Chemother 75:271–282. doi: 10.1093/jac/dkz401. [DOI] [PubMed] [Google Scholar]

[B26] 26.Adegoke AA, Fatunla OK, Okoh AI. 2020. Critical threat associated with carbapenem-resistant Gram-negative bacteria: prioritizing water matrices in addressing total antibiotic resistance. Ann Microbiol 70:43. doi: 10.1186/s13213-020-01579-4. [DOI] [Google Scholar]

[B27] 27.Pérez-Jiménez AI, Lyu D, Lu Z, Liu G, Ren B. 2020. Surface-enhanced Raman spectroscopy: benefits, trade-offs and future developments. Chem Sci 11:4563–4577. doi: 10.1039/d0sc00809e. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B28] 28.Xiong M, Ye J. 2014. Reproducibility in surface-enhanced Raman spectroscopy. J Shanghai Jiaotong Univ (Sci) 19:681–690. doi: 10.1007/s12204-014-1566-7. [DOI] [Google Scholar]

[B29] 29.Witkowska E, Niciński K, Korsak D, Szymborski T, Kamińska A. 2019. Sources of variability in SERS spectra of bacteria: comprehensive analysis of interactions between selected bacteria and plasmonic nanostructures. Anal Bioanal Chem 411:2001–2017. doi: 10.1007/s00216-019-01609-4. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B30] 30.Di Mascolo D, Coclite A, Gentile F, Francardi M. 2019. Quantitative micro-Raman analysis of micro-particles in drug delivery. Nanoscale Adv 1:1541–1552. doi: 10.1039/c8na00187a. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B31] 31.Wang K, Chen L, Ma X, Ma L, Chou KC, Cao Y, Khan IUH, Gölz G, Lu X, Master ER. 2020. Arcobacter identification and species determination using Raman spectroscopy combined with neural networks. Appl Environ Microbiol 86:e00924-20. doi: 10.1128/AEM.00924-20. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B32] 32.Verma T, Annappa H, Singh S, Umapathy S, Nandi D. 2021. Profiling antibiotic resistance in Escherichia coli strains displaying differential antibiotic susceptibilities using Raman spectroscopy. J Biophotonics 14:e202000231. doi: 10.1002/jbio.202000231. [DOI] [PubMed] [Google Scholar]

[B33] 33.Cheong Y, Kim YJ, Kang H, Choi S, Lee HJ. 2017. Rapid label-free identification of Klebsiella pneumoniae antibiotic resistant strains by the drop-coating deposition surface-enhanced Raman scattering method. Spectrochim Acta A Mol Biomol Spectrosc 183:53–59. doi: 10.1016/j.saa.2017.04.044. [DOI] [PubMed] [Google Scholar]

[B34] 34.Ho C-S, Jean N, Hogan CA, Blackmon L, Jeffrey SS, Holodniy M, Banaei N, Saleh AAE, Ermon S, Dionne J. 2019. Rapid identification of pathogenic bacteria using Raman spectroscopy and deep learning. Nat Commun 10:4927. doi: 10.1038/s41467-019-12898-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B35] 35.Weinstein MP, Lewis JS, Kraft CS. 2020. The clinical and laboratory standards institute subcommittee on antimicrobial susceptibility testing: background, organization, functions, and processes. J Clin Microbiol 58:e01864-19. doi: 10.1128/JCM.01864-19. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B36] 36.Letunic I, Bork P. 2021. Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Res 49:W293–W296. doi: 10.1093/nar/gkab301. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B37] 37.Martens H, Karstang T, Næs T. 1987. Improved selectivity in spectroscopy by multivariate calibration. J Chemometrics 1:201–219. doi: 10.1002/cem.1180010403. [DOI] [Google Scholar]

[B38] 38.Laumer J, O’Leary SK. 2018. An approach to the spectral smoothing of Raman data applied to the specific case of thin-film carbon. J Mater Sci Mater Electron 29:10026–10036. doi: 10.1007/s10854-018-9046-8. [DOI] [Google Scholar]

[B39] 39.Watt J, Borhani R, Katsaggelos A. 2020. Machine learning refined: foundations, algorithms, and applications, 2nd ed. Cambridge University Press, Cambridge, UK. [Google Scholar]

[B40] 40.Mert S, Özbek E, Otunctemur A, Culha M. 2015. Kidney tumor staging using surface-enhanced Raman scattering. J Biomed Opt 20:047002. doi: 10.1117/1.JBO.20.4.047002. [DOI] [PubMed] [Google Scholar]

[B41] 41.Ahmed M, Almagedi S, Yao W. 2013. SERS signatures of foodborne pathogenic zoonotic bacteria using gold colloid. Int J Eng Sci Technol 5:810. [Google Scholar]

[B42] 42.Chao Y, Zhang T. 2012. Surface-enhanced Raman scattering (SERS) revealing chemical variation during biofilm formation: from initial attachment to mature biofilm. Anal Bioanal Chem 404:1465–1475. doi: 10.1007/s00216-012-6225-y. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B43] 43.Kim J, Park H, Kim J, Chang B, Park H-K. 2017. Label-free detection for a DNA methylation assay using Raman spectroscopy. Chin Med J (Engl) 130:1961–1967. doi: 10.4103/0366-6999.211874. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B44] 44.De Luca AC, Dholakia K, Mazilu M. 2015. Modulated Raman spectroscopy for enhanced cancer diagnosis at the cellular level. Sensors (Basel) 15:13680–13704. doi: 10.3390/s150613680. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B45] 45.Zhang D, Neumann O, Wang H, Yuwono V, Barhoumi A, Perham M, Hartgerink J, Wittung-Stafshede P, Halas N. 2009. Gold nanoparticles can induce the formation of protein-based aggregates at physiological pH. Nano Lett 9:666–671. doi: 10.1021/nl803054h. [DOI] [PubMed] [Google Scholar]

[B46] 46.Chen W, Lin J, Chen R, Feng S, Yu Y, Lin D, Huang M, Shi H, Huang H. 2015. Detection and identification of Huo–Xue–Hua–Yu decoction (HXHYD) using surface-enhanced Raman scattering (SERS) spectroscopy and multivariate analysis. Laser Phys Lett 12:e045602. doi: 10.1088/1612-2011/12/4/045602. [DOI] [Google Scholar]

[B47] 47.Chaturvedi D, Balaji S, Bn VK, Ariese F, Umapathy S, Rangarajan A. 2016. Different phases of breast cancer cells: Raman study of immortalized, transformed, and invasive cells. Biosensors 6:57. doi: 10.3390/bios6040057. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B48] 48.Suzuki T, Takahashi K, Uehara H, Yamanobe T. 2013. Application and analysis of a DSC-Raman spectroscopy for indium and poly(lactic acid). J Therm Anal Calorim 113:1543–1549. doi: 10.1007/s10973-013-3098-z. [DOI] [Google Scholar]

[B49] 49.Rusciano G, Capriglione P, Pesce G, Del Prete S, Cennamo G, Di Cave D, Cerulli L, Sasso A. 2013. Raman microspectroscopy analysis in the treatment of acanthamoeba keratitis. PLoS One 8:e72127. doi: 10.1371/journal.pone.0072127. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B50] 50.Zhang X, Yin H, Cooper J, Haswell S. 2008. Characterization of cellular chemical dynamics using combined microfluidic and Raman techniques. Anal Bioanal Chem 390:833–840. doi: 10.1007/s00216-007-1564-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B51] 51.Chisanga M, Muhamadali H, Ellis DI, Goodacre R. 2018. Surface-enhanced Raman scattering (SERS) in microbiology: illumination and enhancement of the microbial world. Appl Spectrosc 72:987–1000. doi: 10.1177/0003702818764672. [DOI] [PubMed] [Google Scholar]

[B52] 52.Nguyen EP, Chrimes A, Brkljb a.R, Ou J, Berean K, Zhuiykov S, Kalantarzadeh K. 2013. Assessment of a Raman micro-spectroscopy/microfluidics unit using a model E. coli/glucose bio-system, p 157–162. In The 7th IEEE International Conference on Nano/Molecular Medicine and Engineering. IEEE, New York, NY. [Google Scholar]

PERMALINK

Discrimination between Carbapenem-Resistant and Carbapenem-Sensitive Klebsiella pneumoniae Strains through Computational Analysis of Surface-Enhanced Raman Spectra: a Pilot Study

Wei Liu

Jia-Wei Tang

Jing-Wen Lyu

Jun-Jiao Wang

Ya-Cheng Pan

Xin-Yi Shi

Qing-Hua Liu

Xiao Zhang

Bing Gu

Liang Wang

Roles

ABSTRACT

INTRODUCTION

RESULTS

Raman spectra of CRKP and CSKP strains.

FIG 1.

TABLE 1.

Supervised machine learning algorithms.

TABLE 2.

FIG 2.

FIG 3.

Effects of signal-to-noise ratio on machine learning accuracy.

FIG 4.

DISCUSSION

Conclusion.

MATERIALS AND METHODS

Bacterial strains.

Preparation of AgNO3 solution.

Surface-enhanced Raman spectroscopy.

Raman spectra data analysis.

Machine learning algorithms.

FIG 5.

FIG 6.

Construction of confusion matrix.

Effects of SNR on machine learning accuracy.

Supplementary Material

ACKNOWLEDGMENTS

Footnotes

Contributor Information

REFERENCES

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Preparation of AgNO₃ solution.