Summary
Molecular phenotypic variations in metabolites offer the promise of rapid profiling of physiological and pathological states for diagnosis, monitoring, and prognosis. Since present methods are expensive, time-consuming, and still not sensitive enough, there is an urgent need for approaches that can interrogate complex biological fluids at a system-wide level. Here, we introduce hyperspectral surface-enhanced Raman spectroscopy (SERS) to profile microliters of biofluidic metabolite extraction in 15 min with a spectral set, SERSome, that can be used to describe the structures and functions of various molecules produced in the biofluid at a specific time via SERS characteristics. The metabolite differences of various biofluids, including cell culture medium and human serum, are successfully profiled, showing a diagnosis accuracy of 80.8% on the internal test set and 73% on the external validation set for prostate cancer, discovering potential biomarkers, and predicting the tissue-level pathological aggressiveness. SERSomes offer a promising methodology for metabolic phenotyping.
Keywords: molecular phenotype, metabolite, surface-enhanced Raman spectroscopy, SERS, prostate cancer, diagnosis, biomarker
Graphical abstract

Highlights
-
•
SERSomes profile the metabolic phenotypes of various biofluids
-
•
SERSomes improve diagnostic accuracy with deep learning models
-
•
SERSomes facilitate metabolite biomarker discovery
Bi et al. develop a hyperspectral surface-enhanced Raman spectroscopy-based technique, SERSome, to profile the metabolic phenotypes of various biofluids, including cell culture medium and human serum, offering a methodology for accurate diagnosis and metabolite biomarker discovery with rapidity, low cost, and high sensitivity for fundamental studies and clinical applications.
Introduction
Blood-based testing has recently emerged as a genuinely viable alternative to tissue biopsies in cancer treatment1 due to its improved flexibility, reliability, and reduced invasiveness.2 Since only few diseases can be diagnosed by a single biomarker or a small handful of biomarkers, any technique that is low cost, rapid (ideally within minutes), and capable of detecting an extensive range of molecules should be highly desirable in clinics. A cheap and fast analysis with a small sample volume is a general goal of every blood test approach for large-scale and longitudinal testing in the clinical setting to enable swift therapeutic decisions and intra-operative adjustments.3,4
Molecular phenotyping, a multiplexed detection strategy, enables a system-wide readout of the physiological status5 and the fundamental feature of cancer development in general.6 Here, the molecular phenotype in the metabolite level can be leveraged to reflect the most downstream response to the carcinogenesis.7 However, among the types of biological information that have been investigated as potential candidates in blood-based tests, metabolite phenotyping has yet to achieve practical success, though some well-established approaches, such as mass spectrometry (MS) and nuclear magnetic resonance spectroscopy, are capable of generating multivariate metabolic profiles.8 These methods often require expensive instruments and extensive pretreatment procedures, impeding their clinical applicability.9,10,11
Raman spectroscopy, offering label-free and non-destructive molecular fingerprinting,12,13 has great potential for molecular phenotyping and has been employed to detect various biomolecules.14 Ramanome, a set of Raman spectra collected from a cell population, has been used to profile cellular responses to particular stresses and investigate biological mechanisms based on spectral signatures.15,16 However, it is not sensitive enough to examine small metabolites in bioliquids due to their intrinsically low Raman cross-sections. Surface-enhanced Raman spectroscopy (SERS), assisted by plasmonic nanomaterials, can detect molecules down to the single-molecule level and quantify a wide range of molecules, exhibiting high potential in metabolic phenotyping for pan-cancer or systems-wide characterizations.17,18
One of the major reasons for this limited success in metabolic phenotyping by present SERS approaches is, ultimately, poor reproducibility.19 The label-free method relies on stochastic adsorption of molecules to the metal plasmonic nanoparticles (NPs) for significant enhancement of the molecular signatures; generally leading to an uncontrollable random distribution of adsorbed molecules among the electromagnetic hotspots within the NPs.20 This hinders the comprehensive and unambiguous identification of the metabolic composition if only single or a couple of spectra have been acquired for each sample. Another non-negligible reason lies in the lack of proper pretreatment of the samples (specifically, large molecules, including proteins and nucleic acids) that were not removed in previous works.21 As a result, the metabolites are impeded from approaching the hotspots; thus, the signatures of metabolites cannot be fully presented.
In this work, we developed an ultrasensitive and tractable SERS-based technique to robustly profile molecular phenotypes at the metabolite level in complex biological fluids, such as cell culture medium and human serum. Rapid single-step filtration of the biological fluids is applied in advance to remove macromolecules so that the metabolites have maximal access to the hotspots, achieving a high signal-to-noise ratio of the metabolic signatures. We refer to the spectral set of each sample as a “SERSome,” comprising multiple SERS spectra (e.g., 200) acquired at various locations within the sample, as shown in Figure 1. In this way, the molecular composition is comprehensively fingerprinted, embodying rare events and weak signals that might be overlooked if only handful of spectra are acquired for one sample. In addition, this method requires no more than 10 μL of biofluidic metabolite extraction and completes measurements in 15 min. We demonstrate that this method exhibits remarkable sensitivity, from the nanomolar to the millimolar range, and allows the identification of subtle differences and correlations in SERS features between cultured cells during growth. Additionally, we reveal metabolic changes in the human serum of prostate cancer (PCa) and benign prostatic hyperplasia (BPH) patients using SERSomes, unveiling potential biomarkers. Furthermore, we show that classification based on SERSomes dramatically improves the diagnosis of PCa and the prediction of tissue-level aggressiveness compared to conventional biomarkers, like prostate-specific antigen (PSA) and other clinical measures. Thus, this work introduces SERSomes as a powerful tool for the rapid and sensitive characterization of metabolites in complex biological fluids, with immediate applications in cancer liquid biopsies and broader implications for basic research and clinical practice.
Figure 1.
Concept of SERSome
The SERS spectral set, collected pointwise from the bioliquid sample mixed with the SERS NPs, is referred to as a “SERSome,” effectively reflecting the molecular fingerprint phenotype at the metabolite level. Various analyses, including quantification and correlation assessments, can be conducted using SERSomes of different samples to unveil the underlying biological mechanisms and realize clinical diagnosis.
Results
Workflow of SERSome measurements
The goal of metabolic SERSome is the rapid generation of a comprehensive metabolite profile of a biological fluid with high sensitivity. Briefly, biofluid samples are first filtered to remove large molecules (such as proteins, nucleic acids, and polysaccharides) and then mixed with the metallic NPs. Metabolites adsorbed or adjacent to the NPs exhibit a significant enhancement in their Raman spectra. Multiple measurements of the sample are then obtained by pointwise excitation and detection across different regions in the sample using the confocal Raman system. Due to the high level of complexity of the biofluid composition, the molecules close to NPs differ in type and number in the different probed volumes, producing spectral variation across different acquisitions. Consequently, multiple measurements of the sample are required to obtain a spectral set as a SERSome specific to the sample that can robustly characterize its metabolite composition. For each spectral set, calculation of the Pearson correlation coefficient (PCC) between different averaged spectrum subsets is used to determine the minimum number of spectra needed for adequate robustness.22 This process is similar to the collection of a fingerprint as the security code on a cell phone, during which more complete information about the fingerprint is collected by constantly changing the angle of the fingers. Similarly, the SERSomes can be considered as the metabolite fingerprints of biofluids and leveraged to explore the metabolic variation with time and treatment. Correlation among the metabolic patterns and the heterogeneity among different samples can also been unveiled based on the SERSomes (Figure 1).
Typically, measurements were performed using a quartz capillary (inside diameter, 1 mm) containing a 10-μL mixture of plasmonic NPs and the biofluidic sample in a confocal Raman system equipped with a 638-nm laser (12.67 mW) and a 10× objective lens (Figure 2A). The spectra were collected in the pointwise scanning mode with adequate spacing between adjacent acquisitions. Specifically, 10 μm was chosen as the step size in this work, though larger ones were also feasible. This avoided measuring the same metabolite molecules repeatedly, thus enabling comprehensive metabolic profiling (Figure S1). For metabolite detection, citrate-reduced Ag NPs without additional surface modification were selected because they can provide surfaces capable of adsorbing a wide variety of metabolites.23 For certain types of samples, the same amount of Ag NPs (concentration, 0.4 nM as synthesized) was applied for reproducible sampling. Additionally, they are easy to synthesize (and possibly mass produced), highly stable in a variety of biofluids, and, most importantly, exhibit excellent Raman enhancement for single-molecule detectability (Figure S2). The Ag NPs showed an extinction peak at 443 nm (Figure 2B), a zeta potential of −29.6 mV, and a hydrodynamic diameter of approximately 89 nm (polydispersity index, 0.411) (Figure 2C). It is worth noting that the hydrodynamic diameter was slightly larger than the size (61 ± 13.8 nm) determined from transmission electron microscopy (TEM) (Figure 2D) owing to the contribution of larger NPs to light scattering.
Figure 2.
SERSome measurement
(A) A mixture of metabolites and SERS NPs is introduced into a quartz capillary and measured using a confocal Raman system operating in one-dimensional scanning mode.
(B–D) Characterizations of Ag NPs used for SERSome measurements: (B) absorbance spectrum, (C) histogram of the hydrodynamic diameters measured via dynamic light scattering, and (D) TEM image.
(E–J) Representative SERSomes derived from various biofluidic samples with diverse molecular compositions: (E) cysteamine + riboflavin, (F) phenylalanine + nicotinic acid, (G) adenine + uric acid, (H) HepG2 cell culture medium, (I) human serum, and (J), human urine. Each SERSome is represented by a heatmap of a spectral set consisting of 100 spectra. For clarity, a mean spectrum (black line, n = 100) is also shown at the bottom.
To provide an intuitive illustration of the capabilities of metabolic SERSomes, we initiated our study by preparing three artificial metabolite mixtures (Figures 2E–2G), such as cysteamine + riboflavin, as well as three types of genuine biofluid samples, including cell culture medium (Figure 2H), human serum (Figure 2I), and human urine (Figure 2J). These samples were combined with Ag NPs, and the measurements were conducted as described above. It can be seen that the SERSomes derived from these diverse samples effectively profiled distinct molecular phenotypes, in accordance with their respective molecular compositions. In addition, our approach unveiled rare fingerprints with the SERSomes, which might have been overlooked if relying solely on average spectra for metabolic profiling (Figure S3). Therefore, SERSomes show immense potential in a wide range of biological and clinical applications.
Quantification via SERSomes with specifically prepared samples
To demonstrate the effectiveness of SERSomes, we prepared three distinct samples, denoted S1, S2, and S3. Each sample contained 20 common metabolites, such as cytosine and adenine, acquired from commercial sources at various concentrations within a range of 0.015–1.5 mM (see Table S1 for precise compositions). This setup aimed to mimic biofluids with relatively simple compositions (e.g., serum-free cell culture). The metabolite concentration range was chosen mainly in consideration of detection sensitivity of the benchmark MS technique. As shown in Figure 3A, 200 Raman spectra were collected within each sample along a designated line, a process completed in approximately 5 min. It is evident that, for each sample, notable differences in peak positions and intensities among the spectra were observed, as expected due to the inherent stochasticity in molecular enhancement across the probed volumes (Figures 3A and 3B). To determine the minimum number of spectra needed for adequate robustness in the final profile, we computed the PCC between two averaged spectra derived from two randomly selected subsets of the entire spectral set. This analysis revealed that PCC curves rapidly converged to values close to 1 when approximately 40 spectra were averaged together (Figure 3C). Consequently, to ensure maximal robustness in final averaged spectra, we analyzed the full complement of 200 spectra for each sample. Next, we monitored the Raman intensities at specific wavenumbers (649, 723, 786, and 1,492 cm−1) corresponding to SERS-specific peaks of four metabolites: guanine (ring breathing mode24), adenine (ring breathing mode25), cytosine (ring breathing mode26), and tyrosine (ring vibration27), respectively (Figures 3A and 3B). Although the intensity at specific bands of these four components varied among the 200 spectra within each sample, the collection of values for each component obtained from the 200 measurements in each sample (e.g., the mean value) demonstrated the expected trends among different samples (Figure 3D). To further validate this observation, the concentrations of each of the four metabolites were determined using MS. As displayed in Figure 3D, the intensities of the specific peaks exhibited a high degree of correlation with the MS results (R2 ≥ 0.90), reinforcing the accuracy of our approach (also refer to Figure S4).
Figure 3.
Metabolic SERSomes of specifically prepared biofluidic samples
(A) SERSome heatmaps (top, consisting of 200 spectra) of three artificial samples (S1, S2, and S3), showing variations in metabolite concentrations, including guanine, adenine, cytosine, and tyrosine, and the corresponding averaged spectra (bottom). The composition of S1–S3 is shown in Table S1.
(B) Typical SERS spectra (black) from S3 as indicated in (A). The standard SERS spectra of guanine (red), adenine (orange), cytosine (green), and tyrosine (blue) are presented at the bottom, and their specific SERS bands are marked with color-matched stripes here and in (A).
(C) Pearson correlation coefficient (PCC) versus the number of spectra per spectral set.
(D) The correlation between the mean intensity of the SERS bands associated with the four metabolites (n = 200) and the concentration measured by mass spectrometry (MS), indicating the quantification potential of SERS spectra.
Next, we aimed to ascertain the minimal resolvable concentration achievable with SERSomes. It is well known that SERS can detect molecules at much lower concentrations compared to conventional MS. Indeed, our findings indicate that concentrations as low as nanomolar levels could be resolved by this method, a capability beyond the detection limits of conventional MS (Figure S5; Table S2).28 Thus, overall, these results illustrate that SERSomes are capable of robustly determining the identity and relative abundance of multiple metabolites in biological fluids from concentrations spanning from the nanomolar to the millimolar range.
Cell metabolic variation revealed by SERSomes
Comprehensive characterization of the metabolic changes in cell cultures under various conditions is important for a number of applications and particularly relevant for drug development endeavors.29 Prior to analyzing actual samples, we assessed the effects of different amounts of originally included or xenobiotic metabolite molecules doped into the culture medium (i.e., DMEM without fetal bovine serum). Our results showed that concentration variations of these molecules can be well reflected by the intensities at characteristic peaks obtained from SERSome measurements (Figure S6). We also confirmed stable spectral fingerprints of metabolites under typical working conditions (i.e., pH and ionic strength). Especially for the salt concentration far exceeding the normal conditions (>100 mS/cm), reproducible profiling can still be guaranteed by using our method (Figure S7). To this end, we used SERSomes to examine the metabolic differences in the extracellular milieu surrounding liver cancer HepG2 cells as a function of time, both in the presence or absence of methotrexate (MTX), a common clinical anti-cancer drug that disrupts the synthesis of tetrahydrofolate (Figure 4A).30 After the initial day, the cells were further cultured for 3 days only (i.e., D0–D3) in order to prevent contact inhibition. Then, 400 μL of the culture medium samples with cell incubation were collected daily, and macromolecules potentially interfering with metabolites approaching the hotspots on the NP surface due to factors such as protein corona were removed by filtration using a 3-kDa-cutoff membrane (Figure S8).31,32 Next, 10 μL of each sample was mixed with Ag NPs at a ratio of 1:9 for optimal enhancement (Figure S9A). Again, for Raman measurement, we obtained 200 spectra and confirmed that this number was sufficient for subsequent SERSome analysis (Figure S10). Heatmaps of the complete spectral sets are shown for the HepG2 group and HepG2 + MTX group from D0 to D3 (Figure 4B), along with several individual spectra from the set of HepG2+MTX on D1 (Figure 4C). Both indicate significant fluctuations in peak positions and intensities among the different spectra in this measurement, emphasizing the necessity of analyzing them only in toto. This observation has also been confirmed by the t-distributed stochastic neighbor embedding (t-SNE), a method for visualizing high-dimensional spectral data in a lower-dimensional space (i.e., 2D). As shown in Figure 4D, increasing the number of spectra in one SERSome from 5 to 200 spectra per set substantially enhances the classification accuracy among different samples. With 200 spectra acquired for each SERSome, overlapped distributions are observed between two individually cultivated plates of cells during the initial 2 days (D0 and D1). However, following MTX treatment, the SERSomic patterns started to segregate into two distinct groups on D2, further separating on the subsequent day (D3) because of the enhanced response to MTX. This emphasizes the significance of collecting multiple spectra in the spectral set for SERSome analysis.
Figure 4.
Differences in cell metabolism in cultured cells
(A) Bright-field images of HepG2 cells during culture (D0, no cells, not provided; D1; D2; and D3). MTX was introduced to the HepG2+MTX group after D1. Scale bar: 100 μm.
(B) SERSome (200 spectra) heatmaps for each sample.
(C) Representative spectra from D1 in the HepG2+MTX group, as indicated in (B).
(D) t-SNE plots generated using varying numbers of spectra per SERSome (i.e., 5, 10, 50, and 200 spectra).
(E) Correlation analysis among the SERS peaks. Line colors and endpoints indicate changes during the culture process, with orange denoting increased intensities, blue indicating decreased intensities, and purple signifying intensity independence with time. The darkness of the line color and the size of the endpoints represent the extent of correlation, with the darker lines and larger endpoints indicating stronger correlation between two bands.
(F) ANOVA between HepG2 and HepG2 groups on each day. Colors represent the maximum p values at each wavenumber from three parallel experiments after Bonferroni correction.
(G) The intensity fold change (FC) of HepG2+MTX compared with HepG2 (). Only FCs with a consistent trend in the three parallel experiments are displayed as numbers with minimum absolute value, while others are assigned as zero.
(H) Histogram illustrating the percentage of wavenumbers with corrected p values smaller than 0.05 and nonzero FCs.
(I) Screened SERS bands with corrected p values smaller than 0.05 and nonzero FCs (black, HepG2; red, HepG2+MTX). Error bars, standard deviation, n = 200.
Notably, significant correlations can be found in each group throughout the culture process and among different SERS bands, as demonstrated by the Pearson correlation analysis involving all SERSomes (D0–D3) (Figure 4E). To elucidate the potential metabolic variations, we inferred the detailed metabolites based on the important SERS bands according to the vibrational modes reported in the literature (Table 1). For example, SERS bands at 1,103, 1,108, 1,118, 1,354, and 1,399 cm−1, primarily attributed to phenylalanine,33 threonine,34 glucose,35 lipids,36 adenine,37 and guanine,36 respectively, exhibit decreasing intensities in both HepG2 and HepG2+MTX because of the consumption of essential amino acids and other chemicals from the culture medium for cell proliferation.38,39,40 Metabolite conversions can also be evidenced through negatively correlated SERS bands during culture.41 The decrease in intensity at 1,118 cm−1, accompanied by an increase at 1,399 cm−1, aligns with glucose consumption in the tricarboxylic acid (TCA) cycle, which drives biosynthetic pathways (e.g., fatty acids synthesis) in cancer cells.42 This negative correlation was weakened in the MTX-treated group because of reduced TCA cycle activity induced by MTX.43,44 In the HepG2+MTX group, the SERS bands at 1,266 cm−1 intensify over time, accompanied by several negative correlations (e.g., 1,354 cm−1), indicating MTX-induced adenosine release45 into the extracellular space for anti-proliferation effects,46,47 which was consistent with the result obtained from the cell viability test (Figure S11).
Table 1.
Assignments to some typical SERS bands for HepG2 SERSomes
| Wavenumber | Vibration mode | Assignment |
|---|---|---|
| 1,103 cm−1 | symmetric ring breathing | phenylalanine33 |
| 1,108 cm−1 | C-C-O out-of-phase stretch | threonine34 |
| 1,118 cm−1 | C-C stretching | glucose35 |
| 1,252 cm−1 | alkyl = C–H cis stretching | lipid48 |
| 1,266 cm−1 | H bending and ring stretching | adenosine45 |
| 1,278 cm−1 | symmetric stretching of PO43– | lipid49 |
| 1,317 cm−1 | C=C ring breathing | guanine34 |
| 1,339 cm−1 | C=C ring breathing | adenine50 |
| 1,354 cm−1 | CH3CH2 wagging | guanine, adenine, tryptophan36,37,51 |
| 1,399 cm−1 | COO− symmetric stretching | fatty acids36 |
| 1,592 cm−1 | C-N stretching | nucleotides52,53 |
| 1,658 cm−1 | C-NH3+ angle bending deformation | glucosamine54 |
Obvious differences can also be found between groups with or without MTX treatment. Therefore, we further conducted a comparative analysis of SERSomes from the two groups on a daily basis using two statistical methods: one-way analysis of variance (ANOVA) and fold change (FC). Here, data from three parallel experiments were used to minimize randomness. As shown in Figure 4F, in the initial 2 days, few bands exhibit ANOVA p < 0.05, calculated by the maximum p values among the three experiments after Bonferroni correction. However, starting from D2, more bands present a different distribution in intensity between HepG2 and HepG2+MTX, with a substantial increase on D3. Similar trends can be observed in the FC analysis (Figure 4G). Figure 4H summarizes the percentage of significantly varied SERS wavenumbers each day, illustrating drastic metabolic changes caused by MTX treatment. By the combination of ANOVA and FC, we screened five major bands with corrected p < 0.05 and the nonzero FCs on the last 2 days (D2 and D3) (Figure 4I). Here, reduced intensities at 1,244–1,253 and 1,275–1,280 cm−1 may indicate decelerated lipid production48,49,55 resulting from suppressed proliferation process caused by MTX. Reduced utilization of nucleotides52,53 and glucosamine54 may also be inferred by the less reduced intensity at 1,555–1,627 and 1,651–1,664 cm−1 on D2, respectively. It is worth mentioning that some small peaks, such as 1,275–1,280 cm−1, which are observable in only a subset of spectra, underscore the superiority of SERSomes in capturing rare signal events and enhancing the information obtained from label-free SERS metabolite detection (Figure S12).
Collectively, these findings confirm the accuracy and effectiveness of using SERSomes for molecular phenotyping, particularly in elucidating intriguing metabotype differences arising from MTX presence. This method, characterized by its multiplex nature and high sensitivity, provides a more comprehensive understanding of the metabolic changes induced by this drug.
PCa diagnosis using serum SERSomes
Inspired by the successful applications of SERSomes in cultured cell medium, we conducted experiments to assess their capability to extract metabolic information from human serum for the diagnosis of PCa, which is one of the most common cancers. Currently, though serum PSA is now used as the only biomarker for PCa screening,56 it is still challenging to effectively distinguish between PCa and BPH using the current preoperative methods.57 Therefore, we proposed an intelligent system with SERSomes for PCa diagnosis and pathological stratification prediction based on serum that is relatively invulnerable to external interfering factors and minimally invasive to obtain. Over approximately 2 years, we collected serum samples from 241 participants (85/83/73, PCa/BPH/control) as the training cohort, 26 participants (7/9/10, PCa/BPH/control) as the validation cohort, and another distinct 68 participants (24/23/21, PCa/BPH/control) as the test cohort (Figure 5A). Here, for the training cohort, power analysis has been performed in advance with a pilot cohort of 21 participants (7/7/7, PCa/BPH/control) and indicate a minimum number of 75 for each group (75/75/75, PCa/BPH/control) required for classification with the prediction power of 0.8 at a false discovery rate of 0.1. This confirmed that the current sample size of the training cohort was sufficiently large (Figure S13).
Figure 5.
Diagnosis and biomarker discovery of PCa using serum samples
(A) The demographic information for the training, validation, and test cohorts. Age distribution is summarized by the maximum, mean, and minimum values.
(B) A schematic of the algorithm for PCa diagnosis based on SERSomes. The workflow involves (i) colloidal background subtraction using BRNet, followed by (ii) generation of auxiliary SERSomes using DANet. Subsequently, (iii) PCDNet is trained for diagnosis, and (iv) model interpretation is implemented to identify salient bands.
(C) Confusion matrices of the diagnosis result using the proposed approach and PSA values in the test group.
(D) A 2D visualization of deep features extracted from PCDNet using t-SNE.
(E) ROC curves depicting the performance of PCa diagnosis under different implementation settings for the test cohort.
(F) Ablation studies in the test cohort on data augmentation and background removal and the relation between accuracy/PCC and the number of spectra per SERSome. PCC is displayed as the mean (solid orange line) with the standard deviation (yellow shade) from all SERSomes.
(G) Confusion matrices illustrating the pathological stratification prediction of GSs based on multimodal data (SERSomes and clinical data), SERSomes only, and clinical data only.
(H) ROC curves corresponding to pathological stratification prediction based on different data modalities.
(I) Saliency spectra for layers 1, 2, and 3 for PCa diagnosis, indicated by the solid line for the mean value and the shade for the standard deviation (PCa, n = 22; BPH, n = 14; control, n = 19).
To ensure a robust measurement of human serum SERSomes, we sequentially collected 200 spectra for each sample using 10-μL metabolite extraction obtained from 100 μL serum by ultra-filtration. To characterize the sensitivity of this method in serum, a spike-in experiment was also performed in advance, showing that a nanomolar level of metabolite concentration difference can be well profiled (Figure S14). In the first step, we employed an end-to-end measurement calibration deep model, Background Removal Network (BRNet) (see details in Table S3) to eliminate the background signals generated by Ag NPs. This calibration is essential to minimize systematic errors introduced during sample collection and measurement in real-world applications.58 BRNet was trained by the BR dataset consisting of artificially constructed spectrum pairs with a known background to expose and subtract the background from real measurements (Figure 5Bi). Notably, BRNet outperformed existing methods that determine the background contribution by non-negative linear squares and the intensities of characteristic peaks (Figure S15).59,60 It is also worth mentioning that this approach can be applied to other SERS research with different diseases and sample types as long as background removal is required to eliminate the systematic error and improve the data quality. Further, to address training data scarcity issues, we introduced a class-by-class trained 1-dimension diffusion model, Data Augmentation Network (DANet), to augment the training set by generating auxiliary SERSomes61,62,63 (Figure 5Bii).
Considering the SERSomes embodied both spatial and spectral features in a 2D spectrogram format (matrix size = 200 spectra × 724 wavenumbers/spectra), in analogy to a Mel spectrogram,64 we employed the Prostate Cancer Diagnosis Network (PCDNet) (see details in Table S4), which utilizes a convolutional neural network to extract spectrogram texture for the multiple classifications of PCa, BPH, and control (Figure 5Biii).
As a result, SERSomes empowered by deep learning exhibited an obvious improvement in discriminating PCa and BPH compared to other screening methods, such as PSA and the prostate imaging-reporting and data system, achieving an overall accuracy of 80.8% (Figures 5C and S16). In the multi-class receiver operating characteristic (ROC) analysis, the areas under the curve are 0.868 for PCa, 0.835 for BPH, and 0.994 for control, indicating that SERSomes from PCa patients are more likely to be correctly predicted as PCa with a probability of 0.868. In addition, the accuracy of distinguishing between PCa and non-cancers (BPH and control) using SERSomes combined with our system (82.35%) has also been significantly improved (19.12%) compared with using PSA (63.23%) (Figure 5C) Specifically, the sensitivity, specificity, and positive and negative predictive values for PCa were 91.67%, 77.27%, 68.75%, and 94.44%, respectively (Figure S17). Moreover, to demonstrate the clinical generalizability, we recruited additional samples from two different institutions as an external validation set, which were measured using the identical protocol followed by BRNet, Gaussian normalization in the spectral dimension, and calibration by the Combining Batch algorithm.65 As the result, our method presented consistently superior predictability of PCa (Figure S18). All of the above highlights the potential of our system for more accurate and minimally invasive preoperative screening of PCa, which could mitigate overdiagnosis and overtreatment.66 We visualized the distribution of the 6,144-dimension feature vector extracted by the spectral-spatial encoder of PCDNet using t-SNE (Figure 5D), demonstrating that spectrogram features form stable and separatable clusters. Furthermore, the consistency between the deep feature clustering and the PSA value distribution (the area of the dots related to the PSA value) validated the reliability of PCDNet.
To further verify the necessity of each module in the proposed system, in the test cohort, we carried out ablation studies by implementing the above operations without background removal or data augmentation, which can only achieve an overall average accuracy of 58.8% and 69.1%, respectively (Figures 5E and 5F). Specifically, the diagnosis accuracy increased sharply (5.8% ± 2.5%) as the number of spectra per SERSome increased from 1 to 25 and gradually (3.4% ± 1.3%) from 50 to 200, consistent with the result of the PCC (Figure 5F), emphasizing the importance of acquiring an adequate number of spectra in a single complex biofluid sample to comprehensively profile the metabolic phenotype. Moreover, the whole algorithm is robust, as proven by reproducible results from 9 repeated computations with consistent accuracy on reassigned training and test sets (Table S3). Results from the training cohort and the validation cohort can be found in Figure S19.
The Gleason score (GS) serves as a routine protocol for pathological stratification of PCa,67 typically obtained only from invasive tissue biopsy (see details in Figure S20). If a minimally invasive technique can help to predict the GS, it would be highly preferred. In this regard, we explored the fusion of spectral features with some clinical measures (T stage, N stage, and PSA value; Figure S21) to predict the GS (Figure 5Biii). As shown in Figures 5G and 5H, the accuracy of GS prediction using SERSomes fused with clinical indicators reached 71.4%, exhibiting a 33.3% improvement over using clinical indicators alone. Remarkably, using SERSomes alone also lead to a 28.6% increase in prediction accuracy. When distinguishing highly malignant PCa (GS = 8, 9) from the moderately malignant cases (GS = 6, 7), the accuracy of using SERSomes fused with clinical indicators was 90.48%, significantly outperforming the use of clinical indicators alone (71.43%). This not only demonstrated that SERSomes contain important information of pathological prediction but also established that our method is transferable and capable of extracting sufficient features. In summary, these findings, though serving as a first try and definitely requiring further improvement regarding accuracy by collecting more training samples, indicated the potential of predicting highly malignant PCa using minimally invasive methods in the future and underscored its clinical significance.
Uninterpretable “black boxes” have long been a fundamental issue for deep learning methods.68 In this regard, we implemented an interpretable network to score the importance of spectrogram features and infer potential metabolite biomarkers (Figure 5Biv). We applied Gradient-weighted Class Activation Mapping++ (Grad-CAM++) to retrospectively excavate the most important features contributing to the diagnosis model.69 For each SERSome, a saliency map was generated to reveal the significant features at each wavenumber and spectral position (Figure S22). By matching serum SERSomes with the detailed SERS bands indicated by saliency spectra compressed from the saliency maps on the spatial dimension (Figure 5I), we uncovered a potential PCa biomarker (Table 2). Specifically, bands at 480 and 1,197 cm−1 can be attributed to serum uric acid, an antioxidant associated with increased PCa risk due to the production of reactive oxygen species and impairment of the antioxidant defense mechanism.70,71,72 Hypoxanthine, inferred from peaks at 718, 1,368, and 1,394 cm−1, and uracil, with a peak at 1,190 cm−1, are closely relevant to the perturbed purine and pyrimidine metabolism pathways involved in DNA and RNA synthesis, consistent with previous research.73,74,75 Steroids, including testosterone, which might be evidenced by the peak at 1,540 cm−1,76,77 has also been found previously to be closely related to the risk of PCa.78 Salient bands at 417 and 1,570 cm−1 indicate the dysfunctional tryptamine metabolism in PCa, inducing a combined modulation of immune escape and DNA repair mechanisms with its downstream metabolites.79,80 Salient maps enhance the interpretability of the deep learning models, offering dual benefits in diagnosis and biomarker discovery.
Table 2.
Assignments to some typical SERS bands for serum SERSomes
| Wavenumber | Vibration mode | Assignment |
|---|---|---|
| 480 cm−1 | C–N–C ring vibrations | uric acid81 |
| 1,197 cm−1 | C-N stretching | uric acid82 |
| 718 cm−1 | C–H bending | hypoxanthine83 |
| 1,368 cm−1 | C=O stretching | hypoxanthine84 |
| 1,394 cm−1 | C-N stretching | hypoxanthine84 |
| 1,190 cm−1 | in-plane ring stretching | uracil85 |
| 417 cm−1 | out-of-plane benzene ring deformation | tryptamine86 |
| 1,125 cm−1 | C-H and NH3+ bending, in-plane benzene ring deformation | tryptamine86 |
| 1,540 cm−1 | C=O stretching | steroid76,77 |
| 1,570 cm−1 | NH2(NH3+) bending | tryptamine86 |
In short, our proposed system can be potentially integrated into the standard workflow for SERSome-based blood tests. It effectively addresses challenges such as systematic errors, data scarcity, complex spectral-spatial features, and relationship between SERSomes and physiological characteristics in clinical diagnosis and biomarker discovery. It is worth mentioning that, based on Grad-CAM++, we were able to visualize the important features for classification to evidence the potential biomarkers. This addressed the challenge of “black box” mode and lack of interpretation in current deep learning-based methods for analytical chemistry.87 In addition, our system has been proven to outperform the conventional algorithms (including the decision tree classifier, K-neighbors classifier, Gaussian naive Bayes classifier, and multi-layer perceptron classifier) based on the averaged spectra in diagnosis accuracy (Figure S23).
Discussion
In this work, we introduced the concept of metabolic SERSomes as a powerful tool for sensitive and reliable metabolic phenotyping in biological fluids. With proper pretreatment (i.e., ultra-filtration) of the biosamples, which has often been neglected by previous studies, we were able to focus on the metabolites without the interference from other macromolecules, such as proteins. The integration of SERSomes with deep learning represented an advancement in molecular phenotyping in (1) how to treat the multiple measurements for one certain sample and (2) how to use the spectral set for metabolic profiling.
For the first issue, the multiple measurements, instead of using the averaged spectra as in previous studies, were treated as different perspectives of one sample in its whole profile (i.e., SERSome) to fully capture the rare signals generated by trace molecules or molecules with low affinity and cross-sections. Here, the number of spectra in the SERSome was calculated in advance for reproducible profiling. This enabled a more robust and comprehensive metabolic phenotyping.
Another issue is realized with the assistance of deep learning models. The algorithm was specially designed and used the SERSome as a 2D image. Therefore, all features in the SERSome can be potentially excavated by the deep learning model, thus benefitting classification and biomarker discovery. This has been proven by comparison with other conventional AI methods based on the mean spectra or only a handful of spectra, showing improved diagnostic accuracy.
For a demonstration, we chose two scenarios: extracellular metabolism by culture medium and PCa by human serum. As a result, metabotypic variations in cell culture media in response to drug treatments and in the serum of individuals undergoing carcinogenesis were successfully detected and quantified with nanomolar sensitivity. SERSomes achieved an impressive accuracy of 80.8% on the internal test set and 73% on the external validation set for PCa diagnosis, outperforming conventional clinical measures. We also prospected the potential of incorporating SERSomes with clinical data for the prediction of tissue-level aggressiveness with minimally invasive methods. For real-world clinical use, the diagnostic accuracy should be further improved by recruiting more samples in the training cohort, developing more advanced deep learning algorithms, and incorporating other screening methods with little or no invasiveness.
Compared with existing methods such as MS, this technique requires less pretreatment and measurement time. In addition, the cost is relatively low, since only ordinary consumables and microliters of NP suspensions are required, and it is likely to be further reduced by using a portable Raman system or even a microspectrometer.88 In the future, this method should be further improved regarding sensitivity to detect a lower abundance of molecules and a comprehensive SERS database to facilitate model interpretation on the potential biomarkers. For real-world applications, urgency should be raised for the wide commercialization of Raman devices for improved spectral resolution with lower cost, broader manufacture, and downsizing. In a word, SERSome, offering a transformative approach to metabolite analysis, holds great promise for a wide range of applications in both fundamental research and clinical settings.
Limitations of the study
In terms of the biological mechanisms regarding the metabolic phenotypes, many more experimental validations should be performed in the future, such as investigating PCa-related cell lines to unveil the underlying cell-level pathology and explore the correlation between cell-level metabolic changes and systematic variations. Further efforts should also be made to incorporate our approach with other assays for higher diagnostic accuracy in future projects.
STAR★Methods
Key resources table
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Chemicals, peptides, and recombinant proteins | ||
| Citrate trisodium (98%) | Aladdin | S189183; CAS No.: 68-04-2 |
| Silver nitrate (AR, 99.8%) | Aladdin | S116264; CAS No.: 7761-88-8 |
| Potassium chloride (AR, 99.5%) | Aladdin | P112134; CAS No.: 7447-40-7 |
| Methotrexate | BBI Life Sciences | A600612; CAS No.: 59-05-2 |
| Deposited data | ||
| Code for serum data analysis | This paper | https://doi.org/10.5281/zenodo.11068334 |
| Experimental models: Cell lines | ||
| Human hepatocyte carcinoma cells | ATCC | HB-8065 |
| Software and algorithms | ||
| Python (version 3.9.16) | Python Software Foundation | https://www.python.org |
| PyTorch (version 1.13.1+cu116) | Python Software Foundation | https://www.python.org |
| Numpy (version 1.24.2) | Python Software Foundation | https://www.python.org |
| Matplotlib (version 3.7.1) | Python Software Foundation | https://www.python.org |
| pytorch_grad_cam (version 1.4.6) | Python Software Foundation | https://www.python.org |
| CUDA 11.4 | Nvidia Developer | https://developer.nvidia.com/ |
| cuDNN 8.0.5 | Nvidia Developer | https://developer.nvidia.com/ |
Resource availability
Lead contact
Further information and requests for resources should be directed to and will be fulfilled by the lead contact, Jian Ye (yejian78@sjtu.edu.cn).
Materials availability
This study did not generate new unique reagents.
Data and code availability
-
•
All data is available from the lead contact upon request.
-
•
The codes and data including the model, the training engine, the testing script as well as the model weight and partial test dataset have been deposited at Zenodo and are publicly available as of the date of publication. DOI is listed in the key resources table.
-
•
Any additional information required to reanalyze the data reported in this work paper is available from the Lead Contact upon request.
Experimental model and study participant details
Cell lines
Human hepatocyte carcinoma cells (HepG2) were obtained from the American Type Culture Collection (ATCC). Cells were maintained in Dulbecco’s Modified Eagle Medium (DMEM) supplemented with 10% fetal bovine serum (FBS) and 1:100 penicillin-streptomycin (104 units/mL) at 37°C and 5% CO2. 2 × 106 cells were cultured in T25 flasks in a total volume of 10 mL of medium, which was not changed during culture. After the adherence of cells, MTX was added to reach a final concentration of 5 μM at D1. Herein, MTX was directly resolved in DMEM without any other solvents. 400 μL of the culture medium was taken from each culture flask every day. For the analysis of extracellular metabolism, the culture medium was centrifuged at 200 g for 3 min to remove the dead cells and at 300 g for 20 min to get rid of the cell debris, subsequently. Both two groups (HepG2 and HepG2 + MTX) were performed in 3 replicates.
Human subjects
A total of 424 clinical samples were collected for this study (Table S4). Herein, for the training set, the validation set and the internal test set, 231 samples (training: 85 PCa and 83 BPH, validation: 7 PCa and 9 BPH, test: 24 PCa and 23 BPH) were obtained from the urology department in Ren ji Hospital, School of Medicine, Shanghai Jiao Tong University, and 104 samples (training: 73 controls, validation: 10 controls, test: 21 controls) were collected from the Physical Examination Center of Ren ji Hospital, School of Medicine, Shanghai Jiao Tong University. For the external validation set, 37 samples (15 PCa and 22 BPH) were obtained from the urology department in Ningbo Hangzhou Bay Hospital and 52 samples (25 PCa and 27 BPH) were obtained from the urology department in Jiading District Central Hospital Affiliated Shanghai University of Medicine and Health Sciences. All the samples were from distinct participants. The collection and usage of clinical samples have been approved by the ethics committee of Ren ji Hospital, School of Medicine, Shanghai Jiao Tong University (KY2021-030). Consent documents were obtained from all participants included in this study. All participants excluded other tumor history, anti-tumor medication history and metabolic syndrome. Clinical data of each participant including age and PSA value were collected for study analysis. PCa patients also underwent pelvic enhanced MRI and/or whole-body PET-CT examination. The T stage and N stage of PCa patients were determined by urologists according to the American Joint Committee on Cancer (AJCC) 8th staging system. Each BPH and PCa patient has been confirmed by transrectal prostate biopsy. The corresponding diagnosis was made by pathologists according to the H&E staining of the biopsy tissue. And the Gleason score was therewith determined for those with prostate cancer. The control participants have been excluded from BPH and PCa, and were aged from 50 to 70 years old which was comparable to the BPH and PCa group. To collect the serum for SERSome measurement, 3–5 mL fasting venous blood were collected by experienced nurses in the morning and placed at room temperature for stratification. After the centrifugation of the blood at 3000 rpm for 10 min under ambient condition, 500 μL serum was obtained and stored at −80°C until use. SERS measurements were acquired within 28 days upon serum collection. Before the SERS measurement, all samples were thawed in an ice box with crushed ice at 2°C–4°C. Power analysis was employed to compute the sample size of clinical serum samples based on the SERS spectra.
Method details
Instrumentations
A JEM-2100F transmission electron microscope (JEOL, Tokyo, Japan) was used to obtain the NPs’ morphology. UV-vis spectra were collected on a UV1900 UV-Vis spectrophotometer (Aucybest, Shanghai, China). The hydrodynamic diameter and the zeta potential of the NPs were characterized by a Zetasizer Nano ZSP (Malvern, UK). An NP tracking system (ZetaView, Merkel, German) was used to measure the concentration of the NPs. The mass spectroscopy for the 20-metabolite mixture was implemented with a UPLC-MS system (Acquity UPLC H-class/Xevo TQ-XS).
Synthesis of the SERS NPs
The citrate-reduced silver (Ag) NPs were synthesized according to Lee and Meisel’s method89 with minor modifications. Briefly, 9.2 mg AgNO3 was dissolved in 75 mL ultrapure water and heated to boiling under stirring. Then, 2 mL of citrate sodium solution (10.1 mg/mL) was added dropwise. The mixture was further kept boiling for 1 h without exposure to light and then cooled to room temperature under vigorous stirring. The product was stored in 4°C refrigerator and protected from light exposure before being used.
SERS measurement
For the specifically prepared metabolite mixtures, 20 metabolite molecules were revolved in pure water (Table S5). To collect the metabolic substances from the mixture with other macromolecules, the 3 kDa-cutoff filter (Millipore, Amicon Ultra-4, PLBC Ultracel-3) was used to centrifugate the culture medium and serum at 10000 rpm for 10 min. The metabolite extraction was mixed with the Ag NPs with the ratio of 1:9 for culture medium and 1:1 for serum in the first step. Then the mixtures were ultrasonicated for 10 s to ensure monodispersion and complete mixing and set still for full adsorption of molecules onto the surface of the Ag NPs, e.g., for 1 h used in this work. Before measurement, the mixtures were slightly ultrasonicated to avoid NP precipitation. Finally, 200 SERS spectra were collected for each sample on the confocal Raman system (Horiba, XploRA INV) for scanning along a quartz capillary (inside diameter: 1 mm, outside diameter: 2 mm) containing 10 μL of the NP-metabolite extraction mixture with the incident laser wavelength of 638 nm (power: 12.67 mW), the acquisition time of 5 s per spectrum and the step size of 10 μm between adjacent spectra through a 10× objective lens. Herein, each spectrum consisted of 724 datapoints within the spectral range of 200–2000 cm−1 by using the 600 gr/mm grating in the spectrometer.
Quantification and statistical analysis
Before analysis, all SERS spectra have been preprocessed by baseline removal with the software LabSpec 6 (method: poly; degree: 10; max points: 200). The Pearson correlation coefficient (PCC) is defined as , where represents the covariance between and , represents the variance of , and both and were the average of random sampling from the measured spectra.
The signals of cytosine, adenine, guanine and tyrosine in the synthetic samples were read by the max value within the band of 765–820 cm−1, 695-750 cm−1, 625-680 cm−1, and 1465-1520 cm−1, respectively.
For the cell culture medium, Pearson correlation was applied to evaluate the time-dependency and intra-independency of the SERS intensities. t-Distributed stochastic neighbor embedding (t-SNE) algorithm was used to visualize the distribution variability among different samples and the perplexity was optimized according to the spectrum number used for each sample (i.e., perplexity = 5 for 5 spectra/set, perplexity = 10 for 10 spectra/set, perplexity = 40 for 50 and 200 spectra/set). One-way analysis of variance and fold change were utilized to screen SERS bands with significant changes between two groups.
The SERSome data of human serum consists of 200 spectra scanning from different spatial positions. Each spectrum includes 724 spectral components (i.e., wavenumbers/data points collected by the Horiba instrument) ranging from 600 to 1800 . The whole process regarding the analysis of human serum SERSomes includes BRNet, DANet, PCDNet, explainable clinical deep learning and the prediction of Gleason score.
The NP background removal neural network—BRNet: In real applications, there is sometimes a dilemma about the NP backgrounds. Specifically, in order to obtain high signal-to-noise ratio signals of the target molecules, the NPs are designed to aggregated slightly thus to form more intensive hotspots. While at the same time, the NP background will also be enhanced most probability generated by the residue moieties of reagents on the NP surface and these NP background peaks might overlap with target signals and interfere with the downstream data analysis. The removal of the NP background has been a great challenge for a long time since the background signals vary from different spectra. Conventional background removal approaches, for example, estimate the background contribution in the mixed signal referred to one certain background pattern or some of the peaks,59,60 as the result, prone to overestimate or underestimate the background. Therefore, the BRNet (details in Table S6), which denoted as , was proposed to remove the background signal precisely and automatically. First, we constructed a BR dataset consisting of 100 pure background data (the signal of NPs only), 1,000 no (weak) background data (SERS spectra of real biological fluids with low or negligible NP signals) and 5,000,000 “mixed spectra”. Herein, the “mixed spectra” were artificially generated by linearly adding background signals on sample signals. Each data pair in the BR dataset consisted of “mixed spectra” and its corresponding background components, as the input and the output of the model, respectively. To accurately remove the NP background, we modified 1D-ResUnet as BRNet to understand the patterns of the NP background and precisely infer the background signal without any prior knowledge of background spectra. In detail, BRNet consists of three components, i.e., the 1D residual CNN encoder, 1D transpose convolutional up-sample decoder and layer-wise skip connection. The learning criterion was to minimize the Mean Squared Error (MSE) loss between the predicted background and the ground truth (GT) background. As shown in Equation 1, we obtained the background-removed spectrum () by subtracting the inferred background spectrum () from the original spectrum (). The background-removed SERSomes were concatenated by the background-removed spectra in the spatial dimension, as show in Equation 2.
| (Equation 1) |
| (Equation 2) |
The diffusion model data augmentation—DANet: The 1D-diffusion model was extended to synthesize spectra since the conventional augmentation methods for sequential data do not fit well with spectral format. In this study, a 1D-ResUnet with 8 Resnet blocks was implemented as the denoising model () which iteratively reduces the noise with = 2000 times from the 1D-random input to generate auxiliary spectra. The denoising models were trained class by class, i.e., three diffusion models of synthetic PCa, BPH and control. To accelerate the training process, the input spectra were down-sampled to shrink the input scale. The diffusion model has been trained by 7000 steps with Adam optimizer with the learning rate of . The synthetic spectra were then up-sampled to the original spectral length. The t-step shrinkage parameter was denoted as and the t-step noise variance as . As shown in Equation 3, the spectrum of ( 1)-step () is inferred from the -step spectrum () by denoising. The synthetic SERSomes were concatenated by the -step denoised spectra from Gaussian random inputs, as shown in Equation 4.
| (Equation 3) |
| (Equation 4) |
The SERSome-based deep learning diagnosis for prostate cancer—PCDNet: The medical information used for PCa diagnosis is consistently and coupled embedded in the spatial-spectral features of SERSome data. Fully understanding the spectral and spatial information and their association can strengthen the SERSome for diagnosis. Actually, this has been widely practiced in acoustics that the time-frequency feature of sound was jointly rephrased in the visual form of Mel spectrograms and processed by the computer vision methods.64 The PCDNet (details in Table S7) architecture was adapted from the typical Resnet, which was generally applied as a visual encoder (). It consists of an initial convolution block followed by 4 residual blocks, integrated with flattening operation as the spectral-spatial encoder () and a final fully connected classification layer. The residual blocks contained skip connections between the input and output of each residual block, benefiting stable training and gradient propagation.90 Herein, the learning criterion was to minimize the cross-entropy loss between the predicted classes and the ground truth (GT) classes (PCa, BPH and control). The prediction was inferred from the SERSome inputs, as shown in Equation 5. t-SNE was done based on 6144-D features vectors extracted by . Then, every SERSome was mapped to a 6144-D latent feature vector for t-SNE analysis and visualization.
| (Equation 5) |
The models were trained on the training sets and validated after every training epoch. The initial learning rate was 10−4 and decay every 10 epoch to 98% of previous learning rate. The models were saved at the epoch when validation accuracy was highest and training accuracy larger than 75%. Then the models were evaluated on the test sets.
Mining spectral features—explainable clinical deep learning: Explainable AI (XAI) is an emerging development in deep learning, which enables the knowledge that deep models learn from medical big data to be distillated to clinical experts. Spectral data have good interpretability because of the stable and explicit correlation between characteristic bands. However, traditional statistical methods, though having screened some differentiated bands, are not necessarily to find the features literally conducive to classification. Spectral data combined with deep learning can learn the principles for diagnosis, and interpretable models can visualize the features that the deep models pay attention to. The feature peaks from interpretable deep learning are more reliable. The Gradient Class Activation Map++ (Grad CAM++)69 is applied to the well-trained PCDNet to visually show the inference procedure of the deep model via saliency maps. The feature maps () were encoded by , as shown in Equation 6. The saliency maps were the weighted summation of , and the weights () were calculated by the first, second and third-order back propagation gradients, as shown in Equations 7 and 8. The saliency maps were then activated by each class (), denoted as . We calculated the spatial average of and obtained the saliency spectrum, as shown in Figure 5B.
| (Equation 6) |
| (Equation 7) |
| (Equation 8) |
Prediction of Gleason score: The clinical information (T-stages, N-stages and PSA) is used to predict the Gleason score via Multi-Layer Perceptron (MLP). The SERSome-based method employed the PCDNet. During the process of multimodal fusion, the spectral features were encoded by transferring from the PCDNet and fused with clinical indicators. The cases of GS = 6, 8 & 9 are insufficient so that operations of copy and noise addition are implemented to balance the training data.
A ROC analysis and AUC were used to evaluate model performance for diagnosis and pathological stratification of PCa. The multiclass classification accuracy is defined as , where is true-or-false indicator so that and . For multiclass tasks, the macro averages of ROC and AUC were used as the metrics for each class. The ROC curves were plotted by using the true-positive rate (TPR, namely, sensitivity) versus the false-positive rate (FPR, namely, 1 − specificity) with changing decision thresholds. For a model and a given ROC curve TPR = f(FPR), where FPR∈[0,1], the AUC is defined as: . Normalized confusion matrices were also applied to quantify the classification results. The above mentioned multivariate and statistical analyses were performed using scikit-learn (version 1.2.2) and scipy (version 1.9.1) packages in Python (version 3.9.16). All SERSomes were processed on Intel(R) Xeon(R) Platinum 8276M CPU @ 2.20GHz and a total of two Nvidia 3090 graphics processing units.
Acknowledgments
We gratefully acknowledge financial support from the National Natural Science Foundation of China (82272054, 81627801, 31971151, and 82373358), the Science and Technology Commission of Shanghai Municipality (21511102100 and BI0820067/002), Shanghai Jiao Tong University (YG2024LC09), the Clinical Research Plan of Shanghai Hospital Development Center (SHDC2020CR3014A), the “Clinic Plus” Outstanding Project (2023ZYA007) from the Shanghai Key Laboratory for Nucleic Acid Chemistry and Nanomedicine, the Shanghai Key Laboratory of Gynecologic Oncology, and the National Key Research and Development Program of China (2022YFB4702702).
Author contributions
J.Y., W.X., J.P., and C.J. conceived the research. X.B. synthesized silver particles, configured the synthetic mixture, and performed measurement and the subsequent data analyses of the synthetic mixtures and the cell culture medium. C.H. evaluated the spectral number for each sample. F.L. performed cell experiments and obtained cell culture medium. H.C. conducted the correlation analysis of the SERS spectra of the cell culture medium. J.W., B.D., and B.L. collected the serum samples. J.W. and X.B. measured SERS spectra of the serum. B.X. performed the data analysis of the serum SERS spectra and designed the AI system. J.Y. administrated the project and provided guidance on methodology. J.Y. and C.J. guided the analysis of the serum samples. All authors wrote and revised the manuscript. X.B., J.W., and B.X. contributed equally to this work.
Declaration of interests
The authors declare no competing interests.
Published: May 21, 2024
Footnotes
Supplemental information can be found online at https://doi.org/10.1016/j.xcrm.2024.101579.
Contributor Information
Cheng Jin, Email: chengjin520@sjtu.edu.cn.
Jiahua Pan, Email: panjiahua@renji.com.
Wei Xue, Email: xuewei@renji.com.
Jian Ye, Email: yejian78@sjtu.edu.cn.
Supplemental information
References
- 1.Marrugo-Ramírez J., Mir M., Samitier J. Blood-Based Cancer Biomarkers in Liquid Biopsy: A Promising Non-Invasive Alternative to Tissue Biopsy. Int. J. Mol. Sci. 2018;19:2877. doi: 10.3390/ijms19102877. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Liu S., Wang J. Current and Future Perspectives of Cell-Free DNA in Liquid Biopsy. Curr. Issues Mol. Biol. 2022;44:2695–2709. doi: 10.3390/cimb44060184. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Sathyanarayan P., Sloane H., Edelstein D., Jones F., Preston J., Wu S., Los J., Holtrup F., Quinn H., Feller-Kopman D. Rapid liquid biopsy genotyping in NSCLC patients. J. Thorac. Oncol. 2021;16:S788. [Google Scholar]
- 4.Jin C.E., Koo B., Lee T.Y., Han K., Lim S.B., Park I.J., Shin Y. Simple and Low-Cost Sampling of Cell-Free Nucleic Acids from Blood Plasma for Rapid and Sensitive Detection of Circulating Tumor DNA. Adv. Sci. 2018;5 doi: 10.1002/advs.201800614. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Holmes E., Wilson I.D., Nicholson J.K. Metabolic phenotyping in health and disease. Cell. 2008;134:714–717. doi: 10.1016/j.cell.2008.08.026. [DOI] [PubMed] [Google Scholar]
- 6.Faubert B., Solmonson A., DeBerardinis R.J. Metabolic reprogramming and cancer progression. Science. 2020;368 doi: 10.1126/science.aaw5473. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Nicholson J.K., Holmes E., Kinross J.M., Darzi A.W., Takats Z., Lindon J.C. Metabolic phenotyping in clinical and surgical environments. Nature. 2012;491:384–392. doi: 10.1038/nature11708. [DOI] [PubMed] [Google Scholar]
- 8.Blaise B.J., Correia G.D.S., Haggart G.A., Surowiec I., Sands C., Lewis M.R., Pearce J.T.M., Trygg J., Nicholson J.K., Holmes E., Ebbels T.M.D. Statistical analysis in metabolic phenotyping. Nat. Protoc. 2021;16:4299–4326. doi: 10.1038/s41596-021-00579-1. [DOI] [PubMed] [Google Scholar]
- 9.Deda O., Gika H., Raikos N., Theodoridis G. In: The Handbook of Metabolic Phenotyping. Lindon J.C., Nicholson J.K., Holmes E., editors. Elsevier; 2019. Chapter 4 - GC-MS-Based Metabolic Phenotyping; pp. 137–169. [Google Scholar]
- 10.Markley J.L., Brüschweiler R., Edison A.S., Eghbalnia H.R., Powers R., Raftery D., Wishart D.S. The future of NMR-based metabolomics. Curr. Opin. Biotechnol. 2017;43:34–40. doi: 10.1016/j.copbio.2016.08.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Graça G., Lau C.-H.E., Gonçalves L.G. In: Tumor Microenvironment : The Main Driver of Metabolic Adaptation. Serpa J., editor. Springer International Publishing; 2020. Exploring Cancer Metabolism: Applications of Metabolomics and Metabolic Phenotyping in Cancer Research and Diagnostics; pp. 367–385. [DOI] [PubMed] [Google Scholar]
- 12.Wang W., Shi B., He C., Wu S., Zhu L., Jiang J., Wang L., Lin L., Ye J., Zhang H. Euclidean distance-based Raman spectroscopy (EDRS) for the prognosis analysis of gastric cancer: A solution to tumor heterogeneity. Spectrochim. Acta. 2023;288 doi: 10.1016/j.saa.2022.122163. [DOI] [PubMed] [Google Scholar]
- 13.Wu Y., Wang Y., He C., Wang Y., Ma J., Lin Y., Zhou L., Xu S., Ye Y., Yin W., et al. Precise diagnosis of breast phyllodes tumors using Raman spectroscopy: Biochemical fingerprint, tumor metabolism and possible mechanism. Anal. Chim. Acta. 2023;1283 doi: 10.1016/j.aca.2023.341897. [DOI] [PubMed] [Google Scholar]
- 14.Bi X., Lin L., Chen Z., Ye J. Artificial Intelligence for Surface-Enhanced Raman Spectroscopy. Small Methods. 2024;8 doi: 10.1002/smtd.202301243. [DOI] [PubMed] [Google Scholar]
- 15.Teng L., Wang X., Wang X., Gou H., Ren L., Wang T., Wang Y., Ji Y., Huang W.E., Xu J. Label-free, rapid and quantitative phenotyping of stress response in E. coli via ramanome. Sci. Rep. 2016;6 doi: 10.1038/srep34359. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Cutshaw G., Uthaman S., Hassan N., Kothadiya S., Wen X., Bardhan R. The Emerging Role of Raman Spectroscopy as an Omics Approach for Metabolic Profiling and Biomarker Detection toward Precision Medicine. Chem. Rev. 2023;123:8297–8346. doi: 10.1021/acs.chemrev.2c00897. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Langer J., Jimenez de Aberasturi D., Aizpurua J., Alvarez-Puebla R.A., Auguié B., Baumberg J.J., Bazan G.C., Bell S.E.J., Boisen A., Brolo A.G., et al. Present and future of surface-enhanced Raman scattering. ACS Nano. 2020;14:28–117. doi: 10.1021/acsnano.9b04224. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Liu F., Wu T., Tian A., He C., Bi X., Lu Y., Yang K., Xia W., Ye J. Intracellular metabolic profiling of drug resistant cells by surface enhanced Raman scattering. Anal. Chim. Acta. 2023;1279 doi: 10.1016/j.aca.2023.341809. [DOI] [PubMed] [Google Scholar]
- 19.Panneerselvam R., Liu G.-K., Wang Y.-H., Liu J.-Y., Ding S.-Y., Li J.-F., Wu D.-Y., Tian Z.-Q. Surface-enhanced Raman spectroscopy: bottlenecks and future directions. Chem. Commun. 2017;54:10–25. doi: 10.1039/c7cc05979e. [DOI] [PubMed] [Google Scholar]
- 20.Willets K.A., Van Duyne R.P. Localized surface plasmon resonance spectroscopy and sensing. Annu. Rev. Phys. Chem. 2007;58:267–297. doi: 10.1146/annurev.physchem.58.032806.104607. [DOI] [PubMed] [Google Scholar]
- 21.Bonifacio A., Cervo S., Sergo V. Label-free surface-enhanced Raman spectroscopy of biofluids: fundamental aspects and diagnostic applications. Anal. Bioanal. Chem. 2015;407:8265–8277. doi: 10.1007/s00216-015-8697-z. [DOI] [PubMed] [Google Scholar]
- 22.Schober P., Boer C., Schwarte L.A. Correlation coefficients: appropriate use and interpretation. Anesth. Analg. 2018;126:1763–1768. doi: 10.1213/ANE.0000000000002864. [DOI] [PubMed] [Google Scholar]
- 23.Xu Y., Zhang Y., Li C., Ye Z., Bell S.E.J. SERS as a Probe of Surface Chemistry Enabled by Surface-Accessible Plasmonic Nanomaterials. Acc. Chem. Res. 2023;56:2072–2083. doi: 10.1021/acs.accounts.3c00207. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Oh W.S., Kim M.S., Suh S.W. Surface-enhanced Raman scattering (SERS) of nucleic acid components in silver sol: Guanine series. J. Raman Spectrosc. 1987;18:253–258. [Google Scholar]
- 25.Kim S.K., Joo T.H., Suh S.W., Kim M.S. Surface-enhanced Raman scattering (SERS) of nucleic acid components in silver sol: Adenine series. J. Raman Spectrosc. 1986;17:381–386. [Google Scholar]
- 26.Liu Y., Lyu N., Rajendran V.K., Piper J., Rodger A., Wang Y. Sensitive and direct DNA mutation detection by surface-enhanced Raman spectroscopy using rational designed and tunable plasmonic nanostructures. Anal. Chem. 2020;92:5708–5716. doi: 10.1021/acs.analchem.9b04183. [DOI] [PubMed] [Google Scholar]
- 27.Królikowska A., Bukowska J. Self-assembled monolayers of mercaptosuccinic acid monolayers on silver and gold surfaces designed for protein binding. Part II: vibrational spectroscopy studies on cytochrome c immobilization. J. Raman Spectrosc. 2007;38:943–949. [Google Scholar]
- 28.Popp M., Hann S., Koellensperger G. Environmental application of elemental speciation analysis based on liquid or gas chromatography hyphenated to inductively coupled plasma mass spectrometry--a review. Anal. Chim. Acta. 2010;668:114–129. doi: 10.1016/j.aca.2010.04.036. [DOI] [PubMed] [Google Scholar]
- 29.Heath J.R., Ribas A., Mischel P.S. Single-cell analysis tools for drug discovery and development. Nat. Rev. Drug Discov. 2016;15:204–216. doi: 10.1038/nrd.2015.16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Cronstein B.N., Aune T.M. Methotrexate and its mechanisms of action in inflammatory arthritis. Nat. Rev. Rheumatol. 2020;16:145–154. doi: 10.1038/s41584-020-0373-9. [DOI] [PubMed] [Google Scholar]
- 31.Casals E., Pfaller T., Duschl A., Oostingh G.J., Puntes V. Time Evolution of the Nanoparticle Protein Corona. ACS Nano. 2010;4:3623–3632. doi: 10.1021/nn901372t. [DOI] [PubMed] [Google Scholar]
- 32.Walkey C.D., Olsen J.B., Song F., Liu R., Guo H., Olsen D.W.H., Cohen Y., Emili A., Chan W.C.W. Protein Corona Fingerprinting Predicts the Cellular Interaction of Gold and Silver Nanoparticles. ACS Nano. 2014;8:2439–2455. doi: 10.1021/nn406018q. [DOI] [PubMed] [Google Scholar]
- 33.Akbar S., Majeed M.I., Nawaz H., Rashid N., Tariq A., Hameed W., Shakeel S., Dastgir G., Bari R.Z.A., Iqbal M., et al. Surface-enhanced Raman spectroscopic (SERS) characterization of low molecular weight fraction of the serum of breast cancer patients with principal component analysis (PCA) and partial least square-discriminant analysis (PLS-DA) Anal. Lett. 2022;55:1588–1604. [Google Scholar]
- 34.Anwar ul H., Muhammad Irfan M., Haq N., Nosheen R., Muhammad Rizwan J., Ali R., Muhammad S., Syeda Takmeel Z., Lubna M., Asima P., et al. Surface-enhanced Raman spectroscopy for monitoring antibacterial activity of imidazole derivative (1-benzyl-3-(sec-butyl)-1H-imidazole-3-ium bromide) against Bacillus subtilis and Escherichia coli. Photodiagn. Photodyn. 2023;42 doi: 10.1016/j.pdpdt.2023.103533. [DOI] [PubMed] [Google Scholar]
- 35.Wang W., Rahman A., Kang S., Vikesland P.J. Investigation of the Influence of Stress on Label-Free Bacterial Surface-Enhanced Raman Spectra. Anal. Chem. 2023;95:3675–3683. doi: 10.1021/acs.analchem.2c04636. [DOI] [PubMed] [Google Scholar]
- 36.Nargis H.F., Nawaz H., Bhatti H.N., Jilani K., Saleem M. Comparison of surface enhanced Raman spectroscopy and Raman spectroscopy for the detection of breast cancer based on serum samples. Spectrochim. Acta. 2021;246 doi: 10.1016/j.saa.2020.119034. [DOI] [PubMed] [Google Scholar]
- 37.Shao L., Zhang A., Rong Z., Wang C., Jia X., Zhang K., Xiao R., Wang S. Fast and non-invasive serum detection technology based on surface-enhanced Raman spectroscopy and multivariate statistical analysis for liver disease. Nanomedicine. 2018;14:451–459. doi: 10.1016/j.nano.2017.11.022. [DOI] [PubMed] [Google Scholar]
- 38.Santos C.R., Schulze A. Lipid metabolism in cancer. FEBS J. 2012;279:2610–2623. doi: 10.1111/j.1742-4658.2012.08644.x. [DOI] [PubMed] [Google Scholar]
- 39.Sakagami H., Satoh M., Yokote Y., Takano H., Takahama M., Kochi M., Akahane K. Amino acid utilization during cell growth and apoptosis induction. Anticancer Res. 1998;18:4303–4306. [PubMed] [Google Scholar]
- 40.Kim H.-Y., Lee K.-M., Kim S.-H., Kwon Y.-J., Chun Y.-J., Choi H.-K. Comparative metabolic and lipidomic profiling of human breast cancer cells with different metastatic potentials. Oncotarget. 2016;7:67111–67128. doi: 10.18632/oncotarget.11560. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.He Y., Huang S., Zhang P., Ji Y., Xu J. Intra-Ramanome correlation analysis unveils metabolite conversion network from an isogenic population of cells. mBio. 2021;12 doi: 10.1128/mBio.01470-21. e01470–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.DeBerardinis R.J., Mancuso A., Daikhin E., Nissim I., Yudkoff M., Wehrli S., Thompson C.B. Beyond aerobic glycolysis: Transformed cells can engage in glutamine metabolism that exceeds the requirement for protein and nucleotide synthesis. Proc. Natl. Acad. Sci. USA. 2007;104:19345–19350. doi: 10.1073/pnas.0709747104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Al-Natour M.A., Alazzo A., Ghaemmaghami A.M., Kim D.-H., Alexander C. LC-MS metabolomics comparisons of cancer cell and macrophage responses to methotrexate and polymer-encapsulated methotrexate. Int. J. Pharm. X. 2019;1 doi: 10.1016/j.ijpx.2019.100036. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Ijaz M.U., Shahzadi S., Ashraf A., Mahboob S., Al-Ghanim K.A., Riaz M.N., Al-Misned F., Sultana S. Protective effect of baicalin on methotrexate-induced mitochondrial damage in testicular tissues of rats. J. King Saud Univ. Sci. 2022;34 [Google Scholar]
- 45.Muniz-Miranda M., Gellini C., Pagliai M., Innocenti M., Salvi P.R., Schettino V. SERS and computational studies on microRNA chains adsorbed on silver surfaces. J. Phys. Chem. C. 2010;114:13730–13735. [Google Scholar]
- 46.Chan E.S.L., Cronstein B.N. Methotrexate—how does it really work? Nat. Rev. Rheumatol. 2010;6:175–178. doi: 10.1038/nrrheum.2010.5. [DOI] [PubMed] [Google Scholar]
- 47.Chan E.S.L., Cronstein B.N. Mechanisms of action of methotrexate. Bull. Hosp. Jt. Dis. 2013;71:S5–S8. [PubMed] [Google Scholar]
- 48.Lee C., Carney R., Lam K., Chan J.W. SERS analysis of selectively captured exosomes using an integrin-specific peptide ligand. J. Raman Spectrosc. 2017;48:1771–1776. [Google Scholar]
- 49.Jaworska A., Wojcik T., Malek K., Kwolek U., Kepczynski M., Ansary A.A., Chlopicki S., Baranska M. Rhodamine 6G conjugated to gold nanoparticles as labels for both SERS and fluorescence studies on live endothelial cells. Mikrochim. Acta. 2015;182:119–127. doi: 10.1007/s00604-014-1307-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Giese B., McNaughton D. Surface-Enhanced Raman Spectroscopic and Density Functional Theory Study of Adenine Adsorption to Silver Surfaces. J. Phys. Chem. B. 2002;106:101–112. [Google Scholar]
- 51.Wang Z., Hong Y., Yan H., Luo H., Zhang Y., Li L., Lu S., Chen Y., Wang D., Su Y., Yin G. Fabrication of optoplasmonic particles through electroless deposition and the application in SERS-based screening of nodule-involved lung cancer. Spectrochim. Acta. 2022;279 doi: 10.1016/j.saa.2022.121483. [DOI] [PubMed] [Google Scholar]
- 52.Zhao H., Fu H., Tian C., Ren Z., Tian G. Fabrication of silver nanoparticles/single-walled carbon nanotubes composite for surface-enhanced Raman scattering. J. Colloid Interface Sci. 2010;351:343–347. doi: 10.1016/j.jcis.2010.08.004. [DOI] [PubMed] [Google Scholar]
- 53.He X.-N., Wang Y.-N., Wang Y., Xu Z.-R. Accurate quantitative detection of cell surface sialic acids with a background-free SERS probe. Talanta. 2020;209 doi: 10.1016/j.talanta.2019.120579. [DOI] [PubMed] [Google Scholar]
- 54.Mohaček-Grošev V., Brljafa S., Škrabić M., Marić I., Blažek Bregović V., Amendola V., Ropret P., Kvaček Blažević A. Glucosamine to gold nanoparticles binding studied using Raman spectroscopy. Spectrochim. Acta. 2022;264 doi: 10.1016/j.saa.2021.120326. [DOI] [PubMed] [Google Scholar]
- 55.Milewska A., Sigurjonsson O.E., Leosson K. SERS imaging of mesenchymal stromal cell differentiation. ACS Appl. Bio Mater. 2021;4:4999–5007. doi: 10.1021/acsabm.1c00286. [DOI] [PubMed] [Google Scholar]
- 56.Duffy M.J. Biomarkers for prostate cancer: prostate-specific antigen and beyond. Clin. Chem. Lab. Med. 2020;58:326–339. doi: 10.1515/cclm-2019-0693. [DOI] [PubMed] [Google Scholar]
- 57.Turkbey B., Rosenkrantz A.B., Haider M.A., Padhani A.R., Villeirs G., Macura K.J., Tempany C.M., Choyke P.L., Cornud F., Margolis D.J., et al. Prostate Imaging Reporting and Data System Version 2.1: 2019 Update of Prostate Imaging Reporting and Data System Version 2. Eur. Urol. 2019;76:340–351. doi: 10.1016/j.eururo.2019.02.033. [DOI] [PubMed] [Google Scholar]
- 58.Geeurickx E., Hendrix A. Targets, pitfalls and reference materials for liquid biopsy tests in cancer diagnostics. Mol. Aspect. Med. 2020;72 doi: 10.1016/j.mam.2019.10.005. [DOI] [PubMed] [Google Scholar]
- 59.Tan Z., Zhang Y., Thackray B.D., Ye J. Improvement of surface-enhanced Raman scattering detection and imaging by multivariate curve resolution methods. J. Appl. Phys. 2019;125 [Google Scholar]
- 60.LaLone V., Aizenshtadt A., Goertz J., Skottvoll F.S., Mota M.B., You J., Zhao X., Berg H.E., Stokowiec J., Yu M., et al. Quantitative chemometric phenotyping of three-dimensional liver organoids by Raman spectral imaging. Cell Rep. Methods. 2023;3 doi: 10.1016/j.crmeth.2023.100440. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Ruan L., Ma Y., Yang H., He H., Liu B., Fu J., Jing Yuan N., Jin Q., Guo B. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2023. MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation; pp. 10219–10228. [Google Scholar]
- 62.Trabucco B., Doherty K., Gurinas M., Salakhutdinov R. Effective Data Augmentation With Diffusion Models. arXiv. 2023 doi: 10.48550/arXiv.2302.07944. Preprint at. [DOI] [Google Scholar]
- 63.Pavlou E., Kourkoumelis N. Deep adversarial data augmentation for biomedical spectroscopy: Application to modelling Raman spectra of bone. Chemometr. Intell. Lab. 2022;228 [Google Scholar]
- 64.Kaneko T., Tanaka K., Kameoka H., Seki S. ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022. ISTFTNET: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform; pp. 6207–6211. [Google Scholar]
- 65.Behdenna A., Colange M., Haziza J., Gema A., Appé G., Azencott C.-A., Nordor A. pyComBat, a Python tool for batch effects correction in high-throughput molecular data using empirical Bayes methods. BMC Bioinf. 2023;24:459. doi: 10.1186/s12859-023-05578-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Loeb S., Bjurlin M.A., Nicholson J., Tammela T.L., Penson D.F., Carter H.B., Carroll P., Etzioni R. Overdiagnosis and Overtreatment of Prostate Cancer. Eur. Urol. 2014;65:1046–1055. doi: 10.1016/j.eururo.2013.12.062. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Gleason D.F., Mellinger G.T. Prediction of Prognosis for Prostatic Adenocarcinoma by Combined Histological Grading and Clinical Staging. J. Urol. 1974;111:58–64. doi: 10.1016/s0022-5347(17)59889-4. [DOI] [PubMed] [Google Scholar]
- 68.Azodi C.B., Tang J., Shiu S.-H. Opening the Black Box: Interpretable Machine Learning for Geneticists. Trends Genet. 2020;36:442–455. doi: 10.1016/j.tig.2020.03.005. [DOI] [PubMed] [Google Scholar]
- 69.Chattopadhay A., Sarkar A., Howlader P., Balasubramanian V.N. 2018 IEEE winter conference on applications of computer vision (WACV) 2018. Grad-CAM++: Generalized Gradient-Based Visual Explanations for Deep Convolutional Networks; pp. 839–847. [Google Scholar]
- 70.Oh B., Figtree G., Costa D., Eade T., Hruby G., Lim S., Elfiky A., Martine N., Rosenthal D., Clarke S., Back M. Oxidative stress in prostate cancer patients: A systematic review of case control studies. Prostate Int. 2016;4:71–87. doi: 10.1016/j.prnil.2016.05.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Deng Y., Huang J., Wong M.C.S. Association between serum uric acid and prostate cancer risk in East Asian populations: a Mendelian randomization study. Eur. J. Nutr. 2023;62:1323–1329. doi: 10.1007/s00394-022-03076-7. [DOI] [PubMed] [Google Scholar]
- 72.Benli E., Cirakoglu A., Ayyıldız S.N., Yüce A. Comparison of serum uric acid levels between prostate cancer patients and a control group. Cent. European J. Urol. 2018;71:242–247. doi: 10.5173/ceju.2018.1619. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Kumar D., Gupta A., Mandhani A., Sankhwar S.N. NMR spectroscopy of filtered serum of prostate cancer: A new frontier in metabolomics. Prostate. 2016;76:1106–1119. doi: 10.1002/pros.23198. [DOI] [PubMed] [Google Scholar]
- 74.Clendinen C.S., Gaul D.A., Monge M.E., Arnold R.S., Edison A.S., Petros J.A., Fernández F.M. Preoperative Metabolic Signatures of Prostate Cancer Recurrence Following Radical Prostatectomy. J. Proteome Res. 2019;18:1316–1327. doi: 10.1021/acs.jproteome.8b00926. [DOI] [PubMed] [Google Scholar]
- 75.Lima A.R., Carvalho M., Aveiro S.S., Melo T., Domingues M.R., Macedo-Silva C., Coimbra N., Jerónimo C., Henrique R., Bastos M.d.L., et al. Comprehensive Metabolomics and Lipidomics Profiling of Prostate Cancer Tissue Reveals Metabolic Dysregulations Associated with Disease Development. J. Proteome Res. 2022;21:727–739. doi: 10.1021/acs.jproteome.1c00754. [DOI] [PubMed] [Google Scholar]
- 76.Krysa M., Szymańska-Chargot M., Zdunek A. FT-IR and FT-Raman fingerprints of flavonoids – A review. Food Chem. 2022;393 doi: 10.1016/j.foodchem.2022.133430. [DOI] [PubMed] [Google Scholar]
- 77.Chen J., Liu M., Yuan H., Huang S., Zhao J., Tao J., Xu N. Surface-enhanced Raman spectroscopy for classification of testosterone propionate and nandrolone residues in chicken. Vib. Spectrosc. 2018;99:7–12. [Google Scholar]
- 78.Schatzl G., Madersbacher S., Thurridl T., Waldmüller J., Kramer G., Haitel A., Marberger M. High-grade prostate cancer is associated with low serum testosterone levels. Prostate. 2001;47:52–58. doi: 10.1002/pros.1046. [DOI] [PubMed] [Google Scholar]
- 79.Li Z., Ding B., Ali M.R.K., Zhao L., Zang X., Lv Z. Dual Effect of Tryptamine on Prostate Cancer Cell Growth Regulation: A Pilot Study. Int. J. Mol. Sci. 2022;23 doi: 10.3390/ijms231911087. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Kannen V., Bader M., Sakita J.Y., Uyemura S.A., Squire J.A. The Dual Role of Serotonin in Colorectal Cancer. Trends Endocrinol. Metabol. 2020;31:611–625. doi: 10.1016/j.tem.2020.04.008. [DOI] [PubMed] [Google Scholar]
- 81.Goodall B.L., Robinson A.M., Brosseau C.L. Electrochemical-surface enhanced Raman spectroscopy (E-SERS) of uric acid: a potential rapid diagnostic method for early preeclampsia detection. Phys. Chem. Chem. Phys. 2013;15:1382–1388. doi: 10.1039/c2cp42596c. [DOI] [PubMed] [Google Scholar]
- 82.Pucetaite M., Velicka M., Pilipavicius J., Beganskiene A., Ceponkus J., Sablinskas V. Uric acid detection by means of SERS spectroscopy on dried Ag colloidal drops. J. Raman Spectrosc. 2016;47:681–686. [Google Scholar]
- 83.Lin X., Jia X., Lin J.Y., Wu P.H., Weng Y., Feng S. A comparative study based on serum SERS spectra in and on the coffee ring for high precision breast cancer detection. J. Raman Spectrosc. 2022;53:1371–1379. [Google Scholar]
- 84.Jeyavijayan S., Arivazhagan M. Study of density functional theory and vibrational spectra of hypoxanthine. NOPR. 2010;48:869–874. [Google Scholar]
- 85.Singh J.S. Thermodynamic functions with other properties and vibrational spectra of pyrimidine ring of uracil for RNA and bio-molecule 5-aminouracil. Heliyon. 2019;5 doi: 10.1016/j.heliyon.2019.e02955. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Hussain S., Pang Y. Surface geometry of tryptophan adsorbed on gold colloidal nanoparticles. J. Mol. Struct. 2015;1096:121–128. [Google Scholar]
- 87.Debus B., Parastar H., Harrington P., Kirsanov D. Deep learning in analytical chemistry. TrAC, Trends Anal. Chem. 2021;145 [Google Scholar]
- 88.Yang Z., Albrow-Owen T., Cai W., Hasan T. Miniaturization of optical spectrometers. Science. 2021;371 doi: 10.1126/science.abe0722. [DOI] [PubMed] [Google Scholar]
- 89.Lee P.C., Meisel D. Adsorption and surface-enhanced Raman of dyes on silver and gold sols. J. Phys. Chem. 1982;86:3391–3395. [Google Scholar]
- 90.He K., Zhang X., Ren S., Sun J. Proceedings of the IEEE conference on computer vision and pattern recognition. 2016. Deep residual learning for image recognition; pp. 770–778. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
-
•
All data is available from the lead contact upon request.
-
•
The codes and data including the model, the training engine, the testing script as well as the model weight and partial test dataset have been deposited at Zenodo and are publicly available as of the date of publication. DOI is listed in the key resources table.
-
•
Any additional information required to reanalyze the data reported in this work paper is available from the Lead Contact upon request.





