Abstract
Osteoarthritis (OA) is the most common degenerative joint disease, presented as wearing down of articular cartilage and resulting in pain and limited mobility for 1 in 10 adults in the UK [Osteoarthr. Cartil. 28(6), 792 (2020) 10.1016/j.joca.2020.03.004]. There is an unmet need for patient friendly paradigms for clinical assessment that do not use ionizing radiation (CT), exogenous contrast enhancing dyes (MRI), and biopsy. Hence, techniques that use non-destructive, near- and shortwave infrared light (NIR, SWIR) may be ideal for providing label-free, deep tissue interrogation. This study demonstrates multimodal “spectromics”, low-level abstraction data fusion of non-destructive NIR Raman scattering spectroscopy and NIR-SWIR absorption spectroscopy, providing an enhanced, interpretable “fingerprint” for diagnosis of OA in human cartilage. This is proposed as method level innovation applicable to both arthro- or endoscopic (minimally invasive) or potential exoscopic (non-invasive) optical approaches. Samples were excised from femoral heads post hip arthroplasty from OA patients (n = 13) and age-matched control (osteoporosis) patients (n = 14). Under multivariate statistical analysis and supervised machine learning, tissue was classified to high precision: 100% segregation of tissue classes (using 10 principal components), and a classification accuracy of 95% (control) and 80% (OA), using the combined vibrational data. There was a marked performance improvement (5 to 6-fold for multivariate analysis) using the spectromics fingerprint compared to results obtained from solely Raman or NIR-SWIR data. Furthermore, clinically relevant tissue components were identified through discriminatory spectral features – spectromics biomarkers – allowing interpretable feedback from the enhanced fingerprint. In summary, spectromics provides comprehensive information for early OA detection and disease stratification, imperative for effective intervention in treating the degenerative onset disease for an aging demographic. This novel and elegant approach for data fusion is compatible with various NIR-SWIR optical devices that will allow deep non-destructive penetration.
1. Introduction
Osteoarthritis (OA) presents a major public health challenge recognized as a serious burden for the individuals affected, healthcare systems, and resulting in significant national and global socioeconomic costs [1,2]. Current modes of assessment of articular cartilage (AC) in the clinic are typically invasive (endoscopy, arthroscopy), destructive (biopsy, histochemistry), incorporating ionising radiation and/or exogenous contrast (CT, X-ray, MRI) [3,4]. Each modality provides different levels of qualitative morphological information of the tissue to assess the health of the patient but typically necessitate clinical evaluation and treatment prognosis. Critically, none of these indicated techniques provide a definitive diagnosis for OA, which needs to be confirmed by secondary methods.
Current gold standard approaches for diagnosing OA rely on the detection of pain, radiologically presenting morphological changes (joint space narrowing, osteophytes formation), or accumulation of synovial fluids [5,6]. OA presents degradation and loss of articular cartilage, the lubricating and shock absorbing inter-joint layer, developed over many years, and which can pre-date symptoms over decades [7]. Hence, early diagnosis is crucial for effective and timely intervention to reduce pain, improve mobility, and patient quality of life. To date, there is no cure for OA, rather treatments are focused on alleviating inflammatory symptoms or interventional surgery including arthroplasty and prosthetic joint implant [8–10]. Detection of pre-pathomorphological and biochemical changes will inform new and earlier forms of pharmacological and lifestyle interventions to alter the course and progression of the disease [11,12].
Near- and Shortwave Infrared (NIR, SWIR) absorption and spontaneous Raman scattering are highly sensitive to structural and biochemical changes offering an innovative approach for early detection associated with the onset of OA, using spectral biomarkers. This can be ideal as both Raman and NIR-SWIR absorption spectroscopy can be carried out with minimal sample preparation, conducive to native in situ tissue assessment. Moreover, both NIR-SWIR absorption and Raman spectroscopies can utilize the biological ‘optical’ transparency windows. Such windows exist in various native human and animal tissue types and are quantified by the absorptive and scattering effect of common endogenous chromophores. In these regions, non-ionising optical light undergoes reduced scattering and absorption, facilitating deep penetration [13–15]. Depth penetrations in AC tissue are recorded up to 5 mm, indicating high suitability for optical assessment [16,17]. Since the normal thickness of human cartilage is 1–3 mm, optical interrogation in the spectral range of 1.4–2.5 µm is considered optimal for AC tissue assessment [17,18]. Thus, spectroscopic techniques such as NIR-SWIR absorption and Raman spectroscopy (if carried out with NIR or SWIR excitation) can offer valuable chemometric and structural information at depth for non-destructive, potentially non-invasive, in vivo clinical assessment.
NIR-SWIR absorption spectroscopy has been shown to be sensitive to structural and compositional changes resulting from loss or alteration of the tissue extracellular matrix (ECM) [19]. The spectral response provides information relevant to structural and functional characteristics, important in the assessment of degeneration of cartilage [20–23]. Specifically, absorbance bands in the NIR and SWIR are overtones and combinations of the fundamental vibrations of O-H, C-H, N-H, and S-H bonds which form the molecular framework of the tissue [16,19,24]. As such, NIR-SWIR absorption spectroscopy offers a non-destructive method to determine thickness, biomechanical properties, and composition of articular cartilage especially water fractions for evaluation, prediction, and monitoring of OA progression [17,18,21,22,25,26].
Spontaneous Raman scattering spectroscopy, is insensitive to water, and the spectral region between 1800 - 800 cm–1 is particularly sensitive to structural and skeletal vibration modes, thus ideal for biological tissue characterization [27]. Raman spectroscopy using NIR excitation has been employed by a number of groups in the diagnosis of osteoarthritis, with biochemical and biomechanical change in human and preclinical models correlating with gold standard assays [18,25,28]. Spectral features including those associated with collagen, GAG, and PG (major proteins of the ECM), water fraction, lipid and amides have proved efficacious as preclinical and prepathomorphological biomarkers [29–31]. Specific biochemical distributions have been mapped up to depths of 0.5 mm with NIR excitations [27,32–34] and OA relevant signals (depth and GAG) collected at depths >10 mm under spatially offset Raman spectroscopy (SORS) geometries [35].
NIR-SWIR absorption and NIR-excited Raman spectroscopy have been used to interrogate cartilage and shown potential for OA diagnosis, especially through evaluation of depth-dependent features [36]. Since these techniques are mediated by different optical phenomena, namely changes in dipole moment (overtones of vibrational modes) and changes in polarization, respectively, they offer complementary information. As such, these modalities can be combined to yield a more holistic chemical ‘fingerprint’ of the sample of interest. This study presents the first demonstration of an elegant combination of vibrational spectroscopy techniques operating in the transparency windows to augment their diagnostic assessment potential of human articular cartilage. Raman scattering and NIR-SWIR absorption signatures are concatenated through low-level abstraction data fusion, into a new “spectromic” fingerprint [37–39]. The fused data was subjected to statistical and machine learning analysis, namely supervised Principal Component Analysis – Linear Discriminant Analysis (PCA-LDA) and supervised support vector machine (SVM) [37]. The new fingerprint facilitated improved classification accuracy and delineation between control and osteoarthritic AC tissue compared to spectra from each technique individually. The low-level abstracted data is directly interpretable since significant spectral features used to classify the tissue can highlight clinically relevant biomarkers, and is compatible with various statistical and machine learning assessments. The current studies demonstrate the efficacy and power of a spectromics approach in its ability to provide a holistic assessment of human cartilage tissue for OA diagnosis, with therapeutic implications for an increasing aging population. This is directly compatible with minimally invasive procedures (arthroscopy, endoscopy) currently, and potentially with completely non-invasive implementations using NIR-SWIR wavelengths.
2. Method
2.1. Cartilage samples and preparation
Articular cartilage samples, obtained with full ethical approval and patient consent (REC reference 18/NW/0231), were excised from human femoral heads manually using a scalpel blade. Cartilage tissue slices were taken parallel to the femoral head surface, as deep as the subchondral bone. Cartilage slices were fixed in 4% paraformaldehyde (PFA) for 72 hrs and stored, refrigerated, in phosphate buffered saline.
Tissue storage conditions were found to be compatible with vibrational spectroscopy characterization, with consistent Raman spectral responses reproduced despite storage periods approaching 6 months.
For spectroscopic and imaging analysis, AC samples were cut into square slices, side lengths of the order of 10’s of mm, thickness of ∼1 mm. For each sample, the “superficial surface” describes the outermost layer of cartilage on the femoral head (in contact with the acetabular cup) and “deep side” the layer proximal to the subchondral bone. This terminology reflects the zonal stratified structure reported for cartilage wherein the superficial-, middle-, and deep zones (SZ, MZ, DZ) and calcified zones contain varying collagen fiber orientation and composition [4,17,36,40].
Samples obtained from osteoarthritic (OA) femoral heads serve as the diseased AC model, while samples from osteoporotic (OP) femoral heads served as the control model. Both classes of tissue were obtained as consequence of interventional arthroplasty surgery. Analysis of human samples enabled paradigm development to inform and model clinical application. Patient information included only patient age and gender, and classification between OA and OP. Classification was confirmed by the consultant orthopaedic surgeon with OA samples typically Grade 3 & 4 (late stage) OA progression. Lack of access to healthy mature human cartilage necessitated the use of AC from OP femoral heads as the “healthy” control model tissue. OA pathology results in severe thinning of AC around the femoral head such that the subchondral bone is typically exposed in large regions. OA samples were taken from areas where AC remained present.
Samples were selected in order to match and be equally distributed across anthropometric parameters. Spectra were collected from AC of n = 13 OA patients and n = 14 OP patients for this proof-of-concept study.
2.2. Raman scattering microspectroscopy
Raman spectroscopy of the cartilage samples was carried out via modification of a previously reported protocol [36]. Samples of AC were placed on a quartz slide, “superficial” side up, and their spectra centred at 1200 cm–1 (614–1722cm–1) captured in reflectance geometry. Measurements were carried out on a Renishaw InVia microscope system with samples excited using a 785 nm laser focused through a Leica 50x (0.75 NA) short working distance (∼200 µm) objective. Renishaw WiRE 4.1 software was used to collect data and set measuring parameters. The system was calibrated to the 520 cm–1 peak of a silicon standard before each experiment and cosmic rays removed after acquisition. Background-subtraction was carried out by subtraction of the spectrum of a blank quartz slide, taken with the same acquisition parameters as experimental samples. Spectral resolution was recorded as Δλ ∼1.1 cm–1.
Spectral mapping of each sample was achieved by movements of the sample stage in random steps of the order of 100’s µm to measure across the tissue surface. For each position, a mean average spectrum of 3 acquisitions with exposure time of 5 seconds was recorded. A modal average of 3 samples were investigated for each patient with 10 spectra measured for each sample.
2.3. NIR-SWIR absorption spectroscopy
NIR-SWIR spectroscopy of the cartilage samples was carried out on a homemade benchtop system. Samples of AC were placed on a gold-coated mirror slide, “superficial” side up, and spectra between 11,127–3993 cm–1 (899–2504 nm) captured in transreflectance geometry. Incident excitation light was provided by a broadband halogen lamp (HL-2000-FHSA-LL, Ocean Insight) emitting as a blackbody across the NIR-SWIR range, and signal collected via an OceanOptics NIR Quest 2.5 + spectrometer. Both were coupled to a ferrule fiber optic reflectance probe with a profile of 6 annular fibers for excitation and 1 central fiber for collection [22]. Two planoconvex uncoated lenses collimated and focused light onto sample, allowing for contact-less measurements. Spectral resolution of the spectrometer was quoted by the manufacturer at Δλ ∼6.3 nm.
Spectral mapping of each sample was achieved by manual sample scanning in random steps of the order of 100’s µm across the surface of the tissue. For each position, a mean average spectrum of 100 acquisitions with exposure time of 10 milliseconds was recorded. A modal average of 3 random samples were investigated for each patient and 10 spectra measured for each sample.
2.4. Spectral pre-processing
Spectra were labeled with the patient’s age, sex, and OA/OP classification. The modal average was 30 spectra per patient (10 spectra per sample, 3 samples per patient). Spectral data underwent pre-processing transformations prior to classification via multivariate analysis, carried out in iRootLab (0.15.07.09-v) toolbox within MATLAB R2020a software (MathWorks) [41].
Raman scattering spectra were treated with 5th-order polynomial baseline correction, to eliminate slow varying offset attributed to interference of fluorescence and Mie scattering [42–44]. Wavelet de-noising via 6 level Haar wavelet thresholding minimized random spectral noise without affecting signal quality [41,45]. Rubber-banding baseline correction anchored the primary and terminal ends of each spectra to the horizontal axis, before vector normalization [43,46,47].
NIR-SWIR absorbance spectra were first treated with a 1st-derivative transformation to elucidate subtle features, as well as eliminate baseline offset, linear trends, and interference from light scattering [16,44]. To mitigate subsequently increased noise (reduced SNR), a 2nd order Savitsky-Golay smoothing filter was applied (suitable for vibrational spectroscopy data) with 9 filter coefficients (reducing noise whilst preserving information) [16,44,47]. A rubber-banding baseline correction was also applied, suitable for later concatenation since the terminal value of the Raman region could marry with the starting value of NIR-SWIR. Finally, a further application of a 6 level, Haar wavelet de-noising transformation before vector normalization served to make Raman and NIR-SWIR spectral peaks a compatible magnitude.
2.5. Concatenation
The spectromics spectral fingerprint was built by concatenation of data from the mean average pre-processed Raman spectra to the mean average pre-processed NIR-SWIR data from a given patient. This was motivated by the fact that the Raman and NIR-SWIR measurements could not be spatially correlated on the femoral head of a given patient since these were performed on separate optical systems. As such one average Raman, one average NIR-SWIR and one average Spectromics spectrum was produced for each patient, representative of the whole femoral head cartilage. Concatenation is thus an abstraction of the data wherein the overall spectral shape is preserved (Raman scattering and NIR-SWIR absorbance) and the independent variable becomes a reference point in the spectromics fingerprint. Previously we have reported on the low level fusion between two Raman spectra excited at different wavelengths for accurate, label-free characterization of bacterial pathogens [45,48]. This concatenation process resulted in data fusion with Raman spectra accounting for the first 1011 independent variables (data points) and NIR-SWIR the last 512 points [37–39].
2.6. Multivariate analysis
2.6.1. PCA-LDA
Each of the pre-processed Raman, NIR-SWIR, and spectromics data set was mean centred before Principal Component Analysis (PCA) [16]. PCA scores represented variance in the sample direction and highlighted clustering patterns related to chemical similarities/dissimilarities between samples. The cumulative explained variance for increasing number of principal components considered for each spectral modality is displayed in Supplement 1 (1.8MB, pdf) . Loadings describe the contributions (weightings) across the original independent variable (wavenumbers) for a given principal component. The highest absolute loading coefficients (positive and negative peaks) can then identify the spectral regions most responsible for the PCA scores distribution [44,49]. PCA was employed for dimensionality reduction of the spectral data, from many hundred wavenumbers to a limited number of principal components to match the scale of the patient cohort. PCA could not systematically classify samples alone and instead required further classification techniques [50]. Herein the first 20, 10, and 5 PC’s (responsible for the majority of variance in the dataset, see Supplement 1 (1.8MB, pdf) ) were selected for supervised classification via Linear Discriminant Analysis (LDA). PCA-LDA assigns the cartilage samples to their predicted groups, “Control” and “OA” cartilage [51]. This algorithm calculated the Mahalanobis distance between samples for each class as a measure of tissue class segregation [44]. Classification for diagnosis here was supervised, labeled a priori via gold standard assessment from the orthopaedic surgeon. Though LDA is a parametric method and assumes samples hold a normal distribution, it was considered robust enough for spectroscopic data, and by applying to the foremost PCA scores maintains that the number of spectral variables was smaller than the number of samples [44].
2.6.2. SVM
Support Vector Machine (SVM), a supervised binary linear machine-learning classifier, was applied to each the Raman, NIR-SWIR, and spectromic fingerprints to quantify classification accuracy. A k-fold cross validation method was used to train the Gaussian classifier to model “Control vs OA Cartilage”, with the optimal values for the parameters for c and γ determined via a grid search function [41,51]. This, was carried out for k = 3 and leave-one-out cross validation to determine training and test sub-datasets, chosen from the 27 patient spectra (one spectrum for each patient, labeled “Control” or “OA”). As such, the model training had no sight of the test subset, nor of data from the same patient, validation carried out on unseen data. Confusion matrices of each SVM model, built on the Raman, NIR-SWIR and spectromics fingerprints, describe the rate of correct group assignation when applying the trained model to the test dataset.
2.6.3. Quality parameters
Quality parameters were calculated using the accumulative hits of the classification model, describing number of true positives (TP), true negatives (TN), false positives (FP), and false negatives (FN). This quantified the classification accuracy of the respective models (Raman, NIR-SWIR, spectromics) on a test dataset (no a priori indication) [44]. The comparator metrics, equations, and relevance to the technique are summarized in Table 1.
Table 1. Quality parameters used to evaluate model classification performance a .
| Quality Parameter | Equation | Meaning |
|---|---|---|
| Accuracy / % | No. of samples correctly classified considering true & false negatives. Optimal = 100% | |
| Sensitivity (SENS) / % | Proportion of positive samples (e.g., OA) correctly classified. Optimal = 100% | |
| Specificity (SPEC) / % | Proportion of negative samples (e.g., control) correctly classified. Optimal = 100% | |
| F-Score | Model performance considering imbalanced classes. Optimal = 100% |
Quality parameters used to evaluate model classification performance. Here TP means True Positive, FP false positives, TN true negatives, and FN false negatives. Adapted from Medeiros-De-morais et al. [44].
2.7. Feature extraction: spectral biomarkers
Feature extraction was performed on the spectromics data set to ascertain potential spectral biomarkers that account for the biggest differences between the OA and control tissue groups. The key biomarkers were identified by observing agreement between the highest weighted results of several independent statistical tests. Each process was carried out within IRootLab [41].
PCA-LDA Loadings allowed feature extraction by first identifying the 3 highest absolute loading coefficients of the first LD component corresponding to the highest contributing PC’s. The loadings of the top 3 discriminant PC’s were in turn plotted to identifying the highest absolute peaks in the wavenumber direction. Cluster Vector Analysis following PCA-LDA highlighted PC’s that represent the best samples’ clustering and their loading vectors were combined [41,44]. Both approaches highlighted the most pertinent spectral regions for tissue classification in the PCA-LDA model, and the top 20 wavenumbers were recorded for each.
Differences Between Mean Spectra (DBMS) identified biochemical alterations between the mean spectra of the control AC tissue (reference) and of the OA tissue (investigated sample). The top 20 most distinguishing features were recorded.
The Student’s T-Test and Mann-Whitney U-Test were each applied to the spectromics fingerprint to test the probability of correct classification. The former assumed a normal (Gaussian) distribution and the latter non-parametric (arguably with less bias), both carried out for completeness [41,44]. The -log10 of the P-value for the T- and U-Test for each wavenumber was plotted. Feature extraction was carried out by identifying the 20 largest peaks above a threshold of p = 0.01 (99% confidence interval) [44].
Finally, Feature Forward Selection was carried out via standard protocol in the iRootLab toolbox. Here a binary classification was iterated to find the optimal features for class segregation, trained using a random 90% portion of the data, tested on the remaining 10%. Feature histograms counted how many times particular wavelength were selected, and displayed the most important features for distinguishing between the condition (OA) and reference class (control) [41,52]. The 7 most segregating peaks were recorded from the FFS histogram.
Candidate biomarker spectral features from the concatenated fingerprint were mathematically translated to the corresponding Raman and NIR-SWIR spectra for assignment [44]. NIR-SWIR features were corroborated through comparison to the nearest 1st derivative and corresponding 2nd derivative transformed spectral peaks. This allowed correct attribution to the zero-order NIR-SWIR spectra.
3. Results and discussion
3.1. Deeply penetrating vibrational spectroscopy of articular cartilage
Representative spectra obtained with back-scattered Raman scattering and transreflected SWIR absorption spectroscopy of articular cartilage are shown in Fig. 1. Raman spectra are displayed following the convention of low to high wavenumber, NIR-SWIR following the convention of high to low wavenumber. For concatenation, high to low wavenumber was chosen for consistency in both modes, though we anticipate the analysis should be agnostic to this. The class means for control and OA tissue from Raman spectra and the corresponding 1st order derivative NIR-SWIR spectra for the same patient are shown, used to form the spectromics concatenated fingerprint. For NIR-SWIR absorption spectra it was found that transreflectance geometry measurements (non-contact, highly reflective substrate) produced the same spectral response as for transmission geometry measurements (illumination and collection from either side of the sample), recorded in-house, and for backscattered fiber probe measurements (in contact, diffuse reflectance), recorded in literature [16,21,22].
Fig. 1.
Typical vibrational spectroscopy fingerprints of articular cartilage. (a.) Represents Raman spectroscopy (614–1722cm–1) pre-processed to correct background signal and random noise, and vector normalized. (b.) Represents typical NIR-SWIR absorption (11,127–3993 cm–1) minimally pre-processed to remove extraneous signal, and normalized. (c.) Represents NIR-SWIR spectra with 1st Derivative pre-treatment to elucidate subtle peaks, smoothed and normalized. (d.) Concatenated Spectromics Fingerprint showing abstracted Raman spectra (1011 data points) fused to NIR-SWIR spectra (512 data points).
Spatial sampling at 100's µm distances for vibrational spectroscopy of cartilage was considered a suitable spectral sampling size since clinical use of MRI for diagnosis of early OA would employ in-plane spatial resolutions of 300 × 300 µm2 (high powered 3.0 T scans) to assess radiologically presenting anatomical features such as joint space narrowing and some limited macromolecule compositional assessments of the soft tissue [53–56]. From another perspective, Afara et al. have reported NIR spectroscopic analysis of animal and human knee joint cartilage with a macroscale optical fiber probe, sufficient to differentiate healthy vs diseased tissue and track biochemical changes. Here the probe was pressed against tissue samples with tip window diameter 2 mm (total ø 3.25 mm) to take an average of 10 co-added scans (30 ms exposure) from the centre of each sample. Samples were chosen from biopsy plugs 4 or 8 mm in diameter, or measurements taken from intact tissue with 3.25 mm separation [57–59].
Our methodology is consistent with the above as in our case slices of cartilage tissue up to 10 mm in diameter were taken from across the intact femoral head surface, and 10 single-point spectral measurements captured about the centre of each slice, with separation of the order of 100’s µm. However, transreflectance measurements necessitate the use of a highly reflective substrate. Native in vivo tissue classification will require diffuse reflectance measurements (subchondral bone as natural substrate), nevertheless, NIR-SWIR results presented here enable us to provide proof-of-concept.
3.2. Multivariate analysis modeling: improved tissue classification using spectromics
Raman scattering spectra, NIR-SWIR absorption spectra, and Concatenated spectra were assessed under multivariate statistical analysis to determine tissue classification accuracy. The data represented a mean average spectra characteristic for each of n = 13 osteoarthritis and n = 14 control model patients (one spectrum per patient).
Classification via PCA clustering alone proved inconclusive for each spectral modality since significant overlap existed between groupings of control and OA patients This could be due to the relatively small number of total patient samples, exacerbated by the lack of spatially correlated Raman and NIR-SWIR data, motivating the use of patient averaged spectra. LDA based on the first (most informative) 20, 10, and 5 PC’s showed sufficient inter-group differences, between control (negative class) and OA cartilage (positive), measured in Mahanobolis distance, displayed in Fig. 2 [45]. Quality parameters were calculated from the corresponding proportion of TP, FN, TN, and FP segregation of samples, summarized in Table 2 [44].
Fig. 2.
Principal Component Analysis - Linear Discriminant Analysis (PCA-LDA) classification of Raman scattering spectra, NIR-SWIR absorption scattering and the concatenated spectra. LDA based on (a.) 20 (b.) 10 and (c.) 5 principal components display marked improvement in classification for the combined spectra compared to individual analysis, illustrated by the lack of overlap and/or greater degree of separation (greater Mahanolobis distance).
Table 2. Quality parameters to quantify classification performance of PCA-LDA models a .
| Accuracy | Sensitivity | Specificity | F-Score | |
|---|---|---|---|---|
| 20 Principal Components | ||||
|
| ||||
| Raman | 100.0 | 100.0 | 100.0 | 100.0 |
| NIR-SWIR | 100.0 | 100.0 | 100.0 | 100.0 |
| Concatenated | 100.0 | 100.0 | 100.0 | 100.0 |
|
| ||||
| 10 Principal Components | ||||
|
| ||||
| Raman | 77.8 | 76.9 | 78.6 | 77.7 |
| NIR-SWIR | 62.3 | 53.8 | 71.4 | 61.4 |
| Concatenated | 100.0 | 100.0 | 100.0 | 100.0 |
|
| ||||
| 5 Principal Components | ||||
|
| ||||
| Raman | 55.6 | 84.6 | 28.6 | 42.7 |
| NIR-SWIR | 33.3 | 7.7 | 57.1 | 13.6 |
| Concatenated | 85.2 | 92.3 | 78.6 | 84.9 |
Quality parameters to quantify classification performance of PCA-LDA models built upon the foremost 20, 10 and 5 principal components of the Raman, NIR-SWIR and concatenated fingerprints. Ideal result is 100% for each.
Results showed a marked performance enhancement in modeling accuracy for the concatenated fingerprint over Raman and NIR-SWIR fingerprints alone. Initially, models built upon 20 PC’s were investigated to assess the effect of dimensionality reduction on classification performance. These showed classes completely segregated (100%) for all spectral modes and so acted as our positive control. We were motivated to work with a reduced number of features in order to avoid susceptibility to overfitting when modeling data with a high number of predictors (wavenumbers) to samples (patients), which likely explain the very high performance when using 20 PC’s.
Still, should tissue samples become more complex, approaches to maximize segregation are valuable. Samples with smaller variance, such as when investigating disease type/severity, or factors such as comorbidities, sex, and age, will require more sensitive segregation for clinical application. Thus the mean and median Mahanolobis distance were quantified as a measure of tissue classification performance between the spectral modes. The greater the difference, the greater the discrimination between tissue classes. A marked improvement for the concatenated fingerprint over Raman and NIR-SWIR fingerprints alone was observed (summarized in Table 3).
Table 3. Difference between mean and median Mahalobian distance in PCA-LDA a .
| Raman | NIR-SWIR | Concatenated | ||
|---|---|---|---|---|
| 20 PC’s PCA-LDA Difference in Mahalobian distance (x10−2) | ||||
|
| ||||
| Median | 1.58 | 1.37 | 8.98 | |
| (570/ 654% increase) | ||||
| Mean | 1.54 | 0.07 | 8.81 | |
| (572/ 1220% increase) | ||||
Difference between mean and median Mahalobian distance to quantify tissue classification achieved via PCA-LDA modeling of articular cartilage built upon the 20 foremost principal components of the Raman, NIR-SWIR and concatenated fingerprints. Ideal result is a larger distance between clusters.
An approximately 5- to 6-fold improvement in classification was demonstrated with the concatenated fingerprint over Raman and NIR-SWIR alone. These results indicate the augmented information content from both Raman and NIR-SWIR active vibrations may be responsible for improvement in tissue classification.
For models built on 10 PC’s, accuracy increased from 77.8% and 62.3% for Raman and NIR-SWIR, respectively, to 100.0% segregation of the concatenated fingerprint spectra, showing proof-of-principal for the added information afforded by multimodal Spectromics.
Finally, for models built using only 5 PC’s, both Raman and NIR-SWIR failed to show effective tissue classification, describing poor accuracies (and F-Scores) at 55.6% (42.7%) and 33.3% (13.6%), respectively. By contrast, Spectromics showed significantly high classification performance with 85.2% accuracy and an F-Score of 84.9%.
This highlights the beneficial application of multimodal Spectromics, suitable to work with a reduced number of PC’s, a stand-in for poorer fidelity data, showing a spectral analysis approach compatible with reduced computational expense and headroom for greater clinical complexity.
3.3. Machine learning modeling: improved tissue classification under spectromics
Spectral fingerprints formed from Raman scattering spectra, NIR-SWIR absorption spectra, and concatenated spectra were used to build a support vector machine for model “Control vs OA Cartilage” to determine tissue classification accuracy. These represented one spectrum for each of 13 osteoarthritis and 14 control model patients. Classification accuracies of SVM with 3-fold cross validation with leave-one-out cross validation shown in Fig. 3 and summarized in Table 4.
Fig. 3.
Support Vector Machine classification model (OA vs control tissue) for Raman scattering, NIR-SWIR absorption scattering and the concatenated spectra. SVM confusion matrices (correct classification in green, incorrect in red) and associated classification rates displayed for the test dataset generated with optimal tuning parameters (c,γ) extracted from a grid search of the training dataset. The algorithm was carried out on a (a.) 3-fold and (b.) leave-one-out cross validator and shows improved classification when employing the concatenated spectra over Raman and NIR-SWIR alone.
Table 4. Percentage classification accuracies rating performance of SVM models “Control vs OA Cartilage” a .
| Control/ Control | Control/ OA | OA/ Control | OA/OA | |
|---|---|---|---|---|
| 3-fold Cross Validator | ||||
|
| ||||
| Raman | 85.7 | 14.3 | 18.9 | 81.1 |
| NIR-SWIR | 73.3 | 26.7 | 15.0 | 85.0 |
| Concatenated | 95.2 | 4.8 | 20.0 | 80.0 |
|
| ||||
| Leave-One-Out Cross Validator | ||||
|
| ||||
| Raman | 92.9 | 7.1 | 23.1 | 76.9 |
| NIR-SWIR | 85.7 | 14.3 | 23.1 | 76.9 |
| Concatenated | 100.00 | 0.00 | 23.1 | 76.9 |
Percentage classification accuracies rating performance of SVM models “Control vs OA Cartilage” based on data of the Raman, NIR-SWIR and concatenated fingerprints. Ideal result is 100% for Control/Control and OA/OA classes, and 0% for mismatched classes.
The results clearly describe the improved classification under the spectromics fingerprint over modeling with Raman or NIR-SWIR alone. Specifically, recall for the control class (specificity) showed improvement from 85.7% and 73.3% for Raman and NIR-SWIR, respectively, up to 95.2% for the spectromics fingerprint under 3-fold cross validation SVM.
Under leave-one-out cross validation, where each spectrum in turn is treated as the test data and the remaining used to train the model. This approach showed an improvement again from 92.9% and 85.7%, for Raman and NIR-SWIR, respectively, to 100% accuracy for spectromics. However, recall for the positive OA class (sensitivity) showed mixed results with slightly lower rates for 3-fold SVM (Raman at 81.1%, NIR-SWIR at 85.0%, down to 80.0% for spectromics) and no difference under leave-one-out SVM observed.
Since the above evaluations were taken for each class in isolation, quality parameters were calculated from the accumulated hits to assess the classification model as a whole. Here the number of TP, TN, FP, and FN classified spectra was determined from the accumulated hits (summarized in Table 5).
Table 5. Quality parameters based on accumulated hits of Support Vector Machine model “Control vs OA” a .
| Accuracy | Sensitivity | Specificity | F-Score | |
|---|---|---|---|---|
| 3-fold Cross Validator | ||||
|
| ||||
| Raman | 77.8 | 76.9 | 78.6 | 77.7 |
| NIR-SWIR | 77.8 | 84.6 | 71.4 | 77.5 |
| Concatenated | 85.2 | 76.9 | 92.9 | 84.1 |
|
| ||||
| Leave-One-Out Cross Validator | ||||
|
| ||||
| Raman | 85.2 | 76.9 | 92.9 | 84.1 |
| NIR-SWIR | 81.5 | 76.9 | 85.7 | 81.1 |
| Concatenated | 88.9 | 76.9 | 100.0 | 87.0 |
Quality parameters based on accumulated hits of Support Vector Machine model “Control vs OA” to assess modeling performance. Ideal results are 100% for each.
Accuracy and F-Score of the whole model show marked improvement, considering the imbalanced classes, namely n = 14 control and n = 13 OA [44]. Under 3-fold cross validated training, accuracy improved from 77.8% for both Raman and NIR-SWIR, to 85.2% for spectromics; F-Score from 77.7% for Raman and 77.5% for NIR-SWIR, to 84.1% for spectromics. Similar improvements were seen under leave-one-out training, with accuracy improvement from 85.2% for Raman, 81.5% for NIR-SWIR, to 88.89% for spectromics; F-score from 84.1% for Raman, 81.1% for NIR-SWIR, to 87.0% for spectromics.
Although the reported improvements were relatively modest (∼10% in specificity) this nonetheless highlights the rich chemometric information afforded by vibrational spectroscopy and the added benefit of employing the enhanced fingerprint, with specificity as high as 100%. This latter result suggests the control cartilage spectra contained greater inter-sample consistency, drawing attention perhaps to the disordering influence OA has on the tissue, resulting in relatively low sensitivity. Indeed, damage to the ECM, loss of vital PG and water, as the complex effects of the disease, would likely produce heterogeneity between patients.
Although the current data set clearly provides sufficient proof-of-concept, the work will require further validation, primarily through correlated spectromics data and an increase of number of patient samples. An increase in number of spectra will improve the training and subsequently the accuracy of both the PCA-LDA and SVM models. Thus, considering the low number of samples in each class, for which only one representative spectra (mean average) was used to represent each patient in each of the Raman, NIR-SWIR and Spectromics investigations, respectively, the proof-of-concept demonstrates high classification performance.
3.4. Identifying spectral regions most pertinent to OA diagnosis
To determine which regions of the fingerprint were the most significant contributors to discerning inter-class variation, a series of independent statistical tests were performed on the concatenated spectra. The results from each test (Fig. 4) were corroborated by identifying agreement between the highest scoring features (Supplement 1 (1.8MB, pdf) ). These corresponded to spectral regions most pertinent to tissue classification and in turn candidates for spectral biomarkers to distinguish control vs. OA AC tissue. These were then assigned to the chemical vibration/AC tissue constituent (Supplement 1 (1.8MB, pdf) ).
Fig. 4.
Candidate spectral biomarkers identified by corroboration between the highest scoring features extracted by separate statistical tests. (a.) Highest scoring features presented for Difference Between Mean Spectra, Cluster Vector Analysis, Student’s T-test, Mann-Whitney U-Test, Feature Forward Selection, and weightings for PCA-LDA. (b.) Corroborating the statistical tests sees certain spectral regions in agreement, displayed as regions scoring four or three hits.
Relevant Raman spectral features included peaks at 1343-1334 cm–1 (CH wagging) indicative of presence of glycosaminoglycan (GAG) proteins, and the typically quoted amide III band around here at 1255-1246 cm–1 (C-N stretching), both shown to correlate with progression of OA [18,25,27–29,60]. The amide II band has been employed as a prominent spectral delineator between human healthy and OA knee AC, and the red shift as an indicator of mechanical impact compression in porcine AC [25,31].
The peak at 1617-1616 cm–1 is representative of the amide I band (1612-1696 cm–1), with magnitude shown to decrease consistently with increasing grade of OA in human knee chondrocytes [61]. Spectral markers identified at 1064-1063 cm–1 agree strongly with the symmetric SO3– stretching of sulphated-GAG (sGAG) protein, the quantification of which can determine the degeneration state of ex vivo human AC [18,27,29].
Relevant NIR-SWIR spectral features included bands at 6543-6472 cm–1, attributed to the N-H stretch (-CONH, 1st overtone) related to proteoglycan (PG) protein absorptions. This is regarded indicative of the ECM, and found to be the best spectral region for prediction of OA severity via Mankin scoring (6411-6496 cm–1) [22]. The 4402-4366 cm–1 band can be attributed to the 2nd overtone of the C-H bend, indicative of ECM protein content, pertinent since Partial Least Squares (PLS) modeling of the region of 4000-6000 cm–1 has predicted relative content of collagen and PG proteins in AC with error as low as 6% [25].
The features at 7159/7094 cm–1 and 8767/8744 cm–1 describe O-H stretching (1st overtone) and C-H stretching (2nd overtone), respectively [18,22]. These have been attributed to water and collagen and PG proteins, their ratio forming a useful metric of relative water content (increasing during OA onset), Mankin score, and mechanical stiffness. The band centred about 5200 cm–1 has been used to accurately predict bound and free water through PLS regression modeling, represented by the spectromics biomarker found at 5357-5333 cm–1 [18,19].
Interestingly, the fundamental vibrational modes of O-H and C-H stretching have been explicitly identified as Raman AC disease markers and shown to correlate with the increase in total cartilage hydration and loss of tissue constituents, respectively, as indicators of lesions [27]. This highlights the benefit of the complimentary and encompassing nature of the spectromics approach.
Also highlighted are potential “spectromics-specific” biomarkers, namely at the discontinuous transition from Raman to NIR-SWIR data and at around 4402-4367 cm–1 in NIR-SWIR absorbance. Though these cannot be attributed to pre-existing spectral markers of cartilage, these still offer a systematic feature unique to each spectra, and potentially a discriminatory metric to aid tissue classification. Further implementation of the approach could elucidate such features.
4. Conclusions
This proof-of-concept study examined Raman scattering and NIR-SWIR absorption spectroscopy for chemometric assessment of articular cartilage tissue in an elegant low-level abstraction data fusion approach termed Spectromics. The improved classification potential of the enhanced spectromic fingerprint has been demonstrated and the potential for diagnosis of OA. This was proposed as method level innovation directly compatible with minimally invasive approaches with optical fibers and, conceivably, to enhance non-invasive deep NIR-SWIR approaches.
Binary classification models based on PCA-LDA and SVM have been assessed, which exhibited notable performance improvement when built upon the holistic Spectromics fingerprint over Raman or NIR-SWIR fingerprints alone.
PCA-LDA employing 20 PC’s presented high performance accuracy (acting as a positive control) with complete class segregation. This was likely susceptible to overfitting artefacts but, nevertheless, showed a 5-6 fold improvement under Spectromics, valuable as an approach to maximize class differences, useful for disease stratification.
PCA-LDA employing 10 PC’s showed an increase in accuracies of 77.8% and 62.3%, for Raman and NIR-SWIR, respectively, to 100% for Spectromics. Only the Spectromics approach showed sufficient classification over Raman and NIR-SWIR for models using 5 PC’s, showing robust performance when employing lower fidelity data, suitable also for reduced computational requirements.
SVM modeling similarly showed performance improvement ∼10% for class prediction using Spectromics over Raman and NIR-SWIR alone. Spectral features most contributing to the classification elucidated clinically relevant tissue components, proposed as potential label-free, non-invasive OA biomarkers, and demonstrated interpretable feedback from the Spectromics approach.
Future work will seek to improve performance of the approach by adopting a spatially correlated spectromic fingerprinting strategy and increasing the number of patient samples. In summary, the current proof-of-concept study demonstrates the potential of the spectromics approach for OA diagnosis with significant therapeutic diagnostic implications for an aging demographic.
Supplemental information
Acknowledgments
SM, RO and HC conceptualized the study. AC carried out the Raman spectroscopy work. HC acquired the NIR-SWIR spectra and developed the spectromics approach and carried out all of the analysis. DD collected the samples. KNB assisted in developing the setups for spectral acquisition. HC was supervised by both SM and RO. SM oversaw the project. HC wrote the first draft of the manuscript. SM and RO revised it. All authors contributed to the revised version and approved the final submission.
Funding
Engineering and Physical Sciences Research Council10.13039/501100000266 (EP/N509747/1, EP/T020997/1, EP/V038036/1).
Disclosures
The authors declare no conflicts of interest.
Data availability
Data underlying the results presented in this paper are available upon reasonable request.
Supplemental document
See Supplement 1 (1.8MB, pdf) for supporting content.
References
- 1.Swain S., Sarmanova A., Mallen C., et al. , “Trends in incidence and prevalence of osteoarthritis in the United Kingdom: findings from the Clinical Practice Research Datalink (CPRD),” Osteoarthr. Cartil. 28(6), 792–801 (2020). 10.1016/j.joca.2020.03.004 [DOI] [PubMed] [Google Scholar]
- 2.Safiri S., Kolahi A. A., Smith E., et al. , “Global, regional and national burden of osteoarthritis 1990-2017: A systematic analysis of the Global Burden of Disease Study 2017,” Ann. Rheum. Dis. 79(6), 819–828 (2020). 10.1136/annrheumdis-2019-216515 [DOI] [PubMed] [Google Scholar]
- 3.Novakofski K. D., Pownder S. L., Koff M. F., et al. , “High-resolution methods for diagnosing cartilage damage in vivo,” Ann. Rheum. Dis. 7(1), 39–51 (2016). 10.1177/1947603515602307 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Bhosale A. M., Richardson J. B., “Articular cartilage: Structure, injuries and review of management,” Br. Med. Bull. 87(1), 77–95 (2008). 10.1093/bmb/ldn025 [DOI] [PubMed] [Google Scholar]
- 5.Braun H. J., Gold G. E., “Diagnosis of osteoarthritis: Imaging,” Bone 51(2), 278–288 (2012). 10.1016/j.bone.2011.11.019 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Zhang W., Doherty M., Peat G., et al. , “EULAR evidence-based recommendations for the diagnosis of knee osteoarthritis,” Ann. Rheum. Dis. 69(3), 483–489 (2010). 10.1136/ard.2009.113100 [DOI] [PubMed] [Google Scholar]
- 7.Ryd L., Brittberg M., Eriksson K., et al. , “Pre-Osteoarthritis,” Cartilage 6(3), 156–165 (2015). 10.1177/1947603515586048 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Sinusas K., “Osteoarthritis: diagnosis and treatment,” Am. Fam. Physician 85(1), 49–56 (2012). [PubMed] [Google Scholar]
- 9.Michael J. W. P., Schlüter-Brust K. U., Eysel P., “Epidemiologie, ätiologie, diagnostik und therapie der gonarthrose,” Dtsch. Arztebl. 107(9), 152–162 (2010). 10.3238/arztebl.2010.0152 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Gademan M. G. J., Hofstede S. N., Vliet Vlieland T. P. M., et al. , “Indication criteria for total hip or knee arthroplasty in osteoarthritis: a state-of-the-science overview,” BMC Musculoskelet. Disord. 17(1), 463 (2016). 10.1186/s12891-016-1325-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Chu C. R., Williams A. A., Coyle C. H., et al. , “Early diagnosis to enable early treatment of pre-osteoarthritis,” Arthritis Res. Ther. 14(3), 212 (2012). 10.1186/ar3845 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Li X., Wang S., Liu W., et al. , “Causal effect of physical activity and sedentary behaviors on the risk of osteoarthritis: a univariate and multivariate Mendelian randomization study,”Sci. Rep. 1–10 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Golovynskyi S., Golovynska I., Stepanova L. I., et al. , “Optical windows for head tissues in near-infrared and short-wave infrared regions: approaching transcranial light applications,” J. Biophotonics 11(12), 1–12 (2018). 10.1002/jbio.201800141 [DOI] [PubMed] [Google Scholar]
- 14.Sordillo L. A., Sordillo D. C., Shi L., et al. , “SWIR windows as an adjunctive to biopsy for distinguishing and monitoring benign and malignant tissues,” February 11234, 50 (2020). 10.1117/12.2550311 [DOI] [Google Scholar]
- 15.Sordillo L. A., Shi L., Sordillo D., et al. , “Advances in medical applications using SWIR light in the wavelength range from 1000 to 2500 nm,” (March 2019), 28 (2019).
- 16.Afara I. O., Shaikh R., Nippolainen E., et al. , “Characterization of connective tissues using near-infrared spectroscopy and imaging,” Nat. Protoc. 16(2), 1297–1329 (2021). 10.1038/s41596-020-00468-z [DOI] [PubMed] [Google Scholar]
- 17.Padalkar M. V., Pleshko N., “Wavelength-dependent penetration depth of near infrared radiation into cartilage,” Analyst 140(7), 2093–2100 (2015). 10.1039/C4AN01987C [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Rieppo L., Töyräs J., Saarakkala S., “Vibrational spectroscopy of articular cartilage,” Appl. Spectrosc. Rev. 52(3), 249–266 (2017). 10.1080/05704928.2016.1226182 [DOI] [Google Scholar]
- 19.Afara I. O., Oloyede A., “Resolving the Near-Infrared Spectrum of Articular Cartilage,” Cartilage 13(1_suppl), 729S–737S (2021). 10.1177/19476035211035417 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Afara I. O., Hauta-Kasari M., Jurvelin J. S., et al. , “Optical absorption spectra of human articular cartilage correlate with biomechanical properties, histological score and biochemical composition,” Physiol. Meas. 36(9), 1913–1928 (2015). 10.1088/0967-3334/36/9/1913 [DOI] [PubMed] [Google Scholar]
- 21.Afara I., Prasadam I., Crawford R., et al. , “Non-destructive evaluation of articular cartilage defects using near-infrared (NIR) spectroscopy in osteoarthritic rat models and its direct relation to Mankin score,” Osteoarthr. Cartil. 20(11), 1367–1373 (2012). 10.1016/j.joca.2012.07.007 [DOI] [PubMed] [Google Scholar]
- 22.Afara I. O., Prasadam I., Arabshahi Z., et al. , “Monitoring osteoarthritis progression using near infrared (NIR) spectroscopy,” Sci. Rep. 7(1), 11463 (2017). 10.1038/s41598-017-11844-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Olumegbon I. A., Oloyede A., Afara I. O., “Near-infrared (NIR) spectroscopic evaluation of articular cartilage: A review of current and future trends,” Appl. Spectrosc. Rev. 52(6), 541–559 (2017). 10.1080/05704928.2016.1250010 [DOI] [Google Scholar]
- 24.ElMasry G., Nakauchi S., “Prediction of meat spectral patterns based on optical properties and concentrations of the major constituents,” Food Sci. Nutr. 4(2), 269–283 (2016). 10.1002/fsn3.286 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Yu C., Zhao B., Li Y., et al. , “Vibrational spectroscopy in assessment of early osteoarthritis—a narrative review,” Int. J. Mol. Sci. 22(10), 5235 (2021). 10.3390/ijms22105235 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Afara I. O., Sarin J. K., Ojanen S., et al. , “Machine learning classification of articular cartilage integrity using near infrared spectroscopy,” Cell. Mol. Bioeng. 13(3), 219–228 (2020). 10.1007/s12195-020-00612-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Gaifulina R., Nunn A. D. G., Draper E. R. C., et al. , “Intra-operative Raman spectroscopy and ex vivo Raman mapping for assessment of cartilage degradation,” Clin. Spectrosc. 3(March), 100012 (2021). 10.1016/j.clispe.2021.100012 [DOI] [Google Scholar]
- 28.Casal-Beiroa P., Balboa-Barreiro V., Oreiro N., et al. , “Optical biomarkers for the diagnosis of osteoarthritis through raman spectroscopy: Radiological and biochemical validation using ex vivo human cartilage samples,” Diagnostics 11(3), 546 (2021). 10.3390/diagnostics11030546 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Richardson W., Wilkinson D., Wu L., et al. , “Ensemble multivariate analysis to improve identification of articular cartilage disease in noisy Raman spectra,” J. Biophotonics 8(7), 555–566 (2015). 10.1002/jbio.201300200 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Unal M., Akkus O., “Shortwave-infrared Raman spectroscopic classification of water fractions in articular cartilage ex vivo,” J. Biomed. Opt. 23(01), 1 (2018). 10.1117/1.JBO.23.1.015008 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Oshima Y., Kiyomatsu H., Miura H., et al. , “Evaluation of degenerative changes in articular cartilage of osteoarthritis by Raman spectroscopy,” in Imaging, Manipulation, and Analysis of Biomolecules, Cells, and Tissues XVI , Farkas D. L., Nicolau D. V., Leif R. C., eds. (SPIE, 2018), p. 45. [Google Scholar]
- 32.Mason D., Murugkar S., Speirs A. D., “Measurement of cartilage sub-component distributions through the surface by Raman spectroscopy-based multivariate analysis,” J. Biophotonics 14(1), e202000289 (2021). 10.1002/jbio.202000289 [DOI] [PubMed] [Google Scholar]
- 33.Albro M. B., Bergholt M. S., St-Pierre J. P., et al. , “Raman spectroscopic imaging for quantification of depth-dependent and local heterogeneities in native and engineered cartilage,” npj Regen. Med. 3(1), 3–11 (2018). 10.1038/s41536-018-0042-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Bergholt M. S., St-Pierre J. P., Offeddu G. S., et al. , “Raman spectroscopy reveals new insights into the zonal organization of native and tissue-engineered articular cartilage,” ACS Cent. Sci. 2(12), 885–895 (2016). 10.1021/acscentsci.6b00222 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Raj P., Wu L., Almeida C., et al. , “Shining Light on Osteoarthritis: Spatially Offset Raman Spectroscopy as a Window into Cartilage Health,” bioRxiv, (2023). 10.1101/2023.08.14.553328 [DOI] [PubMed]
- 36.Crisford A., Cook H., Bourdakos K., et al. , “Harnessing Raman spectroscopy and Multimodal Imaging of Cartilage for Osteoarthritis Diagnosis,” medRxiv, (2023). 10.1101/2023.09.05.23294936 [DOI]
- 37.Huang X., Teye E., Sam-Amoah L. K., et al. , “Rapid measurement of total polyphenols content in cocoa beans by data fusion of NIR spectroscopy and electronic tongue,” Anal. Methods 6(14), 5008–5015 (2014). 10.1039/C4AY00223G [DOI] [Google Scholar]
- 38.Li M., Xue J., Du Y., et al. , “Data fusion of raman and near-infrared spectroscopies for the rapid quantitative analysis of methanol content in methanol-gasoline,” Energy and Fuels 33(12), 12286–12294 (2019). 10.1021/acs.energyfuels.9b03021 [DOI] [Google Scholar]
- 39.Zhu S., Song Z., Shi S., et al. , “Fusion of near-infrared and raman spectroscopy for in-line measurement of component content of molten polymer blends,” Sensors (Switzerland 19(16), 3463 (2019). 10.3390/s19163463 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Buckwalter J. A., Mow V. C., Ratcliffe A., “Restoration of injured or degenerated articular cartilage,” J. Am. Acad. Orthop. Surg. 2(4), 192–201 (1994). 10.5435/00124635-199407000-00002 [DOI] [PubMed] [Google Scholar]
- 41.Trevisan J., Angelov P. P., Scott A. D., et al. , “IRootLab: a free and open-source MATLAB toolbox for vibrational biospectroscopy data analysis,” Bioinformatics 29(8), 1095–1097 (2013). 10.1093/bioinformatics/btt084 [DOI] [PubMed] [Google Scholar]
- 42.Devitt G., Howard K., Mudher A., et al. , “Raman spectroscopy: an emerging tool in neurodegenerative disease research and diagnosis,” ACS Chem. Neurosci. 9(3), 404–420 (2018). 10.1021/acschemneuro.7b00413 [DOI] [PubMed] [Google Scholar]
- 43.Devitt G., Crisford A., Rice W., et al. , “Conformational fingerprinting of tau variants and strains by Raman spectroscopy,” RSC Adv. 11(15), 8899–8915 (2021). 10.1039/D1RA00870F [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Morais C. L. M., Lima K. M. G., Singh M., et al. , “Tutorial: multivariate classification for vibrational spectroscopy in biological samples,” Nat. Protoc. 15(7), 2143–2162 (2020). 10.1038/s41596-020-0322-8 [DOI] [PubMed] [Google Scholar]
- 45.Lister A. P., Highmore C. J., Hanrahan N., et al. , “Multi-excitation Raman spectroscopy for label-free, strain-level characterization of bacterial pathogens in artificial sputum media,” Anal. Chem. 94(2), 669–677 (2022). 10.1021/acs.analchem.1c02501 [DOI] [PubMed] [Google Scholar]
- 46.Shen X., Ye S., Xu L., et al. , “Study on baseline correction methods for the Fourier transform infrared spectra with different signal-to-noise ratios,” Appl. Opt. 57(20), 5794 (2018). 10.1364/AO.57.005794 [DOI] [PubMed] [Google Scholar]
- 47.Martyna A., Menżyk A., Damin A., et al. , “Improving discrimination of Raman spectra by optimising preprocessing strategies on the basis of the ability to refine the relationship between variance components,” Chemom. Intell. Lab. Syst. 202, 104029 (2020). 10.1016/j.chemolab.2020.104029 [DOI] [Google Scholar]
- 48.Lister A., Avershina E., Ali J., et al. , “Multi-excitation Raman spectroscopy complements whole genome sequencing for rapid detection of bacterial infection and resistance in WHO priority pathogens,” bioRxiv (2022). 10.1101/2022.02.08.479540 [DOI]
- 49.Martin F. L., German M. J., Wit E., et al. , “Identifying variables responsible for clustering in discriminant analysis of data from infrared microspectroscopy of a biological sample,” J. Comput. Biol. 14(9), 1176–1184 (2007). 10.1089/cmb.2007.0057 [DOI] [PubMed] [Google Scholar]
- 50.Camilo L., Medeiros De Morais, “Novel chemometric approaches towards handling biospectroscopy datasets,”PhD diss., University of Central Lancashire; (2020). [Google Scholar]
- 51.Callery E. L., Morais C. L. M., Paraskevaidi M., et al. , “New approach to investigate common variable immunodeficiency patients using spectrochemical analysis of blood,” Sci. Rep. 9(1), 7239 (2019). 10.1038/s41598-019-43196-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Guyon I., Elisseeff A., “An introduction to variable and feature selection,” J. Mach. Learn. Res. 3(7–8), 1157–1182 (2003). [Google Scholar]
- 53.Liu C., Liu C., Ren X., et al. , “Quantitative evaluation of subchondral bone microarchitecture in knee osteoarthritis using 3 T MRI,” BMC Musculoskelet. Disord. 18(1), 496 (2017). 10.1186/s12891-017-1865-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Li X., Majumdar S., “Quantitative MRI of articular cartilage and its clinical applications,” J. Magn. Reson. Imaging 38(5), 991–1008 (2013). 10.1002/jmri.24313 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Aguilar H. N., Battié M. C., Jaremko J. L., “MRI-based hip cartilage measures in osteoarthritic and non-osteoarthritic individuals: a systematic review,” RMD Open 3(1), e000358 (2017). 10.1136/rmdopen-2016-000358 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Mourad C., Vande Berg B., “Osteoarthritis of the hip: is radiography still needed?” Skeletal Radiol. 52(11), 2259–2270 (2023). 10.1007/s00256-022-04270-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Nippolainen E., Shaikh R., Virtanen V., et al. , “Near infrared spectroscopy enables differentiation of mechanically and enzymatically induced cartilage injuries,” Ann. Biomed. Eng. 48(9), 2343–2353 (2020). 10.1007/s10439-020-02506-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Sarin J. K., Nykänen O., Tiitu V., et al. , “Arthroscopic determination of cartilage proteoglycan content and collagen network structure with near-infrared spectroscopy,” Ann. Biomed. Eng. 47(8), 1815–1826 (2019). 10.1007/s10439-019-02280-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Linus A., Tanska P., Sarin J. K., et al. , “Visible and near-infrared spectroscopy enables differentiation of normal and early osteoarthritic human knee joint articular cartilage,” Ann. Biomed. Eng. 51(10), 2245–2257 (2023). 10.1007/s10439-023-03261-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Lim N. S. J., Hamed Z., Yeow C. H., et al. , “Early detection of biomolecular changes in disrupted porcine cartilage using polarized Raman spectroscopy,” J. Biomed. Opt. 16(1), 017003 (2011). 10.1117/1.3528006 [DOI] [PubMed] [Google Scholar]
- 61.Kumar R., Singh G., Grønhaug K., et al. , “Single cell confocal raman spectroscopy of human osteoarthritic chondrocytes: a preliminary study,” Int. J. Mol. Sci. 16(5), 9341–9353 (2015). 10.3390/ijms16059341 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Data underlying the results presented in this paper are available upon reasonable request.




