Skip to main content
PLOS One logoLink to PLOS One
. 2025 Jan 30;20(1):e0316522. doi: 10.1371/journal.pone.0316522

Application of attenuated total reflection–Fourier transform infrared spectroscopy in semi-quantification of blood lipids and characterization of the metabolic syndrome

Tz-Ping Gau 1,2,3, Jen-Hung Wen 4, I-Wei Lu 5, Pei-Yu Huang 6, Yao-Chang Lee 6,7,8, Wei-Po Lee 3, Hsiang-Chun Lee 9,10,11,*
Editor: Rajesh Kumar Singh12
PMCID: PMC11781649  PMID: 39883744

Abstract

Background/Purpose

Dyslipidemia, a hallmark of metabolic syndrome (MetS), contributes to atherosclerotic and cardiometabolic disorders. Due to days-long analysis, current clinical procedures for cardiotoxic blood lipid monitoring are unmet. This study used AI-assisted attenuated total reflectance Fourier transform infrared (ATR-FTIR) spectroscopy to identify MetS and precisely quantify multiple blood lipid levels with a blood sample of 0.5 µl and the assaying time is approximately 10 minutes.

Methods

ATR-FTIR spectroscopy with 1738 data points in the spectral range of 4000–650 cm−1 was used to analyze the blood samples. An adaptive synthetic technique was used to establish a prevalence-balanced dataset. LDL-C, HDL-C, TG, VLDL-C, and cholesterol levels were defined as the predicted targets of lipid absorption profiles. Linear regression (LR), gradient boosting regression tree (GBT), and histogram-based gradient boosting regression tree (HGBTR) were used to train the models. Lipid profile value prediction was evaluated using R2 and MAE, whereas MetS prediction was evaluated using area under the ROC curve.

Results

A total of 150 blood samples from 25 individuals without MetS and 25 with MetS yielded 491 spectral measurements. In the regression models, HGBT best predicted the targets of TG, CHOL, HDL-C, LDL-C, and VLDL-C with R2 values of 0.854 (0.12), 0.684 (0.08), 0.758 (0.10), and 0.419 (0.11), respectively. The classification model with the greatest AUC was RF (0.978), followed by HGBT (0.972) and GBT (0.967).

Conclusion

The results of this study revealed that predicting MetS and determining blood lipid levels with high R2 values and limited errors are feasible for monitoring during therapy and intervention.

Introduction

Metabolic syndrome (MetS) is the leading cause of cardiometabolic diseases and has become a significant health concern worldwide [1,2]. According to the National Cholesterol Education Program (NCEP), MetS is diagnosed based on elevated glucose levels, reduced HDL cholesterol, elevated triglycerides (TG), obesity, and hypertension [3]. Beyond low-density lipoprotein cholesterol (LDL-C), TG-rich lipoproteins, such as very-low-density lipoprotein (VLDL), have been shown to cause cardiac remodeling [46], particularly by inducing atrial dilation, which is a hallmark of atrial myopathy and represents a risk for atrial fibrillation (AF). Other hazardous lipids, such as diacylglycerols and ceramides, can promote atrial myopathy by promoting mitochondrial dysfunction and apoptosis [7]. Some saturated free fatty acids can affect cardiac ion channels by activating macrophages and IL-6 [8], and postprandial VLDL is correlated with atrial remodelling [5], suggesting that dietary fat and postprandial circulatory lipids can affect arrhythmogenicity [9]. Moreover, VLDL is toxic to the cardiovascular system and can induce vulnerability to AF in patients with MetS but not in healthy subjects [10,11].

Circulatory lipids, except for some albumin-binding fatty acids, are carried by lipoproteins. VLDL lipotoxicity is primarily affected by the lipid core, which consists of a variable abundance of lipid species. Quantification of lipid species in lipoproteins can be performed using nuclear magnetic resonance (NMR), mass spectrometry (MS), gas chromatography (GC), and Fourier transform-ion cyclotron resonance (FT-ICR) [12]. However, these costly and time-consuming methods are not feasible for daily applications in clinical scenarios. Moreover, none of them can be used as a monitoring assay when repeated testing is required, for instance, when assessing blood lipid changes after a nutritional intervention or exercise workout.

Attenuated total reflectance Fourier transform infrared (ATR-FTIR) spectroscopy is a technique for analyzing the chemical composition of materials by detecting their characteristic absorption of mid-infrared light. In ATR-FTIR, molecules absorb mid-infrared light at specific wavenumbers (or frequencies), depending on their chemical bonds and structure. Each type of bond (e.g., C = O and C–H) has a unique absorption pattern, creating a molecular fingerprint. ATR-FTIR can quickly analyze the molecular fingerprint of an organic sample and reveal its principal biomolecular composition and interactions and has been validated in a broad spectrum of biomedical applications, such as SARS-CoV-2 detection [13,14], urinalysis for assessing renal hyperfiltration [15], and the detection of HIV infection [16]. The advantages of FTIR, with its quick assaying time and demand for tiny blood samples (0.5 µL, equivalent to a finger-prick sampling amount) prove that it is an ideal method for blood lipid analysis.

In ATR-FTIR, lipids in the blood samples produce clustered signals representing IR reflections from structures with a range of molecules conjugated to various proteins. Conventional algorithms for spectral analysis cannot categorize lipid species. Algorithms are used to directly estimate and quantify the amount of specific lipids to generate raw reflection data with complex interactions. Therefore, an effective analytical method for ATR-FTIR is required to facilitate its application in lipid analyses.

Data-driven processing enables rapid development and application of artificial intelligence (AI). The introduction of the gradient boosting tree model (GBT), linear regression model (LR), and histogram-based gradient boosting tree model (HGBT), an enhanced variant of the GBT model, makes machine learning useful in complex biomedical prediction. Some examples include the prediction of progression-free survival in prostate cancer [17], estimation of delivery date in pregnancy [18], and evaluation of one-year mortality in patients receiving coronary stenting [19]. AI-assisted ATR-FTIR spectroscopy has great potential as a rapid, economical, and resilient diagnostic platform for detecting and monitoring metabolic disorders. Thus, it can be widely used in the clinical setting. In this study, we combined machine learning and ATR-FTIR data to develop a valuable method for distinguishing between patients with and without MetS and for predicting conventional laboratory test results.

Methods

Setting and participants

The study protocol was approved by the Kaohsiung Medical University Hospital Institutional Review Board (KMUHIRB-F(II)-20210091) and was implemented in accordance with the Good Clinical Practice Guidelines, the principles of the Declaration of Helsinki, and all applicable legislation and regulations of the Kaohsiung Medical University Hospital. This retrospective study accessed data from 2021/05/25, and all authors had access to information that could identify individual information after data collection. All study participants provided their informed written consent. Two blood samples were collected from each subject and centrifuged to keep the serum frozen for the ATR-FTIR experiments. Clinical and laboratory data were obtained. The study participants were divided into two groups based on the presence or absence of MetS. The clinical diagnosis of MetS is defined by the existence of at least three criteria as follows: waist circumference (greater than or equal to 90 cm in men and greater than or equal to 80 cm in women), blood pressure (≥ 130/85 mmHg), blood glucose level (fasting state greater than or equal to 100 mg/dl), TG (greater than 150 mg/dL), and high-density lipoprotein cholesterol (HDL-C level, for male less than or equal to 40 mg/dL, for female less than or equal to 50 mg/dL).

ATR-FTIR spectral data acquisition

All ATR-FTIR spectral data were obtained at Taiwan Light Source (TLS) BL14A1 of the National Synchrotron Radiation Research Center (NSRRC) in Taiwan. This study applied a small sample volume (0.5 µL to a Ge ATR crystal, as shown in Fig 1(A). A modulated mid-infrared beam was directed into the Ge crystal, generating an evanescent wave at the crystal surface, allowing the light to interact with the sample. The sample absorbs light at characteristic wavenumbers, and the remaining light is detected using a liquid nitrogen-cooled mercury cadmium telluride (MCT) infrared detector. The resulting spectrum shows peaks corresponding to specific molecular bonds, providing a unique chemical profile of the sample. ATR-FTIR samples must be stored in a −80 °C freezer to ensure sample stability before measurement and brought to room temperature naturally before measurement. The samples were placed on the ATR crystal surface to allow moisture to evaporate naturally before the signal acquisition, ensuring that the moisture did not interfere with the spectra. If the moisture in the samples was not completely removed, interference appeared at 3500 cm−1, 2100 cm−1, and 1640 cm−1. A measurement time of approximately 5 min (~300 s) did not cause any degradation of the components.

Fig 1.

Fig 1

(A). The schematic depicts the attenuated total reflection for acquiring the FTIR spectra of a blood sample. The mid-infrared light is modulated using the Michaelson interferometer of an FTIR spectrometer. (B). Representative ATR-FTIR spectrometry. Raw data are obtained from plasma samples from one MetS and one non-MetS individual. The absorption regions of the major biomolecules are indicated.

Each ATR-FTIR spectral dataset of a blood sample contained 1738 data points in the spectral range 4000–650 cm−1 (Fig 1(B)). Three spectral ranges, 3000–2800 cm−1, 1800–1650 cm−1, and 1500–1350 cm−1, of characteristic lipid absorption were selected as inputs. The characteristic absorption bands of the blood samples for subsequent analysis included absorption bands (1) in the spectral range of 3000–2800 cm−1 and 1500–1350 cm−1 assigned to the methyl and methylene groups of lipid molecules, (2) at ~ 1740 cm−1 for the carbonyl group of lipids, (3) at ~ 1650 cm−1 assigned to C = O stretching of the peptide bond, and (4) in the spectral range of 1500–1000 cm−1 for the phosphodiester groups of nucleic acids, phospholipids, and side chains of peptides, and glycans [2023] as shown in Fig 1(B).

ATR-FTIR spectral analysis

All analyses and model calculations were performed using Python 3.9, NumPy, scikit-learn, imbalanced-learn, and matplotlib packages. The ATR-FTIR spectral data were used mainly to (1) determine the presence of MetS (classification task) and (2) predict the blood levels of several lipids (regression task) as follows: TG, total cholesterol (CHOL), HDL-C, low-density lipoprotein-cholesterol (LDL-C), and very-low-density lipoprotein cholesterol (VLDL-C). In conventional data analysis, data points that deviate significantly from the distribution are typically identified as outliers and eliminated to optimize the performance of the models. However, in this study, data outliers were not removed because all blood lipid levels were obtained from a trustworthy and standardized laboratory at the medical center.

Models used for data preparation

Based on the previously reported prevalence of MetS, approximately 33% [23] oversampling was performed to generate a balanced dataset. An adaptive synthetic approach (ADASYN) [24] was used to generate calibration data. The performance and credibility of the model were assessed using 10-fold cross-validation.

Models used for data analysis

The models used in the regression task include basic linear regression (LR), gradient boosting regression tree (GBTR), random forest regression tree, and histogram-based gradient boosting tree classifier (HGBTR), which have the advantage of starting the estimation quickly without loss of accuracy [25]. For the classification task, that is, the determination of MetS, logistic regression, random forest tree classifier (RF), gradient boosting tree classifier (GBT), and histogram-based gradient boosting tree classifier (HGBT) [25]. The model for each task was individually tested to determine the best target prediction results.

Metrics for model explanation

The relationships between the actual laboratory values and model-predicted values for each prediction target were presented in the residual plot to evaluate the regression task results, along with the calculated values of R squared (R2), mean absolute error (MAE), and mean absolute percentage error (MAPE), which were calculated by taking the average of the absolute differences between the predicted and actual values, divided by the actual values, and then expressed as a percentage. This can provide a clear and interpretable measure of prediction accuracy by expressing errors as a percentage of actual values. Subsequently, 10-fold cross-validation was used to calculate the mean and standard deviation (SD) of the above metrics across 10 models. A smaller standard deviation suggests that the performance of the model is relatively consistent across different folds.

The receiver operating characteristic curve (ROC) and confidence intervals (CI) were presented for 10-fold cross-validation to evaluate the classification task results. Other evaluation metrics included area under the curve (AUC), F1, precision, and recall. The mean and SD were also calculated.

The results of the regression models were plotted as the weights of the best-performing GBT model over a 10-fold range. The scaled weighting was from −1 to 1, greater than 0.5, or smaller than −0.5 for the selected absorption wavenumbers for further evaluation.

To determine MetS, t-distributed stochastic neighbor embedding (TSNE) [26] was used to verify the data distribution in a high-dimensional 2D plane. Regarding the model explanation for the classification task, we applied the SHAP (SHapley Additive exPlanations) values with a tree-based explainer [27] to determine the nine wavenumbers with the greatest impact. This method enabled us to assess the consistency between our findings and those in the current literature.

Results

Study subjects

This study included 25 non-MetS and 25 MetS patients (Table 1). The two groups were age- and sex-matched. The MetS group had higher incidences of hypertension, diabetes mellitus, body weight, waist circumference, and hip circumference. In the MetS group, 56% of the subjects were taking lipid-lowering drugs. The mean MetS score was 3.96 for the MetS group and 0.32 for the non-MetS group. In the laboratory data, the MetS group had greater TG, VLDL-C, non-HDL-C, uric acid, alanine aminotransferase (AST), and aspartate aminotransferase (ALT). HDL-C was significantly lower in the MetS group (Table 1). Upon examining the distribution of blood lipid values, we discovered that some patients with MetS exhibited extreme TG, VLDL, and LDL values. Prior to running the model, our data analysis indicated significant differences between the MetS and non-MetS groups for all parameters except CHOL (more information in S1 Appendix).

Table 1. Demographic and laboratory data.

Items Non-MetS
(n = 25)
MetS
(n = 25)
P value
Female (n)/ Male (n) 19/ 6 12/ 13 0.08
Age (years), mean (SD) 49.12 (6.66) 52.84 (7.20) 0.064
MetS score, mean (SD) 0.32 (0.75) 3.96 (0.89) <0.001
Hypertension, n (%) 0 (0.0) 24 (96.0) <0.001
Diabetes mellitus, n (%) 2 (8.0) 15 (60.0) <0.001
Height (cm), mean (SD) 161.66 (8.16) 162.52 (7.46) 0.699
Body weight (Kg), mean (SD) 58.68 (11.80) 75.69 (12.67) <0.001
BMI (Kg/m2), mean (SD) 22.38 (3.59) 28.52 (3.34) <0.001
Waist (cm), mean (SD) 75.07 (9.23) 94.60 (9.07) <0.001
Hip circumference (cm), mean (SD) 94.58 (7.69) 101.56 (7.10) 0.002
Systolic BP (mmHg), mean (SD) 115.60 (9.34) 147.88 (22.46) <0.001
Diastolic BP (mmHg), mean (SD) 70.32 (7.51) 90.20 (16.21) <0.001
Pulse rate (/min), mean (SD) 73.56 (9.96) 80.40 (13.26) 0.045
Use of lipid-lowering drugs* (%) 0 (0.0) 14 (56.0) <0.001
Laboratory data
Cholesterol (mg/dL), mean (SD) 194.28 (33.56) 185.29 (30.90) 0.329
Triglyceride (mg/dL), mean (SD) 89.82 (25.17) 226.30 (133.96) <0.001
HDL-C (mg/dL), mean (SD) 68.20 (12.03) 41.16 (10.00) <0.001
LDL-C (mg/dL), mean (SD) 122.59 (32.91) 119.33 (28.02) 0.708
VLDL-C (mg/dL), mean (SD) 5.90 (29.39) 24.80 (19.16) 0.01
Non-HDL-C (mg/dL), mean (SD) 69.62 (26.08) 144.13 (28.25) <0.001
Glucose (mg/dL), mean (SD) 85.41 (11.83) 118.59 (31.18) <0.001
HbA1C (%), mean (SD) 5.44 (0.32) 6.41 (1.15) 0.074
BUN (mg/dL), mean (SD) 14.02 (4.76) 15.55 (4.11) 0.231
Creatinine (mg/dL), mean (SD) 0.72 (0.14) 0.84 (0.27) 0.048
AST (IU/L), mean (SD) 20.20 (5.92) 25.41 (10.12) 0.031
ALT (IU/L), mean (SD) 17.48 (7.01) 29.38 (16.52) 0.002
Uric acid (mg/dL), mean (SD) 4.69 (0.99) 6.19 (1.47) <0.001

MetS, metabolic syndrome; BMI, body mass index; BP, blood pressure; HDL-C, high-density lipoprotein cholesterol; LDL-C, low-density lipoprotein cholesterol; VLDL-C, very-low-density lipoprotein cholesterol; HbA1c, hemoglobin A1c; BUN, blood urea; AST, aspartate aminotransferase; ALT, alanine aminotransferase.

*The lipid-lowering drugs include statins (atorvastatin, rosuvastatin, lovastatin), niacin, fenofibrate, and ezetimibe.

ATR-FTIR spectral data characteristics

Spectral data were collected following a standard calibration procedure, using 150 blood samples from 50 individuals. Each blood sample was repeatedly used to collect FTIR spectral data to generate 491 FTIR data (300 from MetS and 191 from non-MetS subjects). The ADASYN method was used to match the prevalence balance with 33% of MetS cases, and the dataset numbers were 300 for MetS and 609 for non-MetS cases. We categorized lipid values into five equally divided groups from maximum to minimum to examine potential spectral differences among varying lipid levels (S2 Appendix). The mean ATR-FTIR spectrum for each group is plotted as a line, with the shaded regions representing plus one standard deviation. This visualization highlights the trends in spectral differences across lipid groups. However, although these trends suggest potential relationships, they do not allow for direct quantitative conclusions or inferences. Overlapping or minimal differences in FTIR spectral absorption across lipid ranges further suggest that nonlinear associations may exist which are not directly observable in raw spectral data. All the figures are listed in S2 Appendix. Grouped Spectrum with Specified Lipid Value Ranges.

Regression task results

The correlation between the predicted values derived from the different regression models and actual laboratory values is shown in Fig 2 and Table 2. Prediction errors (residuals = actual values – predicted values) were also examined for TG, CHOL, HDL-C, LDL-C, and VLDL-C (Fig 2A2E). The best model for predicting all targets based on R2 was HGBT, whereas the model with the best MAE value was RF (Table 2). In our analysis, we observed that deviance decreased across most of the folds during the cross-validation training and testing phases. However, a few folds exhibited irregular patterns, possibly due to outliers in the data distribution (S3 Appendix for more information).

Fig 2. FTIR spectrometry values predict the plasma concentrations of cholesterol (CHOL) and triglycerides (TG).

Fig 2

Correlation between the predictive values of FTIR spectrometry and the reference data (left panel). Residual plots (right panels) of IR absorbance versus the blood concentrations of (A)TG, (B) total cholesterol, (C) HDL-C, (D) LDL-C, and (E) VLDL-C.

Table 2. The result of the regression task and classification task.

Regression task
Models LR RF GBT HGBT
Metrics R2 MAE MAPE (%) R2 MAE MAPE (%) R2 MAE MAPE (%) R2 MAE MAPE (%)
Prediction targets TG 0.13
(1.03)
67.73
(9.50)
41.9 0.82
(0.20)
21.33
(2.78)
12.3 0.80
(0.24)
23.69
(5.89)
14.1 0.85
(0.12)
21.58
(5.06)
12.5
CHOL 0.11
(0.63)
12.01
(2.05)
6.9 0.68
(0.09)
6.19
(1.00)
3.4 0.65
(0.11)
7.36
(1.05)
3.7 0.68
(0.08)
6.47
(0.96)
3.5
HDL-C 0.04
(0.58)
6.19
(1.06)
13.4 0.74
(0.13)
3.18
(0.54)
6.9 0.72
(0.15)
3.59
(0.57)
8.1 0.76
(0.10)
3.28
(0.56)
7.0
LDL-C 0.22
(0.68)
15.40
(3.20)
12.2 0.39
(0.15)
9.70
(1.21)
7.3 0.35
(0.16)
12.51
(1.27)
10.0 0.42
(0.11)
10.40
(1.23)
7.9
VDLD-C 0.04
(0.71)
9.36
(1.12)
109.9 0.79
(0.24)
3.18
(0.65)
38.0 0.78
(0.25)
3.46
(0.78)
42.1 0.80
(0.19)
3.43
(0.86)
41.6
Classification task
Precision Recall AUC F1 Accuracy
LR 0.613 (0.11) 0.809 (0.048) 0.852 (0.047) 0.694 (0.084) 0.772 (0.046)
RF 0.955 (0.037) 0.833 (0.054) 0.982 (0.01) 0.889 (0.035) 0.934 (0.014)
GBT 0.938 (0.066) 0.836 (0.038) 0.971 (0.018) 0.883 (0.033) 0.930 (0.013)
HGBT 0.931 (0.049) 0.854 (0.059) 0.976 (0.017) 0.889 (0.032) 0.932 (0.017)

LR, linear regression; RF, random forest tree classifier; GBT, gradient boosting tree classifier; HGBT, histogram-based gradient boosting tree classifier; TG, triglyceride; HDL-C, high-density lipoprotein cholesterol; LDL-C, low-density lipoprotein cholesterol; VLDL-C, very-low-density lipoprotein cholesterol; CHOL, cholesterol; R2, R squared; MAE, mean absolute error. AUC, area under the curve; F1, F1 score. All values are shown as the mean (SD). The best performances are indicated in bold.

In the residual evaluation (right panels of Fig 2A2E), the performance of LR (shown by green dots) was poor (R2 < 0.3) with large MAE values, whereas the performance of RF/GBT/HGBT was similar. Among the different lipid parameters, the prediction performance was the best for TG and the worst for LDL-C, particularly with the LR model. The widely used Friedewald formula for calculating LDL-C is known to have inherent errors when compared with direct blood measurements, although it has been reported to be the most accurate way to calculate fasting LDL-C in Chinese people [28]. For example, the MAPE and MAE for LDL-C, calculated using the Friedewald formula, were 77.5% and 92.3 mg/dL, respectively. In this study, our best model (Random Forest) achieves a significantly lower MAPE of 7.3% and an MAE of 9.7 mg/dL for LDL-C prediction.

Next, coefficient modeling was performed. The wavenumbers that were strongly correlated in predicting specific lipid parameters in the GBT model are shown in the heatmap (Fig 3). To predict TG, the data points were 2991, 1724, and 1722 cm−1. For LDL-C, the data included points at 2933, 2931, 2906, and 1766 cm−1. For HDL-C, the data included points at 2906, 2904, 2902, and 2898 cm−1. For Chol were at 2931, 2916, 2906, and 1755 cm−1. Furthermore, the data points were 2991 and 1722 cm−1 for VLDL-C, respectively.

Fig 3. Feature weights of the GBT prediction model for TG, CHOL, HDL-C, LDL-C, and VLDL-C.

Fig 3

The blue line represents the mean value of the FTIR spectra of total MetS, the mean value of the FTIR spectra of total MetS, and the orange dotted line represents the mean value of the FTIR spectra of total non-MetS. The degree of darkness in the vertical line indicates the magnitude of the correction. The unselected frequency bands are blank. The frequencies with an absolute coefficient larger than 0.5 are specifically labeled.

We further investigated the effect of MetS on the lipid value predictions of the model. The error distribution for our predictions showed that MetS subjects had more prediction errors than non-MetS subjects, indicating that the prediction accuracy was more accurate in the absence of MetS. To explore this further, we tested models trained and validated separately for MetS and non-MetS groups. The results indicated that the models trained on non-MetS data performed consistently well on non-MetS test data, whereas those trained on MetS data exhibited significant variability (S3 Appendix for more information).

These findings suggest that, while our model is adequate for general rapid scans, additional data and validation are necessary with a sufficient population size for patients with MetS. This also highlights the substantial variability in lipid disorders among patients with MetS. Moreover, FTIR provides an efficient and visualized screen to demonstrate lipid abnormalities in MetS (S4 Appendix for detailed information).

Classification task results

These models were used to examine their ability to classify MetS and non-MetS groups, as shown in Fig 3 and Table 2. The best model was RF, with an AUC of 0.978, followed by HGBT (AUC, 0.972), GBT (AUC, 0.967), and LR (AUC, 0.851) (Fig 4A). The F1 score was similar, approximately around 0.88 for RF, HBGT, and GBT, but 0.69 for LR. The results suggest that RF, HBGT, and GBT can distinguish between MetS and non-MetS using ATR-FTIR data.

Fig 4. MetS is distinct from non-MetS in the ATR-FTIR spectrometry analysis.

Fig 4

(A) ROCs curves of the different models in the MetS prediction task. The color bands represent the CI ranges. (B) The t-SNE method shows separated clustering between MetS and non-MetS spectrometry data. Light-blue dots represent non-MetS and red dots represent MetS. (C) The SHAP values for the GBT model in the MetS prediction task are presented in descending order, with features ranked from top to bottom based on their influence on prediction. (D) SHAP waterfall plots of the METS predictions. (E) SHAP partial-dependence plots for the absorption at 1660 cm−1. (F) SHAP partial-dependence plots for the absorption at 1653 cm−1. (G) SHAP partial-dependence plots for the absorption at 1655 cm−1.

Projection verification of all the samples was performed using the t-SNE method (Fig 4B). Each projection point indicates high-dimensional data on the 2D plane; the color marks MetS, and the point density reflects the similarity of the data. In the 2D projection, the distinction between most MetS and non-MetS samples was well illustrated; however, a few data points remained relatively similar and overlapped. The excellent AUC and F1 ratings of all three models (Table 2) indicate that MetS and non-MetS could be correctly classified using an appropriately constructed nonlinear model with FTIR spectral data. Concurrently, as shown in Fig 4C, the SHAP value analysis revealed that the top nine wavenumbers were 2843, 1496, 1653, 1699, 1655, 2891, 2935, 1410, and 2902 cm−1. Notably, four wavenumbers were within the 2800–3000 cm−1 range, two wavenumbers were within the 1350–1500 cm−1 range, and three were within the 1600–1700 cm−1 range. This finding indicates that when the task requires a comprehensive assessment rather than a single numerical value, the model considers all relevant lipid-associated frequency bands. SHAP waterfall plots identified bands that significantly contributed to MetS predictions (Fig 4D). For instance, wavenumbers of 1660, 1655, and 1657 cm−1 were found to be the most significant contributors to the model’s prediction of MetS (Fig 4EG). These plots revealed that wavenumbers of 1653, 1655, and 1660 cm−1 exhibited a stronger association with MetS predictions once they exceeded a certain threshold, further supporting the idea that FTIR detects long-chain lipid structures in the blood, which are crucial features for our model’s predictions.

Discussion

This study has proven our hypothesis that ATR-FTIR spectral data and appropriate machine learning methods can be used to accurately predict lipid levels in blood samples and distinguish between the absence and presence of MetS.

With the emergence of new lipid-lowering drugs based on various pharmaceutical targets, stratification of patients with MetS has become increasingly important. The more comprehensive the lipid disorders are determined, the more precisely the treatment will be chosen to optimize the efficacy and outcome benefits. Lipids are a broad group of organic molecules with structures that produce different infrared absorptions or emissions, which can be detected using FTIR spectrometry. FTIR spectrometry has been used to quantify the oil and fat content of food [2931]. For instance, it precisely detects different lipid fractions and free fatty acids in oleaginous yeasts [30]. Bel’skaya et al. published their work on using model solutions in applying FTIR spectroscopy for the quantitative analysis of blood serum [32]. Portaccio et al. published their study results on ATR-FTIR spectra of commercial lipid samples and demonstrated specific fingerprint regions in ATR-FTIR spectra for phospholipids (phosphatidylcholine, phosphatidylethanolamine, phosphatidylserine, and phosphatidylinositol) from 3500 to 2800 cm−1 and 1800 to 600 cm−1, sphingolipids (ceramide, sphingosine 1-phosphate, ceramide 1-phosphate, and sphingomyelin) from 3500 to 2800 cm−1 and 1800 to 600 cm−1, and CHOL from 3150–2800 cm−1.[33] Their study proved the usefulness of FTIR spectra for analyzing complex lipids, and this application can ultimately contribute to lipid research and its implications for human health. Similarly, FTIR could precisely predict TG, CHOL, and different-density lipoproteins in this study. The best spectral region(s) for predicting total TG were 2927, 2925, 1718, and 1500 cm−1, 1718 cm−1 for CHOL, and 2937 cm−1 for HDL-C. Those were 2895, 2806, 2800, 1745, 1743, and 1716 cm−1 for LDL-C, and were 2931 and 1500 cm−1 for VLDL-C. The regions are close to those in a previous study, which evaluated 31 healthy individuals and suggested spectral ranges for lipids as 1741, 1747, 2854, and 2929 cm−1 [34]. Jessen et al. used FTIR spectrometry to analyze blood plasma without any sample preparation to achieve stable and accurate measurements for concentrations of clinically relevant chemical constituents, including TG and CHOL [35]. Similarly, Li et al. used FTIR spectrometry to determine glucose and CHOL in whole blood and a partial least squares (PLS) regression fusion modeling method was used [36].

The choice of machine learning method affects the accuracy of the prediction of blood lipids, and this phenomenon was clearly demonstrated in this study. Linear and logistic regression models perform poorly with high-dimensional data. It can be inferred that our model did not initially extract the features using PCA. However, RF, GPT, and HGPT continue to perform well for TG/CHOL/HDL-C, where a specific algorithm alone is sufficient to extract information from the original input and map the results.

The prediction performance also varied among different blood lipids. For LDL-C, which had a relatively wide concentration range, the R2 value was relatively low. Nevertheless, the MAE value was between 9 and 12, and our model achieved a considerably low prediction error (as low as 7.3%) (Table 2). For instance, with an actual LDL level of 100 mg/dL, our model’s prediction ranged from 93.7 to 107.3 mg/dL. Previous studies suggested a desirable bias of ± 6.8% and a desirable total allowable error (TEa) of ± 13.7% for LDL laboratory data and calculated errors by analyzing the European Federation of Clinical Chemistry and Laboratory Medicine database [37]. Therefore, the prediction error for LDL-C in this study would not affect the medication decisions or clinical judgment. In the case of TG, all the machine learning models exhibited superior prediction accuracy, with R2 values of 0.801–0.851 (Table 2).

Consistent with the existing knowledge, the model’s interpretation revealed that the focus of HGBT lies between 3000 and 2800 cm−1. We also attempted to restrict the input variables to the theoretical spectrum and entire spectral range in wavenumbers; however, the performance did not increase considerably, indicating that the interval chosen for this experiment was adequate. Using the high-dimensional projection technique, t-SNE, FTIR spectral analysis can effectively identify MetS samples. The SHAP values represent the contribution of each feature (in our case, each wavenumber) to the model’s output relative to the average prediction (Fig 4). Essentially, SHAP values explain how a particular feature impacts the model’s prediction, serving as a simplified, interpretable model to elucidate the workings of a more complex machine learning model. However, interpretability challenges can occasionally be introduced because the SHAP uses a linear approach to explain nonlinear models. Literature indicates that the range of 1645–1675 cm−1 primarily corresponds to non-cyclic carbon-carbon double bonds (C = C), which may be related to the structure of long-chain lipids in the blood. Notably, the combined contribution of the other 252 parameters was greater than that of the single wavenumber at 1660 cm−1, indicating the advantage of using SHAP in the FTIR spectrometry analysis.

This study has several limitations. Firstly, the limited sample size hindered our ability to evaluate the effectiveness of FTIR spectrometry in determining the severity of MetS. Secondly, although differences between plasma and serum FTIR spectra have been reported, this aspect was not investigated here. Lastly, the selection and exclusion of absorption bands in FTIR spectra may have introduced some degree of bias. Although the number of data items was sufficient for machine learning calculations, evaluating the model will require additional data. As we employed the ADASYN approach to balance the prevalence of the data, since additional data were created to compensate, the impact on actual applications must be further investigated. Fourth, verification using liquid chromatography-mass spectrometry (LC-MS) standards is lacking. The Taiwan Accreditation Foundation approved the laboratory for the fulfillment of ISO 15189 (Medical Laboratories Requirements for Quality and Competence), which supports the reliability of blood lipid concentrations. Future studies should comprehensively compare the accuracy, sensitivity, and specificity of the results from the AI-assisted FTIR spectrometry method against established methods with lipid standards. And we may explore causation-focused models to investigate molecular associations further and validate these predictions. However, given the study’s current scope, our machine learning approach offers valuable insights into broad lipid level trends without requiring the extensive molecular characterization that lies beyond this study’s objectives.

The developed AI-assisted FTIR spectrometry method is a nondestructive, reagent-free, and robust technology capable of accurately predicting lipid concentrations and identifying MetS in a prepared plasma sample. Macroscopic assessment is an unmet clinical requirement for diseases associated with metabolic disorders. This method provides a complete scout view of all MetS-related biochemical constituents, including all lipid species. Therefore, residual lipid-related risks can be revealed entirely. This comprehensive feature provides the potential for establishing an assessment system to stratify individuals with MetS to guide medical decisions and monitor therapeutic effects. In conclusion, AI-assisted FTIR spectrometry blood sample analysis is worth future large population studies to be applicable in clinical settings.

Supporting information

S1 Appendix. Distribution of blood lipid values in MetS and non-MetS groups.

(PDF)

pone.0316522.s001.pdf (147.7KB, pdf)
S2 Appendix. Grouped spectrum with specified lipid value ranges.

(PDF)

pone.0316522.s002.pdf (1.6MB, pdf)
S3 Appendix. Training deviance of the GBT model on different targets.

(PDF)

pone.0316522.s003.pdf (225.9KB, pdf)
S4 Appendix. The effect of the presence of MetS on the model’s lipid values prediction.

(PDF)

pone.0316522.s004.pdf (756.1KB, pdf)
S5 File. The de-identified supporting data.

(CSV)

pone.0316522.s005.csv (7.3MB, csv)

Acknowledgments

The authors thank Miss Wun-Jyun Jhuang for her skillful handling of the blood sample preparation and on the ATR-FTIR platform. The authors also thank Dr. Yi-Hsiung Lin for his assistance with research coordination at the beginning.

Data Availability

All relevant data are within the paper and its Supporting information files.

Funding Statement

We received funding to support this study and updated the Funding Information on the Submission page as follows: National Health Research Institutes (NHRI-EX107-10724SC, NHRI-EX108-10724SC, NHRI-EX109-10724SC, and NHRI-EX110-10724SC), Kaohsiung Medical University Hospital (KMUH111-1R07 and KMUH110-0R11), Taiwan Ministry of Science and Technology/National Sciences, and Technology Council Grants (MOST 109-2314-B-037-111-MY3) NSTC 112-2314-B037-069. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Saklayen MG. The global epidemic of the metabolic syndrome. Curr Hypertens Rep. 2018;20(2):12. Epub 2018/02/26. doi: 10.1007/s11906-018-0812-z ; PMCID: PMC5866840 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Deng H, Guo P, Zheng M, Huang J, Xue Y, Zhan X, et al. Epidemiological characteristics of atrial fibrillation in Southern China: results from the Guangzhou heart study. Sci Rep. 2018;8(1):17829. Epub 2018/12/13. doi: 10.1038/s41598-018-35928-w ; PMCID: PMC6292893 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Ahmed M, Kumari N, Mirgani Z, Saeed A, Ramadan A, Ahmed MH, et al. Metabolic syndrome; definition, pathogenesis, elements, and the effects of medicinal plants on it’s elements. J Diabetes Metab Disord. 2022;21(1):1011–22. Epub 2022/03/15. doi: 10.1007/s40200-021-00965-2 ; PMCID: PMC9167315 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Lee H-C, Lin Y-H. The pathogenic role of very low density lipoprotein on atrial remodeling in the metabolic syndrome. Int J Mol Sci. 2020;21(3):891. Epub 2020/01/30. doi: 10.3390/ijms21030891 ; PMCID: PMC7037013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Lee H-C, Shin S-J, Huang J-K, Lin M-Y, Lin Y-H, Ke L-Y, et al. The role of postprandial very-low-density lipoprotein in the development of atrial remodeling in metabolic syndrome. Lipids Health Dis. 2020;19(1):210. Epub 2020/09/22. doi: 10.1186/s12944-020-01386-5 ; PMCID: PMC7507670 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Lee H-C, Akhmedov A, Chen C-H. Spotlight on very-low-density lipoprotein as a driver of cardiometabolic disorders: Implications for disease progression and mechanistic insights. Front Cardiovasc Med. 2022;9:993633. Epub 2022/10/04. doi: 10.3389/fcvm.2022.993633 ; PMCID: PMC9577298 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Robinson H, Molendijk J, Shah AK, Rahman T, Anderson GJ, Hill MM. Rapid assessment of lipidomics sample purity and quantity using fourier-transform infrared spectroscopy. Biomolecules. 2022;12(9):1265. Epub 2022/09/08. doi: 10.3390/biom12091265 ; PMCID: PMC9496531 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Alí A, Boutjdir M, Aromolaran AS. Cardiolipotoxicity, inflammation, and arrhythmias: role for interleukin-6 molecular mechanisms. Front Physiol. 2019;9:1866. Epub 2019/01/07. doi: 10.3389/fphys.2018.01866 ; PMCID: PMC6330352 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Sacks FM, Lichtenstein AH, Wu JHY, Appel LJ, Creager MA, Kris-Etherton PM, et al. Dietary fats and cardiovascular disease: a presidential advisory from the american heart association. Circulation. 2017;136(3):e1–23. Epub 2017/06/15. doi: 10.1161/CIR.0000000000000510 ; [DOI] [PubMed] [Google Scholar]
  • 10.Lee H-C, Lin H-T, Ke L-Y, Wei C, Hsiao Y-L, Chu C-S, et al. VLDL from metabolic syndrome individuals enhanced lipid accumulation in atria with association of susceptibility to atrial fibrillation. Int J Mol Sci. 2016;17(1):134. doi: 10.3390/ijms17010134 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Lee H-C, Cheng W-C, Ma W-L, Lin Y-H, Shin S-J, Lin Y-H. Association of lipid composition and unsaturated fatty acids of VLDL with atrial remodeling in metabolic syndrome. Sci Rep. 2023;13(1):6575. Epub 2023/04/21. doi: 10.1038/s41598-023-33757-0 ; PMCID: PMC10121655 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Rosqvist F, Fridén M, Vessby J, Rorsman F, Lind L, Risérus U. Circulating fatty acids from high-throughput metabolomics platforms as potential biomarkers of dietary fatty acids. Clin Nutr. 2022;41(12):2637–43. Epub 2022/10/13. doi: 10.1016/j.clnu.2022.10.005 ; [DOI] [PubMed] [Google Scholar]
  • 13.Heino H, Rieppo L, Männistö T, Sillanpää MJ, Mäntynen V, Saarakkala S. Diagnostic performance of attenuated total reflection Fourier-transform infrared spectroscopy for detecting COVID-19 from routine nasopharyngeal swab samples. Sci Rep. 2022;12(1):20358. Epub 2022/11/27. doi: 10.1038/s41598-022-24751-z ; PMCID: PMC9701801 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Martinez-Cuazitl A, Vazquez-Zapien GJ, Sanchez-Brito M, Limon-Pacheco JH, Guerrero-Ruiz M, Garibay-Gonzalez F, et al. ATR-FTIR spectrum analysis of saliva samples from COVID-19 positive patients. Sci Rep. 2021;11(1):19980. Epub 2021/10/07. doi: 10.1038/s41598-021-99529-w ; PMCID: PMC8497525 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Kurultak İ, Sarigul N, Kodal NS, Korkmaz F. Urinalysis of individuals with renal hyperfiltration using ATR-FTIR spectroscopy. Sci Rep. 2022;12(1):20887. Epub 2022/12/03. doi: 10.1038/s41598-022-25535-1 ; PMCID: PMC9719484 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Silva LG, Péres AFS, Freitas DLD, Morais CLM, Martin FL, Crispim JCO, et al. ATR-FTIR spectroscopy in blood plasma combined with multivariate analysis to detect HIV infection in pregnant women. Sci Rep. 2020;10(1):20156. Epub 2020/11/19. doi: 10.1038/s41598-020-77378-3 ; PMCID: PMC7677535 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Jia Y, Quan S, Ren J, Wu H, Liu A, Gao Y, et al. MRI radiomics predicts progression-free survival in prostate cancer. Front Oncol. 2022;12:974257. Epub 2022/08/30. doi: 10.3389/fonc.2022.974257 ; PMCID: PMC9468743 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Wu Y, Zhang Y, Zou X, Yuan Z, Hu W, Lu S, et al. Estimated date of delivery with electronic medical records by a hybrid GBDT-GRU model. Sci Rep. 2022;12(1):4892. Epub 2022/03/22. doi: 10.1038/s41598-022-08664-5 ; PMCID: PMC8941136 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Lin H, Xue Y, Chen K, Zhong S, Chen L. Acute coronary syndrome risk prediction based on gradient boosted tree feature selection and recursive feature elimination: a dataset-specific modeling study. PLoS One. 2022;17(11):e0278217. Epub 2022/11/29. doi: 10.1371/journal.pone.0278217 ; PMCID: PMC9707772 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Kacuráková M, Mathlouthi M. FTIR and laser-Raman spectra of oligosaccharides in water: characterization of the glycosidic bond. Carbohydr Res. 1996;284(2):145–57. doi: 10.1016/0008-6215(95)00412-2 [DOI] [PubMed] [Google Scholar]
  • 21.Banyay M, Sarkar M, Gräslund A. A library of IR bands of nucleic acids in solution. Biophys Chem. 2003;104(2):477–88. doi: 10.1016/s0301-4622(03)00035-8 [DOI] [PubMed] [Google Scholar]
  • 22.Barth A. Infrared spectroscopy of proteins. Biochim Biophys Acta. 2007;1767(9):1073–101. Epub 2007/06/28. doi: 10.1016/j.bbabio.2007.06.004 [DOI] [PubMed] [Google Scholar]
  • 23.Casal HL, Mantsch HH. Polymorphic phase behaviour of phospholipid membranes studied by infrared spectroscopy. Biochim Biophys Acta. 1984;779(4):381–401. doi: 10.1016/0304-4157(84)90017-0 [DOI] [PubMed] [Google Scholar]
  • 24.Haibo H, Yang B, Garcia EA, Shutao L. ADASYN: Adaptive synthetic sampling approach for imbalanced learning. 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence); 2008:1–8. [Google Scholar]
  • 25.Ke G, Meng Q, Finley T, Wang T, Chen W, Ma W. LightGBM: a highly efficient gradient boosting decision tree. Advances in Neural Information Processing Systems; 2017;30. [Google Scholar]
  • 26.van der Maaten LJP, Hinton GE. Visualizing data using t-SNE. J Mac Lear Res. 2008;9:2579–605. [Google Scholar]
  • 27.Lundberg SM, Erion G, Chen H, DeGrave A, Prutkin JM, Nair B, et al. From local explanations to global understanding with explainable AI for trees. Nat Mach Intell. 2020;2(1):56–67. doi: 10.1038/s42256-019-0138-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Xu J, Du X, Zhang S, Xiang Q, Zhu L, Liu L. The accuracy of four formulas for LDL-C calculation at the fasting and postprandial states. Front Cardiovasc Med. 2022;9:944003. Epub 2022/08/18 doi: 10.3389/fcvm.2022.944003 ; PMCID: PMC9433804 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Ye Q, Meng X. Highly efficient authentication of edible oils by FTIR spectroscopy coupled with chemometrics. Food Chem. 2022;385:132661. Epub 2022/03/08. doi: 10.1016/j.foodchem.2022.132661 ; [DOI] [PubMed] [Google Scholar]
  • 30.Shapaval V, Brandenburg J, Blomqvist J, Tafintseva V, Passoth V, Sandgren M, et al. Biochemical profiling, prediction of total lipid content and fatty acid profile in oleaginous yeasts by FTIR spectroscopy. Biotechnol Biofuels. 2019;12:140. Epub 2019/06/06. doi: 10.1186/s13068-019-1481-0 ; PMCID: PMC6551905 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Mousa MAA, Wang Y, Antora SA, Al-Qurashi AD, Ibrahim OHM, He H-J, et al. An overview of recent advances and applications of FT-IR spectroscopy for quality, authenticity, and adulteration detection in edible oils. Crit Rev Food Sci Nutr. 2022;62(29):8009–27. Epub 2021/05/12. doi: 10.1080/10408398.2021.1922872 [DOI] [PubMed] [Google Scholar]
  • 32.Bel’skaya LV, Sarf EA, Solomatin DV. Application of FTIR Spectroscopy for quantitative analysis of blood serum: a preliminary study. Diagnostics (Basel). 2021;11(12):2391. doi: 10.3390/diagnostics11122391 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Portaccio M, Faramarzi B, Lepore M. Probing biochemical differences in lipid components of human cells by means of ATR-FTIR spectroscopy. Biophysica. 2023;3(3):524–38. doi: 10.3390/biophysica3030035 [DOI] [Google Scholar]
  • 34.Huber M, Kepesidis KV, Voronina L, Božić M, Trubetskov M, Harbeck N, et al. Stability of person-specific blood-based infrared molecular fingerprints opens up prospects for health monitoring. Nat Commun. 2021;12(1):1511. Epub 2021/03/08. doi: 10.1038/s41467-021-21668-5 ; PMCID: PMC7940620 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Jessen TE, Höskuldsson AT, Bjerrum PJ, Verder H, Sørensen L, Bratholm PS, et al. Simultaneous determination of glucose, triglycerides, urea, cholesterol, albumin and total protein in human plasma by Fourier transform infrared spectroscopy: direct clinical biochemistry without reagents. Clin Biochem. 2014;47(13–14):1306–12. Epub 2014/06/02. doi: 10.1016/j.clinbiochem.2014.05.064 ; [DOI] [PubMed] [Google Scholar]
  • 36.Li Z, Lv H, Li T, Si G, Wang Q, Lv J, et al. Reagent-free simultaneous determination of glucose and cholesterol in whole blood by FTIR-ATR. Spectrochim Acta A Mol Biomol Spectrosc. 2017;178:192–7. Epub 2017/02/03. doi: 10.1016/j.saa.2017.02.002 [DOI] [PubMed] [Google Scholar]
  • 37.Steyn N, Muller Rossouw H, Pillay TS, Martins J. Comparability of calculated LDL-C with directly measured LDL-C in selected paediatric and adult cohorts. Clin Chim Acta. 2022;537:158–66. doi: 10.1016/j.cca.2022.10.003 [DOI] [PubMed] [Google Scholar]

Decision Letter 0

Muhammad Iqhrammullah

8 May 2024

PONE-D-24-11405Application of Attenuated Total Reflection–Fourier Transform Infrared spectroscopy in semi-quantification of blood lipids and characterization of the metabolic syndromePLOS ONE

Dear Dr. Lee,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

==============================

Please improve the novelty statement and sampling handling explanation.

==============================

Please submit your revised manuscript by Jun 22 2024 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org . When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols . Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols .

We look forward to receiving your revised manuscript.

Kind regards,

Muhammad Iqhrammullah, Ph.D

Academic Editor

PLOS ONE

Journal requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Please note that PLOS ONE has specific guidelines on code sharing for submissions in which author-generated code underpins the findings in the manuscript. In these cases, all author-generated code must be made available without restrictions upon publication of the work. Please review our guidelines at https://journals.plos.org/plosone/s/materials-and-software-sharing#loc-sharing-code and ensure that your code is shared in a way that follows best practice and facilitates reproducibility and reuse.

3. We note that the grant information you provided in the ‘Funding Information’ and ‘Financial Disclosure’ sections do not match.

When you resubmit, please ensure that you provide the correct grant numbers for the awards you received for your study in the ‘Funding Information’ section.

4. In this instance it seems there may be acceptable restrictions in place that prevent the public sharing of your minimal data. However, in line with our goal of ensuring long-term data availability to all interested researchers, PLOS’ Data Policy states that authors cannot be the sole named individuals responsible for ensuring data access (http://journals.plos.org/plosone/s/data-availability#loc-acceptable-data-sharing-methods ).

Data requests to a non-author institutional point of contact, such as a data access or ethics committee, helps guarantee long term stability and availability of data. Providing interested researchers with a durable point of contact ensures data will be accessible even if an author changes email addresses, institutions, or becomes unavailable to answer requests.

Before we proceed with your manuscript, please also provide non-author contact information (phone/email/hyperlink) for a data access committee, ethics committee, or other institutional body to which data requests may be sent. If no institutional body is available to respond to requests for your minimal data, please consider if there any institutional representatives who did not collaborate in the study, and are not listed as authors on the manuscript, who would be able to hold the data and respond to external requests for data access? If so, please provide their contact information (i.e., email address). Please also provide details on how you will ensure persistent or long-term data storage and availability.

Additional Editor Comments:

Please improve the novelty statement and sampling handling explanation.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: No

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: 1. After carefully reviewing the introduction section, I did not find a clear and explicit statement highlighting the novelty or newness of the study. The authors do not appear to directly state what makes their work novel compared to previous research in the field.

2. The authors emphasize the advantages of their ATR-FTIR method, such as rapid assay times and the need for small blood samples (0.5 μL). These benefits make the method potentially better for frequent monitoring and clinical applications compared to established techniques. Unfortunately, the authors lack a comprehensive comparison of the accuracy, sensitivity, and specificity of their approach against established methods. A deeper analysis of the performance metrics and potential limitations of their technique, in relation to existing gold standards, would better support their case for the usefulness and innovation of their method.

3. The conclusion should explicitly reiterate the potential impact of early detection and management of MetS on improving patient outcomes and quality of life, as highlighted in the introduction.

4. What do you mean by the brackets in table 2 as standard deviation (SD)?

5. However, there are a few minor points that could be improved in the reporting of the SHAP analysis: The authors could provide more details on how exactly the SHAP values were calculated (e.g., using the TreeExplainer for tree-based models like GBT) and what they represent (e.g., the contribution of each wavelength to the model's output relative to the average prediction). They could also discuss any potential limitations or caveats of using SHAP for model interpretation in this specific context.

6. From a purely statistical perspective, an R2 value of 0.419 indicates that only 41.9% of the variance in LDL-C levels can be explained by the ATR-FTIR spectral data using the HGBT model. This suggests that the model's predictions for LDL-C are not as accurate as those for other lipid parameters like TG (R2 = 0.854) or HDL-C (R2 = 0.758). In many scientific contexts, an R2 value below 0.5 would be considered poor model performance. I'm uncertain about the authors' argument, which claims that despite the low R² value, the mean absolute error (MAE) for LDL-C prediction was between 9 and 12 mg/dL.

Reviewer #2: Interesting work that discusses the application of ATR-FTIR in biological samples. The study is needed and the use of AI algorithms added value. Just some comments are needed to clarify:

-How were the samples stored?

-How did the authors ensure stability of samples?

-Was any heating of the sample applied prior to measurement? To prevent water interference?

-Comment on transferability of the models to other matrices in the discussion

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean? ). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy .

Reviewer #1: No

Reviewer #2: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/ . PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org . Please note that Supporting Information files do not need this step.

PLoS One. 2025 Jan 30;20(1):e0316522. doi: 10.1371/journal.pone.0316522.r002

Author response to Decision Letter 0


28 Jul 2024

28-July-2024

Editor-in-Chief

PLOS ONE

Dear Editors:

We would like to thank all the reviewers and editors for their comments on the manuscript (PONE-D-24-11405) entitled “Application of attenuated total reflection–Fourier transform Infrared spectroscopy in semi-quantification of blood lipids and characterization of metabolic syndrome”.

Following the reviewers’ comments, we revised the manuscript and provided responses to describe the manuscript amendments. Point-to-point responses have been included in this submission.

We apologize that the grant information we provided in the ‘Funding Information’ and ‘Financial Disclosure’ sections do not match. One of our authors forgot to add a Financial Disclosure Statement before the primary submission. We received funding to support this study and updated the Funding Information on the Submission page as follows: National Health Research Institutes (NHRI-EX107-10724SC, NHRI-EX108-10724SC, NHRI-EX109-10724SC, and NHRI-EX110-10724SC), Kaohsiung Medical University Hospital (KMUH111-1R07 and KMUH110-0R11), Taiwan Ministry of Science and Technology/National Sciences, and Technology Council Grants (MOST 109-2314-B-037-111-MY3) NSTC 112-2314-B037-069. Please change the online submission form on our behalf. Thank you.

Regarding raw data sharing, all authors have decided on a data-sharing plan, and our entire data will be made freely accessible if our manuscript is accepted for publication. We will adhere to your open-data policy. To meet your requirement that all data presented in the study be made publicly available at or before the time of acceptance, the file named “encoded_data” is uploaded to the Submission System as a Supporting Information.

We believe that, after incorporating the valuable reviewers’ and editors’ opinions, the manuscript is worthy of consideration for publication in PLOS ONE.

Thank you for your consideration.

Sincerely yours,

Hsiang-Chun Lee, MD, PhD

Professor, College of Medicine, Kaohsiung Medical University

Attending Physician, Division of Cardiology, Department of Internal Medicine, Kaohsiung Medical University Hospital

100 Tzyou 1st Rd, Kaohsiung 807, Taiwan

Tel.: +886-7-3121101#7741 Fax: +886-7-3234845

E-mail: hclee@kmu.edu.tw

Attachment

Submitted filename: Specific Responses to Reviewers.docx

pone.0316522.s006.docx (23.3KB, docx)

Decision Letter 1

Rajesh Kumar Singh

5 Nov 2024

PONE-D-24-11405R1Application of attenuated total reflection–Fourier transform Infrared spectroscopy in semi-quantification of blood lipids and characterization of metabolic syndromePLOS ONE

Dear Dr. Lee,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Dec 20 2024 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org . When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols . Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols .

We look forward to receiving your revised manuscript.

Kind regards,

Rajesh Kumar Singh, Ph.D.

Academic Editor

PLOS ONE

Journal Requirements:

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #3: All comments have been addressed

Reviewer #4: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #3: Yes

Reviewer #4: (No Response)

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #3: Yes

Reviewer #4: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #3: Yes

Reviewer #4: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #3: Yes

Reviewer #4: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #3: The study based on Application of attenuated total reflection–Fourier transform Infrared spectroscopy in

semi-quantification of blood lipids and characterization of metabolic syndrome is highly significant. However I would suggest to authors please expand this study by using molecular methods.

Reviewer #4: • Provide additional context on ATR-FTIR: Add a brief explanation about how ATR-FTIR spectroscopy works, specifically for readers less familiar with the technique. This could improve the accessibility of the study’s technical approach.

• Add a visual comparison: Consider adding a figure that visually compares the ATR-FTIR spectrum with traditional spectra of individual lipid classes or a figure showing predicted vs. actual lipid levels for better visual interpretation.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean? ). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy .

Reviewer #3: No

Reviewer #4: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/ . PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org . Please note that Supporting Information files do not need this step.

PLoS One. 2025 Jan 30;20(1):e0316522. doi: 10.1371/journal.pone.0316522.r004

Author response to Decision Letter 1


10 Dec 2024

PONE-D-24-11405R1

Application of attenuated total reflection–Fourier transform Infrared spectroscopy in semi-quantification of blood lipids and characterization of metabolic syndrome

PLOS ONE

Comments to the Author

Reviewer #3: The study based on Application of attenuated total reflection–Fourier transform Infrared spectroscopy in semi-quantification of blood lipids and characterization of metabolic syndrome is highly significant. However I would suggest to authors please expand this study by using molecular methods.

Response:

Thank you for your comments. We agree that molecular methods are also very important for applying non-destructive ATR-FTIR spectrometry in serum analysis, particularly for lipids. The following studies demonstrate the use of ATR-FTIR spectrometry to analyze blood samples. Bel’skaya et al. published their work on using model solutions in applying FTIR spectroscopy for the quantitative analysis of blood serum.[32] Portaccio et al. published their study results on ATR-FTIR spectra of commercial lipid samples and demonstrated specific fingerprint regions in ATR-FTIR spectra for phospholipids (phosphatidylcholine, phosphatidylethanolamine, phosphatidylserine, and phosphatidylinositol) from 3500 to 2800 cm−1 and 1800 to 600 cm−1), sphingolipids (ceramide, sphingosine 1-phosphate, ceramide 1-phosphate, and sphingomyelin) from 3500 to 2800 cm−1 and 1800 to 600 cm−1), and cholesterol (3150–2800 cm−1) [33]. Their study proved the usefulness of FTIR spectra for analyzing complex lipids, and this application can ultimately contribute to lipid research and its implications for human health. (Line 319-328)

Based on the known advantage of FTIR spectra in quantitative analysis and sensitivity to differentiate various lipids, our study approach was directed toward linkage to clinical application. Consistent with the aforementioned literature, our results revealed the reliability of ATR-FTIR for semi-quantitative lipid analysis of human serum samples.

Unlike those publications, our study emphasizes the predictive capability of machine learning models. The core concept is to uncover the correlations between FTIR spectra, clinically derived lipid values, and diagnostic classifications.

Since this approach centers on data-driven correlations rather than direct molecular causation, it inherently lacks molecular specificity, which remains a limitation of ML-based predictions.

In future work, we may explore causation-focused models to investigate molecular associations further and validate these predictions. However, given the study’s current scope, our ML approach offers valuable insights into broad lipid level trends without requiring the extensive molecular characterization that lies beyond this study’s objectives. (Line 380-384)

Two new citations have been added.

32. Bel’skaya LV, Sarf EA, Solomatin DV. Application of FTIR Spectroscopy for Quantitative Analysis of Blood Serum: A Preliminary Study. Diagnostics. 2021;11(12):2391. PubMed PMID: doi:10.3390/diagnostics11122391.

33. Portaccio M, Faramarzi B, Lepore M. Probing Biochemical Differences in Lipid Components of Human Cells by Means of ATR-FTIR Spectroscopy. Biophysica. 2023;3(3):524-38. PubMed PMID: doi:10.3390/biophysica3030035.

 Reviewer #4: • Provide additional context on ATR-FTIR: Add a brief explanation about how ATR-FTIR spectroscopy works, specifically for readers less familiar with the technique. This could improve the accessibility of the study’s technical approach.

Response:

Thank you for your comments. We have added a paragraph to the Introduction and Methods section to briefly explain how ATR-FTIR spectroscopy works. The context is as follows.

ATR-FTIR spectroscopy is a technique for analyzing the chemical composition of materials by detecting their characteristic absorption of mid-infrared light. In ATR-FTIR, molecules absorb mid-infrared light at specific wavenumbers (or frequencies), depending on their chemical bonds and structure. Each type of bond (e.g., C=O and C–H) has a unique absorption pattern, creating a molecular fingerprint. (Line 77-81)

This study applied a small sample volume (0.5 µL to a Ge ATR crystal, as shown in Figure 1(A). A modulated mid-infrared beam was directed into the Ge crystal, generating an evanescent wave at the crystal surface, allowing the light to interact with the sample. The sample absorbs light at characteristic wavenumbers, and the remaining light is detected using a liquid nitrogen-cooled mercury cadmium telluride (MCT) infrared detector. The resulting spectrum shows peaks corresponding to specific molecular bonds, providing a unique chemical profile of the sample. (Line 122-127)

Figure 1(A). The schematic depicts the attenuated total reflection for acquiring the FTIR spectra of a blood sample. The mid-infrared light is modulated using the Michaelson interferometer of an FTIR spectrometer. (Line 142-144)

• Add a visual comparison: Consider adding a figure that visually compares the ATR-FTIR spectrum with traditional spectra of individual lipid classes or a figure showing predicted vs. actual lipid levels for better visual interpretation.

Response:

Thank you for your comments. To illustrate the ability of spectral values to quantify traditionally categorized lipid classes (e.g., cholesterol and triglycerides), we categorized lipid values into five equally divided groups from maximum to minimum to examine potential spectral differences among varying lipid levels (see Figures below). The X-axis represents the input ATR-FTIR wavenumbers, the Y-axis represents the input ATR-FTIR spectral values, and the color group corresponds to the measurement group obtained from conventional biochemical assays. The average ATR-FTIR spectrum for each group is shown as a line, with the shaded regions representing plus one standard deviation. This visualization highlights the trends in spectral differences across lipid groups. However, although these trends suggest potential relationships, they do not allow for direct quantitative conclusions or inferences. Overlapping or minimal differences in FTIR spectral absorption across lipid ranges further suggest that nonlinear associations may exist that are not directly observable in raw spectral data. By applying machine learning models, we can efficiently capture and interpret these complex patterns, offering precise and meaningful lipid-level predictions beyond what is visually discernible. All figures are listed in Appendix 2. Grouped Spectrum with Specified Lipid Value Ranges (Line 211-223)

In response to your suggestion, we have also included Figure 2 to illustrate the comparison of predicted versus actual lipid levels. In this figure, the x-axis represents the lipid values predicted by the model based on the input ATR-FTIR spectral data, whereas the y-axis shows the actual measured lipid values. Additionally, we included a visualization of the distribution of error values to further elucidate the prediction accuracy and deviations of the model.

Attachment

Submitted filename: Response to Reviewers.docx

pone.0316522.s007.docx (2.6MB, docx)

Decision Letter 2

Rajesh Kumar Singh

13 Dec 2024

Application of attenuated total reflection–Fourier transform Infrared spectroscopy in semi-quantification of blood lipids and characterization of metabolic syndrome

PONE-D-24-11405R2

Dear Dr. Lee,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice will be generated when your article is formally accepted. Please note, if your institution has a publishing partnership with PLOS and your article meets the relevant criteria, all or part of your publication costs will be covered. Please make sure your user information is up-to-date by logging into Editorial Manager at Editorial Manager®  and clicking the ‘Update My Information' link at the top of the page. If you have any questions relating to publication charges, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Rajesh Kumar Singh, Ph.D.

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Acceptance letter

Rajesh Kumar Singh

PONE-D-24-11405R2

PLOS ONE

Dear Dr. Lee,

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now being handed over to our production team.

At this stage, our production department will prepare your paper for publication. This includes ensuring the following:

* All references, tables, and figures are properly cited

* All relevant supporting information is included in the manuscript submission,

* There are no issues that prevent the paper from being properly typeset

If revisions are needed, the production department will contact you directly to resolve them. If no revisions are needed, you will receive an email when the publication date has been set. At this time, we do not offer pre-publication proofs to authors during production of the accepted work. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few weeks to review your paper and let you know the next and final steps.

Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

If we can help with anything else, please email us at customercare@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Rajesh Kumar Singh

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Appendix. Distribution of blood lipid values in MetS and non-MetS groups.

    (PDF)

    pone.0316522.s001.pdf (147.7KB, pdf)
    S2 Appendix. Grouped spectrum with specified lipid value ranges.

    (PDF)

    pone.0316522.s002.pdf (1.6MB, pdf)
    S3 Appendix. Training deviance of the GBT model on different targets.

    (PDF)

    pone.0316522.s003.pdf (225.9KB, pdf)
    S4 Appendix. The effect of the presence of MetS on the model’s lipid values prediction.

    (PDF)

    pone.0316522.s004.pdf (756.1KB, pdf)
    S5 File. The de-identified supporting data.

    (CSV)

    pone.0316522.s005.csv (7.3MB, csv)
    Attachment

    Submitted filename: Specific Responses to Reviewers.docx

    pone.0316522.s006.docx (23.3KB, docx)
    Attachment

    Submitted filename: Response to Reviewers.docx

    pone.0316522.s007.docx (2.6MB, docx)

    Data Availability Statement

    All relevant data are within the paper and its Supporting information files.


    Articles from PLOS ONE are provided here courtesy of PLOS

    RESOURCES