Skip to main content
Sensors (Basel, Switzerland) logoLink to Sensors (Basel, Switzerland)
. 2026 Feb 18;26(4):1304. doi: 10.3390/s26041304

Detection and Quantification of Corn Starch and Wheat Flour as Adulterants in Milk Powder by Raman Spectroscopy Coupled with Chemometric Routines

Edwin R Caballero-Agosto 1, Louang D Cruz-Dorta 1, Samuel P Hernandez-Rivera 2, Leonardo C Pacheco-Londoño 2,3, Ricardo Infante-Castillo 1,*
Editors: Francisco Ferrero, Ana Soldado
PMCID: PMC12944320  PMID: 41755243

Abstract

Highlights

What are the main findings?

  • Portable Raman spectroscopy has been shown to quantify adulterants in milk powder using chemometric routines.

  • Results show errors of 0.76 and 0.77 %w/w for quantifying corn starch and wheat flour as adulterants, respectively, using partial least squares models.

What are the implications of the main findings?

  • Study shows that portable Raman spectroscopy is an effective tool for quantifying adulterants in milk powder.

  • The portable Raman modality can be used as a complementary in situ rapid screening technique by non-academic users to detect adulterants in milk and other foods.

Abstract

Adulteration of milk powder (MP) is performed, especially in underdeveloped countries, by adding corn starch (CS) or wheat flour (WF) without mentioning it. Multiple techniques have been established to reduce these deceptive methods. Most of these techniques require samples to be sent to the laboratory for results through a time-consuming, expert-requiring, and destructive procedure. Raman spectroscopy (RS) has seen application due to the availability of portable modalities and its non-destructive, water-insensitive nature. Using principal component analysis (PCA), the differences and similarities between MP and the adulterants (CS and WF) have been evaluated. To quantify the percentages of CS and WF binary mixtures independently with MP, partial least squares regression (PLSR) has been employed. A total of 70 MP samples independently adulterated with CS and WF were prepared. Thirteen chemometric modes were developed by combining the first and second derivatives with Standard Normal Variate (SNV) and Multiplicative Scatter Correction (MSC) to quantify adulteration. The results obtained for CS and WF mixtures show errors of 0.76 and 0.77 %w/w, respectively, with the optimized math pretreatment. These results demonstrate that the portable RS modality can be used as an effective technique for detecting adulterants in milk powder.

Keywords: Raman spectroscopy (RS), milk powder, adulteration, partial least squares (PLS), principal component analysis (PCA)

1. Introduction

Food adulteration remains a persistent global challenge. It compromises food safety, economic fairness, and consumer confidence. Economically motivated adulteration is particularly problematic in powdered food commodities. In these commodities, visual inspection provides limited information about authenticity. As a result, rapid, reliable, and non-destructive analytical methods applicable throughout the food supply chain have become a research priority [1,2]. Milk powder (MP) is a critical case study. It has high commercial value, extensive international trade, and is frequently used in infant nutrition and processed foods.

Over the past five years, vibrational spectroscopy—including near-infrared (NIR), mid-infrared (MIR/FTIR), and Raman spectroscopy—has become a key tool for authenticating food and detecting adulteration [3,4,5,6]. These techniques provide molecular fingerprints tied to fundamental vibrational modes and enable rapid analysis with minimal sample preparation. When combined with chemometric and machine-learning methods, vibrational spectroscopy enables both qualitative classification and quantitative measurement of adulterants in complex food samples [7,8,9]. As a result, these methods are widely used for oils, spices, honey, meat, dairy, and powdered foods [10,11,12,13,14].

Milk powder adulteration often involves adding inexpensive bulking agents or nitrogen-rich compounds to inflate apparent mass or protein content. Infrared spectroscopy has played a key role in addressing this issue; recent studies show that FTIR and NIR spectroscopy, combined with multivariate analysis, can accurately detect and quantify adulteration—including starches, flours, melamine, and reconstituted milk solids [15,16,17,18]. However, debate persists regarding the robustness and transferability of targeted chemometric models when adulterant type, processing conditions, or natural compositional variability change. As a result, interest has grown in non-targeted and anomaly-detection approaches, which flag suspicious samples without prior knowledge of the adulterant [19,20].

Despite their analytical strengths, many infrared-based studies rely on benchtop laboratory instrumentation, creating centralized workflows that slow decision-making. This approach requires sample transport, specialized infrastructure, and trained staff—limitations that are especially critical in time-sensitive settings such as border inspection, warehouse screening, and on-site quality control. As a result, recent research emphasizes portable and handheld spectroscopic platforms that deliver actionable results directly at the point of need [21,22,23].

Portable Raman spectroscopy is particularly promising for analyzing powdered foods. It is non-destructive, relatively insensitive to water, and enables compact instrument designs. Recent studies demonstrate the feasibility of portable Raman systems for detecting food adulteration, including applications to milk powder and other dairy matrices [24,25,26]. Furthermore, surface-enhanced Raman scattering (SERS) has been explored to improve sensitivity to trace adulterants. However, challenges such as substrate reproducibility and field robustness remain active areas of discussion [27,28].

In this evolving research landscape, our group previously reported the detection and quantification of corn starch (CS) and wheat flour (WF) adulteration in milk powder using near- and mid-infrared spectroscopy with chemometric routines [29]. That work showed that infrared fingerprints, combined with optimized preprocessing and partial least squares regression (PLSR), could achieve low prediction errors for both adulterants. Building on this foundation, the present study extends the same analytical objective. We apply a portable Raman spectroscopy platform, providing a methodological continuation from benchtop infrared analysis to field-deployable vibrational sensing.

The primary aim of this work is to evaluate portable Raman spectroscopy, combined with chemometric routines, for detecting and quantifying CS and WF adulteration in milk powder. This study uses principal component analysis for exploratory assessment and partial least squares regression for quantification. Results show prediction errors below 1 %w/w for both adulterants. These findings highlight the promise of portable vibrational methodologies. Such methods can complement—and in some contexts replace—laboratory-based techniques, enabling faster decision-making and broader surveillance against food adulteration throughout the supply chain.

2. Materials and Methods

2.1. Samples

This study used Denia® whole milk powder (distributed by V. Suarez & Co., Bayamon, PR, USA), containing 26% milk fat from pasteurized cow’s milk. The whole milk powder (MP) contains palmitate, vitamin D3, calcium, riboflavin, and phosphorus. The corn starch (CS) used is Maizena® (distributed by Unilever, Englewood Cliffs, NJ, USA; product of Mexico), which contains sulfur dioxide as a preservative. The wheat flour (WF) used is Enriched Wheat Flour (distributed by Goya Foods, Inc., Jersey City, NJ, USA), containing niacin, iron, thiamine, riboflavin, and folic acid. These specific brands were used due to their affordability and market accessibility, as well as their physical properties similar to those of the adulteration target. Figure 1 shows a schematic of the samples used in this study. From this scheme, one pure sample of MP, two pure samples of adulterants (CS and WF), and 70 different binary mixtures were prepared in this study.

Figure 1.

Figure 1

Sample preparation scheme (n = 73). Three samples of pure compounds (1 sample of milk powder, 1 sample of corn starch, and 1 sample of wheat flour), 35 samples of milk powder adulterated with corn starch, and 35 samples of milk powder adulterated with wheat flour.

In Table 1, adulterated samples were prepared by adding varying amounts of adulterants (CS and WF) to milk powder. The procedure was performed as follows: each sample of milk powder was manually mixed with the adulterant powder in a test tube, then placed on a high-speed stirrer with a glass rod to reduce particle-size variation. Samples were prepared by varying the proportions of milk powder and adulterant, and the range of percentage of adulterant, in this case, is between 2.5 and 45 %w/w adulteration. A total of 35 binary samples were mixed using CS and MP, and another 35 binary samples were mixed for CS and WF in the same manner.

Table 1.

Percentage distribution of binary (milk powder and one adulterant) mixtures.

Range Component Weight-Weight Percentage (%w/w)
Low Milk Powder 97.5 96.3 95.0 93.7 92.5 91.2 89.9 88.7 87.5 86.2 85.0 83.7
Adulterant 2.5 3.7 5.0 6.3 7.5 8.8 10.1 11.3 12.5 13.8 15.0 16.3
Medium Milk Powder 82.5 81.3 80.0 78.7 77.5 76.2 75.0 73.7 72.5 71.3 70.0 -
Adulterant 17.5 17.7 20.0 21.3 22.5 23.8 25.0 26.3 27.5 28.7 30.0 -
High Milk Powder 68.7 67.5 66.3 65.0 63.8 62.5 61.3 60.0 58.8 57.5 56.2 55.0
Adulterant 31.3 32.5 33.7 35.0 36.2 37.5 38.7 40.0 41.2 42.5 43.8 45.0

2.2. Raman Spectroscopy Analysis

The milk powder mixtures were analyzed without pretreatment at room temperature (25 °C). The surface of the mixture was scanned using a handheld Raman spectrometer, ProgenyTM (Rigaku Analytical Devices, Wilmington, MA, USA). The spectrometer uses a 1064-nm laser and thermoelectrically cooled InGaAs 512-pixel detector with a spectral range of 145–2500 cm−1 at 8 cm−1 spectral resolution. The selection of the 1064 nm excitation wavelength was guided by the need to minimize fluorescence interference and ensure stable, reproducible Raman spectra for complex organic and powdered food matrices. Shorter excitation wavelengths, such as 532 nm and 785 nm, provide higher Raman scattering efficiency but are more susceptible to fluorescence background, which can obscure Raman features and adversely affect quantitative analysis in samples containing fluorescent groups [30,31]. In contrast, near-infrared excitation at 1064 nm operates at lower photon energy, significantly reducing fluorescence contributions and improving baseline stability, although with reduced scattering intensity [32]. This trade-off is particularly advantageous for organic materials, where fluorescence suppression is critical for reliable multivariate analysis and robust chemometric model development. Consequently, the use of a 1064 nm laser in this study improved spectral quality and reproducibility, supporting accurate, transferable quantitative modeling for routine screening applications. The laser power was set at 490 mW, and the exposure time was 30 s, with three averages to maximize the signal-to-noise ratio without damaging the sample. Six spectra were collected for each sample from six different points on the surface of the mixture. The optical setup used was backscattering, with the handheld spectrometer placed perpendicular to the table and the surface of the mixture to acquire the sample.

2.3. Spectral Data Processing

The collected Raman spectra were transformed into a data matrix with predictor (X) and response (Y) variables: Raman spectra (X) comprising Raman shifts as variables and samples as objects, and adulterant weight percentages (Y). Chemometric models were developed using The Unscrambler XTM Ver. 10.5.1 software (The Unscrambler X, CAMO Analytics AS, Oslo, Norway). Principal component analysis (PCA) was used to identify the presence of adulterants in MP and detect outliers, while partial least squares regression (PLSR) was used to quantify the weight percentage of adulterants in MP.

All the developed models were optimized using a random cross-validation method included in the software package, with 20 segments, the NIPALS algorithm, and seven latent variables (principal components for PCA and factors for PLSR). The optimal number of latent variables was kept to 1 to ensure the model is as parsimonious as possible while minimizing under- and overfitting. To determine the mathematical treatment that effectively reduces spectral artifacts, twelve different combinations of pre-treatments were used, comprising Standard Normal Variate (SNV) and Multiplicative Scatter Correction (MSC) for correcting scattering and first (SG1) and second (SG2) order derivatives using Savitzky–Golay’s algorithm with a fixed polynomial order of 2 and 15 smoothing points. These combinations yield the following 12 mathematical treatments: MSC, SNV, SG1, SG2, MSC + SG1, MSC + SG2, SNV + SG1, SNV + SG2, SG1 + MSC, SG2 + MSC, SG1 + SNV, and SG2 + SNV. Including unprocessed spectral data (RAW), a total of 13 PLSR models were developed per adulterant, yielding 26 PLSR models overall.

The best mathematical pre-treatments were selected based on the statistical criteria for each adulterant. This selection was based on the lowest Root Mean Square Error of Prediction (RMSEP) and Relative Standard Error of Prediction (RSEP), as well as the highest Ratio of Prediction (RPD) and Range Error Ratio (RER). The 35 adulterant mixtures were divided into two sets: one for developing the PLSR model (CAL) and one for external testing of the developed model (TEST). The mixtures were divided using the 30/70 TEST/CAL split, ensuring that the TEST samples are between CAL samples and that the distribution is as symmetric as possible to ensure interpolation-based prediction while minimizing bias and extrapolation. This split can be viewed as a systematic, interleaved, space-filling external validation sampling procedure, with the CAL set at 70% (n = 26 samples) and the TEST set at 30% (n = 9 samples). External validation was performed using a systematic, interleaved, space-filling sampling strategy, chosen to ensure an unbiased and representative assessment of model performance. This design ensured that the TEST set spanned the full adulteration range (avoiding bias from concentration clustering) while remaining independent of calibration. For each binary system (MP–CS and MP–WF), the complete concentration series was first ordered from lowest to highest adulterant level. Samples were then assigned in an interleaved manner: approximately every third concentration was selected for the TEST set, while the remaining samples were used for calibration. This resulted in a 30/70 TEST/CAL split (9 TEST and 26 CAL samples). This procedure ensured that TEST samples were evenly distributed across the concentration range and fell within calibration levels, thereby enabling model evaluation via interpolation rather than extrapolation. The specific concentration values assigned to the CAL and TEST sets for both CS and WF are summarized in Figure S1 and Table S1 (Supplementary Materials), which visually represents the space-filling validation design.

3. Results and Discussion

All samples were scanned with the previously detailed acquisition settings and optical setup for the portable Raman spectrometer. Figure 2 shows the raw average spectra of pure MP, CS, and WF.

Figure 2.

Figure 2

2D line plot of milk powder: (a) raw Raman spectra overlapped with corn starch (CS) and wheat flour (WF), (b) pre-treated spectra overlapped with CS, and (c) pre-treated spectra overlapped with WF. The mathematical pretreatments are the same as those used for the optimal PLSR models (SG2 + SNV for CS and MSC + SG1 for WF).

3.1. Spectral Analysis

Figure 2 shows the FT-Raman spectra of milk powder (blue), corn starch (red), and wheat flour (green). A clear qualitative distinction among the samples is evident, primarily attributable to variations in the contributions of lipids, proteins, and carbohydrates. The spectral differences across the samples provide an initial indication of compositional variability, which is fundamental for subsequent chemometric modeling.

The main vibrational bands of milk powder, along with their assignments from previous reports [33,34], are listed in Table 2. The most distinctive feature appears at 1747 cm−1, corresponding to the ester carbonyl stretching vibration (ν) of triglycerides in milk fat. This band is intense in pure milk powder and markedly reduced in samples with starch or flour additives, confirming its diagnostic value for lipid content.

Table 2.

Main Raman shifts in cm−1, and their respective tentative assignments for milk powder.

Raman Shift (cm−1) Band Strength Assignment
355 weak Lactose
442 weak δ CCC+τ CO
872 weak δ CCH+δ COC
1002 weak Ring-breathing (phenylalanine)
1079 medium ν CO+ν CC+δ COH
1123 weak ν CO+ν CC+δ COH
1262 weak γ CH2
1303 medium τ CH2
1440 very strong δ CH2
1656 medium ν C=OAmide;ν C=C
1747 weak ν C=OEster

νstretcing vibrations, δscissoring (symmetric deformation) vibrations, τtwisting, γout-of-plane deformation.

Another important region, at 1656 cm−1, comes from amide carbonyl stretching (ν) vibrations tied to milk proteins. This band is clearly visible in milk powder, but its intensity decreases in the adulterated sample, indicating dilution of the protein upon addition of starch or flour. Protein-related information also comes from the 1002 cm−1 band, assigned to phenylalanine ring-breathing vibrations, a stable aromatic marker in protein-rich matrices.

The region between 1380 and 1450 cm−1, particularly the band at 1440 cm−1, is associated with C–H deformation vibrations (δ) of methyl and methylene groups from fatty acid chains. This band shows higher intensity in the milk powder spectrum and progressively decreases in wheat flour and corn starch, further confirming its association with lipid content and its usefulness as a qualitative indicator of milk fat dilution.

Bands at 1262 cm−1 and 1303 cm−1 are associated with mixed organic matrix vibrations, predominantly deformation modes (δ) involving C–H and C–O groups. These bands reflect contributions from proteins, lipids, and carbohydrates and vary with sample composition, reinforcing their role as secondary indicators of adulteration.

The carbohydrate-dominated spectral region (920–1160 cm−1) shows pronounced overlap among milk powder, corn starch, and wheat flour. In milk powder, this region is linked to lactose, the main sugar in milk, with strong C–O (carbon-oxygen) and C–C (carbon-carbon) stretching vibrations (ν), referring to the movement of atoms during bond stretching. The 1079 cm−1 and 1123 cm−1 bands, associated with glycosidic (sugar bond) and glucosyl (glucose unit) stretching modes, are also strong in milk powder. Because starch and flour have polysaccharide (complex carbohydrate) structures, they exhibit similar features, leading to noticeable spectral overlap in mixtures. As a result, this region is highly sensitive to adulteration, and reliable discrimination still requires multivariate analysis, a statistical method that uses multiple variables to distinguish between samples.

The 872 cm−1 band is attributed to deformation-type vibrations (δ) tied to carbohydrate skeletal modes. The 355 cm−1 band is characteristic of lactose and confirms the presence of milk-derived components. These lower-intensity, lower-wavenumber bands help define the fingerprint region and together enhance qualitative interpretation.

Qualitative FT-Raman spectral analysis shows that lipid (fat) and protein-specific (amino acid-based) spectral bands at 1747, 1656, 1440, and 1002 cm−1 are highly specific for identifying milk powder. In contrast, the carbohydrate-rich region (920–1160 cm−1), reflecting sugars and starches, is most affected by corn starch and wheat flour. As shown in Figure 2, these compositional differences are clearly visible in the Raman profiles, thereby supporting chemometric (statistical) classification and adulteration detection.

In addition to qualitative band assignments, the Raman spectra exhibited systematic, concentration-dependent intensity changes as CS and WF levels increased. Specifically, bands within carbohydrate-related regions (323–380 cm−1, 454–540 cm−1, and 1420–1500 cm−1) showed consistent intensity trends with adulterant concentration. This indicates their relevance for quantitative analysis. Individual peak intensity ratios can be sensitive to local heterogeneity, particle size, and baseline variability in powdered samples. Instead of relying on these ratios, we used multivariate analysis to capture correlated variations in intensity across the full spectral range. This confirms that the dominant spectral variation captured by PCA is strongly associated with adulterant concentration.

In Raman analysis of carbohydrate-rich mixtures, pronounced overlap between the 920–1160 cm−1 C–O and C–C vibrational modes of lactose and polysaccharides complicates spectral interpretation. To clarify the rationale for each preprocessing method, SNV and MSC are applied to normalize intensities and correct for scatter, specifically minimizing baseline differences to allow accurate comparison [35,36]. Savitzky–Golay (SG) derivatives enhance resolution by smoothing noise and highlighting inflection points and peak curvature, targeting the separation of overlapping carbohydrate features [37]. Ruffin et al. generated synthetic spectra composed of multiple overlapping Gaussian bands and showed that Savitzky–Golay derivatives accurately detect each band [38]. These tailored methods—SG2 + SNV for corn starch (CS) and MSC + SG1 for wheat flour (WF)—were selected to enhance discrimination in the 920–1160 cm−1 region, enabling more apparent PCA-based sample separation and more robust, stable PLS calibration models.

3.2. Principal Component Analysis (PCA)

After plotting the spectra, PCA was performed using the pure MP spectra and the corresponding adulterant mixtures. For the development of the PCA and PLSR models, two independent binary systems are being studied (MP-CS and MP-WF). The resulting PCA and PLS models cannot simultaneously discriminate or quantify both adulterants. After determining the optimal math pre-treatment for the PLSR models, PCA models were also developed utilizing the optimal pre-treatments. Figure 3 summarizes the best-obtained results.

Figure 3.

Figure 3

Principal component analysis of (yellow star) milk powder and their corresponding mixtures: (a) corn starch (CS) mixtures and (b) wheat flour (WF) mixtures. Mixtures were divided into three regions: (blue squares) 2.5–16.3, (red circles) 17.5–30.0, and (green triangles) 31.3–45.0 %w/w. A confidence ellipse with a 0.05 significance level was determined for each mixture cluster.

Figure 3 shows that it is possible to identify MP separated from the rest of the samples for both CS (a) and WF (b). The adulterant samples can be discriminated into three predetermined concentration regions, with well-defined, separated clusters. The CS mixtures (Figure 3a) appear to increase in concentration from bottom to top and left to right, with the middle range centered at the origin of the scores plot. Similarly, WF mixtures show a middle range centered on the origin, with samples increasing in concentration from bottom to top and right to left. For both scores, the pure MP sample aligns with the direction of the cluster with the least adulterant. For CS, a slight separation is observed between the pure MP mixture and the sample with the lowest adulterant level in the first component.

In contrast, a significantly larger gap is observed in the second component. For WF, the same ample space is observed; however, in the component and the second components, the MP sample is placed within the range with the highest adulterant concentration. The behavior shown in both score plots indicates that the first component separates the adulterant mixtures and the pure MP sample.

To verify mixture homogeneity, given the localized sampling nature of Raman spectroscopy, the six replicate spectra collected from six different positions within each prepared sample were used to quantify within-sample variability. For representative mixtures across the tested concentration range, peak intensities (or integrated band areas) were extracted from three prominent Raman regions (323–380 cm−1, 454–540 cm−1, and 1420–1500 cm−1), and the relative standard deviation (RSD, %) across the six spatial replicates was calculated for each region. An overall homogeneity indicator was then obtained by averaging the RSD values across the selected bands and concentrations for each binary system. Using this approach, the mixtures exhibited overall RSD values of no more than 12%, indicating that the manual mixing and high-speed stirring procedures produced sufficiently homogeneous samples for subsequent chemometric modeling.

3.3. Partial Least Squares Regression (PLSR) Results

This study evaluated two binary adulteration systems: milk powder–corn starch (MP–CS) and milk powder–wheat flour (MP–WF). The aim was not to compare corn starch and wheat flour but to test whether portable Raman spectroscopy and chemometric modeling could detect and quantify each adulterant separately in milk powder. For this reason, the models consider only binary adulteration, not cases with multiple adulterants or simultaneous identification of different adulterants. Distinguishing between CS and WF was outside the scope of this study.

After evaluating the classification procedure using raw spectra, the next step was to develop calibration models using all spectra with PLSR and cross-validation using random groups. As indicated in Section 2, different mathematical pre-treatments will be used to generate a total of thirteen PLSR models per adulterant. Table 3 summarizes the results obtained for each treatment and adulterant.

Table 3.

Test statistics of developed partial least squares regression (PLSR) models to quantify adulteration of milk powder with corn starch and wheat flour (n = 70).

Adulterant PLS
Parameters
RAW MSC SNV SG1 SG2 MSC + SG1 MSC + SG2 SNV + SG1 SNV + SG2 SG1 + MSC SG2 + MSC SG1 + SNV SG2 + SNV
Corn starch RMSEP [%w/w] 6.56 0.96 0.98 2.01 1.96 0.79 0.76 0.8 0.76 0.83 0.76 0.81 0.76
RSEP [%] 26.0 3.8 3.9 7.9 7.7 3.1 3.0 3.2 3.0 3.3 3.0 3.2 3.0
RPD 1.8 9.6 9.5 5.1 5.2 11.2 11.9 11.2 12.0 10.6 11.8 11.1 12.1
RER 5.8 26.1 25.7 14.7 15.2 31.4 33.0 31.7 33.1 31.3 33.1 32.5 33.6
Wheat flour RMSEP [%w/w] 3.54 0.83 0.83 1.10 0.92 0.77 0.80 0.84 0.86 1.13 1.18 1.12 1.16
RSEP [%] 14.0 3.3 3.3 4.4 3.6 3.0 3.2 3.3 3.4 4.5 4.6 4.4 4.6
RPD 2.5 11.0 11.1 7.7 9.5 11.6 11.3 10.7 10.6 7.9 7.7 8.0 7.8
RER 6.5 33.6 33.7 22.9 28.5 35.6 34.5 35.0 33.6 28.9 27.4 29.5 27.6

RAW—unprocessed spectral data, SNV—standard normal variate, SG1—Savitzky–Golay second order derivative first order polynomial with 15 smoothing points, SG2—Savitzky–Golay second order derivative second order polynomial with 15 smoothing points, MSC—multiplicative scatter correction, RMSEP—root mean square error of prediction, RSEP—relative standard error of prediction, RPD—ratio of prediction, RER—range error ratio.

Although using a single latent variable (LV = 1) in PLSR models may seem unusual, this choice is justified in the present study. The application involves highly collinear Raman spectral data, binary adulteration systems, and a small sample size. Additional latent variables were tested during model optimization (Tables S3 and S4 in the Supplementary Materials). Higher LVs yielded only marginal calibration improvements (no more than 30% with 5 LVs) and did not consistently improve external validation. With an interleaved external validation scheme and moderate sample size (n = 70), retaining only the first latent variable minimized overfitting and maintained predictive stability. Explained variance analysis showed that the first LV accounted for over 70% of both X- and Y-variance, confirming that it captured the main chemical information relevant to adulterant concentration without underfitting. This streamlined strategy prioritizes robustness in real-world spectral acquisitions by buffering against the effects of added spectral artifacts (baseline, scattering, noise, etc.).

When analyzing the optimized PCA models (Figure 3), the first principal component (PC1) explains 88% and 95% of the total X-variance for the CS and WF systems, respectively. PC2 explains an additional 9% for CS and 4% for WF, resulting in cumulative variances of 97% and 99%. The PCA score plots show separation of the concentration levels along both PC1 and PC2. It is important to note that PCA is an unsupervised method that maximizes the variance of the spectral data (X) without considering the response variable (Y). As a result, variance captured by PC2 may reflect orthogonal spectral variability—such as scattering effects, baseline variation, or sample heterogeneity. These factors aid visual clustering but do not necessarily contribute to predicting adulterant concentration. Figure 3b appears to show PC2 wider than PC1; however, PC2 spans from −2250 to 2250, while PC1 spans from −4200 to 2140. Tables S3 and S4 (Supplementary Materials) present the optimal LVs per PLSR model for CS-MP and WF-MP, along with a discussion of why model simplicity was favored over predictive power for both PLSR models. In contrast, PLSR LVs are extracted to maximize the covariance between X and Y, ensuring that retained components are directly relevant to concentration prediction. Consistent with this distinction, evaluating the X-variance and Y-variance explained versus LVs showed that the first LV alone captured more than 70% of the relevant information for both CS and WF. Additional LVs did not consistently improve external validation performance (Tables S3 and S4, Supplementary Materials). Given the limited dataset size and the interleaved external validation design, retaining a single latent variable may help reduce the expected decrease in robustness when using a portable Raman spectrometer outside a controlled environment by non-academic personnel. This approach minimizes overfitting while preserving predictive stability.

As can be seen in Table 3, for CS, all RMSEP values are below 6.56 %w/w with the median being 0.81 %w/w, RSEP are below 26.0% with the median being 3.2%, RPD are above 1.8 with the median being 11.1, and RER are above 5.8 with the median being 31.4. For WF, all RMSEP values are below 3.54 %w/w, with the median being 0.92 %w/w; RSEP values are below 14.0%, with the median being 3.6%; RPD values are above 2.5, with the median being 9.5; and RER values are above 6.5, with the median being 29.5. The lowest performance can be seen for the RAW PLSR models, while the best mathematical pre-treatment for adulterants was SG2 + SNV for CS and MSC + SG1 for WF. Table 4 summarizes the optimal PLSR model determined for both adulterants.

Table 4.

Test statistics of the optimally developed partial least squares regression (PLSR) models to quantify adulteration of milk powder with corn starch and wheat flour (n = 70).

Math Pre-Treatment Parameter Model Validation Stage RMSEP [%w/w] RSEP [%] RPD RER Factor
SG2 + SNV Corn starch Calibration 0.96 3.5 14.6 39.2 1
Cross-validation 1.04 3.8 13.4 42.8 1
Test 0.76 3.0 12.0 33.4 1
MSC + SG1 Wheat flour Calibration 0.90 3.3 15.6 47.7 1
Cross-validation 0.98 3.6 14.2 43.7 1
Test 0.77 3.0 11.6 35.6 1

SNV—standard normal variate, SG1—Savitzky–Golay second order derivative first order polynomial with 15 smoothing points, SG2—Savitzky–Golay second order derivative second order polynomial with 15 smoothing points, MSC—multiplicative scatter correction, RMSEP—root mean square error of prediction, RSEP—relative standard error of prediction, RPD—ratio of prediction, RER—range error ratio.

As shown in Table 4, the optimal PLSR test models had an average of 0.77 %w/w RMSEP, 3.0% RSEP, 11.8 RPD, and 34.5 RER for adulterants, with CS having the best-performing model by a slight margin. The calibration and cross-validation results were also shown in Table 4, with the test models showing surprisingly better performance in RMSEP and RSEP; however, the optimistic results of calibration and cross-validation appear in the RPD and RER values. As mentioned in Section 2.3, the number of factors used was 1 for all the models. After analyzing the PCA scores plot with the optimized mathematical pre-treatments in Figure 3, further evidence is provided for using a single latent variable as a reasonable selection. Figure 4 shows the predicted vs. reference plot for the optimal PLSR models for both adulterants.

Figure 4.

Figure 4

Predicted vs. reference plot showing the performance of the optimal PLSR model for (a) corn starch and (b) wheat flour. The models with samples and red line (model linear fit) close to the black dashed lines (ideal model) illustrate higher accuracy, while the shorter whiskers indicate higher precision.

The primary objective of this study was not trace-level detection. Instead, it aimed for reliable discrimination and quantification of economically meaningful adulteration levels in milk powder. In practical food fraud scenarios, adulterants such as CS and WF are introduced at concentrations well above trace levels. Very low percentages do not provide enough economic incentive, given the risk and processing effort [39]. The selected concentration range reflects realistic adulteration practices seen in routine quality control and screening. It does not represent the detection limits of the spectroscopic technique. Therefore, performance metrics such as prediction error (RMSEP) and model stability under external validation were considered more suitable.

Portable Raman spectroscopy offers key advantages for point-of-need analysis, including rapid measurements, minimal sample preparation, and in situ use. These features support this study’s aims and make portable systems suitable for food authentication and field screening. However, portable Raman instruments face operational limitations compared with laboratory benchtop systems: lower laser power, reduced optical throughput, and decreased detector sensitivity (due to eye-safety regulations, battery limitations, and compact design) can degrade signal-to-noise ratio and spectral resolution in complex samples [40]. Also, fluorescence interference is often higher in portable systems because of limited excitation wavelength options and simpler optics [41]. By contrast, benchtop Raman systems offer higher laser power, better detector cooling, and flexible data acquisition, resulting in improved fluorescence suppression and spectral resolution. Thus, portable Raman is a complementary tool for rapid screening when combined with robust chemometric analysis, as shown in this study.

Previous near- and mid-infrared (NIR/MIR) spectroscopy studies, combined with chemometric modeling, have demonstrated effective detection and quantification of CS and WF adulteration in MP under laboratory conditions. For example, optimized FT-NIR and FT-MIR models typically achieve prediction errors below or near 1 %w/w when benchtop instruments and controlled sampling geometries are used [29]. However, these infrared approaches generally require contact-based or ATR configurations and stable environmental conditions, which may limit their use for rapid, on-site analysis. In contrast, the present work shows that portable Raman spectroscopy can achieve similar prediction accuracy for individual binary adulteration systems. Furthermore, it offers practical advantages, such as non-contact measurements, reduced sensitivity to water absorption, and simplified sample handling. Taken together, these results position portable Raman spectroscopy as a complementary, field-deployable alternative to established NIR/MIR methods, rather than a replacement.

Recent literature shows that machine-learning and deep-learning methods can improve Raman spectroscopy, especially for complex or large-scale datasets, by automating preprocessing and delivering higher accuracy in certain classification and regression tasks [42]. Despite these benefits, reviews note key limitations, such as high computational cost, large data requirements, limited interpretability, and greater complexity. For studies with moderate data that require transparent and easily deployable models, classical chemometric methods such as PCA and PLS continue to offer direct advantages, including robustness, interpretability, compatibility with standard analysis software, accessibility, and reproducibility. In this work, PCA and PLS were selected to ensure these advantages, in line with recent conclusions that deep learning is justified only when a problem’s complexity and data clearly outweigh the strengths of existing methods [40].

4. Conclusions

In this work, we determined whether a portable Raman spectrometer coupled with chemometric routines (PCA and PLSR) could be used for fraud detection in milk powder. Detecting fraud in solid samples, such as milk powder, requires the collection of robust spectra that provide relevant information. All classification and calibration models showed satisfactory results. The optimal PLSR model for quantifying CS was developed using SG2 + SNV math pre-treatment combination, while WF was developed using MSC + SG1. For all adulterants, the RMSEP is 0.77 %w/w on average, with an RSEP of 3.0%. These results confirm that portable Raman spectrometers can be used to detect food fraud in milk powder samples and open the possibility of detecting other solid mixtures in the food industry, as well as malicious adulteration of solid samples in other relevant fields.

Acknowledgments

The authors are grateful to the students who provided their time to acquire spectral data, as well as to the laboratory technicians, Lourdes Dorta and Marilyn Rivera, for their constant and fruitful contributions. Generative artificial intelligence tools (ChatGPT, version 5.2; OpenAI) were used solely as an auxiliary aid during the manuscript preparation stage to support language editing and exploratory literature searching. In this context, the tool was used to suggest general topics, keywords, and potentially relevant research areas, which the authors subsequently evaluated. All scientific concepts, data interpretation, methodological decisions, and conclusions were developed and validated by the authors. Importantly, all cited references were independently identified, manually reviewed, and verified by the authors using primary literature sources; no citations generated by the AI tool were included in the final manuscript without direct author verification. The generative AI tool did not generate, analyze, or modify experimental data, nor did it influence the reported results or conclusions.

Abbreviations

The following abbreviations are used in this manuscript:

%w/w Weight-weight percentage
CAL Calibration set
CS Corn starch
CV Cross-validation
FTIR Fourier Transform infrared
GenAI Generative artificial intelligence
MIR Mid-infrared
MP Milk powder
MSC Multiplicative scatter correction
N Total samples
NIPALS Non-iterative partial least squares
NIR Near-infrared
PCA Principal component analysis
PLSR Partial-least squares regression
RAW Unprocessed spectral data
RER Range error ratio
RMSEP Root mean square error of prediction
RPD Ratio of prediction
RS Raman spectroscopy
RSEP Relative standard error of prediction
SERS Surface-enhanced Raman spectroscopy
SG1 Savitzky–Golay first derivative
SG2 Savitzky–Golay second derivative
SNV Standard normal variate
TEST External testing set
WF Wheat flour
X Predictor variables (Raman spectra)
Y Response variable (adulterant concentration)

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/s26041304/s1, Figure S1: Space-filling validation design utilized for the development of the PLSR models with calibration (blue squares) and test (orange squares) samples ordered from lowest to highest adulterant level; Table S1: Percentage distribution of calibration and test samples for the development of the PLSR models; Table S2: Thirteen matrices with all Raman spectra used for developing the PLSR models with different data preprocessing (DP) methods; Table S3: Optimal latent variable (LV) per partial least squares regression (PLSR) chemometrics model for the independent corn starch and milk powder (CS-MP) binary mixtures; Table S4: Optimal latent variable (LV) per partial least squares regression (PLSR) chemometrics model for the independent wheat flour and milk powder (WF-MP) binary mixtures. Table S5. Performance results for the PLSR model with optimal parameters (MSC) and the model with the optimal parameters using only one latent variable (MSC + SG1) for the independent wheat flour and milk powder (WF-MP) binary mixtures.

sensors-26-01304-s001.zip (180.4KB, zip)

Author Contributions

Conceptualization, R.I.-C. and S.P.H.-R.; methodology, R.I.-C., L.C.P.-L. and E.R.C.-A.; software, R.I.-C. and S.P.H.-R.; validation, R.I.-C., L.C.P.-L. and E.R.C.-A.; formal analysis, L.C.P.-L. and E.R.C.-A.; investigation, R.I.-C., L.C.P.-L. and E.R.C.-A.; resources, S.P.H.-R. and R.I.-C.; data curation, E.R.C.-A.; writing—original draft preparation, E.R.C.-A., R.I.-C. and L.D.C.-D.; writing—review and editing, R.I.-C. and E.R.C.-A.; visualization, E.R.C.-A.; supervision, S.P.H.-R. and R.I.-C.; project administration, S.P.H.-R. and R.I.-C.; funding acquisition, R.I.-C. All authors have read and agreed to the published version of the manuscript.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are available upon request from the corresponding authors.

Conflicts of Interest

The authors declare no conflicts of interest.

Funding Statement

This research was funded by the Center for Research and Creation (CIC) of the University of Puerto Rico at Arecibo (Seed funds).

Footnotes

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

References

  • 1.Moore J.C., Spink J., Lipp M. Development and application of a database of food ingredient fraud and economically motivated adulteration from 1980 to 2010. J. Food Sci. 2012;77:R118–R126. doi: 10.1111/j.1750-3841.2012.02657.x. [DOI] [PubMed] [Google Scholar]
  • 2.Spink J., Moyer D.C. Defining the public health threat of food fraud. J. Food Sci. 2011;76:R157–R163. doi: 10.1111/j.1750-3841.2011.02417.x. [DOI] [PubMed] [Google Scholar]
  • 3.Lohumi S., Lee S., Lee H., Cho B.-K. A review of vibrational spectroscopic techniques for the detection of food authenticity and adulteration. Trends Food Sci. Technol. 2015;46:85–98. doi: 10.1016/j.tifs.2015.08.003. [DOI] [Google Scholar]
  • 4.Ellis D.I., Brewster V.L., Dunn W.B., Allwood J.W., Golovanov A.P., Goodacre R. Fingerprinting food: Current technologies for the detection of food adulteration and contamination. Chem. Soc. Rev. 2012;41:5706–5727. doi: 10.1039/c2cs35138b. [DOI] [PubMed] [Google Scholar]
  • 5.Nascimiento C.F., Santos P.M., Pereira-Filho E.R., Rocha F.R. Recent advances on determination of milk adulterants. Food Chem. 2017;221:1232–1244. doi: 10.1016/j.foodchem.2016.11.034. [DOI] [PubMed] [Google Scholar]
  • 6.Ceniti C., Spina A.A., Piras C., Oppedisano F., Tilocca B., Roncada P., Britti D., Morittu V.M. Recent advances in the determination of milk adulterants and contaminants by mid-infrared spectroscopy. Foods. 2023;12:2917. doi: 10.3390/foods12152917. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Berghian-Grosan C., Magdas D.A. Raman spectroscopy and machine-learning for edible oils evaluation. Talanta. 2020;218:121176. doi: 10.1016/j.talanta.2020.121176. [DOI] [PubMed] [Google Scholar]
  • 8.Kazlagić A., Omanović-Mikličanin E. CMBEBIH 2019. Volme 73. Springer; Cham, Switzerland: 2020. Application of Raman Spectroscopy in Food Forensic: A Review; pp. 257–263. IFMBE Proceedings. [DOI] [Google Scholar]
  • 9.Vafakhah M., Asadollahi-Baboli M., Hassaninejad-Darzi S.K. Raman spectroscopy and chemometrics for rice quality control and fraud detection. J. Consum. Prot. Food Saf. 2023;18:403–413. doi: 10.1007/s00003-023-01435-y. [DOI] [Google Scholar]
  • 10.Melendreras C., Soldado A., Costa-Fernández J.M., López A., Valledor M., Campo J.C., Ferrero F. An Affordable NIR Spectroscopic System for Fraud Detection in Olive Oil. Sensors. 2023;23:1728. doi: 10.3390/s23031728. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Galvin-King P., Haughey S.A., Elliott C.T. The Detection of Substitution Adulteration of Paprika with Spent Paprika by the Application of Molecular Spectroscopy Tools. Foods. 2020;9:944. doi: 10.3390/foods9070944. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Karunathilaka S.R., Yakes B.J., He K., Brückner L., Mossoba M.M. First use of handheld Raman spectroscopic devices and on-board chemometric analysis for the detection of milk powder adulteration. Food Control. 2018;92:137–146. doi: 10.1016/j.foodcont.2018.04.046. [DOI] [Google Scholar]
  • 13.Limm W., Karunathilaka S.R., Yakes B.J., Mossoba M.M. A portable mid-infrared spectrometer and a non-targeted chemometric approach for the rapid screening of economically motivated adulteration of milk powder. Int. Dairy J. 2018;85:177–183. doi: 10.1016/j.idairyj.2018.06.005. [DOI] [Google Scholar]
  • 14.Yegon D., Ojijo N.K., Tybussek T., Owino W. Application of portable near-infrared spectroscopy for rapid detection and quantification of adulterants in baobab fruit pulp. Int. J. Food Sci. Technol. 2023;58:1465–1473. doi: 10.1111/ijfs.16313. [DOI] [Google Scholar]
  • 15.Balabin R.M., Smirnov S.V. Melamine detection by mid- and near-infrared (MIR/NIR) spectroscopy: A quick and sensitive method for dairy products analysis including liquid milk, infant formula, and milk powder. Talanta. 2011;85:562–568. doi: 10.1016/j.talanta.2011.04.026. [DOI] [PubMed] [Google Scholar]
  • 16.Du L. New insights into raw milk adulterated with milk powder identification: ATR-FTIR spectroscopic fingerprints combined with machine learning and feature selection approaches. J. Food Compos. Anal. 2024;133:106443. doi: 10.1016/j.jfca.2024.106443. [DOI] [Google Scholar]
  • 17.Chu C., Wang H., Luo X., Fan Y., Nan L., Du C., Gao D., Wen P., Wang D., Yang Z., et al. Rapid detection and quantification of melamine, urea, sucrose, water, and milk powder adulteration in pasteurized milk using Fourier transform infrared (FTIR) spectroscopy coupled with modern statistical machine learning algorithms. Heliyon. 2024;10:e32720. doi: 10.1016/j.heliyon.2024.e32720. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Kohler A., Afseth N.K., Jørgensen K., Randby Å., Martens H. Handbook of Vibrational Spectroscopy. John Wiley & Sons; Hoboken, NJ, USA: 2010. Quality Analysis of Milk by Vibrational Spectroscopy. [DOI] [Google Scholar]
  • 19.Bergana M.M., Adams K.M., Harnly J., Moore J.C., Xie Z. Non-targeted detection of milk powder adulteration by 1H NMR spectroscopy and Conformity Index Analysis. J. Food Compos. Anal. 2019;78:49–58. doi: 10.1016/j.jfca.2019.01.016. [DOI] [Google Scholar]
  • 20.Karunathilaka S.R., Farris S., Mossoba M.M., Moore J.C., Yakes B.J. Non-targeted detection of milk powder adulteration using Raman spectroscopy and chemometrics: Melamine case study. Food Addit. Contam. Part A. 2017;34:170–182. doi: 10.1080/19440049.2016.1260168. [DOI] [PubMed] [Google Scholar]
  • 21.Ellis D.I., Muhamadali H., Haughey S.A., Elliott C.T., Goodacre R. Point-and-shoot: Rapid quantitative detection methods for on-site food fraud analysis—Moving out of the laboratory and into the food supply chain. Anal. Methods. 2015;7:9401–9414. doi: 10.1039/C5AY02048D. [DOI] [Google Scholar]
  • 22.Chen J., Liu W., Cao X., Zhang Q., Zou X. Rapid identification of milk powder adulteration based on surface-enhanced Raman spectroscopy. AIP Adv. 2024;14:105126. doi: 10.1063/5.0228449. [DOI] [Google Scholar]
  • 23.Li L., Chin W.S. Rapid and sensitive SERS detection of melamine in milk using Ag nanocube array substrate coupled with multivariate analysis. Food Chem. 2021;357:129717. doi: 10.1016/j.foodchem.2021.129717. [DOI] [PubMed] [Google Scholar]
  • 24.Huang Y., Lu D., Zhang S., Jiang S., Zhou Y., Xu J. Portable Raman spectroscopy combined with machine learning for highly sensitive and rapid detection of food pollutants with flexible Ag@TiO2@polyester SERS substrate. Sustain. Food Technol. 2025;428 doi: 10.1039/D5FB00802F. [DOI] [Google Scholar]
  • 25.Santos P.M., Pereira-Filho E.R., Rodriguez-Saona L.E. Rapid detection and quantification of milk adulteration using infrared microspectroscopy and chemometrics analysis. Food Chem. 2013;138:19–24. doi: 10.1016/j.foodchem.2012.10.024. [DOI] [PubMed] [Google Scholar]
  • 26.Saji R., Ramani A., Gandhi K., Seth R., Sharma R. Application of FTIR spectroscopy in dairy products: A systematic review. Food Humanit. 2024;2:100239. doi: 10.1016/j.foohum.2024.100239. [DOI] [Google Scholar]
  • 27.Xiao G., Li L., Yan A., He X. Direct detection of melamine in infant formula milk powder solution based on SERS effect of silver film over nanospheres. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2019;223:117269. doi: 10.1016/j.saa.2019.117269. [DOI] [PubMed] [Google Scholar]
  • 28.Pilot R. SERS detection of food contaminants by means of portable Raman instruments. J. Raman Spectrosc. 2018;49:954–981. doi: 10.1002/jrs.5400. [DOI] [Google Scholar]
  • 29.Caballero-Agosto E.R., Sierra-Vega N.O., Rolon-Ocasio Y., Hernandez-Rivera S.P., Infante-Degró R.A., Fontalvo-Gomez M., Pacheco-Londoño L.C., Infante-Castillo R. Detection and quantification of corn starch and wheat flour as adulterants in milk powder by near- and mid-infrared spectroscopy coupled with chemometric routines. Food Chem. Adv. 2024;4:100582. doi: 10.1016/j.focha.2023.100582. [DOI] [Google Scholar]
  • 30.Gouvinhas I., Machado N., Carvalho T., De Almeida J.M.M.M., Barros A.I.R.N.A. Short Wavelength Raman Spectroscopy Applied to the Discrimination and Characterization of Three Cultivars of Extra Virgin Olive Oils in Different Maturation Stages. Talanta. 2015;132:829–835. doi: 10.1016/j.talanta.2014.10.042. [DOI] [PubMed] [Google Scholar]
  • 31.Ellis D.I., Muhamadali H., Xu Y., Eccles R., Goodall I., Goodacre R. Rapid Through-Container Detection of Fake Spirits and Methanol Quantification with Handheld Raman Spectroscopy. Analyst. 2019;144:324–330. doi: 10.1039/C8AN01702F. [DOI] [PubMed] [Google Scholar]
  • 32.Meyer M.W., Lupoi J.S., Smith E.A. 1064nm Dispersive Multichannel Raman Spectroscopy for the Analysis of Plant Lignin. Anal. Chim. Acta. 2011;706:164–170. doi: 10.1016/j.aca.2011.08.031. [DOI] [PubMed] [Google Scholar]
  • 33.McGoverin C.M., Clark A.S.S., Holroyd S.E., Gordon K.C. Raman spectroscopic quantification of milk powder constituents. Anal. Chim. Acta. 2010;673:26–32. doi: 10.1016/j.aca.2010.05.014. [DOI] [PubMed] [Google Scholar]
  • 34.Rodrigues-Júnior P.H., de Sá Oliveira K., de Almeida C.E.R., De Oliveira L.F.C., Stephani R., da Silva Pinto M., de Carvalho A.F., Perrone Í.T. FT-Raman and chemometric tools for rapid determination of quality parameters in milk powder: Classification of samples for the presence of lactose and fraud detection by addition of maltodextrin. Food Chem. 2016;196:584–588. doi: 10.1016/j.foodchem.2015.09.055. [DOI] [PubMed] [Google Scholar]
  • 35.Barnes R.J., Dhanoa M.S., Lister S.J. Standard Normal Variate Transformation and De-Trending of Near-Infrared Diffuse Reflectance Spectra. Appl. Spectrosc. 1989;43:772–777. doi: 10.1366/0003702894202201. [DOI] [Google Scholar]
  • 36.Geladi P., Macdougall D., Martens H. Linearization and Scatter-Correction for Near-Infrared Reflectance Spectra of Meat. Appl. Spectrosc. 1985;39:491–500. doi: 10.1366/0003702854248656. [DOI] [Google Scholar]
  • 37.Savitzky A., Golay M.J.E. Smoothing and Differentiation of Data by Simplified Least Squares Procedures. Anal. Chem. 1964;36:1627–1639. doi: 10.1021/ac60214a047. [DOI] [Google Scholar]
  • 38.Ruffin C., King R.L., Younan N.H. A Combined Derivative Spectroscopy and Savitzky-Golay Filtering Method for the Analysis of Hyperspectral Data. GISci. Remote Sens. 2008;45:1–15. doi: 10.2747/1548-1603.45.1.1. [DOI] [Google Scholar]
  • 39.Kuranathilaka S.R., Yakes B.J., He K., Chung J.K., Mossoba M. Non-Targeted NIR Spectroscopy and SIMCA Classification for Commercial Milk Powder Authentication: A Study Using Eleven Potential Adulterants. Heliyon. 2008;4:e00806. doi: 10.1016/j.heliyon.2018.e00806. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Tripathy S., Chavva S., Coté G.L., Mabbott S. Modular and Handheld Raman Systems for SERS-Based Point-of-Care Diagnostics. Curr. Opin. Biomed. Eng. 2023;28:100488. doi: 10.1016/j.cobme.2023.100488. [DOI] [Google Scholar]
  • 41.Vulchi R.T., Morgunov V., Junjuri R., Bocklitz T. Artifacts and Anomalies in Raman Spectroscopy: A Review on Origins and Correction Procedures. Molecules. 2024;29:4748. doi: 10.3390/molecules29194748. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Boateng D. Advances in Deep Learning-Based Applications for Raman Spectroscopy Analysis: A Mini-Review of Progress and Challenges. Microchem. J. 2025;209:112692. doi: 10.1016/j.microc.2025.112692. [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

sensors-26-01304-s001.zip (180.4KB, zip)

Data Availability Statement

Data are available upon request from the corresponding authors.


Articles from Sensors (Basel, Switzerland) are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES