Summary
Spectroscopic techniques are indispensable for material characterization, yet their weak signals remain highly prone to interference from environmental noise, instrumental artifacts, sample impurities, scattering effects, and radiation-based distortions (e.g., fluorescence and cosmic rays). These perturbations not only significantly degrade measurement accuracy but also impair machine learning–based spectral analysis by introducing artifacts and biasing feature extraction. This review provides a systematic evaluation of critical spectral preprocessing methods—encompassing cosmic ray removal, baseline correction, scattering correction, normalization, filtering and smoothing, spectral derivatives, and advanced techniques like 3D correlation analysis—highlighting their theoretical underpinnings, performance trade-offs, and optimal application scenarios. The field is undergoing a transformative shift driven by three key innovations: context-aware adaptive processing, physics-constrained data fusion, and intelligent spectral enhancement. These cutting-edge approaches enable unprecedented detection sensitivity achieving sub-ppm levels while maintaining >99% classification accuracy, with transformative applications spanning pharmaceutical quality control, environmental monitoring, and remote sensing diagnostics.
Subject areas: Physics, Computer science, Engineering
Graphical abstract

Physics; Computer science; Engineering
Introduction
Spectral analysis is a fundamental tool for material characterization,1,2,3,4,5,6,7,8,9,10 yet its effectiveness is consistently challenged by both intrinsic signal limitations (e.g., low photon yields or peak shifts from molecular interactions) and extrinsic perturbations (e.g., environmental fluctuations inducing baseline drifts or tilts), all of which undermine quantification accuracy. For instance, in gamma-ray spectroscopy, cosmic ray artifacts obscure faint astrophysical signals11; in mid-infrared flame spectroscopy, thermal radiation introduces nonlinear baselines that impede gas concentration retrieval12; while in ocean color remote sensing, instrument noise exacerbates errors in atmospheric correction for parameters like total suspended solids (TSS).13 Beyond domain-specific noise, cross-instrument variability and methodological inconsistencies further amplify these distortions, impairing both traditional analyses (e.g., Beer-Lambert–based quantification) and machine learning models (principal component analysis (PCA),14 partial least squares (PLS),15 support vector machines (SVMs),16 k-nearest neighbors (KNN),17 and convolutional neural networks (CNNs)18). Despite the critical role of preprocessing, existing review articles and methodological discussions often focus narrowly on domain-specific applications or specialized spectral processing techniques, lacking a unified mathematical framework or broad generalizability.19,20,21,22,23,24,25,26
At the quantum level, spectroscopic signals arise from electron/phonon transitions (emission or absorption27,28; Figure 1A), manifesting as emission spectra (e.g., laser-induced breakdown spectroscopy (LIBS) or Raman; Figure 1B) or absorption spectra (e.g., UV-Vis or IR; Figure 1C). While absorption spectra obey the Beer-Lambert law, their practical measurement via dispersion techniques (prisms, gratings, Fourier transform interferometry, or tunable filters29; Figure 1D) decomposes raw signals into three components: target peaks (physicochemical information), background interference (e.g., scattering or thermal effects), and stochastic noise (e.g., detector readout errors; Figure 1E). Whether confronting cosmic ray spikes in astrophysics or fluorescence-dominated Raman spectra, these artifacts invariably mask intrinsic spectral features (Figure 1G), necessitating systematic preprocessing to recover latent material signatures.
Figure 1.
Schematic diagram of spectral technology principle
The figure summarizes fundamental energy transitions (A), spectral classifications (B-C), instrumentation (D), spectral data structure and preprocessing (E and F), machine learning methods (G), and applications (H).
(A) Energy level diagrams of spontaneous emission, stimulated emission, and stimulated absorption.
(B) Emission spectra classifications: LIBS (laser-induced breakdown spectroscopy), XRF (X-ray fluorescence spectroscopy), MFS (molecular fluorescence spectroscopy), and RS (Raman spectroscopy).
(C) Absorption spectra classifications: XANES (X-ray absorption near-edge structure spectroscopy), UV (ultraviolet), Vis (visible), IR (infrared), AS (absorption spectroscopy), and TAS (terahertz absorption spectroscopy).
(D) Spectrometer types: FPI (Fabry–Perot interferometer) and AOTF (acousto-optic tunable filter).
(E) Spectral data structure with interference factors: Noise, baseline drift, cosmic rays, and fluorescent background.
(F) Preprocessing methods: Normalization, BC (baseline correction), S&F (smoothing and filtering), CRR (cosmic ray removal), SD (spectral derivative), and 3days-CM (three-dimensional correlation method).
(G) Machine learning methods: PCA (principal component analysis), PLS (partial least squares), SVM (support vector machine), KNN (k-nearest neighbor), RF (random forest), and CNN (convolutional neural networks).
(H) Applications: QA (quantitative analysis) and C&P (classification and prediction).
To address these challenges, we propose a hierarchy-aware preprocessing framework (Figure 1F) comprising: (1) Localized artifact removal (cosmic ray/spike filtering), (2) baseline correction for low-frequency drift suppression, (3) scattering correction, (4) intensity normalization to mitigate systematic errors, (5) noise filtering and smoothing, (6) feature enhancement via spectral derivatives, and (7) information mining by three-dimensional correlation method. This pipeline (Figures 1H; Table 1) synergistically bridges raw spectral fidelity and downstream analytical robustness, ensuring reliable quantification and machine learning compatibility.
Table 1.
Comparison of spectral data preprocessing techniques: mechanisms, validation requirements, and applications
| Category | Method | Core Mechanism | Validation Need | Advantages | Disadvantages | Primary Role & Application Context |
|---|---|---|---|---|---|---|
| Cosmic Ray Removal | Moving Average Filter (MAF)30 | Detects cosmic rays via MAD-scaled Z score and first-order differences; corrects with outlier rejection and windowed averaging (Equations 1 and 2). | Assumes isolated CRAs and single-scan spectral data. Threshold needs to be calibrated to 3–5. | Fast real-time processing with better spectral preservation than uniform averaging. | Blurs adjacent features due to uniform-weight averaging; sensitive to window size () tuning. | Real-time single-scan correction for Raman/IR spectra without replicate measurements. |
| Missing-Point Polynomial Filter (MPF)31 | Explicitly excludes central outlier (zero-weighted convolution ), fits quadratic polynomial via least squares (Equation 3). | Assumes sparse CRAs and uniform spectral sampling (e.g., Raman/IR). | Preserves fidelity by excluding corrupted points; faster than Savitzky-Golay. | Ineffective for clustered CRAs; requires uniform spectral sampling (e.g., Raman/IR). | Optimized for real-time cosmic ray removal in Raman/IR spectra (single-scan, uniform sampling), prioritizing local feature fidelity. | |
| Multistage Spike Recognition (MSR)32 | Forward differences () + dynamic threshold (). Shape validation (sharp rise/fall, width ≤30 pixels) (Equation 4). | Requires ≥40 sequential spectra. adaptive threshold performance verification. | Automated: Dynamic thresholds auto-adjust. Accurate: shape constraints ensure precision. Robust: drift resilience maintains signal integrity. | May miss broad anomalies (>30 px) due to rigid width constraints. | Reliable cosmic ray removal for time-resolved Raman spectra (40+ scans) with drift/variable spikes. | |
| Polynomial Filtering (PF)33 | Global third-order (or higher) polynomial fit per wavenumber; outliers rejected at > noise deviation. | Requires manual validation or median filtering for heterogeneous data (avoids distortion). | Spectral-preserving, Robust automation, Scalable for high-throughput (homogeneous data). | Struggles with local anomalies in heterogeneous data. requires manual validation. | Optimized for high-throughput Raman applications where manual inspection is impractical. | |
| Nearest Neighbor Comparison (NNC)34,35 | Normalized covariance similarity () + Savitzky-Golay noise estimation (); dual thresholds ( primary, secondary) (Equations 5 and 6). | Requires validation of single-scan robustness, dual-threshold efficacy, preprocessing dependence, and computational speed. | Single-scan avoids read noise; auto-dual thresholds optimize sensitivity/specificity. | Assumes spectral similarity; SG smoothing affects low-SNR regions. | Real-time hyperspectral imaging or time-sensitive spectroscopic analysis under low SNR conditions. | |
| Wavelet Transform (DWT+K-means)36,37,38 | DWT decomposition (depth ) + K-means clustering; Allan deviation threshold () (Equations 7 and 8). | Requires testing on datasets with CR/Raman FWHM overlap. | Multi-scale wavelet analysis preserves spectral details; automated for large datasets (single-scan). | Limited efficacy when CRA width overlaps Raman peaks (low-resolution data). | Single-scan CRA correction for Raman spectra (high-frequency artifacts). | |
| Kernel PCA Residual Diagnosis (KPCARD)39,40 | Gaussian KPCA () + dual-threshold residual diagnosis (, ); reconstruction via PCA components (Equations 9, 10, 11, and 12). | Requires clear contrast between artifacts and true spectra | KPCA preserves subtle features, robust to noise, retains true spectral profiles. | Computationally intensive; requires manual optimization of and PCA order. | High-precision artifact removal for nonlinearly distorted spectra (e.g., high-resolution Raman). | |
| Baseline Correction | Wavelength Artificial Shift Subtraction (WASS)41 | Artificial wavelength shift () isolates signal via least-squares solution. Shift magnitude must avoid noise amplification ( too large) or insufficient separation ( too small). (Equations 14, 15, 16, and 17). | Requires precise calibration (avoid under/over-shifting). | No empirical fitting. Robust for obscured signals (e.g., underwater LIBS). | Suboptimal degrades performance; sensitive to shift magnitude. | Underwater LIBS: Targets faint emission lines and Resists obscuration & noise. |
| Piecewise Polynomial Fitting (PPF)42,43,44,45,46,47 | Segmented polynomial fitting (orders adaptively optimized per segment) with S-ModPoly variant for iterative refinement (Equation 18). | Poor segmentation in PPF leads to over/underfitting → adaptive methods (e.g., S-ModPoly) needed for robust baselines. | Adaptive & Fast: No physical assumptions, handles complex baselines. Rapid (<20 ms, Raman) with smooth continuity (S-ModPoly). | Sensitive to segment boundaries and polynomial degree (over/underfitting). Requires adaptive methods (e.g., S-ModPoly) for robustness. | High-accuracy soil analysis: 97.4% land-use classification via chromatography. | |
| B-Spline Fitting (BSF)48,49,50,51 | Local polynomial control via knots () and recursive basis (). Least-squares optimizes control points (Equations 19, 20, 21, and 22). | Tune knots & degree ()—accuracy drops if suboptimal. | Local control avoids overfitting, boosting sensitivity 3.7× for gases (NH3/O3/CO2). | Scales poorly with large datasets unless optimized; knot degree tuning critical. | Robust trace gas analysis—resolves overlapping peaks & irregular baselines. | |
| Two-Side Exponential (ATEB)52 | Bidirectional exponential smoothing (forward/backward passes) with adaptive weights (Equation 23). | Tunable smoothing: Optimize (smoothing) and (convergence) for balance between noise reduction and signal fidelity. | Fast & automatic: Linear time, scalable for large data. Self-adjusting—no manual peak tuning. | Less effective for sharp fluctuations (B-splines preferred). | Best for: High-throughput data with smooth/moderate baselines. | |
| Morphological Operations (MOM)53,54,55,56,57,58,59 | Erosion/dilation with structural element (width ). Averaged opening/closing. Mollifier convolution (Equations 24, 25, 26, and 27). | Structural element width () must match peak/trough dimensions; RCR threshold controls adaptivity. | Key Benefits: Maintains spectral peaks/troughs (geometric integrity) and Optimized for Pharma PCA workflows (classification-ready). | Trade-offs: ① SE too wide: Blurs sharp peaks; ② SE too narrow: Underfits baseline; and ③ Accuracy depends on window alignment. | Pharmaceutical tablet analysis, enhancing peak-valley distinction while suppressing noise. | |
| Singular Value Decomposition (SVD)59,60,61 | Rank- SVD decomposition () isolates baseline (noise suppressed); residual enhances features (Sharp peaks/valleys) (Equations 28, 29, and 30). | Optimal Rank- Selection: via Variance threshold or Fuzzy GSVD. | Preserves signal features (e.g., boosts from 0.55 to 0.88). Fast for large datasets (GC-TOF-MS, NIR). | Performance sensitive to selection (misestimation impacts correction). | Automated baseline correction (NIR, DPPH/ABTS) with high accuracy. | |
| Wavelet Transforms (DWT)62,63,64,65,66,67,68 | DWT decomposes signals into low-frequency baselines () and high-frequency details (), using filter banks () for exact reconstruction (Mallat) (Equations 31 and 32). | Requires wavelet choice (e.g., db4) and thresholding for artifact suppression. | No shape assumptions—localized analysis with strong noise resistance (e.g., 21× lower RMSE vs. polynomial fitting). | Potential minor artifacts (e.g., negative lobes) from wavelet oscillations. | Terahertz & complex spectra—precise baseline/peak separation. | |
| Least Squares Methods (LSM)69,70,71,72,73,74,75,76,77,78,79,80,81,82,83,84,85 | Weighted Penalized Least Squares (wPLS): Combines weighted residuals (fidelity ) and curvature smoothing (roughness penalty via 2nd-order differences) (Equations 33, 34, 35, and 36). | Requires tuning (L-curve/cross-validation). | Explicit baseline modeling, robust to drift-dominated noise (Raman/IR). | Limited flexibility for mixed-frequency artifacts. | Raman/IR spectroscopy with smooth baselines. | |
| Asymmetric Least Squares (AsLS): Applies asymmetric weights () to penalize peaks while fitting smooth baselines via second-order difference regularization () in least squares optimization. (Equation 37). | Tune via L-curve; validate against peak suppression in noisy regions. | Effective for gentle baselines (e.g., weak Raman drift). Preserves peaks via asymmetric penalties. | Oversmooths weak peaks near baseline. Struggles with mixed-frequency artifacts (jumps + drift). | Baseline correction for spectra with smooth/moderate drift and distinct peaks (e.g., Raman, weak signals). | ||
| Improved AsLS (IAsLS): extends AsLS by incorporating dual regularization alongside asymmetric weights to address mixed-frequency baseline artifacts. (Equations 37 and 38). | Empirical trade-off between (jump suppression) and (curvature control) via cross-validation. | Handles mixed artifacts (e.g., NMR drift + Raman jumps). Adaptive to both sharp and smooth distortions. | Requires manual tuning of . Computationally heavier than AsLS. | Complex baselines with combined distortions (e.g., IR spectra with abrupt baseline shifts + gradual drift). | ||
| Multiple Constrained AsLS (mcaLS): Enforces peak symmetry via left/right residual equalization (Equation 39). | Optimize / to balance peak symmetry and smoothness. | Improved RMSE (e.g., 0.09 vs. exponential baselines). | Computationally intensive. | NIR spectra with complex baselines (e.g., maize). | ||
| Adaptive Iteratively Reweighted PLS (airPLS) & Improved airPLS (IairPLS): Exclude peaks () and fit baselines via adaptive weights (exponential in airPLS, sigmoid in IairPLS), balancing residuals and smoothness with penalty term (Equations 40, 41, and 42). | Requires noisy-data validation (e.g., XRF) for baseline RMSE and peak-interference robustness. | airPLS: Simple, fast, effective for mild noise. IairPLS: Higher accuracy (e.g., RMSE 0.0187 vs. 0.0531 in XRF), fewer edge artifacts. | airPLS: Underestimates noisy baselines; -sensitive. IairPLS: Slower; needs sigmoid tuning. | Smart baseline fitting for spectra/chromatograms (XRF/Raman/HPLC). | ||
| Asymmetrically Reweighted PLS (arPLS): Logistic sigmoid weighting () dynamically suppresses negative residuals (noise) while preserving peaks (Equation 43). | Tune (smoothness) via L-curve; validate against false peak suppression in noisy regions. | Automated noise adaptation. Effective for Gaussian/symmetric noise (e.g., XRF). | Overfits faint peaks near baseline. | General-purpose correction for medium-SNR spectra (e.g., XRF, NIR, UV-Vis) with smooth baselines, symmetrical noise, and moderate drift. | ||
| Inverse Square Root Unit (ISRU): Inverse square root () smooths transitions, better capturing weak peaks. (Equation 44). | Validate trade-off between peak sharpness and noise robustness in baseline correction. | Superior weak-peak retention. | N/A | N/A | ||
| Sparse Bayesian Learning (SBL/BrPLS): Bayesian weight adaptation () to separate baseline () and signal, using noise () and amplitude ()parameters (Equations 45 and 46). | N/A | Avoids heuristic tuning (adaptive ). Rigorous -norm minimization via Bayesian inference. Outperforms arPLS/airPLS/AsSL in noise resilience and accuracy. | Computationally heavy (Bayesian updates). Needs prior estimation (signal probability). | Probabilistic baseline correction for noisy, overlapping-peak spectra, excelling in accuracy and robustness over traditional methods. | ||
| Sparse Representation Method (SRM)86,87,88,89,90 | Simultaneous Spectrum Fitting and Baseline Correction via Sparse Representation (SSFBCSP): Sparsity-based joint fitting of baseline and peaks with smoothness/sparsity constraints (Equations 47 and 48). | Requires rigorous validation for highly overlapping peaks (e.g., Raman) and irregular baselines (e.g., fluorescence/drift). | High precision (RMSE 7.83 × 10−4). Adaptable dictionaries handle complex peak shapes. | Slow (dictionary learning); sensitive to prior knowledge in dictionary selection. | Gold-standard for baseline/peak separation. Ideal for biomedical spectroscopy, Raman/NIR, and complex spectral data. | |
| Fast Burst-Sparse Learning for Baseline Correction (FBSL-BC): Fast baseline correction via downsampling and adaptive sparse coding (Voigt dictionary), optimized for burst-sparse signals (Equation 49). | Requires high-throughput validation for real-time processing of large datasets (e.g., agricultural/industrial spectrometry). | 3–5× faster than SSFBCSP. Balanced accuracy-speed trade-off (e.g., on corn NIR dataset). | Lower accuracy with weak signals; struggles with irregular baselines. | Fast, scalable solution for portable spectrometers, agri-NIR, and industrial monitoring. | ||
| Convex Optimization (COF)91,92,93,94 | Unified high-pass fidelity (),adaptive sparsity (), and smoothness(), via MM optimization Equations 50, 51, 52, 53, and 54. | N/A | Automation: Reduces manual tuning. Feature retention: Preserves critical peaks (Raman/LIBS) while removing baselines. Adaptivity: Dynamic sparsity-smoothness trade-off via . | Complex parameter tuning (, , ); requires inversion . | Spectral signal enhancement, especially for LIBS/Raman with baseline drift and overlapping peaks (e.g., steel composition analysis). | |
| Bézier Curve Fitting87,95,96 | Parametric curve defined by control points and Bernstein polynomials , satisfying convex hull containment, endpoint interpolation, and tangent alignment (Equation 55). | N/A | Stable (strict convexity) and simple (endpoint interpolation). | No local control (less flexible than B-splines); constrained to global convexity. | Convex spectral fitting: prioritizes stability over flexibility. | |
| Genetic Algorithm (GA)97,98,99,100,101,102,103 | Evolutionary optimization of fitness function balancing peak retention () and baseline distortion () under noise constraints (Equation 56). | N/A | Self-optimizing automated baseline correction: dynamically balances peak preservation and artifact removal with noise-aware precision. | Computationally demanding for large datasets or high-generation optimizations. | Automated high-throughput spectral analysis (Raman/clinical/agricultural) outperforms manual methods (e.g., Bézier) in precision, robustness, and efficiency. | |
| Deep Learning (1D-CNN/GAN/VAE)104,105,106,107,108,109,110,111,112 | 1D Convolutional Neural Networks (1D-CNNs): ReLU-activated convolutions with MSE optimization (Equations 57 and 58). | N/A | Automated, high-accuracy feature extraction (95.09% in MIR breast cancer classification). | Poor interpretability of learned filters. Requires large datasets for kernel optimization. | MIR spectroscopy for biomedical classification. | |
| Enhanced GAN (EGAN): Adversarial training with generator () and LSGAN discriminator (); composite loss includes reconstruction and local peak preservation (Equations 59, 60, and 61). | N/A | Handles high variability ( for XRF alloys) with transferability to new domains (e.g., soils). | Dual-network training is computationally intensive and sensitive to . | Robust XRF spectroscopy for alloys and soils, handling drifting baselines. | ||
| Variational Autoencoders (VAEs): Gaussian latent space (, ); optimizes ELBO (reconstruction + KL loss) (Equations 62, 63, 64, 65, and 66). | N/A | Uncertainty-aware VAE: Reconstructed ( , ) with edge deployment (TinyML-VAE NRMSE: 0.53–0.66). | VAE issues: Over-smoothing, parameter sensitivity. | Error-Sensitive Spectroscopy in Embedded Sensing (e.g., Magnetometers). | ||
| Enhanced U-Net & Hybrid Models: Learned Feature Fusion (LFF) + Residual Dense Blocks (RDBs). Three-stage training: unpaired pretraining, synthetic augmentation, contrastive fine-tuning. | N/A | Near-perfect regression (), preserving features without over-smoothing. | N/A | High-order baseline correction (7th-degree poly) for Raman/seismic spectra. | ||
| Scattering Correction | Multiplicative Scatter Correction (MSC)113,114,115,116 | Linear correction of additive (), & multiplicative () effects via OLS regression against reference spectrum (Equations 67, 68, 69, 70, and 71). | Validate linearity (e.g., spectral homogeneity) on representative data. | Robustly corrects additive & multiplicative effects (e.g., baseline shifts, path-length). Preserves chemical features (e.g., FTIR oil analysis: ). | Nonlinear limitation: Ineffective for scattering (e.g., complex biological tissues). | Empirical pre-processing for scatter-prone FTIR data (e.g., oil/moisture analysis). |
| Extended MSC (EMSC)117,118,119,120,121 | Extends MSC with polynomial () and PCA terms () for interferents & nonlinear baselines (Equations 72, 73, and 74). | Cross-validation selects polynomial order () and PCA components to avoid overfitting. | Superior to MSC for complex samples (e.g., tissues, multicomponent systems). High specificity (e.g., 97% for Crohn’s). Reduces artifacts (e.g., temperature effects). | Requires careful tuning. Computationally heavier than MSC due to extra terms and PCA. Requires >95% cumulative variance in PCA. | Advanced spectral correction for complex signals (nonlinearities, noise, interference). Used in biomedical (FTIR), polymers, and systems with confounders (e.g., moisture, temperature). | |
| Detrending (DT)122,123 | Subtracts least-squares-fitted polynomial from to eliminate drift (Equation 76). | Requires optimizing polynomial order () to avoid overfitting while removing drift. | Simple, effective for low-frequency drift (e.g., 97.4% TCM species ID, enhanced NIRS accuracy). | Risk of overfitting; baseline-only correction (needs pairing, e.g., SNV). | Corrects baseline drift via polynomial fitting, often paired with SNV (e.g., NIRS nutrient prediction) and Traditional Chinese Medicine spectral analysis. | |
| Standard Normal Variate (SNV)124,125,126 | Normalizes spectra (mean = 0, variance = 1) to reduce scatter/amplitude effects (Equation 77). | Assess risk of feature suppression (e.g., geometric influences). | Robust sample-wise processing handles heterogeneity (scattering/baselines). Highly effective (e.g., 99% ginseng authentication accuracy). | Excessive scaling may suppress useful variance (geometric methods can outperform). Limited to single-spectrum normalization. | Optimized for high-frequency, sample-specific noise (e.g., scattering, NIRS, LIBS) in amplitude-driven data. | |
| Data Normalization | Mean Centering (MCN)127,128,129,130,131 | Subtracts arithmetic mean from each to center data (Equation 78). | Preserves discriminative features while removing offsets. | Removes instrumental drift, preserves spectral integrity—critical for PCA (e.g., 99.7% NIR accuracy). | Fixes global shifts, not local/multiplicative effects. | Key for PCA/PLS baseline alignment—vital in agriculture and pharmaceuticals analysis. |
| Orthogonal Signal Correction (OSC)132,133,134,135,136,137,138,139,140,141 | Removes -orthogonal noise via PCA/SVD projections of residual matrix (Equations 79, 80, 81, 82, 83, 84, and 85). | Targeted non-correlated noise removal, preserves predictive signals. | Boosts robustness via structured noise removal (e.g., rice bran oil analysis: ). | Higher compute cost vs. univariate methods. | Applies to spectroscopy noise reduction (e.g., oil analysis, glucose monitoring, soil measurement). | |
| Min-Max Normalization (MMN)127,142,143,144,145,146,147 | Normalizes data to [0,1] via min-max scaling (Equation 86). | N/A | Simple, universally beneficial, and performance-enhancing. Universal scaling for fixed-range models (e.g., honey classification: 97%). | Highly sensitive to outliers (min/max-dependent). | Applied in spectroscopy, environmental analysis, and biomedical models for generic feature scaling. | |
| Z Score Normalization (ZSN)148,149,150,151,152 | Standardizes data to zero mean and unit variance. Standardizes to , via (Equations 87 and 88). | Verify ① and , ② outlier robustness (vs. min-max), and ③cross-domain consistency (e.g., radiomics harmonization). | Robust to outliers (vs. MMN), suppresses noise (reduces multicollinearity), and adapts across domains (supports LSNV variants). | Assumes Gaussianity (non-normal distortion risk). Global scaling (may disrupt local patterns). | Applied in spectroscopy (artifact removal, classification), medical imaging (cross-scanner harmonization), high-dimensional data (multicollinearity reduction). | |
| 3-Sigma Normalization (3-SN)127,153 | Truncates outliers beyond , scales inliers to [0,1] via (Equation 89). | Truncation Bounds: Confirm ( threshold). Normalization: Verify scaled values ∈[0, 1] (post-truncation). | Outlier robustness (vs. Z score). Noise filtering. High classification accuracy (97.8% in breast tumors). Cross-scanner stability. | Information loss (eliminates extreme values). Less theoretical foundations (heuristic approach). | Robust spectroscopic tumor classification method: Cross-scanner harmonization + outlier robustness + signal fidelity. | |
| L1 Normalization154 | scales feature by Manhattan norm (sum of absolute values) to preserve sparsity (Equation 90). | Verify normalization preserves sparsity (e.g., NILM accuracy: 96.42–97.65%). | Sparsity retention (key for NILM). Preserves appliance signatures (96.42–97.65% accuracy, PLAID/WHITED). | Sensitive to low-magnitude noise (non-zero values may be amplified). Ignores distributional properties (vs. ZSN/3-SN). | Non-intrusive load monitoring (NILM) via sparse signal decomposition (e.g., energy disaggregation). | |
| L2 (Vector) Normalization155,156 | Scales by Euclidean norm: (Equation 91). | Ensure normalization enhances discrimination without distorting angular relationships. | Preserves cosine similarity. Enhances contrast (100% PPCA accuracy). Compacts data (88.6% lossless spectral reduction). | Ignores magnitude (angle-focused). Noise-sensitive (non-sparse-optimized). | Angular similarity tasks (e.g., cosine distance). Dimensionality reduction (e.g., VN-CARS-PLSR for kiwifruit sugar prediction). Direction-sensitive classification (e.g., brain MR analysis). | |
| Logarithmic Transformation (LT)157,158 | Converts nonlinear (multiplicative) relationships into additive space (Equation 92). | Ensure input stability (requires ) | Stabilizes variance, normalizes skewness, and compresses dynamic ranges (e.g., RMSEP = 128 in pulp quality analysis). | Excludes non-positive data (requires ). Less flexible than alternatives (e.g., Box-Cox for power-law data). | Spectral/exponential data (NIR absorbance, soil carbon prediction). Variance stabilization for skewed distributions. | |
| 1/n Power Transformation (1/n-PT)159 | Nonlinearly compresses values with parametric control: (Equation 93). | Ensure non-negative inputs (requires ); shift data if negative ().Hyperparameter tuning for optimal . | Reduces outliers, normalizes skewness, stabilizes variance, and optimizes feature distributions. | Limited to non-negative data (requires shifting for negatives). Compression efficacy depends on , requiring careful tuning. | Spectroscopic analysis (e.g., Cu(II)-DOM binding quantification, humic substance enrichment). Parametric compression for skewed data (e.g., TOC removal: 89%). | |
| Filtering & Smoothing | Gaussian Filtering (GF)160,161 | Convolves spectra with Gaussian kernel . Cutoff frequency (Equation 94). | Optimal tuning: Avoid under-smoothing (residual noise) or over-smoothing (peak distortion). | Reduces high-frequency noise while retaining spectral features. Adaptive cutoff: . | Balancing noise suppression and peak distortion requires careful tuning of . | LIBS preprocessing: Resolves emission lines (e.g., Li I 670.8 nm). Enables 100% accurate Dendrobium species classification. |
| Savitzky-Golay Filter (SGF)162,163,164,165,166,167,168 | Sliding-window local polynomial fitting (size ): smoothed output corresponds to the central fitted value (Equation 95). | Optimal tuning of window size () and polynomial degree (). | Preserves spectral moments up to order (unlike fixed GF). Improves SNR (e.g., 2.35× in methane detection) without peak distortion. | Polynomial degree overfits noise if too high; window size blurs features if too large. | Methane detection (LITES), CO monitoring (Vis-NIR), soil phosphorus prediction via remote sensing (), and gas analysis (TDLAS, 0.53 ppm accuracy). | |
| Wiener Filtering (WF)169,170,171,172 | optimally estimates signals by adaptively weighting observations based on noise and system response in the frequency domain (Equations 96 and 97). | Requires accurate and estimation. | Superior noise suppression (vs. SGF in SNR-adaptive cases). 100 ppm CH4 detection limit (vs. SGF’s 120 ppm). 40% higher resolution (vs. SGF). | Requires accurate (signal power spectrum) and and (noise power spectrum) estimation. | Precision spectroscopy: CH4 detection, hyperspectral imaging. Miniature spectrometers: Better abrupt noise suppression vs. SGF. Motion artifact removal: NIR spectroscopy. | |
| Kalman Filtering (KF)173,174,175,176,177,178,179 | Recursive estimation: Time update (model prediction) + measurement update (sensor data). Optimal weighting: Kalman gain minimizes error covariance (Equations 98, 99, and 100). | Verify , , , accuracy; recursive updates minimize posterior error covariance . | Minimizes MMSE via Kalman gain , reduces NO2 detection limits 9.12×, accelerating isotope analysis (CO2 δ13C/δ18O) 7-fold, and cutting PLS RMSE by 40% (0.38 → 0.17 g/L). | Model sensitivity: Performance affected by inaccurate , , , . Computational load: Higher than static methods (e.g., WF) due to recursive updates. | NO2 cavity ring-down (transient noise suppression). Glucose Raman (real-time noise reduction). LIBS steel analysis (drift correction). QCL isotopologues (high-speed precision). | |
| Neural Network Adaptive Filter (NNAF)180,181,182 | LMS weight updates with sigmoid-step control: (Equations 101, 102, 103, and 104) | Requires tuning of Step-size bounds (, ), Sigmoid steepness (), and Error exponent (). Must satisfy for convergence. | Higher adaptability in nonlinear environments; 3–4 dB SNR gain (UV-Vis) and f for concentration prediction. | Sensitive to , , and step-size. Higher computational cost than LMS/KF. | Spectroscopic denoising (Zn extraction, Cu/Co impurity analysis). Nonlinear signal processing (beyond KF/WF assumptions). | |
| Wavelet Shrinkage Denoising (WSD)183,184,185,186 | Applies universal threshold or level-dependent to wavelet coefficients. (Equations 105, 106, 107, 108, 109, and 110). | Requires: Wavelet basis selection (e.g., Daubechies), Optimal decomposition level (), Noise estimation (e.g., MAD). Threshold tuning (noise removal vs. signal fidelity) | Employs MAD-based noise estimation, reducing GFRP thickness error to 3.7% (↓16.4%→3.7% via THz-TDS) with fast sparse processing; achieves high noise reduction (+32 dB SNR gain in ESR spectroscopy). | Basis dependency (sensitivity to wavelet choice). Transient distortion (potential oversmoothing of sharp features). | GFRP thickness analysis. ESR/THz-TDS signal enhancement. | |
| Hilbert Vibration Decomposition (HVD)187,188,189,190,191 | Adaptive demodulation (Hilbert-based component extraction). Iterative decomposition (dominant component and residual processing). Stopping criterion (Iterates until frequency estimate SD ) (Equations 111, 112, 113, 114, and 115). | Requires Reference frequency () selection, Convergence threshold () tuning, and AM/FM signal suitability verification. | SNR improvement: 3.53 → 130.64 (spectroscopy). Classification accuracy: 40% → 90.25%. | Limited to AM/FM signals; sensitive to reference frequency selection. | Structural vibration analysis. Spectroscopic signal enhancement. | |
| Spectral Derivatives | Finite Difference Method (FDM)22 | Discrete differentiation: Approximates derivatives using adjacent points (, , ). First derivative (,error ) and Second derivative (,error ) (Equation 116). | Requires Constant sampling interval () and Low-noise input data to avoid artifact amplification. | Computational efficiency: Simple, non-iterative calculations. Suitable for smooth, high-resolution spectra. | Noise sensitivity: Amplifies fluctuations. Truncation error: Accuracy limited by step size (). | Primarily useful for clean, high-resolution spectra where computational simplicity is prioritized over robustness. |
| Savitzky-Golay Derivatives (SGD)192,193,194,195,196,197 | Polynomial fitting over sliding windows for smoothing + derivatives. Least-squares fitting. Convolution-based: Computes smoothed values and derivatives via predefined coefficient sets (e.g., , , ) (Equations 117, 118, 119, 120, 121, and 122). | Requires tuning Window size (, typically ≤25) and Polynomial order (, usually 2ND or 3rd). | Outperforming FDM in noise robustness while preserving spectral features (e.g., enabling 100% SVM accuracy in NIR wood ID), with tunable parameters (window size and polynomial order ) for optimal smoothing-feature resolution. | Require careful selection of and to avoid parameter sensitivity and edge artifacts, necessitating dataset-specific optimization to maintain accuracy, particularly near spectral boundaries. | Spectral preprocessing: Baseline correction (competitive vs. NW derivatives), Real-time crystallization monitoring (2ND derivatives). Classification tasks: Paired with ML (e.g., SVM). | |
| Norris-Williams Derivatives (NWD)198,199 | Smoothing: moving-average filter (-points window) for . First derivative: , Second derivative: derivative (Equation 123). | Requires optimized (smoothing window) and (gap size). | Strong noise suppression, high SNR. Fast computation (real-time capable). Superior trace detection (e.g., enzyme granules, segregation index 0.71). | Potential blurring of fine features vs. polynomial methods (e.g., SGD). Parameter sensitivity: Suboptimal / degrades performance. | Superior for trace component detection in particulate systems (e.g., enzyme placebo granules). Real-time particulate system monitoring (balances noise suppression & speed). Outperforms derivatives & SNV in low-SNR conditions. | |
| Grünwald-Letnikov Fractional-Order Derivative (GL-FOD)62,200,201,202,203,204 | Employs an fractional differential matrix with gamma-weighted finite difference coefficients derived from , implemented as a sparse lower-triangular matrix truncated at for computational efficiency) (Equations 124, 125, and 126). | Optimize order to Balance resolution and noise sensitivity. Tune step size and truncation limit . Examples: 0.3-order for chlorophyll, 0.6-order for soil organic matter | Captures nonlinear features better than integer derivatives (+17% correlation in some cases). Validated on hyperspectral (crops, soil, oil) and chemometric models. | Higher computational cost (matrix ops, gamma functions). Requires empirical tuning of . | Agriculture: Cadmium detection (), chlorophyll estimation (). Geoscience: Soil organic matter prediction (), reservoir volatility modeling. Strengths: Superior for nonlinear, low-contrast spectral features where integer derivatives fail. | |
| Data Information Mining | Three-Dimensional Correlation Method (3dCM)205,206 | Uses sequential Hilbert transforms (HT) and tensor products to convert 1D spectra into 3D correlation matrices, Final output has equivalent resolution and augmented sample size (Equations 127, 128, 129, 130, and 131). | Confirm performance gains (<10% → >99% accuracy) in models like PCA-LR, PLS-LR, KNN, RF, CNN. | Captures latent correlations missed by removal-based methods. Significantly enhances ML performance (e.g., paper authentication). | Computationally heavy due to 3D tensor operations. Requires sufficient samples per class () for stable tensor construction. | Authentication: Handwritten paper analysis (>99% accuracy with ML). Hyperspectral: Extends 2D correlation for deeper feature extraction. |
The table systematically evaluates seven key dimensions of common spectral preprocessing methods, emphasizing their theoretical foundations, practical trade-offs, and domain-specific applicability.Columns Description:1. Category: Broad classification of preprocessing methods, encompassing cosmic ray removal, baseline correction, scattering correction, normalization, filtering and smoothing, spectral derivatives, and data information mining.2. Method: Names of specific techniques within each category (e.g., Savitzky–Golay smoothing, multiplicative scatter correction).3. Core Mechanism: Fundamental principle or mathematical operation (e.g., Least-squares fitting, adaptive sparse coding).4. Validation Need: Empirical validation requirement (Yes/No) with typical approaches (e.g., noisy-data validation, manual validation).5. Advantages: Key strengths of each method (e.g., preserves spectral details, computationally efficient).6. Disadvantages: Limitations and trade-offs (e.g., computationally intensive, poor interpretability of learned filters).7. Primary Role & Application Context: Dominant use cases (e.g., robust XRF spectroscopy for alloys and soils).
The removal of cosmic ray artifacts
Building upon the spectral data challenges, we focus on cosmic ray spikes (CRSs)—among the most abrupt and localized artifacts—whose immediate removal is essential to avoid propagating errors through downstream preprocessing. CRSs arise from high-energy particle collisions, manifesting as stochastic, ultra-narrowband (<1-2 cm−1) unipolar spikes that disproportionately distort spectral features despite their sparse occurrence. Unlike broadband noise, CRSs induce catastrophic localized distortions, corrupting both qualitative spectral interpretation (e.g., peak misidentification) and quantitative metrics (e.g., integrated intensity errors).
Convolutional smoothing (e.g., Savitzky-Golay filters) inherently fails for CRSs: their global operation sacrifices either spike suppression or genuine feature preservation—a trade-off untenable for precision spectroscopy. Although multi-scan averaging is the gold standard, its dependence on stationary samples and replicate measurements excludes real-time applications, while repeated scans compound readout noise—a critical limitation for transient phenomena.
These constraints demand algorithmic solutions. We systematically categorize and evaluate computational CRA removal methods, progressing from heuristic to model-based approaches.
Moving average filtering
For single-scan spectral data where replicate measurements are unavailable, cosmic ray artifacts (CRAs) must be corrected in real-time. The moving average filter (MAF) achieves this by statistically detecting and excluding CRA-corrupted points before performing localized smoothing. While computationally efficient, MAF’s uniform-weight averaging inherently blunts legitimate sharp spectral features adjacent to CRAs—a limitation intrinsic to linear filtering approaches.
The algorithm first computes a first-order difference to amplify CRA-induced transient spikes while suppressing low-frequency baseline drift. A modified Z score (Equation 1) then isolates these anomalies30:
| (Equation 1) |
where represents the spectral intensity at wavenumber point , the 0.6745 factor calibrates the median absolute deviation (MAD) to match the standard deviation of Gaussian-distributed noise. A threshold (typically 3–5) is applied to flag and exclude CRA-corrupted points from smoothing. The final filtered spectrum is reconstructed via selective windowed averaging (Equation 2):
| (Equation 2) |
where the indicator function ensures only non-CRA points contribute, and the half-width adjusts smoothing intensity. This approach achieves real-time cosmic ray correction with robust artifact removal and computational efficiency, but trades off spectral resolution (near affected regions) and sensitivity to window size tuning.
Missing-point polynomial filters
While MAF provides a foundational approach to spike rejection through linear averaging, the Missing-Point Polynomial Filter (MPF) advances this paradigm by strategically excluding CRA-corrupted points during polynomial regression, thereby preserving spectral features adjacent to artifacts with higher fidelity31:
| (Equation 3) |
Unlike conventional Savitzky-Golay smoothing, which applies symmetric convolution to all points, MPF explicitly omits the central outlier (e.g., ) by enforcing zero convolution weights (). By fitting a quadratic polynomial to the remaining points within , MPF computes a corrected value purely from neighboring data, minimizing distortion while retaining computational efficiency. The regression coefficients are derived via least-squares minimization, where is the Vandermonde matrix of terms , and represents residuals.
This approach is particularly suited for isolated cosmic ray spikes in uniformly sampled spectra (e.g., Raman or IR), eliminating artifacts without the spectral broadening associated with MAF.
Multistage spike recognition
Multistage spike recognition (MSR) algorithm32 improves upon standard cosmic ray removal methods (e.g., MPF) by integrating adaptive dynamic thresholds, iterative statistical validation, and signal-shape constraints. This automated technique accurately discriminates and corrects cosmic ray artifacts in time-resolved Raman spectra without distorting authentic chemical variations.
The algorithm processes spectral data (where = temporal sampling points, = wavenumbers) by first computing forward differences to enhance transient signals:
| (Equation 4) |
Potential cosmic ray spikes are identified through iterative statistical filtering, where deviations in are compared against a dynamic threshold . Here, is the locally recalculated standard deviation after excluding outlier to avoid bias. False positives are minimized by requiring spikes to exhibit a sharp rise followed by an immediate return to baseline. For apodized spectra, a secondary threshold (empirically set to ) corrects spike shoulders while enforcing a maximum width constraint (≤30 pixels) to prevent overcorrection.
Validated across diverse experimental conditions, MSR demonstrates robust performance—particularly in datasets with 40–50+ sequential spectra—due to its adaptive thresholding, which reduces sensitivity to parameter selection and baseline drift.
Polynomial filters
After MSR identifies cosmic ray spikes through adaptive thresholds and signal-shape constraints (e.g., sharp rise/fall within ≤30 pixels), polynomial filtering (PF)33 corrects the affected regions via localized polynomial interpolation. This automated method preserves underlying spectral features while minimizing artifacts by leveraging dataset-wide consistency. PF achieves this by:
-
(1)
Global polynomial fitting – Fitting a third-order (or higher) polynomial across all spectra at each wavenumber to establish the expected intensity distribution.
-
(2)
Outlier rejection – Flagging data points exceeding 3× the standard noise deviation as cosmic ray artifacts while retaining genuine Raman peaks.
-
(3)
Parameter efficiency – Requiring only two adjustable parameters (polynomial order and threshold multiplier), enabling batch processing of homogeneous datasets.
For heterogeneous datasets, supplemental techniques (e.g., manual validation or median filtering) ensure accuracy without distorting true spectral features. PF’s robustness and minimal user intervention make it ideal for high-throughput Raman applications where manual inspection is impractical.
Nearest neighbor comparison (NNC)
PF relies on global polynomial fits for CRA removal, whereas the nearest neighbor comparison (NNC)34,35 method enhances robustness by incorporating spectral covariance-based cross-validation, dynamically identifying anomalies through spatial comparison to maintain inter-spectral coherence in single-scan datasets. The algorithm quantifies spectral similarity using normalized covariance:
| (Equation 5) |
where, “·” denotes the dot product between spectra and , with higher values indicating greater similarity. For each spectrum, NNC computes its closest match by maximizing , then estimates local noise characteristics via Savitzky-Golay (SG) smoothing:
| (Equation 6) |
Here, represents the SG-smoothed signal, and is the noise standard deviation. CRAs are flagged where the raw-smoothed residual exceeds (primary threshold), while a secondary threshold at captures broader, low-intensity anomalies in neighboring pixels. Artifacts are replaced using values from the best-matched spectrum, ensuring minimal distortion of genuine features. Unlike multi-scan methods, NNC’s single-scan design eliminates cumulative read noise, optimizing efficiency without compromising signal integrity—critical for time-sensitive studies. The method’s dual-threshold approach balances sensitivity and specificity, while its automated, statistics-driven workflow minimizes false positives even in spectrally variable datasets when combined with preprocessing (e.g., baseline correction).
Wavelet transform methods
Wavelet-transform methods complement spatial-correlation-based techniques like NNC by employing multi-resolution decomposition to isolate CRAs while preserving fine spectral features across frequency scales. These artifacts manifest as narrow, high-intensity distortions in Raman spectra, obscuring underlying chemical information.36 The hybrid approach integrates discrete wavelet transform (DWT) and k-means clustering (KMC): each spectrum undergoes -level wavelet decomposition, splitting into approximation () and detail coefficients (), with decomposition depth determined by (where is spectral length). Unlike genuine Raman signals (characterized by Lorentzian/Gaussian/Voigt profiles), CR spikes generate random high-frequency wavelet coefficients. K-means clustering then minimizes intra-cluster variance37 to isolate outliers:
| (Equation 7) |
where, represents cluster centroids and their coefficient counts. An adaptive threshold —scaling linearly for broad Raman peaks or with a square root function for sharp features—ensures dynamic artifact detection. The algorithm incrementally increases cluster count until the largest cluster’s radius falls below , classifying non-clustered coefficients as CR spikes; these are replaced with centroid values, while unclustered coefficients retain original values to preserve spectral diversity. Finally, inverse DWT reconstructs the corrected spectrum.
For enhanced detection of multi-pixel CR events, lag plots further discriminate artifacts: Raman peaks form lemniscate patterns along , whereas CR-induced deviations () appear as statistical outliers. A DWT filter extracts residuals () and the Allan deviation (ADEV) provides a noise-robust threshold38:
| (Equation 8) |
Spikes exceeding (98% confidence) are interpolated, with adjacent points flagged to capture broader artifacts. This method excels for CR spikes narrower than Raman peaks but faces limitations in low-resolution datasets where their widths overlap. Automated parameter optimization ensures scalability for large spectroscopic datasets, offering a robust alternative to spatial-correction methods like NNC without requiring multi-scan averaging
Kernel principal component analysis residual diagnosis
Wavelet transforms decompose signals in the frequency domain to isolate noise. In contrast, Kernel PCA Residual Diagnosis (KPCARD)39 adopts a machine learning framework, leveraging kernel-induced feature spaces to detect and correct nonlinear artifacts while preserving subtle spectral features—an essential capability for high-precision applications. The foundation of this approach lies in principle component analysis (PCA), which decomposes a spectral dataset (with spectra and channels) via singular value decomposition (SVD) of its covariance matrix expressing the data as:
| (Equation 9) |
where contains scores, denotes loadings (principal components), and represents residuals. The principal components are derived from the eigenvalue problem:
| (Equation 10) |
where, and correspond to explained variance and eigenvector directions, respectively.
However, while PCA excels at modeling smooth spectral variations, it falters with sharp CRAs due to its linearity constraint. To address this, KPCARD extends the framework to Kernel PCA (KPCA)40 by nonlinearly mapping the data into a high-dimensional space via a Gaussian kernel:
| (Equation 11) |
where adjusts the sensitivity to spectral distances. KPCA then applies PCA in , enabling the extraction of nonlinear principal components that better isolate CRAs from genuine spectral bands.
The residual diagnosis stage introduces a dual-threshold mechanism (global and local) to optimize artifact detection:
| (Equation 12) |
where the global threshold leverages confidence intervals (), while the local noise threshold incorporates second-derivative estimates (), ensuring robustness against noise fluctuations. Residuals exceeding both thresholds are flagged as CRAs; corrupted regions are then reconstructed via:
| (Equation 13) |
where denotes a neighboring uncorrupted reference spectrum, dynamically scales the intensity for spectral matching, corrects baseline deviations, and reconstructs fine spectral features through a weighted () linear combination of retained principal components (). By synergizing adaptive threshold detection with PCA-based reconstruction, this approach ensures robust artifact removal while faithfully preserving the intrinsic Raman spectral profiles.
Cosmic ray removal: algorithm performance comparison
CRA correction methods span a spectrum balancing efficiency and precision. Rapid single-scan techniques (e.g., MAF) utilize statistical thresholding for real-time applications but may distort adjacent spectral features. Localized fitting approaches (e.g., missing-point polynomial filters, MPF) preserve fidelity by excluding corrupted points during interpolation, while multistage spike recognition (MSR) employs adaptive dynamic thresholds for robust iterative validation in time-resolved datasets.
For global optimization, PF fits homogeneous data efficiently but struggles with complex spectral variations. NNC leverages spectral covariance for artifact replacement, proving effective in dynamic systems. Wavelet-based methods excel in discriminatory power through multiscale decomposition and clustering, albeit requiring parameter tuning. At the highest precision tier, KPCARD enables nonlinear artifact detection at increased computational cost.
Method selection hinges on a triad of factors: artifact morphology (isolated spikes vs. distributed noise), dataset dynamics (static homogeneity vs. temporal variability), and accuracy-speed trade-offs—from rapid preprocessing (MAF/MPF) to critical applications requiring maximal feature preservation (wavelet/KPCARD).
Baseline correction
After addressing high-frequency CRAs through thresholding or wavelet-based removal, spectroscopic preprocessing shifts to correcting baseline distortions—systematic low-frequency shifts caused by scattering, fluorescence, or detector drift. Unlike sparse CRA spikes, these broad interferences obscure true spectral features and skew quantitative analysis. Effective correction must disentangle analyte signals from background without distorting peak morphology, a task approached through physical/statistical models (e.g., polynomial fitting and asymmetric least squares), adaptive algorithms (e.g., penalized splines and iterative optimization), and machine learning (e.g., autoencoder-based detrending). The choice hinges on spectral complexity: simple baselines may succumb to linear models, while complex interferences demand data-driven adaptability. Below, we dissect these methods, balancing mathematical rigor with practical trade-offs.
Wavelength artificial shift subtraction
Wavelength artificial shift subtraction (WASS) is a computational baseline suppression technique that isolates spectral signals by leveraging artificial wavelength shifts instead of empirical baseline fitting. The method decomposes an observed spectrum into background and signal :
| (Equation 14) |
The key innovation lies in applying a small spectral displacement via a translation matrix , generating a shifted spectrum . The difference between the original and shifted spectra is governed by a transfer matrix :
| (Equation 15) |
where is the identity matrix. The signal component is then derived through least-squares minimization of the residual error:
| (Equation 16) |
yielding the closed-form solution:
| (Equation 17) |
WASS is particularly effective in underwater LIBS, where it enhances faint emission lines obscured by broadband scattering.41 However, its performance hinges on careful calibration of —excessive shifts may amplify noise, while insufficient displacements fail to separate the baseline.
Polynomial fitting
Unlike WASS, which relies on physical spectral shifts, polynomial fitting (PF) provides a model-free approach to baseline estimation. However, traditional single-polynomial fitting suffers from order sensitivity—high-order polynomials introduce oscillations, while low-order polynomials underfit complex baselines.42,43 Piecewise polynomial fitting (PPF) addresses this limitation by adaptively segmenting the spectrum and fitting independent polynomials to each region44,45:
| (Equation 18) |
where is the fitted baseline, define the spectral boundaries, partition the spectrum, and are individually optimized polynomial orders per segment.
Recent refinements, such as S-ModPoly, integrate sliding-window segmentation and iterative refinement, reducing processing times to <20 ms for Raman spectra while eliminating inter-segment discontinuities.46 PPF’s flexibility has proven especially powerful in chromatographic soil analysis, where coupling polynomial baselining with vector normalization achieves 97.4% accuracy in land-use classification.47
B-spline fitting method
B-spline fitting (BSF) offers a flexible and robust approach to spectral baseline correction, combining low-degree polynomials with localized control to avoid overfitting while maintaining smoothness. Unlike traditional polynomial fitting, BSF decomposes the spectral domain into segments defined by a knot vector , ensuring each segment is influenced only by nearby control points.48 This localized adaptability makes BSF particularly effective in handling complex spectral variations, such as overlapping peaks and irregular baselines.
The BSF curve for an - point dataset , is constructed using piecewise polynomial basis functions , where denotes the polynomial degree (e.g., cubic splines with ). The fitted curve is expressed as 48,49,50:
| (Equation 19) |
where are control coefficients, and the basis functions are defined recursively. Starting from piecewise constant functions for :
| (Equation 20) |
higher-order B-splines () are computed via:
| (Equation 21) |
The fitting process employs least-squares optimization to determine the optimal control points (The control polygon formed by ):
| (Equation 22) |
minimizing the error , where is the B-spline basis matrix and the observed spectral intensities.
By adjusting the knot density and polynomial degree, BSF balances smoothness and flexibility, outperforming conventional methods in scenarios like trace gas detection (demonstrating 3.7× higher sensitivity for NH3, O3, and CO2).51 Its local control prevents distortions from isolated noise spikes, while the recursive basis ensures computational efficiency—critical for processing large-scale spectral datasets.
Two-side exponential fitting
The automatic two-side exponential (ATEB) method provides an effective solution for baseline correction through its bidirectional exponential smoothing approach. While B-spline fitting’s geometric flexibility makes it ideal for modeling smooth baselines, ATEB’s dynamic adaptation mechanism specializes in handling baseline drift while effectively eliminating edge artifacts, though it faces challenges with rapid signal fluctuations compared to localized methods like B-splines.
At the core of ATEB’s operation lies a weighted moving average process that self-adjusts based on spectral characteristics. The method calculates the baseline estimate at each point through the recursive relation:
| (Equation 23) |
where represents the raw signal intensity and the smoothing factor controls the trade-off between noise suppression and baseline tracking responsiveness. Lower values of provide stronger smoothing, while higher values allow quicker adaptation to baseline variations.
ATEB’s distinctive feature is its bidirectional processing, combining results from forward (from to ) and backward (from to 1) passes to cancel out edge distortions that commonly plague one-sided smoothing methods. The algorithm iterates this process until the baseline stabilizes, typically when changes between iterations fall below a predefined threshold .
With linear time complexity per iteration, ATEB offers computational efficiency suitable for processing large multivariate datasets. Its completely automated operation eliminates the need for user-defined peak detection or prior knowledge of spectral features, making it particularly useful for high-throughput applications. It is effective in resolving complex baselines while preserving analytical signal integrity.52
However, the method’s reliance on averaging makes it less suited for signals with rapid fluctuations, where B-spline approaches with localized control demonstrate superior performance. This trade-off positions ATEB as an excellent choice for smooth or moderately varying baselines, while suggesting alternative methods might be preferable for more volatile spectral features.
Morphological operations and mollification
Compared to ATEB’s statistical smoothing, which lacks spatial awareness, morphological operations—erosion and dilation—leverage a localized, topology-driven approach to signal processing. These operations excel in pharmaceutical spectral analysis, though their performance depends on carefully tuning the structural element (SE) width .53 Rooted in set theory, mathematical morphology inherently captures geometric features, making it highly effective for tasks such as baseline correction.53,54,55,56,57,58 The four fundamental operators—erosion, dilation, opening, and closing—use a sliding window of points centered at :
| (Equation 24) |
Here, erosion suppresses narrow peaks while broadening valleys, whereas dilation enhances peak prominence while narrowing valleys. The opening operation (erosion followed by dilation) smooths signals by eliminating small fluctuations but may distort sharp peaks. Conversely, closing (dilation followed by erosion) fills gaps in the baseline but risks overestimation. To balance these effects, the Morphological Operation and Mollification (MOM) method computes their average:
| (Equation 25) |
To ensure smoothness, the estimated baseline is convolved with a mollifier kernel , an exponentially decaying function with compact support on :
| (Equation 26) |
This normalized convolution refines the baseline while suppressing artificial oscillations. The smoothed baseline is expressed as follows:
| (Equation 27) |
The process terminates when either the maximum iterations are reached or the relative change rate (RCR) falls below a threshold (typically 10−3 to 10−6).
MOM has demonstrated strong performance in pharmaceutical applications, particularly in enhancing PCA-based classification58,59 of active compounds in tablets—provided the SE width matches the spectral feature sizes.
SVD method
SVD provides an effective low-rank approximation framework for automated baseline correction in spectroscopic analysis. This approach fundamentally differs from morphology-based methods by employing orthogonal matrix decomposition to separate low-rank baseline variations from high-rank signal components. Given a spectral data matrix (where represents wavelength points and denotes samples), SVD factorizes the matrix as:
| (Equation 28) |
where, (left singular vectors) and (right singular vectors) are orthogonal matrices, and contains the singular values in descending order. The algorithm’s effectiveness stems from its ability to capture baseline artifacts in the first singular vectors (associated with the largest ). The reconstructed signal aggregates high-rank residual components (from to ) as a weighted sum:
| (Equation 29) |
where scales significance, are sample weights (from ), and are baseline-corrected spectral basis functions (derived from truncated left singular vectors). Truncation at isolates the true signal by excluding dominant baseline contributions.59
The decomposition explicitly separates the measured signal into baseline and corrected spectrum through:
| (Equation 30) |
where represents the non-Raman background. Optimal selection can be achieved via variance analysis or advanced methods like fuzzy generalized SVD (FGSVD),60 which achieves 100% classification accuracy in NIR spectroscopy. Compared to alternatives, SVD demonstrates superior performance in feature preservation—e.g., improving from 0.55 to 0.88 in DPPH/ABTS assays—while remaining computationally efficient for large datasets (e.g., GC-TOF-MS).61
Wavelet transforms method
The wavelet transform (WT)62,63,64 overcomes the global spectral mixing of SVD by providing localized multiscale decomposition. In this framework, a signal is partitioned into low-frequency approximations (, representing baseline trends) and high-frequency details (, encoding peaks and noise) through the DWT. Mathematically, this is expressed as:
| (Equation 31) |
where the wavelet function extracts high-frequency components while the scaling function captures low-frequency dynamics. The coefficients and are computed via iterated filtering:
| (Equation 32) |
with (low-pass) and (high-pass) constituting a two-channel filter bank designed for exact reconstruction, where for orthogonal wavelets. Implemented via the Mallat algorithm,65 the DWT achieves exact reconstruction by recursively decomposing and reconstructing the signal through down-sampling and up-sampling operations. Crucially, this method requires no prior assumptions about baseline or peak morphology,66 enabling robust separation even for complex spectra. For example, in terahertz spectroscopy, the db4 wavelet reduced baseline RMSE to 0.57%—21-fold lower than polynomial fitting’s 12.38%.67 While minor artifacts (e.g., negative lobes68) can arise from wavelet oscillations, these are readily suppressed via thresholding, preserving the DWT’s dominance in noise-resistant spectral analysis.
Least squares methods
Unlike wavelet-based methods that implicitly address baseline distortion, the weighted penalized least squares (wPLS) framework explicitly models baseline correction through a regularized optimization approach.69,70,71,72,73 This method is particularly effective for Raman and IR spectroscopy, where baseline drift often dominates noise. Given an observed spectrum composed of the true signal , baseline , and random noise , the wPLS objective function balances fidelity term amd roughness penalty :
| (Equation 33) |
| (Equation 34) |
where are adaptive weights that downweight spectral regions with peaks or noise, ensuring baseline fidelity while mitigating undue influence from signal features, is a second-order difference matrix (a Toeplitz matrix with diagonals) that enforces smoothness by penalizing sharp curvature in the baseline estimate . The combined objective function is formulated as:
| (Equation 35) |
where is the regularization parameter, chosen via L-curve analysis74,75,76 or cross-validation to trade off baseline smoothness () against fitting accuracy (). The closed-form solution is derived via normal equations:
| (Equation 36) |
Iterative reweighting refines until convergence, typically controlled by a threshold on relative weight changes .
One foundational advancement in this field is the Asymmetric Least Squares (AsLS) method,77 which employs asymmetric weighting to differentiate between peaks (signal) and baseline deviations. The weights are dynamically assigned as follows:
| (Equation 37) |
where . This asymmetry suppresses baseline drift while preserving high-frequency peaks. However, its reliance solely on second-order differences () limits flexibility in handling mixed-frequency artifacts (e.g., abrupt jumps or linear ramps).
The Improved AsLS (IAsLS)78,79 addresses this by combining first-order () and second-order () penalties:
| (Equation 38) |
where penalizes residual gradients to mitigate abrupt transitions (e.g., step artifacts), controls curvature to smooth slow drifts, suppresses sharp jumps but may oversmooth weak peaks, and flattens global curvature but risks attenuating genuine low-frequency signals. Optimal balance is empirically determined via cross-validation, prioritizing for sharp artifacts (e.g., Raman baselines) and for undulating offsets (e.g., NMR drift). IAsLS achieves local adaptability (peak preservation) and global smoothness (artifact rejection), making it robust across techniques with diverse distortion profiles.
To achieve superior peak symmetry in spectral baseline correction, the multiple constrained asymmetric least squares (mcaLS)80 method employs a tripartite objective function that simultaneously minimizes fitting residuals, penalizes excessive baseline curvature, and enforces symmetry in peak boundaries. The mathematical formulation is given by:
| (Equation 39) |
Here, the first term represents a weighted sum of squared residuals, where assigns importance to each data point. The second term, regulated by the smoothing parameter , suppresses abrupt changes in the second derivative of the baseline (), ensuring smooth approximation. The third term, controlled by , minimizes asymmetry in peak boundaries by equalizing the left () and right () residuals. When applied to maize near-infrared (NIR) spectra, mcaLS demonstrated superior performance, achieving RMSEs of 0.13 and 0.09 for quadratic polynomial and exponential baselines, respectively—significantly outperforming AsLS and ARPLS.80
For adaptive baseline fitting, the Adaptive Iteratively Reweighted Penalized Least Squares (airPLS)69,72 method employs a dynamic reweighting scheme that excludes peak regions while optimizing smoothness. Its objective function balances residual minimization against baseline roughness:
| (Equation 40) |
Here, governs the smoothness penalty to prevent overfitting, while the adaptive weights () distinguish baseline and peak regions. At each iteration , airPLS and its improved variant (IairPLS) update weights as follows:
| (Equation 41) |
| (Equation 42) |
The dynamic weighting mechanism operates in three critical steps: (1) peak exclusion (zero weight for ), (2) baseline refinement, where weights for points below the baseline ()decay exponentially (airPLS) or follow a moderated sigmoid (IairPLS), scaled by the residual magnitude (), and (3) termination when residuals fall below or after maximal iterations.
IairPLS improves upon airPLS by smoothing transitions at peak boundaries, reducing artifacts in complex datasets. In X-ray fluorescence spectra of soil samples, IairPLS achieved an RMSE of 0.0187, outperforming not only mcaLS (0.3576), AsLS (0.6566), but also the original airPLS (0.0531).69 This highlights its robustness for applications demanding high-fidelity baseline correction.
The airPLS method, while effective for standard datasets, often underestimates baselines in non-peak regions and artificially inflates peak amplitudes when spectra are contaminated with noise. To address this limitation, the asymmetrically reweighted penalized least squares (arPLS)81 method was developed, employing a semi-balanced weighting strategy that accounts for noise distribution symmetry around peak-free baselines. Unlike airPLS, which aggressively suppresses all deviations, arPLS assigns intermediate weights to noise fluctuations near the baseline while zero-weighting signals that significantly exceed it (likely peaks). The weighting function is defined as:
| (Equation 43) |
where denotes the negative residuals (), and and represent their standard deviation and mean, respectively. The logistic function ensures gradual weight transitions, reducing abrupt artifacts. However, arPLS remains prone to overfitting faint noise in Raman spectra, occasionally misclassifying low-intensity peaks as baseline variations. To mitigate this, the inverse square root unit (ISRU) function78 was introduced as a replacement for the logistic component. ISRU’s smoother gradient better discriminates subtle features:
| (Equation 44) |
Experimental validation on simulated Raman spectra (SNR = 30) confirmed ISRU’s superiority, achieving an RMSE of 7.71—significantly lower than arPLS (9.06), airPLS (10.25), and AsLS (9.49).78
For even greater precision, derivative-aware methods such as Adaptive Gaussian Derivative PLS (agdPLS)82 integrate spectral and baseline curvature penalties to distinguish peaks from noise. When applied to methane and ethane gas spectra, agdPLS increased values from 0.88 to >0.99 (methane) and 0.33 to >0.99 (ethane), showcasing its efficacy for gas-phase analysis. Further refinements emerged with the Spectral Estimation-based Asymmetrically Reweighted Least Squares (SEALS),83 which incorporates additive noise uniformity and signal energy distributions relative to the baseline. By dynamically adapting to local SNR conditions, SEALS achieves higher accuracy in low-SNR environments, making it indispensable for complex datasets.
Traditional baseline correction methods often suffer from empirical weight dependencies that render them sensitive to experimental variations and prone to inconsistent performance. To overcome these limitations, the Sparse Bayesian Learning (SBL) framework, also known as Bayesian reweighted penalized least squares (BrPLS), unifies spectral fitting and baseline correction through a probabilistic approach. Unlike conventional methods that approximate the norm through regularization, SBL directly addresses the fundamental -norm minimization problem, achieving superior accuracy without requiring heuristic parameter tuning.84 The mathematical core of SBL employs Bayesian inference to derive adaptive weights85:
| (Equation 45) |
with the auxiliary function:
| (Equation 46) |
In this formulation, the weight function provides a probabilistic measure of whether spectral data point belongs to the baseline or contains signal components. The parameter represents the prior probability of signal presence, while denotes baseline dominance likelihood. The error-function-based adaptation mechanism in enables automatic adjustment to local noise characteristics ( being residual standard deviation) and signal amplitudes , preventing overfitting common in deterministic approaches.
Experimental validation demonstrates SBL/BrPLS’s exceptional performance, delivering values of 0.9990 for sinusoidal baselines compared to 0.9985 (arPLS), 0.8452 (airPLS), and 0.6222 (AsSL).85 This significant improvement stems from the framework’s ability to rigorously distinguish between noise and signal through statistical inference rather than amplitude thresholds, making it particularly robust against noise-dependent biases in spectral analysis.
Sparse representation
While traditional baseline correction approaches like weighted least-squares methods (LSMs) achieve smoothing through curvature penalties, sparse representation techniques offer a fundamentally different paradigm—enforcing structural parsimony through adaptive dictionary learning, albeit with greater computational demands. This sparse representation framework has emerged as a particularly powerful approach for baseline correction, capitalizing on the inherent sparsity of analytical signals when projected onto properly designed dictionaries. The observed signal is decomposed into three components:
| (Equation 47) |
where represents the low-frequency baseline, captures structured signals (e.g., spectral peaks), and accounts for noise. This decomposition is formalized in the simultaneous spectrum fitting and baseline correction via sparse representation (SSFBCSP)86 method as an optimization problem:
| (Equation 48) |
Here, , with being a dictionary (e.g., polynomial or wavelet basis), and the sparse coefficient vector encoding the signal’s structure. The regularization terms (smoothness control via differential operator ) and (sparsity enforcement) balance fidelity and simplicity. While SSFBCSP excels in spectroscopy and biomedical applications due to its precision and noise robustness, its computational complexity remains a limitation for high-dimensional data.86
For instance, SSFBCSP achieves a prediction RMSE of 7.83 × 10−4 for Gaussian noise with sinusoidal baselines,42 outperforming alternatives like asLS (0.0062), airPLS (0.0052), IIPFAT (0.0104), CC (0.0094),87 and ATEB (0.0056).52 On the corn moisture dataset, it attains an of 0.9726, surpassing MSC (0.8881) and SNV (0.8082).86
To mitigate computational costs, downsampling88 techniques reduce data dimensionality without sacrificing accuracy. Leveraging Bayesian frameworks and independent vector decomposition, these methods decouple signal components adaptively and refine solutions via grid optimization, achieving performance comparable to sparse methods with lower overhead.88
Further advances include the fast burst-sparsity learning (FBSL-BC)89 method, which combines down-sampling for speed with burst-sparse modeling for accuracy. FBSL-BC employs a redundant dictionary of Voigt-like functions to represent spectral peaks:
| (Equation 49) |
where, is the peak position, controls width, and adjusts line shape ( for Lorentzian, for Gaussian). By constructing with parameter variations (, , etc.), FBSL-BC achieves 3–5× faster processing than traditional methods while maintaining high accuracy (e.g., on corn NIR data89).
The Adaptive Sparse Decomposition Denoising (ASDD)90 algorithm dynamically builds dictionaries tailored to noise characteristics, demonstrating efficacy in applications like Raman spectroscopy for pesticide detection. These innovations highlight the trade-offs between accuracy, interpretability, and computational efficiency in modern baseline correction pipelines.
Convex optimization framework
While sparse representation methods rely on manually designed dictionaries—limiting adaptability and requiring expert intervention—the Convex Optimization Framework (COF)91,92,93 provides a systematic alternative by unifying sparsity promotion, structural smoothness, and data fidelity through automated convex regularization. Unlike polynomial fitting or dictionary-based approaches, COF minimizes manual parameter tuning while preserving critical spectral features (e.g., Raman/LIBS peaks). Its core objective function is94:
| (Equation 50) |
The first term (data fidelity) ensures alignment with observed spectra, where acts as a high-pass filter. is a baseline kernel and extracts high-frequency features. The inversion recovers suppressed high-frequency components by the baseline model.
The asymmetric penalty (adaptive sparse regularization) dynamically switches between -norm (sparsity) and -norm (smoothness) via threshold and asymmetry parameter :
| (Equation 51) |
The third term of Equation 50 (structural preservation) penalizes noise while retaining peak structures using derivative operators (e.g., 1st/2nd differences) and a differentiable penalty .
COF leverages majorization-minimization (MM) to iteratively convexify the problem. The surrogate function simplifies computation via an adaptive weighting matrix (diagonal entries scale penalties by signal magnitude ) and a quadratic majorizer for :
| (Equation 52) |
| (Equation 53) |
The final surrogate combines all terms into a convex problem per iteration:
| (Equation 54) |
Here, introduces asymmetry, weights derivative penalties for smoothness, and provides a scalar offset.
In LIBS steel analysis, COF improved for Mn/Ni concentration predictions from 0.943/0.958 to 0.982/0.971,94 demonstrating superior baseline removal and feature retention compared to traditional methods.
Bezier curve fitting
While COF’s mathematical rigor provides a systematic framework, Bezier curves offer parametric simplicity through iterative baseline interpolation using control points. This contrast is further enhanced by Bayesian approaches, which automate parameter selection by modeling signal sparsity probabilistically. Mathematically, a Bezier curve is defined as a parametric curve whose shape is determined by weighted combinations of Bernstein basis polynomials. Specifically, the position of an -th degree Bezier curve at parameter is given by87,95:
| (Equation 55) |
where, are the Bernstein polynomials, is the binomial coefficient, and denotes the -th control point. The curve exhibits three critical properties for optimization-based fitting: it exactly interpolates the endpoints and , maintains endpoint tangents aligned with the control polygon edges, and lies strictly within the convex hull of , guaranteeing stability when used as constraints in convex programs.
These properties make Bezier curves naturally compatible with regularized least-squares and iterative corner-cutting algorithms,87 enabling adaptive baseline estimation with guaranteed convergence. For example, in terahertz spectra of ErFeO3 (0.93–5.11 mm thickness), this approach achieves 49–52.6% noise reduction while preserving convexity constraints. However, B-splines offer superior adaptability in optimization-driven tasks due to local control and piecewise polynomial continuity, enabling targeted knot placement that suppresses ∼71% of noise without global constraints.96 This positions Bézier curves as a globally convex but less flexible alternative for spectral preprocessing when strict convexity is prioritized over local adaptability.
Genetic algorithm
Unlike Bezier curves, which demand manual control point placement and user intervention to iteratively adjust baseline shapes, genetic algorithms (GA) automate the optimization process entirely through evolutionary fitness criteria. This eliminates subjectivity by dynamically balancing peak preservation and artifact removal via a formalized objective function . For spectral analysis, quantifies the trade-off between Raman peak retention () and baseline distortion ()97,98:
| (Equation 56) |
Here, the term penalizes overcorrection by normalizing intensity variations against noise, a feature absents in Bezier’s fixed parametric approach. The automation and adaptability of GA deliver comprehensive improvements across three critical dimensions: analytical performance, processing efficiency, and real-world robustness.
In complex spectral analysis, GA achieves superior precision with an RMSE = 0.0004—significantly outperforming conventional methods like AsLS (0.028) or airPLS (0.0007).99,100 GA’s adaptive framework enables seamless integration with advanced analytical workflows, demonstrating 16% lower PLS errors in Gibberella fujikuroi studies101 and enabling high-precision agricultural predictions (e.g., GA-CARS synergy yielding for apple Brix values).102 The method’s robustness is further evidenced in clinical applications—when coupled with Savitzky-Golay smoothing, GA attains 5.7 μM detection limits for methotrexate serum with 15.6% RSD, surpassing Bezier implementations by 30–110% in both sensitivity and accuracy.103 Together, these advantages position GA as the preferred choice for modern, high-throughput spectral analysis where Bezier’s manual paradigm falls short.
Deep learning method
Traditional baseline correction methods like GA rely on computationally intensive heuristic searches, often requiring manual parameter tuning and iterative optimization. In contrast, deep learning (DL) automates this process through hierarchical feature extraction, delivering superior classification accuracy with minimal preprocessing. DL revolutionizes spectral processing by leveraging neural networks’ unparalleled capacity to model complex nonlinear relationships directly from raw spectroscopic data.
1D convolutional neural networks (1D-CNNs)104 exemplify this paradigm, replacing heuristic searches with learned spectral filters. Each convolutional layer applies localized kernels to input spectra , followed by rectified linear unit (ReLU) activation (defined as )105:
| (Equation 57) |
by minimizing MSE between predictions and target spectra , enabling joint baseline removal and feature extraction:
| (Equation 58) |
In MIR breast tissue classification, this achieved 95.09% accuracy by modeling nonlinear variations directly from raw spectra.106
For highly variable baselines (e.g., XRF of alloys), the enhanced GAN (EGAN) framework outperforms deterministic methods by jointly optimizing its generator and discriminator through adversarial training. The generator processes raw spectra and noise via convolutional layers with skip connections, producing baseline-corrected outputs. Meanwhile, the discriminator enforces distributional consistency by minimizing a least-squares adversarial loss (LSGAN):
| (Equation 59) |
Here, learns to push real corrected spectra toward 1 and generated spectra toward 0 through squared-error-driven training. The discriminator distinguishes real corrected spectra from generated ones , while the generator optimizes a composite objective107:
| (Equation 60) |
This equation combines -norm reconstruction () for global fidelity and local peak optimization () to critical spectral regions. is a specialized local peak loss, scaled by a factor of , that prioritizes accurate correction by minimizing squared errors within a local range ( channels) of each peak position (total peaks):
| (Equation 61) |
EGAN achieved on XRF data,107 with further improvements via transfer learning from alloys to soil samples.
For error-sensitive applications, variational autoencoders (VAEs) probabilistically encode input spectra into a latent space by defining a Gaussian posterior distribution:
| (Equation 62) |
where and are the mean and variance learned by the encoder. To preserve differentiability during training, sampling employs the reparameterization trick:
| (Equation 63) |
Here, stochasticity is decoupled into a fixed noise variable , enabling gradient propagation through and . The decoder then reconstructs a denoised spectrum by modeling its distribution conditioned on and decoder weights :
| (Equation 64) |
Here, and quantify both the reconstructed signal and its uncertainty, critical for error-aware applications. Training optimizes the VAE loss , which combines reconstruction error with Kullback–Leibler (KL) divergence to regularize the latent space:
| (Equation 65) |
For diagonal-covariance Gaussian latent variables, the term simplifies to:
| (Equation 66) |
A TinyML-VAE variant demonstrated real-time performance (NRMSE 0.53–0.66) on embedded magnetometers,108 highlighting edge-compatibility. While traditional VAE-based approaches suffer from spectral over-smoothing and parameter sensitivity, modern DL methods leverage domain-specific architectures and multitask training to achieve unprecedented precision and robustness.
For spectroscopic applications, an enhanced U-Net model integrates Learned Feature Fusion (LFF) and Residual Dense Blocks (RDBs) in a three-stage training framework (unpaired pretraining, synthetic data augmentation, and contrastive fine-tuning), achieving near-perfect regression () while preserving critical spectral features.109 In seismic data processing, TraceNet combines synthetic multistage drifts with real earthquake records for automated baseline correction (MSE <10−4), effectively recovering displacement signals without filtering artifacts.110 Similarly, hybrid ResNet-U-Net architectures for Raman spectroscopy111,112 handle high-order polynomial baselines (up to 7th degree) and complex noise, reducing errors by >85% (RMSE: 5.40 × 10−4) compared to asPLS while maintaining robustness on real-world samples.
These advancements—automated processing, quantitative precision, and feature preservation—demonstrate the superiority of data-driven DL over conventional techniques across analytical domains, laying the foundation for adaptive, high-fidelity baseline correction in complex scenarios.
Baseline correction: precision-automation trade-offs in algorithms
Traditional baseline correction methods span a spectrum of accuracy, automation, and computational trade-offs: parametric fits (e.g., polynomials and B-splines) offer simplicity but require careful parameter tuning, while adaptive least-squares methods (AsLS and arPLS) improve robustness through iterative reweighting yet struggle with complex baselines. WTs excel in multiscale separation but may introduce artifacts, and morphological operations efficiently remove narrow distortions using geometric filters yet lack flexibility. Advanced approaches address these limitations at higher computational costs—sparse representation and convex optimization balance accuracy and efficiency but face scalability challenges, while evolutionary algorithms (e.g., genetic algorithms) dynamically optimize corrections for intricate spectra at significant runtime expense. DL models (CNNs and GANs) automate correction via data-driven feature learning, achieving state-of-the-art accuracy at the cost of large datasets and interpretability.
In practice, hybrid methods that integrate complementary algorithmic strengths (e.g., combining mathematical operators with iterative optimization or data-driven modules) are increasingly adopted to bridge these gaps, enhancing adaptability, precision, and scalability for real-world deployment.
Scattering correction
While baseline correction addresses low-frequency distortions to reveal true spectral features, complete signal recovery requires further accounting for scattering effects—a wavelength-dependent phenomenon that introduces intensity variations and obscures chemical information. Scattering correlations elucidate these perturbations, particularly distinguishing between Rayleigh scattering () and Mie scattering (), which dominate wavelength-dependent noise and systematic calibration errors. To mitigate these effects, field-standard techniques like multiplicative scatter correction (MSC) were developed as foundational solutions, later advancing to more sophisticated approaches.
Multiplicative scattering correction
MSC is a chemometric technique that empirically mitigates additive (e.g., baseline shifts) and multiplicative (e.g., path length variations) scattering effects in spectral data.113 The method follows a three-step workflow.114
Step 1 is reference spectrum calculation, where a scatter-free reference spectrum is computed as the mean intensity across all samples at each wavelength :
| (Equation 67) |
Step 2 is linear regression correction, where each sample spectrum is modeled as a linear transform of the reference:
| (Equation 68) |
where (intercept) corrects additive effects, (slope) compensates for multiplicative effects, and denotes residuals. The coefficients and are estimated via ordinary least squares (OLS) by minimizing the residual sum of squares (RSS) with respect to and :
| (Equation 69) |
This yields the closed-form solutions expressed as follows:
| (Equation 70) |
where is the ratio of covariance between and to the variance of , and adjusts for the mean offset.
Step 3 is spectral alignment, where the corrected spectrum is derived by normalizing the original signal:
| (Equation 71) |
MSC demonstrates robust performance in standardized applications, such as FTIR-based crude oil API gravity prediction ()115 and moisture analysis in freeze-dried mushrooms ().116 However, its linearity assumption limits effectiveness for complex samples (e.g., biological tissues with nonlinear scattering or multiple interferents), prompting the adoption of advanced alternatives.
Extended multiplicative signal correction
Building upon the framework of multiplicative scatter correction (MSC), extended multiplicative signal correction (EMSC) addresses limitations in conventional MSC—such as its reliance on linear assumptions—by introducing additional correction terms tailored to complex samples (e.g., biological tissues or moisture-rich environments).117,118 For multicomponent systems, EMSC models the corrected spectrum using119:
| (Equation 72) |
Here, is the mean reference spectrum, scales multiplicative effects (e.g., path length variations), represents interfering signals (e.g., water vapor bands) with coefficients optimized via least-squares regression, and captures residuals. This selectively removes confounders while preserving analyte-specific features (e.g., disease biomarkers).
For nonlinear baseline distortions (e.g., fluorescence in Raman spectroscopy), EMSC incorporates a polynomial model:
| (Equation 73) |
where accounts for baseline shifts, are polynomial coefficients, and the optimal order is determined through cross-validation to avoid overfitting. For systematic noise (e.g., temperature-induced peak shifts in NIR), EMSC integrates PCA-based corrections:
| (Equation 74) |
Here, are PCA loadings, and their weights, with principal components (PCs) typically retained to explain >95% cumulative variance. Residuals are further decomposed to unmask subtle spectral features:
| (Equation 75) |
In practice, EMSC achieves high fidelity: it distinguishes Crohn’s disease via FTIR spectra with 97% specificity120 and reduces temperature artifacts in polymers from to .121 Its modular design enables protocol customization but demands careful parameter optimization and computational resources.
Detrending
While EMSC addresses multiplicative, additive, and structured noise through polynomial and PCA-based corrections, detrending (DT) offers a specialized solution for scenarios where baseline drift dominates spectral disturbances. By removing low-frequency variations via polynomial fitting, DT calculates the corrected spectrum as:
| (Equation 76) |
Here, is the raw spectrum, the detrended output, and a polynomial baseline fitted via least-squares regression. The order controls flexibility—higher orders model complex drifts but risk overfitting.
In NIRS analysis of pearl millet genotypes, DT combined with SNV improved nutrient prediction () using PLS models,122 while for Traditional Chinese Medicine, DT achieved 97.4% species identification accuracy (KNN).123 DT’s focused approach—ideally paired with complementary techniques—makes it indispensable for baseline correction in spectral analysis.
Standard normal variate
While DT eliminates low-frequency baseline drift through polynomial fitting, standard normal variate (SNV) specifically targets higher-frequency, sample-specific variations by centering each spectrum around zero and scaling it to unit variance. This makes SNV particularly effective for datasets where sample-to-sample variability—such as scattering effects, amplitude shifts, or localized baseline deviations—dominates spectral differences.124
The method operates independently on each spectrum, ensuring robustness under heterogeneous conditions (e.g., varying scattering or baseline levels). Mathematically, given a raw spectrum , SNV transforms each intensity value at wavelength as:
| (Equation 77) |
Here, is the sample-wise mean (averaged across wavelengths), and is the standard deviation. Although SNV resembles Z score normalization (ZSN) in form, its per-spectrum application and spectral-specific design differentiate it from global normalization approaches, as it avoids assumptions of uniform statistics across samples.
SNV excels in applications requiring localized spectral correction, such as geographical origin tracing (99% accuracy in Panax ginseng authentication via LIBS-SVM125) and defect detection (enhancing NIR-HSI water-absorption peaks at 1450 nm for apple bruise analysis126). However, excessive scaling may suppress discriminative features, as seen when geometric methods outperformed SNV in hyperspectral edge-bruise detection.
Scattering correction: a benchmark of methods
Scattering correction techniques—such as MSC, EMSC, detrending (DT), and SNV—address spectral artifacts through distinct yet complementary approaches. MSC efficiently corrects linear additive and multiplicative scattering effects but lacks robustness against complex interferents. EMSC, an advanced variant of MSC, incorporates polynomial baselines, interferent modeling, and PCA-based denoising, making it particularly effective in biological systems (e.g., serum or tissue spectra), though it requires careful parameter optimization. DT specifically targets baseline drift by fitting a low-order polynomial, offering a simple yet flexible solution for spectral detrending. SNV, on the other hand, normalizes each spectrum independently by scaling to zero mean and unit variance, which is highly effective for heterogeneous datasets (e.g., LIBS or NIR-HSI) but may overcorrect subtle spectral features.
The choice of method depends on the dominant spectral artifacts: MSC for basic scattering, EMSC for complex interferents or nonlinear effects, DT for baseline shifts, and SNV for sample-specific variations. For challenging datasets, hybrid approaches (e.g., EMSC + DT) often yield superior results by leveraging the strengths of multiple techniques. No single method universally dominates, emphasizing the need for method selection based on spectral characteristics and analytical objectives.
Data normalization
While scattering correction methods address wavelength-dependent intensity distortions, spectral normalization techniques standardize sample-specific scaling effects, enabling robust comparisons across heterogeneous datasets.
In fields ranging from agricultural monitoring to pharmaceutical quality control, spectral variability arises from diverse sources—including measurement conditions, instrument response, and sample heterogeneity. These variations manifest as intensity shifts, baseline deviations, and variance disparities, which can critically degrade the performance of machine learning models during training and inference. Effective normalization addresses three core challenges: (1) preserving relative spectral features while standardizing numerical scales, (2) accelerating optimization convergence, and (3) enhancing model generalizability by reducing systematic biases.
The following sections systematically detail nine essential normalization methods, categorized by their mathematical principles and interdependent applications.
Mean centering
Mean centering normalization (MCN)127 aligns spectral baselines by subtracting the arithmetic mean of the dataset from each data point. For a spectral vector , the MCN-transformed value is computed as:
| (Equation 78) |
where is the mean intensity across all wavelengths and is the spectral data length.
By removing global intensity offsets, MCN mitigates spectrometer drift and baseline shifts while retaining relative spectral features—a property critical for PCA, where centering ensures principal components reflect true variance rather than artificial offsets.128,129 This is exemplified in agricultural NIR spectroscopy, where MCN preprocessing elevated prediction accuracy beyond 99.7% by accentuating wavelength-specific crop quality signatures,130 and in pharmaceutical quality control, where it resolved overlapping peaks in complex formulations (e.g., sodium hyaluronate mixtures) with correlation coefficients exceeding 0.999.131
Orthogonal signal correction
Orthogonal signal correction (OSC)132,133,134,135,136 extends MCN’s variance-centric philosophy into multivariate space, addressing a critical limitation: while MCN removes global offsets, OSC systematically eliminates signal components orthogonal to target variables (e.g., analyte concentrations), leading to more robust modeling.
The Direct OSC (DOSC)137 algorithm accomplishes this through a three-step projection and decomposition process. First, the target response variable (e.g., concentration) is decomposed into its spectral-space projection and an orthogonal residual :
| (Equation 79) |
where represents the predictable variations in captured by , while contains noise unrelated to . Crucially, the same decomposition is applied to the spectral data itself, splitting it into a component aligned with and an orthogonal subspace :
| (Equation 80) |
Here, (satisfying ) isolates spectral variations uncorrelated with , and denotes the Moore-Penrose pseudoinverse, ensuring numerical stability in ill-conditioned cases.
To extract and remove structured noise, DOSC applies PCA/SVD to , deriving its dominant variations (). These are then mapped back to the original data space through a correction weight matrix , computed as:
| (Equation 81) |
where again ensures robust inversion. The final projection loadings () for noise subtraction are given by:
| (Equation 82) |
With these, the corrected spectral data () is obtained by removing the noise subspace:
| (Equation 83) |
New prediction samples () are transformed analogously, retaining consistency between training and future data. The correction first projects onto the orthogonal space to capture -irrelevant variations via:
| (Equation 84) |
Here, captures the interference components in aligned with the training set’s orthogonal subspace . Then removes them by:
| (Equation 85) |
where is the corrected spectral data ready for prediction.
The correction method demonstrates broad spectroscopic utility, transforming predictive accuracy across applications: By projecting data into orthogonal space () and removing interference (), it elevates rice bran oil analysis () through humidity elimination,138 enables single-component glucose prediction in bioreactors,139 and enhances soil monitoring (, RMSE↓3.203 g/kg).140,141 While outperforming univariate approaches, its computational overhead necessitates tradeoff evaluation in resource-constrained settings.
Min-max normalization
While methods like OSC selectively remove structured noise through orthogonal projections, min-max normalization (MMN) operates as a domain-agnostic scaling tool, universally bounding data ranges without distinguishing signal from interference. MMN is a linear transformation method that preserves relative feature distributions by mapping data to [0,1]127,142:
| (Equation 86) |
Its simplicity belies broad applicability: Raman-PLS models achieve 97% honey origin classification accuracy (vs. 85% unscaled),143 random forests identify microplastics at 99% precision,144 Terahertz-PLS models predicted caffeine/quinic acid/niacin mixtures with 0.0866 RMSE reduction,145 and biomedical models (e.g., heart disease detection146 and stress level analysis147) benefit from standardized intensities. However, MMN’s dependency on extremal values (, ) renders it susceptible to outlier distortion—a limitation absent in OSC’s covariance-aware decomposition.
Z score normalization
ZSN148 addresses the outlier sensitivity of MMN by standardizing data based on distributional statistics rather than extreme values. It transforms features to have a mean of 0 and variance of 1. For a dataset , ZSN generates the normalized vector :
| (Equation 87) |
where is an -dimensional unit vector, is the population standard deviation. ZSN projects data onto a hyperplane orthogonal to and scales it to a hypersphere of radius , enforcing:
| (Equation 88) |
This ensures mean-centered, unit-variance distributions while mitigating biases from unknown minima/maxima or singularities (e.g., spectroscopic scatter effects).
ZSN’s robustness is demonstrated across diverse domains. In spectroscopy, it enables PLS/RF-based tea identification149 and 2D stellar classification via attention mechanisms,150 while simultaneously eliminating scatter artifacts caused by particle heterogeneity in diffuse reflectance studies. In medical imaging, ZSN achieves superior cross-scanner harmonization of radiomic features (p < 0.001),151 and its noise suppression capability helps resolve multicollinearity in high-dimensional data.124 Additionally, the method’s mathematical simplicity and compatibility with localized adaptations (e.g., LSNV152) contribute to its analytical precision.
3-Sigma normalization
While ZSN standardizes data to a zero-mean, unit-variance distribution, 3-Sigma Normalization (3-SN) implements a more robust heuristic by truncating values outside and linearly transforming the inlier range to [0,1] via:
| (Equation 89) |
Here, and denote the dataset’s mean and standard deviation. The numerator centers each relative to the lower bound, while the denominator scales the result by the total inlier range ().
This dual mechanism—statistical truncation followed by linear scaling—proves invaluable in clinical spectroscopy, where it achieves 97.8% accuracy in breast tumor classification153 while mitigating scanner-to-scanner variability.127 By suppressing technical noise (e.g., instrumental artifacts) and preserving biological signals, 3-SN offers a principled balance between outlier resilience and data harmonization, complementary to ZSN’s Gaussian-centric approach.
normalization
While statistical scaling methods (e.g., Z Score, 3-Sigma) focus on distributional properties, vector space methods like Normalization operate on geometric constraints. By scaling components relative to the Manhattan norm (sum of absolute values), Normalization preserves sparsity patterns—a critical requirement in applications such as non-intrusive load monitoring (NILM). Its formulation is given by:
| (Equation 90) |
Here, is the normalized value of the -th feature, is the raw value, and (the Manhattan or Taxicab norm) computes the sum of absolute values across all features.
This approach is particularly effective in NILM, where appliance signatures often exhibit sparse energy consumption patterns (e.g., inactive phases with zero current). By preserving these zero-value components through scaling, the method achieves 96.42–97.65% accuracy on benchmark datasets (PLAID/WHITED).154
Vector normalization ( normalization)
Normalization techniques like and address scale invariance but prioritize different properties: While normalization (sum of absolute values) enforces sparsity by favoring feature selection, normalization (Euclidean norm) preserves directional relationships by projecting vectors onto a unit hypersphere—this makes it ideal for comparing angular similarity (e.g., cosine distance) rather than magnitude. Mathematically, normalization scales a vector by its Euclidean norm:
| (Equation 91) |
Its discriminative power is evident in diverse fields: Combined with probabilistic PCA (PPCA), it achieves 100% accuracy in brain MR classification (5×5-fold cross-validation) by enhancing directional contrast,155 while in agricultural spectroscopy, it enables non-destructive kiwifruit sugar prediction () through the VN-CARS-PLSR model, reducing spectral data volume by 88.6% without sacrificing accuracy (RMSEP = 0.498).156
Logarithmic transformation
While / normalization achieves scale invariance through linear operations, logarithmic transformation (LT) resolves nonlinear distortions by converting multiplicative relationships into additive space—particularly valuable for spectral data with exponential dynamics or heavy skewness. The transformation is defined as:
| (Equation 92) |
where denotes the logarithmic base (typically 10 or ) adjusts scale sensitivity (with base natural for theory-driven models and base 10 preferred for human-readable scales), and (>0) ensures stability at . By compressing dynamic ranges and suppressing extreme values, LT stabilizes variance and promotes normality, with demonstrated success in pulp quality analysis (RMSEP = 128, NIR absorbance)157 and soil organic carbon prediction ().158 Unlike normalization’s focus on directional consistency, LT prioritizes distributional symmetry, though its requirement of may necessitate alternatives (e.g., Box-Cox) for non-positive or power-law-optimized data.
1/n Power transformation
Compared to normalization (directional scaling) or log transforms (multiplicative symmetry), the 1/n power transformation (1/n-PT) provides parametric control over value compression, balancing flexibility against data constraints. The transformation is defined as:
| (Equation 93) |
where determines compression strength: larger (e.g., for square roots) yields milder adjustments. By nonlinearly suppressing large values, 1/n-PT mitigates outliers and normalizes right-skewed distributions, proving advantageous for regression (variance stabilization) and machine learning (feature distribution optimization). However, its limitations include a requirement for , necessitating data shifting (e.g., ) for negative values. Moreover, its compression efficacy depends on , necessitating careful hyperparameter tuning.
In spectroscopic analysis, 1/n-PT improves Cu(II)-DOM binding quantification. Coupled with biological treatment, it achieves 89% TOC removal, 55% humic substance enrichment, and enhanced complex stability, collectively mitigating environmental hazards.159
Data normalization: linear, geometric, and nonlinear approaches
Normalization techniques optimize data for downstream analysis by addressing scale, distribution, and noise, with nine prominent methods falling into three functional categories.
For linear adjustments, mean centering (MCN) removes baseline offsets and is fundamental to PCA, though it ignores scale standardization. Building on this, MMN constrains values to [0,1] for intuitive scaling but suffers under outliers—a weakness addressed by Z Score (ZSN), which enforces and through statistical scaling. To further robustify ZSN, 3-Sigma (3-SN) applies probabilistic Winsorization, automatically clipping extreme deviations beyond . OSC extends linear approaches by projection pursuit, stripping multivariate noise orthogonal to targets (e.g., correcting spectral interferents), though at higher computational cost.
When geometric properties dominate, normalization prioritizes sparsity via Manhattan distances (key for feature selection), while normalization preserves directional relationships through Euclidean norms, proving superior in tasks like spectral discrimination.
Nonlinear dynamics demand specialized transforms: Logarithmic (LT) compression handles exponential-scale data (e.g., pixel intensities or highly skewed features), whereas 1/n-PT stabilizes heteroscedastic variance in regression via parametric scaling—e.g., square roots () for Poisson-like distributions.
Method selection balances data traits and analytical goals: outlier resilience favors ZSN or 3-SN over MMN; high-dimensional noise necessitates OSC; nonlinear skewness calls for LT or 1/n-PT; and geometric needs dictate (sparsity) versus (directionality). Real-world pairings include MMN+LT for image preprocessing, +LT for NLP embeddings, and ZSN+3-SN for environmental sensor normalization, each combo addressing domain-specific challenges.
Filtering and smoothing
Building upon normalized spectral data (scaled and distribution-optimized via Z score, OSC, etc.), filtering and smoothing methods form the subsequent critical layer of preprocessing. These techniques target structured noise—high-frequency fluctuations, baseline drift, or stochastic artifacts—while preserving spectral fingerprints (e.g., peak positions, relative intensities, and band shapes) essential for downstream tasks like classification or regression. Here, we systematically present seven core methods, ordered by computational complexity and methodological interdependencies. Each approach negotiates the trade-off between noise suppression and feature fidelity, with selection guidelines provided for diverse spectroscopic modalities (e.g., Raman, NIR, and MS).
Gaussian filtering
Gaussian filtering (GF) applies weighted averaging via a Gaussian kernel to suppress high-frequency noise while preserving spectral signatures:
| (Equation 94) |
The kernel’s width () directly controls the trade-off between noise attenuation and feature distortion—smaller retains sharp peaks but may leave residual noise, whereas larger risks oversmoothing.
In practice, GF operates as a convolution-based low-pass filter, with its cutoff frequency naturally excluding high-frequency artifacts (e.g., detector noise). For LIBS data, proved critical for resolving emission lines (e.g., Li I 670.8 nm)—a value too small fails to attenuate noise, while too large merges adjacent peaks. When combined with stacked correlation feature selection, this approach enabled 100% accuracy in Dendrobium species classification—outperforming SVM/RF/KNN by 10–20%.160 Further, GF’s integration with baseline removal and normalization ensured robust mineral classification (e.g., lithium ores in complex matrices161), underscoring its versatility in spectral preprocessing.
Savitzky-Golay filtering
Savitzky-Golay filter (SGF) extends GF’s concept by fitting local degree-N polynomials within a sliding window of points. Its smoothed output at point is the central value of the fitted polynomial 162,163:
| (Equation 95) |
where the finite impulse response (FIR) coefficients derive from least-squares regression on the Vandermonde matrix of polynomial bases , computed as the 0th row of . The filter preserves spectral moments up to order , unlike GF’s fixed spectral attenuation. The trade-offs are controlled by window size () and polynomial degree (). Larger enhances noise suppression but blurs transient features (cf. GF’s parameter), while higher retains sharp peaks but overfits noise (GF lacks this adaptability).
Applications leverage SGF’s balance between noise removal and feature fidelity. It achieves 2.35× SNR enhancement in methane detection (LITES),164 improves soil phosphorus prediction ( with adaptive MRLFOSGF165) in Vis-NIR datasets,166 reduces CO spectral errors from 25.2% to 5.9% (Vis-NIR),167 and maintains 0.53 ppm accuracy in TDLAS gas monitoring,168 with its noise suppression and peak-preserving capabilities making it indispensable across spectroscopy and industrial sensing applications.
Wiener filtering
While SGF excels in local smoothing and spectral peak preservation, Wiener filtering (WF) provides a statistically optimal framework by minimizing the total mean-square error (MSE) across all frequencies:
| (Equation 96) |
where defines the estimated signal. For a noisy observation (with system distortion and noise ), WF’s closed-form solution adaptively suppresses degradation by incorporating both the system response and local SNR169:
| (Equation 97) |
where it explicitly leverages signal and noise power spectra (). Assuming and are uncorrelated (), WF outperforms heuristic methods like SGF in noise-modeled scenarios.
In precision spectroscopy, WF reduced CH4 detection limits from 150 ppm to 100 ppm (vs. SGF’s 120 ppm),170 while in hyperspectral imaging, it achieved comparable SNR (17 dB) with superior robustness to non-uniform noise171; For miniature spectrometers, where SGF suffers abrupt noise transitions due to fixed-window smoothing, WF improved resolution by 40% through SNR-adaptive suppression.172 Unlike SGF’s static polynomial fitting, WF dynamically adjusts to local SNR conditions—proving indispensable for applications like motion artifact removal in NIR spectroscopy.
Kalman filtering
While WF provides an optimal frequency-domain solution for stationary processes, Kalman filtering (KF)173 extends this framework to non-stationary signals by recursively updating state estimates in real-time.
The KF algorithm operates via two core stages: the time update (prediction) and measurement update (correction). Time update propagates the prior state estimate and error covariance forward equations174,175:
| (Equation 98) |
Here, and represent the predicted state and error covariance, respectively, with as the state transition matrix and as process noise covariance.
Measurement update integrates new observations to refine predictions:
| (Equation 99) |
Here, maps state to measurement space, is measurement noise (covariance ), and Kalman gain optimally balances prediction and observation. The covariance matrices and quantify estimation errors (, ):
| (Equation 100) |
KF recursively minimizes the posterior error covariance in the minimum mean square error (MMSE) sense, adapting dynamically to new data. KF’s versatility is demonstrated across spectroscopic techniques, where it enhances precision by fusing model predictions with noisy sensor data.
The recursive nature of KF allows it to adapt to dynamic systems, making it particularly effective in spectroscopic applications where real-time noise suppression and drift correction are critical. This is demonstrated in the following cases. In NO2 cavity ring-down spectroscopy, it improved detection limits 9.12-fold by suppressing transient noise.176 For glucose Raman spectroscopy, it reduced PLS model RMSE from 0.38 to 0.17 g/L. In LIBS steel analysis,177 it corrected instrumental drift, lowering Mn prediction RSD from 35% to 11%.178 Remarkably, for QCL CO2 isotopologues, it achieved δ13C (−9.19 ± 0.29‰) and δ18O (−1.46 ± 0.34‰) measurements with 7-fold better precision in just 1 s, matching 95-s averaging.179
Neural network adaptive filter
While KF provides optimal state estimation under linear Gaussian assumptions, Neural Network Adaptive Filtering (NNAF)180,181 extends this framework by learning nonlinear dynamics and noise characteristics directly from data, enabling robust performance in complex environments. The NNAF method combines the stability of least mean squares (LMS) filtering with the adaptability of neural networks through backpropagation optimization, where key parameters are carefully designed to balance convergence speed and steady-state error.
The core algorithm operates through three fundamental components:
First (Error Calculation), the instantaneous prediction error quantifies the discrepancy between the desired signal and filter output182:
| (Equation 101) |
where represents the desired signal (reference measurement), denotes the input vector (current signal observations), and is the adaptive weight vector containing the filter coefficients.
Next (Weight Update Mechanism), the filter coefficients are adjusted using a modified LMS rule:
| (Equation 102) |
The adaptive learning factor dynamically controls the update magnitude, preventing instability while maintaining fast convergence.
Finally (Adaptive Step Size Control), the critical adaptation component is governed by:
| (Equation 103) |
To ensure the convergence, must satisfy , where is the maximum eigenvalue and is positive. To speed up convergence, the core adaptation logic combines a sigmoid-shaped controller (instantaneous error function) and an error-driven exponential term (error factor):
| (Equation 104) |
The term regulates adaptation sensitivity, where parameter controls steepness. The term introduces nonlinear error-driven adjustments, with exponent governing growth intensity.
The NNAF demonstrates superior denoising capabilities in practical applications, including spectroscopic analysis: achieved an SNR improvement of 3–4 dB in predicting copper and cobalt impurity concentrations in UV-Vis spectra during wet zinc extraction processes; and prediction accuracy: delivered a goodness-of-fit (R2) exceeding 0.99182, confirming highly reliable modeling performance.
Wavelet shrinkage denoising method
While NNAF relies on learned nonlinear mappings for adaptive denoising, wavelet shrinkage denoising (WSD) employs transform-domain processing, offering computationally efficient and theoretically optimal noise removal through thresholding of sparse wavelet representations. WSD is particularly effective for piecewise-smooth signals with transient features, where noise and signal components can be separated via rigorously derived thresholds. A standard approach is the universal threshold from VisuShrink,183,184 defined as:
| (Equation 105) |
where estimates the noise standard deviation at decomposition level , and is the signal length. For improved adaptability, level-dependent thresholds may be used185:
| (Equation 106) |
where represents the number of coefficients at level . More sophisticated methods apply locally adaptive thresholds based on coefficient statistics184:
| (Equation 107) |
where, and denote the mean and standard deviation of wavelet coefficients at level :
| (Equation 108) |
Thresholds are applied via hard thresholding:
| (Equation 109) |
or soft thresholding, which ensures smoother transitions:
| (Equation 110) |
where preserves the coefficient’s sign.
The denoising process involves: Selecting a wavelet basis (e.g., Daubechies and Symlet), Decomposing the signal to an optimal level , Estimating (e.g., via MAD), thresholding detail coefficients, and reconstructing via inverse discrete wavelet transform (IDWT).
This approach demonstrates remarkable performance: In THz-TDS measurements of GFRPs, it reduced thickness errors from 16.4% to 3.7%.186 For ESR spectroscopy, it achieved 32 dB SNR improvement versus conventional methods (10 dB).184
The method’s strength lies in its statistically informed thresholding, adaptively distinguishing signal from noise—crucial for precision-demanding applications like materials science and spectral analysis.
Hilbert vibration decomposition (HVD)
While WSD employs fixed wavelet bases for noise suppression, Hilbert vibration decomposition (HVD) adaptively extracts signal components through envelope demodulation—leveraging the analytic signal derived from the Hilbert transform of the original signal 187,188,189:
| (Equation 111) |
where denotes the Cauchy principal value. The analytic signal is constructed as:
| (Equation 112) |
with instantaneous frequency , where and represent the instantaneous amplitude and phase respectively. The decomposition algorithm proceeds as follows:
First, extract the dominant component through demodulation with reference frequency :
| (Equation 113) |
Next, reconstruct the vibrational mode:
| (Equation 114) |
Finally, obtain residual signal:
| (Equation 115) |
The process repeats on until standard deviation of successive frequency estimates falls below .190
This method has demonstrated remarkable performance in applications such as structural vibration analysis and spectroscopic signal processing, where it can enhance signal-to-noise ratios from 3.53 to 130.64189 and improve classification accuracy from 40% to 90.25%.191 While particularly effective for oscillatory AM/FM signals, complementary approaches may be required for non-harmonic components.
Spectroscopic filtering: toward a domain-adaptive framework
Filtering and smoothing methods offer distinct advantages for spectral data preprocessing: GF employs weighted averaging to suppress noise, excelling at high-frequency noise reduction though requiring careful parameterization to avoid over-smoothing; KF, through state-space modeling, provides optimal recursive estimation for dynamic systems with time-varying signals, while WF operates in the frequency domain to minimize mean-square error by leveraging signal-to-noise statistics in stationary processes. For complex datasets, the neural network adaptive filter (NNAF) combines LMS stability with neural network flexibility for robust denoising, whereas Savitzky-Golay filtering preserves peak integrity via local polynomial regression to balance noise removal and feature retention. Meanwhile, WSD exploits signal sparsity in multiscale domains through adaptive thresholding, and HVD specializes in nonstationary multicomponent signal analysis via instantaneous frequency extraction. Ultimately, the choice among these techniques hinges on noise characteristics, signal dynamics, and the critical trade-off between smoothing intensity and feature fidelity.
Spectral derivatives
While smoothing suppresses noise in spectral data, it often obscures subtle features. Spectral derivative preprocessing22 addresses this by computing derivatives to enhance slope variations while suppressing baseline drift. Unlike smoothing, derivatives sharpen inflection points, improving spectral resolution—critical for hyperspectral imaging and vibrational spectroscopy. This approach balances noise resilience with feature discrimination. Derivative techniques amplify weak spectral features while reducing noise and scattering artifacts. Their effectiveness depends on the differentiation method, which governs the resolution-noise tradeoff. We systematically evaluate four key derivative methods by their mathematical sophistication and practical utility.
Finite difference method
As the most fundamental numerical differentiation approach, the finite difference method (FDM) computes derivatives by approximating discrete differences between adjacent spectral points. Assuming the spectral signal is sampled at discrete points , with a constant sampling interval , FDM can be mathematically expressed as follows22:
| (Equation 116) |
FDM approximates the first derivative by calculating the difference between consecutive spectral data points and , normalized by the sampling interval . Similarly, the second derivative is computed using a centered difference formula involving , , and . While computationally efficient, FDM suffers from inherent noise sensitivity, where minor fluctuations lead to amplified derivative artifacts. Truncation errors further limit accuracy, scaling linearly () and quadratically () for the first and second derivatives, respectively. This makes FDM primarily useful for clean, high-resolution spectra where computational simplicity is prioritized over robustness.
Savitzky-Golay derivatives
An extension of FDM, the Savitzky-Golay derivatives (SGD) addresses noise amplification by incorporating polynomial smoothing prior to differentiation.192,193 The SGD method operates by selecting a window of (where typically ranges up to 25) consecutive points within a spectral interval of points, with the central point designated as . These points are initially fitted to a -th order polynomial, , expressed as follows:
| (Equation 117) |
Then, the coefficients corresponding to its various derivatives are derived through polynomial fitting. Subsequently, a least squares fitting procedure is applied to the spectrum within the window, with the fitting error denoted as :
| (Equation 118) |
Minimization of this error is achieved by solving the partial derivative condition . Curve smoothing in SGD is centered at , where a set of coefficients is determined to compute the smoothed value of the -th point. Alternatively, distinct sets of coefficients can be utilized to calculate the smoothed values of the -th derivative, as represented by , where :
| (Equation 119) |
| (Equation 120) |
| (Equation 121) |
| (Equation 122) |
where, and correspond to the coefficients for the zero and second derivatives of SGD, respectively, obtained through fitting with second- or third-order polynomials. Similarly, denotes the coefficient for the first derivative of SGD, derived from a second-order polynomial fit.194
The Savitzky-Golay (SG) filter shares a convolution-based approach with mean filtering but includes polynomial fitting for simultaneous smoothing and derivative computation. Its ability to preserve spectral features makes it particularly effective in classification tasks; for instance, SG first derivatives paired with SVM classifiers achieved 100% accuracy in NIR-based wood species identification.195 In baseline correction of Rhodiola FT-NIR spectra, the Norris-Williams (NW) derivatives showed slightly higher values (98.5 vs. 96.5 for first derivatives; 97.4 vs. 93.1 for second derivatives), but SG derivatives offer greater flexibility in optimizing window size and polynomial order.196 SG second derivatives excel in crystallization monitoring, balancing noise suppression, feature enhancement, and computational efficiency for real-time analysis.197 The method’s strength lies in its dual smoothing-differentiation capability, which retains spectral features better than simple filters. However, performance hinges on careful parameter selection (polynomial order, window size), requiring dataset-specific tuning.
Norris-Williams derivatives
Like SGD, the Norris-Williams derivatives (NWD)198,199 integrates smoothing with derivative computation but replaces polynomial regression with a moving-average filter. Assuming spectral signals are sampled at discrete points with a constant interval , the method constructs its mathematical model as follows:
| (Equation 123) |
In this method, spectral smoothing is first achieved by averaging points within a window centered on measurement point , where defines the window size. The smoothed signal then serves as the basis for derivative calculations. The first derivative is computed as the difference between consecutive smoothed values (spaced by ), while the second derivative incorporates a second-order difference to capture curvature.
The NWD method effectively combines smoothing with gap-size adjustment to reduce noise and improve the signal-to-noise ratio (SNR), outperforming traditional finite difference methods (FDMs). It excels in detecting segregation in particulate systems, particularly for trace components (e.g., enzyme placebo granules in complex mixtures). Unlike standard first/second derivatives, SNV, or SG smoothing, NWD minimizes noise interference while preserving spectral details, making it particularly effective for low-concentration detection.
In NIR spectral analysis, NWD achieved a high segregation index (0.71) with minimal error (low MAE and MAPE), proving reliable for industrial quality control.199 However, NWD’s performance depends on optimizing the smoothing window () and gap size () for each application. Its simplicity enables fast computation—beneficial for real-time industrial processing.
Compared to SG’s polynomial flexibility, NWD’s uniform smoothing may occasionally obscure fine spectral features. Nonetheless, it remains superior in applications prioritizing noise suppression and rapid analysis, as demonstrated in enzyme granule segregation studies.
Grunwald-Letnikov fractional order derivation
Building on NWD’s robustness of in industrial applications, the Grunwald-Letnikov fractional-order derivative (GL-FOD) enhances spectral processing flexibility through fractional-order differentiation. By tuning the derivative order , GL-FOD optimizes feature resolution for complex spectral datasets—capturing nonlinear patterns inaccessible to integer-order derivatives. For a continuous function , GL-FOD is rigorously defined as200,201,202:
| (Equation 124) |
where is the step size, and are the lower and upper limits of differentiation, refers to the integer part of , and denotes the gamma function for interpolation, given by:
| (Equation 125) |
For discrete spectral signals , the discrete implementation uses an fractional differential matrix :
| (Equation 126) |
where the series truncation at (due to for ) balances accuracy and computational efficiency.
GL-FOD has demonstrated remarkable efficacy in extracting nonlinear features from hyperspectral data, significantly enhancing analytical accuracy across diverse applications. In agriculture, GL-FOD coupled with PLS regression achieved an of 0.84 for cadmium detection in rice leaves,201 while a 0.6-order derivative combined with random forest regression improved soil organic matter prediction ().62 For winter wheat monitoring, a 0.3-order derivative with CARS-ETsR modeling attained a validation of 0.8667 in chlorophyll density estimation, outperforming integer-order derivatives by 17% in Na+ correlation enhancement.203 Notably, in reservoir characterization, GL-FOD’s recursive formulation with Hausdorff accumulation kernels demonstrated superior volatility modeling and long-term trend capture (higher , lower RMSE versus conventional models).204
Spectral derivatives: balancing resolution and noise
The four derivative approaches—FDM, NWD, SG, and GL-FOD—offer distinct advantages and trade-offs in spectral analysis. FDM provides straightforward implementation but suffers from significant noise sensitivity, limiting its utility in low-SNR environments. NWD (Norris-Williams derivatives) excels in noise suppression and real-time processing, making it particularly suitable for industrial quality control where baseline stability is critical. SG (Savitzky-Golay) achieves a balance between smoothing and feature preservation through polynomial fitting, though its fixed-window design restricts adaptability to dynamic spectral features. In contrast, GL-FOD leverages fractional calculus to enable fine-tunable differentiation, delivering superior multi-scale resolution for complex datasets such as nonlinear mineral-organic mixtures. While NWD and SG are efficient for routine applications, GL-FOD represents a more advanced solution when precision and flexibility are paramount. Ultimately, method selection depends on the interplay of noise tolerance, computational efficiency, and spectral detail requirements—with GL-FOD emerging as the preferred choice for applications demanding high feature discrimination.
Data information mining
Conventional spectral preprocessing relies on removal-based techniques (e.g., noise filtering, baseline correction), often overlooking intrinsic data relationships. In contrast, correlation-driven approaches (e.g., 2D/3D correlation) exploit interdependencies among spectral points to extract latent information.
The three-dimensional correlation method (3dCM)205 enhances spectral analysis through sequential Hilbert transforms (HT) and tensor product. Given a spectral dataset with sample types, each type contains spectra (total samples ) and frequency points (matrix ). The process involves three key steps:
Step 1: Orthogonal Signal Generation via HT. For each sample type , apply HT to spectral matrix to obtain a 90° phase-shifted matrix :
| (Equation 127) |
Step 2: 2D Correlation Matrix Construction. The tensor product of and generates a 2D correlation matrix (2dCM206):
| (Equation 128) |
After vectorization to (), a second HT is applied to produce :
| (Equation 129) |
Step 3: 3D Tensor Expansion. The final 3D correlation matrix is computed by:
| (Equation 130) |
Vectorization of yields ()with cubically expanded resolution () and augmented sample size :
| (Equation 131) |
In practical applications like Chinese handmade paper authentication, this preprocessing boosted machine learning accuracy from <10% to >99% across PCA-LR, PLS-LR, KNN, RF, and CNN models by effectively increasing both spectral resolution and training data volume.142
Discussion and conclusion
The evolution of spectral preprocessing has progressed from basic noise removal to sophisticated information enhancement strategies, with modern techniques now targeting specific analytical challenges through optimized approaches. Cosmic ray removal exemplifies the precision-speed trade-off, spanning rapid moving average filtering to advanced WSD and kernel PCA residual diagnosis (KPCARD), while baseline correction has advanced from polynomial fitting to adaptive asymmetric least squares (AsLS) and interpretation-guided DL models. Scattering correction techniques like MSC and its extended variant (EMSC)—especially effective for biological samples via combined PCA denoising—along with SNV normalization demonstrate method specialization for distinct interference types. Normalization methods such as Z score standardization and L2 normalization provide fundamental variance stabilization, whereas derivative analysis now benefits from fractional-order differentiation (GL-FOD)’s superior multicomponent resolution compared to traditional finite differences.
Conventional spectral preprocessing relies on subtraction-based techniques (e.g., noise reduction and baseline correction), which often compromise underlying data correlations. In contrast, emerging correlation-based methods—particularly three-dimensional correlation analysis (3dCM)—explicitly preserve spectral interdependencies while providing deeper insights into spectral relationships. By leveraging Hilbert transforms and tensor operations, this approach expands spectral resolution from [] to [] dimensions, significantly enhancing feature discrimination, as evidenced by benchmark classification accuracy improvements from <10% to >99% in material authentication studies.
Challenges remain, however: manual parameter tuning impedes high-throughput implementation, computational demands vary widely (from efficient MAF to intensive WSD/KPCARD), and the opacity of DL models hinders interpretability. Addressing these limitations is critical for broader adoption of advanced spectral analysis.
Future progress in spectroscopic data preprocessing will depend on three critical advancements.
-
(1)
Context-aware algorithms capable of dynamic adaptation to varying data conditions;
-
(2)
Hybrid modeling frameworks that integrate physics-based constraints (e.g., correlation-preserving methods such as 3dCM) to enhance the robustness of data-driven approaches;
-
(3)
Reinforcement learning-driven automation to optimize the trade-off between computational efficiency and information fidelity.
These innovations—illustrated, for example, by 3dCM’s Hilbert-transform-enhanced tensor operations (improving classification accuracy from <10% to >99% in benchmark cases)—signal a paradigm shift from mere distortion correction to structured feature enhancement. By unifying computational efficiency, physical fidelity (via spectral relationship encoding), and autonomous optimization, next-generation preprocessing methods will transform spectroscopic analysis into a powerful, information-enriching platform for both scientific and industrial applications.
Limitations of the study
This review primarily focuses on high-impact publications to ensure methodological rigor, though we acknowledge that certain niche or industry-specific approaches may receive less coverage, and some methods may be inherently tailored to specific spectral techniques (e.g., Raman vs. NIR) or application domains (e.g., biomedical vs. agricultural). Given the rapid advancements in AI-driven spectral analysis, some cutting-edge developments might not yet be fully reflected in the existing literature. Direct comparisons between preprocessing techniques (e.g., baseline correction vs. smoothing filters) are inherently challenging due to methodological incompatibilities, where each approach is typically optimized and validated against distinct metrics or task-specific objectives. Lab-scale results may require further validation under real-world conditions. While our systematic approach aims to mitigate bias, the broader literature landscape exhibits a natural inclination toward widely cited methodologies, whereas hardware implementation challenges (e.g., computational efficiency and cost) merit deeper exploration in future work. Additionally, the scarcity of embedded-system validations highlights opportunities for extending this research direction in subsequent studies.
Acknowledgments
This research received no specific grant from funding agencies in the public, commercial, or not-for-profit sectors.
Author contributions
C.S.Y.: Conceptualization, methodology, investigation, formal analysis, writing – original draft, writing – review and editing.
Declaration of interests
There are no competing interests.
References
- 1.Zhang W., Kasun L.C., Wang Q.J., Zheng Y., Lin Z. A Review of Machine Learning for Near-Infrared Spectroscopy. SENSORS. 2022;22:9764. doi: 10.3390/s22249764. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Hao Z., Liu K., Lian Q., Song W., Hou Z., Zhang R., Wang Q., Sun C., Li X., Wang Z. Machine learning in laser-induced breakdown spectroscopy: A review. Front. Phys. 2024;19 doi: 10.1007/s11467-024-1427-2. [DOI] [Google Scholar]
- 3.Luo R., Popp J., Bocklitz T. Deep Learning for Raman Spectroscopy: A Review. Analytica. 2022;3:287–301. doi: 10.3390/analytica3030020. [DOI] [Google Scholar]
- 4.Zhang D., Zhang H., Zhao Y., Chen Y., Ke C., Xu T., He Y. A brief review of new data analysis methods of laser-induced breakdown spectroscopy: machine learning. Appl. Spectrosc. Rev. 2022;57:89–111. doi: 10.1080/05704928.2020.1843175. [DOI] [Google Scholar]
- 5.van de Sande D.M.J., Merkofer J.P., Amirrajab S., Veta M., van Sloun R.J.G., Versluis M.J., Jansen J.F.A., van den Brink J.S., Breeuwer M. A review of machine learning applications for the proton MR spectroscopy workflow. Magn. Reson. Med. 2023;90:1253–1270. doi: 10.1002/mrm.29793. [DOI] [PubMed] [Google Scholar]
- 6.Carter J.B., Huffaker R., Singh A., Bean E. HUM: A review of hydrochemical analysis using ultraviolet-visible absorption spectroscopy and machine learning. Sci. Total Environ. 2023;901 doi: 10.1016/j.scitotenv.2023.165826. [DOI] [PubMed] [Google Scholar]
- 7.Valente P.A., Mota S.I., Teixeira A., Ferreiro E., Sarmento H., Cipriano I., Campos J.R., Rama L., Oliveira P.J. Fourier Transform Infrared (FTIR) Spectroscopy as a Tool to Characterize Exercise and Physical Activity: A Systematic Review. Sports Med. 2025;55:459–472. doi: 10.1007/s40279-024-02139-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Srivastava S., Wang W., Zhou W., Jin M., Vikesland P.J. Machine Learning-Assisted Surface-Enhanced Raman Spectroscopy Detection for Environmental Applications: A Review. Environ. Sci. Technol. 2024;58:20830–20848. doi: 10.1021/acs.est.4c06737. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Saeidfirozeh H., Kubelik P., Laitl V., Krivkova A., Vrabel J., Rammelkamp K., Schroder S., Gornushkin I.B., Kepes E., Zabka J., et al. Laser-induced breakdown spectroscopy in space applications: Review and prospects. TRAC-TRENDS Anal. Chem. 2024;181 doi: 10.1016/j.trac.2024.117991. [DOI] [Google Scholar]
- 10.Khan M.A., Asadi H., Zhang L., Qazani M.R.C., Oladazimi S., Loo C.K., Lim C.P., Nahavandi S. Application of artificial intelligence in cognitive load analysis using functional near-infrared spectroscopy: A systematic review. Expert Syst. Appl. 2024;249 doi: 10.1016/j.eswa.2024.123717. [DOI] [Google Scholar]
- 11.Li J., Lv H., Liu Y., Huang J., Wang Y., Lin W. Application of Machine Learning to Background Rejection in Very-high-energy Gamma-Ray Observation. Astrophys. J. Suppl. Ser. 2025;276:24. doi: 10.3847/1538-4365/ad9581. [DOI] [Google Scholar]
- 12.Li H., Wei C., Zhang C., Tao R. Retrieval of Multiple Flame Parameters Based on Physics-Based Neural Network and Emission Spectrum Measurement: Model Development and Experimental Validation. Acta Opt. Sin. 2025;45 doi: 10.3788/AOS241456. [DOI] [Google Scholar]
- 13.Balasubramanian S.V., O’Shea R.E., Saranathan A.M., Begeman C.C., Gurlin D., Binding C., Giardino C., Tomlinson M.C., Alikas K., Kangro K., et al. Mixture density networks for re-constructing historical ocean-color products over inland and coastal waters: demonstration and validation. Front. Remote Sens. 2025;6 doi: 10.3389/frsen.2025.1488565. [DOI] [Google Scholar]
- 14.Barnett T.P., Preisendorfer R. Origins and Levels of Monthly and Seasonal Forecast Skill for United-States Surface Air Temperatures Determined by Canonical Correlation-Analysis. Mon. Weather Rev. 1987;115:1825–1850. doi: 10.1175/1520-0493(1987)115<1825:OALOMA>2.0.CO;2. [DOI] [Google Scholar]
- 15.Wold S., Sjöström M., Eriksson L. PLS-regression:: a basic tool of chemometrics. Chemom. Intell. Lab. Syst. 2001;58:109–130. doi: 10.1016/S0169-7439(01)00155-1. [DOI] [Google Scholar]
- 16.Cortes C., Vapnik V. Support-Vector Networks. Mach. Learn. 1995;20:273–297. doi: 10.1007/BF00994018. [DOI] [Google Scholar]
- 17.Fix E., Hodges J.L., join' Discriminatory Analysis - Nonparametric Discrimination - Consistency Properties. Int. Stat. Rev. 1989;57:238–247. doi: 10.2307/1403797. [DOI] [Google Scholar]
- 18.Shin H.-C., Roth H.R., Gao M., Lu L., Xu Z., Nogues I., Yao J., Mollura D., Summers R.M. Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning. IEEE Trans. Med. Imag. 2016;35:1285–1298. doi: 10.1109/TMI.2016.2528162. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Mishra P., Biancolillo A., Roger J.M., Marini F., Rutledge D.N. New data preprocessing trends based on ensemble of multiple preprocessing techniques. TRAC-TRENDS Anal. Chem. 2020;132 doi: 10.1016/j.trac.2020.116045. [DOI] [Google Scholar]
- 20.Renner G., Nellessen A., Schwiers A., Wenzel M., Schmidt T.C., Schram J. Data preprocessing & evaluation used in the microplastics identification process: A critical review & practical guide. TRAC-TRENDS Anal. Chem. 2019;111:229–238. doi: 10.1016/j.trac.2018.12.004. [DOI] [Google Scholar]
- 21.Lee L.C., Liong C.-Y., Jemain A.A. A contemporary review on Data Preprocessing (DP) practice strategy in ATR-FTIR spectrum. Chemometr. Intell. Lab. Syst. 2017;163:64–75. doi: 10.1016/j.chemolab.2017.02.008. [DOI] [Google Scholar]
- 22.Rinnan Å., Berg F.V.D., Engelsen S.B. Review of the most common pre-processing techniques for near-infrared spectra. TrAC Trends Anal. Chem. 2009;28:1201–1222. doi: 10.1016/j.trac.2009.07.007. [DOI] [Google Scholar]
- 23.Vincenzo Palleschi V.P. 1st ed. Wiley; 2022. Chemometrics and Numerical Methods in LIBS. [Google Scholar]
- 24.Brunnbauer L., Gajarska Z., Lohninger H., Limbeck A. A critical review of recent trends in sample classification using Laser-Induced Breakdown Spectroscopy (LIBS) TrAC Trends Anal. Chem. 2023;159 doi: 10.1016/j.trac.2022.116859. [DOI] [Google Scholar]
- 25.Duponchel L., Fabre C., Bousquet B., Motto-Ros V. Statistical comparison of predictive models for quantitative analysis and classification in the framework of LIBS spectroscopy: A tutorial. Spectrochim. Acta Part B At. Spectrosc. 2023;208 doi: 10.1016/j.sab.2023.106776. [DOI] [Google Scholar]
- 26.Werner de Vargas V., Schneider Aranda J.A., Dos Santos Costa R., Jorge Luis Victória Barbosa. Victória Barbosa J.L. Imbalanced data preprocessing techniques for machine learning: a systematic mapping study. Knowl. Inf. Syst. 2023;65:31–57. doi: 10.1007/s10115-022-01772-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Cohen-Tannoudji C., Diu B., Laloe F. Wiley-VCH; 1992. Quantum Mechanics. [Google Scholar]
- 28.DEMTRÖDER W. Second Enlarged Edition. Springer-Verlag; 1998. Laser Spectroscopy: Basic Concepts and Instrumentation. [Google Scholar]
- 29.Yan C.S., Chen Y.W., Yang H.M., Ahokas E. Optical spectrum analyzers and typical applications in astronomy and remote sensing. Rev. Sci. Instrum. 2023;94 doi: 10.1063/5.0138963. [DOI] [PubMed] [Google Scholar]
- 30.Whitaker D.A., Hayes K. A simple algorithm for despiking Raman spectra. Chemometr. Intell. Lab. Syst. 2018;179:82–84. doi: 10.1016/j.chemolab.2018.06.009. [DOI] [Google Scholar]
- 31.Phillips G.R., Harris J.M. Polynomial Filters for Data Sets with Outlying or Missing Observations - Application to Charge-Coupled-Device-Detected Raman-Spectra Contaminated by Cosmic-Rays. Anal. Chem. 1990;62:2351–2357. doi: 10.1021/ac00220a017. [DOI] [Google Scholar]
- 32.Mozharov S., Nordon A., Littlejohn D., Marquardt B. Automated Cosmic Spike Filter Optimized for Process Raman Spectroscopy. Appl. Spectrosc. 2012;66:1326–1333. doi: 10.1366/12-06660. [DOI] [PubMed] [Google Scholar]
- 33.Schulze H.G., Turner R.F.B. A Fast, Automated, Polynomial-Based Cosmic Ray Spike-Removal Method for the High-Throughput Processing of Raman Spectra. Appl. Spectrosc. 2013;67:457–462. doi: 10.1366/12-06839. [DOI] [PubMed] [Google Scholar]
- 34.Cappel U.B., Bell I.M., Pickard L.K. Removing Cosmic Ray Features from Raman Map Data by a Refined Nearest Neighbor Comparison Method as a Precursor for Chemometric Analysis. Appl. Spectrosc. 2010;64:195–200. doi: 10.1366/000370210790619528. [DOI] [PubMed] [Google Scholar]
- 35.Barton S.J., Hennelly B.M. An Algorithm for the Removal of Cosmic Ray Artifacts in Spectral Data Sets. Appl. Spectrosc. 2019;73:893–901. doi: 10.1177/0003702819839098. [DOI] [PubMed] [Google Scholar]
- 36.Ehrentreich F., Sümmchen L. Spike removal and denoising of Raman spectra by wavelet transform methods. Anal. Chem. 2001;73:4364–4373. doi: 10.1021/ac0013756. [DOI] [PubMed] [Google Scholar]
- 37.Tian Y., Burch K.S. Automatic Spike-Removal Algorithm for Raman Spectra. Appl. Spectrosc. 2016;70:1861–1871. doi: 10.1177/0003702816671065. [DOI] [PubMed] [Google Scholar]
- 38.Maury A., Revilla R.I. Autocorrelation Analysis Combined with a Wavelet Transform Method to Detect and Remove Cosmic Rays in a Single Raman Spectrum. Appl. Spectrosc. 2015;69:984–992. doi: 10.1366/14-07834. [DOI] [PubMed] [Google Scholar]
- 39.Li B., Calvet A., Casamayou-Boucau Y., Ryder A.G. Kernel principal component analysis residual diagnosis (KPCARD): An automated method for cosmic ray artifact removal in Raman spectra. Anal. Chim. Acta. 2016;913:111–120. doi: 10.1016/j.aca.2016.01.042. [DOI] [PubMed] [Google Scholar]
- 40.Schölkopf B., Smola A., Müller K.-R. Nonlinear Component Analysis as a Kernel Eigenvalue Problem. Neural Comput. 1998;10:1299–1319. doi: 10.1162/089976698300017467. [DOI] [Google Scholar]
- 41.Liu Z., Zheng R., Tian Y., Wang B., Guo J., Lu Y. A new approach for baseline correction in laser induced breakdown spectroscopy. J. Anal. At. Spectrom. 2022;37:1134–1140. doi: 10.1039/D1JA00464F. [DOI] [Google Scholar]
- 42.Gan F., Ruan G., Mo J. Baseline correction by improved iterative polynomial fitting with automatic threshold. Chemometr. Intell. Lab. Syst. 2006;82:59–65. doi: 10.1016/j.chemolab.2005.08.009. [DOI] [Google Scholar]
- 43.Zhao J., Lui H., McLean D.I., Zeng H. Automated autofluorescence background subtraction algorithm for biomedical Raman spectroscopy. Appl. Spectrosc. 2007;61:1225–1232. doi: 10.1366/000370207782597003. [DOI] [PubMed] [Google Scholar]
- 44.Hu H., Bai J., Xia G., Zhang W., Ma Y. Improved Baseline Correction Method Based on Polynomial Fitting for Raman Spectroscopy. Photonic Sens. 2018;8:332–340. doi: 10.1007/s13320-018-0512-y. [DOI] [Google Scholar]
- 45.Tian C., Li J., Weng G., Zhu J., Zhao J. Improved Num-Local Piecewise Polynomial Fitting Algorithm for Accurate Correction of Raman Spectroscopy Baselines. Spectrosc. Spectr. Anal. 2024;44:1073–1080. doi: 10.3964/j.issn.1000-0593(2024)04-1073-08. [DOI] [Google Scholar]
- 46.Ma J., Zuo M., Wu B., Wang W. An Automated Baseline Correction Method for Raman Spectra Based on Piecewise Polynomial Fitting With Adaptive Window. J. Raman Spectrosc. 2025;56:337–344. doi: 10.1002/jrs.6772. [DOI] [Google Scholar]
- 47.Mohd Rosdi N.A.N.B., Abdul Halim N.I.H., Sashidharan J.A., Abd Hamid N., Abdul Halim A., Sino H., Lee L.C. Land-use classification of Malaysian soils by ultra-high performance liquid chromatography (UHPLC)-based untargeted data combined with chemometrics for forensic provenance. Microchem. J. 2024;199 doi: 10.1016/j.microc.2024.110030. [DOI] [Google Scholar]
- 48.Wang X., Fan X.g., Xu Y.j., Wang X., He H., Wang X.f. A baseline correction algorithm for Raman spectroscopy by adaptive knots B-spline. Meas. Sci. Technol. 2015;26 doi: 10.1088/0957-0233/26/11/115503. [DOI] [Google Scholar]
- 49.Wang X., Kang Z.-M., Liu L., Fan X.-G. Baseline correction algorithm for Raman spectra based on median filtering and un-uniform B-spline. ACTA Phys. Sin. 2020;69 doi: 10.7498/aps.69.20200552. [DOI] [Google Scholar]
- 50.Eilers P.H.C., Marx B.D. Flexible smoothing with B-splines and penalties. Stat. Sci. 1996;11:89–121. doi: 10.1214/ss/1038425655. [DOI] [Google Scholar]
- 51.Song Y., Peng W., Li Z., Yu B., Zhou S., Li J. EC-QCL sensing system-incorporated adaptive baseline correction algorithm for simultaneous detection of multiple gas components. Microchem. J. 2024;201 doi: 10.1016/j.microc.2024.110605. [DOI] [Google Scholar]
- 52.Liu X., Zhang Z., Liang Y., Sousa P.F.M., Yun Y., Yu L. Baseline correction of high resolution spectral profile data based on exponential smoothing. Chemometr. Intell. Lab. Syst. 2014;139:97–108. doi: 10.1016/j.chemolab.2014.09.018. [DOI] [Google Scholar]
- 53.Chen H., Xu W., Broderick N.G.R. An Adaptive and Fully Automated Baseline Correction Method for Raman Spectroscopy Based on Morphological Operations and Mollification. Appl. Spectrosc. 2019;73:284–293. doi: 10.1177/0003702818811688. [DOI] [PubMed] [Google Scholar]
- 54.Perez-Pueyo R., Soneira M.J., Ruiz-Moreno S. Morphology-Based Automated Baseline Removal for Raman Spectra of Artistic Pigments. Appl. Spectrosc. 2010;64:595–600. doi: 10.1366/000370210791414281. [DOI] [PubMed] [Google Scholar]
- 55.Liu H., Zhang Z., Liu S., Yan L., Liu T., Zhang T. Joint Baseline-Correction and Denoising for Raman Spectra. Appl. Spectrosc. 2015;69:1013–1022. doi: 10.1366/14-07760. [DOI] [PubMed] [Google Scholar]
- 56.Koch M., Suhr C., Roth B., Meinhardt-Wollweber M. Iterative morphological and mollifier-based baseline correction for Raman spectra. J. Raman Spectrosc. 2017;48:336–342. doi: 10.1002/jrs.5010. [DOI] [Google Scholar]
- 57.Chen Y., Dai L. An Automated Baseline Correction Method Based on Iterative Morphological Operations. Appl. Spectrosc. 2018;72:731–739. doi: 10.1177/0003702817752371. [DOI] [PubMed] [Google Scholar]
- 58.Li Z., Zhan D.-J., Wang J.-J., Huang J., Xu Q.-S., Zhang Z.-M., Zheng Y.-B., Liang Y.-Z., Wang H. Morphological weighted penalized least squares for background correction. Analyst. 2013;138:4483–4492. doi: 10.1039/c3an00743j. [DOI] [PubMed] [Google Scholar]
- 59.Palacký J., Mojzeš P., Bok J. SVD-based method for intensity normalization, background correction and solvent subtraction in Raman spectroscopy exploiting the properties of water stretching vibrations: SVD based method for preprocessing Raman spectra. J. Raman Spectrosc. 2011;42:1528–1539. doi: 10.1002/jrs.2896. [DOI] [Google Scholar]
- 60.Wu X., Wang Y., Wu B., Sun J. Classification of Fritillaria using a portable near-infrared spectrometer and fuzzy generalized singular value decomposition. Ind. Crops Prod. 2024;218 doi: 10.1016/j.indcrop.2024.119032. [DOI] [Google Scholar]
- 61.Takatsuka M., Goto S., Kobayashi K., Otsuka Y., Shimada Y. Evaluation of pure antioxidative capacity of antioxidants: ESR spectroscopy of stable radicals by DPPH and ABTS assays with singular value decomposition. Food Biosci. 2022;48 doi: 10.1016/j.fbio.2022.101714. [DOI] [Google Scholar]
- 62.Geng J., Lv J., Pei J., Liao C., Tan Q., Wang T., Fang H., Wang L. Prediction of soil organic carbon in black soil based on a synergistic scheme from hyperspectral data: Combining fractional-order derivatives and three-dimensional spectral indices. Comput. Electron. Agric. 2024;220 doi: 10.1016/j.compag.2024.108905. [DOI] [Google Scholar]
- 63.Yan C. Image reconstruction algorithms of computed tomography. Chin. Opt. 2013;6:617–632. [Google Scholar]
- 64.Xi Y., Li Y., Duan Z., Lu Y. A Novel Pre-Processing Algorithm Based on the Wavelet Transform for Raman Spectrum. Appl. Spectrosc. 2018;72:1752–1763. doi: 10.1177/0003702818789695. [DOI] [PubMed] [Google Scholar]
- 65.Mallat S.g. A Theory for Multiresolution Signal Decomposition - the Wavelet Representation. IEEE Trans. Pattern Anal. Mach. Intell. 1989;11:674–693. doi: 10.1109/34.192463. [DOI] [Google Scholar]
- 66.Shao L., Griffiths P.R. Automatic baseline correction by wavelet transform for quantitative open-path Fourier transform infrared spectroscopy. Environ. Sci. Technol. 2007;41:7054–7059. doi: 10.1021/es062188d. [DOI] [PubMed] [Google Scholar]
- 67.Xiong W., Jiang Y., Huang X., Cao L. Baseline offset correction technique for terahertz signals based on improved wavelet multiresolution analysis. Phys. Scr. 2024;99 doi: 10.1088/1402-4896/ad7bf4. [DOI] [Google Scholar]
- 68.Hu Y., Jiang T., Shen A., Li W., Wang X., Hu J. A background elimination method based on wavelet transform for Raman spectra. Chemometr. Intell. Lab. Syst. 2007;85:94–101. doi: 10.1016/j.chemolab.2006.05.004. [DOI] [Google Scholar]
- 69.Jiang X., Li F., Wang Q., Luo J., Hao J., Xu M. Baseline correction method based on improved adaptive iteratively reweighted penalized least squares for the x-ray fluorescence spectrum. Appl. Opt. 2021;60:5707–5715. doi: 10.1364/AO.425473. [DOI] [PubMed] [Google Scholar]
- 70.Cobas J.C., Bernstein M.A., Martín-Pastor M., Tahoces P.G. A new general-purpose fully automatic baseline-correction procedure for 1D and 2D NMR data. J. Magn. Reson. 2006;183:145–151. doi: 10.1016/j.jmr.2006.07.013. [DOI] [PubMed] [Google Scholar]
- 71.Zhang Z.-M., Chen S., Liang Y.-Z., Liu Z.-X., Zhang Q.-M., Ding L.-X., Ye F., Zhou H. An intelligent background-correction algorithm for highly fluorescent samples in Raman spectroscopy. J. Raman Spectrosc. 2010;41:659–669. doi: 10.1002/jrs.2500. [DOI] [Google Scholar]
- 72.Zhang Z.-M., Chen S., Liang Y.-Z. Baseline correction using adaptive iteratively reweighted penalized least squares. Analyst. 2010;135:1138–1146. doi: 10.1039/b922045c. [DOI] [PubMed] [Google Scholar]
- 73.Ma S., Xu S., Chen Y., Dou Z., Xia Y., Ding W., Dong J., Hu Y. A LIBS spectrum baseline correction method based on the non-parametric prior penalized least squares algorithm. Anal. Methods. 2024;16:4360–4372. doi: 10.1039/D4AY00679H. [DOI] [PubMed] [Google Scholar]
- 74.Per Christian Hansen . SIAM-Society for Industrial and Applied Mathematics; 2010. Discrete Inverse Problems : Insight and Algorithms (Fundamentals of Algorithms) [Google Scholar]
- 75.Calvetti D., Morigi S., Reichel L., Sgallari F. Tikhonov regularization and the L-curve for large discrete ill-posed problems. J. Comput. Appl. Math. 2000;123:423–446. doi: 10.1016/S0377-0427(00)00414-3. [DOI] [Google Scholar]
- 76.Hansen P.C., O’Leary D.P. The Use of the L-Curve in the Regularization of Discrete Ill-Posed Problems. SIAM J. Sci. Comput. 1993;14:1487–1503. doi: 10.1137/0914086. [DOI] [Google Scholar]
- 77.Eilers P.H.C. A perfect smoother. Anal. Chem. 2003;75:3631–3636. doi: 10.1021/ac034173t. [DOI] [PubMed] [Google Scholar]
- 78.Ye J., Tian Z., Wei H., Li Y. Baseline correction method based on improved asymmetrically reweighted penalized least squares for the Raman spectrum. Appl. Opt. 2020;59:10933–10943. doi: 10.1364/AO.404863. [DOI] [PubMed] [Google Scholar]
- 79.He S., Zhang W., Liu L., Huang Y., He J., Xie W., Wu P., Du C. Baseline correction for Raman spectra using an improved asymmetric least squares method. Anal. Methods. 2014;6:4402–4407. doi: 10.1039/c4ay00068d. [DOI] [Google Scholar]
- 80.Yang G., Dai J., Liu X., Chen M., Wu X. Multiple Constrained Reweighted Penalized Least Squares for Spectral Baseline Correction. Appl. Spectrosc. 2020;74:1443–1451. doi: 10.1177/0003702819885002. [DOI] [PubMed] [Google Scholar]
- 81.Baek S.-J., Park A., Ahn Y.-J., Choo J. Baseline correction using asymmetrically reweighted penalized least squares smoothing. Analyst. 2015;140:250–257. doi: 10.1039/c4an01061b. [DOI] [PubMed] [Google Scholar]
- 82.Li X., Tang X., Wang B., Lu Y., Chen H. An adaptive extended Gaussian peak derivative reweighted penalised least squares method for baseline correction. Anal. Methods. 2023;15:6048–6060. doi: 10.1039/d3ay01389h. [DOI] [PubMed] [Google Scholar]
- 83.Guo Y., Jin W., Wang W., He Y., Qiu S. Baseline correction for Raman spectra using a spectral estimation-based asymmetrically reweighted penalized least squares method. Appl. Opt. 2023;62:4766–4776. doi: 10.1364/AO.489478. [DOI] [PubMed] [Google Scholar]
- 84.Li H., Dai J., Pan T., Chang C., So H.C. Sparse Bayesian learning approach for baseline correction. Chemometr. Intell. Lab. Syst. 2020;204 doi: 10.1016/j.chemolab.2020.104088. [DOI] [Google Scholar]
- 85.Wang Q., Yan X.-L., Chen X.-C., Shuai P., Wang M., Zhang Y.-H. Spectral baseline estimation using penalized least squares with weights derived from the Bayesian method. Nucl. Sci. Tech. 2022;33:148. doi: 10.1007/s41365-022-01132-9. [DOI] [Google Scholar]
- 86.Han Q., Xie Q., Peng S., Guo B. Simultaneous spectrum fitting and baseline correction using sparse representation. Analyst. 2017;142:2460–2468. doi: 10.1039/c6an02341j. [DOI] [PubMed] [Google Scholar]
- 87.Liu Y., Zhou X., Yu Y. A concise iterative method using the Bezier technique for baseline construction. Analyst. 2015;140:7984–7996. doi: 10.1039/c5an01184a. [DOI] [PubMed] [Google Scholar]
- 88.Hu Y., Cao Z., Fu H., Dai J. Spectral Baseline Correction Method Based on Down-Sampling. Spectrosc. Spectr. Anal. 2025;45:351–357. doi: 10.3964/j.issn.1000-0593(2025)02-0351-07. [DOI] [Google Scholar]
- 89.Li H., Chen S., Dai J., Zou X., Chen T., Pan T., Holmes M. Fast Burst-Sparsity Learning-Based Baseline Correction (FBSL-BC)Algorithm for Signals of Analytical Instruments. Anal. Chem. 2022;94:5113–5121. doi: 10.1021/acs.analchem.1c05443. [DOI] [PubMed] [Google Scholar]
- 90.Sun B., Zhai J., Wang Z., Wu T., Yang S., Xie Y., Li Y., Liang P. Sparse decomposition enables adaptive and accurate Raman spectral denoising. Talanta (Oxf.) 2024;266 doi: 10.1016/j.talanta.2023.125120. [DOI] [PubMed] [Google Scholar]
- 91.Ning X., Selesnick I.W., Duval L. Chromatogram baseline estimation and denoising using sparsity (BEADS) Chemometr. Intell. Lab. Syst. 2014;139:156–167. doi: 10.1016/j.chemolab.2014.09.014. [DOI] [Google Scholar]
- 92.Ke K., Lu Y., Yi C. Improvement of Convex Optimization Baseline Correction in Laser-Induced Breakdown Spectral Quantitative Analysis. Spectrosc. Spectr. Anal. 2018;38:2256–2261. doi: 10.3964/j.issn.1000-0593(2018)07-2256-06. [DOI] [Google Scholar]
- 93.He Z., Zhang Y., Wang H., Pan P. Baseline correction method of acceleration time history based on residual displacement and convex optimization. Soil Dynam. Earthq. Eng. 2023;165 doi: 10.1016/j.soildyn.2022.107676. [DOI] [Google Scholar]
- 94.Yi C., Lv Y., Xiao H., Ke K., Yu X. A novel baseline correction method using convex optimization framework in laser-induced breakdown spectroscopy quantitative analysis. Spectrochim. Acta Part B At. Spectrosc. 2017;138:72–80. doi: 10.1016/j.sab.2017.10.014. [DOI] [Google Scholar]
- 95.Koch A., Weber J.-V. Baseline Correction of Spectra in Fourier Transform Infrared: Interactive Drawing with Bezier Curves. Appl. Spectrosc. 1998;52:970–973. doi: 10.1366/0003702981944607. [DOI] [Google Scholar]
- 96.Zeng X., Zhang H., Xi X., Li B., Zhou J. Numerically Denoising Thermally Tunable and Thickness-Dependent Terahertz Signals in ErFeO3 Based on Bézier Curves and B-Splines. Ann. Phys. 2021;533 doi: 10.1002/andp.202000464. [DOI] [Google Scholar]
- 97.Guo S., Bocklitz T., Popp J. Optimization of Raman-spectrum baseline correction in biological application. Analyst. 2016;141:2396–2404. doi: 10.1039/c6an00041j. [DOI] [PubMed] [Google Scholar]
- 98.Brunel B., Alsamad F., Piot O. Toward automated machine learning in vibrational spectroscopy: Use and settings of genetic algorithms for pre-processing and regression optimization. Chemometr. Intell. Lab. Syst. 2021;219 doi: 10.1016/j.chemolab.2021.104444. [DOI] [Google Scholar]
- 99.He S., Fang S., Liu X., Zhang W., Xie W., Zhang H., Wei D., Fu W., Pei D. Investigation of a genetic algorithm based cubic spline smoothing for baseline correction of Raman spectra. Chemometr. Intell. Lab. Syst. 2016;152:1–9. doi: 10.1016/j.chemolab.2016.01.005. [DOI] [Google Scholar]
- 100.Gorski L., Jakubowska M., Bas B., Kubiak W.W. Application of genetic algorithm for baseline optimization in standard addition voltammetry. J. Electroanal. Chem. 2012;684:38–46. doi: 10.1016/j.jelechem.2012.08.014. [DOI] [Google Scholar]
- 101.Jarvis R.M., Goodacre R. Genetic algorithm optimization for pre-processing and variable selection of spectroscopic data. Bioinformatics. 2005;21:860–868. doi: 10.1093/bioinformatics/bti102. [DOI] [PubMed] [Google Scholar]
- 102.Lin Y., Fan R., Wu Y., Zhan C., Qing R., Li K., Kang Z. Combining hyperspectral imaging technology and visible-near infrared spectroscopy with a data fusion strategy for the detection of soluble solids content in apples. J. Food Compost. Anal. 2025;137 doi: 10.1016/j.jfca.2024.106996. [DOI] [Google Scholar]
- 103.He P., Dumont E., Göksel Y., Slipets R., Schmiegelow K., Chen Q., Zor K., Boisen A. SERS mapping combined with chemometrics, for accurate quantification of methotrexate from patient samples. Spectrochim. Acta Mol. Biomol. Spectrosc. 2024;305 doi: 10.1016/j.saa.2023.123536. [DOI] [PubMed] [Google Scholar]
- 104.Lecun Y., Bottou L., Bengio Y., Haffner P. Gradient-based learning applied to document recognition. Proc. IEEE. 1998;86:2278–2324. doi: 10.1109/5.726791. [DOI] [Google Scholar]
- 105.Kiranyaz S., Avci O., Abdeljaber O., Ince T., Gabbouj M., Inman D.J. 1D convolutional neural networks and applications: A survey. Mech. Syst. Signal Process. 2021;151 doi: 10.1016/j.ymssp.2020.107398. [DOI] [Google Scholar]
- 106.Shang H., Wu Q., Wu J., Zhou S., Wang Z., Wang H., Yin J. Study on breast cancerization and isolated diagnosis in situ by HOF-ATR-MIR spectroscopy with deep learning. Spectrochim. Acta Mol. Biomol. Spectrosc. 2024;319 doi: 10.1016/j.saa.2024.124546. [DOI] [PubMed] [Google Scholar]
- 107.He X., Zhao Y., Li F. A new technique for baseline calibration of soil X-ray fluorescence spectra based on enhanced generative adversarial networks combined with transfer learning. J. Anal. Spectrom. 2023;38:2486–2498. doi: 10.1039/D3JA00235G. [DOI] [Google Scholar]
- 108.Siddique T., Mahmud M.S. 2024 International Conference on Computing, Networking and Communications (ICNC) 2024. A Surrogate Tiny Machine Learning Model of Variational Autoencoder For Real-Time Baseline Correction of Magnetometer Data; pp. 579–583. [DOI] [Google Scholar]
- 109.Jiao Q., Cai B., Liu M., Dong L., Hei M., Kong L., Zhao Y. A three-stage deep learning-based training frame for spectra baseline correction. Anal. Methods. 2024;16:1496–1507. doi: 10.1039/d3ay02062b. [DOI] [PubMed] [Google Scholar]
- 110.Dong S., Li Z., Hu F., Yu Z., Chen X. TraceNet: An Effective Deep-Learning-Based Method for Baseline Correction of Near-Field Acceleration Records. Seismol Res. Lett. 2023;94:1656–1669. doi: 10.1785/0220220272. [DOI] [Google Scholar]
- 111.Chen T., Son Y., Park A., Baek S.-J. Baseline correction using a deep-learning model combining ResNet and UNet. Analyst. 2022;147:4285–4292. doi: 10.1039/D2AN00868H. [DOI] [PubMed] [Google Scholar]
- 112.Kazemzadeh M., Martinez-Calderon M., Xu W., Chamley L.W., Hisey C.L., Broderick N.G.R. Cascaded Deep Convolutional Neural Networks as Improved Methods of Preprocessing Raman Spectroscopy Data. Anal. Chem. 2022;94:12907–12918. doi: 10.1021/acs.analchem.2c03082. [DOI] [PubMed] [Google Scholar]
- 113.Helland I., Naes T., Isaksson T. Related Versions of the Multiplicative Scatter Correction Method for Preprocessing Spectroscopic Data. Chemom. Intell. Lab. Syst. 1995;29:233–241. doi: 10.1016/0169-7439(95)00031-1. [DOI] [Google Scholar]
- 114.Chen Y.-C., Thennadil S.N. Insights into information contained in multiplicative scatter correction parameters and the potential for estimating particle size from these parameters. Anal. Chim. Acta. 2012;746:37–46. doi: 10.1016/j.aca.2012.08.006. [DOI] [PubMed] [Google Scholar]
- 115.Cauduro V.H., Alessio K.O., Gomes A.O., Flores E.M.M., Muller E.I., Duarte F.A. A simple, fast and green method for API gravity, density, sulfur and nitrogen determination in crude oil by ATR-FTIR. Microchem. J. 2024;200 doi: 10.1016/j.microc.2024.110348. [DOI] [Google Scholar]
- 116.Bai S., Xiao K., Liu Q., Mariga A.M., Yang W., Fang Y., Hu Q., Gao H., Chen H., Pei F. Prediction of moisture content of Agaricus bisporus slices as affected by vacuum freeze drying using hyperspectral imaging. Food Control. 2024;159 doi: 10.1016/j.foodcont.2024.110290. [DOI] [Google Scholar]
- 117.Afseth N.K., Kohler A. Extended multiplicative signal correction in vibrational spectroscopy, a tutorial. Chemometr. Intell. Lab. Syst. 2012;117:92–99. doi: 10.1016/j.chemolab.2012.03.004. [DOI] [Google Scholar]
- 118.Martens H., Stark E. Extended Multiplicative Signal Correction and Spectral Interference Subtraction - New Preprocessing Methods for Near-Infrared Spectroscopy. J. Pharm. Biomed. Anal. 1991;9:625–635. doi: 10.1016/0731-7085(91)80188-F. [DOI] [PubMed] [Google Scholar]
- 119.Jochemsen A., Alfredsen G., Martens H., Burud I. Exploring the use of extended multiplicative scattering correction for near infrared spectra of wood with fungal decay. Chemometr. Intell. Lab. Syst. 2024;252 doi: 10.1016/j.chemolab.2024.105187. [DOI] [Google Scholar]
- 120.Prada P., Brunel B., Moulin D., Rouillon L., Netter P., Loeuille D., Slimano F., Bouche O., Peyrin-Biroulet L., Jouzeau J.-Y., Piot O. Identification of circulating biomarkers of Crohn’s disease and spondyloarthritis using Fourier transform infrared spectroscopy. J. Biophoton. 2023;16:e202200200. doi: 10.1002/jbio.202200200. [DOI] [PubMed] [Google Scholar]
- 121.Goncalves M., Paiva N.T., Ferra J.M., Martins J., Magalhaes F., Carvalho L. Effect of temperature and age on near infrared spectra of amino resins. J. Near Infrared Spectrosc. 2021;29:84–91. doi: 10.1177/0967033520982365. [DOI] [Google Scholar]
- 122.Yadav S., Tomar M., Singhal T., Joshi N., Bhargavi H.A., Aavula N., Langyan S., Joshi T., Satyavathi C.T., Rana J.C., et al. Near-infrared reflectance spectroscopy (NIRS): An innovative, rapid, economical, easy and non-destructive whole grain analysis method for nutritional profiling of pearl millet genotypes. J. Food Compost. Anal. 2025;142 doi: 10.1016/j.jfca.2025.107373. [DOI] [Google Scholar]
- 123.Liu X., Huang J., Li W., Chen R., Cao L., Pan T., Liu F. Fast quality assessment and origin identification of Gentianae Macrophyllae Radix using fourier transform infrared photoacoustic spectroscopy coupled with chemometrics. J. Pharm. Biomed. Anal. 2025;259 doi: 10.1016/j.jpba.2025.116774. [DOI] [PubMed] [Google Scholar]
- 124.Barnes R.J., Dhanoa M.S., Lister S.J. Standard Normal Variate Transformation and De-Trending of Near-Infrared Diffuse Reflectance Spectra. Appl. Spectrosc. 1989;43:772–777. doi: 10.1366/0003702894202201. [DOI] [Google Scholar]
- 125.Ping J., Hao N., Guo X., Miao P., Guan Z., Chen H., Liu C., Bai G., Li W. Rapid and accurate identification of Panax ginseng origins based on data fusion of near-infrared and laser-induced breakdown spectroscopy. Food Res. Int. 2025;204 doi: 10.1016/j.foodres.2025.115925. [DOI] [PubMed] [Google Scholar]
- 126.Li B., Ma T., Inagaki T., Tsuchikawa S. Enhanced detection of early bruises in apples using near-infrared hyperspectral imaging with geometrical influence correction for universal size adaptation. Postharvest Biol. Technol. 2025;219 doi: 10.1016/j.postharvbio.2024.113282. [DOI] [Google Scholar]
- 127.Giannini V., Panic J., Regge D., Balestra G., Rosati S. Could normalization improve robustness of abdominal MRI radiomic features? Biomed. Phys. Eng. Express. 2023;9 doi: 10.1088/2057-1976/ace4ce. [DOI] [PubMed] [Google Scholar]
- 128.Agelet L.E., Hurburgh C.R., Jr A Tutorial on Near Infrared Spectroscopy and Its Calibration. Crit. Rev. Anal. Chem. 2010;40:246–260. doi: 10.1080/10408347.2010.515468. [DOI] [Google Scholar]
- 129.Lamptey F.P., Amuah C.L.Y., Boadu V.G., Abano E.E., Teye E. Smart classification of organic and inorganic pineapple juice using dual NIR spectrometers combined with chemometric techniques. Appl. Food Res. 2024;4 doi: 10.1016/j.afres.2024.100471. [DOI] [Google Scholar]
- 130.Boadu V.G., Teye E., Lamptey F.P., Amuah C.L.Y., Sam-Amoah L.K. Novel authentication of African geographical coffee types (bean, roasted, powdered) by handheld NIR spectroscopic method. Heliyon. 2024;10 doi: 10.1016/j.heliyon.2024.e35512. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 131.Alwaili M.A., Alminderej F.M., Alsehli B.R., Rudayni H.A., Allam A.A., Saleh S.M., Mohamed M.A. State-of-the-art mean centering of ratio spectra and HPLC methodologies for comprehensive analysis of some dry eyes drugs: investigation of white and green practices. Green Chem. Lett. Rev. 2025;18 doi: 10.1080/17518253.2025.2473487. [DOI] [Google Scholar]
- 132.Sjöblom J., Svensson O., Josefson M., Kullberg H., Wold S. An evaluation of orthogonal signal correction applied to calibration transfer of near infrared spectra. Chemom. Intell. Lab. Syst. 1998;44:229–244. doi: 10.1016/S0169-7439(98)00112-9. [DOI] [Google Scholar]
- 133.Wold S., Antti H., Lindgren F., Öhman J. Orthogonal signal correction of near-infrared spectra. Chemometr. Intell. Lab. Syst. 1998;44:175–185. doi: 10.1016/S0169-7439(98)00109-9. [DOI] [Google Scholar]
- 134.Xie L., Hong M., Yu Z. A Wavelength Selection Method Combining Direct Orthogonal Signal Correction and Monte Carlo. Spectrosc. Spectr. Anal. 2022;42:440–445. doi: 10.3964/j.issn.1000-0593(2022)02-0440-06. [DOI] [Google Scholar]
- 135.Fearn T. On orthogonal signal correction. Chemometr. Intell. Lab. Syst. 2000;50:47–52. doi: 10.1016/S0169-7439(99)00045-3. [DOI] [Google Scholar]
- 136.Andersson C.A. Direct orthogonalization. Chemometr. Intell. Lab. Syst. 1999;47:51–63. doi: 10.1016/S0169-7439(98)00158-0. [DOI] [Google Scholar]
- 137.Westerhuis J.A., de Jong S., Smilde A.K. Direct orthogonal signal correction. Chemometr. Intell. Lab. Syst. 2001;56:13–25. doi: 10.1016/S0169-7439(01)00102-2. [DOI] [Google Scholar]
- 138.Lomarat P., Phechkrajang C., Sunghad P., Anantachoke N. Raman spectroscopy coupled with the PLSR model: A rapid method for analyzing gamma-oryzanol content in rice bran oil. Food Chem. X. 2024;24 doi: 10.1016/j.fochx.2024.101923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 139.Pavurala N., Madhavarao C.N., Lee J., Das J., Ashraf M., O’Connor T. Cell Culture Media and Raman Spectra Preprocessing Procedures Impact Glucose Chemometrics. J. Chemom. 2025;39 doi: 10.1002/cem.70005. [DOI] [Google Scholar]
- 140.Wang D., Tan Y., Li C., Xin J., Wang Y., Hou H., Gao L., Zhong C., Pan J., Li Z. Recognizing and reducing effects of moisture-salt coexistence on soil organic matter spectral prediction:From laboratory to satellite. Soil Tillage Res. 2025;248 doi: 10.1016/j.still.2024.106397. [DOI] [Google Scholar]
- 141.Ding S., Zhang X., Shang K., Xiao Q., Wang W., UR Rehman A. Removal of environmental influences for estimating soil texture fractions based on ZY1 satellite hyperspectral images. Catena (Cremling.) 2024;236 doi: 10.1016/j.catena.2023.107713. [DOI] [Google Scholar]
- 142.Yan C., Cheng Z., Luo S., Huang C., Han S., Han X., Du Y., Ying C. Analysis of handmade paper by Raman spectroscopy combined with machine learning. J. Raman Spectrosc. 2022;53:260–271. doi: 10.1002/jrs.6280. [DOI] [Google Scholar]
- 143.David M., Magdas D.A. Authentication of honey origin and harvesting year based on Raman spectroscopy and chemometrics. Talanta Open. 2024;10 doi: 10.1016/j.talo.2024.100342. [DOI] [Google Scholar]
- 144.Cai Z.-Q., Feng W.-W., Wang H.-Q., Lang X.-H., Yang J.-L., Wu X., Wang Q. In: OPTICAL DESIGN AND TESTING XII Proceedings of SPIE. Wang Y., Kidger T.E., Wu R., editors. Spie-Int Soc Optical Engineering; 2023. Identification method of microplastics based on Raman - Infrared spectroscopy fusion. [DOI] [Google Scholar]
- 145.Loahavilai P., Datta S., Prasertsuk K., Jintamethasawat R., Rattanawan P., Chia J.Y., Kingkan C., Thanapirom C., Limpanuparb T. Chemometric Analysis of a Ternary Mixture of Caffeine, Quinic Acid, and Nicotinic Acid by Terahertz Spectroscopy. ACS Omega. 2022;7:35783–35791. doi: 10.1021/acsomega.2c03808. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 146.Aathilakshmi S., Balasubramaniam S., Sivakumar T.A., Chetana V.L. Min-Max Filtering and Exponential Fossa Optimization Algorithm-Based Parallel Convolutional Neural Network for Heart Disease Detection. Int. J. Intell. Syst. 2025;2025 doi: 10.1155/int/1409684. [DOI] [Google Scholar]
- 147.Ponugoti S., Rajagopalan S. MNAFH-Net: Maxout neuron attention forward harmonic network for human stress level detection. Biomed. Signal Process Control. 2025;103 doi: 10.1016/j.bspc.2024.107442. [DOI] [Google Scholar]
- 148.Fei N., Gao Y., Lu Z., Xiang T. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021) IEEE; 2021. Z-Score Normalization, Hubness, and Few-Shot Learning; pp. 142–151. [DOI] [Google Scholar]
- 149.Jiang J., Tan X., Zhang L., Zhu Q., Li H., Qiu B. Hybrid N-way Partial Least Squares and Random Forest Model for Brick Tea Identification Based on Excitation–emission Matrix Fluorescence Spectroscopy. Food Bioproc. Tech. 2023;16:1335–1342. doi: 10.1007/s11947-023-03006-3. [DOI] [Google Scholar]
- 150.Li R., Cao G., Pu Y., Qiu B., Wang X., Yan J., Wang K. TDSC-Net: A Two-Dimensional Stellar Spectra Classification Model Based on Attention Mechanism and Feature Fusion. Spectrosc. Spectr. Anal. 2024;44:1968–1973. doi: 10.3964/j.issn.1000-0593(2024)07-1968-06. [DOI] [Google Scholar]
- 151.Duenweg S.R., Bobholz S.A., Lowman A.K., Winiarz A., Nath B., Barrett M.J., Kyereme F., Vincent-Sheldon S., Laviolette P. Comparison of intensity normalization methods in prostate, brain, and breast cancer multi-parametric magnetic resonance imaging. Front. Oncol. 2025;15 doi: 10.3389/fonc.2025.1433444. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 152.Bi Y., Yuan K., Xiao W., Wu J., Shi C., Xia J., Chu G., Zhang G., Zhou G. A local pre-processing method for near-infrared spectra, combined with spectral segmentation and standard normal variate transformation. Anal. Chim. Acta. 2016;909:30–40. doi: 10.1016/j.aca.2016.01.010. [DOI] [PubMed] [Google Scholar]
- 153.Ardakani A.A., Gharbali A., Mohammadi A. Classification of Breast Tumors Using Sonographic Texture Analysis. J. Ultrasound Med. 2015;34:225–231. doi: 10.7863/ultra.34.2.225. [DOI] [PubMed] [Google Scholar]
- 154.Yang N.-C., Sung K.-L. Non-Intrusive Load Classification and Recognition Using Soft-Voting Ensemble Learning Algorithm With Decision Tree, K-Nearest Neighbor Algorithm and Multilayer Perceptron. IEEE Access. 2023;11:94506–94520. doi: 10.1109/ACCESS.2023.3311641. [DOI] [Google Scholar]
- 155.Nayak D.R., Dash R., Majhi B. Brain MR image classification using two-dimensional discrete wavelet transform and AdaBoost with random forests. Neurocomputing. 2016;177:188–197. doi: 10.1016/j.neucom.2015.11.034. [DOI] [Google Scholar]
- 156.Guo Z., Zhang B., Zeng Y. Study on Sugar Content Detection of Kiwifruit Using Near-Infrared Spectroscopy Combined With Stacking Ensemble Learning. Spectrosc. Spectr. Anal. 2024;44:2932–2940. doi: 10.3964/j.issn.1000-0593(2024)10-2932-09. [DOI] [Google Scholar]
- 157.Luo Y., Liu Y., Wei Q., Strlič M. NIR spectroscopy in conjunction with multivariate analysis for non-destructive characterization of Xuan paper. Herit. Sci. 2024;12:175. doi: 10.1186/s40494-024-01287-1. [DOI] [Google Scholar]
- 158.Jakkan D.A., Ghare P., Sakode C. Multi-parameter Soil Property Prediction Incorporating Mid-infrared Spectroscopy and Dropout Sequential Artificial Neural Network. Water Air Soil Pollut. 2023;234:694. doi: 10.1007/s11270-023-06726-6. [DOI] [Google Scholar]
- 159.Dai B., Peng Y., Zhang M., Yang M., Wu Y., Guo X. Insight into the effects of biological treatment on the binding properties of copper onto dissolved organic matter derived from coking wastewater. Ecotoxicol. Environ. Saf. 2022;238 doi: 10.1016/j.ecoenv.2022.113567. [DOI] [PubMed] [Google Scholar]
- 160.Zhang T., Liu Z., Ma Q., Hu D., Dai Y., Zhang X., Zhou Z. Identification of Dendrobium Using Laser-Induced Breakdown Spectroscopy in Combination with a Multivariate Algorithm Model. Foods. 2024;13:1676. doi: 10.3390/foods13111676. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 161.Capela D., Lopes T., Dias F., Ferreira M.F.S., Teixeira J., Lima A., Jorge P.A.S., Silva N.A., Guimarães D. Advancing automated mineral identification through LIBS imaging for lithium-bearing mineral species. Spectrochim. Acta Part B At. Spectrosc. 2025;223 doi: 10.1016/j.sab.2024.107085. [DOI] [Google Scholar]
- 162.Schafer R. What Is a Savitzky-Golay Filter? IEEE Signal Process. Mag. 2011;28:111–117. doi: 10.1109/MSP.2011.941097. [DOI] [Google Scholar]
- 163.Lee J., Lee W. Peak-aware adaptive denoising for Raman spectroscopy based on machine learning approach. J. Raman Spectrosc. 2024;55:525–533. doi: 10.1002/jrs.6648. [DOI] [Google Scholar]
- 164.Liu X., Qiao S., Ma Y. Highly sensitive methane detection based on light-induced thermoelastic spectroscopy with a 2.33 μm diode laser and adaptive Savitzky-Golay filtering. Opt. Express. 2022;30:1304–1313. doi: 10.1364/OE.446294. [DOI] [PubMed] [Google Scholar]
- 165.Zheng K.-Y., Zhang X., Tong P.-J., Yao Y., Du Y.-P. Pretreating near infrared spectra with fractional order Savitzky-Golay differentiation (FOSGD) Chin. Chem. Lett. 2015;26:293–296. doi: 10.1016/j.cclet.2014.10.023. [DOI] [Google Scholar]
- 166.Zhang J., Mouazen A.M. Fractional-order Savitzky-Golay filter for pre-treatment of on-line vis-NIR spectra to predict phosphorus in soil. Infrared Phys. Technol. 2023;131 doi: 10.1016/j.infrared.2023.104720. [DOI] [Google Scholar]
- 167.Lu Y., Liu W., Zhang Y., Zhang K., He Y., You K., Li X., Liu G., Tang Q., Fan B., et al. An Adaptive Hierarchical Savitzky-Golay Spectral Filtering Algorithm and Its Application. Spectrosc. Spectr. Anal. 2019;39:2657–2663. doi: 10.3964/j.issn.1000-0593(2019)09-2657-07. [DOI] [Google Scholar]
- 168.Chen S., Tian X., Mu T., Yuan J., Cao X., Cheng G. Enhancement of Methane Detection in Tunable Diode Laser Absorption Spectroscopy Using Savitzky-Golay Filtering. Photonics. 2024;12:2. doi: 10.3390/photonics12010002. [DOI] [Google Scholar]
- 169.Jacob A.M., Menten K.M., Wiesemeyer H., Lee M.-Y., Güsten R., Durán C.A. Fingerprinting the effects of hyperfine structure on CH and OH far infrared spectra using Wiener filter deconvolution. Astron. Astrophys. 2019;632:A60. doi: 10.1051/0004-6361/201936037. [DOI] [Google Scholar]
- 170.Yang S., Li Y., Song F., Wang H., Wang Y., Zheng C., Tittel F.K. Non-differential dual-channel mid-infrared absorption spectroscopy for CH4 detection using Wiener-Homomorphic filtering technique. Infrared Phys. Technol. 2020;104 doi: 10.1016/j.infrared.2019.103105. [DOI] [Google Scholar]
- 171.Ibrahim I., AlRowaily M.H., Arof H., Abu Talip M.S. Performance Comparison of Selected Filters in Fast Denoising of Oil Palm Hyperspectral Data. Appl. Sci. (Basel). 2024;14:8895. doi: 10.3390/app14198895. [DOI] [Google Scholar]
- 172.Hotra O., Firago V., Levkovich N., Shuliko K. Investigation of the Possibility of Using Microspectrometers Based on CMOS Photodiode Arrays in Small-Sized Devices for Optical Diagnostics. Sensors. 2022;22:4195. doi: 10.3390/s22114195. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 173.Kalman R.E. A New Approach to Linear Filtering and Prediction Problems. J. Basic Eng. 1960;82:35–45. [Google Scholar]
- 174.Izzetoglu M., Chitrapu P., Bunce S., Onaral B. Motion artifact cancellation in NIR spectroscopy using discrete Kalman filtering. Biomed. Eng. Online. 2010;9:16. doi: 10.1186/1475-925X-9-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 175.Spotts I., Brodie C.H., Saeedkia D., Gadsden S.A., Collier C.M. Improved Terahertz Time-Domain Spectroscopy via the Extended Kalman Filter. IEEE J. Sel. Top. Quantum Electron. 2023;29:1–12. doi: 10.1109/JSTQE.2023.3269048. [DOI] [Google Scholar]
- 176.Li D., Wang D., Li Z., Chen H. Application of Kalman Filter in Gas Detection by Cavity Ring-Down Spectroscopy. Spectrosc. Spectr. Anal. 2024;44:2727–2732. doi: 10.3964/j.issn.1000-0503(2024)10-2727-06. [DOI] [Google Scholar]
- 177.Dong X., He Z., Yan X., Gao D., Jiao J., Sun Y., Wang H., Qu H. Real-time model correction using Kalman filter for Raman-controlled cell culture processes. Chin. J. Chem. Eng. 2024;70:251–260. doi: 10.1016/j.cjche.2024.03.016. [DOI] [Google Scholar]
- 178.Lu Y., Liu L., Wu Z., Xu Z., Zhao Z., Hao Z., Shi J., He X. Long-term reproducibility detection method for quantitative LIBS using Kalman filtering. J. Anal. At. Spectrom. 2023;38:2619–2624. doi: 10.1039/D3JA00275F. [DOI] [Google Scholar]
- 179.Cui X., Cui X., Zhu Q., Yin S., Shi X., Zhang L., Yu B., Hong Y., Chen W. Rapid and precise measurement of atmospheric CO2 and its isotopic ratios using a mid-infrared gas sensor. Sens. Actuators B Chem. 2025;430 doi: 10.1016/j.snb.2025.137329. [DOI] [Google Scholar]
- 180.Ashkezari-Toussi S., Sadoghi-Yazdi H. Robust diffusion LMS over adaptive networks. Signal Process. 2019;158:201–209. doi: 10.1016/j.sigpro.2019.01.004. [DOI] [Google Scholar]
- 181.Chu Y.J., Chan S.C., Zhou Y., Wu M. A new diffusion variable spatial regularized LMS algorithm. Signal Process. 2021;188 doi: 10.1016/j.sigpro.2021.108207. [DOI] [Google Scholar]
- 182.Wu B., Zhou F. Application of neural network adaptive filter method to simultaneous detection of polymetallic ions based on ultraviolet-visible spectroscopy. Front. Chem. 2024;12 doi: 10.3389/fchem.2024.1409527. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 183.Donoho D.L., Johnstone I.M. Ideal Spatial Adaptation by Wavelet Shrinkage. Biometrika. 1994;81:425–455. doi: 10.1093/biomet/81.3.425. [DOI] [Google Scholar]
- 184.Srivastava M., Anderson C.L., Freed J.H. A New Wavelet Denoising Method for Selecting Decomposition Levels and Noise Thresholds. IEEE Access. 2016;4:3862–3877. doi: 10.1109/ACCESS.2016.2587581. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 185.Johnstone I.M., Silverman B.W. Wavelet threshold estimators for data with correlated noise. J. R. Stat. Soc. Ser. B Stat. Methodol. 1997;59:319–351. doi: 10.1111/1467-9868.00071. [DOI] [Google Scholar]
- 186.Nsengiyumva W., Zhong S., Luo M., Wang B. Terahertz Spectroscopic Characterization and Thickness Evaluation of Internal Delamination Defects in GFRP Composites. Chin. J. Mech. Eng. 2023;36:6. doi: 10.1186/s10033-022-00829-7. [DOI] [Google Scholar]
- 187.Feldman M. Time-varying vibration decomposition and analysis based on the Hilbert transform. J. Sound Vib. 2006;295:518–530. doi: 10.1016/j.jsv.2005.12.058. [DOI] [Google Scholar]
- 188.Yan C. Research and development on Kramers-Kronig relationship. Chin. Opt. 2019;12:179–198. doi: 10.3788/CO.20191202.0179. [DOI] [Google Scholar]
- 189.Zhao X.Y., Liu G.Y., Sui Y.T., Xu M., Tong L. Denoising method for Raman spectra with low signal-to-noise ratio based on feature extraction. Spectrochim. Acta Mol. Biomol. Spectrosc. 2021;250 doi: 10.1016/j.saa.2020.119374. [DOI] [PubMed] [Google Scholar]
- 190.Shukla U.P., Nanda S.J. Denoising hyperspectral images using Hilbert vibration decomposition with cluster validation. IET Image Process. 2018;12:1736–1745. doi: 10.1049/iet-ipr.2017.1234. [DOI] [Google Scholar]
- 191.Zhao X., Xu M., Zhang W., Liu G., Tong L. Identification of zinc pollution in rice plants based on two characteristic variables. Spectrochim. Acta Mol. Biomol. Spectrosc. 2021;261 doi: 10.1016/j.saa.2021.120043. [DOI] [PubMed] [Google Scholar]
- 192.Savitzky A., Golay M.J.E. Smoothing + Differentiation of Data by Simplified Least Squares Procedures. Anal. Chem. 1964;36:1627–1639. doi: 10.1021/ac60214a047. [DOI] [Google Scholar]
- 193.Steinier J., Termonia Y., Deltour J. Smoothing and Differentiation of Data by Simplified Least Square Procedure. Anal. Chem. 1972;44:1906–1909. doi: 10.1021/ac60319a045. [DOI] [PubMed] [Google Scholar]
- 194.Madden H.H. Comments on Savitzky-Golay Convolution Method for Least-Squares Fit Smoothing and Differentiation of Digital Data. Anal. Chem. 1978;50:1383–1386. doi: 10.1021/ac50031a048. [DOI] [Google Scholar]
- 195.Hao Y., Wang Q., Zhang S. Rapid Identification of Wood Species Based on Portable Near-Infrared Spectrometry and Chemometrics Methods. Spectroscopy. 2021;36:7–13. [Google Scholar]
- 196.Li T., Su C. Authenticity identification and classification of Rhodiola species in traditional Tibetan medicine based on Fourier transform near-infrared spectroscopy and chemometrics analysis. Spectrochim. Acta Mol. Biomol. Spectrosc. 2018;204:131–140. doi: 10.1016/j.saa.2018.06.004. [DOI] [PubMed] [Google Scholar]
- 197.Jong C.Y., Tristan G., Felix L.J.J., Yeap E.W.Q., Dubbaka S.R., Rao H.N., Wong S.Y. Systematic Assessment of Calibration Strategies in Spectroscopic Analysis: A Case Study of Paracetamol Crystallization. Org. Process Res. Dev. 2025;29:503–520. doi: 10.1021/acs.oprd.4c00496. [DOI] [Google Scholar]
- 198.Norris K., Williams P. Optimization of Mathematical Treatments of Raw Near-Infrared Signal in the Measurement of Protein in Hard Red Spring Wheat .1. Influence of Particle-Size. Cereal Chem. 1984;61:158–165. [Google Scholar]
- 199.Asachi M., Hassanpour A., Ghadiri M., Bayly A. Assessment of Near-Infrared (NIR) spectroscopy for segregation measurement of low content level ingredients. Powder Technol. 2017;320:143–154. doi: 10.1016/j.powtec.2017.07.003. [DOI] [Google Scholar]
- 200.He N., Shan P., He Z., Wang Q., Li Z., Wu Z. Study on the Fractional Baseline Correction Method of ATR-FTIR Spectral Signal in the Fermentation Process of Sodium Glutamate. Spectrosc. Spectr. Anal. 2022;42:1848–1854. doi: 10.3964/j.issn.1000-0593(2022)06-1848-07. [DOI] [Google Scholar]
- 201.Zhang S., Fei T., Chen Y., Hong Y. Estimating cadmium-lead concentrations in rice blades through fractional order derivatives of foliar spectra. Biosyst. Eng. 2022;219:177–188. doi: 10.1016/j.biosystemseng.2022.04.023. [DOI] [Google Scholar]
- 202.Song G., Wang Q., Jin J. Estimation of leaf photosynthetic capacity parameters using spectral indices developed from fractional-order derivatives. Comput. Electron. Agric. 2023;212 doi: 10.1016/j.compag.2023.108068. [DOI] [Google Scholar]
- 203.Tian A., Zhao J., Tang B., Zhu D., Fu C., Xiong H. Study on the Pretreatment of Soil Hyperspectral and Na+ Ion Data under Different Degrees of Human Activity Stress by Fractional-Order Derivatives. REMOTE Sens. 2021;13:3974. doi: 10.3390/rs13193974. [DOI] [Google Scholar]
- 204.Wang Y., Wang Y., Zhang Z., Sun L., Yang R., Sapnken F.E., Xiao W. A novel fractional-order kernel regularized non-homogeneous grey Riccati model and its application in oil reserves prediction. Energy (Calg.) 2025;316 doi: 10.1016/j.energy.2025.134675. [DOI] [Google Scholar]
- 205.Yan C., Cheng Z., Cao L., Wen Y. Enhanced 3-D asynchronous correlation data preprocessing method for Raman spectroscopy of Chinese handmade paper. Spectrochim. Acta Mol. Biomol. Spectrosc. 2024;310 doi: 10.1016/j.saa.2024.123866. [DOI] [PubMed] [Google Scholar]
- 206.Yan C., Luo S., Cao L., Cheng Z., Zhang H. Tensor product based 2-D correlation data preprocessing methods for Raman spectroscopy of Chinese handmade paper. Spectrochim. Acta Mol. Biomol. Spectrosc. 2023;302 doi: 10.1016/j.saa.2023.123033. [DOI] [PubMed] [Google Scholar]

