Skip to main content
Frontiers in Bioengineering and Biotechnology logoLink to Frontiers in Bioengineering and Biotechnology
. 2026 Feb 23;14:1774782. doi: 10.3389/fbioe.2026.1774782

Rapid and reagent-free screening of occult hepatitis B virus infection based on plasma Vis-NIR spectral pattern recognition

Linbin Huang 1,, Xiaoting Huang 2,, Jingjing Xia 1, Lining Huang 1, Huanjie Zhou 1, Min Chen 2, Baoren He 3, Meijun Chen 3, Qiuhong Mo 3, Tao Pan 2,*, Chao Ou 1,*
PMCID: PMC12968168  PMID: 41808763

Abstract

Introduction

Occult hepatitis B virus infection (OBI) is a specific form of hepatitis B virus (HBV) infection characterized by testing negative for Hepatitis B surface antigen (HBsAg) with the presence of HBV DNA in the blood. Due to the complexity and high cost of HBV DNA testing, which is rarely included in routine physical examinations, leading to underdiagnosis of OBI. In this study, plasma visible-near-infrared (Vis-NIR) spectroscopy pattern recognition was employed to develop the discriminant analysis models for distinguishing between OBI from healthy (normal controls) plasma.

Methods

A total of 444 plasma samples from voluntary blood donors (OBI 204, normal controls 240) were collected, and their Vis-NIR spectra were measured. The samples were rigorously divided into training, prediction, and independent external validation sets. Partial least squares-discriminant analysis (PLS-DA) and k-nearest neighbor (kNN) were used as spectral classifiers; standard normal variate (SNV) and norris derivative filtering (NDF) were applied for spectral preprocessing. The integrated algorithm combining separation degree priority combination (SDPC) with wavelength step-by-step phase-out (WSP) was utilized for the optimal wavelength selection.

Results

The plasma spectral discriminant models for OBI and normal control were successfully established. Based on the optimal SNV-NDF preprocessed spectra, the SDPC-WSP-kNN and SDPC-WSP-PLS-DA methods determined the optimal number of wavelengths N to be 5 and 26, respectively. When evaluated on the independent external validation set, the SDPC-WSP-kNN model demonstrated better robustness, achieving sensitivity, specificity, and total recognition accuracy rates of 96.6%, 100%, and 98.7%, respectively. By introducing a grey judgment zone, both SEN and SPE reached 100%, with a detection recovery rate of 96.8%.

Conclusion

These results indicated that Vis-NIR spectroscopy pattern recognition can accurately discriminate between OBI and normal controls’ plasma samples. This method is reagent-free, rapid, and simple, making it suitable for large-scale, low-cost rapid screening of OBI. In particular, the proposed few-wavelength model can provide an important reference for the development of small specialized blood analyzers for OBI detection.

Keywords: blood screening, multi-wavelength, norris derivative filtering, occult hepatitis B virus infection, partial least squares-discriminant analysis, separation degree priority combination, step-by-step phase-out, visible-near-infrared spectralpattern recognition

1. Introduction

Hepatitis B virus (HBV) infection is a serious infectious disease and a global public health problem (Jeng et al., 2023). The main route of transmission of HBV is blood-borne (blood transfusions, blood products, etc.). Transfusion-transmitted HBV infection may lead to hepatitis, cirrhosis, hepatocellular carcinoma (HCC), and even death in patients (Justiz Vaillant et al., 2025). Preventing transfusion-transmitted infections (TTIs) is a priority in the management of conditions such as surgeries, and trauma (Candotti et al., 2019).

The progression of HBV infection is often jointly determined by the host’s innate and/or adaptive immune responses, which subsequently influence viral clearance, disease progression, and clinical outcomes. This process is accompanied by dynamic changes in serological markers, including HBV DNA, Hepatitis B surface antigen (HBsAg), and related antibodies (Hong et al., 2021; Kramvis et al., 2022). Occult hepatitis B virus infection (OBI) represents a specific form of HBV infection. Under the influence of multiple factors such as strong immune suppression, viral mutations, or low-level replication, OBI is characterized by undetectable HBsAg in blood despite the presence of HBV DNA, often with low and fluctuating viral loads (Raimondo et al., 2019; Huang et al., 2025). OBI is difficult to identify through routine HBsAg screening and is frequently missed. Importantly, although OBI is not currently incorporated into HBV treatment guidelines, it is considered a recognized key risk factor in transfusion medicine, as undetected HBV infection can lead to transfusion transmission (Raimondo et al., 2019). Studies have estimated that 8%–29% of transfusion-associated HBV infections may be attributable to OBI (Im et al., 2022). In contexts such as immunosuppressive therapy, chemotherapy, or organ transplantation, OBI may lead to viral reactivation and increase the risk of HCC (Peng et al., 2022; Schwarz et al., 2023; Hussein and Mahfodh, 2025). Thus, OBI detection is an essential component of HBV infection control (Ye et al., 2022). However, identifying OBI requires nucleic acid testing (NAT) for HBV DNA in addition to HBsAg screening (Ye et al., 2022; Saitta et al., 2022). In particular, in some low-income countries, NAT for OBI is not available in the blood collection/supply system, increasing the risk of OBI transfusion transmission (Cooke et al., 2024; Saravanan et al., 2024; Alshrari et al., 2025).

Visible-near-infrared (Vis-NIR) spectroscopy offers the advantages of rapid and straightforward analysis, holding significant potential for applications in the biomedical field (Wang et al., 2022). Blood samples, which are rich in clinical information, contain numerous components that exhibit characteristic absorption features in the Vis-NIR region. Vis-NIR spectroscopy has already been utilized for the quantitative analysis of several critical blood clinical indicators.

Spectral pattern recognition is based on the similarity within the same category of spectra and the differences between distinct categories, utilizing pattern recognition algorithms for classification. For classifying samples with similar compositions but low concentrations of characteristic components, spectral pattern recognition based on population characteristics is often simpler and more effective than quantitative analysis, which typically requires complex and expensive micro-quantitative analysis. Vis-NIR spectral pattern recognition represents a prominent application direction in the biomedical field (Wang et al., 2022)and has been successfully applied in clinical blood discrimination. These include the identification of human and animal blood (Fonseca et al., 2022), discrimination between β-thalassemia and normal controls (Yao et al., 2018), diagnosis of ovarian cancer from serum (Gajjar et al., 2013), classification of breast cancer and normal controls’ serum (Yuan et al., 2021), and identification of HBV-DNA excess in serum (Chen et al., 2023). These advances demonstrate the feasibility of using Vis-NIR spectroscopy for clinical blood classification.

Given that both OBI individuals and healthy individuals (normal controls) test negative for HBsAg, the accurate identification of OBI among HBsAg-negative blood samples is of critical importance. The fundamental distinction between plasma from OBI individuals and normal controls lies in the presence of HBV DNA in the blood of OBI individuals, which serves as a marker of viral replication after HBV infection.

The HBV DNA genome contains four open reading frames (ORFs)—S, C, P, and X—and nucleic acid testing for HBV often targets specific ORF fragments in the blood as detection markers (Saitta et al., 2022; Nguyen et al., 2020). In this study, the C and S gene fragments of HBV genome were amplified and tested by NAT. Any positive result for these gene fragments indicated OBI, whereas the normal controls were tested negative. According to the National Center for Biotechnology Information (NCBI) database, the primary amino acids encoded by the various HBV gene fragments include Proline (molecular formula C5H9NO2), Leucine (C6H13NO2), Serine (C3H7NO3), Alanine (C3H7NO2), Glycine (C2H5NO2), Arginine (C6H14N4O2), and others (Wu et al., 2022; Seeger and Mason, 2015). Compared with normal controls’ plasma, OBI samples exhibit altered levels of these amino acids. Furthermore, multi-omics studies have revealed significant differences between OBI individuals and normal controls in various metabolic pathways and enzyme activities, including protein metabolism, fatty acid metabolism, retinol metabolism, alcohol dehydrogenase (ADH), and UDP-glucuronosyltransferase (UGT) (Jiang et al., 2024).

The aforementioned amino acids, metabolites, and enzymes all contain hydrogen-containing groups (X-H), which exhibit characteristic absorption in the Vis-NIR spectral region. Therefore, this may lead to differences in the overall Vis-NIR spectral characteristics of OBI plasma compared with normal controls. Accordingly, establishing a discrimination model for OBI and normal control plasma samples using Vis-NIR spectral pattern recognition has a certain molecular mechanism basis. Subsequent experiments and modeling will indirectly validate this feasibility.

In recent years, near-infrared to mid-infrared (NIR-MIR) spectral pattern recognition has been applied to discriminate serum samples with elevated HBV DNA levels (Chen et al., 2023), providing preliminary evidence of an association between HBV DNA concentration and infrared spectral signals. Given that the presence of HBV DNA in the blood represents the key distinction between OBI and normal controls’ plasma, the aforementioned study indirectly supports the feasibility of applying Vis-NIR spectral pattern recognition for discriminating OBI from normal controls’ plasma samples. However, according to the literature search, there has not been reported about the spectroscopic discrimination of OBI and normal controls’ blood samples. Therefore, the development of a rapid Vis-NIR spectroscopy-based screening method for OBI in blood samples is of great significance for infectious disease prevention and control.

In this study, Vis-NIR spectroscopy combined with chemometrics was used to establish the discrimination models of the OBI and normal controls’ plasma samples. The research focused on the integrated optimization of classifier algorithms, spectral preprocessing methods, and multi-wavelength combination models for plasma spectral pattern recognition. Partial least squares-discriminant analysis (PLS-DA) (Wang et al., 2022; Chi et al., 2024) and k-nearest neighbor (kNN) (Mabwa et al., 2021; Ciloglu et al., 2020) were employed as classifier algorithms; standard normal variate (SNV) transformation (Barnes MSD and Lister, 1989; Dhanoa et al., 1994) and Norris derivative filtering (NDF) with multiple parameters (Yang et al., 2019; Norris, 2001) were applied for spectral preprocessing. A two-stage wavelength selection strategy was implemented: separation degree priority combination (SDPC) (Yuan et al., 2023; Pan et al., 2023)was used in the first stage, followed by wavelength step-by-step phase-out (WSP) (Yang et al., 2019; Tan et al., 2020) in the second stage. This study aims to develop a rapid, reagent-free, and accurate spectral pattern recognition method for screening occult hepatitis B virus infection.

2. Experiments and methods

2.1. Samples and reference methods

Plasma samples were collected from voluntary blood donors at the Nanning Blood Center in Guangxi, China. As part of routine screening, each plasma sample was initially tested twice for HBsAg by two independent operators using enzyme-linked immunosorbent assays (ELISA). To avoid detection variability, two different commercial ELISA kits (Bio-Rad, France; Wantai, Beijing, China; analytical sensitivity: 1 IU/mL) and detection systems (MICROLAB® F.A.M.E, Hamilton, Switzerland; URANUS AE 368, Aikon, China) were used. All samples that tested negative for HBsAg by ELISA were further confirmed using high-sensitivity electrochemiluminescence immunoassay (ECLIA; Roche Diagnostics, Germany; detection limit: 0.05 IU/mL) to rule out low-level HBsAg positivity. Only samples confirmed as HBsAg-negative by ECLIA were included in subsequent analyses. For HBV DNA testing, the cobas 6800 nucleic acid testing system (Roche Diagnostics, Germany; detection limit: 1.4 IU/mL) was used, targeting the C and S gene regions of the HBV genome. OBI was defined as the detection of HBV DNA in plasma in the absence of HBsAg (confirmed by high-sensitivity ECLIA). Healthy controls were defined as individuals negative for both HBsAg (confirmed by ECLIA) and HBV DNA.

Through the above rigorous testing procedures, a total of 204 OBI samples (166 males, 38 females) and 240 normal controls’ samples (180 males, 60 females) were collected. The discriminant criteria for OBI and normal controls are listed in Table 1.

TABLE 1.

Sample categories based on HBsAg and HBV DNA detection.

Category/Test items HBsAg HBV DNA
Normal control - -
OBI - +

The test results are positive and negative, recorded as + and - respectively.

2.2. Experimental materials, instrumentation, and measurement methods

Spectra were acquired using a QE65Pro spectrometer (Ocean Optics, United States); Hamamatsu S7031-1006 thin back-illuminated CCD array detector was adopted; the spectral scanning range was 333–1118 nm (wavelength interval 0.79 nm, 1044 wavelengths); a 10 mm quartz cuvette was employed as the liquid transmission accessory. The integration time for spectral measurement was set to 8 ms, and the number of scans was set to 10. Each plasma sample was measured five times, and the average spectrum was used for subsequent spectral modeling and validation. All measurements were conducted under controlled ambient conditions (25 °C ± 1 °C, 46% ± 1% relative humidity).

2.3. Training-prediction-external validation procedure and model evaluation indexes

A rigorous training-prediction-external validation design was implemented. All plasma samples were first stratified into four subgroups according to category (OBI, normal control) and gender (male, female). Each subgroup was then randomly allocated to training, prediction, and independent external validation sets, which were subsequently pooled. The types and numbers of training, prediction, and external validation set samples are shown in Table 2. The training and prediction sets were used for modeling and parameter optimization. While in the external validation set, the independent samples that did not participate in modeling were used for model validation.

TABLE 2.

Sample types and number assigned to the training, prediction, and external validation sets.

Type Training Prediction External validation Total
OBI
Male 70 50 46 166
Female 15 10 13 38
Normal control
Male 70 50 60 180
Female 15 10 35 60
Total 170 120 154 444

The performance evaluation indicators of the plasma discrimination model for OBI (positive) and the normal control (negative) are as follows: in the modeling process, the predicted category of each spectrum (positive, negative) was determined; by comparing the results with the clinically true categories, the positive, negative, and total recognition-accuracy rates ( RART+ ; RART , RART ; RARP+ ; RARP , RARP ;%) on the training and prediction sets were calculated according to the following Formulas 1, 2:

RART+=NT+NT+,RART=NTNT,RART=NT++NTNT++NT, (1)
RARP+=NP+NP+,RARP=NPNP,RARP=NP++NPNP++NP (2)

Among them, the number of positive and negative spectra in the training and prediction sets by clinical diagnosis were NT+ , NP+ , NT and NP , respectively; the number of correctly identified spectra were NT+ , NP+ , NT and NP , respectively.

In the external validation process, the number of positive and negative spectra in the external validation set by clinical diagnosis were NV+ , NV , respectively; the number of correctly identified spectra were NV+ , NV , respectively. The corresponding number of incorrectly identified spectra (false negatives and false positives) were NV+NV+ and NVNV , respectively. The sensitivity (SEN), specificity (SPE), false negative rate (FNR, i.e., missed diagnosis rate), and false positive rate (FPR, i.e., misdiagnosis rate) of the external validation set were calculated as the following Formula 3:

SEN=NV+NV+,SPE=NVNV,FNR=NV+NV+NV+,FPR=NVNVNV (3)

The total recognition-accuracy rate ( RARv ) on the external validation sets was calculated, as the following Formula 4:

RARV=NV++NVNV++NV (4)

2.4. Norris derivative filtering

Spectral preprocessing helps to correct noise interference such as spectral baseline-drift and tilt. Norris derivative filtering NDF is an effective multi-parameter spectral preprocessing algorithm (Yang et al., 2019; Norris, 2001; Pan et al., 2020), which includes moving average smoothing and difference derivation, and used three parameters: derivative order (d), number of smoothing points (s, odd number) and number of difference gaps (g). The specific calculation steps were as follows.

2.4.1. Moving average smoothing

Smoothing window with s wavelengths was selected, s=1,3,,S , SN0 , N0 was the total number of wavelengths in the entire scanning region, S = 21 was set. In the smoothing window (center-symmetric), the absorbance of the central wavelength was replaced by the mean of all absorbances and the window was moved from left to right to complete the smoothing of the entire spectrum (except the leftmost and rightmost half-broadband wavelengths). The absorbance of the leftmost half-broadband wavelengths (leftmost s12 wavelengths) cannot be smoothed symmetrically. To ensure continuity, left-decreasing smoothing window 1,2,,k+s12 was used to smooth absorbance of kth wavelength xk , k=1,2,,s12 . The calculation formula was as the following Formula 5 (Yang et al., 2019; Pan et al., 2020):

xk=i=1k+s12xik+s12,k=1,2,,s12 (5)

Similarly, for the absorbance of the rightmost half-broadband wavelengths (rightmost s12 wavelengths), they can be smoothed using right-decreasing smoothing window.

2.4.2. Difference derivative

Variable wavelength gap was used as the difference derivative gap (g), g=1,2,,G , G=20 was set. For absorbance xk of kth wavelength, the 1st derivative ( Dxk ) of xk can be calculated by using central difference. The derivative values of all wavelengths were obtained by calculating from left to right (except the leftmost and rightmost half-broadband wavelengths). The absorbance of the leftmost half-broadband wavelengths (leftmost g wavelengths) cannot be differentiated symmetrically, and the 1st derivative was calculated by forward difference, as the following Formula 6 (Yang et al., 2019; Pan et al., 2020):

Dxk=xk+gxkg,k=1,2,,g (6)

Similarly, for the absorbance of the rightmost half-broadband wavelengths (rightmost g wavelengths), the 1st derivative was calculated by backward difference.

Based on the above 1st derivative spectrum, the above difference derivation process was performed again to obtain the 2nd derivative spectrum, and so on. Considering that the absolute value of higher-order derivatives above the 3rd was small and the spectral information content was low, it was generally not recommended to use. The derivative order was set as d=0,1,2 . In particular, d=0 means no difference derivation, that is, only moving average smoothing was performed.

2.4.3. NDF parameter optimization

For any parameter combination (d, s, g), there was a corresponding Norris derivative mode. For d=0,1,2 ; s=1,3,,21 ; g=1,2,,20 , there are a total of 11 + 2 × 11 × 20 = 451 algorithm modes. The three variable parameters with different functions made the Norris derivative spectra more diverse than the raw spectra, thus having a wider applicability. The NDF method was used to preprocess the plasma spectra. On this basis, the PLS-DA models corresponding to all NDF modes were established, and the optimal NDF parameters (d, s, g) were determined according to the recognition-accuracy rates on the prediction set ( RARP ). In addition, the NDF preprocessed spectra were also applied to the kNN modeling.

2.5. Separation degree priority combination

For the given wavelength range (number of wavelengths: n), all wavelengths were represented in ascending order as: λ1,λ2,,λn . For any wavelength, the separation degree between the two types of spectra was calculated using the following Formula 7 (Yuan et al., 2023; Pan et al., 2023):

Sλ=AMinλAMax+λ,AAveλAAve+λAMin+λAMaxλ,AAveλ<AAve+λ (7)

The spectral separation degree quantified the discriminative capacity between two spectral populations. The wavelengths with higher separation degrees were expected to contribute more effectively to classification performance. Based on this principle, the Separation Degree Priority Combination (SDPC) method was proposed for wavelength selection, as described below: (Jeng et al., 2023): All wavelengths were sorted by separation degree values from largest to smallest, as follows: λ1,λ2,,λn ; (Justiz Vaillant et al., 2025); According to the priority of separation degree, the combination of the first i wavelengths from 1 to i was selected; there were a total of n combinations as follows: Ωi=λ1,λ2,,λi,i=1,2,,n ; (Candotti et al., 2019); Using the above n wavelength combinations, PLS-DA and kNN as the basis elemental methods, the SDPC-PLS-DA and SDPC-kNN integrated modeling algorithms were established, respectively. According to the predicted RARP of the model, the optimal i and the corresponding wavelength combination Ωi was determined.

The determined optimal wavelength combination Ωi is typically a combination of multi-waveband. The SDPC-PLS-DA and SDPC-kNN methods were employed in wavelength selection of the first stage.

2.6. Wavelength step-by-step phase-out

Wavelength step-by-step phase-out is a backward selection method (Yang et al., 2019; Tan et al., 2020). In an existing wavelength combination, by traversing all wavelengths, the interference wavelength with the best modeling effect is determined, based on the deleted remaining wavelength model. Phase-out based on the above rules until only one wavelength remains; the local optimal model obtained from each round was evaluated to comprehensively determine the global optimal model (according to the predicted RARP of the model, the optimal wavelength combination was determined). This method can effectively remove redundant wavelengths, and combined with the PLS-DA algorithm, WSP-PLS-DA has been established and successfully applied (Yang et al., 2019; Tan et al., 2020). In the current study, kNN was also used as a basic algorithm to establish the WSP-kNN model. The WSP-PLS-DA and WSP-kNN methods were used in the wavelength selection of the second stage.

The above-mentioned algorithms were constructed using MATLAB version 2020b software.

3. Results and discussion

3.1. Full spectral PLS-DA and kNN models with and without spectral preprocessing

The Vis-NIR spectra of OBI and normal controls’ plasma samples in the full spectral range (333–1118 nm) are shown in Figure 1a. The green and red curves represent the spectra of the two types of samples, respectively. Significant baseline drift in the spectra was observed. Using raw spectra, PLS-DA and kNN discriminant models for OBI-normal control were established. Their training and prediction discrimination results are summarized in Tables 3, 4, respectively.

FIGURE 1.

Grouped figure with three line graphs comparing absorbance versus wavelength for OBI (red lines) and normal control (green lines). Panel (a) displays raw absorbance spectra, while panels (b) and (c) show processed data with reduced vertical scale, emphasizing subtle spectral differences between groups.

Vis-NIR spectra of all OBI and normal control plasma samples: (a) Raw spectra; (b) SNV-NDF combination preprocessing spectra (d = 2, s = 21, g = 2); (c) SNV-NDF combination preprocessing spectra (d = 2, s = 3, g = 3).

TABLE 3.

Modeling effects of PLS-DA models for discriminating OBI-normal control with and without spectral preprocessing.

Method N Lv RART RART+ RART RARP RARP+ RARP
Raw spectra 522 8 94.1% 100.0% 97.1% 85.0% 96.7% 90.8%
Preprocessed spectra 522 5 91.8% 100.0% 95.9% 93.3% 100.0% 96.7%

TABLE 4.

Modeling effects of kNN models for discriminating OBI-normal control with and without spectral preprocessing.

Method N k RARP RARP+ RARP
Raw spectra 522 3 73.3% 76.7% 75.0%
Preprocessed spectra 522 3 96.7% 100.0% 98.3%

The SNV was first used for spectral preprocessing; based on the SNV spectra, the NDF was used for further preprocessing, and the SNV-NDF combined spectra based on all the NDF modes were used to establish PLS-DA and kNN models. According to the optimal discriminative effect, the optimal Norris parameter combination was selected, and the optimal NDF parameter combinations for the PLS-DA and kNN models were d = 2, s = 21, g = 2, and d = 2, s = 3, g = 3, respectively. The two sets of preprocessed spectra are shown in Figures 1b,c, and the spectral baseline drifts were significantly improved. The corresponding training and prediction discrimination results are also summarized in Tables 3, 4, respectively. It can be seen that after spectral preprocessing, the RARP were improved substantially to 96.7% and 98.3%, respectively. However, the number of wavelengths N in the full spectrum was as high as 522, necessitating simplification of the wavelength model.

3.2. SDPC-PLS-DA and SDPC-kNN models

Based on SNV-NDF spectra, the SDPC method was used to establish SDPC-PLS-DA and SDPC-kNN models to determine the wavelength combinations with the optimal discriminative effect. Using the separation degree method, the separation spectra (333–1118 nm) of the two types of modeling samples were calculated, as shown in Figures 2a,c. For the optimal SNV-NDF spectra for the PLS-DA and kNN models, the wavelengths with the highest separation degree, 592 nm and 584 nm, were taken as examples to observe the separation of the two types of spectra. The preprocessed absorbance values at 592 nm and 584 nm for all 290 modeling samples were sorted in ascending order and assigned new numbers. The preprocessed absorbance values for all samples are shown with new numbering in Figures 3a,b, where the red and green dots represent OBI and normal controls’ plasma samples. It can be clearly seen that the two types of samples are almost separated at wavelengths of 592 nm and 584 nm.

FIGURE 2.

Four line graphs labeled a, b, c, and d show separation degree values versus wavelength in nanometers. Panels b and d have a green-shaded area indicating a specific value range above the x-axis. All graphs exhibit sharp dips at lower wavelengths and relatively stable or slightly negative values at higher wavelengths.

Separation degree spectrum of the two spectral populations after preprocessing in modeling: (a) 333–1118 nm (d = 2, s = 21, g = 2); (b) Selected waveband combination of the optimal SDPC-PLS-DA model (d = 2, s = 21, g = 2); (c) 333–1118 nm (d = 2, s = 3, g = 3); (d) Selected waveband combination of the optimal SDPC-kNN model (d = 2, s = 3, g = 3).

FIGURE 3.

Two side-by-side scatter plots compare preprocessed absorbance for OBI (green circles) and normal controls (red stars) across serial sample numbers, with similar trends in both panels (a) and (b) showing OBI values rising at higher sample numbers.

Preprocessed absorbance at a wavelength in order from smallest to largest for two types of modeling spectra: (a) 592 nm (d = 2, s = 21, g = 2); (b) 584 nm (d = 2, s = 3, g = 3).

Using the SDPC-PLS-DA method, the optimal waveband combination (N = 398) selected was {333–344, 357, 366–373, 396, 550–1117} (nm), which can be seen in the green background part of Figure 2b, and the modeling discrimination effect is shown in Table 5. Using the SDPC-kNN method, the optimal waveband combination (N = 19) selected was {569, 584, 606–618, 635–638, 644, 680–685, 692–694} (nm), which can be seen in the green background part of Figure 2d, and the modeling discrimination effect is shown in Table 6. Compared with the full-spectrum model, the RARP of the optimal SDPC-PLS-DA and the SDPC-kNN models were both improved, while the numbers of wavelengths were greatly reduced.

TABLE 5.

Prediction discriminant effect of optimal SDPC-PLS-DA models.

Waveband combination (nm) N Lv RARP RARP+ RARP
333–344, 357, 366–373, 396, 550–1117 398 2 96.7% 100% 98.3%

TABLE 6.

Prediction discriminant effect of optimal SDPC-kNN models.

Waveband combination (nm) N k RARP RARP+ RARP
569, 584, 606–618, 635–638, 644, 680–685, 692–694 19 3 100% 100% 100%

The SDPC is a recently proposed novel spectral analysis method (Yuan et al., 2023; Pan et al., 2023). It calculates the separation spectra of two spectral populations based on the characteristics of the two types of sample spectra and sequentially combines the selected wavelengths in order of decreasing separation. The optimal wavelength combinations are determined according to the modeling performance, as shown by the green sections in Figures 2b,d, which correspond to the wavelength selections for the SDPC-PLS-DA and SDPC-kNN methods, respectively. Among these, 592 nm and 584 nm were the wavelengths with the greatest separation in the two aforementioned combinations. In Figure 3, 592 nm and 584 nm were used as examples to demonstrate the effective separation between the OBI and normal control spectral populations. Of course, information from a single wavelength is insufficient to support the classification in the OBI-normal control discriminant model. The optimal SDPC-kNN model achieved significantly better modeling performance (prediction recognition-accuracy RARP = 100%), with the corresponding wavelength combination of 584, 610, 612, 683, 607, 569, 682, 609, 692, 636, 680, 685, 606, 694, 638, 635, 644, 613, 618 (nm, N = 19, sorted by separation degree from high to low). According to literature reports, in the visible spectrum region, the absorption of hemoglobin (including oxyhemoglobin HbO2 and deoxyhemoglobin HHb) is mainly located in the 500–650 nm range (Prahl, 1999). In the current study, most of the selected wavelength combinations fall within the hemoglobin absorption bands, with the remainder corresponding to absorptions of other components, such as those related to lipid metabolism and bile acid metabolism.

3.3. SDPC-WSP-PLS-DA and SDPC-WSP-kNN models

The SDPC-based SDPC-kNN and SDPC-PLS-DA algorithms are forward optimization wavelength selection methods aimed at screening informative wavelengths; the WSP-based WSP-kNN and WSP-PLS-DA algorithms are backward optimization wavelength selection methods aimed at eliminating redundant wavelengths. SDPC and WSP were successively used in the first and second stages of wavelength selection. This wavelength selection strategy combined the advantages of forward optimization (selecting informative wavelengths) and backward optimization (eliminating redundant wavelengths). As a result, the integrated two-stage SDPC-WSP-kNN and SDPC-WSP-PLS-DA algorithms were used for modeling the plasma discrimination analysis of OBI-normal controls.

The WSP method was employed to further optimize the SDPC-PLS-DA and SDPC-kNN models. The number of wavelengths N of the optimal SDPC-WSP-PLS-DA and SDPC-WSP-kNN models significantly reduced to 26 and 5, respectively. Their modeling discrimination effects (RARP) were substantially improved to 100%. As shown in Tables 7, 8.

TABLE 7.

Prediction discriminant effect of the optimal SDPC-WSP-PLS-DA models.

Wavelengths (nm) N Lv RARP RARP+ RARP
373, 603–604, 853–854, 955, 963–967,969, 973–979, 1048–1052, 1054,1073, 1100–1107 26 2 100% 100% 100%

TABLE 8.

Prediction discriminant effect of the optimal SDPC-WSP- kNN models.

Wavelengths (nm) N k RARP RARP+ RARP
569, 584, 613, 618, 644 5 1 100% 100% 100%

3.4. External validation of the models

Plasma samples from an external validation set that were not involved in the modeling process were used to verify the optimal discriminant models of SDPC-WSP-PLS-DA and SDPC-WSP-kNN for OBI and normal controls. The accuracy of the discriminant model was calculated based on the categories of the spectral discriminant and clinical diagnosis. For the optimal SDPC-WSP-PLS-DA and SDPC-WSP-kNN models, the sensitivity SEN, specificity SPE, false negative rate FNR, false positive rate FPR, and total recognition-accuracy rate RARv on the external validation sets are summarized in Table 9. Among them, the RARv of the two models reached 98.1% and 98.7%, respectively.

TABLE 9.

Validation effect of the optimal SDPC-WSP-PLS-DA and SDPC-WSP-kNN models with SNV-NDF preprocessing.

Method SEN SPE FNR FPR RARv
SDPC-WSP-PLS-DA 96.6% 98.9% 3.4% 1.1% 98.1%
SDPC-WSP-kNN 96.6% 100% 3.4% 0% 98.7%

For the SDPC-WSP-PLS-DA model with SNV-NDF preprocessing, the predicted class variable values (S, ≥0.5, <0.5) for OBI and normal controls’ spectra and the discrimination effect are shown in Figure 4a. For the SDPC-WSP-kNN model with SNV-NDF preprocessing, the predicted class variable labels (1,0) and the discrimination effect are shown in Figure 5.

FIGURE 4.

Paired scatter plots compare sample classification results with a dividing line at categorical value zero point five. Panel a and panel b show serial number versus value, marking true positives, false positives, true negatives, and false negatives. Panel b highlights an uncertainty region around the classification line.

External validation results of the SDPC-WSP-PLS-DA model with SNV-NDF preprocessing: (a) Prediction value of class variable for OBI and normal control; (b) Schematic diagram of discrimination after introducing the gray area (G, 4.6 < S < 5.2).

FIGURE 5.

Scatter plot showing classification results for a categorical variable across samples. Points are labeled as true positive, false positive, true negative, and false negative, separated by a dotted classification line at 0.5.

Prediction class variable labels in the SDPC-WSP-kNN model with SNV-NDF preprocessing.

Notably, for the PLS-DA method, the classification line for the categorical variables of the two sample classes was established, with a threshold of 0.5. Due to the diversity of samples and random interference, it is difficult to avoid misclassifying samples near the threshold. Therefore, during validation and application, a grey judgment zone G = {4.6 < S <5.2} was introduced, as shown in the gray area in Figure 4b. Considering that false negatives lead to missed diagnoses while false positives cause misdiagnoses, false negatives pose greater harm from the perspective of disease transmission risk. Thus, it is reasonable to set the negative portion of the gray zone wider. Based on this, the clear positive and negative judgment zones were defined as D + = {S ≥ 4.6}, D = {S ≤5.2}, respectively. Under this configuration, out of 154 external validation samples, 5 samples fell into the gray zone G and will be further confirmed using other methods; the remaining 149 samples fall into the positive and negative zones D + and D , with both SEN and SPE reaching 100%, and FNR and FPR both at 0%. The method’s detection recovery rate reaches 96.8%.

The external validation results of both models demonstrated that plasma Vis-NIR spectral pattern recognition can be used to accurately discriminate between OBI and normal control.

In the current study, a real-time rapid OBI detection method based on plasma Vis-NIR spectral pattern recognition was developed. This method requires no reagents and represents an environmentally friendly green detection approach. The wavelength range for spectral measurement was 333–1118 nm, covering the entire visible and short-wave NIR spectral regions. Many components of clinical blood indicators have absorption characteristics within this spectral range (Nishadini Premasinghe et al., 2025). Moreover, photodetectors operating in this range can use high-efficiency and low-cost silicon (Si) detectors, making it possible to develop affordable related spectral systems, which facilitates the application of this technology for large-scale screening of HBV.

On the other hand, this study also proposes a high-precision detection model that requires only a few wavelength data points, providing a core solution for developing small OBI-specific spectral detection instruments. In recent years, with the rapid advancement of spectral detection equipment, the hardware cost in the visible to short-wave near-infrared spectral range has decreased significantly (even below 10,000 RMB), which is far lower than that of the main equipment currently used for OBI detection (nucleic acid testing). Moreover, this method does not require reagents, allows for rapid measurement, can instantly transfer spectral data to algorithm software, and displays results through embedded media, completing the entire process in just a few minutes. This is something that the vast majority of current clinical testing methods find difficult to achieve.

Due to the low cost of the hardware and the fact that no reagents are required, the corresponding testing costs will also decrease significantly. This makes the method suitable for rapid testing of large populations, which is difficult to achieve with nucleic acid methods. Of course, this method needs to be calibrated using standard clinical testing methods (hepatitis B surface antigen and nucleic acids). It cannot replace nucleic acid methods, but in terms of application, it is a novel and effective supplement, mainly in terms of rapid testing and suitability for large populations.

Regarding spectral pattern recognition, PLS-DA and kNN were used as the basic classifier algorithms. PLS combines the advantages of principal component analysis (PCA) and multiple linear regression (MLR). When combined with categorical variable discriminant analysis (DA), PLS-DA has demonstrated excellent modeling performance in many spectral discriminant analysis applications. kNN is a supervised learning method based on the Euclidean distance of spectral curves, offering the advantages of geometric interpretability and algorithmic simplicity. Combined with spectral preprocessing, kNN has also shown excellent modeling performance in many spectral discriminant analysis applications. In the current study, PLS-DA and kNN were used as the base classifier algorithms, and combined with wavelength selection, the plasma spectral discriminant models for OBI-normal control achieved 100% modeling accuracy, indicating that the choice of base algorithms was appropriate. On the other hand, once the wavelength combination model and the parameters of the base algorithms (N, k, Lv) are determined, the PLS-DA and kNN algorithms are very simple. These algorithms can be integrated into the embedded media of the spectrometer, allowing for rapid implementation of spectral measurements and discriminant model computation.

4. Conclusion

Occult hepatitis B virus infection individuals test negative for HBsAg, which requires additional HBV DNA testing for detection. Due to the relative complexity and cost of HBV DNA testing, it is typically excluded from general health examinations, leading to potential underdiagnosis of OBIs. In this study, Vis-NIR spectroscopy combined with novel chemometric methods was employed to establish a plasma discriminant analysis model for distinguishing OBI from normal controls in the HBsAg-negative population.

kNN and PLS-DA were employed as base classifiers; combination optimization of SNV-NDF was applied for spectral preprocessing; SDPC combined with WSP was used for two-stage wavelength selection. Consequently, the SDPC-WSP-KNN and SDPC-WSP-PLS-DA integrated optimization models were established and successfully applied for precise discrimination between OBI and normal controls’ plasma samples. Based on the optimal SNV-NDF spectra, the SDPC-WSP-KNN and SDPC-WSP-PLS-DA methods determined the optimal number of wavelengths N to be 5 and 26, respectively. Using independent external validation samples, the SDPC-WSP-kNN model demonstrated better robustness, with sensitivity, specificity, and total recognition-accuracy rate of 96.6%, 100%, and 98.7%, respectively. By introducing a grey judgment zone, both SEN and SPE reached 100%, with a detection recovery rate of 96.8%. In particular, the number of wavelengths used for modeling was smaller (N = 5), resulting in a simpler wavelength model.

The results indicate that plasma Vis-NIR spectral pattern recognition can be used for accurate discrimination between OBI and normal control. This method is rapid, reagent-free, and convenient, making it suitable for large-scale, low-cost rapid screening of OBI. The integrated modeling strategy proposed in this study, which considers both forward (SDPC) and backward (WSP) wavelength selection, may also apply to spectral analysis of other subjects. The proposed wavelength combination model can provide an important reference for the development of small dedicated blood analyzers for OBI.

Funding Statement

The author(s) declared that financial support was received for this work and/or its publication. This work was funded by The National Natural Science Foundation of China (82460611; 62476112), Guangxi Science and Technology Program under Grant (No.2025FNFN96416; No.AB19110007), the Innovation Project of Guangxi Graduate Education (No.YCSW2025253) and Wu Jieping Medical Foundation Research Special Grant Fund (No.320.6750.2025-06-161).

Footnotes

Edited by: Helena Maria Rodrigues Gonçalves, Chemistry and Technology Network (REQUIMTE), Portugal

Reviewed by: Song Shi, National Center for Clinical Laboratories (NCCL), China

Jessenia Sabrina Navas Castillo, Hospital Roosevelt, Guatemala

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by the Ethics Committee of the Nanning Blood Center. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants’ legal guardians/next of kin.

Author contributions

LbH: Writing – original draft, Writing – review and editing, Data curation, Formal Analysis, Investigation, Validation, Visualization. XH: Writing – original draft, Data curation, Formal Analysis, Investigation, Software, Validation, Visualization, Writing – review and editing. JX: Writing – original draft, Data curation, Visualization. LiH: Writing – original draft, Data curation, Visualization. HZ: Writing – original draft, Data curation, Resources, Writing – review and editing. MiC: Writing – original draft, Data curation, Formal Analysis, Investigation, Software, Validation, Visualization. BH: Writing – original draft, Conceptualization, Resources. MeC: Writing – original draft, Resources. QM: Writing – review and editing, Resources. TP: Writing – review and editing, Conceptualization, Data curation, Formal Analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft. CO: Funding acquisition, Investigation, Writing – original draft, Writing – review and editing, Conceptualization, Formal Analysis, Methodology, Project administration, Supervision.

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

  1. Alshrari A. S., Hudu S. A., Shinkafi S. H., Tahir A., Raji H. Y., Jimoh A. O. (2025). Prevalence and transfusion risks of occult hepatitis B infection among HBcAb-Positive blood donors in a high-endemic region. Diagn. (Basel) 15 (4), 486. 10.3390/diagnostics15040486 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Barnes Msd R. J., Lister S. J. (1989). Standard normal variate transformation and De-trending of near-infrared diffuse reflectance spectr. Appl. Spectrosc. 43, 772–777. 10.1366/0003702894202201 [DOI] [Google Scholar]
  3. Candotti D., Assennato S. M., Laperche S., Allain J. P., Levicnik-Stezinar S. (2019). Multiple HBV transfusion transmissions from undetected occult infections: revising the minimal infectious dose. Gut 68 (2), 313–321. 10.1136/gutjnl-2018-316490 [DOI] [PubMed] [Google Scholar]
  4. Chen J., Ma J., Han X., Zhou Y., Xie B., Huang F., et al. (2023). Rapid detection of hepatitis B virus DNA level based on interval-point data fusion of infrared spectra. J. Biophot. 16 (3), e202200251. 10.1002/jbio.202200251 [DOI] [PubMed] [Google Scholar]
  5. Chi K., Lin J., Chen M., Chen J., Chen Y., Pan T. (2024). Changeable moving window-standard normal variable transformation for visible-NIR spectroscopic analyses. Spectrochim. Acta A Mol. Biomol. Spectrosc. 308, 123726. 10.1016/j.saa.2023.123726 [DOI] [PubMed] [Google Scholar]
  6. Ciloglu F. U., Saridag A. M., Kilic I. H., Tokmakci M., Kahraman M., Aydin O. (2020). Identification of methicillin-resistant Staphylococcus aureus bacteria using surface-enhanced raman spectroscopy and machine learning techniques. Analyst 145 (23), 7559–7570. 10.1039/d0an00476f [DOI] [PubMed] [Google Scholar]
  7. Cooke G. S., Flower B., Cunningham E., Marshall A. D., Lazarus J. V., Palayew A., et al. (2024). Progress towards elimination of viral hepatitis: a lancet gastroenterology and hepatology commission update. Lancet Gastroenterol. Hepatol. 9 (4), 346–365. 10.1016/S2468-1253(23)00321-7 [DOI] [PubMed] [Google Scholar]
  8. Dhanoa M. S., Lister S. J., Sanderson R., Barnes R. J. (1994). The link between multiplicative scatter correction (MSC) and standard normal variate (SNV) transformations of NIR spectra. J. Near Infrared Spectrosc. 2, 43, 47. 10.1255/jnirs.30 [DOI] [Google Scholar]
  9. Fonseca A. C. S., Pereira J. F. Q., Honorato R. S., Bro R., Pimentel M. F. (2022). Hierarchical classificationik models and handheld NIR spectrometer to human blood stains identification on different floor tiles. Spectrochim. Acta A Mol. Biomol. Spectrosc. 267 (Pt 1), 120533. 10.1016/j.saa.2021.120533 [DOI] [PubMed] [Google Scholar]
  10. Gajjar K., Trevisan J., Owens G., Keating P. J., Wood N. J., Stringfellow H. F., et al. (2013). Fourier-transform infrared spectroscopy coupled with a classification machine for the analysis of blood plasma or serum: a novel diagnostic approach for ovarian cancer. Analyst. 138 (14), 3917–3926. 10.1039/c3an36654e [DOI] [PubMed] [Google Scholar]
  11. Hong X., Luckenbaugh L., Mendenhall M., Walsh R., Cabuang L., Soppe S., et al. (2021). Characterization of hepatitis B Precore/core-related antigens. J. Virol. 95 (3). 10.1128/JVI.01695-20 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Huang L., He B., Mo Q., Li B., Yan X., Lai R., et al. (2025). Transfusion-transmitted occult hepatitis B virus infection: current understanding, challenges, and its implication in blood safety. Front. Immunol. 16, 1663306. 10.3389/fimmu.2025.1663306 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Hussein N. R., Mahfodh D. S. (2025). The current situation of organ donation from occult hepatitis B infection (OBI) donors: a perspective from Iraq. EClinicalMedicine 84, 103284. 10.1016/j.eclinm.2025.103284 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Im Y. R., Jagdish R., Leith D., Kim J. U., Yoshida K., Majid A., et al. (2022). Prevalence of occult hepatitis B virus infection in adults: a systematic review and meta-analysis. Lancet Gastroenterol. Hepatol. 7 (10), 932–942. 10.1016/S2468-1253(22)00201-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Jeng W. J., Papatheodoridis G. V., Lok A. S. F. (2023). Hepatitis B. Lancet. 401 (10381), 1039–1052. 10.1016/S0140-6736(22)01468-4 [DOI] [PubMed] [Google Scholar]
  16. Jiang X., Tian J., Song L., Meng J., Yang Z., Qiao W., et al. (2024). Multi-omic molecular characterization and diagnostic biomarkers for occult hepatitis B infection and HBsAg-positive hepatitis B infection. Front. Endocrinol. (Lausanne) 15, 1409079. 10.3389/fendo.2024.1409079 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Justiz Vaillant A. A., Zubair M., Sticco K. I. L. (2025). Transfusion transmitted disease. StatPearls. Available online at: https://www.ncbi.nlm.nih.gov/pubmed/30860698 (November 1, 2025). [Google Scholar]
  18. Kramvis A., Chang K. M., Dandri M., Farci P., Glebe D., Hu J., et al. (2022). A roadmap for serum biomarkers for hepatitis B virus: current status and future outlook. Nat. Rev. Gastroenterol. Hepatol. 19 (11), 727–745. 10.1038/s41575-022-00649-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Mabwa D., Gajjar K., Furniss D., Schiemer R., Crane R., Fallaize C., et al. (2021). Mid-infrared spectral classification of endometrial cancer compared to benign controls in serum or plasma samples. Analyst 146 (18), 5631–5642. 10.1039/d1an00833a [DOI] [PubMed] [Google Scholar]
  20. Nguyen M. H., Wong G., Gane E., Kao J. H., Dusheiko G. (2020). Hepatitis B virus: advances in prevention, diagnosis, and therapy. Clin. Microbiol. Rev. 33 (2). 10.1128/CMR.00046-19 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Nishadini Premasinghe C., Broeckelmann C., Kolobaric A., Cozzolino D., Chapman J. (2025). Artificial intelligence–enhanced near infrared (NIR) spectroscopy for rapid diagnostics of infectious diseases. Appl. Spectrosc. Rev., 1–28. 10.1080/05704928.2025.2570816 [DOI] [Google Scholar]
  22. Norris K. H. (2001). Applying norris derivatives. Understanding and correcting the factors which affect diffuse transmittance spectra. NIR News 12 (3), 6–9. 10.1255/nirn.613 [DOI] [Google Scholar]
  23. Pan T., Zhang J., Shi X. (2020). Flexible vitality of near-infrared spectroscopy–talking about norris derivative filter. NIR News 31 (1-2), 24–27. 10.1177/0960336019889587 [DOI] [Google Scholar]
  24. Pan T. C. J., Li J., Yao L. (2023). A method for combining spectral wavelengths based on spectral separation degree, CN 111474128 B. March. 14. [Google Scholar]
  25. Peng J., Yao X., Yuan C., Liu X., Xia R., He J., et al. (2022). The investigation of hepatitis B vaccine immune responses in occult hepatitis B virus-infected patients. Front. Immunol. 13, 903685. 10.3389/fimmu.2022.903685 [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Prahl S. (1999). Optical absorption of hemoglobin. Available online at: https://omlc.org/spectra/hemoglobin/ (November 1, 2025).
  27. Raimondo G., Locarnini S., Pollicino T., Levrero M., Zoulim F., Lok A. S., et al. (2019). Update of the statements on biology and clinical impact of occult hepatitis B virus infection. J. Hepatol. 71 (2), 397–408. 10.1016/j.jhep.2019.03.034 [DOI] [PubMed] [Google Scholar]
  28. Saitta C., Pollicino T., Raimondo G. (2022). Occult hepatitis B virus infection: an update. Viruses 14 (7), 1504. 10.3390/v14071504 [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Saravanan S., Shankar E. M., Vignesh R., Ganesh P. S., Sankar S., Velu V., et al. (2024). Occult hepatitis B virus infection and current perspectives on global WHO 2030 eradication. J. Viral Hepat. 31 (7), 423–431. 10.1111/jvh.13928 [DOI] [PubMed] [Google Scholar]
  30. Schwarz C., Morel A., Matignon M., Grimbert P., Rondeau E., Ouali N., et al. (2023). Hepatitis B virus reactivation in kidney transplant recipients treated with belatacept. Kidney Int. Rep. 8 (8), 1531–1541. 10.1016/j.ekir.2023.05.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Seeger C., Mason W. S. (2015). Molecular biology of hepatitis B virus infection. Virology 479-480, 672–686. 10.1016/j.virol.2015.02.031 [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Tan H., Liao S., Pan T., Zhang J., Chen J. (2020). Rapid and simultaneous analysis of direct and indirect bilirubin indicators in serum through reagent-free visible-near-infrared spectroscopy combined with chemometrics. Spectrochim. Acta A Mol. Biomol. Spectrosc. 233, 118215. 10.1016/j.saa.2020.118215 [DOI] [PubMed] [Google Scholar]
  33. Wang H.-P., Chen P., Dai J.-W., Liu D., Li J.-Y., Xu Y.-P., et al. (2022). Recent advances of chemometric calibration methods in modern spectroscopy: algorithms, strategy, and related issues. TrAC Trends Anal. Chem. 153, 116648. 10.1016/j.trac.2022.116648 [DOI] [Google Scholar]
  34. Wu T., Zheng X., Yang M., Zhao A., Xiang H., Chen T., et al. (2022). Serum amino acid profiles predict the development of hepatocellular carcinoma in patients with chronic HBV infection. ACS Omega 7 (18), 15795–15808. 10.1021/acsomega.2c00885 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Yang Y., Lei F., Zhang J., Yao L., Chen J., Pan T. (2019). Equidistant combination wavelength screening and step-by-step phase-out method for the near-infrared spectroscopic analysis of serum urea nitrogen. J. Innovative Opt. Health Sci. 12 (06), 1950018. 10.1142/s1793545819500184 [DOI] [Google Scholar]
  36. Yao L., Xu W., Pan T., Chen J. (2018). Moving-window bis-correlation coefficients method for visible and near-infrared spectral discriminant analysis with applications. J. Innovative Opt. Health Sci. 11 (02), 1850005. 10.1142/s1793545818500050 [DOI] [Google Scholar]
  37. Ye X., Liu L., Chen L., Nie X., Huang L., Ye D., et al. (2022). High-frequency notable HBV mutations identified in blood donors with occult hepatitis B infection from heyuan city of southern China. Front. Immunol. 13, 754383. 10.3389/fimmu.2022.754383 [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Yuan L., Zhang J., Xu J., Yao L., Wang D., Pan T. (2021). “Vis-NIR spectroscopic discriminant analysis applied to serum breast cancer screening,” in The international conference on NIR, 187–192. 10.1007/978-981-19-4884-8_19 [DOI] [Google Scholar]
  39. Yuan L., Chen X., Huang Y., Chen J., Pan T. (2023). Spectral separation degree method for Vis-NIR spectroscopic discriminant analysis of milk powder adulteration. Spectrochimica Acta Part A Mol. Biomol. Spectrosc. 301, 122975. 10.1016/j.saa.2023.122975 [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.


Articles from Frontiers in Bioengineering and Biotechnology are provided here courtesy of Frontiers Media SA

RESOURCES