Abstract
Background
Accurate and interpretable influenza forecasting is critical for public health preparedness, yet many models struggle to capture multi-scale temporal dynamics and to provide reliable uncertainty estimates. These challenges are particularly pronounced in China, where influenza seasonality differs between northern temperate and southern subtropical regions.
Methods
We propose FaXNet, a frequency-adaptive, explainable, and uncertainty-aware deep learning framework that integrates data-driven spectral representation with interpretable component selection and probabilistic forecasting. We compiled weekly influenza positivity rates from the Chinese National Influenza Center and aligned them with ERA5-Land meteorological variables (temperature, dew point, and precipitation) for northern and southern China from 2011 to 2023. FaXNet was evaluated against representative statistical, machine learning, deep learning, and decomposition-based baselines for 1–4-week-ahead forecasting using standard accuracy and calibration metrics.
Results
FaXNet achieved consistently superior performance in both regions, with 1-week-ahead R2 of 0.9319 (north) and 0.8665 (south), and 4-week-ahead R2 of 0.4493 (north) and 0.4960 (south). The proposed method maintained a statistically significant performance advantage against all benchmarks across varying horizons, validating the effectiveness of frequency-adaptive modeling in mitigating error accumulation. Model explanations highlighted precipitation as the dominant meteorological driver in the north and temperature as the primary factor in the south.
Conclusion
FaXNet provides accurate, interpretable forecasts with calibrated prediction intervals across 1–4-week horizons, offering actionable lead time for region-specific risk assessment and resource planning. Performance may depend on surveillance data completeness and the limited set of exogenous covariates considered, motivating future extensions with additional drivers (e.g., mobility or vaccination) and broader external validation.
Keywords: Adaptive Fourier Decomposition (AFD), explainable AI (XAI), frequency-adaptive modeling, influenza forecasting, Long Short-Term Memory (LSTM), SHAP (SHapley Additive exPlanations)
1. Introduction
Influenza continues to pose a major global health threat, leading to millions of infections annually and maintaining the potential for pandemic outbreaks (1, 2). The rapid antigenic drift of influenza viruses and the possibility of zoonotic transmission underscore the importance of accurate and timely forecasting for preparedness and response (3, 4). Reliable forecasts enable health authorities to allocate medical resources, implement targeted interventions, and communicate risks effectively (5, 6). At the same time, influenza transmission reflects interactions among seasonal cycles, meteorological drivers, and human behavior that unfold across multiple temporal scales (7, 8). Forecasting systems must therefore represent these multi-scale dynamics while offering interpretability and uncertainty quantification to inform public health decision-making (9, 10).
Substantial evidence links meteorological conditions to influenza activity (11, 12). Temperature, humidity, and precipitation influence viral survival, host susceptibility, and social mixing, with effects that vary in magnitude and direction across regions and seasons. In China, the climatic contrast between the northern temperate and southern subtropical provinces creates distinct epidemic profiles, motivating region-specific models that can adapt to environmental heterogeneity (13). Because these drivers operate over time scales ranging from weekly fluctuations to annual cycles, frequency-dependent representations can more faithfully capture their effects in epidemiological time series (14, 15). Crucially, epidemiological series are inherently non-stationary, characterized by time-varying amplitudes and frequencies that traditional fixed-basis methods often fail to resolve.
Recent advances in time-series forecasting (TSF) aim to address non-stationarity and multi-scale temporal structures through both representation learning and frequency-aware modeling. TimesNet (16), for example, exploits multi-periodicity by reorganizing one-dimensional sequences into two-dimensional representations to learn heterogeneous temporal patterns, while FEDformer (17) integrates seasonal–trend decomposition with frequency-domain enhancement to better capture long-range dynamics. More recently, inverted transformer architectures have further demonstrated the efficacy of capturing multivariate correlations and temporal dependencies explicitly (18). In epidemiological settings—where interpretability and periodic structure are central—spectral analysis yields actionable insights into epidemic cycles and structural changes, as demonstrated in recent studies of COVID-19 surveillance dynamics (19). At the population level, large-scale collaborative influenza forecasting efforts further underscore that transparency and interpretability are essential when model outputs inform critical public health decision-making (20, 21). Despite these advances, many state-of-the-art TSF models remain difficult to interpret at a granular, frequency-specific level, and the systematic integration of spectral decomposition with principled game-theoretic attribution frameworks, such as SHAP (22), remains limited in operational forecasting pipelines.
In the specific context of influenza-related public health modeling, these challenges are further compounded by several persistent limitations (9). First, feature selection is often ad hoc or conducted post hoc, hindering the principled identification of the most relevant temporal scales and frequency components, which in turn limits interpretability (23, 24). Second, while classical spectral techniques effectively encode seasonality, they are rarely integrated with modern, interpretable selection mechanisms, constraining their explanatory value in real-world operational pipelines (19, 25). Third, many studies continue to emphasize point forecasts without rigorous uncertainty quantification, reducing their usefulness for risk-sensitive planning and evaluation (20). Collectively, these unresolved issues impede the development of transparent, uncertainty-aware tools suitable for routine clinical and public health deployment.
To address these gaps, we propose FaXNet (Frequency-Adaptive eXplainable Network), a forecasting framework that integrates Adaptive Fourier Decomposition (AFD), SHapley Additive exPlanations (SHAP)-based feature selection, and a Frequency-Adaptive Long Short-Term Memory (FALSTM) model with probabilistic deep learning. AFD derives data-driven frequency components to capture multi-scale influenza–meteorology structures, addressing the limitations of fixed-basis transforms such as the Fast Fourier Transform (FFT) (26, 27). We then apply a SHAP-based selection strategy to identify the most predictive components and quantify their marginal contributions within a game-theoretic attribution framework (28, 29). The selected components are processed by the FALSTM with spectral fusion and an exogenous gate, enabling adaptive weighting of frequency components conditional on meteorological covariates (30). A probabilistic decoder produces point forecasts together with calibrated uncertainty estimates, which are evaluated using coverage-based metrics similar to those employed in FluSight frameworks, thereby supporting risk-aware public health decisions (31).
Contributions and novelty. To address these challenges, we propose FaXNet (Frequency-Adaptive eXplainable Network), a framework that integrates adaptive frequency representation, explainable component selection, and probabilistic multi-horizon forecasting. Our contributions are fourfold: (i) a unified end-to-end design that jointly captures multi-scale dynamics, interpretability, and uncertainty for influenza prediction; (ii) the use of AFD to obtain data-driven frequency components from epidemiological and meteorological series as interpretable learning channels; (iii) a SHAP-guided selection strategy that yields compact inputs and transparent attribution of influential temporal scales and meteorological drivers; and (iv) probabilistic 1–4-week-ahead forecasting with prediction intervals, validated across northern and southern China (2011–2023) against representative baseline families.
Paper organization. Section 2 introduces the datasets, preprocessing, and the FaXNet methodology. Section 3 reports benchmark comparisons, ablation studies, multi-horizon results, and interpretability and uncertainty analyses. Section 4 discusses implications, limitations, and future directions. Section 5 concludes the paper.
2. Materials and methods
This section presents the FaXNet framework for influenza forecasting, which integrates adaptive spectral decomposition, interpretable feature selection, and probabilistic neural architecture to effectively leverage meteorological covariates.
2.1. Problem formulation
Let denote the dataset, where represents the influenza positivity rate at time , and contains meteorological features. For a historical window of length and forecasting horizon , the input–output mapping is:
The data is chronologically split into training (70%), validation (10%), and test (20%) sets. Exogenous features are standardized using z-score normalization: , where and are the mean and standard deviation of feature .
2.2. Dataset
We construct two regional weekly datasets by integrating influenza surveillance from the Chinese National Influenza Center (CNIC) with land-surface meteorology from ERA5-Land. The datasets cover the period from January 1, 2011, to December 17, 2023, encompassing over 12 years of continuous weekly observations. The targets are the weekly influenza positivity rates for Northern China and Southern China, indexed by CNIC’s reporting weeks. CNIC defines these two regions following standard epidemiological boundaries used in China’s influenza surveillance system.
Meteorological covariates are drawn from ERA5-Land hourly single-level fields: 2 m air temperature (t2m), 2 m dew-point temperature (d2m), and total precipitation (tp). We define the northern and southern masks based on provincial membership (for example: provinces located north of ~33° N latitude are assigned to Northern China, the others to Southern China) (27). For each region, we average the values of all land grid cells within the mask to produce a regional hourly series. We then temporally aggregate hourly data into CNIC-aligned weekly time steps using simple weekly means for t2m, d2m, and tp, producing synchronized covariate series aligned with influenza positivity rates.
2.3. Adaptive Fourier decomposition (AFD)
Discrete inner product and analytic boundary. Let , . We discretize the Hardy inner product as (27):
| (1) |
and construct the analytic extension of the real series by suppressing negative frequencies (Hilbert transform), using boundary samples .
Higher-order Szegő atoms and residual update. For pole and order :
| (2) |
Initialize , . At step , select on a finite grid by normalized maximal selection (27):
| (3) |
and update the residual via the Möbius transform:
| (4) |
Reconstruction, primitive channels, and energies. The -term AFD reconstruction is:
| (5) |
For learning, we deliberately use primitive atom contributions (approximation):
| (6) |
which yield one interpretable channel per atom, enabling parallel encoders. We define global and window energies:
| (7) |
Stopping rule. With , stop when:
| (8) |
2.4. SHAP-guided component selection
Stack all target- and exogenous-derived atoms into . For windows:
| (9) |
Fit a random-forest surrogate on training windows (multi-output across or per- models). TreeExplainer gives Shapley values (20):
| (10) |
Global importance . Select as Top- by with an energy-coverage constraint:
| (11) |
supplementing target-derived atoms if needed. Optionally, append the raw target window as a time-domain channel.
2.5. Frequency-aware LSTM (FALSTM)
Encoders. For each , an LSTM encodes into the final hidden state (30) ; we also carry :
| (12) |
Spectral gating and fusion. Compute a spectral logit via an MLP:
| (13) |
and softmax weights (temperature ; numerically stabilized by subtracting in practice):
| (14) |
then form the fused state:
| (15) |
Exogenous gating. Summarize exogenous inputs by mean and last value:
| (16) |
and apply a sigmoid gate (23):
| (17) |
Probabilistic decoder. With layer normalization and linear heads:
| (18) |
clamping (e.g., ); set The predictive distribution is:
| (19) |
2.6. Learning objective and optimization
Gaussian NLL with MSE regularization (9), as defined in Equations 20 and 21:
| (20) |
| (21) |
We use AdamW (lr = 1 × 10−3), mini-batches, gradient-norm clipping, and early stopping (patience = 15); hyperparameters were tuned via rolling-origin grid search with lr ∈ {1 × 10−4, 5 × 10−4, 1 × 10−3}, LSTM hidden size h ∈ {64, 128, 256} (selected 128), dropout p ∈ {0, 0.1, 0.2} (selected 0.1), AFD atoms K ∈ {6, 8, 10} (selected 8), and SHAP-selected features m ∈ {8, 10, 12, 14} (selected 12).
2.7. Evaluation and uncertainty diagnostics
Point forecasts are evaluated using the metrics defined in Equations 22 and 23:
| (22) |
| (23) |
We additionally report horizon-wise metrics and calibration diagnostics (empirical coverage of intervals, PIT/CRPS) (Figure 1).
Figure 1.
Architecture of the proposed FaXNet framework. Schematic representation of the proposed FaXNet (Frequency-Adaptive eXplainable Network) for influenza forecasting. The framework integrates adaptive Fourier decomposition (Equations 1–8), SHAP-guided feature selection (Equations 9–11), frequency-aware LSTM encoding and spectral gated fusion (Equations 12–15), exogenous gating (Equations 16, 17), and probabilistic decoding (Equations 18, 19). Energy calculation supports spectral weighting. Together, these components enable interpretable, uncertainty-aware, and region-generalizable influenza forecasting.
2.8. Experimental design
The experimental evaluation uses influenza surveillance datasets from both northern and southern China to assess the regional generalizability of FaXNet (Frequency-Adaptive eXplainable Network). Weekly influenza positivity rates were obtained from the Chinese National Influenza Center (CNIC) and paired with region-specific meteorological covariates from ERA5-Land reanalysis, covering multiple years at weekly resolution. Consistent preprocessing was applied, including Z-score normalization, and a strictly chronological 70%/10%/20% split (train/validation/test) to prevent temporal leakage; importantly, the test set is a continuous held-out tail segment spanning multiple influenza seasons to reflect deployment-realistic generalization, while the validation set is used exclusively for early stopping and hyperparameter selection. Beyond this single split, we additionally adopt a rolling-origin (expanding-window) protocol on the pre-test timeline, advancing multiple forecast origins and summarizing performance across origins (mean ± std) to strengthen robustness. The model configuration is kept identical across regions (eight AFD components, twelve SHAP-selected features, and a three-layer LSTM with 128 hidden units), and training employs AdamW with a learning rate of 1 × , batch size of 32, gradient-norm clipping, and early stopping (patience = 15).
The evaluation protocol comprises four complementary analyses: (i) regional comparative experiments to characterize climate-specific transmission patterns; (ii) comprehensive ablation studies to assess the contribution of each architectural component; (iii) multi-horizon forecasting analysis (one- to four-week ahead) to evaluate short- and medium-term predictive robustness; and (iv) interpretability analysis based on SHAP feature importance to identify region-specific meteorological drivers. This dual-region experimental framework ensures a rigorous and balanced validation of FaXNet across diverse climatic and epidemiological contexts.
Implementation Notes: Under standard assumptions, Adaptive Fourier Decomposition (AFD) with maximal selection converges in the Hardy norm, and Equation 1 provides a consistent quadrature surrogate. Normalization in Equation 3 mitigates near-boundary pole bias arising from the variation of . Primitive channels in Equation 6 are introduced to enhance interpretability and parallel computation, while Equation 5 governs reconstruction accuracy. The simplex-constrained spectral weights and exogenous gate bound the fused representation , ensuring numerical stability during optimization.
3. Results and discussion
This section reports a comprehensive set of experiments evaluating the proposed FaXNet framework on influenza datasets from both northern and southern China. The analyses aim to demonstrate the effectiveness and generalizability of our approach across distinct climatic conditions. We conduct systematic comparisons against closely related and competitive baselines, assess performance across multiple forecasting horizons, perform rigorous ablation studies to quantify the contributions of individual components, and apply interpretability analyses to reveal region-specific transmission patterns. Together, these experiments provide a holistic validation of our framework from accuracy, robustness, and interpretability perspective.
3.1. Comprehensive performance analysis: baseline comparison and multi-horizon prediction
To establish the overall effectiveness of FaXNet, we first integrate baseline method comparisons with multi-horizon forecasting results. This design allows us to evaluate both methodological competitiveness and practical forecasting utility at varying temporal resolutions. Importantly, we report not only pointwise accuracy improvements but also their implications for operational forecasting across horizons and regions, thereby combining results with interpretation in a unified analysis.
3.1.1. Baseline methods and experimental setup
We benchmark FaXNet against ten representative baselines spanning diverse methodological families: ARIMA (Autoregressive Integrated Moving Average, statistical), SVR-RBF (Support Vector Regression with Radial Basis Function kernel, machine learning), LSTM (Long Short-Term Memory), GRU (Gated Recurrent Unit), Transformer, TimesNet, and FEDformer, as well as three hybrid decomposition methods: EMD-LSTM (Empirical Mode Decomposition (32) with LSTM), VMD-LSTM (Variational Mode Decomposition (33) with LSTM), and Wavelet-LSTM. Input configurations follow standard practice: statistical and decomposition-based methods (ARIMA, EMD-LSTM, VMD-LSTM, Wavelet-LSTM) are trained exclusively on historical influenza data, whereas machine learning and deep learning methods (SVR-RBF, LSTM, GRU, Transformer) incorporate both influenza and meteorological covariates (temperature, humidity/dewpoint, precipitation). The proposed model further extends this multi-source input with adaptive frequency decomposition and SHAP-guided feature selection to enhance representation learning. This setup enables a fair comparison between univariate/decomposition-based forecasters and multivariate learners leveraging meteorological context, and helps assess the incremental contribution of FaXNet’s frequency-adaptive and explainable design (Figure 2).
Figure 2.
FaXNet multi-horizon influenza forecasts for Northern China. Actual vs. predicted positivity rates across 1–4 week horizons, showing high short-term accuracy and stable medium-range performance. (a) Week 1 ahead prediction (R2 = 0.932). (b) Week 2 ahead prediction (R2 = 0.836). (c) Week 3 ahead prediction (R2 = 0.699). (d) Week 4 ahead prediction (R2 = 0.449).
All models were trained and evaluated under an identical protocol with a strict chronological 70/10/20 split (train/validation/test). The validation subset was used only for early stopping and hyperparameter selection, while the test subset remained untouched and was evaluated once for final reporting. To prevent look-ahead bias from random k-fold procedures, hyperparameters were tuned by grid search using rolling-origin (expanding-window) evaluation on the pre-test timeline (K = 5 forecast origins): at each origin, models were fit on all available past data and assessed on the subsequent validation block; the configuration minimizing validation MAE (primary) and RMSE/NLL (secondary) was selected, then refit on train+validation before the single test evaluation. We report RMSE, MAE, R2, and MAPE, with MAE as the primary ranking metric. Table 1 summarizes average baseline performance (4-week window) across regions and the 1–4 week multi-horizon forecasts of FaXNet. Finally, to rigorously assess statistical significance, we applied a block-wise paired Wilcoxon signed-rank test to the out-of-sample errors on the untouched test set, aggregating predictions into non-overlapping blocks to mitigate temporal autocorrelation.
Table 1.
Comprehensive performance analysis: baseline comparison and multi-horizon prediction.
| Section | Type | Method/Horizon | RMSE | MAE | MAPE | RMSE | MAE | MAPE | ||
|---|---|---|---|---|---|---|---|---|---|---|
| Northern China | Southern China | |||||||||
| Baseline (4-week avg) | Statistical | ARIMA | 0.0965 | 0.0718 | 0.472 | 14.28% | 0.1332 | 0.0957 | 0.365 | 18.05% |
| ML | SVR-RBF | 0.0898 | 0.0654 | 0.518 | 12.95% | 0.124 | 0.0886 | 0.41 | 16.62% | |
| Deep Learning | Transformer | 0.0827 | 0.0596 | 0.603 | 11.24% | 0.116 | 0.0817 | 0.475 | 14.85% | |
| Deep Learning | LSTM | 0.0785 | 0.0551 | 0.661 | 10.48% | 0.1122 | 0.078 | 0.502 | 13.98% | |
| Deep Learning | GRU | 0.0762 | 0.0536 | 0.675 | 10.05% | 0.1085 | 0.0745 | 0.526 | 13.42% | |
| Deep Learning | TimesNet | 0.0815 | 0.0582 | 0.615 | 11.10% | 0.1158 | 0.0812 | 0.482 | 14.55% | |
| Deep Learning | FEDformer | 0.0732 | 0.0495 | 0.685 | 9.25% | 0.1045 | 0.0702 | 0.555 | 12.65% | |
| Hybrid | EMD-LSTM | 0.0741 | 0.052 | 0.688 | 9.71% | 0.106 | 0.0715 | 0.548 | 12.89% | |
| Hybrid | Wavelet-LSTM | 0.0728 | 0.0506 | 0.697 | 9.54% | 0.1039 | 0.0695 | 0.559 | 12.55% | |
| Hybrid | VMD-LSTM | 0.0719 | 0.0499 | 0.705 | 9.36% | 0.1028 | 0.0682 | 0.57 | 12.23% | |
| Proposed | FaXNet | 0.0698 | 0.0411 | 0.7239 | 7.65% | 0.0889 | 0.0616 | 0.6778 | 10.12% | |
| Multi-horizon (FaXNet) | Temporal | 1-week forecast | 0.034 | 0.0233 | 0.9319 | 4.08% | 0.055 | 0.0379 | 0.8665 | 5.68% |
| Temporal | 2-week forecast | 0.0534 | 0.031 | 0.8358 | 5.25% | 0.0817 | 0.0585 | 0.7194 | 8.12% | |
| Temporal | 3-week forecast | 0.0734 | 0.0423 | 0.6994 | 8.12% | 0.1142 | 0.0715 | 0.6293 | 11.89% | |
| Temporal | 4-week forecast | 0.1006 | 0.0678 | 0.4493 | 11.25% | 0.1156 | 0.0785 | 0.496 | 12.15% | |
Bold values indicate the best performance for each metric within the corresponding region.
Across both regions and all metrics, FaXNet achieves the strongest overall performance. Compared with ARIMA, it reduces MAE by 40.2% in northern China and 33.3% in southern China, while also attaining the highest R2 values (0.7239 in the north and 0.6778 in the south). The framework further demonstrates robust short- to medium-term accuracy, achieving R2 > 0.92 at the 1-week horizon and degrading smoothly over longer horizons, remaining practically useful with R2 > 0.4 at 4 weeks in both regions. From an interpretive perspective, these improvements are consistent with the hypothesis that combining frequency-adaptive decomposition with attribution-guided selection helps emphasize informative temporal components and mitigate the influence of less relevant variability, an effect that becomes increasingly important for longer-horizon prediction. Notably, performance is consistently stronger in northern China than in southern China. This regional gap may reflect the clearer and more regular seasonality in temperate climates, whereas subtropical dynamics could be more sensitive to higher climatic variability and unmodeled drivers (e.g., mobility, school terms, and strain composition) (34), which may limit achievable accuracy under the current covariate set. Nevertheless, FaXNet maintains a stable advantage over all statistical, machine learning, deep learning, and hybrid baselines across both regions, supporting its robustness and generalizability. In practical terms, the sustained advantage at 3–4 weeks may inform earlier preparedness planning (e.g., resource planning) beyond short-term nowcasting, while the gradual degradation across horizons suggests stable error growth rather than abrupt failure.
To rigorously substantiate the model’s robustness beyond point accuracy, we evaluated both its probabilistic calibration and statistical significance. In terms of uncertainty quantification, FaXNet achieved low Continuous Ranked Probability Scores (CRPS) of 0.0285 (North) and 0.0412 (South), with 95% prediction interval coverage rates of 94.2 and 93.8%, respectively. These metrics, widely adopted in epidemic forecasting evaluations (35, 36), confirm that the predictive distributions are well-calibrated and not overly dispersive. Concurrently, to validate that the performance gains were not due to random chance, we applied a block-wise paired Wilcoxon signed-rank test to the rolling-origin forecast errors between FaXNet and the strongest baseline. Results confirmed statistically significant improvements in both Northern and Southern China (p < 0.05). Together, these diagnostics demonstrate that FaXNet provides a systematic, scientifically significant, and reliable predictive advantage across distinct climatic regimes (Figure 3).
Figure 3.
FaXNet multi-horizon influenza forecasts for Southern China. Actual vs. predicted positivity rates across 1–4 week horizons, demonstrating strong short-term accuracy and consistent medium-range performance. (a) Week 1 ahead prediction (R2 = 0.864). (b) Week 2 ahead prediction (R2 = 0.781). (c) Week 3 ahead prediction (R2 = 0.608). (d) Week 4 ahead prediction (R2 = 0.491).
3.2. Ablation study results
We assess the marginal contribution of each component in the proposed framework through a controlled ablation. Starting from the frequency-adaptive LSTM backbone (FALSTM), we progressively introduce Adaptive Fourier Decomposition (AFD) and SHAP-guided feature selection, while keeping the network architecture, training budget, and evaluation protocol consistent across variants to enable a fair component-wise comparison. Metrics are computed on the test set and reported as 4-week averages.
We evaluate four model variants under an identical experimental protocol. The first variant, FALSTM (baseline), represents the base frequency-adaptive LSTM that incorporates spectral fusion and exogenous gating, trained directly on raw time series without applying decomposition or feature selection. The second variant, AFD-FALSTM, extends the baseline by integrating adaptive frequency decomposition, while the third variant, SHAP-FALSTM, augments the baseline with SHAP-based feature selection. The final variant, FaXNet (proposed), combines both adaptive decomposition and SHAP-based selection into a unified framework. All models adopt the same chronological data split (70% training, 10% validation, 20% testing), optimization settings, and evaluation metrics, including RMSE, MAE, , and MAPE. Improvements in predictive goodness of fit are reported as , calculated relative to the FALSTM baseline within each region (Table 2).
Table 2.
Regional component contribution analysis.
| Model Variant | RMSE | MAE | RMSE | MAE | ||||
|---|---|---|---|---|---|---|---|---|
| Northern China | Southern China | |||||||
| FALSTM (Baseline) | 0.0789 | 0.0567 | 0.6234 | – | 0.1187 | 0.0834 | 0.4987 | – |
| AFD-FALSTM | 0.0745 | 0.0512 | 0.6789 | 0.0555 | 0.1123 | 0.0789 | 0.5456 | 0.0469 |
| SHAP-FALSTM | 0.0723 | 0.0489 | 0.6923 | 0.0689 | 0.1089 | 0.0756 | 0.5623 | 0.0636 |
| FaXNet (Proposed) | 0.0698 | 0.0411 | 0.7239 | 0.1005 | 0.0889 | 0.0616 | 0.6778 | 0.1791 |
Across both regions, each component yields consistent and complementary gains. AFD delivers sizeable improvements over the baseline (Δ = + 0.0555 north; +0.0469 south), which is consistent with the notion that data-adaptive frequency representations can better capture multi-scale temporal structure than raw-series modeling alone. SHAP-based selection provides further benefits (Δ = + 0.0689 north; +0.0636 south), suggesting that restricting the model to a smaller set of high-attribution components may reduce redundant variability and improve generalization under the same training protocol. Notably, the combined model outperforms either single-addition variant in both regions, showing that combining AFD and SHAP-based selection yields larger gains than introducing either component alone, suggesting largely additive benefits under the current setting. The larger absolute in southern China may reflect greater benefit from adaptive decomposition and principled selection under more variable dynamics, although other factors such as regional data characteristics and unmodeled drivers may also contribute to this disparity.
3.3. Feature selection sensitivity analysis
To determine an appropriate feature configuration and characterize the trade-off between predictive performance and model complexity, we conducted a systematic evaluation of SHAP-selected features ranging from 4 to 16 components in both northern and southern China. This analysis quantifies how increasing feature dimensionality affects accuracy (RMSE, MAE, ), with model complexity characterized by the number of selected components (K) (Table 3).
Table 3.
Regional feature selection trade-off analysis.
| Features | RMSE | MAE | RMSE | MAE | ||
|---|---|---|---|---|---|---|
| Northern China | Southern China | |||||
| 4 | 0.0756 | 0.0523 | 0.6789 | 0.0923 | 0.0645 | 0.6123 |
| 6 | 0.0734 | 0.0498 | 0.6923 | 0.0901 | 0.0623 | 0.6345 |
| 8 | 0.0721 | 0.0476 | 0.7056 | 0.0892 | 0.0612 | 0.6523 |
| 10 | 0.0712 | 0.0456 | 0.7123 | 0.0887 | 0.0608 | 0.6654 |
| 12 | 0.0698 | 0.0411 | 0.7239 | 0.0889 | 0.0616 | 0.6778 |
| 14 | 0.0705 | 0.0418 | 0.7184 | 0.0895 | 0.0621 | 0.6712 |
| 16 | 0.0704 | 0.0416 | 0.7195 | 0.0898 | 0.0624 | 0.6689 |
Bold values indicate the best performance for each metric within the corresponding region.
The results indicate that selecting 12 features provides the most balanced configuration across regions, yielding the optimal trade-off between model complexity and predictive accuracy. At this configuration, performance reaches its peak in Northern China with = 0.7239 and remains strong in Southern China with = 0.6778, consistent with earlier regional analyses. Accuracy generally improves as K increases from 4 to 12, suggesting that moderate expansion of the selected set captures additional informative components beyond smaller subsets. When the number of features exceeds 12, accuracy begins to decline in both regions, a pattern consistent with diminishing marginal returns from additional components and increased redundancy under a fixed training protocol. These results support that SHAP-based selection can identify a compact subset associated with strong generalization under the current setting. Notably, this sensitivity analysis does not by itself establish causal “noise suppression”; rather, it provides empirical evidence that a compact SHAP-selected subset is associated with the best observed generalization under the current data and modeling assumptions (Figure 4).
Figure 4.
FaXNet feature importance analysis for Northern China. SHAP-based feature attribution illustrating the relative contributions of frequency components and meteorological variables. Panels show (a) individual feature importance ranking, (b) type-wise importance distribution between target and weather-derived components, and (c) cumulative contribution analysis identifying the optimal number of features for accurate prediction with a compact feature set.
3.4. Interpretability analysis
Understanding the factors associated with influenza predictions is crucial for both model validation and public health applications. Our framework’s SHAP-based interpretability analysis provides quantitative insights into the relative importance of different meteorological and temporal features across diverse climatic regions, supporting transparent model auditing and region-specific interpretation under a consistent attribution protocol (Figure 5).
Figure 5.
FaXNet feature importance analysis for Southern China. SHAP-based feature attribution illustrating the relative contributions of frequency components and meteorological variables. Panels show (a) individual feature importance ranking, (b) type-wise importance distribution between target and weather-derived components, and (c) cumulative contribution analysis identifying the optimal number of features for accurate prediction with a compact feature set.
3.4.1. Regional SHAP feature importance analysis
To systematically evaluate feature contributions, we employ SHAP (SHapley Additive exPlanations) analysis on the trained models for both regions. This game-theoretic approach quantifies the marginal contribution of each feature to the final prediction, providing interpretable and theoretically grounded feature importance rankings (Table 4).
Table 4.
Top feature importance scores by region.
| Rank | Component | Series | Importance | Component | Series | Importance |
|---|---|---|---|---|---|---|
| Northern China | Southern China | |||||
| 1 | 5 | north_tp | 0.0396 | 7 | south_t2m | 0.0198 |
| 2 | 3 | target | 0.0111 | 7 | target | 0.0129 |
| 3 | 3 | north_d2m | 0.0108 | 6 | target | 0.0121 |
| 4 | 6 | north_d2m | 0.0082 | 5 | target | 0.007 |
| 5 | 1 | target | 0.0075 | 4 | target | 0.0068 |
3.4.2. Regional transmission mechanism analysis
The SHAP rankings highlight regional differences in the relative roles of meteorological covariates and target-derived temporal components. In northern China, precipitation (north_tp, component 5) is the most influential feature (importance = 0.0396), exceeding the next-ranked feature by a large margin. This concentration of importance is consistent with a stronger reliance on meteorological conditions for predictive signal in the temperate north, where seasonal regularity is often stronger and meteorological covariates can be more informative under the current feature set (37). Importantly, SHAP importance reflects association within the model and dataset rather than causal effect, and should therefore be interpreted as indicating which variables the model most relies upon for prediction given the available inputs.
In southern China, temperature (south_t2m, component 7) is the top-ranked meteorological driver (importance = 0.0198), while several target-derived components also appear among the highest-ranked features (e.g., target component 7: 0.0129; target component 6: 0.0121). The more distributed importance pattern indicates that the model’s predictions are informed by both meteorological covariates and endogenous temporal structure captured by the target-series components. This pattern does not by itself establish a distinct “transmission mechanism,” but is consistent with the possibility that subtropical influenza dynamics are shaped by multiple interacting factors and may be less dominated by a single meteorological proxy under the current covariate set.
3.4.3. Implications for surveillance and decision support
The regional differences in feature importance have implications for influenza surveillance and intervention planning. The precipitation-dominant attribution pattern in northern China suggests that meteorology-informed monitoring may be particularly useful for contextualizing forecast changes in temperate settings. In contrast, the more distributed importance pattern in southern China suggests that incorporating additional contextual signals (e.g., mobility, school terms, vaccination uptake, or virological indicators) may further improve interpretability and robustness, especially if these drivers explain variability not captured by the current covariates.
Overall, these interpretability results support the view that FaXNet can produce region-sensitive attributions within a unified modeling pipeline, enabling stakeholders to inspect which inputs are most influential for forecasts in different climatic contexts. SHAP-based explanations enhance transparency for model auditing and communication, but should be interpreted alongside predictive performance, uncertainty estimates, and domain knowledge rather than as standalone causal evidence.
4. Conclusion
This study proposed FaXNet, a unified framework integrating adaptive spectral decomposition, SHAP-guided selection, and probabilistic deep learning to deliver transparent influenza forecasts. Extensive experiments demonstrated that FaXNet achieved superior accuracy and robustness across northern and southern China compared to state-of-the-art baselines. Crucially, the interpretability analysis elucidated distinct regional drivers, specifically precipitation dominance in the north versus complex thermal-social interactions in the south, providing a transparent scientific basis for risk-sensitive public health interventions.
Future research should prioritize three actionable directions to enhance operational viability. First, to enable real-time large-scale deployment, engineering accelerations such as warm-start pole selection or approximate SHAP algorithms are recommended to mitigate computational overhead. Second, to deepen causal understanding beyond meteorological correlations, future iterations should incorporate socio-behavioral covariates, including inter-regional mobility data and vaccination coverage, to better capture the complex human-driven factors shaping transmission dynamics. Finally, we propose extending this modality-agnostic framework to other multi-scale pathogens, such as dengue or RSV, to rigorously test its generalizability, ultimately advancing the paradigm of explainable and uncertainty-aware precision epidemiology.
Glossary
Glossary
- AFD
Adaptive Fourier Decomposition
- ARIMA
Autoregressive Integrated Moving Average
- AdamW
Adaptive Moment Estimation with decoupled weight decay
- CNIC
Chinese National Influenza Center
- CRPS
Continuous Ranked Probability Score
- d2m
2-m Dew-Point Temperature
- ERA5-Land
ECMWF Reanalysis v5-Land
- FaXNet
Frequency-Adaptive eXplainable Network
- FALSTM
Frequency-Adaptive Long Short-Term Memory
- FFT
Fast Fourier Transform
- GELU
Gaussian Error Linear Unit
- GRU
Gated Recurrent Unit
- LN
Layer Normalization
- LSTM
Long Short-Term Memory
- MAE
Mean Absolute Error
- MAPE
Mean Absolute Percentage Error
- MLP
Multilayer Perceptron
- MSE
Mean Squared Error
- NLL
Negative Log-Likelihood
- PIT
Probability Integral Transform
Coefficient of Determination
- ReLU
Rectified Linear Unit
- RMSE
Root Mean Squared Error
- SHAP
SHapley Additive exPlanations
- SVR-RBF
Support Vector Regression with Radial Basis Function kernel
- t2m
2-m Air Temperature
- tp
Total Precipitation
- EMD
Empirical Mode Decomposition
- VMD
Variational Mode Decomposition
- XAI
Explainable Artificial Intelligence
Funding Statement
The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This work was supported by the self-supporting Program of Guangzhou Laboratory (Grant No. SRPG22-007), the Science and Technology Development Fund of Macau SAR (Grant No. FDCT 0002/2024/RDP), the Engineering Technology Research (Development) Center of Ordinary Colleges and Universities in Guangdong Province (Grant No. 2024GCZX010), the Guangdong Engineering Technology Research Center (Grant No. 2024A137), the National Natural Science Foundation of China (Grant No. 82361168672), and the Science and Technology Development Fund of Macau SAR (Grant No. FDCT0111/2023/AFJ).
Footnotes
Edited by: Wenbiao Hu, Queensland University of Technology, Australia
Reviewed by: Bin Hu, Xuzhou Medical University, China
Luttfi A. Al-Haddad, University of Technology, Iraq
Data availability statement
The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.
Author contributions
WH: Writing – review & editing, Conceptualization, Formal analysis, Writing – original draft, Software, Data curation, Validation, Methodology. XuL: Validation, Software, Formal analysis, Writing – original draft. XiL: Writing – review & editing, Investigation, Visualization. ZL: Writing – review & editing, Resources, Data curation. ZZ: Formal analysis, Writing – review & editing, Methodology, Supervision. ZY: Funding acquisition, Validation, Project administration, Writing – review & editing. CH: Conceptualization, Writing – review & editing, Supervision, Project administration, Funding acquisition.
Conflict of interest
The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Correction note
A correction has been made to this article. Details can be found at: 10.3389/fpubh.2026.1804925.
Generative AI statement
The author(s) declared that Generative AI was not used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
- 1.Wang X, Li Y, O'Brien KL, Madhi SA, Widdowson MA, Byass P, et al. Global burden of respiratory infections associated with seasonal influenza in children under 5 years in 2018: a systematic review and modelling study. Lancet Glob Health. (2020) 8:e497–510. doi: 10.1016/S2214-109X(19)30545-5, [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Uyeki TM, Hui DS, Zambon M, Wentworth DE, Monto AS. Influenza. Lancet. (2022) 400:693–706. doi: 10.1016/S0140-6736(22)00982-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Mostafa A, Abdelwhab EM, Mettenleiter TC, Pleschka S. Zoonotic potential of influenza a viruses: a comprehensive overview. Viruses. (2018) 10:497. doi: 10.3390/v10090497, [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Huddleston J, Barnes JR, Rowe T, Xu X, Kondor R, Wentworth DE, et al. Integrating genotypes and phenotypes improves long-term forecasts of seasonal influenza a/H3N2 evolution. eLife. (2020) 9:e60067. doi: 10.7554/eLife.60067 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Doms C, Kramer SC, Shaman J. Assessing the use of influenza forecasts and epidemiological modeling in public health decision making in the United States. Sci Rep. (2018) 8:12406. doi: 10.1038/s41598-018-30378-w [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Ning S, Yang S, Kou SC. Accurate regional influenza epidemics tracking using internet search data. Sci Rep. (2019) 9:5238. doi: 10.1038/s41598-019-41559-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Guo D, Li KC, Peters TR, Snively BM, Poehling KA, Zhou X. Multi-scale modeling for the transmission of influenza and the evaluation of interventions toward it. Sci Rep. (2015) 5:8980. doi: 10.1038/srep08980 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Pei S, Kandula S, Yang W, Shaman J. Forecasting the spatial transmission of influenza in the United States. Proc Natl Acad Sci USA. (2018) 115:2752–7. doi: 10.1073/pnas.1708856115, [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Morris M, Hayes P, Cox IJ, Lampos V. Neural network models for influenza forecasting with associated uncertainty using web search activity trends. PLoS Comput Biol. (2023) 19:e1011392. doi: 10.1371/journal.pcbi.1011392, [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Reich NG, McGowan C, Yamana TK, Tushar A, Ray EL, Osthus D, et al. Accuracy of real-time multi-model ensemble forecasts for seasonal influenza in the U.S. PLoS Comput Biol. (2019) 15:e1007486. doi: 10.1371/journal.pcbi.1007486, [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Yan ZL, Liu WH, Long YX, Ming BW, Yang Z, Qin PZ, et al. Effects of meteorological factors on influenza transmissibility by virus type/subtype. BMC Public Health. (2024) 24:494. doi: 10.1186/s12889-024-17961-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Chen C, Zhang X, Jiang D, Yan D, Guan Z, Zhou Y, et al. Associations between temperature and influenza activity: a national time series study in China. Int J Environ Res Public Health. (2021) 18:10846. doi: 10.3390/ijerph182010846 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Si X, Wang L, Mengersen K, Hu W. Epidemiological features of seasonal influenza transmission among 11 climate zones in Chinese mainland. Infect Dis Poverty. (2024) 13:4. doi: 10.1186/s40249-024-01173-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Du M, Zhu H, Yin X, Ke T, Gu Y, Li S, et al. Exploration of influenza incidence prediction model based on meteorological factors in Lanzhou, China, 2014–2017. PLoS One. (2022) 17:e0277045. doi: 10.1371/journal.pone.0277045, [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Chen S, Xu J, Wu Y, Wang X, Fang S, Cheng J, et al. Predicting temporal propagation of seasonal influenza using improved Gaussian process model. J Biomed Inform. (2019) 93:103144. doi: 10.1016/j.jbi.2019.103144 [DOI] [PubMed] [Google Scholar]
- 16.Wu H, Hu T, Liu Y, Zhou H, Wang J, Long M. TimesNet: Temporal 2D-variation modeling for general time series analysis. In: Proceedings of the International Conference on Learning Representations (ICLR). 2023. [Google Scholar]
- 17.Zhou T, Ma Z, Wen Q, Wang X, Sun L, Jin R. FEDformer: frequency enhanced decomposed transformer for long-term series forecasting. In: Proceedings of the 39th international conference on machine learning (ICML). Proc Mach learn res. 2022;162:27268–27286. [Google Scholar]
- 18.Liu Y, Hu T, Zhang H, Wu H, Wang S, et al., iTransformer: inverted transformers are effective for time series forecasting. In: Proceedings of the International Conference on Learning Representations (ICLR). 2024. [Google Scholar]
- 19.Demongeot J, Magal P. Spectral method in epidemic time series: application to COVID-19 pandemic. Biology. (2022) 11:1825. doi: 10.3390/biology11121825, [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Reich NG, Brooks LC, Fox SJ, Kandula S, McGowan CJ, Moore E, et al. A collaborative multiyear, multimodel assessment of seasonal influenza forecasting in the United States. Proc Natl Acad Sci USA. (2019) 116:3146–54. doi: 10.1073/pnas.1812594116, [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Cramer EY, Ray EL, Lopez VK, Bracher J, Brennen A, Castro Rivadeneira AJ, et al. Evaluation of individual and ensemble probabilistic forecasts of COVID-19 mortality in the United States. Proc Natl Acad Sci USA. (2022) 119:e2113561119. doi: 10.1073/pnas.2113561119 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Lundberg SM, Erion G, Chen H, DeGrave A, Prutkin JM, Nair B, et al. From local explanations to global understanding with explainable AI for trees. Nat Mach Intell. (2020) 2:56–67. doi: 10.1038/s42256-019-0138-9, [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Lim B, Arik SÖ, Loeff N, Pfister T. Temporal fusion transformers for interpretable multi-horizon time series forecasting. Int J Forecast. (2021) 37:1748–64. doi: 10.1016/j.ijforecast.2021.03.012 [DOI] [Google Scholar]
- 24.Ozyegen O, Ilic I, Cevik M. Evaluation of interpretability methods for multivariate time series forecasting. Appl Intell. (2022) 52:4727–43. doi: 10.1007/s10489-021-02662-2, [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Nsoesie EO, Brownstein JS, Ramakrishnan N, Marathe MV. A systematic review of studies on forecasting the dynamics of influenza outbreaks. Influenza Other Respir Viruses. (2014) 8:309–16. doi: 10.1111/irv.12226, [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Yu L, Wang Z, Dai R, Wang W. Daily runoff prediction based on the adaptive Fourier decomposition method and multiscale temporal convolutional network. Environ Sci Pollut Res Int. (2023) 30:95449–63. doi: 10.1007/s11356-023-28936-5, [DOI] [PubMed] [Google Scholar]
- 27.Wang Z, Wong CM, Rosa A, Qian T, Wan F. Adaptive Fourier decomposition for multi-channel signal analysis. IEEE Trans Signal Process. (2022) 70:903–18. doi: 10.1109/tsp.2022.3143723 [DOI] [Google Scholar]
- 28.Wang H, Liang Q, Hancock JT, Khoshgoftaar TM. Feature selection strategies: a comparative analysis of SHAP-value and importance-based methods. J Big Data. (2024) 11:44. doi: 10.1186/s40537-024-00905-w [DOI] [Google Scholar]
- 29.Mohanty PK, Francis SAJ, Barik RK, Roy DS, Saikia MJ. Leveraging shapley additive explanations for feature selection in ensemble models for diabetes prediction. Bioengineering. (2024) 11:1215. doi: 10.3390/bioengineering11121215, [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Hochreiter S, Schmidhuber J. Long short-term memory. Neural Comput. (1997) 9:1735–80. doi: 10.1162/neco.1997.9.8.1735, [DOI] [PubMed] [Google Scholar]
- 31.Mathis SM, Webber AE, León TE, Murray EL, Sun M, White LA, et al. Evaluation of flusight influenza forecasting in the 2021–22 and 2022–23 seasons with a new target: laboratory-confirmed influenza hospitalizations. Nat Commun. (2024) 15:6289. doi: 10.1038/s41467-024-50601-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Huang NE, Shen Z, Long SR, Wu MC, Shih HH, Zheng Q, et al. The empirical mode decomposition and the hilbert spectrum for nonlinear and non-stationary time series analysis. Proc R Soc A. (1998) 454:903–95. doi: 10.1098/rspa.1998.0193 [DOI] [Google Scholar]
- 33.Dragomiretskiy K, Zosso D. Variational mode decomposition. IEEE Trans Signal Process. (2014) 62:531–44. [Google Scholar]
- 34.Yu H, Alonso WJ, Feng L, Tan Y, Shu Y, Yang W, et al. Characterization of regional influenza seasonality patterns in China and implications for vaccination strategies: spatio-temporal modeling of surveillance data. PLoS Med. (2013) 10:e1001552. doi: 10.1371/journal.pmed.1001552, [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Bracher J, Ray EL, Gneiting T, Reich NG. Evaluating epidemic forecasts in an interval format. PLoS Comput Biol. (2021) 17:e1008618. doi: 10.1371/journal.pcbi.1008618, [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Funk S, Camacho A, Kucharski AJ, Lowe R, Eggo RM, Edmunds WJ. Assessing the performance of real-time epidemic forecasts: a case study of Ebola in Sierra Leone, 2014–15. PLoS Comput Biol. (2019) 15:e1006785. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Shaman J, Kohn M. Absolute humidity modulates influenza survival, transmission, and seasonality. Proc Natl Acad Sci USA. (2009) 106:3243–8. doi: 10.1073/pnas.0806852106, [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.





