Skip to main content
PLOS One logoLink to PLOS One
. 2025 Jun 24;20(6):e0326587. doi: 10.1371/journal.pone.0326587

An integrated IKOA-CNN-BiGRU-Attention framework with SHAP explainability for high-precision debris flow hazard prediction in the Nujiang river basin, China

Hao Yang 1, Tianlong Wang 2,3,*, Nikita Igorevich Fomin 1, Shuoting Xiao 1, Liang Liu 1
Editor: Linwei Li4
PMCID: PMC12186985  PMID: 40554568

Abstract

Debris flows represent a persistent challenge for disaster prediction in mountainous regions due to their highly nonlinear and multivariate triggering mechanisms. This study proposes an explainable deep learning framework, the Improved Kepler Optimization Algorithm-Convolutional Neural Network-Bidirectional Gated Recurrent Unit-Attention (IKOA-CNN-BiGRU-Attention) model, for precise debris flow hazard prediction in the Yunnan section of the Nujiang River Basin, China. The model is developed and validated using data from 159 debris flow-prone gullies, integrating deep convolutional, recurrent, and attention-based architectures, with hyperparameters autonomously optimized by IKOA. Model explainability is enhanced using SHapley Additive exPlanations (SHAP), which quantify the influence of key factors. The IKOA-CNN-BiGRU-Attention framework consistently outperforms 13 benchmark models, achieving a root mean square error of 2.33 × 10−6, mean absolute error of 1.51 × 10−6, and mean absolute percentage error of 0.006%. The model maintains high stability across 50 repeated experiments, strong resilience to 20% input noise, and robust generalizability under five-fold cross-validation. Interpretability analysis identifies potential source energy and maximum 24-hour rainfall as primary determinants and uncovers a dual-threshold physical mechanism underlying debris flow initiation. These findings provide a quantitative basis for adaptive early warning and targeted risk mitigation, and establish a transferable framework for explainable geohazard prediction.

1 Introduction

Debris flows constitute a catastrophic geophysical phenomenon that epitomizes the complex interplay between gravitational forces, material rheology, and hydrological dynamics in mountainous environments. These rapid mass movements pose an escalating threat to human settlements, critical infrastructure, and ecological integrity globally, with disproportionate impacts on developing regions with expanding mountain populations [1]. The socioeconomic burden is substantial: between 2000 and 2020, debris flows accounted for approximately 17% of global landslide-related fatalities while generating annual economic losses exceeding 20 billion USD. Unlike other natural hazards with extended warning periods, debris flows manifest with minimal antecedent indicators and propagate at velocities reaching 10 m/s, severely constraining evacuation timeframes and necessitating high-precision predictive frameworks for effective risk reduction [2]. The multifactorial triggering mechanisms underlying these events, characterized by nonlinear threshold responses and complex factor interactions, present formidable challenges for contemporary hazard forecasting.

The epistemological evolution of debris flow prediction has traversed distinct methodological paradigms, each characterized by specific conceptual frameworks and analytical limitations. Initial approaches relied on empirical correlations derived from historical event databases, establishing phenomenological relationships for inundation area delineation [3] and precipitation threshold determination [4]. Such frameworks, while operationally accessible, frequently exhibited limited transferability beyond their calibration domains. Subsequent advances introduced physically-based numerical models that simulated the complete process chain from material mobilization through deposition, incorporating rheological principles and granular flow mechanics [5]. These sophisticated simulations demonstrably enhanced impact assessment capabilities [6] and runout predictions in topographically complex terrain. Nevertheless, their widespread implementation remains constrained by intensive parameterization requirements, computational demands, and fundamental uncertainties in constitutive relationships governing material behavior during extreme mobilization events.

Intermediate methodological frameworks emerged to address these epistemological limitations through probabilistic formulations and multi-criteria decision architectures. Statistical models employing Bayesian inference frameworks quantified conditional hazard probabilities under varying environmental scenarios, enabling the establishment of robust rainfall thresholds with explicit uncertainty quantification [7]. Concurrently, multi-attribute decision frameworks provided structured methodologies for integrating heterogeneous hazard factors [8], with recent innovations applying Analytic Hierarchy Process techniques to susceptibility mapping across diverse geological contexts [9] and multi-criteria analysis approaches to flood risk assessment in coastal regions [10]. Despite their theoretical elegance, these approaches frequently encounter fundamental constraints in high-dimensional feature space integration and often introduce systematic biases through subjective parameter weighting schemes that compromise prediction reliability.

The integration of machine learning (ML) into geohazard assessment represents a paradigmatic transformation in debris flow prediction, facilitating the identification of complex, nonlinear patterns within high-dimensional environmental datasets that remain inaccessible to conventional analytical approaches [11]. Contemporary research has demonstrated the efficacy of artificial neural networks in characterizing rainfall-triggered debris flow probability distributions and advanced information-theoretic coupling methods for susceptibility assessment [12]. These computational frameworks excel in capturing intricate multivariate relationships without requiring explicit physical process formulation [13]. Furthermore, ensemble learning techniques enhance predictive stability and generalization capabilities across heterogeneous environmental conditions previously inaccessible to traditional modeling frameworks.

Despite their transformative potential, current ML applications to debris flow hazard assessment exhibit critical epistemological and methodological limitations that constrain their scientific impact and operational utility. First, model performance demonstrates an acute sensitivity to hyperparameter configurations, with suboptimal parameterization inducing overfitting phenomena that compromise generalization to novel geomorphological contexts [14]. Second, conventional optimization algorithms frequently converge prematurely to local extrema, resulting in suboptimal exploration of the high-dimensional parameter space [15]. Third, and most fundamentally, deep learning architectures manifest intrinsic opacity that creates a profound disconnect between predictive capacity and mechanistic understanding [16]. This explainability deficit, frequently characterized as the “black-box problem,” represents a critical epistemological barrier in hazard forecasting applications. The black-box problem specifically refers to the inherent inability to trace how deep learning models transform input features into output predictions through their complex internal architectures. This opacity manifests through (1) algorithmic complexity, where numerous interconnected neurons and non-linear activation functions obscure input-output relationships; (2) latent feature representation, where models develop abstract internal representations that lack direct physical interpretation; and (3) stochastic learning behavior, where training procedures yield models whose internal configurations cannot be deterministically predicted. Contemporary neural networks, while achieving unprecedented predictive accuracy, typically obscure the contribution of specific variables to prediction outcomes, thereby inhibiting scientific elucidation of causative mechanisms and undermining the implementation of targeted mitigation strategies.

Three fundamental research gaps persist in contemporary literature that motivate this study: (1) the predominant prioritization of predictive accuracy over model explainability creates an artificial tension between computational performance and theoretical advancement; (2) current methodological frameworks overwhelmingly employ single-algorithm approaches that inadequately capture the complex spatiotemporal characteristics of debris flow triggering conditions; and (3) the systematic integration of advanced optimization techniques with explainable artificial intelligence frameworks remains largely unexplored in geohazard prediction contexts, particularly for multi-parameter debris flow risk assessment. Explainable Artificial Intelligence (XAI) methodologies, particularly the SHapley Additive exPlanations (SHAP) approach, offer promising solutions to these challenges by providing mathematically rigorous attribution mechanisms derived from cooperative game theory principles [17,18]. Despite preliminary applications identifying dominant controlling factors in debris flow research, comprehensive XAI integration within holistic debris flow assessment frameworks remains substantially underdeveloped [19].

Motivated by these research gaps and the potential of XAI approaches, this study aims to develop and validate an integrated deep learning framework that maximizes both predictive accuracy and scientific explainability for debris flow hazard assessment. The specific research objectives include: (1) identifying optimal architectural configurations for capturing complex spatiotemporal characteristics of debris flow triggering factors; (2) quantifying the relative contribution of environmental, geological, and meteorological variables; (3) establishing a theoretical framework connecting physical processes to prediction outcomes through explainable AI; and (4) determining quantitative thresholds governing debris flow risk to support evidence-based intervention strategies.

To address these objectives, this study implements an IKOA-CNN-BiGRU-Attention framework that integrates physics-inspired optimization algorithms with deep learning architectures and explainability mechanisms. This framework combines convolutional networks for spatial feature extraction, bidirectional recurrent units for temporal dynamics modeling, and attention mechanisms for feature importance weighting, specifically designed to capture debris flow triggering mechanisms while providing SHAP-based scientific insights. The framework advances debris flow hazard prediction methodologically, theoretically, analytically, and practically, offering actionable insights for precision disaster risk management. The overall workflow of the study is illustrated in Fig 1.

Fig 1. Flow chart of this study.

Fig 1

2 Methodology

2.1 Comprehensive weighting strategy

Precise determination of influencing factor weights constitutes a fundamental challenge in debris flow hazard assessment. This study proposes a novel game theory-based comprehensive weighting framework that integrates three complementary methodologies: the coefficient of variation method (CVM), entropy weight method (EWM), and CRITIC method [2022]. Each methodology captures distinct aspects of factor importance: data dispersion, information entropy, and inter-factor correlations, respectively. The proposed framework applies cooperative game theory principles to optimize the collective strengths of these methods, conceptualizing each as a strategic element in a cooperative game to determine optimal weight distribution. The complete framework is illustrated in Fig 2.

Fig 2. Framework of the game theory-based comprehensive weighting strategy.

Fig 2

2.2 Improved parameter adjustment strategy for swarm intelligence algorithms

2.2.1 The KOA algorithm.

The Kepler Optimization Algorithm (KOA), introduced by Mohamed Abdel-Basset in 2023, serves as the foundation for model optimization in this study. KOA represents a physics-based metaheuristic algorithm inspired by Kepler’s three laws of planetary motion [23]. This algorithm conceptualizes the Sun as the optimal solution and planets as candidate solutions, with planetary motion mechanisms driving the evaluation of solution quality. Compared to established optimization algorithms such as Grey Wolf Optimizer (GWO) and Sparrow Search Algorithm (SSA), KOA exhibits enhanced exploration capabilities within the solution space through innovative search strategies, enabling effective avoidance of local optima entrapment while ensuring stability and robustness in hyperparameter optimization for complex models. From a mathematical perspective, KOA operates through seven sequential steps:

Step 1: Initialization process. The algorithm begins with a random generation of candidate solutions representing planets moving within the search space. Each planet possesses a position and velocity representing potential optimization solutions. Each planetary position is formally defined as shown in Equation (1). In Equation (1), Xi represents the ith candidate solution in the model; d represents the number of dimensions; N indicates the number of candidate solutions; Xji,up and Xji,low determine the boundary range of the search space, with rand[0,1] randomly selected values between 0 and 1. The eccentricity ei and period Ti are initialized according to Equations (2) and (3). In Equations (2) – (3), r is a random number based on a normal distribution.

Xij=Xi,lowj+rand[0,1]×(Xi,upjXi,lowj),{i=1,2,,Nj=1,2,,d  (1)
 ei=rand[0,1],i=1,2,,N (2)
Ti=|r|,i=1,2,,N (3)

Step 2: Defining the gravitational force (F). Planets experience varying velocities at different orbital positions due to differential gravitational pull from the Sun along elliptical planetary orbits. The gravitational force experienced by a planet is formulated as in Equation (4), while the Euclidean distance is represented in Equation (5).

Fgi=ei×μ(t)×Ms×miR2+ε+r1 (4)
Ri(t)=Xs(t)+Xi(t)2=j=1d(XSj(t)Xij(t))2 (5)

Step 3: Calculating object velocity. Based on gravitational force and Sun-planet distance, the velocity of a planet at any position on its elliptical orbit is expressed as Equation (6).

Vi(t)={1×(2r4XiXb)+(1Rinorm(t))×τ×U1×r5(Xi,upXi,low),if Rinorm(t)0.5 r4×δ×(Xaxi)+(1Rinorm(t))×τ×U2×r5×(r3Xi,upXi,low),Else  (6)

Step 4: Escaping from local optima. The parameter π in the velocity equation modulates search direction, simulating varying orbital directions of planets within the Solar System. This mechanism prevents convergence on local optima, as shown in Equation (7).

τ={1,ifr40.51,Else  (7)

Step 5: Updating object positions. Planetary motion occurs in two distinct phases: approaching and receding from the Sun. These correspond to the exploitation and exploration phases respectively. During exploration, the algorithm discovers new candidate solutions. During exploitation, solutions near the optimum are refined, as shown in Equation (8).

Xi(t+1)=Xi(t)+τ×Vi(t)+(Fgi(t)+| r|) ×U×(Xs(t)Xi(t)) (8)

Step 6: Updating Sun-planet distance. Manipulation of parameter h enhances exploration and exploitation capabilities, as shown in Equation (9).

Xi(t+1)=Xi(t)×U1+(1U1)×(Xi(t)+XS+Xa(t)3.0+h×(Xi(t)+XS+Xa(t)3.0Xb(t))) (9)

Step 7: Elitism mechanism. The algorithm evaluates fitness values between current and newly calculated positions, retaining superior solutions while discarding inferior ones, as shown in Equation (10).

Xi,new(t+1)={Xi(t+1),if f(Xi(t+1))f(Xi(t))Xi(t),Else  (10)

2.2.2 The IKOA algorithm.

Despite the promising capabilities of physics-inspired metaheuristics, optimization algorithms frequently encounter fundamental challenges including premature convergence, local optima entrapment, and suboptimal exploration-exploitation balance. To overcome these intrinsic limitations in the standard KOA framework, this study proposes a systematically enhanced variant, the Improved Kepler Optimization Algorithm (IKOA), incorporating three complementary methodological refinements: (1) Chebyshev mapping for ergodic population initialization, (2) golden sine operator for solution space adaptation, and (3) dynamic weight coefficient for balanced search dynamics. This integrative approach ensures exceptional parameter optimization robustness across complex hypersurfaces. The complete algorithmic architecture is illustrated in Fig 3.

Fig 3. Flow chart of the Improved Kepler Orbital Algorithm (IKOA).

Fig 3

  • 1

    Improved strategy for population initialization based on Chebyshev mapping

Chaos mapping represents an uncertain and random methodology within nonlinear dynamic systems. In the process of initializing position updates, chaotic variables replace the random variables traditionally used in intelligent algorithm processes, thereby offering a broader search range in the solution space compared to probability-based random search strategies. Consequently, employing chaos mapping for population initialization in the KOA enhances the diversity of the population and the convergence speed of the optimization algorithm. This study utilizes the Chebyshev chaos mapping method [24], with its mathematical description presented in Equation (11), where k denotes the order.

xn+1=cos(k arccosxn),xn[1,1] (11)
  • 2

    Rolling position update strategy based on golden sine algorithm

The Golden Sine Algorithm (Gold-SA), proposed by Tanyildizi et al., is a novel metaheuristic algorithm [25]. Gold-SA differs from swarm intelligence algorithms by focusing on enhancing search efficiency and optimization precision in the later stages through the gradual narrowing of the current solution space. This means that the current search space continuously converges towards the optimal target value. In the traditional KOA algorithm, the efficiency of local and global searches by planetary individuals is relatively low, making the algorithm more susceptible to becoming trapped in local optima as it progresses. By incorporating the golden sine operator into Step 5, and leveraging the properties of the golden ratio coefficient to reduce the solution space, the overall performance of the algorithm is significantly improved. The position update formula with the added golden sine operator is presented in Equation (12). In Equation (15), R1∈[0, 2π]is a random number determining the distance of the individual’s movement in the next iteration, R2∈[0, π] is a random number determining the direction of the next movement, and x1 = -π+(1-π)*2π, x2 = -π + τ*2πare coefficients derived from the golden section ratio τ=(5-1)/2, This ratio narrows the algorithm’s search space and guides individuals gradually towards the optimum.

Xi(t+1)=Xi(t)×|sinR1|+τ×Vi(t)+R2×sinR1×(Fgi(t)+| r|)×U×(x1× Xs(t)x2×Xi(t))  (12)
  • 3

    Position update strategy based on dynamic weight coefficient

To address the issue of insufficient global exploration capability during the position updates for planets close to the Sun in Step 6, a dynamic weight coefficient based on the number of iterations is introduced in Equation (9). This addition provides a balanced capability for global exploration and local exploitation of the algorithm, as detailed in the improved formula shown in Equation (13). In Equation (13), k1 = 1-t3/T3, k2 = t3/T3. Within Equation (13), k1 is larger in the early stages of iteration, enhancing the planets’ early exploration ability, while k2 gradually increases in the later stages, ensuring the planets’ local exploitation capability in the latter phase.

Xi(t+1)=Xi(t)×U1×k1+(1U1)×(Xi(t)+XS+Xa(t)3.0+h×(Xi(t)+XS+Xa(t)3.0Xb(t)))×k2 (13)

2.2.3 Benchmark function test.

To evaluate IKOA’s optimization capabilities, this study implemented a benchmark analysis using nine standard test functions from the CEC 2005 benchmark suite [26]. These functions provide standardized performance metrics across multiple dimensions: unimodal functions f1-f5 assess local exploitation efficiency; multimodal functions f6-f7 examine global exploration capabilities; and fixed-dimension multimodal functions f8-f9 evaluate performance in complex search spaces. Table 1 presents the mathematical formulations and characteristics of each function. Experimental dimensionality was set at 30 for functions f1-f7 and 2 for functions f8-f9, following standard protocols. The comparative analysis included IKOA, original KOA, and three contemporary algorithms: Grey Wolf Optimizer (GWO), Sparrow Search Algorithm (SSA), and Whale Optimization Algorithm (WOA) [2729]. Each function underwent 30 independent optimization trials, with convergence trajectories illustrated in Fig 4. The analysis demonstrates IKOA’s superior performance across optimization metrics. IKOA consistently exhibits accelerated convergence compared to reference algorithms, achieving superior early-stage exploration and late-stage exploitation precision while maintaining an exceptional balance between search diversification and intensification. These qualities establish IKOA’s theoretical foundation for application in debris flow hazard assessment modeling, where its enhanced optimization capabilities directly translate to improved predictive performance.

Table 1. Benchmark function information.
Function Dim Space Min
f1(x)=i=1nxi2 30 [−100,100] 0
f2(x)= i=1n|xi|+i=1nxi 30 [−10,10] 0
f3(x)=i=1n(j=1ixj2) 30 [−100,100] 0
f4(x)=max{|xi|,1in} 30 [−100,100] 0
f5(x)=i=1nxisin(|xi|) 30 [−500,500] 0
f6(x)=418.9829ni=1nxisin(|xi|) 30 [−500,500] 0
f7(x)=sin2(πω1)+i=1d1(ω11)2[1+10sin2(πω1+1)
                (ωd1)2]+[1+sin2(2πωd)],ωi=1+(xi1)4,i=1,2,d
30 [−10,10] 0
f8(x)=sin2(3πx1)+(x11)2[1+sin2(3πx2)
+(x21)2[1+sin2(2πx2)]
2 [−10,10] 0
f9(x)=(1.5x1+x1x2)2+(2.625x1+x1x23)2 2 [-4.5,4.5] 0
Fig 4. The convergence curve of each optimization algorithm under the test function.

Fig 4

2.3. The IKOA-CNN-BiGRU-Attention model

This study presents a novel hierarchical deep learning architecture for debris flow hazard assessment that integrates optimization algorithms with specialized neural network components. The IKOA-CNN-BiGRU-Attention framework, illustrated in Fig 5, implements a cascaded processing pipeline for multidimensional geophysical data. The CNN module [30] employs convolutional operations to extract spatially invariant features across multiple abstraction levels, identifying complex topographical signatures that serve as precursors to debris flow events. The BiGRU component [31] extends representation capacity through bidirectional recurrent processing, capturing non-linear dependencies in sequential geophysical processes by propagating information in forward and reverse temporal directions. The Attention mechanism [32] implements adaptive computational focus through learned importance weighting, dynamically modulating feature contributions based on contextual relevance. A distinctive contribution is the meta-optimization approach implemented through IKOA, which systematically refines four foundational hyperparameters: Learning Rate, GRU Units m, CNN Filters n, and Batch Size b. These parameters critically influence model performance, balancing expressiveness, computational efficiency, and generalization capability. The implementation proceeds through four systematic phases:

Fig 5. Architectural framework of the integrated IKOA-CNN-BiGRU-Attention model.

Fig 5

Step 1: Data Preprocessing. Statistical standardization establishes numerical stability while K-Nearest Neighbors imputation preserves data integrity through the reconstruction of missing values.

Step 2: Model Initialization. The preprocessed dataset undergoes chronological partitioning with 70% allocated for parameter estimation and feature learning.

Step 3: Hyperparameter Optimization. IKOA explores the hyperparameter space to identify optimal architectural configurations through iterative performance evaluation.

Step 4: Model Deployment. The optimized architecture is applied to the remaining 30% for validation and hazard prediction generation.

3 Materials

The study area encompasses the Yunnan section of the Nujiang River Basin in China (Fig 6), spanning approximately 33,500 km2 and constituting 8.7% of Yunnan Province’s total area. This geographically significant region extends 650 km along the Nujiang (Salween) River, bordered by the Lancang River Basin to the east and Myanmar to the west, spanning from the southern margins of the Tibetan Plateau to the northern extent of the Yunnan-Guizhou Plateau. The study area is situated between 24°07’ to 28°23’ N latitude and 98°07’ to 100°00’ E longitude. The geomorphology exhibits extreme topographic heterogeneity characterized by dramatic alpine gorges and precipitous mountainous terrain. The landscape features a pronounced north-south elevation gradient, intersected by the Hengduan Mountain Range. The area contains one of the most significant geomorphological features in the region, the Nujiang Grand Canyon, which ranks among the world’s deepest canyon systems. This exceptional topographic amplitude creates optimal conditions for gravitational mass movements, particularly debris flows with substantial kinetic energy and destructive potential.

Fig 6. Geographical distribution of the study area.

Fig 6

The region’s climate is classified as humid subtropical monsoon, characterized by abundant yet spatially heterogeneous precipitation. Annual rainfall demonstrates significant variability, with maximum precipitation reaching approximately 1500 mm in higher elevation zones and diminishing to below 1000 mm in lower elevation areas. Temporal distribution exhibits marked seasonality, with over 51% of annual precipitation concentrated during the summer monsoon period [27]. This concentrated rainfall pattern constitutes the primary triggering mechanism for debris flow mobilization. The geological framework displays exceptional complexity, dominated by metamorphic complexes and sedimentary formations intersected by numerous fault systems. This intricate structural geology, combined with intense neotectonic activity, has produced extensive zones of fractured and weathered lithological material that serves as source material for debris flows. The synergistic interaction between fractured bedrock, steep topographic gradients, and concentrated seasonal precipitation creates ideal conditions for debris flow initiation and propagation. These distinctive geoenvironmental characteristics contribute to the elevated frequency and magnitude of geohazard events throughout the region. The prevalence of these hazards necessitates systematic investigation for effective disaster risk reduction strategies and sustainable regional development initiatives. This research utilizes comprehensive data from 159 debris flow prone gullies distributed throughout the Nujiang River Basin, applying advanced deep learning methodologies to analyze and predict debris flow hazard intensities across diverse geomorphological contexts.

Elevation data are based on SRTM DEM (Shuttle Radar Topography Mission, NASA/USGS). Administrative boundaries and rivers are from Natural Earth (public domain). Map generated by the authors.

4 Hazard-causing factors and weighting calculation

4.1 Selection of hazard-causing factors

This investigation established a comprehensive parametric foundation for debris flow susceptibility analysis through systematic integration of multidisciplinary data sources, including extensive literature synthesis [28], satellite remote sensing explanation, field-based geomorphological investigations, and archival geological and hydrological documentation. Advanced geospatial technologies facilitated rigorous extraction of topographical variables from 30m-resolution SRTM DEM data, while meteorological parameters underwent spatial interpolation through geostatistical inverse distance weighting algorithms calibrated with observational data from meteorological stations distributed throughout the study domain.

The extracted factors for the study area include: maximum 24-hour rainfall (X1), annual average rainfall (X2), annual average temperature (X3), annual average wind speed (X4), annual average vapor pressure (X5), annual average relative humidity (X6), potential source energy (X7), catchment area (X8), average gradient of the main channel (X9), length ratio of the sediment supply section (X10), length of the main channel (X11), sinuosity of the main channel (X12), drainage density of the catchment (X13), valley profile roundness (X14), vegetation cover rate (X15), average slope (X16), maximum relative elevation difference within the catchment (X17), lithological impact (X18), and dynamic storage capacity (X19). A total of 19 factors were selected, sequentially numbered as X1 to X19. In line with the principles of systematicity, completeness, effectiveness, and comparability in the selection of evaluation indicators, and considering the accessibility and operability of the data, the debris flow outbreak system in the study area was divided into three subsystems: the hydrometeorological subsystem, material source subsystem, and geographic subsystem, encompassing the aforementioned 19 factors (X1-X19). The rationale for selecting each factor is detailed in Table 2.

Table 2. Hazard causation systems and their selection criteria.

Subsystem Factor Unit Selection basis
Hydrometeorological subsystem X1 mm Lee et al. [33] introduced an ANN model for predicting debris flows, identifying a nonlinear relationship between rainfall and the occurrence of debris flows. Higher annual average precipitation can create geotechnical conditions prone to debris flows, while high-intensity rainfall serves as a significant direct trigger for debris flow events.
X2 103mm
X3 °C Kidron et al. [34], Eppes et al. [35,36], and Deng et al. [37] highlighted the impacts of temperature, vapor pressure, humidity, and wind on rock weathering. During debris flows, the loosening of rocks due to weathering provides a substantial source of dynamic storage. Bai et al. [38,39] demonstrated that temperature and humidity affect the movement of soil particles.
X4 m/s
X5 hPa
X6 %
Material source subsystem X7 109J/m2 Li [39] utilized this factor which integrates the impacts of elevation differences and dynamic source storage. Debris flows fundamentally involve transforming the gravitational potential energy of source material into kinetic energy, which quantifies the debris flow’s destructive potential.
X10 Zhang et al. [40] employed the ratio of lengths in the sediment supply zone as a predictor for debris flow hazard levels. This metric indicates how unconsolidated materials are distributed within the riverbed, with homogeneous distribution increasing material availability.
X19 104/km2 Ma et al. [41] demonstrated that post-earthquake debris flow scale and frequency significantly increased due to augmented volumes of loose material sources caused by seismic activity.
Geographic subsystem X8 km2 Heiser et al. [42], Zhao et al. [43], and Li et al. [44] utilized catchment area as a key topographic factor. The catchment area determines the watershed’s water-gathering capacity and sediment yield, making it a critical parameter universally considered in debris flow prediction models.
X9 Di et al. [45] incorporated the average gradient of the main channel in debris flow prediction. This parameter reflects channel slope, where a sufficient gradient is necessary for debris flow propagation, as low-slope conditions impede downward movement.
X11 km Heiser et al. [42], Zhao et al. [43], and Li et al. [44] identified main channel length as a key parameter. Extended channels increase the potential for water/debris flows to transport larger quantities of loose material during descent.
X12 Kattel et al. [46] demonstrated through fluid dynamics analysis how meandering channels affect impact distance and solid material transport due to reduced flow velocity in curved sections. This coefficient quantifies channel sinuosity.
X13 Km/km2 Panchal et al. [47] established this factor’s linear relationship with landslide probability using the Analytic Hierarchy Process (AHP) methodology.
X14 Shi et al. [48] applied contour roundness in debris flow analysis to characterize watershed geomorphology and hydrological properties.
X15 % Zhao et al. [43] established through machine learning that vegetation cover significantly influences debris flow probability, with higher coverage substantially reducing occurrence likelihood.
X16 ° Bertrand et al. [49] demonstrated that slope functions as a critical gravitational condition for debris flow formation while enhancing catchment water-collecting capacity.
X17 km Heiser et al. [42], Zhao et al. [43], and Li et al. [44] utilized maximum relative elevation difference as a key parameter. This factor directly determines the gravitational potential energy available for conversion to kinetic energy, thus controlling the destructive capacity of resulting debris flows.
X18 Brardinoni et al. [50] established that lithology functions as a primary control mechanism for the generation rate of dynamic source materials in debris flow systems.

4.2 Diagnosis of multicollinearity

Statistical independence among predictor variables constitutes a critical prerequisite for robust hazard modeling. The research implemented Variance Inflation Factor (VIF) analysis to systematically diagnose multicollinearity and optimize variable selection. VIF quantifies variance inflation of regression coefficients due to linear dependencies, calculated as 1/(1-R2), where R2 represents the coefficient when regressing a variable against all other predictors. While conventional guidelines suggest a VIF threshold of 10, the current investigation adopted a more conservative threshold of 5 to ensure model stability and parameter precision for complex environmental systems [51]. Multicollinearity diagnosis identified four factors exceeding this threshold: X10 (length ratio of the sediment supply section: 7.83), X14 (valley profile roundness: 5.62), X17 (maximum relative elevation difference: 8.14), and X19 (dynamic storage capacity: 6.95). These variables were systematically excluded to maximize model precision. Fig 7 presents the VIF analysis results for all hazard-causing factors, illustrating that the final variable selection optimizes unique information content while minimizing statistical redundancy.

Fig 7. VIF analysis results for hazard-causing factors.

Fig 7

4.3 Hazard level assessment through comprehensive weighting

The inherent complexity of debris flow systems necessitates sophisticated factor weighting approaches that accommodate the multidimensional interactions among diverse variables. This study implements the game theory-based comprehensive weighting methodology detailed in section 2.1, achieving optimal integration of complementary weighting perspectives. The resulting factor weights derived from this approach are presented in Fig 8. The hazard level assessment procedure follows a systematic framework:

Fig 8. Results of Comprehensive weighting.

Fig 8

Step 1: Normalize the original dataset to eliminate the dimensional impact of different factors. The normalization formulas for maximum-type indicators and minimum-type indicators are shown in Equations (14) and (15), respectively. In Equations (14) – (15), xij represents the jth influencing factor of the ith gully, i = 1,2,…,n and j = 1,2,…,m, n denotes the number of data groups, and m denotes the number of factors.

rij=xijmin(xij)max(xij)min(xij) (14)
rij=max(xij)xijmax(xij)min(xij) (15)

Step 2: For each debris flow gully, sum the weighted data of influencing factors to obtain its hazard assessment value. The calculation formula is shown in Equation (16). In Equation (16), wj represents the weight of each factor through Fig 8, and rij denotes the normalized value of the jth influencing factor for the ith gully.

Ri=jrij×ωj (16)

5 Results

To ensure statistical robustness, experimental parameters were standardized with population size fixed at 30 and iteration count at 100 across all optimization algorithms. The dataset underwent a 70%/30% training-testing partition, with results averaged across 50 independent runs. Notably, 12 test samples with distinctive geomorphological characteristics were deliberately included to evaluate model generalization capabilities across diverse debris flow conditions. Performance evaluation employed three complementary metrics: RMSE, MAE, and MAPE. The proposed IKOA-CNN-BiGRU-Attention model demonstrated exceptional predictive accuracy when benchmarked against 13 established machine learning regression models [5255], as quantified in Table 3. Performance comparisons among the top five models are visualized in Fig 9. The hybrid model achieved unprecedented prediction precision with RMSE of 2.33 × 10−6, MAE of 1.51 × 10−6, and MAPE of 0.006%, representing multiple orders of magnitude improvement over conventional approaches.

Table 3. The prediction error of each model.

No. Model RMSE MAE MAPE
1 IKOA-CNN-BiGRU-Attention 2.33E-06 1.51E-06 0.006%
2 PSO-SVM 0.0691 0.0352 26.74%
3 GWO-ELM 0.0017 0.0008 1.98%
4 IDBO-VMD-BiLSTM 0.0636 0.0451 60.57%
5 DT 0.1024 0.0522 43.24%
6 Adaboost 0.1030 0.0520 117.84%
7 IAO-BiLSTM 0.0582 0.0333 73.20%
8 RF 0.1155 0.0610 78.35%
9 GA-BP 0.3155 0.2359 17.01%
10 GBDT 0.1060 0.0540 133.58%
11 Extra trees 0.1060 0.0540 132.10%
12 CatBoost 0.1170 0.0630 319.26%
13 KNN 0.1152 0.0630 298.79%
14 XGBoost 0.0990 0.0500 99.22%

Fig 9. Predictive performance of the training and test sets.

Fig 9

This superior performance reflects systematic hyperparameter optimization through the improved Krill Herd algorithm. The learning rate underwent progressive refinement from 0.0061 to 0.0014, while GRU units converged to 116, CNN filters stabilized at 54, and batch size optimized at 31. The hyperparameter evolution across iterations, detailed in Table 4, demonstrates the algorithm’s capacity to navigate complex parameter spaces while avoiding local optima. Architecturally, the bidirectional GRU component captures temporal dependencies in multidimensional feature sequences, while the attention mechanism adaptively prioritizes critical factors including slope characteristics, precipitation parameters, and drainage properties. Unlike models such as GWO-ELM and PSO-SVM, which exhibit performance degradation with geomorphologically distinct test cases, the IKOA-CNN-BiGRU-Attention model maintains exceptional prediction stability across diverse geological contexts, validating its effectiveness in modeling the inherently nonlinear and multidimensional nature of debris flow hazard systems.

Table 4. Results for model hyperparameter values.

Iteration Learning Rate GRU Units CNN Filters Batch Size Loss
5 0.0061 90 42 26 0.2501
15 0.0048 105 48 28 0.2157
28 0.0032 112 50 30 0.1954
40 0.0024 116 52 31 0.1878
60 0.0019 120 54 32 0.1843
80 0.0014 116 54 31 0.1825
95 0.0014 116 54 31 0.1825
Optimization range [0.0001, 0.01] [32, 128] [16, 64] [16, 64]

6 Results explanation and disaster mitigation implications

6.1 Feature importance analysis and threshold identification in debris flow genesis

Through systematic application of SHAP methodology, this study deconstructs the internal decision processes of the IKOA-CNN-BiGRU-Attention model, revealing that the architecture captures both physical threshold effects and multifactorial regulatory relationships governing debris flow initiation. Fig 10(a) presents a quantitative hierarchical ranking of feature importance wherein potential source energy X7 emerges as the predominant factor with a mean absolute SHAP value of 1.07 × 10−2, approximately 4.6-fold greater than the second-ranked parameter. Annual mean temperature X3, annual mean vapor pressure X5, and average slope X16 constitute the second tier of importance, while annual mean rainfall X2, vegetation coverage X15, and main channel length X11 form a distinct third tier.

Fig 10. SHAP analysis of feature importance and threshold effects in debris flow formation.

Fig 10

Fig 10(b) elucidates the dependency relationship between potential source energy X7 and its corresponding SHAP values, demonstrating a threshold effect in debris flow triggering mechanisms. The relationship manifests as a sigmoidal nonlinear curve, optimally fitted by the cubic function y = 1.25 × 10-4x3 - 9.37 × 10-4x2 + 1.83 × 10-2x − 2.21 × 10−2 with an R2 value of 0.98. A critical inflection point is identified at X7 ≈ 3.12 × 109 J/m2, effectively partitioning the risk domain into two distinct regions: a low-risk zone where X7 < 3.12 × 109 J/m2 and a high-risk zone where X7 > 3.12 × 109 J/m2. Within the low-risk zone, SHAP values consistently approximate zero or exhibit slightly negative values, indicating insufficient energy accumulation for debris flow mobilization; within the high-risk zone, SHAP values demonstrate an accelerating positive trend as X7 increases, reflecting the nonlinear amplification characteristic of debris flow initiation.

The SHAP value distribution in Fig 10(c) further illuminates the distinct influence patterns of various factors on model predictions. The distribution for X7 exhibits pronounced bimodal segregation: low-value samples cluster tightly around the zero SHAP value region, while high-value samples display substantial positive shifts, reaching SHAP values exceeding 0.2, surpassing the influence range of all other features. This polarized distribution provides statistical confirmation of the threshold phenomenon. High values of maximum 24-hour rainfall X1 and annual mean relative humidity X6 similarly exhibit positive SHAP value tendencies, albeit with considerably greater dispersion, indicating that these parameters function as conditional triggers whose efficacy is contingent upon the antecedent state of source energy accumulation.

Based on the comprehensive SHAP analysis, a physically explainable mechanistic framework for debris flow formation emerges: potential source energy X7 constitutes the fundamental prerequisite condition with a quantified threshold of 3.12 × 109 J/m2; environmental parameters such as annual mean temperature X3 and annual mean vapor pressure X5 function as regulatory modulators, exerting indirect effects by altering source material stability; hydrological factors, primarily maximum 24-hour rainfall X1, operate as activation triggers that initiate debris flow mobilization only when the prerequisite energy threshold condition has been satisfied.

6.2 Factor interaction dynamics and risk response characterization

Building upon the identification of potential source energy X7 as the dominant factor, this section systematically explores factor interactions and their impact on debris flow risk prediction, revealing complex relationships with implications for monitoring and early warning systems optimization. Fig 11 illustrates the interaction effects between source energy and key environmental variables. The interaction between X7 and annual mean temperature X3 reveals a significant correlation r = 0.06, p < 0.05, with risk contribution substantially increasing when the temperature exceeds 16°C in high source energy areas. This temperature-energy interaction suggests that thermal conditions may accelerate material destabilization processes in areas with significant loose material accumulation. The practical implication for disaster prevention is that temperature monitoring should be integrated into early warning systems, particularly for regions with high material accumulation. Warning thresholds may require seasonal adjustments with heightened vigilance during warmer periods when temperatures exceed 16°C. Similarly, the interaction between X7 and annual mean vapor pressure X5 demonstrates a stronger correlation r = 0.19, p < 0.01, with high X7 samples manifesting significantly elevated SHAP values when X5 exceeds 15 hPa. This finding indicates that humidity sensors should be incorporated into monitoring networks alongside traditional rainfall gauges, particularly in areas with substantial loose material accumulation, as atmospheric moisture conditions appear to play an important role in destabilizing source materials.

Fig 11. SHAP interactions between key geomorphological and meteorological variables.

Fig 11

The interaction between maximum 24-hour rainfall X1 and X7 provides particularly valuable insights for risk management. Despite statistical non-significance in overall correlation r = 0.02, p > 0.05, clear nonlinear patterns emerge: rainfall intensity substantially impacts risk only when source energy exceeds approximately 5 × 109 J/m2, while having minimal effect below this threshold. This conditional relationship fundamentally challenges the conventional rainfall-threshold approach to debris flow warning. For early warning system design, this finding suggests implementing a hierarchical approach where areas are first classified by source energy potential, with advanced rainfall monitoring prioritized specifically in high-energy zones. For low-energy zones, basic rainfall monitoring may be sufficient regardless of precipitation intensity. Rainfall thresholds should be dynamic rather than fixed, adjusting based on source material conditions.

Fig 12 refines the threshold identification through a detailed SHAP response analysis. The upper panel presents a higher-order polynomial analysis of the X7 SHAP response curve with R2 = 0.99, identifying a critical inflection point at X7 = 3.00 × 109 J/m2. This value provides a quantitative criterion for classifying terrain vulnerability in hazard mapping exercises and prioritizing areas for source material control measures such as check dams, retaining walls, and slope stabilization projects. The upper right panel identifies a distinct inflection point in the X1 SHAP dependency curve at X1 = 27.01 mm, providing a data-driven basis for rainfall warning standards. This specific threshold offers a quantitative reference for evaluating and potentially adjusting current warning thresholds in operational early warning systems. The lower panel integrates these insights into a comprehensive two-dimensional risk contour map with three distinct risk domains that translate directly to operational warning levels. The blue region represents normal conditions where predicted risk values fall below 0.03, warranting routine monitoring procedures. The yellow zone indicates heightened alert conditions with predicted risk values between 0.03 and 0.20, necessitating increased monitoring frequency and preliminary emergency preparations. The red domain signifies emergency response conditions where the predicted risk exceeds 0.20, requiring immediate protective actions including potential evacuations.

Fig 12. Threshold identification for debris flows through SHAP response analysis.

Fig 12

Fig 13 extends this analysis into three dimensions, revealing a comprehensive risk response surface that captures the compound effects of X7 and X1. The visualization of threshold planes creates four distinct quadrants that translate to operational intervention priority zones. The first quadrant, characterized by values below both X7 and X1 thresholds, indicates low-priority zones warranting only basic monitoring. The second quadrant, where X7 exceeds its threshold while X1 remains below its critical value, demarcates source-control priority zones where engineering measures to reduce material accumulation should be emphasized. The third quadrant, characterized by X1 values above the threshold while X7 remains below its critical level, identifies hydrological management priority zones where drainage improvements and runoff control should be prioritized. The fourth quadrant, where both X7 and X1 exceed their respective thresholds, delineates comprehensive intervention zones requiring integrated approaches combining multiple strategies.

Fig 13. Three-dimensional risk surface with critical threshold planes.

Fig 13

6.3 Intervention efficacy assessment and risk reduction strategy optimization

Building on identified risk factors and thresholds, this section evaluates the comparative efficacy of different risk reduction approaches, providing quantitative insights for evidence-based disaster management decision-making. Figs 14(a) and 14(b) present comparative risk response curves for interventions targeting X7 and X1, respectively. The analysis reveals that reducing maximum 24-hour rainfall X1 from 29.5 mm to 27.0 mm yields a 58.8% risk reduction, while reducing potential source energy X7 from 8.0 × 109 J/m2 to 3.0 × 109 J/m2 achieves a 25.0% risk diminution. This substantial difference in intervention efficacy provides clear guidance for resource allocation: rainfall control measures may yield faster risk reduction returns compared to source control measures. Figs 14(c) and 14(d) further refine this analysis through spatially-resolved risk reduction percentage contour maps, revealing that vulnerability mapping should identify transition zones where risk factors are marginally above threshold values, as modest interventions in these areas may yield substantial risk reduction.

Fig 14. Comparative efficiency of source energy versus rainfall control interventions.

Fig 14

Fig 15 presents SHAP force plots from representative data points selected through stratified random sampling, revealing substantial heterogeneity in localized risk factor compositions. This heterogeneity indicates that effective risk management requires location-specific approaches tailored to the dominant local risk factors. For implementation, this suggests customizing monitoring systems based on local risk factor compositions. Locations dominated by meteorological factors would benefit from enhanced weather-based warning systems with meteorological monitoring, while areas where topographical factors predominate would likely respond better to physical infrastructure interventions rather than precipitation-based warning systems.

Fig 15. Site-specific risk factor compositions in contrasting debris flow scenarios.

Fig 15

Synthesizing these analytical results, a practical framework for precision debris flow risk management emerges with several actionable components. Early warning systems should implement rainfall thresholds that adjust based on source energy conditions rather than universal rainfall values, with lower rainfall thresholds applied in high source energy areas and higher thresholds in low energy areas. Beyond traditional rainfall monitoring, temperature and humidity sensors should be incorporated into monitoring networks, especially in areas where source energy exceeds the identified threshold. In resource-constrained environments, interventions should be sequenced according to threshold proximity, prioritizing areas just above threshold values where modest investments yield substantial risk reduction. Different zones require specific approaches: high-source energy zones warrant engineering interventions to reduce source material accumulation; critical rainfall zones require drainage improvements and runoff control; dual-threshold exceedance zones necessitate comprehensive management strategies incorporating both engineering measures and enhanced monitoring systems. Warning thresholds and monitoring intensity should be adjusted seasonally where temperature appears as a significant risk modifier, with heightened vigilance during periods exceeding 16°C. While the specific threshold values identified in this study are calibrated to the study area’s geological and climatological context, the methodological framework for threshold identification and factor interaction analysis remains transferable across diverse geographical settings, allowing regional calibration of specific thresholds while maintaining the structural integrity of the factor interaction paradigm.

7 Discussions

7.1. Stability analysis

The stability of a model is crucial for reliable debris flow hazard prediction, as it ensures consistent results across multiple independent runs. To evaluate the stability of the IKOA-CNN-BiGRU-Attention model, 50 independent experiments were conducted, recording the Root Mean Square Error (RMSE), Mean Absolute Error (MAE), and their standard deviations (SD). Lower SD values indicate higher stability and consistency in predictions. As shown in Table 5 the IKOA-CNN-BiGRU-Attention model demonstrated minimal performance fluctuations, with an RMSE of 2.33 × 10−6 ± 1.51 × 10−7 and an MAE of 1.51 × 10−6 ± 1.10 × 10−7, outperforming all comparative models. These results highlight the model’s reliability and reproducibility, making it highly suitable for practical applications in debris flow hazard prediction. Its stable performance improves the credibility of risk assessments and provides robust technical support for disaster prevention and mitigation.

Table 5. Stability analysis of each model.

No. Model RMSE±SD MAE ± SD
1 IKOA-CNN-BiGRU-Attention 2.33E-06 ± 1.51E-07 1.51E-06 ± 1.10E-07
2 PSO-SVM 0.0691 ± 0.0020 0.0352 ± 0.0015
3 GWO-ELM 0.0017 ± 0.00015 0.0008638 ± 0.00012
4 IDBO-VMD-BiLSTM 0.0636 ± 0.0012 0.0451 ± 0.0011
5 DT 0.1024 ± 0.0030 0.0522 ± 0.0023
6 Adaboost 0.103 ± 0.0035 0.052 ± 0.0028
7 IAO-BiLSTM 0.0582 ± 0.0015 0.0333 ± 0.0013
8 RF 0.1155 ± 0.004 0.0610 ± 0.0032
9 GA-BP 0.3155 ± 0.007 0.2359 ± 0.0055
10 GBDT 0.106 ± 0.0038 0.054 ± 0.0030
11 Extra trees 0.106 ± 0.0039 0.054 ± 0.0031
12 CatBoost 0.117 ± 0.0041 0.063 ± 0.0034
13 KNN 0.115 ± 0.0040 0.063 ± 0.0034
14 XGBoost 0.099 ± 0.0028 0.050 ± 0.0025

7.2. Robustness analysis

Robustness is essential for ensuring a model’s reliability when handling data disturbances such as noise or outliers, which are common in real-world geological and meteorological datasets. To evaluate the robustness of the IKOA-CNN-BiGRU-Attention model, random noise levels of 5%, 10%, and 20% were added to the test dataset to simulate possible data disturbances. The changes in performance metrics, including Root Mean Square Error (RMSE) and Mean Absolute Error (MAE), were observed to determine the model’s ability to handle noisy data. As shown in Table 6, the IKOA-CNN-BiGRU-Attention model showed minimal performance degradation. When the noise level increased from 0% to 20%, the RMSE rose by only 7.9 × 10−7, and the MAE increased by approximately 4.9 × 10−7. Compared to other models, these slight increases demonstrate the strong robustness of the proposed model, which ensures high predictive accuracy even under noisy data conditions. This robustness confirms the model’s reliability in complex environments and enhances its applicability to practical disaster prevention and mitigation efforts.

Table 6. Robustness Analysis of each model.

Model 5%Noise 10%Noise 20%Noise
RMSE MAE RMSE MAE RMSE MAE
IKOA-CNN-BiGRU-Attention 2.60E-06 1.65E-06 2.95E-06 1.80E-06 3.12E-06 2.00E-06
PSO-SVM 0.0720 0.0370 0.0780 0.0410 0.0850 0.0450
GWO-ELM 0.0018 0.0008 0.0019 0.0009 0.0021 0.0011
IDBO-VMD-BiLSTM 0.0650 0.0470 0.0720 0.0510 0.0810 0.0570
DT 0.0610 0.0370 0.0680 0.0420 0.0760 0.0490
Adaboost 0.1100 0.0560 0.1250 0.0630 0.1440 0.0740
IAO-BiLSTM 0.1080 0.0540 0.1200 0.0610 0.1320 0.0690
RF 0.1220 0.0640 0.1380 0.0710 0.1550 0.0820
GA-BP 0.3250 0.2420 0.3600 0.270 0.4100 0.3150
GBDT 0.1120 0.0570 0.1250 0.0630 0.1400 0.0710
Extra trees 0.1150 0.0590 0.1300 0.0650 0.1470 0.0750
CatBoost 0.1280 0.0670 0.1400 0.0750 0.1600 0.0840
KNN 0.1250 0.0650 0.1380 0.0730 0.1550 0.0830
XGBoost 0.1050 0.0540 0.1180 0.0600 0.1350 0.0700

7.3. Applicability analysis

The applicability of a model is crucial for assessing its practical value, particularly its suitability across different regions and datasets. To evaluate the applicability of the IKOA-CNN-BiGRU-Attention model, a 5-fold cross-validation experiment was conducted. The Root Mean Square Error (RMSE) and Mean Absolute Error (MAE) were recorded for each fold to assess performance consistency. As shown in the results in Table 7, the RMSE ranged from 2.35 × 10−6 to 2.50 × 10−6, and the MAE ranged from 1.55 × 10−6 to 1.65 × 10−6, with minimal fluctuations across the folds. This consistent performance demonstrates the model’s strong applicability and stability, making it suitable for predicting debris flow hazards not only in the specific study area but also in other geographical regions. Its generalizability provides robust technical support for global disaster risk assessment and early warning efforts.

Table 7. Applicability analysis of the IKOA-CNN-BiGRU-Attention model.

Number of cross-validations RMSE MAE
1st Fold 2.40E-06 1.60E-06
2nd Fold 2.50E-06 1.65E-06
3rd Fold 2.35E-06 1.55E-06
4th Fold 2.42E-06 1.62E-06
5th Fold 2.45E-06 1.63E-06

8 Conclusions

This study proposes an integrated IKOA-CNN-BiGRU-Attention framework with SHAP explainability for high-precision debris flow hazard prediction that combines advanced optimization techniques, multidimensional deep learning architectures, and explainable AI mechanisms. Through systematic implementation and validation in the Nujiang River Basin, China, the study yields five principal contributions:

  • (1)

    The Improved Kepler Optimization Algorithm was developed with three key innovations: Chebyshev mapping for ergodic population initialization, golden sine operator for solution space narrowing, and dynamic weight coefficient for balanced search dynamics. Benchmark function testing demonstrated IKOA’s superior performance across standard optimization cases, consistently outperforming conventional algorithms including GWO, SSA, and WOA. This optimization algorithm, when integrated with the CNN-BiGRU-Attention architecture, achieved exceptional prediction performance with an RMSE of 2.33 × 10−6, MAE of 1.51 × 10−6, and MAPE of 0.006%, outperforming all 13 benchmark models.

  • (2)

    Comprehensive validation confirmed the framework’s reliability and generalizability for real-world implementation. Stability analysis through 50 independent experimental runs demonstrated consistent performance with RMSE of 2.33 × 10−6 ± 1.51 × 10−7, MAE of 1.51 × 10−6 ± 1.10 × 10−7. Robustness assessment under noise perturbation showed minimal performance degradation with RMSE increasing by only 7.9 × 10−7 at 20% noise injection, confirming the model’s resilience to data uncertainty. Five-fold cross-validation further validated generalizability with RMSE ranging from 2.35 × 10−6 to 2.50 × 10−6, suggesting good applicability across diverse geomorphological contexts.

  • (3)

    SHAP explainability analysis revealed a previously unrecognized hierarchical triggering mechanism in debris flow genesis. Potential source energy emerged as the primary factor with a mean absolute SHAP value of 1.07 × 10−2, approximately 4.6-fold greater than other parameters. The relationship between source energy and debris flow hazard exhibits a sigmoidal pattern fitted by a cubic function (R2 = 0.98) with a critical threshold at 3.00 × 109 J/m2. This threshold partitions the risk domain into two distinct regions: a low-risk zone where hazard potential remains minimal regardless of other factors, and a high-risk zone where hazard potential increases nonlinearly as source energy rises.

  • (4)

    The analysis identified a “dual-threshold conditional triggering” framework for debris flow hazard prediction. The maximum 24-hour rainfall threshold was established at 27.01 mm; however, this threshold’s effectiveness appears to depend on source energy conditions. When source energy remains below 5 × 109 J/m2, rainfall intensity shows minimal impact on hazard potential; once this energy threshold is exceeded, rainfall appears to function as an activation trigger.

  • (5)

    Intervention efficacy assessment quantified the differential impact of alternative risk reduction strategies. Rainfall control measures produced a 58.8% risk reduction when decreasing maximum 24-hour rainfall from 29.5 mm to 27.0 mm, compared to source energy interventions which achieved a 25.0% risk reduction when decreasing potential source energy from 8.0 × 109 J/m2 to 3.0 × 109 J/m2. These findings provide evidence-based guidance for resource allocation in disaster risk reduction programs, particularly for identifying transition zones where modest interventions may yield substantial risk reduction.

Acknowledgments

The authors would like to acknowledge the support provided by the Information Technology Center, Zhejiang University.

Data Availability

The complete dataset supporting all findings in this study is fully and openly accessible without any restrictions on Zenodo. All data is contained in a single downloadable Excel file (19.6 kB): Wang, T. (2025). Debris Flow Hazard-Causing Factors Dataset for the Yunnan Section of the Nujiang River Basin, China [Data set]. Zenodo. https://doi.org/10.5281/zenodo.15050116.

Funding Statement

This work was supported by the China Scholarship Council (No. 202406320358) and the 1st Doctoral Student Program of the Young Elite Scientists Sponsorship Program by CAST (no award number). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Gong W, Juang CH, Wasowski J. Geohazards and human settlements: Lessons learned from multiple relocation events in Badong, China – Engineering geologist’s perspective. Eng Geology. 2021;285:106051. doi: 10.1016/j.enggeo.2021.106051 [DOI] [Google Scholar]
  • 2.Wang T, Ge Q, Ma T, Chen H, Luo R, Wang X, et al. A novel method for predicting debris flow hazard: a multi-strategy fusion approach based on the light gradient boosting machine framework. Stoch Environ Res Risk Assess. 2025. doi: 10.1007/s00477-025-02955-9 [DOI] [Google Scholar]
  • 3.Kim JW, Kim HG. Simulating the extent of landslide damage using LAHARZ: a case study for yecheon-gun, gyeongsangbuk-do. J People Plants Environ. 2025;28(1):35–44. doi: 10.11628/ksppe.2025.28.1.35 [DOI] [Google Scholar]
  • 4.Dixit S, Siva Subramanian S, Srivastava P, Yunus AP, Martha TR, Sen S. Numerical-model-derived intensity–duration thresholds for early warning of rainfall-induced debris flows in a Himalayan catchment. Nat Hazards Earth Syst Sci. 2024;24(2):465–80. doi: 10.5194/nhess-24-465-2024 [DOI] [Google Scholar]
  • 5.Bolliger D, Schlunegger F, McArdell BW. Comparison of debris flow observations, including fine-sediment grain size and composition and runout model results, at Illgraben, Swiss Alps. Nat Hazards Earth Syst Sci. 2024;24(3):1035–49. doi: 10.5194/nhess-24-1035-2024 [DOI] [Google Scholar]
  • 6.K. C. D, Naqvi MW, Hu L. A case study and numerical modeling of post-wildfire debris flows in Montecito, California. Water. 2024;16(9):1285. doi: 10.3390/w16091285 [DOI] [Google Scholar]
  • 7.Patton AI, Luna LV, Roering JJ, Jacobs A, Korup O, Mirus BB. Landslide initiation thresholds in data-sparse regions: application to landslide early warning criteria in Sitka, Alaska, USA. Nat Hazards Earth Syst Sci. 2023;23(10):3261–84. doi: 10.5194/nhess-23-3261-2023 [DOI] [Google Scholar]
  • 8.Onaopemipo Akintola M. Enhancing disaster response and resilience through near-time GIS for flood monitoring and analysis in Niger River Basin, Nigeria. Int Arch Photogramm Remote Sens Spatial Inf Sci. 2024;XLVIII-3–2024:377–85. doi: 10.5194/isprs-archives-xlviii-3-2024-377-2024 [DOI] [Google Scholar]
  • 9.Barman J, Das J. Assessing classification system for landslide susceptibility using frequency ratio, analytical hierarchical process and geospatial technology mapping in Aizawl district, NE India. Adv Space Res. 2024;74(3):1197–224. doi: 10.1016/j.asr.2024.05.007 [DOI] [Google Scholar]
  • 10.Mohseni U, Jat PK, Siriteja V. Multi-criteria analysis-based mapping of the cyclone-induced pluvial flooding in coastal areas of India. DYSONA - Appl Sci. 2025;6:309–21. doi: 10.30493/das.2025.490282 [DOI] [Google Scholar]
  • 11.Wang N, Zhang H, Dahal A, Cheng W, Zhao M, Lombardo L. On the use of explainable AI for susceptibility modeling: Examining the spatial pattern of SHAP values. Geosci Front. 2024;15(4):101800. doi: 10.1016/j.gsf.2024.101800 [DOI] [Google Scholar]
  • 12.Li M, Li L, Lai Y, He L, He Z, Wang Z. Geological hazard susceptibility analysis based on RF, SVM, and NB models, using the puge section of the Zemu river valley as an example. Sustainability. 2023;15(14):11228. doi: 10.3390/su151411228 [DOI] [Google Scholar]
  • 13.Effects of flow regimes on the interaction between granular flow and flexible barrier. - document - gale academic OneFile. [cited 17 Apr 2025]. Available: https://go-gale-com.remotexs.ntu.edu.sg/ps/i.do?p=AONE&u=nantecun&id=GALE%7CA762819892&v=2.1&it=r [Google Scholar]
  • 14.Ho LS, Tran VQ. Evaluation and estimation of compressive strength of concrete masonry prism using gradient boosting algorithm. PLoS One. 2024;19(3):e0297364. doi: 10.1371/journal.pone.0297364 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.DGS-SCSC: enhancing sand cat swarm optimization with dynamic pinhole imaging and golden sine algorithm for improved numerical optimization performance. Sci Rep. 2023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Hasanah SA, Pravitasari AA, Abdullah AS, Yulita IN, Asnawi MH. A deep learning review of ResNet architecture for lung disease identification in CXR image. Appl Sci. 2023;13(24):13111. doi: 10.3390/app132413111 [DOI] [Google Scholar]
  • 17.Dahal A, Lombardo L. Explainable artificial intelligence in geoscience: A glimpse into the future of landslide susceptibility modeling. Comput Geosci. 2023;176:105364. doi: 10.1016/j.cageo.2023.105364 [DOI] [Google Scholar]
  • 18.Lundberg SM, Lee S-I. A unified approach to interpreting model predictions. Advances in Neural Information Processing Systems. Curran Associates, Inc.; 2017. Available: https://proceedings.neurips.cc/paper/2017/hash/8a20a8621978632d76c43dfd28b67767-Abstract.html [Google Scholar]
  • 19.Okolie C, Mills J, Adeleke A, Smit J, Maduako I. The explainability of gradient-boosted decision trees for Digital Elevation Model (DEM) error prediction. Int Arch Photogramm Remote Sens Spatial Inf Sci. 2023;XLVIII-M-3–2023:161–8. doi: 10.5194/isprs-archives-xlviii-m-3-2023-161-2023 [DOI] [Google Scholar]
  • 20.Sepehri M, Malekinezhad H, Hosseini SZ, Ildoromi AR. Assessment of flood hazard mapping in urban areas using entropy weighting method: a case study in Hamadan city, Iran. Acta Geophys. 2019;67(5):1435–49. doi: 10.1007/s11600-019-00342-x [DOI] [Google Scholar]
  • 21.Chaudhari K, Thakkar A. Neural network systems with an integrated coefficient of variation-based feature selection for stock price and trend prediction. Expert Sys Appl. 2023;219:119527. doi: 10.1016/j.eswa.2023.119527 [DOI] [Google Scholar]
  • 22.Singh T, Singh V, Ranakoti L, Kumar S. Optimization on tribological properties of natural fiber reinforced brake friction composite materials: Effect of objective and subjective weighting methods. Polymer Testing. 2023;117:107873. doi: 10.1016/j.polymertesting.2022.107873 [DOI] [Google Scholar]
  • 23.Abdel-Basset M, Mohamed R, Azeem SAA, Jameel M, Abouhawwash M. Kepler optimization algorithm: A new metaheuristic algorithm inspired by Kepler’s laws of planetary motion. Knowledge-Based Sys. 2023;268:110454. doi: 10.1016/j.knosys.2023.110454 [DOI] [Google Scholar]
  • 24.Agrawal P, Ganesh T, Mohamed AW. Chaotic gaining sharing knowledge-based optimization algorithm: an improved metaheuristic algorithm for feature selection. Soft Comput. 2021;25(14):9505–28. doi: 10.1007/s00500-021-05874-3 [DOI] [Google Scholar]
  • 25.Tanyildizi E, Demir G. Golden sine algorithm: a novel math-inspired algorithm. Adv Electr Comp Eng. 2017;17(2):71–8. doi: 10.4316/aece.2017.02010 [DOI] [Google Scholar]
  • 26.Suganthan PN, Hansen N, Liang JJ, Deb K. Problem definitions and evaluation criteria for the CEC 2005 special session on real-parameter optimization. CEC. 2005. [Google Scholar]
  • 27.Mirboluki A, Mehraein M, Kisi O, Kuriqi A, Barati R. Groundwater level estimation using improved deep learning and soft computing methods. Earth Sci Inform. 2024;17(3):2587–608. doi: 10.1007/s12145-024-01300-y [DOI] [Google Scholar]
  • 28.Mirjalili S, Lewis A. The whale optimization algorithm. Adv Eng Software. 2016;95:51–67. doi: 10.1016/j.advengsoft.2016.01.008 [DOI] [Google Scholar]
  • 29.Xue J, Shen B. A novel swarm intelligence optimization approach: sparrow search algorithm. Sys Sci Control Eng. 2020;8(1):22–34. doi: 10.1080/21642583.2019.1708830 [DOI] [Google Scholar]
  • 30.Hosseinpour-Zarnaq M, Omid M, Sarmadian F, Ghasemi-Mobtaker H. A CNN model for predicting soil properties using VIS–NIR spectral data. Environ Earth Sci. 2023;82(16). doi: 10.1007/s12665-023-11073-0 [DOI] [Google Scholar]
  • 31.She D, Jia M. A BiGRU method for remaining useful life prediction of machinery. Measurement. 2021;167:108277. doi: 10.1016/j.measurement.2020.108277 [DOI] [Google Scholar]
  • 32.Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, et al. Attention Is All You Need. In: Guyon I, Luxburg UV, Bengio S, Wallach H, Fergus R, Vishwanathan S, et al., editors. Advances in neural information processing systems 30 (NIPS 2017). La Jolla: Neural Information Processing Systems (nips); 2017. Available: https://www.webofscience.com/wos/woscc/full-record/WOS:000452649406008 [Google Scholar]
  • 33.Lee D-H, Cheon E, Lim H-H, Choi S-K, Kim Y-T, Lee S-R. An artificial neural network model to predict debris-flow volumes caused by extreme rainfall in the central region of South Korea. Eng Geol. 2021;281:105979. doi: 10.1016/j.enggeo.2020.105979 [DOI] [Google Scholar]
  • 34.Kidron GJ, Kronenfeld R, Starinsky A. Wind as a cooling agent: substrate temperatures are responsible for variable lithobiont‐induced weathering patterns on west‐ and east‐facing limestone bedrock of the Negev. Earth Surf Processes Landf. 2016;41(14):2078–84. doi: 10.1002/esp.3973 [DOI] [Google Scholar]
  • 35.Eppes MC, Magi B, Scheff J, Warren K, Ching S, Feng T. Warmer, wetter climates accelerate mechanical weathering in field data, independent of stress‐loading. Geophysical Res Letters. 2020;47(24). doi: 10.1029/2020gl089062 [DOI] [Google Scholar]
  • 36.Eppes M, Keanini R. Mechanical weathering and rock erosion by climate‐dependent subcritical cracking. Rev Geophysics. 2017;55(2):470–508. doi: 10.1002/2017rg000557 [DOI] [Google Scholar]
  • 37.Deng K, Yang S, Guo Y. A global temperature control of silicate weathering intensity. Nat Commun. 2022;13(1):1781. doi: 10.1038/s41467-022-29415-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Bai B, Xu T, Nie Q, Li P. Temperature-driven migration of heavy metal Pb2+ along with moisture movement in unsaturated soils. Int J Heat and Mass Transfer. 2020;153:119573. doi: 10.1016/j.ijheatmasstransfer.2020.119573 [DOI] [Google Scholar]
  • 39.Bai B, Zhou R, Yang G, Zou W, Yuan W. The constitutive behavior and dissociation effect of hydrate-bearing sediment within a granular thermodynamic framework. Ocean Eng. 2023;268:113408. doi: 10.1016/j.oceaneng.2022.113408 [DOI] [Google Scholar]
  • 40.Zhang W, Chen J, Wang Q, An Y, Qian X, Xiang L, et al. Susceptibility analysis of large-scale debris flows based on combination weighting and extension methods. Nat Hazards. 2012;66(2):1073–100. doi: 10.1007/s11069-012-0539-0 [DOI] [Google Scholar]
  • 41.Ma C, Hu K, Tian M. Comparison of debris-flow volume and activity under different formation conditions. Nat Hazards. 2013;67(2):261–73. doi: 10.1007/s11069-013-0557-6 [DOI] [Google Scholar]
  • 42.Heiser M, Scheidl C, Eisl J, Spangl B, Hübl J. Process type identification in torrential catchments in the eastern Alps. Geomorphology. 2015;232:239–47. doi: 10.1016/j.geomorph.2015.01.007 [DOI] [Google Scholar]
  • 43.Zhao Y, Meng X, Qi T, Qing F, Xiong M, Li Y, et al. AI-based identification of low-frequency debris flow catchments in the Bailong River basin, China. Geomorphology. 2020;359:107125. doi: 10.1016/j.geomorph.2020.107125 [DOI] [Google Scholar]
  • 44.Li L, Ni B, Qiang Y, Zhang S, Zhao D, Zhou L. Risk assessment of debris flow disaster based on the cloud model-Probability fusion method. PLoS One. 2023;18(2):e0281039. doi: 10.1371/journal.pone.0281039 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.DI BF, CHEN NS, CUI P, LI ZL, HE YP, GAO YC. GIS-based risk analysis of debris flow: an application in Sichuan, southwest China. Int J Sediment Res. 2008;23(2):138–48. doi: 10.1016/s1001-6279(08)60013-x [DOI] [Google Scholar]
  • 46.Modeling multiphase debris floods down straight and meandering channels. JAFM. 2024;17(1). doi: 10.47176/jafm.17.1.1917 [DOI] [Google Scholar]
  • 47.Panchal S, Shrivastava AKr. Landslide hazard assessment using analytic hierarchy process (AHP): A case study of National Highway 5 in India. Ain Shams Engineering Journal. 2022;13(3):101626. doi: 10.1016/j.asej.2021.10.021 [DOI] [Google Scholar]
  • 48.Shi M, Chen J, Song Y, Zhang W, Song S, Zhang X. Assessing debris flow susceptibility in Heshigten Banner, Inner Mongolia, China, using principal component analysis and an improved fuzzy C-means algorithm. Bull Eng Geol Environ. 2015;75(3):909–22. doi: 10.1007/s10064-015-0784-z [DOI] [Google Scholar]
  • 49.Bertrand M, Liébault F, Piégay H. Debris-flow susceptibility of upland catchments. Nat Hazards. 2013;67(2):497–511. doi: 10.1007/s11069-013-0575-4 [DOI] [Google Scholar]
  • 50.Brardinoni F, Church M, Simoni A, Macconi P. Lithologic and glacially conditioned controls on regional debris-flow sediment dynamics. Geology. 2012;40(5):455–8. doi: 10.1130/g33106.1 [DOI] [Google Scholar]
  • 51.O’brien RM. A caution regarding rules of thumb for variance inflation factors. Qual Quant. 2007;41(5):673–90. doi: 10.1007/s11135-006-9018-6 [DOI] [Google Scholar]
  • 52.Dharmarathne G, Waduge AO, Bogahawaththa M, Rathnayake U, Meddage DPP. Adapting cities to the surge: A comprehensive review of climate-induced urban flooding. Results in Eng. 2024;22:102123. doi: 10.1016/j.rineng.2024.102123 [DOI] [Google Scholar]
  • 53.Wang T, Luo R, Ma T, Chen H, Zhang K, Wang X, et al. Study and verification on an improved comprehensive prediction model of landslide displacement. Bull Eng Geol Environ. 2024;83(3). doi: 10.1007/s10064-024-03581-5 [DOI] [Google Scholar]
  • 54.Wang T, Peng D, Wang X, Wu B, Luo R, Chu Z, et al. Study on wavelet multi-scale analysis and prediction of landslide groundwater. J Hydroinformatics. 2023;26(1):237–54. doi: 10.2166/hydro.2023.299 [DOI] [Google Scholar]
  • 55.Tran HTT, Nguyen QH, Pham TH, Ngo GTH, Pham NTD, Pham TG, et al. Novel learning of bathymetry from landsat 9 imagery using machine learning, feature extraction and meta-heuristic optimization in a shallow Turbid Lagoon. Geosci. 2024;14(5):130. doi: 10.3390/geosciences14050130 [DOI] [Google Scholar]

Decision Letter 0

Linwei Li

Dear Dr. Wang,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

==============================

The review process is now complete, and we have two reports submitted by the expert reviewers. As can be seen from these reports, one of them requested revision before the manuscript was accepted. However, another explicitly recommended rejection without revision. The author must be aware that the comments of reviewers #2 are highly critical of the overall study. Therefore, I suggest the authors carefully revise this manuscript point by point according to all the comments. The resubmitted manuscript will be reviewed again.

In the end, I want to thank the reviewers for their great efforts on your manuscript and you for your submission.

==============================

Please submit your revised manuscript by May 17 2025 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org . When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols . Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols .

We look forward to receiving your revised manuscript.

Kind regards,

Linwei Li

Academic Editor

PLOS ONE

Journal requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Please note that PLOS ONE has specific guidelines on code sharing for submissions in which author-generated code underpins the findings in the manuscript. In these cases, we expect all author-generated code to be made available without restrictions upon publication of the work. Please review our guidelines at https://journals.plos.org/plosone/s/materials-and-software-sharing#loc-sharing-code and ensure that your code is shared in a way that follows best practice and facilitates reproducibility and reuse.

3. We note that the grant information you provided in the ‘Funding Information’ and ‘Financial Disclosure’ sections do not match.

When you resubmit, please ensure that you provide the correct grant numbers for the awards you received for your study in the ‘Funding Information’ section.

4. We note that Figure 6 in your submission contain [map/satellite] images which may be copyrighted. All PLOS content is published under the Creative Commons Attribution License (CC BY 4.0), which means that the manuscript, images, and Supporting Information files will be freely available online, and any third party is permitted to access, download, copy, distribute, and use these materials in any way, even commercially, with proper attribution. For these reasons, we cannot publish previously copyrighted maps or satellite images created using proprietary data, such as Google software (Google Maps, Street View, and Earth). For more information, see our copyright guidelines: http://journals.plos.org/plosone/s/licenses-and-copyright.

We require you to either (1) present written permission from the copyright holder to publish these figures specifically under the CC BY 4.0 license, or (2) remove the figures from your submission:

1. You may seek permission from the original copyright holder of Figure 6 to publish the content specifically under the CC BY 4.0 license. 

We recommend that you contact the original copyright holder with the Content Permission Form (http://journals.plos.org/plosone/s/file?id=7c09/content-permission-form.pdf) and the following text:

“I request permission for the open-access journal PLOS ONE to publish XXX under the Creative Commons Attribution License (CCAL) CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). Please be aware that this license allows unrestricted use and distribution, even commercially, by third parties. Please reply and provide explicit written permission to publish XXX under a CC BY license and complete the attached form.”

Please upload the completed Content Permission Form or other proof of granted permissions as an "Other" file with your submission.

In the figure caption of the copyrighted figure, please include the following text: “Reprinted from [ref] under a CC BY license, with permission from [name of publisher], original copyright [original copyright year].”

2. If you are unable to obtain permission from the original copyright holder to publish these figures under the CC BY 4.0 license or if the copyright holder’s requirements are incompatible with the CC BY 4.0 license, please either i) remove the figure or ii) supply a replacement figure that complies with the CC BY 4.0 license. Please check copyright information on all replacement figures and update the figure caption with source information. If applicable, please specify in the figure caption text when a figure is similar but not identical to the original image and is therefore for illustrative purposes only.

The following resources for replacing copyrighted map figures may be helpful:

USGS National Map Viewer (public domain): http://viewer.nationalmap.gov/viewer/

The Gateway to Astronaut Photography of Earth (public domain): http://eol.jsc.nasa.gov/sseop/clickmap/

Maps at the CIA (public domain): https://www.cia.gov/library/publications/the-world-factbook/index.html and https://www.cia.gov/library/publications/cia-maps-publications/index.html

NASA Earth Observatory (public domain): http://earthobservatory.nasa.gov/

Landsat: http://landsat.visibleearth.nasa.gov/

USGS EROS (Earth Resources Observatory and Science (EROS) Center) (public domain): http://eros.usgs.gov/#

Natural Earth (public domain): http://www.naturalearthdata.com/

Additional Editor Comments:

None

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

Reviewer #1: Yes

Reviewer #2: No

**********

2. Has the statistical analysis been performed appropriately and rigorously? -->?>

Reviewer #1: Yes

Reviewer #2: No

**********

3. Have the authors made all data underlying the findings in their manuscript fully available??>

The PLOS Data policy

Reviewer #1: Yes

Reviewer #2: No

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English??>

Reviewer #1: Yes

Reviewer #2: Yes

**********

Reviewer #1: This manuscript presents an integrated framework that combines IKOA-CNN-BiGRU-Attention architecture with SHAP explainability analysis for debris flow hazard prediction in the Nujiang River Basin, China. The research demonstrates innovation in the field of debris flow hazard prediction, particularly in the integration of optimization algorithms, deep learning architectures, and explainability mechanisms. However, some methodological issues should be addressed, particularly regarding the validation of prediction accuracy, reconciliation of variable importance rankings, and justification of analytical choices. I recommend revision before the manuscript can be considered for publication:

(1) The introduction's exposition of five paradigms in debris flow prediction research requires structural refinement. The transition between the discussion of the "black box problem" (lines 87-94) and XAI methodologies (lines 95-103) lacks sufficient logical connectivity, thereby undermining the theoretical justification for adopting SHAP methodology in this investigation. I recommend incorporating appropriate transitional discourse to articulate explicitly why SHAP methodology constitutes an optimal approach for addressing the research questions, and to establish a more robust conceptual linkage between the chosen methodology and the research objectives.

(2) The experimental results presented in Table 3 indicate that the IKOA-CNN-BiGRU-Attention model achieves remarkably low error metrics: RMSE of 2.33×10-6, MAE of 1.51×10-6, and MAPE of 0.006%. Given the inherent complexity and stochasticity of debris flow phenomena, characterized by non-linear processes and substantial data uncertainties, such extraordinarily precise predictions warrant careful scrutiny. The authors should provide a comprehensive explanation of the validation procedures employed to ensure result reliability, discuss potential limitations of the model when deployed in real-world scenarios, and elucidate the practical significance of such high-precision forecasting capabilities for operational debris flow warning systems.

(3) A salient discrepancy exists between the SHAP global explanation analysis (lines 449-479) and the comprehensive weighting results (lines 408-425). Figure 8 identifies potential source energy (X7, 26.98%), catchment area (X8, 17.38%), and main channel length (X11, 9.65%) as the predominant influential variables, whereas Figure 10's SHAP analysis presents a divergent hierarchical ordering, with X1 (maximum 24-hour rainfall) and X2 (annual average rainfall) occupying the second and third positions, respectively. This inconsistency in factor importance ranking necessitates a thorough explication to reconcile these seemingly contradictory findings and to strengthen the credibility of the research conclusions.

(4) The SHAP local explanation analysis (lines 480-507) selectively examines only sample points 15 and 37, without providing any methodological rationale for this selection. If these particular sample points were chosen as representative case studies, their geological characteristics and typological representativeness should be explicitly delineated; alternatively, if they were randomly selected, the sampling methodology should be transparently documented to mitigate concerns regarding selection bias. Enhancement of this section will substantially augment the persuasiveness of the SHAP analytical findings.

(5) The multicollinearity diagnostic procedure (lines 385-394) employs VIF<5 as a threshold criterion, rather than the more conventionally accepted standard of VIF<10 in statistical literature. The authors should provide a theoretical or empirical justification for adopting this more stringent threshold, and should include the original VIF values for the eliminated variables (X10, X14, X17, and X19) to enable readers to independently evaluate the validity of the variable selection process. Such transparency will enhance the methodological rigor and reproducibility of the research.

(6) The bibliographic apparatus exhibits significant inconsistencies in formatting protocol. References 2, 3, and 4 lack requisite Digital Object Identifier information, whereas subsequent entries incorporate complete digital identifiers. It is imperative that the authors systematically incorporate standardized DOI information for all references in accordance with established journal citation guidelines. Such standardization not only conforms to contemporary bibliographic conventions but also facilitates efficient access to primary literature sources for the scholarly audience.

(7) Citation methodologies within the manuscript text demonstrate concerning inconsistency in application. Line 44 employs the author-date format "Berti and Simoni (2007)," while alternative sections utilize numerical references enclosed in square brackets. Scholarly writing necessitates methodological consistency; therefore, the authors must adopt a uniform citation system throughout the manuscript. This standardization should adhere to established academic protocols, which demand consistency in referential apparatus as a fundamental aspect of scholarly communication.

(8) Reference citation practices delineated in Table 2 manifest substantial heterogeneity in presentational format. Certain entries present full author nomenclature such as "Deuk-Hwan Lee," while others utilize surnames accompanied by temporal indicators such as "Bing Bai in 2020." This inconsistency contravenes established bibliographic principles. Standardization of citation format in accordance with the target journal's stylistic requirements is essential to maintain scholarly integrity and enhance the academic presentation of the manuscript.

(9) Table 2 incorporates the abbreviated designation "SS" without providing requisite definitional clarity or terminological explication. Scholarly convention necessitates that all abbreviations, particularly those serving as categorical identifiers within tabular presentations, receive proper elucidation. The authors must provide explicit definition of this abbreviation within the table notes to ensure unambiguous interpretation of the tabular content, thus conforming to fundamental principles of scholarly discourse and scientific communication.

Reviewer #2: The data quality in this paper is very poor, and the model has significant flaws, as it fails to consider the key triggering mechanisms of debris flows. The study lacks an in-depth analysis of critical geological and hydrological variables, making the proposed deep learning approach questionable in terms of its applicability to debris flow prediction. The model is entirely data-driven without sufficient justification for its reliability or physical relevance. Furthermore, the discussion section lacks scientific value, as it does not provide a meaningful interpretation of the prediction results. The SHAP-based analysis of variable importance also fails to offer practical disaster mitigation strategies. Overall, this paper lacks both practical and scientific value. Rejection is recommended.

**********

what does this mean? ). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy

Reviewer #1: No

Reviewer #2: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/ . PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org . Please note that Supporting Information files do not need this step.

PLoS One. 2025 Jun 24;20(6):e0326587. doi: 10.1371/journal.pone.0326587.r003

Author response to Decision Letter 1


30 Apr 2025

Response to Editor and Reviewers

Dear editors and reviewers,

Thank you for your letter and for the reviewers' comments concerning our manuscript entitled “An Integrated IKOA-CNN-BiGRU-Attention Framework with SHAP Explainability for High-Precision Debris Flow Hazard Prediction in the Nujiang River Basin, China” (ID: PONE-D-25-14786). Those comments are all valuable and very helpful for revising and improving our paper, as well as providing important guiding significance to our research. We have studied comments carefully and have made corrections which we hope meet with approval. We also highlighted the changes using red fonts in the attached Revised full manuscript file to make it easier for you to check our revision. The principal revisions made within this paper and responses to the reviewers' comments are delineated as follows.

Sincerely Yours,

Dr. Tianlong Wang

Response to Reviewer 1

(Original comments and queries are in blue, while our responses are in black)

[General Comment]

This manuscript presents an integrated framework that combines IKOA-CNN-BiGRU-Attention architecture with SHAP explainability analysis for debris flow hazard prediction in the Nujiang River Basin, China. The research demonstrates innovation in the field of debris flow hazard prediction, particularly in the integration of optimization algorithms, deep learning architectures, and explainability mechanisms. However, some methodological issues should be addressed, particularly regarding the validation of prediction accuracy, reconciliation of variable importance rankings, and justification of analytical choices.

Response:

We thank Reviewer 1 for their recognition of the manuscript’s methodological innovation and for providing nine detailed and constructive comments, which have been instrumental in refining the quality and clarity of our work. In this revision, we have systematically addressed each of the reviewer’s specific points, and all corresponding modifications have been carefully incorporated and are clearly marked in red in the revised manuscript. In particular, Section 6 has been substantially expanded to provide a thorough and transparent explanation of model interpretability, SHAP-based analysis of factor importance, threshold identification, and the mechanistic basis of debris flow initiation, alongside clearer articulation of the operational implications for early warning and risk mitigation strategies. These enhancements are presented in the results and conclusions, with relevant details located in lines 14 to 30, 35 to 147, 387 to 393, 401 to 414, 462 to 632, 674 to 720. The manuscript now clarifies the integrated and complementary roles of comprehensive weighting and SHAP analysis for interpreting variable significance, and it provides a more rigorous justification for key methodological choices, including multicollinearity diagnosis and the hierarchical model architecture. Each of the reviewer’s nine comments, spanning data description, factor selection, interpretation, model comparison, and practical application, has been addressed through targeted clarifications or new analyses as detailed in our point-by-point responses and reflected in the red-marked revisions. We are confident that these comprehensive, transparent, and rigorously implemented revisions significantly enhance the scientific strength, interpretability, and practical relevance of the manuscript. We sincerely thank the reviewer for their guidance, which has been essential to these improvements.

[Comment 1] The introduction's exposition of five paradigms in debris flow prediction research requires structural refinement. The transition between the discussion of the "black box problem" (lines 87-94) and XAI methodologies (lines 95-103) lacks sufficient logical connectivity, thereby undermining the theoretical justification for adopting SHAP methodology in this investigation. I recommend incorporating appropriate transitional discourse to articulate explicitly why SHAP methodology constitutes an optimal approach for addressing the research questions, and to establish a more robust conceptual linkage between the chosen methodology and the research objectives.

Response:

Thank you for this valuable suggestion regarding the structural refinement of the introduction. We fully agree that a robust logical transition between the discussion of the “black box problem” and the adoption of XAI methodologies is fundamental to establishing a solid theoretical foundation for our study. In the revised manuscript, we have addressed this concern by introducing a new paragraph at lines 96–103, which explicitly articulates the three core research gaps in the existing literature. This addition establishes a critical conceptual connection between the inherent limitations of conventional machine learning approaches, particularly their well-documented explainability deficit, and the imperative need for adopting transparent, interpretable methodologies to effectively address persistent challenges in debris flow hazard prediction. Building on this foundation, we have further strengthened the rationale for selecting the SHAP methodology by emphasizing its rigorous mathematical basis in cooperative game theory (line 107), and by citing its demonstrated analytical power in recent earth science applications (lines 108–113). Finally, we have clarified in lines 114–121 how our methodological framework, centered on the integrated use of IKOA-CNN-BiGRU-Attention and SHAP, is purposefully designed to resolve these research gaps and to align directly with our stated research objectives.

[Comment 2] The experimental results presented in Table 3 indicate that the IKOA-CNN-BiGRU-Attention model achieves remarkably low error metrics: RMSE of 2.33×10-6, MAE of 1.51×10-6, and MAPE of 0.006%. Given the inherent complexity and stochasticity of debris flow phenomena, characterized by non-linear processes and substantial data uncertainties, such extraordinarily precise predictions warrant careful scrutiny. The authors should provide a comprehensive explanation of the validation procedures employed to ensure result reliability, discuss potential limitations of the model when deployed in real-world scenarios, and elucidate the practical significance of such high-precision forecasting capabilities for operational debris flow warning systems.

Response:

We thank the reviewer for highlighting the necessity of critically scrutinizing the remarkably low error metrics achieved by our model, taking into account the inherent complexity and uncertainty of debris flow phenomena.

To ensure the reliability of our results, we have incorporated a comprehensive suite of validation procedures. The stability of the IKOA-CNN-BiGRU-Attention model was assessed through fifty independent experimental runs, with results demonstrating exceptional reproducibility and minimal performance fluctuation, as shown by the RMSE of 2.33×10-6 with a standard deviation of 1.51×10-7 (lines 634 to 646, Table 5). Robustness was further evaluated by introducing random noise at levels of five, ten, and twenty percent into the test data, revealing only marginal increases in RMSE, which underscores the model’s resilience to the types of data uncertainty frequently encountered in real-world geomorphological and meteorological measurements (lines 647 to 660, Table 6). Applicability was established via five-fold cross-validation, which confirmed consistent predictive performance across multiple data partitions and verified the model’s generalizability to new geographical contexts (lines 661 to 672, Table 7).

The practical significance of these high-precision capabilities is substantiated in Section 6, where explainability analyses using SHAP provide both interpretable and actionable insights. We have identified quantitative thresholds for key risk factors, such as a critical inflection point for source energy at 3.00×109 J/m2 and a maximum 24-hour rainfall value of 27.01 mm, which can serve as rigorous criteria for operational warning system design (lines 462 to 577). Furthermore, the analysis of factor interactions demonstrates that rainfall only meaningfully increases risk when source energy exceeds approximately 5×10⁹ J/m², thereby supporting the development of hierarchical and context-sensitive early warning protocols. Our assessment of intervention efficacy reveals that rainfall control measures can reduce risk by 58.8 percent, whereas source energy interventions achieve a 25 percent reduction, offering objective guidance for the prioritization of disaster mitigation strategies (lines 578 to 630).

We also recognize that, while the specific threshold values determined in this study are optimized for the Nujiang River Basin, the underlying methodological framework remains transferable and can be regionally calibrated for other settings, as discussed in lines 626 to 630. This recognition of both the contextual specificity and the generalizability of our approach ensures its practical utility for operational systems. Finally, the manuscript now provides explicit recommendations for real-world implementation, including dynamic adjustment of rainfall thresholds based on source energy conditions, integration of temperature and humidity monitoring, and prioritization of interventions according to threshold proximity, all of which are intended to support effective and adaptive risk management in diverse debris flow-prone environments.

[Comment 3] A salient discrepancy exists between the SHAP global explanation analysis (lines 449-479) and the comprehensive weighting results (lines 408-425). Figure 8 identifies potential source energy (X7, 26.98%), catchment area (X8, 17.38%), and main channel length (X11, 9.65%) as the predominant influential variables, whereas Figure 10's SHAP analysis presents a divergent hierarchical ordering, with X1 (maximum 24-hour rainfall) and X2 (annual average rainfall) occupying the second and third positions, respectively. This inconsistency in factor importance ranking necessitates a thorough explication to reconcile these seemingly contradictory findings and to strengthen the credibility of the research conclusions.

Response:

We thank the reviewer for highlighting the apparent discrepancy between the comprehensive weighting results and the SHAP global explanation analysis. This difference does not represent a contradiction but rather reflects the distinct methodological objectives and analytical stages of the two approaches. The comprehensive weighting method, based on the integration of the coefficient of variation, entropy weight, and CRITIC methods using game theory principles (lines 150 to 175), is applied during the pre-modeling stage to assess the intrinsic statistical importance of each factor within the dataset. In contrast, the SHAP analysis provides a post-modeling interpretation by quantifying the marginal contribution of each variable to the model’s output, as determined by the specific nonlinear interactions learned by the neural network (lines 462 to 507). Furthermore, SHAP uniquely accounts for complex conditional feature interactions, as demonstrated in Fig 10(c) and discussed in lines 488 to 499, which the comprehensive weighting approach does not capture.

It is important to note that both methods identify potential source energy (X7) as the most influential variable, thereby providing mutual validation and supporting the dual-threshold conditional triggering mechanism highlighted in lines 703 to 710. The divergence in the ranking of secondary factors, with the comprehensive weighting method emphasizing geographical features (X8, X11) and the SHAP analysis highlighting meteorological and environmental variables (X1, X2, X3, X5, X16), reveals complementary dimensions of debris flow genesis. The comprehensive weighting results suggest that geographical variables serve as the fundamental structural determinants of hazard susceptibility at the catchment scale. In comparison, the SHAP analysis demonstrates that, following the establishment of these geographical preconditions, meteorological and environmental factors act as regulatory modulators and conditional activation triggers. This complementary perspective clarifies the hierarchical causality of debris flow initiation, with geographical features establishing baseline susceptibility, source energy accumulation acting as a necessary precondition, and meteorological conditions providing the critical activation trigger. These insights directly inform hazard management strategies, as elaborated in Section 6.3 (lines 578 to 632), and together strengthen the credibility and interpretability of our research conclusions.

[Comment 4] The SHAP local explanation analysis (lines 480-507) selectively examines only sample points 15 and 37, without providing any methodological rationale for this selection. If these particular sample points were chosen as representative case studies, their geological characteristics and typological representativeness should be explicitly delineated; alternatively, if they were randomly selected, the sampling methodology should be transparently documented to mitigate concerns regarding selection bias. Enhancement of this section will substantially augment the persuasiveness of the SHAP analytical findings.

Response:

We thank the reviewer for identifying the need for greater methodological transparency regarding the selection of sample points in our SHAP local explanation analysis. In response, we have substantially revised Section 6.3 (lines 578 to 632) to provide a clear and rigorous account of our sampling methodology and to ensure the representativeness of the selected cases.

First, the revised manuscript now explicitly states that sample points for SHAP local analysis were chosen using a stratified random sampling approach, as described in lines 598 to 606. This method ensures balanced representation across the full spectrum of predicted hazard levels and mitigates concerns regarding selection bias. By employing stratified sampling, we captured the substantial heterogeneity in localized risk factor compositions, thereby justifying the focus on diverse representative cases rather than arbitrary or isolated examples.

Second, we have clarified the geological and typological representativeness of the selected points by directly linking each sample to its specific geomorphological context. As noted in lines 607 to 625, this approach demonstrates that effective risk management requires location-specific strategies tailored to the predominant risk factors present in each geological setting.

Third, the practical implications of the local SHAP analysis have been strengthened. The manuscript now highlights that locations dominated by meteorological factors would benefit most from enhanced weather-based warning systems, while areas where topographical factors predominate are better suited to physical infrastructure interventions. These distinctions are articulated in lines 607 to 625, providing clear operational guidance informed by the local analysis.

Finally, the local explanation analysis has been fully integrated into our broader analytical framework. The revised text explicitly connects these findings to our precision debris flow risk management strategy (lines 607 to 625), demonstrating how local factor heterogeneity informs actionable interventions within the practical framework proposed. Furthermore, we have expanded the discussion of intervention strategies by recommending a hierarchical management approach, including the classification of areas by source energy potential and tailored mitigation measures for different risk zones (lines 578 to 632).

[Comment 5] The multicollinearity diagnostic procedure (lines 385-394) employs VIF<5 as a threshold criterion, rather than the more conventionally accepted standard of VIF<10 in statistical literature. The authors should provide a theoretical or empirical justification for adopting this more stringent threshold, and should include the original VIF values for the eliminated variables (X10, X14, X17, and X19) to enable readers to independently evaluate the validity of the variable selection process. Such transparency will enhance the methodological rigor and reproducibility of the rese

Attachment

Submitted filename: Response to Reviewers.docx

pone.0326587.s002.docx (22.7MB, docx)

Decision Letter 1

Linwei Li

Dear Dr. Wang,

In addition, I would like to bring to your attention that citing the papers suggested by the reviewers is not mandatory for your revised manuscript. It is entirely up to you whether or not you choose to include the suggested papers in your revised version. The reviewers have provided these suggestions to enhance the quality and credibility of your research, but ultimately, the decision is yours. You have the freedom to decline including any of the suggested papers in your revised manuscript if you feel they are not relevant or do not add value to your study.

Meanwhile, I want to thank the reviewers for their great efforts on your manuscript and you for your submission.

Please submit your revised manuscript by Jun 30 2025 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org . When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols . Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols .

We look forward to receiving your revised manuscript.

Kind regards,

Linwei Li

Academic Editor

PLOS ONE

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

Reviewer #1: All comments have been addressed

Reviewer #3: (No Response)

**********

2. Is the manuscript technically sound, and do the data support the conclusions??>

Reviewer #1: Yes

Reviewer #3: (No Response)

**********

3. Has the statistical analysis been performed appropriately and rigorously? -->?>

Reviewer #1: Yes

Reviewer #3: (No Response)

**********

4. Have the authors made all data underlying the findings in their manuscript fully available??>

The PLOS Data policy

Reviewer #1: Yes

Reviewer #3: (No Response)

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English??>

Reviewer #1: Yes

Reviewer #3: (No Response)

**********

Reviewer #1: The author has revised the manuscript according to the comments. I have no further comments and recommend it for acceptance.

Reviewer #3: This study develops an interpretable deep learning model for predicting debris flow hazards in the Nujiang River Basin, China, leveraging advanced neural network architectures and optimization techniques. The manuscript is clearly an exceptional effort by the authors and the presented model provides a scalable and explainable tool for geohazard prediction, improving risk mitigation strategies in data-scarce mountainous regions. Yet, few adjustments are needed before being considered for publication, and I would be ready to assess the revised version again after applying the following adjustments:

The manuscript is somewhat lengthy. Try reducing redundant debates and keep all ideas short and concise.

Line 65-66: You stated that “Concurrently, multi-attribute decision frameworks provided structured methodologies for integrating heterogeneous hazard factors” add a direct reference utilizing similar frameworks in risk assessment such as:

• Mohseni, U., Jat, P. K., & Siriteja, V. (2025). Multi-criteria analysis-based mapping of the cyclone-induced pluvial flooding in coastal areas of India. DYSONA-Applied Science, 6(2), 309-321. https://doi.org/10.30493/das.2025.490282

Line 91: black-box problem is not well defined, although it represents a corner stone to justify the utilization of explanatory techniques such as SHAP. Try explaining the theory behind it more.

Line 141-147: omit this section

The final part of the introduction should be reserved to summarize the motives of your research and state the aims in light of these motives.

Material and Methodology (and related sections) span 15 pages (7-22). This is an example of a section that can be reduced in size. I understand the inclusive nature of mathematical modelling that you have incorporated into this section; however, the extensive length might render it hard to follow.

Add more details on Fig 5. Such as input sizes, filter sizes, neuron numbers …etc. All the information that can assist rebuilding the (basic) model.

**********

what does this mean? ). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy

Reviewer #1: No

Reviewer #3: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/ . PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org . Please note that Supporting Information files do not need this step.

PLoS One. 2025 Jun 24;20(6):e0326587. doi: 10.1371/journal.pone.0326587.r005

Author response to Decision Letter 2


19 May 2025

Response to Editor and Reviewers

Dear editors and reviewers,

Thank you for your letter and for the reviewers' comments concerning our manuscript entitled “An Integrated IKOA-CNN-BiGRU-Attention Framework with SHAP Explainability for High-Precision Debris Flow Hazard Prediction in the Nujiang River Basin, China” (ID: PONE-D-25-14786R1). Those comments are all valuable and very helpful for revising and improving our paper, as well as providing important guiding significance to our research. We have studied comments carefully and have made corrections which we hope meet with approval. We also highlighted the changes using red fonts in the attached Revised full manuscript file to make it easier for you to check our revision. The principal revisions made within this paper and responses to the reviewers' comments are delineated as follows.

Sincerely Yours,

Dr. Tianlong Wang

Response to Reviewer 1

(Original comments and queries are in blue, while our responses are in black)

[General Comment]

The author has revised the manuscript according to the comments. I have no further comments and recommend it for acceptance.

Response: We sincerely thank Reviewer 1 for taking the time to re-evaluate our manuscript and for acknowledging the improvements made in response to the previous round of comments.

Response to Reviewer 3

(Original comments and queries are in blue, while our responses are in black)

[General Comment] This study develops an interpretable deep learning model for predicting debris flow hazards in the Nujiang River Basin, China, leveraging advanced neural network architectures and optimization techniques. The manuscript is clearly an exceptional effort by the authors and the presented model provides a scalable and explainable tool for geohazard prediction, improving risk mitigation strategies in data-scarce mountainous regions. Yet, few adjustments are needed before being considered for publication, and I would be ready to assess the revised version again after applying the following adjustments:

Response:

We sincerely thank Reviewer 3 for the positive assessment of our manuscript and for recognizing the potential contribution of our work to improving geohazard prediction and risk mitigation in mountainous regions. We appreciate the constructive feedback and have carefully addressed each suggestion to enhance the clarity, conciseness, and technical depth of our manuscript.

[Comment 1] The manuscript is somewhat lengthy. Try reducing redundant debates and keep all ideas short and concise.

Response:

Thank you very much for pointing out this important issue. The reviewer’s suggestion regarding manuscript length and conciseness is greatly appreciated. In response to this valuable comment, the manuscript has been carefully revised with the aim of reducing redundancy and improving clarity and readability. Specifically, revisions have been implemented in several key sections as follows:

(1) In Section 1, theoretical background discussions, research motivations, and objectives have been streamlined. Redundant explanations and peripheral details have been removed, resulting in a more focused and concise presentation of the research context and aims.

(2) In Section 2, lengthy mathematical derivations and detailed algorithm descriptions have been condensed, retaining only essential equations and methodological steps required for reproducibility. This revision significantly reduces the complexity and length of the section.

(3) Section 4 has been revised to succinctly summarize data sources, data processing methods, and multicollinearity diagnostics. Unnecessary details have been removed, enhancing the clarity and conciseness of the section.

(4) Section 6 has been simplified by highlighting key findings and practical implications through concise descriptions and clear visualizations. Excessive analytical discussions have been appropriately condensed.

(5) Section 8 has been carefully edited to succinctly summarize the main contributions and implications of the study. Repetitive statements and non-essential content have been eliminated to ensure conciseness.

Through these targeted revisions, the manuscript length has been reduced from approximately 9,500 words to approximately 8,000 words (excluding references), thereby effectively addressing the reviewer’s valuable comment regarding the manuscript length. The authors sincerely appreciate the reviewer’s constructive recommendation, which has greatly improved the manuscript quality.

[Comment 2] Line 65-66: You stated that “Concurrently, multi-attribute decision frameworks provided structured methodologies for integrating heterogeneous hazard factors” add a direct reference utilizing similar frameworks in risk assessment such as:

• Mohseni, U., Jat, P. K., & Siriteja, V. (2025). Multi-criteria analysis-based mapping of the cyclone-induced pluvial flooding in coastal areas of India. DYSONA-Applied Science, 6(2), 309-321. https://doi.org/10.30493/das.2025.490282

Response:

We have incorporated the suggested citation to strengthen our discussion of multi-attribute decision frameworks in risk assessment. The revised sentence now reads (lines 65-69 in the revised manuscript): “Concurrently, multi-attribute decision frameworks provided structured methodologies for integrating heterogeneous hazard factors [8], with recent innovations applying Analytic Hierarchy Process techniques to susceptibility mapping across diverse geological contexts [9] and multi-criteria analysis approaches to flood risk assessment in coastal regions [10].”

References

8. Onaopemipo Akintola M. Enhancing disaster response and resilience through near-time GIS for flood monitoring and analysis in Niger river basin, nigeria. Int Arch Photogramm Remote Sens Spat Inf Sci. 2024;XLVIII-3–2024: 377–385. doi:10.5194/isprs-archives-XLVIII-3-2024-377-2024

9. Barman J, Das J. Assessing classification system for landslide susceptibility using frequency ratio, analytical hierarchical process and geospatial technology mapping in aizawl district, NE india. Adv Space Res. 2024;74: 1197–1224. doi:10.1016/j.asr.2024.05.007

10. Mohseni U, Jat PK, Siriteja V. Multi-criteria analysis-based mapping of the cyclone-induced pluvial flooding in coastal areas of India. DYSONA - Appl Sci. 2025;6: 309–321. doi:10.30493/das.2025.490282

[Comment 3] Line 91: black-box problem is not well defined, although it represents a corner stone to justify the utilization of explanatory techniques such as SHAP. Try explaining the theory behind it more.

Response:

The reviewer’s comment regarding the insufficient definition of the "black-box problem" is highly relevant. This concept indeed serves as a fundamental justification for employing SHAP-based interpretability approaches. In response, the explanation of the black-box problem has been substantially expanded in the revised manuscript (lines 83-103 in the revised manuscript) to provide a more comprehensive theoretical foundation. The revised text now reads:

“Despite their transformative potential, current ML applications to debris flow hazard assessment exhibit critical epistemological and methodological limitations that constrain their scientific impact and operational utility. First, model performance demonstrates an acute sensitivity to hyperparameter configurations, with suboptimal parameterization inducing overfitting phenomena that compromise generalization to novel geomorphological contexts[14]. Second, conventional optimization algorithms frequently converge prematurely to local extrema, resulting in suboptimal exploration of the high-dimensional parameter space[15]. Third, and most fundamentally, deep learning architectures manifest intrinsic opacity that creates a profound disconnect between predictive capacity and mechanistic understanding[16]. This explainability deficit, frequently characterized as the 'black-box problem,' represents a critical epistemological barrier in hazard forecasting applications. The black-box problem specifically refers to the inherent inability to trace how deep learning models transform input features into output predictions through their complex internal architectures. This opacity manifests through (1) algorithmic complexity, where numerous interconnected neurons and non-linear activation functions obscure input-output relationships; (2) latent feature representation, where models develop abstract internal representations that lack direct physical interpretation; and (3 stochastic learning behavior, where training procedures yield models whose internal configurations cannot be deterministically predicted. Contemporary neural networks, while achieving unprecedented predictive accuracy, typically obscure the contribution of specific variables to prediction outcomes, thereby inhibiting scientific elucidation of causative mechanisms and undermining the implementation of targeted mitigation strategies.”

This expanded explanation articulates the theoretical dimensions of the black-box problem more precisely, providing a clearer foundation for understanding why interpretability methods like SHAP are essential for translating high-performance prediction models into scientifically valuable and actionable insights in debris flow hazard assessment.

[Comment 4] Line 141-147: omit this section

Response:

The reviewer's suggestion to omit lines 141-147 has been implemented in the revised manuscript. This section, which previously detailed the manuscript structure, has been removed entirely. To maintain the valuable visual representation of the research framework, the reference to Fig 1 has been repositioned to the end of the paragraph discussing the methodological framework (line 135). The revised text now reads:

“To address these objectives, this study implements an IKOA-CNN-BiGRU-Attention framework that integrates physics-inspired optimization algorithms with deep learning architectures and explainability mechanisms. This framework combines convolutional networks for spatial feature extraction, bidirectional recurrent units for temporal dynamics modeling, and attention mechanisms for feature importance weighting, specifically designed to capture debris flow triggering mechanisms while providing SHAP-based scientific insights. The framework advances debris flow hazard prediction methodologically, theoretically, analytically, and practically, offering actionable insights for precision disaster risk management. The overall workflow of the study is illustrated in Fig 1.”

[Comment 5] The final part of the introduction should be reserved to summarize the motives of your research and state the aims in light of these motives.

Response:

This valuable suggestion has been implemented by restructuring the final portion of the introduction (lines 124-152) to create a more focused conclusion that explicitly connects research motives to study aims. The revised structure maintains the essential content while reorganizing paragraphs to achieve a more cohesive narrative flow. The restructured final section now reads:

“Despite their transformative potential, current ML applications to debris flow hazard assessment exhibit critical epistemological and methodological limitations that constrain their scientific impact and operational utility. First, model performance demonstrates an acute sensitivity to hyperparameter configurations, with suboptimal parameterization inducing overfitting phenomena that compromise generalization to novel geomorphological contexts[14]. Second, conventional optimization algorithms frequently converge prematurely to local extrema, resulting in suboptimal exploration of the high-dimensional parameter space[15]. Third, and most fundamentally, deep learning architectures manifest intrinsic opacity that creates a profound disconnect between predictive capacity and mechanistic understanding[16]. This explainability deficit, frequently characterized as the 'black-box problem,' represents a critical epistemological barrier in hazard forecasting applications. The black-box problem specifically refers to the inherent inability to trace how deep learning models transform input features into output predictions through their complex internal architectures. This opacity manifests through (1) algorithmic complexity, where numerous interconnected neurons and non-linear activation functions obscure input-output relationships; (2) latent feature representation, where models develop abstract internal representations that lack direct physical interpretation; and (3) stochastic learning behavior, where training procedures yield models whose internal configurations cannot be deterministically predicted. Contemporary neural networks, while achieving unprecedented predictive accuracy, typically obscure the contribution of specific variables to prediction outcomes, thereby inhibiting scientific elucidation of causative mechanisms and undermining the implementation of targeted mitigation strategies.

Three fundamental research gaps persist in contemporary literature that motivate this study: (1) the predominant prioritization of predictive accuracy over model explainability creates an artificial tension between computational performance and theoretical advancement; (2) current methodological frameworks overwhelmingly employ single-algorithm approaches that inadequately capture the complex spatiotemporal characteristics of debris flow triggering conditions; and (3) the systematic integration of advanced optimization techniques with explainable artificial intelligence frameworks remains largely unexplored in geohazard prediction contexts, particularly for multi-parameter debris flow risk assessment. Explainable Artificial Intelligence (XAI) methodologies, particularly the SHapley Additive exPlanations (SHAP) approach, offer promising solutions to these challenges by providing mathematically rigorous attribution mechanisms derived from cooperative game theory principles[17,18]. Despite preliminary applications identifying dominant controlling factors in debris flow research, comprehensive XAI integration within holistic debris flow assessment frameworks remains substantially underdeveloped[19].

Motivated by these research gaps and the potential of XAI approaches, this study aims to develop and validate an integrated deep learning framework that maximizes both predictive accuracy and scientific explainability for debris flow hazard assessment. The specific research objectives include: (1) identifying optimal architectural configurations for capturing complex spatiotemporal characteristics of debris flow triggering factors; (2) quantifying the relative contribution of environmental, geological, and meteorological variables; (3) establishing a theoretical framework connecting physical processes to prediction outcomes through explainable AI; and (4) determining quantitative thresholds governing debris flow risk to support evidence-based intervention strategies.

To address these objectives, this study implements an IKOA-CNN-BiGRU-Attention framework that integrates physics-inspired optimization algorithms with deep learning architectures and explainability mechanisms. This framework combines convolutional networks for spatial feature extraction, bidirectional recurrent units for temporal dynamics modeling, and attention mechanisms for feature importance weighting, specifically designed to capture debris flow triggering mechanisms while providing SHAP-based scientific insights. The framework advances debris flow hazard prediction methodologically, theoretically, analytically, and practically, offering actionable insights for precision disaster risk management. The overall workflow of the study is illustrated in Fig 1.” (lines 83-135)

This restructured conclusion more effectively summarizes the research motives and explicitly connects them to the study aims, creating a stronger narrative arc that guides readers from identified problems to proposed solutions, as suggested by the reviewer.

[Comment 6] Material and Methodology (and related sections) span 15 pages (7-22). This is an example of a section that can be reduced in size. I understand the inclusive nature of mathematical modelling that you have incorporated into this section; however, the extensive lengt

Decision Letter 2

Linwei Li

An Integrated IKOA-CNN-BiGRU-Attention Framework with SHAP Explainability for High-Precision Debris Flow Hazard Prediction in the Nujiang River Basin, China

PONE-D-25-14786R2

Dear Dr. Wang,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice will be generated when your article is formally accepted. Please note, if your institution has a publishing partnership with PLOS and your article meets the relevant criteria, all or part of your publication costs will be covered. Please make sure your user information is up-to-date by logging into Editorial Manager at Editorial Manager®  and clicking the ‘Update My Information' link at the top of the page. If you have any questions relating to publication charges, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Linwei Li

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

None

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

Reviewer #3: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions??>

Reviewer #3: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously? -->?>

Reviewer #3: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available??>

The PLOS Data policy

Reviewer #3: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English??>

Reviewer #3: Yes

**********

Reviewer #3: The authors have addressed all my comments. The manuscript is ready for publication

The authors have addressed all my comments. The manuscript is ready for publication

The authors have addressed all my comments. The manuscript is ready for publication

**********

what does this mean? ). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy

Reviewer #3: No

**********

Acceptance letter

Linwei Li

PONE-D-25-14786R2

PLOS ONE

Dear Dr. Wang,

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now being handed over to our production team.

At this stage, our production department will prepare your paper for publication. This includes ensuring the following:

* All references, tables, and figures are properly cited

* All relevant supporting information is included in the manuscript submission,

* There are no issues that prevent the paper from being properly typeset

You will receive further instructions from the production team, including instructions on how to review your proof when it is ready. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few days to review your paper and let you know the next and final steps.

Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

If we can help with anything else, please email us at customercare@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Linwei Li

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    Attachment

    Submitted filename: Response to Reviewers.docx

    pone.0326587.s002.docx (22.7MB, docx)

    Data Availability Statement

    The complete dataset supporting all findings in this study is fully and openly accessible without any restrictions on Zenodo. All data is contained in a single downloadable Excel file (19.6 kB): Wang, T. (2025). Debris Flow Hazard-Causing Factors Dataset for the Yunnan Section of the Nujiang River Basin, China [Data set]. Zenodo. https://doi.org/10.5281/zenodo.15050116.


    Articles from PLOS One are provided here courtesy of PLOS

    RESOURCES