Skip to main content
Sensors (Basel, Switzerland) logoLink to Sensors (Basel, Switzerland)
. 2026 Jan 14;26(2):561. doi: 10.3390/s26020561

Unsupervised Learning-Based Anomaly Detection for Bridge Structural Health Monitoring: Identifying Deviations from Normal Structural Behaviour

Jabez Nesackon Abraham 1, Minh Q Tran 1, Jerusha Samuel Jayaraj 2, Jose C Matos 1, Maria Rosa Valluzzi 3, Son N Dang 1,*
Editors: Phong B Dao, Len Gelman, Liang Yu, Davide Astolfi
PMCID: PMC12846258  PMID: 41600355

Abstract

Structural Health Monitoring (SHM) of large-scale civil infrastructure is essential to ensure safety, minimise maintenance costs, and support informed decision-making. Unsupervised anomaly detection has emerged as a powerful tool for identifying deviations in structural behaviour without requiring labelled damage data. The study initially reproduces and implements a state-of-the-art methodology that combines local density estimation through the Cumulative Distance Participation Factor (CDPF) with Semi-parametric Extreme Value Theory (SEVT) for thresholding, which serves as an essential baseline reference for establishing normal structural behaviour and for benchmarking the performance of the proposed anomaly detection framework. Using modal frequencies extracted via Stochastic Subspace Identification from the Z24 bridge dataset, the baseline method effectively identifies structural anomalies caused by progressive damage scenarios. However, its performance is constrained when dealing with subtle or non-linear deviations. To address this limitation, we introduce an innovative ensemble anomaly detection framework that integrates two complementary unsupervised methods: Principal Component Analysis (PCA) and Autoencoder (AE) are dimensionality reduction methods used for anomaly detection. PCA captures linear patterns using variance, while AE learns non-linear representations through data reconstruction. By leveraging the strengths of these techniques, the ensemble achieves improved sensitivity, reliability, and interpretability in anomaly detection. A comprehensive comparison with the baseline approach demonstrates that the proposed ensemble not only captures anomalies more reliably but also provides improved stability to environmental and operational variability. These findings highlight the potential of ensemble-based unsupervised methods for advancing SHM practices.

Keywords: SHM, unsupervised learning, ensemble fusion, principal component analysis, autoencoder, anomaly detection, adaptive weighting, Z24 dataset

1. Introduction

The assessment of civil infrastructure, particularly bridges, has become increasingly vital as the world’s structural assets age and traffic demands intensify. Traditional methods, based on visual inspection, enable the assessment of the external condition of structures [1]. However, assessments of potential damage within structures are often difficult to detect [2]. In addition, traditional solutions such as visual inspection are time-consuming and resource intensive. This calls for the development of highly reliable structural health monitoring (SHM) systems. With the development of computer science and ultra-sensitive sensors, SHM systems now allow for automated, data-driven anomaly detection. SHM systems use continuous data streams from vibration or deformation sensors and analyse the responses to monitor structural health [3]. By continuously tracking the response of structures, SHM enables early detection of damage, thereby reducing maintenance costs and preventing catastrophic failures [4,5]. Over the past two decades, numerous data-driven approaches have been proposed to automate damage detection, ranging from supervised learning with labelled damage data to fully unsupervised anomaly detection frameworks [6].

However, in most real-world SHM scenarios, labelled damage data are extremely scarce or unavailable [7,8]. Structures typically remain in healthy conditions for long periods, and controlled damage experiments are costly and often infeasible. As a result, traditional supervised classifiers cannot be reliably trained [9,10]. Recent research has therefore shifted toward unsupervised anomaly detection. The requirement is only undamaged-state data for model training. In parallel, physics-based and hybrid approaches have also gained attention, integrating first-principles models and uncertainty quantification to improve damage diagnosis. For example, transmissibility-based methods combined with probabilistic distance measures have been shown to capture structural changes while accounting for model uncertainty, offering a complementary perspective to data-driven anomaly detection approaches [11].

Over the past two decades, SHM research has advanced from classical damage detection techniques to modern machine-learning and physics-informed paradigms [7,10,12] While supervised learning and transfer learning can achieve strong performance when representative labelled damage data exist [8], real-world bridge monitoring rarely provides such data: structures typically remain healthy for long periods, and controlled damage experiments are costly and often infeasible. This practical constraint has shifted increasing attention toward unsupervised anomaly detection, where models are trained using only undamaged-state data and anomalies are identified as departures from the learned normal behaviour [9,10]. In parallel, probabilistic and Bayesian perspectives have gained prominence for handling uncertainty in long-term monitoring. Bayesian approaches have been utilized for condition assessment in field applications [13,14], and recent work in modal identification has demonstrated that Bayesian spectral decomposition can estimate modal parameters with quantified uncertainty under ambient vibration, thereby improving robustness to noise and modelling errors [15]. These trends underscore the central challenge of long-term SHM: reliable anomaly detection under environmental and operational variability, with limited or no labelled damage data.

A substantial body of unsupervised SHM research relies on distance- and statistics-based deviations. Mahalanobis-distance monitoring, including ensemble formulations, has been used for anomaly detection and damage localisation under non-stationary signals [16,17]. Other pattern-recognition approaches include divergence-based novelty detection [18], dynamic time warping for ambient and non-stationary responses [19], and distance measures such as optimal sub-pattern assignment for substructural diagnosis [20]. Dimensionality-reduction and multivariate monitoring methods have also been used to create compact health indicators [21,22], while clustering-based outlier detection has been applied to long-term bridge monitoring, including spectral clustering and clustering-driven threshold estimation under environmental variability [23,24,25,26]. These approaches are often interpretable and computationally efficient, yet they can be sensitive to non-stationary behaviour and non-Gaussian distributions induced by varying temperature, traffic, and operational conditions [13,14,17,18].

To mitigate environmental variability and improve robustness, linear subspace methods—most notably PCA and its variants—have been widely adopted to characterise healthy behaviour and separate dominant environmental trends from damage-sensitive deviations [27,28]. Temperature-driven MPCA has been proposed to address thermal effects [29], PCA combined with time-series modelling has supported long-term monitoring of large structures [30], and moving-PCA strategies have been reported for highway bridges operating under changing conditions [31]. However, PCA-based models are inherently limited in capturing nonlinear interactions and manifold distortions frequently observed in real structures. This limitation has encouraged reconstruction-based learning, especially autoencoders, which can learn nonlinear representations of healthy dynamics and flag anomalies through reconstruction errors [32,33,34,35,36,37,38]. Comparative investigations have demonstrated the potential of autoencoders for structural damage identification [39], and their performance has been shown to depend on monitoring configurations such as measurement intervals in uncontrolled SHM [40,41]. For the Z24 bridge, stacked autoencoders have been employed to extract damage-sensitive features for unsupervised detection, typically relying on a single reconstruction error indicator [42]. Nevertheless, autoencoders may overlook anomalies that remain close to the learned manifold, or that induce only subtle reconstruction deviations under certain damage–environment interactions [39,41]. Recent hybrid efforts suggest combining linear and nonlinear ideas, for example, by enhancing PCA with deep learning components to better represent nonlinear residual behaviour [43] or by using variational autoencoders for representation learning in SHM contexts [44]. Yet such approaches commonly treat linear and nonlinear components sequentially or as enhancements within a single model family, rather than integrating complementary linear and nonlinear detectors within a unified unsupervised ensemble explicitly designed for robust long-term anomaly detection [43].

Ensemble-based anomaly detection has been explored as a route to improved reliability, but practical SHM ensembles often rely on static fusion rules because labelled damage data are unavailable to train complex fusion layers. Distance-metric ensembles have been reported for SHM monitoring under varying conditions [16], and the broader data-fusion literature in SHM documents a wide range of fusion strategies and their trade-offs [45]. While more dynamic directions—such as reinforcement-learning-driven ensembles for SHM [46] and unsupervised ensembles of streaming anomaly detectors [47] are emerging, there remains a gap in principled, performance-driven weighting that can be calibrated using only undamaged-state data and that adapts detector contributions according to their stability under environmental and operational variability. Threshold selection represents another persistent limitation. EVT provides a statistically grounded basis for modelling tail behaviour in streaming anomaly detection [48], and EVT-type strategies have been applied to online anomaly detection in long-term SHM [49]. However, existing EVT-based SHM implementations largely focus on modeling the tail of a single anomaly indicator and do not explicitly account for the heterogeneous, non-Gaussian tail behavior that can arise when multiple heterogeneous anomaly scores are fused. This limits the direct applicability of single-detector EVT thresholding to ensemble anomaly detection in SHM [50,51].

Motivated by these gaps, this study proposes an ensemble-based unsupervised anomaly detection framework for modal-feature-based SHM that integrates complementary linear and nonlinear perspectives and provides statistically principled decision-making under long-term variability. The framework combines PCA-based subspace monitoring and autoencoder-based nonlinear reconstruction and fuses their heterogeneous anomaly scores through an adaptive weighting mechanism calibrated from undamaged-state validation statistics, avoiding heuristic or fixed fusion rules. To enable robust alarm decisions, a Weighted Semi-parametric Extreme Value Theory (SEVT) thresholding strategy is introduced to explicitly model the tail behaviour of the fused anomaly score distribution, addressing heterogeneity induced by score fusion beyond conventional single-indicator EVT formulations. The approach is validated on the Z24 bridge dataset [52], where long-term monitoring data exhibit pronounced environmental influences and documented damage events [53]. The main contributions are:

  • (i)

    A unified unsupervised ensemble that jointly leverages PCA and autoencoders to exploit complementary linear and nonlinear sensitivities [31,39,42].

  • (ii)

    An adaptive, performance-based fusion mechanism that adjusts detector contributions using healthy-state stability without requiring labelled damage data [45,46,47].

  • (iii)

    A weighted SEVT thresholding technique that models the heterogeneous tail behaviour of fused anomaly scores to provide statistically rigorous, environment-adaptive thresholds [12,48,49,54].

The remainder of this paper is organised as follows. Section 2 presents the proposed methodology, which includes feature extraction, ensemble design, and threshold modelling. Section 3 describes the experimental setup and results. Section 4 discusses findings and implications, and Section 5 concludes the paper.

2. Methodology

2.1. Overview of the Architectural Flow

The proposed methodology, as depicted in Figure 1, begins with the acquisition of raw vibration data from the well-known Z24 Bridge, a benchmark structure extensively used in structural health monitoring studies. The dataset comprises synchronised acceleration measurements obtained from eight strategically placed sensors that continuously capture the bridge’s dynamic response to varying environmental and operational loads. Prior to analysis, the raw acceleration signals undergo pre-processing to eliminate low-frequency drift, electrical noise, and channel misalignments, ensuring signal integrity. The cleaned signals are then divided into fixed-length segments, each representing a quasi-stationary state of the structure. This segmentation facilitates the temporal tracking of dynamic behaviour across multiple loading and environmental conditions, thereby enabling a detailed assessment of structural evolution and potential damage progression over time.

Figure 1.

Figure 1

Comparative flow of the baseline CDPF–SEVT and the proposed PCA–AE Ensemble Fusion Framework.

2.2. Baseline: CDPF with SEVT Thresholding

The baseline CDPF-SEVT anomaly detection framework is formulated to quantify deviations in the dynamic response of structural systems by integrating three complementary stages: Stochastic Subspace Identification (SSI) for modal feature extraction, CDPF for deviation scoring, and SEVT for adaptive threshold calibration. This combination yields a fully unsupervised, interpretable pipeline that forms a statistically rigorous baseline for evaluating advanced non-linear and data-fusion-based damage detection models.

The CDPF component quantifies each segment’s deviation from the nominal (undamaged) structural manifold in the transformed feature space. Despite its conceptual simplicity, the CDPF score effectively captures macro-level shifts in modal energy distributions caused by physical deterioration, material degradation, or structural boundary variations [34]. To separate normal from anomalous states, an adaptive statistical threshold is established using the Peaks-Over-Threshold (POT) formulation of the Generalised Pareto Distribution (GPD) [26]. The CDPF–SEVT baseline, while statistically principled and computationally efficient, exhibits few intrinsic limitations that constrain its applicability to complex structural behaviours. Its reliance on linear SSI-derived modal features limits its sensitivity to non-linear phenomena, such as modal coupling or frequency bifurcation, which are often observed in advanced stages of structural degradation. The assumption of signal stationarity during Power Spectral Density (PSD) estimation further limits consistency under varying environmental or operational conditions, where time-varying excitations can distort modal estimates. Additionally, averaging spectral information across channels suppresses spatial mode-shape variations, masking localised damage signatures.

These limitations collectively motivate the development of a more expressive ensemble fusion framework that integrates complementary representations to overcome the inherent linearity and stationarity constraints of the baseline CDPF-SEVT. This ensemble fusion enables the joint capture of both linear and non-linear feature correlations, enhances sensitivity to subtle structural deviations, and provides greater resilience against noise and environmental variability, thereby extending the applicability of anomaly detection to more complex and heterogeneous structural health monitoring scenarios.

2.3. Ensemble Fusion Framework

The proposed Ensemble Fusion (PCA–AE) Framework, shown in Figure 1, constitutes an advanced hybrid architecture for vibration-based structural health monitoring. Its purpose is to identify abnormal structural states, such as stiffness loss, joint degradation, or boundary loosening, by integrating linear and non-linear feature learning mechanisms. To address this challenge, the Ensemble Fusion Framework introduces a multi-stage data-driven anomaly detection pipeline that synergistically combines multi-domain feature extraction, dual-path modelling (PCA and AE), and adaptive statistical thresholding within a unified learning paradigm. The approach extends the baseline CDPF with SEVT thresholding by introducing a more expressive representation and leveraging non-linear manifold learning to uncover subtle, higher-order deviations in structural dynamics that linear models often fail to capture. The first stage of the proposed framework, Modal Feature Extraction, focuses on deriving vibration-based modal parameters using the SSI method. For each sensor channel, the PSD is estimated using Welch’s averaging technique [55], from which the dominant modal peaks are identified to form a modal frequency vector that characterises the resonant dynamics of the structure. In addition to modal parameters, the framework extracts a diverse set of enriched features to capture both global and localised vibration behaviour. Frequency-domain energies, obtained by integrating spectral power over predefined frequency bands, provide robustness against environmental and operational variability. Furthermore, cross-channel covariance features, derived from the covariance matrix and its eigenvalues, represent correlated vibration patterns across multiple sensors, reflecting spatial coherence in the structural response.

The second stage, feature normalisation, is a critical pre-processing step that ensures numerical stability, efficient convergence, and balanced feature contributions across both linear and non-linear modelling branches within the PCA–AE Framework. PCA, relying on covariance decomposition, and AE, leveraging gradient-based optimisation, a dual normalisation strategy is employed to accommodate their specific sensitivities. For the PCA branch, standardisation is applied to each feature to transform it into a zero-mean, unit-variance representation. For the non-linear AE branch, robust scaling or min–max normalisation [56] is adopted, depending on the presence of outliers and noise amplitude in the vibration signals. This normalisation enhances the gradient stability of the AE’s activation functions and ensures that all input features contribute proportionally during reconstruction error minimisation. PCA identifies the dominant axes of variance that represent the healthy-state vibration dynamics of the structure through SPE anomaly indicators. The corresponding SPE index [57] is computed as

SPCA=eTe (1)

where e=xx^ is difference between data x and predicted data x^. A high SPE indicates structural anomalies, such as stiffness degradation, joint loosening, or changes in boundary conditions. The AE consists of an encoder–decoder pair. Once trained, the reconstruction error [58] for each new observation serves as its anomaly score (AE score), expressed as

SAE=||xx^||22 (2)

The mean squared error between the data x and the reconstructed data x^ Which is created by AE, gives the reconstruction error [58].

The third stage, combined with the normalised PCA and AE, provides a more comprehensive diagnostic perspective, enabling the framework to detect both subtle stiffness degradation and complex non-linear behaviours with enhanced reliability and precision across varying operational and environmental conditions. The overall fusion score is then computed as

Sfusion=wPCASPCAnorm+wAESAEnorm (3)

Subject to the constraint that the summation of weighted coefficients (wPCA,wAE) is equal to one. The weighting coefficients are adaptively determined through performance-based weighting; the weights are determined based on validation-derived F1-scores [59], following wPCA F1PCA  and wAEF1AE.

In the final stage of the ensemble fusion framework, the fused anomaly scores are analysed using an EVT–based approach to define a statistically reliable threshold [34] for damage detection. The parametric EVT approach, specifically the GPD, is tailored to model the statistical behaviour of rare, extreme deviations. In anomaly detection tasks, these rare deviations correspond to structural damage or abnormal behaviour, which manifest as tail events in the feature-score distribution. Unlike conventional methods that assume normality, EVT does not require the entire data set to be Gaussian; instead, it models tail exceedances, making it more appropriate for skewed, heavy-tailed, or otherwise non-Gaussian score distributions. The threshold is expressed as

Tfusion=u+σξ1p*ξ1 (4)

where u is the threshold onset, σ is the scale parameter, ξ is the shape parameter, and p* Represents the chosen exceedance probability. This threshold effectively separates normal and abnormal structural states based on statistical tail behaviour. To improve durability under noisy conditions and environmental variability, a Bayesian smoothing step further refines Tfusion, producing a stable and adaptive decision boundary. Samples with fused scores exceeding this smoothed threshold are labelled as damaged, ensuring reliable and data-driven structural anomaly detection.

Algorithm 1 explains the AE–PCA Ensemble Fusion Anomaly Detection procedure used in this framework, illustrating how PCA and AE models are combined to capture both linear and non-linear structural behaviours. The performance of the proposed framework was rigorously evaluated under diverse operational and environmental conditions and compared with the CDPF-SEVT baseline using standard metrics, including Area Under the Curve (AUC), precision, recall, and F1 score.

Algorithm 1. AE–PCA Ensemble Fusion Anomaly Detection
Input: Vibration segments {X_i}, i = 1, ..., N
Output: Segment labels: Normal/Damaged
  • 1.

    Feature Extraction

    For each segment X_i:

      f_i = [modal_features, time_features, frequency_features, covariance_features]

  • 2.

    Feature Scaling

    f_i_scaled = Standardize(f_i)

    f_i_PCA = PCA(f_i_scaled)

    f_i_AE = AE(f_i_scaled)

  • 3.

    Model Training

    Train PCA_model on undamaged f_i_PCA

    Train AE_model on undamaged f_i_AE

  • 4.

    Compute Anomaly Scores

    For test segment f_i_test:

      S_PCA = || f_i_test − PCA_model(f_i_test) ||_2

      S_AE = || f_i_test − AE_model(f_i_test) ||_2

  • 5.

    Normalisation and Fusion Score

      S_PCA_norm = (S_PCA − μ_PCA)/σ_PCA

      S_AE_norm = (S_AE − μ_AE)/σ_AE

      S_fusion = α × S_PCA_norm + (1 − α) × S_AE_norm, 0 ≤ α ≤ 1

  • 6.

    Threshold Estimation

      Fit GPD on undamaged S_fusion

      Determine smooth threshold T_smooth

  • 7.

    Segment Classification

      If S_fusion > T_smooth → Label = Damaged

      Else → Label = Normal

In summary, the Ensemble Fusion Framework significantly improves damage sensitivity, minimises false alarms, and ensures reliable generalisation across varying conditions. Its reliability and adaptability make it particularly well-suited for long-term monitoring of large-scale infrastructures, where damage often evolves gradually through a combination of linear and non-linear dynamic responses.

3. Case Study

3.1. Z24 Bridge

The Z24 Bridge, an exemplary post-tensioned concrete box-girder structure, was built in Switzerland between 1961 and 1963. It was a significant engineering achievement of its time, characterised by its robust design and distinctive features. Illustrated in Figure 2, the bridge’s longitudinal section, main dimensions, top view, and pier cross-section provide a comprehensive overview of its structural layout. Despite being in relatively sound condition, the Z24 Bridge was ultimately demolished to pave the way for a new bridge designed to accommodate a larger side span, showcasing advancements in modern engineering approaches. Before the bridge was fully decommissioned, an extensive SHM program was initiated. This program aimed to assess the impact of environmental variability on the bridge by collecting detailed data, including acceleration time histories from strategically placed sensors. To obtain the acceleration time histories, sixteen accelerometers were attached to the bridge deck and the concrete column, and the location of these accelerometers is shown in Figure 2b,c. The sensor numbers struck through in Figure 2b,c failed during the operational period.

Figure 2.

Figure 2

The Z24 Bridge: (a) the longitudinal section, (b) the top view, (c) the pier and girder cross-view.

Understanding that environmental factors significantly affect structural dynamics, the SHM program employed five types of sensors to monitor critical variables. These included ambient temperature variations, air humidity levels, the occurrence of rainfall (true or false), wind speed, and wind direction. Among these, the influence of temperature was particularly emphasised, as it plays a crucial role in the structural behaviour of civil engineering projects. To accurately capture thermal data, temperature sensors were installed at eight specific points along the girder, located at the midpoint of the bridge’s three spans. Additionally, soil temperature measurements were taken adjacent to each concrete column, as well as at the north, central, and south sections of the intermediate piers, bringing the total number of sensors to 12.

In a progressive effort to simulate potential damage scenarios, the research team implemented controlled experiments on the bridge over the month preceding its demolition. These realistic scenarios were designed based on a series of progressive damage tests that included various structural impairments. Among these were the deliberate lowering and lifting of pier sections, intentional tilting of the foundation, and tests simulating concrete spalling at the soffit. Other scenarios involved induced abutment landslides, failures at concrete hinges, ruptures of anchor heads, and tendon failures. These tests were invaluable for understanding the bridge’s behaviour under stress and evaluating its structural integrity. Further details regarding the Z24 Bridge and the specific progressive damage scenarios can be referenced in [53], contained in the second panel.

Figure 3 illustrates the information gathered during a sub-measurement. The acceleration values at the designated measurement points were recorded continuously over time at a sampling frequency of 655 Hz (which equates to 655 samples per second). Through the application of operational modal analysis, specifically employing SSI, the modal frequencies of the bridge were identified across four modes for normal conditions and damaged conditions. Table 1 presents the modal frequencies corresponding to these modes, with a frequency difference ranging from 14 to 18 for the normal condition and 9 to 14 for the damaged condition.

Figure 3.

Figure 3

Dynamic response of the sensor in the Z24 bridge: (a) normal condition and (b) damaged condition.

Table 1.

Modal frequencies of the Z24 bridge based on OMA for normal conditions and damaged conditions.

Mode Success Rate (%) Eigenfrequency (Hz)
Normal Condition (Hz) Damaged Condition (Hz) Max. Difference (%)
Min Max Min Max Normal Condition Damaged Condition
1 98 3.81 4.38 3.75 4.12 14 9
2 93 4.98 5.89 4.51 5.18 18 14
3 96 9.60 11.20 9.41 10.39 16 10
4 77 10.24 12.09 9.81 10.98 17 12

The dataset used in this study consists of multichannel vibration measurements from the Z24 Bridge benchmark. For each monitoring timestamp, one .aaa ASCII file is provided per accelerometer channel, each containing 65,536 samples recorded at 100 Hz. Eight accelerometer channels were selected based on data completeness and their relevance to the global structural response. A segment is defined as a complete measurement instance in which all eight corresponding .aaa files are available. These files were time-aligned and combined into a synchronised multivariate matrix of size 65,536 × 8 (samples × channels). Across the available monitoring records, 48 segments representing the undamaged state and 48 segments representing the damaged state satisfied the completeness requirements and were therefore included in the analysis. These 96 multichannel segments serve as the basis for feature extraction, model training, and damage detection.

The undamaged segments were primarily used to train the baseline models and to establish feature normalisation for anomaly detection. For the proposed ensemble fusion model integrating PCA and AE features, a small fraction (≈5%) of damaged segments was included in the training set. This step, referred to as the semi-supervised phase, is an experimental procedure designed to enhance the model’s ability to generalise across the feature space and to stabilise the fusion of PCA and AE anomaly scores. Specifically, including only a very limited number of damaged examples allows the model to adapt slightly to feature variations near the decision boundary without significantly biasing the learned representation of normal, undamaged behaviour. This semi-supervised training approach is strictly for controlled experimental evaluation. In real-world SHM applications, it is generally not feasible to include damaged data during training, as anomalies are unknown a priori. The ensemble model is therefore fully compatible with operational deployment, where it would be trained exclusively on undamaged data and detect anomalies solely by deviations from learned normal patterns.

3.2. AUC Analysis

Figure 4 presents the Receiver Operating Characteristic (ROC) curves for the baseline CDPF–SEVT, AE–EVT, and the PCA–AE models. The ROC curve illustrates the trade-off between the True Positive Rate (TPR) and the False Positive Rate (FPR) at varying classification thresholds, providing a comprehensive view of each model’s detection sensitivity and false-alarm tendency. The AUC quantifies this discriminative capability, where higher AUC values correspond to improved distinction between damaged and undamaged structural conditions.

Figure 4.

Figure 4

ROC curve comparison between baseline CDPF-SEVT, AE-EVT, and PCA-AE.

From the comparison, the baseline CDPF–SEVT model yields an AUC of 0.938, reflecting competent detection of linear modal shifts. The AE–EVT model exhibits comparable performance, leveraging non-linear feature representation to capture more complex response patterns. However, the PCA–AE framework attains the highest AUC of 0.956, signifying superior discriminative performance. This improvement results from the adaptive integration of PCA’s global variance sensitivity and the AE’s non-linear reconstruction capacity. The smooth and monotonic progression of the ROC curves indicates stable calibration and a well-balanced threshold response.

3.3. Classification Performance

The comparative performance metrics shown in Figure 5 illustrate Precision, Recall, F1-Score, and AUC for the baseline CDPF-SEVT and the PCA–AE models. These indices collectively assess the reliability, sensitivity, and overall classification accuracy, with classification used to evaluate detection performance, not the training procedure of the structural anomaly detection framework. The baseline CDPF-SEVT demonstrates balanced yet moderate performance (Precision = 0.93, Recall = 0.85, F1 = 0.89, AUC = 0.94), effectively capturing linear deviations but remaining limited in addressing non-linear response behaviours.

Figure 5.

Figure 5

Performance metrics comparison between baseline CDPF and ensemble fusion PCA-AE.

In contrast, the PCA–AE achieves consistently superior metrics (Precision = 0.95, Recall = 0.88, F1 = 0.91, AUC = 0.96), reflecting a synergistic balance between linear and non-linear feature spaces. Compared to the baseline CDPF-SEVT, the fusion approach enhances the F1-score by approximately 2.2%, indicating more reliable classification under varying structural and environmental conditions. Similarly, the AUC improvement of around 1.8% highlights the fusion model’s superior discrimination capability between healthy and damaged states. This improvement arises from the complementary fusion of PCA’s capacity to capture global modal shifts and the AE’s strength in modelling non-linear distortions. Such fusion-based approaches have been validated in recent literature, showing consistent gains in generalisation and reduced false-alarm rates for vibration-based SHM systems [60].

4. Results and Discussion

The discrimination performance between undamaged and damaged structural states, along with the adaptive weighting behaviour of the ensemble model, provides deeper insight into the dependability and interpretability of the proposed framework. The threshold-based score distributions clearly depict how the baseline CDPF-SEVT and PCA-AE models differentiate structural health conditions based on their respective anomaly indices. This complementary weighting mechanism ensures that the fusion process remains both sensitive to damage initiation and robust to environmental noise, thereby enhancing the overall reliability and interpretability of the anomaly detection framework. This adaptability is particularly important for long-term SHM, where environmental variability can easily mask damage-induced changes.

4.1. Threshold-Guided Structural State Classification

The discrimination between undamaged and damaged structural states is achieved through a statistically derived threshold-based anomaly detection framework. In this study, each model, the baseline CDPF-SEVT and the PCA–AE model, produces a continuous anomaly score that quantifies the deviation of current structural responses from their healthy-state characteristics. The threshold is parametrically determined using EVT by fitting a GPD to the tail of the undamaged score distribution, allowing adaptive decision boundaries that minimise false alarms from environmental or operational variability. For the baseline model, the anomaly scores are derived from the statistical energy of the reconstructed state-space residuals, effectively capturing the linear modal deviations. The Ensemble Fusion framework combines these linear and non-linear perspectives by integrating normalised PCA-based SPE scores and AE-based reconstruction errors through an adaptive weighted fusion mechanism.

As illustrated in Figure 6, each method’s threshold distinctly separates the undamaged and damaged data clusters. The baseline CDPF-SEVT model shown in Figure 6a demonstrates moderate separation with some overlap near the threshold boundary, indicating limited sensitivity to non-linear distortions. However, the PCA–AE model produces the most distinct separation between classes, as shown in Figure 6b, with minimal overlap and well-defined threshold boundaries. It indicates enhanced sensitivity and durability, effectively capturing both global linear patterns and local non-linear irregularities in the structural response.

Figure 6.

Figure 6

Anomaly detection based on threshold: (a) baseline CDPF-SEVT method, (b) ensemble fusion.

The clear separation observed between the undamaged and damaged clusters reflects the ensemble model’s superior discriminative capability and its potential for real-time SHM applications. By adaptively adjusting the threshold based on score distribution tails, the framework ensures reliable decision-making even under variable operating conditions.

4.2. Weighted Model Contribution Analysis

The weighted score contribution analysis elucidates the relative influence of the PCA and AE components within the ensemble fusion framework. Since the proposed model integrates both linear and non-linear damage-sensitive features, an adaptive weighting mechanism is implemented to optimise their combined discriminative power. The normalised anomaly scores from PCA based on SPE and AE based on reconstruction error are fused to produce a unified damage index. Figure 7 and Figure 8 illustrate the normalised, weighted anomaly scores assigned by each model across all samples. Each plot represents the sample-wise influence of a specific model on the final ensemble decision.

Figure 7.

Figure 7

Stacked contribution area of PCA and AE for ensemble fusion.

Figure 8.

Figure 8

Weighted score contribution: PCA vs. AE.

The PCA Contribution plot, Figure 7, depicts the localised impact based on SPE. Peaks in the plot indicate samples where the PCA assigned comparatively higher anomaly scores, implying that the density structure of those observations deviates significantly from the global distribution. These instances correspond to periods where the structural behaviour becomes distinctly sparse or isolated in the feature space conditions often associated with abrupt or non-stationary responses. The AE Weighted Contribution Figure 8 illustrates the reconstruction-based anomaly scores, scaled according to the adaptive ensemble weight. Elevated values correspond to samples exhibiting dominant during subtle, non-linear anomalies.

Figure 8 illustrates the adaptive weighting contributions of the PCA and AE components within the Ensemble Fusion framework. The weighting mechanism governs the relative influence of each subsystem in the final fused score, ensuring that both linear and non-linear structural characteristics are appropriately represented. The weights are adaptively computed based on the statistical variance of the normalised anomaly scores, thereby emphasising the model that provides greater discriminatory information in each condition.

The presented results show that the PCA and AE modules contribute nearly equal weighting factors (≈0.5 each), indicating a balanced fusion between the two modalities. This equilibrium suggests that both global linear modal variations, captured by PCA, and localised non-linear distortions, captured by AE, play complementary roles in characterising structural behaviour. The adaptive weighting prevents dominance by either subsystem, thereby enhancing generalisation and mitigating overfitting. This balanced fusion mechanism aligns with recent advancements in hybrid anomaly detection frameworks, where score-level fusion and adaptive weighting techniques enhance the interpretability and reliability of SHM models under variable environmental and operational conditions.

4.3. Environmental and Operational Variability Analysis

Although long-term continuous monitoring data spanning multiple seasons would provide the most rigorous assessment of environmental and operational effects, such datasets with well-documented structural states are not always available in benchmark studies. In the context of unsupervised SHM, a widely adopted and practically meaningful alternative is to evaluate false-alarm behaviour using undamaged monitoring data. Any detected anomaly can be attributed to environmental or operational variability rather than structural damage. Following this rationale, the present study assesses the effectiveness of the proposed ensemble framework by analysing the distribution of anomaly scores and the false-alarm rate on undamaged segments of the Z24 dataset. This approach enables the quantification of the ensemble fusion mechanism’s impact on spurious detections. Indirect yet informative evidence of stability under the environmental and operational variability represented in the available data can thus be provided.

The undamaged segments of the Z24 dataset were used to analyse and evaluate the effectiveness of the proposed ensemble fusion method under environmental and operational variability. The Empirical Cumulative Distribution Function (ECDF) of normalised anomaly scores (Figure 9) shows that the Ensemble Fusion method reduces the occurrence of extreme values compared to the baseline CDPF-SEVT method, indicating fewer spurious anomalies. The corresponding False Alarm Rate (FAR) analysis (Figure 10) confirms that Fusion maintains a low and stable FAR (~4%), whereas CDPF exhibits a higher and more variable FAR (~10%). These findings demonstrate that, within the environmental and operational variability represented in the dataset, the ensemble approach provides improved stability and reliability in anomaly detection.

Figure 9.

Figure 9

Empirical Cumulative Distribution Function of normalised anomaly scores for CDPF-SEVT and Ensemble Fusion methods.

Figure 10.

Figure 10

False Alarm Rate comparison between CDPF-SEVT and Ensemble Fusion methods.

A key limitation of the current study is the restricted environmental and operational variability represented in the Z24 dataset. The 48 undamaged and 48 damaged segments were collected over a relatively short monitoring interval, during which ambient temperatures varied only mildly, and no significant changes in operational loading (e.g., traffic intensity) were recorded. Consequently, frequency shifts due to temperature or operational effects were modest, and the dataset does not capture extreme environmental conditions such as seasonal freezing or high-traffic events. While the selected segments provide a well-defined benchmark with documented structural interventions, enabling methodological validation, future work will focus on evaluating the proposed ensemble framework on long-term continuous monitoring datasets that capture extreme conditions, including winter freezing periods, high-traffic events, and seasonal variations. This will enable a comprehensive assessment of false-alarm behaviour and robustness in real-world SHM scenarios, providing stronger validation for practical deployment.

5. Conclusions

This research presented an ensemble-based unsupervised anomaly detection framework for vibration-based Structural Health Monitoring (SHM), integrating linear and non-linear feature learning through Principal Component Analysis (PCA) and Autoencoder (AE) models and validated on a subset of the Z24 bridge benchmark dataset. Existing SHM approaches typically employ linear subspace models or nonlinear reconstruction-based techniques in isolation, relying on fixed ensemble fusion and thresholding strategies. In contrast, the proposed framework integrates complementary linear and nonlinear representations within a unified unsupervised setting. It further incorporates adaptive ensemble weighting and statistically principled threshold modelling.

The principal conclusions and quantitative findings are summarised as follows:

  • The proposed PCA–AE ensemble fusion framework achieved superior detection performance with Precision = 0.95, Recall = 0.88, F1-score = 0.91, and AUC ≈ 0.956, demonstrating improved discrimination between undamaged and damaged structural states.

  • Compared with the baseline CDPF–SEVT method (Precision = 0.93, Recall = 0.85, F1-score = 0.89, AUC = 0.94), the ensemble approach improved the F1-score by approximately 2.2% and the AUC by approximately 1.8%, indicating enhanced robustness and classification reliability.

  • The adaptive score fusion strategy produced nearly balanced contributions from PCA and AE components (weights ≈ 0.5 each), confirming the complementary role of linear modal variation detection and non-linear response modelling in characterising structural behaviour.

  • The EVT-based thresholding scheme enabled statistically principled decision boundaries without assuming score normality, resulting in improved stability of anomaly detection under the environmental and operational variability present in the dataset.

  • False-alarm analysis on undamaged data used to analyse and evaluate the effectiveness of the proposed ensemble fusion method under environmental and operational variability demonstrated that the ensemble framework maintained a low and stable FAR of approximately 4%, compared to about 10% for the baseline CDPF–SEVT method, highlighting its improved resistance to spurious detections.

The results confirm that the proposed Ensemble Fusion PCA–AE framework offers enhanced accuracy, stability, and interpretability compared to conventional single-model approaches. While the present validation is limited by the restricted environmental and operational variability and the reduced number of segments available in the Z24 dataset, future work will focus on extending the framework to long-term continuous monitoring data, incorporating deeper ensemble architectures, and developing real-time adaptive weighting strategies to support robust deployment in real-world SHM applications.

Acknowledgments

This work is financed by national funds through FCT—Foundation for Science and Technology, under grant agreement PRT/BD/154386/2023 attributed to the 1st author. Jabez Nesackon Abraham acknowledges the support of the doctoral grant reference PRT/BD/154386/2023, financed by the Portuguese Foundation for Science and Technology (FCT). under the MIT Portugal Program (2022 MPP2030-FCT). This work was also supported by FCT/MCTES under the R&D Unit Institute for Sustainability and Innovation in Structural Engineering (ISISE), under reference UID/4029/2025, and under the Associate Laboratory Advanced Production and Intelligent Systems ARISE under reference LA/P/0112/2020.

Author Contributions

Conceptualisation, J.N.A., M.Q.T., J.S.J., J.C.M., M.R.V. and S.N.D.; methodology, J.N.A., M.Q.T., J.S.J., J.C.M. and S.N.D.; software, J.N.A., M.Q.T. and J.S.J.; validation, J.N.A., J.C.M., M.R.V. and S.N.D.; formal analysis, J.C.M., M.R.V. and S.N.D.; investigation, J.C.M. and S.N.D.; resources, J.N.A., M.Q.T., J.S.J. and S.N.D.; data curation, J.N.A., M.Q.T., J.S.J. and S.N.D.; writing original draft preparation J.N.A., M.Q.T., J.S.J., J.C.M., M.R.V. and S.N.D.; writing review and editing, J.N.A., M.Q.T., J.S.J., J.C.M., M.R.V. and S.N.D.; visualisation, J.N.A., M.Q.T. and J.S.J.; supervision, J.C.M., M.R.V. and S.N.D.; project administration, J.C.M. and S.N.D.; funding acquisition, J.C.M. and S.N.D. All authors have read and agreed to the published version of the manuscript.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data used in this study are openly available and can be accessed from the following source: [57].

Conflicts of Interest

The authors declare no conflicts of interest.

Funding Statement

This work is financed by national funds through FCT—Foundation for Science and Technology, under grant agreement PRT/BD/154386/2023 attributed to the 1st author. Jabez Nesackon Abraham acknowledges the support of the doctoral grant reference PRT/BD/154386/2023, financed by the Portuguese Foundation for Science and Technology (FCT). under the MIT Portugal Program (2022 MPP2030-FCT). This work was also supported by FCT/MCTES under the R&D Unit Institute for Sustainability and Innovation in Structural Engineering (ISISE), under reference UID/4029/2025, and under the Associate Laboratory Advanced Production and Intelligent Systems ARISE under reference LA/P/0112/2020.

Footnotes

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

References

  • 1.Luo K., Kong X., Zhang J., Hu J., Li J., Tang H. Computer Vision-Based Bridge Inspection and Monitoring: A Review. Sensors. 2023;23:7863. doi: 10.3390/s23187863. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Ribeiro D., Rakoczy A.M., Cabral R., Hoskere V., Narazaki Y., Santos R., Tondo G., Gonzalez L., Matos J.C., Massao Futai M., et al. Methodologies for Remote Bridge Inspection—Review. Sensors. 2025;25:5708. doi: 10.3390/s25185708. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Minh T.Q., Ngo Van T., Nguyen H.X., Nguyễn Q. Enhancing the Structural Health Monitoring (SHM) through Data Reconstruction: Integrating 1D Convolutional Neural Networks (1DCNN) with Bidirectional Long Short-Term Memory Networks (Bi-LSTM) Eng. Struct. 2025;340:120767. doi: 10.1016/j.engstruct.2025.120767. [DOI] [Google Scholar]
  • 4.Nhung N.T.C., Van Vu L., Nguyen H.Q., Huyen D.T., Nguyen D.B., Quang M.T. Development and Application of Linear Variable Differential Transformer (LVDT) Sensors for the Structural Health Monitoring of an Urban Railway Bridge in Vietnam. Eng. Technol. Appl. Sci. Res. 2023;13:11622–11627. doi: 10.48084/etasr.6192. [DOI] [Google Scholar]
  • 5.Nhung N.T.C., Bui H.N., Minh T.Q. Enhancing Recovery of Structural Health Monitoring Data Using CNN Combined with GRU. Infrastructures. 2024;9:205. doi: 10.3390/infrastructures9110205. [DOI] [Google Scholar]
  • 6.Minh T.Q., Matos J.C., Sousa H.S., Ngoc S.D., Van T.N., Nguyen H.X., Nguyễn Q. Data Reconstruction Leverages One-Dimensional Convolutional Neural Networks (1DCNN) Combined with Long Short-Term Memory (LSTM) Networks for Structural Health Monitoring (SHM) Measurement. 2025;253:117810. doi: 10.1016/j.measurement.2025.117810. [DOI] [Google Scholar]
  • 7.Mammeri S., Barros B., Conde-Carnero B., Riveiro B. From Traditional Damage Detection Methods to Physics-Informed Machine Learning in Bridges: A Review. Eng. Struct. 2025;330:119862. doi: 10.1016/j.engstruct.2025.119862. [DOI] [Google Scholar]
  • 8.Bao N., Zhang T., Huang R., Biswal S., Su J., Wang Y. A Deep Transfer Learning Network for Structural Condition Identification with Limited Real-World Training Data. Struct. Control. Heal. Monit. 2023;2023:1–18. doi: 10.1155/2023/8899806. [DOI] [Google Scholar]
  • 9.Zhang X., Li L., Ren X. Unsupervised Local Damage Diagnosis Leveraging Substructure Isolation Technology and Time Series Models. Structures. 2025;77:109122. doi: 10.1016/j.istruc.2025.109122. [DOI] [Google Scholar]
  • 10.Yu L., Dao P.B., Qiu L., Uhl T., Le M.-Q. Review of Machine-Learning Techniques Applied to Structural Health Monitoring Systems for Building and Bridge Structures. Appl. Sci. 2022;12:10754. doi: 10.3390/app122110754. [DOI] [Google Scholar]
  • 11.Ganjdoust F., Kefal A., Tessler A. A Novel Delamination Damage Detection Strategy Based on Inverse Finite Element Method for Structural Health Monitoring of Composite Structures. Mech. Syst. Signal Process. 2023;192:110202. doi: 10.1016/j.ymssp.2023.110202. [DOI] [Google Scholar]
  • 12.Bao Y., Li H. Machine Learning Paradigm for Structural Health Monitoring. Struct. Health Monit. 2021;20:1353–1372. doi: 10.1177/1475921720972416. [DOI] [Google Scholar]
  • 13.Wang J., Liu X.Z., Ni Y.Q. A Bayesian Probabilistic Approach for Acoustic Emission-Based Rail Condition Assessment. Comput. Aided Civ. Infrastruct. Eng. 2018;33:21–34. doi: 10.1111/mice.12316. [DOI] [Google Scholar]
  • 14.Ni Y.Q., Wang Y.W., Zhang C. A Bayesian Approach for Condition Assessment and Damage Alarm of Bridge Expansion Joints Using Long-Term Structural Health Monitoring Data. Eng. Struct. 2020;212:110520. doi: 10.1016/j.engstruct.2020.110520. [DOI] [Google Scholar]
  • 15.Feng Z., Zhang J., Katafygiotis L., Hua X., Chen Z. Bayesian Spectral Decomposition for Efficient Modal Identification Using Ambient Vibration. Struct. Control. Health Monit. 2024;2024:5137641. doi: 10.1155/2024/5137641. [DOI] [Google Scholar]
  • 16.Sarmadi H., Entezami A., Saeedi Razavi B., Yuen K.V. Ensemble Learning-Based Structural Health Monitoring by Mahalanobis Distance Metrics. Struct. Control. Health Monit. 2021;28:e2663. doi: 10.1002/STC.2663. [DOI] [Google Scholar]
  • 17.Sarmadi H., Entezami A., Daneshvar Khorram M. Energy-Based Damage Localization under Ambient Vibration and Non-Stationary Signals by Ensemble Empirical Mode Decomposition and Mahalanobis-Squared Distance. JVC/J. Vib. Control. 2020;26:1012–1027. doi: 10.1177/1077546319891306. [DOI] [Google Scholar]
  • 18.Entezami A., Shariatmadar H., Mariani S. Early Damage Assessment in Large-Scale Structures by Innovative Statistical Pattern Recognition Methods Based on Time Series Modeling and Novelty Detection. Adv. Eng. Softw. 2020;150:102923. doi: 10.1016/j.advengsoft.2020.102923. [DOI] [Google Scholar]
  • 19.Entezami A., Shariatmadar H. Structural Health Monitoring by a New Hybrid Feature Extraction and Dynamic Time Warping Methods under Ambient Vibration and Non-Stationary Signals. Measurement. 2019;134:548–568. doi: 10.1016/j.measurement.2018.10.095. [DOI] [Google Scholar]
  • 20.Mei L., Li H., Zhou Y., Wang W., Xing F. Substructural Damage Detection in Shear Structures via ARMAX Model and Optimal Subpattern Assignment Distance. Eng. Struct. 2019;191:625–639. doi: 10.1016/j.engstruct.2019.04.084. [DOI] [Google Scholar]
  • 21.Mujica L.E., Vehí J., Ruiz M., Verleysen M., Staszewski W., Worden K. Multivariate Statistics Process Control for Dimensionality Reduction in Structural Assessment. Mech. Syst. Signal Process. 2008;22:155–171. doi: 10.1016/j.ymssp.2007.05.001. [DOI] [Google Scholar]
  • 22.Entezami A., Sarmadi H., Behkamal B., Mariani S., Entezami C., Sarmadi A., Behkamal H., Mariani B., Health S., Villanueva G. Health Monitoring of Large-Scale Civil Structures: An Approach Based on Data Partitioning and Classical Multidimensional Scaling. Sensors. 2021;21:1646. doi: 10.3390/s21051646. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Alamdari M.M., Rakotoarivelo T., Khoa N.L.D. A Spectral-Based Clustering for Structural Health Monitoring of the Sydney Harbour Bridge. Mech. Syst. Signal Process. 2017;87:384–400. doi: 10.1016/j.ymssp.2016.10.033. [DOI] [Google Scholar]
  • 24.Sarmadi H. Investigation of Machine Learning Methods for Structural Safety Assessment under Variability in Data: Comparative Studies and New Approaches. J. Perform. Constr. Facil. 2021;35:04021090. doi: 10.1061/(ASCE)CF.1943-5509.0001664. [DOI] [Google Scholar]
  • 25.Masiero A., Guarnieri A., Baiocchi V., Visintini D., Pirotti F. Machine Learning Clustering Techniques to Support Structural Monitoring of the Valgadena Bridge Viaduct (Italy) Remote Sens. 2024;16:3971. doi: 10.3390/rs16213971. [DOI] [Google Scholar]
  • 26.Sarmadi H., Entezami A., Salar M., De Michele C. Bridge Health Monitoring in Environmental Variability by New Clustering and Threshold Estimation Methods. J. Civ. Struct. Health Monit. 2021;11:629–644. doi: 10.1007/s13349-021-00472-1. [DOI] [Google Scholar]
  • 27.Baduge S.K., Thilakarathna S., Perera J.S., Arashpour M., Sharafi P., Teodosio B., Shringi A., Mendis P. Artificial Intelligence and Smart Vision for Building and Construction 4.0: Machine and Deep Learning Methods and Applications. Autom. Constr. 2022;141:104440. doi: 10.1016/j.autcon.2022.104440. [DOI] [Google Scholar]
  • 28.Maeda H., Sekimoto Y., Seto T., Kashiyama T., Omata H. Road Damage Detection and Classification Using Deep Neural Networks with Smartphone Images. Comput. Aided Civ. Infrastruct. Eng. 2018;33:1127–1141. doi: 10.1111/mice.12387. [DOI] [Google Scholar]
  • 29.Zhu Y., Ni Y.Q., Jin H., Inaudi D., Laory I. A Temperature-Driven MPCA Method for Structural Anomaly Detection. Eng. Struct. 2019;190:447–458. doi: 10.1016/j.engstruct.2019.04.004. [DOI] [Google Scholar]
  • 30.Datteo A., Lucà F., Busca G. Statistical Pattern Recognition Approach for Long-Time Monitoring of the G.Meazza Stadium by Means of AR Models and PCA. Eng. Struct. 2017;153:317–333. doi: 10.1016/j.engstruct.2017.10.022. [DOI] [Google Scholar]
  • 31.Chatterton S., Alfonso Antonino-Daviu J., Yuan Y., Zhu X., Li J. Moving-Principal-Component-Analysis-Based Structural Damage Detection for Highway Bridges in Operational Environments. Sensors. 2024;24:383. doi: 10.3390/S24020383. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.El-Abbasy A.A.A. Artificial Intelligence-Driven Predictive Modeling in Civil Engineering: A Comprehensive Review. J. Umm Al-Qura Univ. Eng. Archit. 2025;16:1322–1345. doi: 10.1007/s43995-025-00166-5. [DOI] [Google Scholar]
  • 33.Malekloo A., Ozer E., AlHamaydeh M., Girolami M. Machine Learning and Structural Health Monitoring Overview with Emerging Technology and High-Dimensional Data Source Highlights. Struct. Health Monit. 2022;21:1906–1955. doi: 10.1177/14759217211036880. [DOI] [Google Scholar]
  • 34.Hassan Daneshvar M., Sarmadi H. Unsupervised Learning-Based Damage Assessment of Full-Scale Civil Structures under Long-Term and Short-Term Monitoring. Eng. Struct. 2022;256:114059. doi: 10.1016/j.engstruct.2022.114059. [DOI] [Google Scholar]
  • 35.Ting D. Adaptive Threshold Sampling and Estimation. Proc. ACM SIGMOD Int. Conf. Manag. Data. 2017:1612–1625. doi: 10.1145/3514221.3526122. [DOI] [Google Scholar]
  • 36.Li Z., Khailah E.Y., Liu X., Liang J. Exploring Purpose-Driven Methods and a Multifaceted Approach in Dam Health Monitoring Data Utilization. Buildings. 2025;15:2803. doi: 10.3390/buildings15152803. [DOI] [Google Scholar]
  • 37.Plevris V., Papazafeiropoulos G. AI in Structural Health Monitoring for Infrastructure Maintenance and Safety. Infrastructures. 2024;9:225. doi: 10.3390/infrastructures9120225. [DOI] [Google Scholar]
  • 38.Liu Q., Mu Y., Li X., Wu X., Ren X. A PCA-Based Approach for Very Early-Age Hydration Monitoring of Self-Compacting Concrete Using Embedded PZT Sensors. Sensors. 2023;23:3627. doi: 10.3390/s23073627. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Finotti M., Barbosa R., Cury F., Structural A., Lui E.M., Neto M.S., Finotti R., Barbosa F., Cury A. Structural Damage Identification Using Autoencoders: A Comparative Study. Buildings. 2024;14:2014. doi: 10.3390/buildings14072014. [DOI] [Google Scholar]
  • 40.Zhang Y., Zhang Y., Zhou G., Zhang W., Li K., Mu Q., He W., Tang K. A New Ensemble Learning Method for Multiple Fusion Weighted Evidential Reasoning Rule. J. Electr. Comput. Eng. 2023;2023:8987461. doi: 10.1155/2023/8987461. [DOI] [Google Scholar]
  • 41.Yang K., Harley J.B. An Investigation of the Effect of Measurement Interval on the Autoencoder Based Damage Detection in Uncontrolled Structural Health Monitoring. Struct. Health Monit. 2023;2023:851–858. doi: 10.12783/SHM2023/36821. [DOI] [Google Scholar]
  • 42.Silva M.F., Santos A., Santos R., Figueiredo E., Costa J.C.W.A. Damage-Sensitive Feature Extraction with Stacked Autoencoders for Unsupervised Damage Detection. Struct. Control. Health Monit. 2021;28:e2714. doi: 10.1002/stc.2714. [DOI] [Google Scholar]
  • 43.Fernandez-Navamuel A., Magalhães F., Zamora-Sánchez D., Omella Á.J., Garcia-Sanchez D., Pardo D. Deep Learning Enhanced Principal Component Analysis for Structural Health Monitoring. Struct. Health Monit. 2022;21:1710–1722. doi: 10.1177/14759217211041684. [DOI] [Google Scholar]
  • 44.Di T., Magistrale L., Lomazzi L. Master’s Thesis. School of Industrial and Information Engineering, Polytechnic University of Milan; Milan, Italy: 2024. Variational Autoencoders for Structural Health Monitoring and Generative Design. [Google Scholar]
  • 45.Hassani S., Dackermann U., Mousavi M., Li J. A Systematic Review of Data Fusion Techniques for Optimized Structural Health Monitoring. Inf. Fusion. 2024;103:102136. doi: 10.1016/j.inffus.2023.102136. [DOI] [Google Scholar]
  • 46.Huang N. Anomaly Detection in Structural Health Monitoring with Ensemble Learning and Reinforcement Learning. Int. J. Adv. Comput. Sci. Appl. 2024;15:121–135. doi: 10.14569/IJACSA.2024.0150112. [DOI] [Google Scholar]
  • 47.Gaurang Shah S., Sankararaman A., Narayanaswamy B., Singh V. SEAD: Unsupervised Ensemble of Streaming Anomaly Detectors 2025; Proceedings of the 42nd International Conference on Machine Learning; Vancouver, BC, Canada. 13–19 July 2025. [Google Scholar]
  • 48.Siffer A., Fouque P.A., Termier A., Largouet C. Anomaly Detection in Streams with Extreme Value Theory; Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; Halifax, NS, Canada. 13–17 August 2017; pp. 1067–1075. [DOI] [Google Scholar]
  • 49.Lee T., Jin S.S., Kim S.T., Min J. Online Anomaly Detection for Long-Term Structural Health Monitoring of Caisson Quay Walls. Eng. Struct. 2025;323:119197. doi: 10.1016/j.engstruct.2024.119197. [DOI] [Google Scholar]
  • 50.Myllyaho L., Raatikainen M., Männistö T., Mikkonen T., Nurminen J.K. Systematic Literature Review of Validation Methods for AI Systems. J. Syst. Softw. 2021;181:111050. doi: 10.1016/j.jss.2021.111050. [DOI] [Google Scholar]
  • 51.Olawade D.B., Wada O.Z., Ige A.O., Egbewole B.I., Olojo A., Oladapo B.I. Artificial Intelligence in Environmental Monitoring: Advancements, Challenges, and Future Directions. Hyg. Environ. Health Adv. 2024;12:100114. doi: 10.1016/j.heha.2024.100114. [DOI] [Google Scholar]
  • 52.Reynders E., De Roeck G. Encyclopedia of Structural Health Monitoring. John Wiley & Sons, Inc.; Hoboken, NJ, USA: 2008. Continuous Vibration Monitoring and Progressive Damage Testing on the Z 24 Bridge. [DOI] [Google Scholar]
  • 53.Peeters B., De Roeck G. One-Year Monitoring of the Z24-Bridge: Environmental Effects versus Damage Events. Earthq. Eng. Amp; Struct. Dyn. 2001;30:149–171. doi: 10.1002/1096-9845(200102)30:2<149::AID-EQE1>3.0.CO;2-Z. [DOI] [Google Scholar]
  • 54.Orabi M., Tran K.P., Egger P., Thomassey S. Anomaly Detection in Smart Manufacturing: An Adaptive Adversarial Transformer-Based Model. J. Manuf. Syst. 2024;77:591–611. doi: 10.1016/j.jmsy.2024.09.021. [DOI] [Google Scholar]
  • 55.Jwo D.J., Chang W.Y., Wu I.H. Windowing Techniques, the Welch Method for Improvement of Power Spectrum Estimation. Comput. Mater. Contin. 2021;67:3983–4003. doi: 10.32604/cmc.2021.014752. [DOI] [Google Scholar]
  • 56.Sinsomboonthong S. Performance Comparison of New Adjusted Min-Max with Decimal Scaling and Statistical Column Normalization Methods for Artificial Neural Network Classification. Int. J. Math. Math. Sci. 2022;2022:3584406. doi: 10.1155/2022/3584406. [DOI] [Google Scholar]
  • 57.Madakyaru M., Kini K.R. A Novel Anomaly Detection Scheme for High Dimensional Systems Using Kantorovich Distance Statistic. Int. J. Inf. Technol. 2022;14:3001–3010. doi: 10.1007/s41870-022-01046-0. [DOI] [Google Scholar]
  • 58.Angiulli F., Fassetti F., Ferragina L. Reconstruction Error-Based Anomaly Detection with Few Outlying Examples. arXiv. 2023 doi: 10.48550/arXiv.2305.10464.2305.10464 [DOI] [Google Scholar]
  • 59.Hinojosa Lee M.C., Braet J., Springael J. Performance Metrics for Multilabel Emotion Classification: Comparing Micro, Macro, and Weighted F1-Scores. Appl. Sci. 2024;14:9863. doi: 10.3390/app14219863. [DOI] [Google Scholar]
  • 60.Samudra S., Barbosh M., Sadhu A. Machine Learning-Assisted Improved Anomaly Detection for Structural Health Monitoring. Sensors. 2023;23:3365. doi: 10.3390/s23073365. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The data used in this study are openly available and can be accessed from the following source: [57].


Articles from Sensors (Basel, Switzerland) are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES