Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2026 Feb 5;16:7455. doi: 10.1038/s41598-026-36656-2

A high-performance training-free pipeline for robust random telegraph signal characterization via adaptive wavelet-based denoising and Bayesian digitization methods

Tonghe Bai 1,2, Ayush Kapoor 2, Na Young Kim 1,2,3,
PMCID: PMC12929601  PMID: 41644663

Abstract

Random telegraph signal (RTS) analysis is increasingly important for characterizing meaningful temporal fluctuations in physical, chemical, and biological systems. The simplest RTS arises from discrete stochastic switching events between two binary states, quantified by their transition amplitude and dwell times in each state. Quantitative analysis of RTSs provides valuable insights into microscopic processes such as charge trapping in semiconductors. However, analyzing RTS becomes considerably complex when signals exhibit multi-level structures or are corrupted by background white or pink noise. To address these challenges and support high-throughput RTS characterization, we propose a modular, training-free signal processing pipeline that integrates adaptive dual-tree complex wavelet transform (DTCWT) denoising with a lightweight Bayesian digitization strategy. The adaptive DTCWT denoiser incorporates autonomous parameter selection rules for its decomposition level and thresholds, optimizing white noise suppression without manual tuning. Complementing this stage, our Bayesian digitizer formulates RTS level assignment as a probabilistic latent-state inference problem incorporating temporal regularization without iterative optimization, effectively resolving binary trap states even under residual notorious background pink noise. Quantitative benchmarking on large synthetic datasets with known ground truth demonstrates improved RTS reconstruction accuracy, trap-state resolution, and dwell-time estimation across diverse noise regimes and multi-trap scenarios, while achieving up to 83Inline graphic speedups over classical and neural baselines. Qualitative validation on experimental RTS data when no ground truth is available illustrates practical usability and flexibility for real-time or large-scale analysis in real measurement settings. Together, the proposed framework establishes a scalable and reproducible foundation for autonomous RTS analysis and systematic benchmarking, with potential to support future extensions toward more complex and device-specific RTS studies.

Keywords: Random telegraph signal, Background noise, Signal denoising, Signal digitization, Dual-tree complex wavelet transform, Semiconductors

Subject terms: Engineering, Mathematics and computing, Physics


Random telegraph signals (RTS) are time-dependent signals that exhibit discrete, stochastic switching events between two distinct levels. Each transition is characterized by a specific amplitude, denoted as Inline graphic, and by dwell times Inline graphic and Inline graphic, which quantify how long the signal resides in the low and high levels, respectively (Fig. 1a). Such fluctuations have been reported across a wide range of systems, from ionic transport in electrochemical settings to active biomolecular processes1 and stochastic gene expression in biology2, as well as signal instabilities in diodes3, sensors4, and even quantum bit (qubit) readouts in quantum computing5,6. In the realm of electronics, RTS often arises from single-charge trapping and detrapping random processes and is observed in atomic-scale devices such as nanoscale metal-oxide-semiconductor field-effect transistors79 and complementary metal-oxide-semiconductor circuits for image sensors10. In modern semiconductor research, this phenomenon is particularly critical, as devices shrink to the nanoscale and become increasingly susceptible to single-charge events and defect-induced noise11. Across these diverse contexts, the discrete fluctuations originate from fundamental microscopic dynamics, and understanding them is essential for both device design and theoretical insight.

Fig. 1.

Fig. 1

(a) A noiseless simple 2-level RTS with the definition of Inline graphic, Inline graphic, and Inline graphic. Examples of processed synthesized RTS with noisy RTS (grey), denoised RTS (blue), and digitized RTS by our DTCWT + Bayesian method (red) on middle subplot, cropped for better visualization; kernel density estimation (KDE) plot on left subplot; time-lag plot on right subplot for the entire RTS duration. (b) The workflow of our three-stage RTS analysis pipeline. (c) 1-trap RTS with Inline graphic. (d) 2-trap RTS with Inline graphic. (e) 3-trap RTS with Inline graphic. (f) 1-trap RTS with Inline graphic. (g) 2-trap RTS with Inline graphic. (h) 3-trap RTS with Inline graphic.

The goal of RTS analysis is to extract the transition amplitude (Inline graphic) and the exponential dwell-time distributions, which reflect the memoryless nature of a Poissonian process. This is essential for identifying the physical variables that govern RTS and for developing reliable models that capture the underlying microscopic dynamics of the nanoscale systems. Yet, analyzing and modeling RTS processes poses unique challenges when other noise sources are present at comparable or higher amplitudes, as the RTS fluctuations can be masked or exacerbated under such high-noise conditions. Among the various noise sources in electronic measurements, white noise and pink noise are especially unavoidable. White noise has a flat spectral density, with power evenly distributed across all frequencies, whereas pink noise exhibits a 1/f-spectrum concentrating more power at lower frequencies12. This low-frequency dominance makes pink noise particularly problematic for RTS analysis, as it masks slow transitions and obscures the identification of individual RTS levels and switching events. To address these challenges, we aim to develop an efficient and systematic RTS analysis protocol capable of reliably handling multi-level RTSs in the presence of both white and pink noise.

As RTS signals grow more complex and involve multiple levels, sophisticated analysis protocols are in demand beyond generic signal processing methods, with a particular emphasis on denoising and digitization. Traditional denoising techniques include moving average (MA) filtering13, fast Fourier transform (FFT) with frequency filtering, empirical mode decomposition14, and various wavelet-based methods15. These techniques provide baseline denoising capabilities, especially effective for signals corrupted by white noise. Probabilistic models, such as hidden Markov models (HMMs)1618 and Gaussian mixture models (GMMs)19, have also been applied, offering strong interpretability of RTS and quantification of RTS parameters. Unfortunately, these approaches often require careful parameter initialization, limiting their scalability and robustness in complex RTS environments. When confronting pink noise and multi-trap scenarios, these classical approaches fall short.

Besides the challenges of noise masking, another important difficulty in RTS analysis is accurately assigning each discrete level to its corresponding trap state, especially in multi-trap scenarios where level overlap and noise can confound traditional digitization methods. Although considerable work has been done to quantify key RTS parameters, such as the amplitude Inline graphic and the mean dwell times Inline graphic and Inline graphic19,20, noise remains a major barrier to accuracy. For instance, density-based techniques such as kernel density estimation (KDE) can identify Inline graphic through clear multimodal peaks in amplitude statistics, but only under well-denoised conditions, while accurate estimation of dwell times likewise depends on the quality of denoising and digitization.

Recently, neural network (NN)-based methods have been proposed and applied to real RTS data from nanoscale electronic devices2123. In particular, NN-based denoising methods have gained attention due to their ability to generalize across diverse noise conditions, such as multi-trap or pink noise, without requiring manual parameter tuning. Architectures such as Recurrent neural networks21 and specialized autoencoder models like DAEUBL - a denoising autoencoder based on U-Net and bidirectional long short-term memory layers19,20 has demonstrated improved performance in digitizing clean RTS from noisy measurements, often outperforming classical filters under complex noise conditions. Self-organizing maps, as an alternative unsupervised NN approach, have also been employed for RTS pattern recognition in resistive random-access memories23. Beyond NNs, another machine learning (ML) technique, such as K-Medoids clustering, has been applied to real cryogenic transistor data for localizing complex RTS features22. Meanwhile, a more general ML framework for RTS analysis is emerging as well24. Despite these advances, NN-based methods still face critical drawbacks. Their inference speed and memory consumption become prohibitive when applied to long-duration or high-resolution signals. For example, a 100-second RTS recorded at 10 ns resolution yields 10 billion time steps, well beyond the processing capabilities of typical NN models like DAEUBL within a reasonable amount of time and memory, even on NVIDIA V100 GPUs.

While denoising has seen significant innovation with these advanced neural-based and wavelet-based models, RTS digitization, the process of mapping denoised signals to discrete levels and decomposing them into binary sequences for individual traps, has received comparatively less systematic attention. Especially, the task becomes particularly challenging when signals involve more than 2 levels. Current practice primarily relies on simple thresholding methods25 or proximity-based peak-matching heuristics19,20. Some probabilistic-based methods have been explored in the context of RTS noise analysis26,27, offering a way to model and interpret fluctuating signals in a statistically grounded manner. Bayesian approaches, particularly suited for handling uncertain and noisy data, are valuable for extracting meaningful information from RTSs, where noise often obscures state transitions26,27, but existing formulations typically rely on complex models or require extensive tuning and prior assumptions28,29.

Finally, a unified framework for evaluating RTS analysis performance remains missing from the literature, as prior studies typically assess only a subset of metrics, without a comprehensive or standardized evaluation protocol. To address this gap, we introduce a fully specified RTS signal processing pipeline that integrates denoising and digitization within a single modular framework, together with a comprehensive suite of quality and performance metrics. In particular, we define a 7-category benchmarking regime including various novel measures such as trap count error (Inline graphic error), to thoroughly evaluate the ability of each algorithm to characterize individual RTS traps in a systematic and reproducible manner.

Here we offer a comprehensive training-free and modular RTS analysis pipeline designed for robust denoising, high-fidelity digitization, and statistical characterization. At its core are two primary algorithmic contributions: an adaptive dual-tree complex wavelet transform (DTCWT) denoiser and a Bayesian digitizer. Each contribution is selected to address limitations observed in prior methods, particularly under high-noise conditions and multi-trap scenarios. High-noise conditions, we refer to settings with a wide range of noise levels, including severe background fluctuations that can obscure step transitions–a point elaborated further in the Methods section. While wavelet-based denoising30 and probabilistic inference2629 have each been explored independently in prior RTS studies, our work adopts in its adaptive formulation and unified integration into an autonomous pipeline that operates without training data or learned models. Each RTS trace is processed independently, avoiding dependence on prior signal statistics and reducing the barrier for experimental adoption.

The proposed adaptive DTCWT denoiser leverages the approximate shift invariance and better transition preservation of the traditional DTCWT by incorporating automated decomposition-level selection and thresholding rules specific to RTS transitions. The benchmarking results show that these characteristics allow for efficient background noise suppression while preserving transition fidelity under a variety of noise conditions. A probabilistic model that interprets the denoised signal in terms of discrete amplitude levels found from KDE is then used to handle RTS digitization. The digitizer determines RTS levels using a Bayesian inference process that integrates data from the observed signal amplitude and prior state likelihoods, as opposed to heuristic thresholding or proximity-based peak matching. In addition to being computationally efficient and appropriate for large-scale RTS analysis, this method offers a lightweight and comprehensible substitute for more intricate generative models. Methods provides a detailed description of the formal probabilistic formulation and inference process.

The workflow for the RTS characterization, as illustrated in Fig. 1b, involves three primary stages:(1) denoising and KDE, (2) digitization, and (3) dwell time statistics extraction. First, a raw signal is processed using a denoising algorithm to suppress background fluctuations while preserving the discrete transitions characteristic of the RTS behavior. The denoised signal is then analyzed using statistical tools such as kernel density estimation (KDE) to identify discrete amplitude levels. In the second stage, the identified levels then serve to digitize the signal into binary sequences, each representing an individual trap. Finally, from these digitized traces, we extract key trap parameters such as their characteristic dwell times Inline graphic and Inline graphic, corresponding to the average durations that the system stays in the low and high states in Fig. 1a.

Reliable characterization of each trap, such as quantifying Inline graphic, Inline graphic and Inline graphic for real data is essential, as it directly informs device reliability and can guide design decisions in manufacturing. As real experimental data lack a well-defined ‘true value’, validating our proposed pipeline through systematic performance quantification is only feasible using synthesis RTS data. Figure 1c–h shows examples of processed synthesized RTS signals. The procedure for generating these ground-truth synthesized RTS datasets is described in Methods. Motivated by the need for reliable validation using controlled datasets, we next present a systematic evaluation of our proposed RTS analysis pipeline on synthesized signals spanning diverse trap numbers, noise types, and noise levels.

Results

To benchmark the pipeline, we synthesize a dataset consisting of 1,800 RTS samples, each with length Inline graphic time steps. These samples span evenly distributed across ground truth trap counts Inline graphic, two background noise types (white noise (wn) and pink noise (pn)) and noise levels, Inline graphic. Quantitative benchmarking is performed on synthetic RTS signals, as real experimental RTS data generally lack ground-truth annotations for trap number, transition amplitudes, and dwell times, making objective quantitative evaluation infeasible without extensive device-specific physical modeling. Figure 1c–e illustrate 1-trap, 2-trap, and 3-trap RTSs (a(t)) injected with three levels of white noise Inline graphic respectively. As both the number of traps and noise level increase, KDE peaks in the left sub panel become less distinguishable, and the time-lag plots increasingly overlap colonies of points. For a 3-trap RTS in Fig. 1e, only 6 KDE peaks are identified, out of the 8 expected levels if three traps are independent and mutually exclusive. Figure 1f–h correspond to 1-trap, 2-trap, and 3-trap RTSs masked by the same pink noise level, Inline graphic in all three cases. Higher trap counts under pink noise present greater denoising challenges: residual low-frequency components can create spurious levels in the KDE. In Fig. 1g, for instance, a true 2-trap RTS may be misclassified as a 3-trap RTS due to multiple sub-peaks. Synthetic RTS signals therefore play a critical role in enabling controlled and reproducible evaluation across noise types, trap counts, and signal complexities, allowing systematic stress testing of denoising and digitization performance that is not possible with real experimental data alone. Beyond the difficulty of colored-noise masking, a further major difficulty lies in the efficient processing of very long RTS signals. In high-resolution experimental measurements, nanoscale devices can generate datasets containing millions or even billions of time steps. This scale places heavy demands on both memory and computation, making it essential to design denoising and digitization algorithms that are not only accurate but also adaptable and computationally efficient.

We emphasize that our RTS datasets are generated under more challenging conditions than prior studies19,20, with a tighter definition of background noise strength Inline graphic in Methods. Furthermore, we incorporate scaled trap amplitudes, where each additional trap has a reduced strength relative to the dominant trap, to simulate sub-dominant traps and introduce strong pink noise that can fully mask weaker transitions, thereby stressing both the denoising and digitization stages (details in Methods). Real experimental RTS examples are included in this work to demonstrate practical applicability of the proposed pipeline rather than to provide quantitative validation, as meaningful numerical error metrics on real RTS data typically require expert annotation and detailed device-specific physical interpretation, often forming the focus of a standalone experimental study. To demonstrate the advantages of our approach, we define two primary benchmark categories: quality and performance. These benchmarks quantitatively assess both the proposed and baseline methods in a controlled and reproducible manner, ensuring that both accuracy and computational efficiency are considered.

Four complete methods are compared across all evaluation metrics: MA denoising with proximity-based digitization (MA + Proximity), FFT denoising with proximity-based digitization (FFT + Proximity), DAEUBL denoising with proximity-based digitization (DAEUBL + Proximity), and our proposed DTCWT denoising combined with Bayesian digitization (DTCWT + Bayesian). The first two represent traditional, non-neural-network baselines. The third is a neural-network-based method–the only such method currently available for this task. While the first three combinations serve as baseline references, the fourth is our proposed approach, which is a modular and probabilistic alternative for robust RTS analysis. Our DTCWT + Bayesian is a coherent pipeline that pairs adaptive DTCWT denoising with a probabilistic model-based digitization scheme. It is important to clarify that digitization occurs after denoising and KDE. Thus, the digitization quality measured in our benchmarks reflects the effectiveness of the entire method pipeline, not just the digitization algorithm in isolation. Below summarizes the results of the denoising quality: signal-to-noise ratio, trap number error, and trap transition amplitude error; as well as the digitization quality as trap state error, and mean dwell times error.

Denoising quality

Signal-to-Noise Ratio (SNR)

First, we evaluate the denoising performance using SNR as Inline graphic, where L is the length of the RTS signal in steps, x(t) is the ground truth amplitude at time t, and Inline graphic is the denoised signal amplitude at the same time step. It is important to note that SNR may not fully reflect the capabilities of noise-removal methods such as DTCWT, particularly under pink noise conditions. This is because the DTCWT + Bayesian approach is designed to defer pink noise handling to the digitization stage via Bayesian inference. Figure 2 plots the SNR in decibels (dB) over varying levels of white noise (Inline graphic, a) and pink noise (Inline graphic, b), across ground truth trap numbers Inline graphic, respectively. As expected, SNR generally decreases with increasing noise levels (Inline graphic) and higher trap complexity (Inline graphic). Moreover, lower SNR is observed under pink noise compared to white noise, as the more complex, non-white background noisy patterns significantly mask the RTS signals. Among four methods, DAEUBL achieves the highest SNR in all noise levels and trap numbers, corroborating its performance reported in the original work20. The DTCWT method ranks second and yields SNR values above 20 dB in most cases, outperforming the other non-neural-network baselines (MA and FFT) across all tested conditions. Although DTCWT does not surpass the neural-network-based DAEUBL baseline in terms of SNR, this outcome is coherent with its design: the DTCWT primarily targets white noise and intentionally leaves pink noise components unfiltered for subsequent Bayesian post-processing. As a result, pink noise traces remain in the denoised signals, lowering the measured SNR. Nevertheless, DTCWT still achieves superior SNR compared to traditional MA and FFT baselines, while offering a more interpretable and efficient denoising strategy.

Fig. 2.

Fig. 2

Comparison of SNR (denoising quality) on benchmarked algorithms. RTS samples spanning ground truth Inline graphic, and Inline graphic for white (a) and pink noise (b), respectively. RTS sample lengths fixed at Inline graphic time steps.

Trap number error

At the validation step of our model, we evaluate the trap number detection accuracy by comparing Inline graphic estimated from each method against the ground truth of the synthetic data. The detected trap number Inline graphic is determined from the number of KDE levels in each denoised signal. Specifically, we compute Inline graphic, where the result is capped within the range [1, 3] to align with the ground truth Inline graphic values used in this benchmark. Figure 3 illustrates the trap number error results under white noise (Fig. 3a) and pink noise (Fig. 3b). Each subplot corresponds to a pairing of ground truth and detected Inline graphic: the three columns represent Inline graphic of 1, 2, and 3 traps, and the three rows represent Inline graphic. Each subplot presents detection frequency on the y-axis and noise level on the x-axis. In the diagonal plots of the white noise in Fig. 3a, our proposed method (adaptive DTCWT, shown in pink) outperforms all baselines 99% of the time in correctly estimating Inline graphic across Inline graphic and Inline graphic. DAEUBL ranks second in most cases, and FFT always underperforms across all configurations. In practice, overestimation of Inline graphic is generally less problematic than underestimation since extra levels from overestimation can be discarded based on prior physical knowledge of the system; however, underestimated Inline graphic may miss meaningful trap events entirely. For the pink noise case in Fig. 3b, trap number estimation becomes less stable, with stronger fluctuations in detection frequencies. Interestingly, MA performs best when Inline graphic, but its effectiveness drops sharply at Inline graphic, where it frequently underestimates. By contrast, DTCWT generally offers the most reliable performance, except at the highest Inline graphic noise levels for the Inline graphic case. Even in this challenging regime, our pipeline remains the top performer relative to other baselines. This stability reflects the ability of the DTCWT denoiser to suppress background noise while preserving transition features critical for KDE-based trap identification. DAEUBL, on the other hand, exhibits a clear tendency to overestimate the number of traps, which is particularly evident in the bottom row of subplots.

Fig. 3.

Fig. 3

Comparison of Inline graphic error (denoising quality) on benchmarked algorithms. RTS samples spanning ground truth Inline graphic, and Inline graphic for white (a) and pink noise (b), respectively. RTS sample lengths fixed at Inline graphic steps. The darkened diagonal subplots indicate correct detections, namely, Inline graphic. The values in each column of each method sum to 100% of all RTS samples at the same ground truth Inline graphic and Inline graphic level. The bottom-left subplots indicate overestimation, while the top-right subplots reflect underestimation.

Trap transition amplitude error

The RTS trap transition amplitude (Inline graphic) is one of the three key parameters characterizing each RTS trap, alongside the high-state and low-state average dwell times (Inline graphic and Inline graphic). Inline graphic is estimated during the denoising and KDE step prior to digitization. Each KDE peak corresponds to a discrete signal level, which we represent using a binary index. In this representation, each bit indicates the state of a single trap: 1 = active (trap in high state), 0 = inactive (trap in low state). Interpreting the levels as least-significant-bit (LSB)-first binary codes allows us to map each observed level to a unique combination of trap states, from which Inline graphic values for individual traps can be decoded. For example, in a system with two mutually independent traps, there can be up to four possible distinct levels: 00 corresponds to both traps being inactive, and 11 corresponds to both traps being active simultaneously. This scheme generalizes naturally: for Inline graphic, there are Inline graphic possible binary codes (000–111), each corresponding to a unique combination of trap states. The conversion is constrained by the number of traps detected (Inline graphic), ensuring that only valid binary indices are used in digitization. This decoding process effectively maps the continuous density peaks in KDE, shown on the left of Fig. 1c-h, into compact, interpretable binary representations. These signal levels allow us to compute the amplitude difference between each state transition and compare against the ground truth.

We compute a percentage error Inline graphic as Inline graphic for each RTS detected Inline graphic trap, where Inline graphic. Figure 4 summarizes Inline graphic as box plots of white noise (a) and pink noise (b) across different noise levels. Each subplot has the range of Inline graphic levels on the x-axis and Inline graphic in y-axis for the individual i-th trap. Across all trap configurations and noise levels, DTCWT consistently produces the lowest and most tightly concentrated Inline graphic errors. The accuracy and stability of DTCWT in estimating trap amplitudes reinforce its robustness, especially under high noise conditions where other methods fail. The performance degradation of MA, FFT, and even DAEUBL at high noise levels aligns with previous observations from the Inline graphic error analysis: when traps cannot be reliably identified, accurate Inline graphic estimation becomes infeasible. For the pink noise case in Fig. 4b, Inline graphic is generally higher than its white noise counterpart, with box plots exhibiting broader spreads that reflect greater variability in estimation. Unlike the white noise scenario, no single method emerges as a consistently dominant performer. Still, MA shows surprisingly strong accuracy in the Inline graphic setting, underscoring its effectiveness for simple single-trap signals even in the presence of structured pink noise.

Fig. 4.

Fig. 4

Comparison of Inline graphic error (denoising quality) on benchmarked algorithms. RTS samples spanning ground truth Inline graphic, and Inline graphic for white (a) and pink noise (b), respectively. RTS sample lengths fixed at Inline graphic steps. Error statistics of individual traps are separated into subplots. Each subplot contains results from up to 50 synthetic RTSs, provided the corresponding trap was correctly detected. The subplot layout is organized as follows: The top row, left subplot shows error distribution for the single trap in Inline graphic samples. The top row, middle and right subplots display errors for the two individual traps, labeled as the first and second traps, in Inline graphic synthetic RTSs. The bottom row shows errors for the first, second, and third traps in Inline graphic synthetic RTSs.

Digitization quality

After completing the denoising and KDE stage, we proceed to evaluate the digitization quality in terms of two error metrics: trap state error as root mean square error (RMSE) of the binary digitized signal with respect to ground truth, and error in the dwell time statistics (Inline graphic) for each trap. RMSE evaluates signal fidelity, while Inline graphic statistics quantify transition behaviors. Among all metrics, we consider these RTS-specific characterization errors to be of higher priority than general signal metrics such as SNR, as they provide deeper insights into the RTS structure. More implementation and visualization details of these error metrics are presented alongside the respective plots in subsequent sections.

Trap state error

During the digitization step, one outcome is a digitized binary sequence for each trap, and the final digitized RTS signal is reconstructed by summing these sequences, each scaled by its respective Inline graphic amplitudes. The first metric is the root mean squared error (RMSE) between the digitized binary sequences and the ground truth from the original synthetic data. Mathematically, the RMSE for the Inline graphic trap is then computed as Inline graphic, where L is the RTS signal length in time steps, and Inline graphic denotes the digitized state of the Inline graphic trap given t. Hence, this metric directly captures only state errors for each trap and is independent of amplitude deviations, unlike the RMSE computed for the full reconstructed signal. Figure 5 summarizes the RMSE results in the range of [0, 1] along the Inline graphic noise levels in (a) and (b), respectively. For the white noise results in Fig. 5a, while DAEUBL + Proximity shows the lowest RMSE in the Inline graphic and 2 cases, it often fails to detect all three traps in Inline graphic samples, resulting in missing data for some subplots. It is important to note that these plots include only the traps that were successfully digitized. Therefore, the number of valid samples will differ across methods and noise levels. Our proposed method (DTCWT denoising + Bayesian digitization) demonstrates more consistent digitization success across all traps and noise levels, even when its RMSE is not always the lowest. In the presence of pink noise (Figure 5b), our method generally yields the lowest and most concentrated RMSE values. Some exceptions appear in the Inline graphic trap of Inline graphic and 2 at very high noise levels (Inline graphic), where DAEUBL + Proximity yields slightly lower median or second quartile in RMSE. Closer inspection of DAEUBL + Proximity box plots reveals significantly wider interquartile ranges (IQR) and pronounced outliers, indicating high variance and unreliability. Although the central tendency appears favorable, the variability suggests a lack of robustness compared to the more stable performance of our DTCWT + Bayesian method.

Fig. 5.

Fig. 5

Comparison of RMSE (digitization quality) on benchmarked algorithms. RTS samples spanning ground truth Inline graphic, and Inline graphic for white (a) and pink noise (b), respectively. Each RTS sample is fixed at a length of Inline graphic time steps. Error statistics of individual traps are presented in separate subplots.

Mean dwell times error

We next evaluate the accuracy of average dwell time estimation after digitization, quantified by the percentage error in the mean high-state and low-state dwell times, denoted as Inline graphic and Inline graphic. For each individual trap in every RTS sample, Inline graphic and Inline graphic are extracted from the Poisson statistics fitting in the dwell-time histograms from the digitized signals. Note that the reference is the Inline graphic and Inline graphic from the same statistical analysis with. For the Inline graphic trap, the percentage error is calculated as Inline graphic, where Inline graphic. Figure 6 collects the error distributions of Inline graphic across noise levels for white noise (a) and pink noise (b). For Inline graphic and Inline graphic in Fig. 6a, the error grows as Inline graphic increases for all models. In the Inline graphic case, DAEUBL + Proximity continues to perform well; however, at Inline graphic, DAEUBL + Proximity and most other baselines frequently fail to detect the third trap. Only the proposed method (DTCWT + Bayesian) reliably captures the third trap under such challenging conditions. DAEUBL + Proximity can achieve lower error in favorable regimes; however, its reliability degrades more rapidly with increasing noise. By contrast, DTCWT + Bayesian maintains consistent detection and generally exhibits the smallest variation across error distributions, particularly under pink noise. A similar pattern is observed under white noise except for the first trap of Inline graphic at Inline graphic, where variability increases. Under pink noise (Fig. 6b), the overall trends are consistent with the white noise scenario. DAEUBL + Proximity often yields the lowest median error, but this comes with a substantially wider IQR, reflecting larger variability. MA + Proximity, on the other hand, performs the weakest in most multi-trap configurations, despite showing stronger results in earlier metrics. The proposed method does not always achieve the lowest Q2, but it consistently produces tighter error distributions, reflecting greater stability. Thus, DAEUBL + Proximity excels in favorable conditions, whereas DTCWT + Bayesian is more robust and reliable across noise types and trap complexities. Inline graphic also exhibits similar trends to Inline graphic; notably, only the proposed DTCWT + Bayesian method successfully captured the third trap in the Inline graphic case at Inline graphic.

Fig. 6.

Fig. 6

Comparison of Inline graphic (digitization quality) on benchmarked algorithms. RTS samples spanning ground truth Inline graphic, and Inline graphic for white (a) and pink noise (b), respectively. RTS sample lengths fixed at Inline graphic steps. Error statistics of individual traps are separated into subplots. Note that the box plots display individual points to present the cases where traps are successfully identified.

Performance

Beyond model quality, we assess the performance of denoising, digitization, and the overall processes in terms of two parameters: core execution time and peak memory usage. The datasets used for performance benchmarking consist of 240 RTS samples with varying lengths from Inline graphic to Inline graphic time steps. Although we also tested across different noise levels and numbers of traps, these factors did not show a significant impact on performance metrics. Thus, our performance benchmarks primarily focus on execution time and memory usage across different signal lengths. The following metrics are used for performance benchmarking: core execution time, capturing the runtime of each core algorithm (e.g., denoising, KDE, digitization), excluding file I/O operations. Then, peak memory usage occurs during the execution of each algorithm. All benchmarking experiments are performed under fixed hardware constraints to ensure fair and consistent comparisons. Specifically, all evaluations are conducted on a dedicated GPU node of the Compute Canada Beluga cluster, with 10 cores of Intel Gold 6148 Skylake CPU @ 2.4 GHz and 1 NVIDIA V100SXM2 GPU with 16 GB VRAM.

Core execution time

The execution time for each core component of the pipeline is recorded individually for every RTS sample as a function of RTS length L. The plots in Fig. 7 therefore collect the results with a 95% confidence interval band at each L. Figure 7a presents the execution time of denoising (left), digitization (middle), and the overall process (right) on a log-log scale. All four methods display a power-law trend as a function of L (linear in log-log scale). The simpler methods (MA, FFT) take less time than the more complex methods (DAEUBL, DTCWT). Bayesian digitization outperforms the proximity method by nearly two orders of magnitude for all L values in the middle subplot. Overall, it is evident that our proposed method, DTCWT denoising followed by Bayesian digitization is approximately 83 times faster than the other baseline approaches.

Fig. 7.

Fig. 7

Comparison of denoising and digitization performance on benchmarked algorithms. RTS samples spanning from Inline graphic to Inline graphic steps. (a) Core execution time. (b) Peak memory usage. The shaded regions around the curves indicate the 95% confidence intervals across RTS samples.

Peak memory usage

Figure 7b plots the peak memory consumption in mebibytes (MiB) of each core component in the processing pipeline for every RTS sample in the performance evaluation dataset. Since DTCWT and Bayesian require higher peak memory usage than their counterparts, our proposed method, DTCWT + Bayesian, demands up to three times more peak memory compared to any baseline approach. Given that this configuration achieves an approximately 83-times improvement in execution speed, the associated increase in memory consumption represents a reasonable and justifiable trade-off.

Discussion

Building on these benchmarking results, we now turn to a closer examination of the underlying mechanisms that drive the desirable performance of our proposed pipeline under challenging RTS conditions. First, the DTCWT denoiser exhibits the features of shift invariance, directional selectivity in the wavelet domain (i.e., distinguishing signal components by orientation in frequency rather than by time or amplitude), and strong noise suppression while preserving RTS step transitions. Unlike previous implementation requiring manual parameter tuning31, our version includes an automatic parameter selection mechanism based on raw signal statistics, making it adaptive to diverse background noise regimes. For KDE, we employ a prominence-based peak filtering step that suppresses spurious peaks caused by noise, thereby improving the Inline graphic estimation. This refinement is not the primary focus of our cross-method comparisons, since KDE is used consistently across all baselines, but it contributes to more reliable trap identification when it combines with the DTCWT denoiser. Second, the Bayesian digitizer replaces traditional heuristic-based digitization methods (e.g., thresholding or proximity rules) with a probabilistic latent-level model over KDE peaks. For each time step, we compute the full posterior distribution under a Gaussian emission model and then report the digitized label via maximum a posteriori (MAP) decoding. Along with temporal prior smoothing, this yields a principled measure of uncertainty, reduces susceptibility to noise, and naturally extends to multi-trap scenarios where interactions between levels complicate deterministic digitization. This allows for more reliable digitization and scales well in multi-trap settings. The resulting binary or multi-level sequences enable the best possible quantification of physical RTS parameters, Inline graphic, Inline graphic and Inline graphic.

With increasing demands for real-time RTS analysis in applications such as quantum random number generation (QRNG) and semiconductor quality control during fabrication, efficient signal processing is becoming a necessity. Our proposed method demonstrates practical real-time performance: 20 million steps of RTS data can be processed in just 30 seconds, which corresponds to 1 second of signal processed in 1 second if sampled at 667 kHz (1Inline graphics time bin). In contrast, the DAEUBL-based pipeline requires about 3000 seconds to process the same data, supporting only up to 6.67 kHz resolution for real-time usage, even if we exclude the training time of the DAEUBL model. Regarding memory, our evaluation focuses on peak memory usage, which measures the maximum memory consumption during the entire execution. This metric is particularly relevant for future deployment scenarios, such as implementing RTS analysis pipelines on embedded or standalone hardware where memory is often a critical constraint. Ensuring that peak memory remains within feasible limits is essential for such implementations. It is important to contextualize the observed memory footprint. DAEUBL was highly optimized using Keras, and classical denoisers such as MA and FFT were implemented with efficient libraries like NumPy and SciPy. In contrast, our DTCWT implementation has not undergone memory optimization yet. Nevertheless, the observed increase in peak memory, up to approximately three times higher than other methods, is a reasonable trade-off considering the 83 times improvement in execution speed. This suggests that future optimizations of DTCWT memory usage could further enhance its suitability for low-resource environments without compromising speed.

Beyond quantitative evaluation, the proposed approach offers several key practical advantages. Most notably, it does not require training of machine learning models, enabling it to generalize across a wide variety of RTS conditions without overfitting to specific data distributions. In contrast, neural-network-based methods like DAEUBL, although being powerful under ideal conditions, are often less reliable on unseen real RTS data and demand substantial expertise to train, validate, and deploy31. Our method is inherently modular, allowing users to plug in different stages if needed, while remaining robust with minimal intervention. We also evaluated our method on experimental RTS signals collected from a carbon nanotube (CNT) device, visualized in Figure 8. The device under test consists of a 500 nm-wide, 65 nm-thick multi-walled CNT film acting as a conduction channel, with two terminals for voltage bias. The current through the channel is recorded at a temperature of 9 K. We emphasize that these real-data examples are presented to demonstrate practical applicability rather than to provide a complete physical characterization of CNT trap dynamics, which typically requires device-specific studies such as gate-voltage or temperature dependence and is beyond the scope of this work. In Figure 8a, the time-lag plot of this sample shows mostly round-shaped intensities, indicative of white noise. The RTS signal is clean and clearly exhibits discrete transitions, making it well-characterized as a single-trap RTS. The denoiser performs smoothly and confidently classifies the step levels. The sample in Figure 8b exhibits stretched intensities in the time-lag plot, consistent with pink noise. Although it is noisier, it remains interpretable. The KDE reveals two lower peak levels near 4.258. Depending on tuning, one could retain one of these levels to model the signal as a 2-trap RTS with three levels, or discard both and interpret it as a simplified single-trap RTS. Such ambiguity reflects common experimental scenarios where expected trap behavior may depend on prior physical insight; the proposed pipeline provides a reasonable autonomous interpretation while allowing expert-guided refinement when desired. The last sample in Fig. 8c also displays mostly white noise characteristics in the time-lag plot. Although the KDE suggests a potential high-level peak, the denoised signal offers limited support for this level, particularly from Inline graphic to Inline graphic. The time-lag density confirms that this upper peak occurs much less frequently, justifying its exclusion under fully autonomous operation. However, prior literature19 has suggested that the fast switching behavior below approximately 5.20 may correspond to a separate trap; motivated by this, we include an additional tuned result in Fig. 8d, where reduced denoising strength preserves these transitions and yields a multi-trap interpretation. This comparison illustrates a key strength of the proposed framework: it enables reliable autonomous analysis across diverse noise regimes, while remaining flexible enough to incorporate expert expectations about trap behavior when such knowledge is available. In contrast, DAEUBL shows reduced robustness on this unseen real RTS example Fig. 8e, as its training and noise synthesis do not fully span the experimental conditions encountered here.

Fig. 8.

Fig. 8

Examples of processed real RTS from carbon nanotube film with noisy RTS (grey), denoised RTS (blue), and digitized RTS (red) on middle subplot, cropped for better visualization; kernel density estimation (KDE) plot on left subplot; time-lag plot on right subplot for the entire RTS duration. (a-c) RTS from three distinct samples (Samples 1–3) processed using fully autonomous DTCWT + Bayesian. (d) RTS from Sample 3 reprocessed using manually tuned DTCWT + Bayesian following expert recommendation on potential alternative trap behavior. (e) RTS from Sample 3 reprocessed using DAEUBL + Proximity.

Our solution is training-free, fast, generalizable, and easy to use for non-ML experts. While no single approach can dominate across all RTS types due to their inherent variability, our framework comes close by balancing accuracy, robustness, and usability. This leads to our future direction: building a fully adaptive RTS analysis pipeline that can automatically select optimal methods at each stage of the workflow. Rather than benchmarking all algorithm combinations on every sample, such a system would dynamically recommend the best configuration, balancing accuracy and computational cost–based on real-time signal characteristics. This will support a more scalable and intelligent RTS characterization protocol applicable to both synthetic and real-world datasets, fulfilling the long-standing need for a standardized, high-throughput RTS analysis toolkit. Development of new denoisers and digitizers remains central to this goal. An alternative direction involves direct digitization, segmenting the noisy signal into tens or even hundreds of candidate traps, where high-frequency, short-lived transitions are interpreted as noise and discarded. This approach could retain only the dominant, stable trap states, potentially preserving accurate dwell statistics without requiring explicit denoising. In addition, neural networks offer promising avenues for RTS digitization beyond classical rule-based methods. While most existing neural approaches focus on denoising with interpretable outputs, lightweight architectures could be trained to directly perform digitization, learning to distinguish genuine trap transitions from noise. Future studies will explore such strategies to further enhance flexibility and performance in RTS analysis.

Methods

Generator protocol

A synthetic signal generation pipeline is constructed for mimicking the physical switching behavior of RTS traps, which enables the quantitative evaluations of denoising methodologies, such as the number of traps Inline graphic and various noise environments, with noise level Inline graphic or Inline graphic corresponding to white noise or pink noise type. Starting from the ground truth RTS signal, we progressively corrupt it with controlled noise, enabling precise benchmarking of denoising and digitization algorithms. Each RTS trap is modeled as a two-state (high/low) Markov process with probabilistic switching governed by characteristic lifetimes Inline graphic and Inline graphic. These correspond to the average durations that the system stays in the low and high states, respectively, and are linked to physical trapping/de-trapping mechanisms in nanoscale devices. We synthesize each trap component as a binary sequence characterized by three parameters: Inline graphic, Inline graphic with each switching durations sampled from a geometric distribution, and a corresponding trap amplitude defined by Inline graphic. Multiple such components are synthesized and summed to construct a ground truth multi-trap signal, which is equivalent to the physical superposition in multi-trap RTS.

Traps are assumed to be independent in order to provide a controlled baseline for benchmarking that isolates algorithmic performance without introducing device-specific trap interactions. While correlated traps may arise in certain experimental systems, the independent-trap assumption does not trivialize the analysis task. To emulate realistic experimental conditions, controlled noise is added to the ground truth RTS signal, which can be either single-trap or multi-trap. A(t) denotes the ground truth RTS amplitude at time or signal step t, to which noise is added with intensity scaled by a factor Inline graphic. Let Inline graphic be the RTS maximum amplitude range. Upon synthesizing the white noise to the ground truth signal, we define the white noise Inline graphic where Inline graphic denotes a Gaussian distribution with zero mean and variance Inline graphic. The standard deviation is Inline graphic. The synthesized RTS with white noise is then expressed as Inline graphic. To synthesize RTS samples with pink noise as a form of low-frequency noise, we replace the previously modeled Gaussian white noise with pink noise, characterized by a power spectral density (PSD) that follows a Inline graphic trend, where f denotes frequency and Inline graphic is the power-law exponent. We assume the standard pink noise with Inline graphic. The pink noise P(t), when the given noise level is Inline graphic, is derived by applying a power-law filter to white noise W(t) where Inline graphic. The filtered output Inline graphic has a 1/f PSD characteristic. Therefore, the pink noise term is defined as Inline graphic, where Inline graphic is a pink noise sample with Inline graphic spectral properties by first generating white noise. The resulting pink noise RTS thus becomes Inline graphic. The process of adding controlled noise can be visualized in Fig. 1 from ground truth RTS (black) to noisy RTS (grey). Although the noise is additive, the inclusion of strong low-frequency (pink) noise introduces substantial temporal correlations and background fluctuations that can fully mask weaker RTS transitions, reflecting a major challenge commonly encountered in experimental RTS measurements.

All stochastic procedures, including RTS switching and noise generation, are seeded using a signal-specific random seed. Given the RTS parameters and seed, the exact signal can be regenerated. This deterministic setup ensures traceability and supports debugging and reproducible experimentation. In summary, the RTS generator synthesizes white noise or pink noise RTS based on sample length L, number of traps Inline graphic, corresponding noise level Inline graphic or Inline graphic, as well as a random seed for reproducibility. This modular pipeline provides a flexible framework for generating synthetic RTS signals suitable for evaluating denoising algorithms in a variety of noise regimes.

Denoising and KDE algorithm

The denoising module takes a raw amplitude time-series signal as an input, which typically exhibits substantial background noise that can mask the discrete switching events characteristic of RTS behavior. The resulting denoised signal closely approximates the idealized RTS form and provides a more interpretable basis for further processing stages. An example of this transformation from noisy RTS (grey) to denoised RTS (blue) is illustrated in Fig. 1c-h. The moving average (MA) filter is simple and fast32,33, with an adaptive window size W using the relation Inline graphic, where Inline graphic represents the standard deviation of the background white noise, balances noise suppression against transition preservation, though it remains sensitive to noise characteristics. FFT-based denoising operates in the frequency domain by retaining only the top 1% of spectral coefficients, effectively reducing broadband noise but risking artifacts when signal and noise spectra overlap. Finally, the DAEUBL neural model provides strong performance under pink noise and multi-trap scenarios, but at significant computational and memory cost.

Algorithm 1.

Algorithm 1

Adaptive DTCWT Denoising

Beyond adapting the DTCWT for RTS denoising, a central innovation of our approach lies in the development of two autonomous parameter selection rules that automatically determine (i) the optimal number of wavelet decomposition levels and (ii) the threshold value for wavelet coefficient pruning. These parameters are essential for achieving a balance between preserving abrupt switching behavior and effectively suppressing both white and colored background noise. Unlike prior studies31 that relied on manual tuning or trial-and-error, our method eliminates this manual burden by using data-driven heuristics derived from empirical testing across, including synthetic RTS and real RTS from quantum dot measurements, single-photon avalanche diode signals. If the user does not provide these parameters explicitly, the denoiser invokes the internal selection rules, as outlined in Fig. 1. After thresholding, the signal is reconstructed by combining retained features across all decomposition levels, resulting in a clean, denoised signal that maintains step fidelity under diverse noise conditions. These autonomous parameter selection rules serve a similar purpose to previous automation efforts in MA filtering, where the window size was adaptively adjusted based on noise-level estimates19,20.

Since the DTCWT denoising requires both a decomposition level and a threshold value, we conduct an exhaustive grid search over reasonable parameter ranges to identify optimal combinations. This procedure is performed on a small set of representative real RTS measurements for which reference estimates were available, and is not based on the synthetic benchmark data used for evaluation. For each RTS signal, the data are duplicated across the full parameter grid and processed through DTCWT denoising followed by digitization. We then compute the deviation between the extracted and reference mean dwell times (Inline graphic, Inline graphic), and identify parameter regions that jointly minimize errors in both states. From these experiments, we observe strong statistical correlations between the optimal DTCWT parameters and simple measurable properties of the raw RTS signal. Importantly, these relationships depend only on general signal statistics and not on semiconductor-specific operating conditions or device physics. Specifically, we find that the optimal decomposition level K correlates with the signal length L, while the optimal wavelet coefficient threshold T is best predicted by the spectral entropy Inline graphic of the signal. This enables us to derive the following predictive rules using linear regression fits over the grid search results:

  • Inline graphic

  • Inline graphic

Here, Inline graphic is defined as Inline graphic, where N is the total number of frequency bins obtained from Welch’s method, Inline graphic is the power spectral density coefficient at the k-th frequency bin, and Inline graphic corresponding normalized power spectral density of the signal. Spectral entropy was found to provide the most consistent correlation with the optimal threshold among the signal statistics examined. The resulting rules enable automated selection of DTCWT denoising parameters directly from the raw signal, without requiring prior knowledge of the underlying semiconductor system. The automatic selection does not fail in the sense of halting the algorithm or producing undefined outputs; rather, under extreme noise conditions, limitations manifest as increased benchmark errors, reflecting fundamental RTS reconstruction limits shared by all denoising methods. For real experimental RTS signals, where no ground truth exists, these rules provide a principled and reproducible starting point that can be further refined by expert users if desired. For clarity in this study, we refer to our fully automated DTCWT-based denoiser with adaptive selection of K and T simply as DTCWT, distinguishing it from the original transform and from basic denoising implementations that require explicit parameter specification.

Although we benchmark against the DAEUBL approach, our noise injection protocol differs significantly from that in the DAEUBL proposal20, which applies the noise ratio relative to the minimum Inline graphic across all traps, whereas our method uses the maximum Inline graphic from the amplitude span of the full multi-trap RTS signal as the reference for scaling noise. This results in a fundamental difference in how the same nominal Inline graphic or Inline graphic translates into actual noise magnitude in the signal. For instance, consider a signal with three traps where the individual trap steps are: 0.2, 0.3, and 0.4 units. DAEUBL would base its noise scaling on the smallest step (0.1), while our approach uses the full range (0.9 if all traps are on). Therefore, a Inline graphic noise in our system (Inline graphic) would correspond to a 90% noise level under DAEUBL proposal (Inline graphic)20. This discrepancy should be considered when comparing denoising performance metrics across studies, as the apparent signal quality at a given noise ratio is not directly equivalent between reported studies.

After denoising, the next step in the pipeline is to identify the stable amplitude levels present in the signal. For this purpose, we apply KDE, a non-parametric technique used to estimate the probability density function of the signal amplitudes. KDE operates by placing a smooth kernel, typically a Gaussian function, over each data point in the denoised signal, and summing the resulting distributions to form a continuous density function. This approach generates a smooth approximation of the amplitude distribution, which is particularly well-suited for noisy or fluctuation-heavy signals like RTS. Because KDE peak detection is sensitive to bandwidth selection, we use a fixed rule-of-thumb bandwidth to ensure reproducibility across all benchmark signals. Initially, KDE is applied to one-dimensional RTS amplitude samples (Inline graphic), with the signal length L corresponding to the number of samples. While Scott’s rule would prescribe a bandwidth scaling factor of Inline graphic, we deliberately replace this data-dependent factor with a fixed empirical bandwidth (bw Inline graphic) applied uniformly after denoising and amplitude normalization. We found that using a fixed bandwidth yields more stable and interpretable KDE peak structures than bandwidths that shrink with signal length L. The value 0.020 was selected empirically as a conservative smoothing scale that balances suppression of residual fluctuations with preservation of distinct RTS levels. Within a reasonable range around this value, the number, ordering, and prominence of dominant KDE peaks remained stable in our tests. By avoiding signal-length–dependent bandwidth shrinkage inherent to rule-of-thumb methods such as Scott’s rule, this choice ensures consistent peak resolution across signals of varying length L and noise level Inline graphic or Inline graphic without per-signal tuning. Peak identification in the KDE result is performed using prominence-aware peak-finding algorithms. These methods assess not only the height of peaks but also their relative separation from neighboring fluctuations, providing robustness in noisy scenarios. When analyzing known 1-trap signals, it is possible to apply a forced mode to extract exactly two dominant peaks, ensuring that the digitization process focuses on the most physically meaningful states. However, this is not used throughout the entirety of this study to ensure no intervention in executing the protocols, meaning that it is possible to recognize a 1-trap RTS as a multi-trap when too much residual noise is present after denoising. Compared to traditional histograms, which can produce misleading visualizations due to arbitrary bin widths or boundaries, KDE offers a smoother and more faithful representation of the underlying distribution. This is especially useful when identifying discrete RTS levels, as noise and sampling artifacts can otherwise obscure the presence of multiple states. In the context of RTS analysis, KDE enables clearer visualization and quantification of distinct amplitude levels–corresponding to the discrete states of traps. These appear as peaks in the KDE output. For example, a 1-trap RTS typically produces two prominent peaks, corresponding to the high and low states. These peak positions are then used in the digitization step to assign each sample in the signal to its most probable discrete state. As illustrated in Fig. 1c–h, the left subplots show the KDE results of the denoised RTS signals, where the density curves reveal the dominant peaks used to determine trap amplitude levels. The middle subplots then illustrate the progression from denoised RTS (blue) to digitized RTS (red). Unlike a histogram, which may display spurious gaps or bins that obscure these levels, the KDE plot offers a continuous and interpretable view that emphasizes the underlying state levels. This clarity is critical for downstream digitization and for accurate extraction of switching behavior. We emphasize that KDE peak detection is intended to provide an automated and reproducible starting point for RTS analysis, while KDE may not perfectly identify all RTS levels in particularly challenging cases, such as those with extreme noise. These limitations are consistent with fundamental signal reconstruction constraints. In practice, such difficult experimental RTS traces often require expert inspection or manual refinement, which remains standard in experimental workflows. Accordingly, achieving zero error is neither expected nor required for the benchmarking goals of this study, and the impact of KDE uncertainties is explicitly captured by the proposed benchmarking metrics.

Digitization algorithm

In the digitization step, each point in the denoised signal is assigned to discrete amplitude levels detected from KDE peaks, corresponding to distinct multi-trap states (i.e., combinations of individual binary traps). Each continuous-valued amplitude sample is mapped to its nearest KDE peak, effectively converting the signal into a stepwise sequence of discrete levels. These levels represent the most likely trap state configurations at each point in time. The outcomes of this process are illustrated as noisy RTS (gray), denoised RTS (blue), and digitized RTS (red) in Fig. 1c-h. The quality of both the denoising and digitization stages can be qualitatively assessed by comparing the digitized trace against the ground truth RTS (black). In the simplest case of a single trap, the digitized signal reduces to a binary time series alternating between two distinct levels, reflecting the “on” and “off” occupancy states. However, as the number of traps increases, the RTS becomes multi-level: a two-trap system, for example, yields four unique amplitude states corresponding to combinations of the two binary trap configurations (00, 01, 10, 11). In such cases, accurate digitization becomes increasingly challenging due to overlapping levels and closely spaced transitions. Several digitization strategies are employed to address this complexity, each with trade-offs in robustness and computational demand. One of the simplest approaches is proximity-based level assignment, extensively used in prior studies19,20, where each amplitude point is matched to the closest KDE peak. This method is fast and effective for low-noise, single-trap signals, where transitions are well-separated. However, it becomes error-prone in high-noise or multi-trap contexts, where peak overlap and noise artifacts can lead to misclassification, spurious transitions, and distorted dwell statistics. An HMM-based digitizer was also implemented as a variant during earlier stages of development. The main limitation lies in its reliance on well-initialized transition and emission matrices. Without a strong prior or carefully tuned initialization, the randomly generated starting parameters often lead to unstable convergence, poor accuracy, and high variance in results, especially for multi-trap RTS cases. This makes HMM-based digitization unreliable and unrepresentative of the true signal dynamics, especially when applied in a fully autonomous setting without manual tuning. For these reasons, it is excluded from the formal benchmark comparisons. For clarity, proximity-based, HMM-based, and Bayesian digitization strategies differ primarily in their sensitivity to residual noise, reliance on parameter initialization, and computational cost: proximity-based methods are fast but noise-sensitive, HMM-based methods are parameter-dependent and unstable without careful initialization, while the proposed Bayesian digitizer offers a lightweight and fully autonomous alternative suited for fluctuating denoised RTS signals.

Although the DTCWT denoiser is specifically optimized to suppress high-frequency white noise with excellent computational efficiency, it may not fully eliminate residual low-frequency components, such as pink noise, that often persist in RTS signals. To address this, and to further improve upon the capabilities of deep learning-based approaches like the DAEUBL, we introduce a novel Bayesian digitization method designed to complement the DTCWT. In this dual-stage architecture, the DTCWT handles the initial denoising to suppress white noise, while the Bayesian digitizer targets the remaining pink noise, enabling robust state classification across diverse RTS profiles, without neural networks or GPU-intensive processing. The proposed Bayesian digitizer applies probabilistic inference to assign each point in the denoised signal to one of the identified KDE peaks shown in Fig. 2.

Algorithm 2.

Algorithm 2

Bayesian Digitization

Bayesian digitization is formulated as a probabilistic inference problem. Let Inline graphic denote the denoised RTS amplitude at discrete time index Inline graphic. From the denoised signal, we extract a finite set of K discrete amplitude levels Inline graphic, obtained as KDE peak locations. We introduce a latent discrete random variable Inline graphic, which indicates which KDE level is active at time t. We consider the observation model Inline graphic, where Inline graphic represents the marginal residual noise remaining after denoising. Under this model, digitization corresponds to inferring the most likely discrete state Inline graphic given the observed amplitude a(t). We assume that the residual noise Inline graphic is approximately zero-mean Gaussian when considered marginally Inline graphic. This assumption implies that amplitudes close to Inline graphic are more likely to arise from state k, while deviations are penalized exponentially. The variance Inline graphic controls how forgiving the classifier is with respect to deviations from KDE levels. Although RTS noise may exhibit temporal correlation (e.g., pink noise), the Gaussian assumption is applied only to the marginal sample distribution. Temporal stability is handled separately through prior smoothing.

The residual noise standard deviation Inline graphic is estimated directly from the denoised signal by measuring deviations from the nearest KDE level. For each time step, we define Inline graphic, and compute the root-mean-square residual between the signal and its closest KDE level. Then Bayesian inference is performed at each time step. Let Inline graphic denote the prior probability of state k at time t. At initialization, we use a uniform prior Inline graphic, reflecting equal occupancy probability across all KDE levels. Given an observation a(t), the likelihood for each state k is computed using a Gaussian emission model Inline graphic. Applying Bayes’ rule yields the posterior distribution Inline graphic. This update is computed in closed form and requires no iterative optimization. Lastly, a hard digitized label is obtained via maximum a posteriori (MAP) decoding Inline graphic. The selected index corresponds to the digitized KDE level at time t. To suppress spurious switching caused by residual noise and enforce temporal persistence, we update the prior distribution using exponential smoothing Inline graphic, where Inline graphic. This update acts as a lightweight temporal regularizer; rather than learning a full transition matrix as in HMM, we impose a persistence bias that favors remaining in the same state unless successive observations consistently support a transition to a different dwell time in RTS. Because this update is a convex combination of valid probability distributions, it preserves normalization and non-negativity Inline graphic, Inline graphic. Additionally, an optional refinement step using Dempster-Shafer Theory34 can be incorporated, leveraging evidence from a short window of previous classifications to further stabilize predictions, especially in multi-trap signals where amplitude levels are closely spaced.

The Bayesian digitization method is computationally efficient and scalable. Because the posterior is computed in closed form at each time step, the method avoids iterative optimization as in variational Bayes or Markov chain Monte Carlo. The computational complexity scales linearly Inline graphic with signal length L and number of KDE levels K. Compared to HMM-based approaches, which jointly learn emission and transition parameters via expectation-maximization and are sensitive to initialization, our method fixes emission means using data-driven KDE peaks and enforces temporal stability through prior smoothing. Prioritizing robustness, scalability, and fully autonomous analysis of long, high-resolution RTS measurements.

A final key component of RTS analysis involves characterizing the temporal dynamics of trap activity in terms of the dwell time statistics. Each RTS signal contains periods where the trap remains in either a high or low state (Inline graphic and Inline graphic) as illustrated in Fig. 1a. These intervals vary in length and occur frequently, which poses a challenge to concise yet accurate summarization. Statistically, these dwell times follow a Poissonian process, leading to exponentially distributed interval lengths. We model the distribution of dwell times using the function Inline graphic, where x is the observed dwell time, Inline graphic is the mean dwell time (either Inline graphic or Inline graphic), and a is an offset parameter accounting for the shift in the decay (typically negligible in RTS studies). To extract these statistics, we analyze the binary time-series outputs from the digitization stage, where each binary signal represents the activity of an individual trap over time, 1 for the high state and 0 for the low state. The extractor identifies all state transitions in the binary trace and records the durations spent in each state. These durations are compiled into histograms and fitted with the exponential decay model to compute the characteristic means, Inline graphic and Inline graphic.

Four benchmarking models

For the selection of four benchmarking models: MA + Proximity, FFT + Proximity, DAEUBL + Proximity, and our proposed DTCWT + Bayesian, we do not include every possible combination of three denoising and two digitization options due to the following reasons. With scalability constraints, benchmarking all method combinations across large-scale datasets in terabytes is computationally infeasible. Secondly, empirical redundancy, preliminary tests of cross-method combinations (e.g., DAEUBL + Bayesian digitization) showed no meaningful quality gains over proximity-based digitization in those contexts. In fact, using Bayesian digitization after DAEUBL was not beneficial. While DAEUBL produces high-confidence outputs even when RTS levels are biased or distorted, the Bayesian digitizer was found less compatible; its design assumes uncertainty structures typical of DTCWT outputs, which tend to fluctuate around RTS levels. On the other hand, DTCWT + proximity digitization was also not pursued, as proximity-based digitizers inherently lack the capability to handle pink noise, a limitation they share with baseline combinations like MA + Proximity and FFT + Proximity.

For benchmarking fairness, all methods evaluated in this study are blind to the synthetic benchmark data: none of the denoisers or digitizers are optimized, trained, or tuned on the evaluated RTS signals. The adaptive rules used in the proposed DTCWT denoiser are derived independently from a separate set of real RTS measurements and rely only on general signal statistics, as detailed in Denoising and KDE Algorithm. Baseline methods such as MA and FFT employ adaptive parameters consistent with their standard usage in RTS analysis, while DAEUBL is evaluated using its available pretrained model without retraining. To ensure fair comparison, we adopt a unified and more aggressive noise-scaling scheme across all methods, avoiding the use of the original DAEUBL training data, which would otherwise introduce an unfair advantage.

Acknowledgements

We acknowledge HeeBong Yang for providing carbon nanotube film RTS data and the Digital Research Alliance of Canada for computing resources for our calculations.

Author contributions

N.Y.K. conceived the research idea and supervised the whole study. T.B. investigated the data synthesis, denoising, and digitization aspects of the analysis. T.B. wrote the code for synthesizing the datasets, DTCWT denoising, KDE steps, and data analysis. A.K. wrote the code for Bayesian digitization. T.B. performed simulations and collected simulation error statistics. T.B. prepared all figures. T.B. and N.Y.K. wrote the paper.

Funding

This work was supported by Industry Canada, the Ontario Ministry of Research & Innovation through Early Researcher Awards (RE09-068), and the Canada First Research Excellence Fund-Transformative Quantum Technologies (CFREF-TQT).

Data availability

The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.

Declarations

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Xie, T., Chen, Y., Marszalek, P. & Tsong, T. Fluctuation-driven directional flow in biochemical cycle: further study of electric activation of na, k pumps. Biophysical Journal72, 2496–2502. 10.1016/S0006-3495(97)78894-5 (1997). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Hansen, M., Desai, R., Simpson, M. & Weinberger, L. Cytoplasmic amplification of transcriptional noise generates substantial cell-to-cell variability. Cell Systems10.1016/j.cels.2018.08.002 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Paschotta, R. Avalanche photodiodes. RP Photonics Encyclopedia, 10.61835/pbn. Available online at https://www.rp-hotonics.com/avalanche_photodiodes.html.
  • 4.Wang, X., Rao, P. R., Mierop, A. & Theuwissen, A. J. Random telegraph signal in cmos image sensor pixels. In 2006 International Electron Devices Meeting10.1109/IEDM.2006.346973 (2006). [Google Scholar]
  • 5.Shaw, M. D., Lutchyn, R. M., Delsing, P. & Echternach, P. M. Kinetics of nonequilibrium quasiparticle tunneling in superconducting charge qubits. Phys. Rev. B78, 024503. 10.1103/PhysRevB.78.024503 (2008). [Google Scholar]
  • 6.Cywiński, L., Lutchyn, R. M., Nave, C. P. & Das Sarma, S. How to enhance dephasing time in superconducting qubits. Phys. Rev. B77, 174509. 10.1103/PhysRevB.77.174509 (2008). [Google Scholar]
  • 7.Guo, S., Wang, R. & Mao, D. e. a. Anomalous random telegraph noise in nanoscale transistors as direct evidence of two metastable states of oxide traps. Scientific Reports 7, 6239 (2017) 10.1038/s41598-017-06467-7 . [DOI] [PMC free article] [PubMed]
  • 8.Amarasinghe, N. V., Çelik Butler, Z., Zlotnicka, A. & Wang, F. Model for random telegraph signals in sub-micron mosfets. Solid-State Electronics47, 1443–1449. 10.1016/S0038-1101(03)00100-X (2003). [Google Scholar]
  • 9.Hung, K., Ko, P., Hu, C. & Cheng, Y. Random telegraph noise of deep-submicrometer mosfets. IEEE Electron Device Letters11, 90–92. 10.1109/55.46938 (1990). [Google Scholar]
  • 10.Wang, X., Rao, P. R., Mierop, A. & Theuwissen, A. J. Random telegraph signal in CMOS image sensor pixels. In 2006 International Electron Devices Meeting1–4, 10.1109/IEDM.2006.346973 (2006).
  • 11.da Silva, R. & Wirth, G. RTS noise in semiconductor devices: time constants estimates and observation window analysis. Journal of Statistical Mechanics: Theory and Experiment2022, 043201. 10.1088/1742-5468/ac5dbf (2022). [Google Scholar]
  • 12.Fleetwood, D. M. Origins of 1/f Noise in Electronic Materials and Devices: A Historical Perspective, 1–31 (Springer International Publishing, Cham, 2020).
  • 13.Vasileiadis, N., Mavropoulis, A., Loukas, P., Sirakoulis, G. C. & Dimitrakis, P. Analysis of random telegraph noise in resistive memories: The case of unstable filaments. Micro and Nano Engineering19, 100205. 10.1016/j.mne.2023.100205 (2023). [Google Scholar]
  • 14.Moshrefi, A., Aghababa, H. & Shoaei, O. Employing the empirical mode decomposition to denoise the random telegraph noise. Engineering Transactions34, 90–96. 10.5829/ije.2021.34.01a.11 (2021). [Google Scholar]
  • 15.Mortazavi, S. H. & Shahrtash, S. M. Comparing denoising performance of dwt, wpt, swt and dt-cwt for partial discharge signals. In 2008 43rd International Universities Power Engineering Conference10.1109/UPEC.2008.4651625 (2008). [Google Scholar]
  • 16.Eddy, S. R. What is a hidden markov model?. Nature Biotechnology22, 1315–1316. 10.1038/nbt1004-1315 (2004). [DOI] [PubMed] [Google Scholar]
  • 17.Değirmenci, A. Introduction to hidden markov models (2015).
  • 18.Jurafsky, D. & Martin, J. H. Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition with Language Models, 3rd edn (2024), Online manuscript released August 20, 2024.
  • 19.Robitaille, M., Yang, H., Wang, L., Deng, B. & Kim, N. Y. Deep neural network analysis models for complex random telegraph signals. Scientific Reports (2024). Institute for Quantum Computing, Department of Electrical and Computer Engineering, Waterloo Institute for Nanotechnology, Department of Chemistry, and Perimeter Institute, University of Waterloo.
  • 20.Deng, B., Yang, H. & Kim, N. Y. A denoising autoencoder based on u-net and bidirectional long short-term memory for multi-level random telegraph signal analysis. Engineering Applications of Artificial Intelligence135, 108685. 10.1016/j.engappai.2024.108685 (2024). [Google Scholar]
  • 21.Lambert, N. J., Esmail, A. A., Edwards, M., Ferguson, A. J. & Schwefel, H. G. L. Random telegraph signal analysis with a recurrent neural network. Phys. Rev. E102, 012312. 10.1103/PhysRevE.102.012312 (2020). [DOI] [PubMed] [Google Scholar]
  • 22.Li, X. et al. Machine learning method for accurate analysis of complicated low temperature random telegraph noise. In 2022 International Conference on IC Design and Technology (ICICDT), 20–23, 10.1109/ICICDT56182.2022.9933107 (2022).
  • 23.González-Cordero, G. et al. Neural network based analysis of random telegraph noise in resistive random access memories. Semiconductor Science and Technology35, 10.1088/1361-6641/ab6103 (2020).
  • 24.Varanasi, A., Degraeve, R., Roussel, P. & Merckling, C. Rtninja: a generalized machine learning framework for analyzing random telegraph noise signals in nanoelectronic devices (2025). arxiv: 2507.08424.
  • 25.Kerski, J. et al. Post-processing of real-time quantum event measurements for an optimal bandwidth. Scientific Reports 13, 1105 (2023). Faculty of Physics and CENIDE, University of Duisburg-Essen. 2nd Institute of Physics, RWTH Aachen University. Chair for Applied Solid State Physics, Ruhr-Universität Bochum. [DOI] [PMC free article] [PubMed]
  • 26.Awano, H., Tsutsui, H., Ochi, H. & Sato, T. Bayesian estimation of multi-trap rtn parameters using markov chain monte carlo method. IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences E95.A, 2272–2283, https://www.jstage.jst.go.jp/article/transfun/E95.A/12/E95.A_2272/_article (2012).
  • 27.Chen, X. et al. Modeling random telegraph noise as a randomness source and its application in true random number generation. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems35, 1–1. 10.1109/TCAD.2015.2511074 (2015). [Google Scholar]
  • 28.Rajaratnam, B. & Sparks, D. Mcmc-based inference in the era of big data: A fundamental analysis of the convergence complexity of high-dimensional chains (2015). arxiv: 1508.00947.
  • 29.Craiu, R. V., Gustafson, P. & Rosenthal, J. S. Reflections on bayesian inference and markov chain monte carlo. Canadian Journal of Statistics50, 1213–1227. 10.1002/cjs.11707 (2022). [Google Scholar]
  • 30.Principato, F. & Ferrante, G. 1/f noise decomposition in random telegraph signals using the wavelet transform. Physica A: Statistical Mechanics and its Applications380, 75–97. 10.1016/j.physa.2007.02.111 (2007). [Google Scholar]
  • 31.Bai, T. Towards a Robust Framework for Analyzing Random Telegraph Signals (RTS): Application to 2-level RTS in a Semiconductor Quantum Dot. Master’s thesis, University of Waterloo, 200 University Ave. West, Waterloo, N2L 3G1, Ontario, Canada (2024). Department of Electrical and Computer Engineering, Institute for Quantum Computing.
  • 32.Vasileiadis, N., Mavropoulis, A., Loukas, P., Sirakoulis, G. C. & Dimitrakis, P. Analysis of random telegraph noise in resistive memories: The case of unstable filaments. Micro and Nano Engineering19, 100205. 10.1016/j.mne.2023.100205 (2023). [Google Scholar]
  • 33.Gauthier, O. et al. Enhanced statistical detection of random telegraph noise in frequency and time domain. Solid-State Electronics194, 108320. 10.1016/j.sse.2022.108320 (2022). [Google Scholar]
  • 34.Sentz, K. & Ferson, S. Combination of evidence in dempster-shafer theory. Tech. Rep. SAND2002-0835, Sandia National Laboratories, Albuquerque, New Mexico and Livermore, California (2002). Approved for public release; further dissemination unlimited. Prepared under Contract DE-AC04-94AL85000.

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES