Abstract
Analysis of patchclamp recordings is often a challenging issue. We give practical guidance how such recordings can be analyzed using the model-free multiscale idealization methodology JSMURF, JULES, and HILDE. We provide an operational manual how to use the accompanying software available as an R-package and as a graphical user interface. This includes selection of the right approach and tuning of parameters. We also discuss advantages and disadvantages of model-free approaches in comparison to hidden Markov model approaches and explain how they complement each other.
Supplementary Information
The online version contains supplementary material available at 10.1007/s00249-021-01506-8.
Keywords: Deconvolution, Flickering, Fully automatic, Hidden Markov models, Homogeneous and heterogeneous noise, Ion channel recordings, Low-pass filtering, Open-channel noise, PorB, Subconductance states
Introduction
The patchclamp technique has been and still is a fundamental tool for the quantitative analysis of electrophysiological processes of transmembrane proteins, in particular of ion channels (Neher and Sakmann 1976; Sakmann and Neher 1995). A detailed understanding of the dynamics of transmembrane proteins and their manifold interactions with their surrounding is of high importance in medicine and biochemistry, for instance for the development of new drugs (Kass 2005; Overington et al. 2006). However, most electrophysiologists will agree that conducting patchclamp experiments, but also the analysis of their recordings is a challenging issue, and the latter is far from being a routine data analysis in general (Sivilotti and Colquhoun 2016). In this work, we provide practical guidance on how to analyze such recordings. We focus mainly on model-free multiscale idealizations (explained below), which we have developed over the last decade.
Patchclamp recordings The patchclamp technique allows one to measure the conductance of a channel (i.e., the recorded current divided by the applied voltage) over time. An example is given in Fig. 1. It shows a recording of the outer membrane porin PorB from Neisseria meningitidis, a pathogenic bacterium in the human nose and throat region (Virji 2009). PorB is a trimeric porin and the second most abundant protein in the outer membrane of Neisseria meningitidis. The added antibiotic ampicillin blocks the ion flow for short periods of time which allows one to draw conclusions about the transport of antibiotics into the cell, which is relevant for the understanding of antibiotic resistances. For further details, see (Bartsch et al. 2019, 2020). In addition, we will also use a PorB dataset without ampicillin (Fig. 6) and a Gramicidin A dataset (Fig. 4) throughout this work as illustrating examples.1
Idealization Important dynamics such as the number of conductance levels, their values, and how long each level persists can be examined provided the conductance recordings (data points) are properly idealized (Colquhoun 1987; Sakmann and Neher 1995), i.e., the conductance trace over time (the underlying signal) is accurately reconstructed (estimated, denoised).
An idealization can either be obtained model free2, i.e., without prior assumptions about the gating dynamics, or in a model-based way by assuming an underlying statistical (parametric) model with a few parameters for the gating dynamics. For the latter, most commonly hidden Markov models (HMMs) are used, see (Ball and Rice 1992) for an early reference, where parameters correspond to states, transition probabilities, and noise characteristics.
Filtering The noise before filtering is often assumed to be Gaussian white noise. However, low-pass filters are usually integrated in the hardware of the measurement device to stay in the transmission range of the amplifier. Such filtering introduces colored noise and smooths the underlying conductance, see Fig. 12 in “Review of existing model-free idealization approaches”. Ignoring filtering typically results in the detection of false positives (additional wrong events). This is illustrated in Fig. 2, where we used (Frick et al. 2014), a multiscale method that does not include filtering, but is otherwise similar in spirit to our model-free idealization approaches, to be explained later. Filtering especially affects short temporal scales (at and below the magnitude of the filter length, say) and is therefore particularly relevant to the analysis of short events, also called flickering.
Flickering and subgating Flickering typically has its own dynamics and can result from various molecular processes like conformational changes of the protein (Grosse et al. 2014) or by the passage of larger molecules blocking the ions’ pathway through the protein (Raj Singh et al. 2012). An example for the latter is the PorB analysis in Figs. 1 and 3. A second potential challenge in the analysis is subconductance states (Fox 1987), meaning that two or more conductance levels are close to each other, as illustrated in Figs. 4 and 5.
Model-free idealizations In this paper, we discuss mainly our model-free idealization methods (Hotz et al. 2013), (Pein et al. 2018), and (Pein et al. 2021), which are primarily designed as versatile tools to analyze patchclamp recordings in a multiscale fashion, for instance to deal well with subconductance states and flickering. Due to their multiscale character, they act on various temporal scales simultaneously and hence are able to idealize events of different lengths well in a single step. Moreover, all parameters, i.e., locations of conductance changes and conductance levels, are obtained by (local) deconvolution, and hence, they take into account low-pass filtering explicitly. Furthermore, all three approaches control the overestimation of the number of conductance changes. More precisely, the probability to detect at least one false positive is bounded approximately by the error level , a tuning parameter. A more-detailed review of these and further model-free idealization approaches is given in “Review of existing model-free idealization approaches”.
All three methods can be used when homogeneous noise is assumed (i.e., the error variability does not change over time, see “Models” for details), but and in addition allow for heterogeneous noise. The latter means that different parts of the data, for instance different states, may have different noise levels, as for instance caused by open-channel noise, i.e., larger noise on segments with a larger conductance. Moreover, requires that events are slightly longer, while or are able to deal with flickering (short events) at the possible expense of longer computational time. Table 1 summarizes for which datasets which one of them is most suitable and “Choosing the right method” explains those choices in full detail.
Table 1.
Homogeneous noise | Heterogeneous noise | |
---|---|---|
No relevant short events exist | JSMURF (homogeneous noise) | JSMURF (heterogeneous noise) |
Relevant short events exist | JULES or HILDE (homogeneous noise) | HILDE (heterogeneous noise) |
For more details, see “Choosing the right method”
Software , , and are implemented as R functions in the package clampSeg (Pein and Aspelmeier 2020). In “Using our software”, use of those methods is demonstrated. They can be combined with the packages readABF (Syekirin and Pein 2020) to load recordings, and lowpassFilter (Pein et al. 2020) for certain data processing steps around filtering such as computing the convolution of an idealization with the kernel of a low-pass filter.
Alternatively, a graphical user interface, available at https://github.com/FlorianPein/clampSegGUI together with detailed manuals on how to install it and on how to use it, allows access without requiring any R or other programming knowledge. The idealizations can be visualized in the interface, but also saved as csv files and hence postprocessed by any other program.
Interplay between model-free idealizations and hidden Markov models There is wide agreement that, except in few counterexamples (Fuliński et al. 1998; Mercik and Weron 2001; Goychuk et al. 2005; Shelley et al. 2010), the gating dynamics of ion and many other channels are usually Markovian. Hence, hidden Markov model (HMM)-based approaches are widely used to analyze patchclamp recordings. However, assuming an HMM is not only saying that the hidden states follow a Markov model, it also fixes a data generating process conditioned on the hidden states. From our own experience, we stress that this second step is usually the critical part of the assumption of an HMM. Standard (homogeneous) HMMs, where observations conditioned on the hidden Markov states are modeled as independent Gaussian observations with state-dependent expectations and variances, are often violated and commonly lead to invalid reconstructions. This is because of artifacts, which are, for instance, caused by the electronics, external vibrations, or small holes in the membrane, or because of additional high-frequency (violet) and long-tailed 1/f (pink) noise components, see for instance (Neher and Sakmann 1976; Venkataramanan et al. 1998; Levis and Rae 1993). Hence, HMM-based analyses often rely on intensive preprocessing or on more complicated models: for instance, (Venkataramanan et al. 1998) assumed an HMM that allows additional colored noise, and (Diehn et al. 2019) provided modifications to incorporate inhomogeneous errors. Moreover, low-pass filtering often requires further, computationally demanding extensions, see for instance (Venkataramanan et al. 1998; de Gunst et al. 2001; Diehn 2017; Almanjahie et al. 2019).
In contrast, model-free idealizations do not assume a specific (parametric) model for the gating dynamics and immediately provide an idealization without such an assumption. Moreover, they usually act rather locally on the data, i.e., at every location, the idealization is not influenced significantly by observations far away. Thus, they are typically more robust to artifacts and hence require often no or less pre-processing. Contrarily, HMM-based approaches have (potentially) a finer time resolution and provide more concise results. A more-detailed review of HMM-based analyses, their advantages and disadvantages in comparison to model-free approaches, and their interplay is given in “Hidden Markov Models”.
Model-free idealizations allow a flexible analysis of the number of conductance levels, and their values and which transitions are possible. To this end, one has to cluster the estimated conductance values, e.g., by fitting a Gaussian mixture distribution and assigning each value to the nearest mean value. The outcome will then have only a small number of conductance levels. This can be used to select and verify a Markov model and to estimate its parameters, which often requires taking into account missing of short events. Further details and tools which can be used for those steps are described in “Analysis of patchclamp recordings”. Finally, model-free idealizations can be used to assist HMM-based approaches in any of their analysis steps, e.g., they can be used to remove artifacts, to select and verify a specific Markov model, to provide starting values for iterative procedures such as the Baum–Welch algorithm, and to verify estimated parameters and the provided idealization.
All in all, model-free and HMM approaches have different strengths and weaknesses and hence should less be seen as competing approaches, but rather as tools that benefit from and complement each other. In fact, as an indication of a proper data analysis, it can be checked whether the results of model-free and HMM-based analyses are in compliance.
Organization of this work In Model-free idealizations, we give detailed instructions how to use our methods to obtain model-free idealizations. In “Models”, we provide details of the statistical models underlying the presented model-free idealization methodology. In Review of tools to analyze patchclamp recordings, we review in more detail existing approaches for the analysis of patchclamp recordings. We start in “Hidden Markov Models” with a review of HMM-based methodology, their advantages and disadvantages in comparison to model-free approaches, and the interplay of model-free idealization with them. Afterwards, in “Review of existing model-free idealization approaches”, we review some existing model-free idealization methods, with a particular focus on our approaches , , and . This is complemented by a brief summary of simulation results. Finally, in “Analysis of patchclamp recordings”, we discuss how (model-free) idealizations can be used to analyze patchclamp recordings. The paper concludes with a discussion in Discussion, in which we highlight open research questions.
Model-free idealizations
This section provides a comprehensive guide on how to use our methods , , and , which have different strengths and weaknesses depending on certain structural features of the measured data. We explain in detail how to use our software (Using our software) and provide guidance for which method is preferable in which situations (“Choosing the right method”).
Using our software
For this section, we use R code3. A tutorial similar to the one in this section is available in the supplement as a zip file. It contains R code, resulting figures, and the obtained fits. Hence, users are able to test whether they obtain the same results. We start by describing how recordings can be loaded, how the low-pass filter can be specified, and how our methods can be called. To this end, we require the R packages readABF (Syekirin and Pein 2020), lowpassFilter (Pein et al. 2020), and clampSeg (Pein and Aspelmeier 2020). All three packages are available on CRAN4. For users who are not familiar with R, we also provide a graphical user interface5, which contains detailed manuals on how to install it and on how to use it. The current guide is also available in the supplement.
Loading the data Patchclamp recordings are typically stored as abf files. The readABF package allows one to read such files in R. After the data set is loaded by calling readABF, we use as.data.frame to transform the data into a data.frame with two columns: time and conductance (the current divided by the voltage channel). We stress that this call is data set specific, since common measuring devices have a wide range of different formats and offer some freedom which channels are recorded. Additionally, users might want to work with the current instead of the conductance. Those options are described in the help file of as.data.frame.
Our idealization methods but also any other approach should not be used as a black box. We strongly recommend to start with an empirical and visual data analysis to gain understanding of the datasets and major features that can be used to direct further analysis. In alignment with the underlying multiscale philosophy of our methods, we recommend always to plot on various temporal scales. See Fig. 1 for an example of such plots on three different scales ranging from a minute to milliseconds. Moreover, histograms of the raw data (point amplitude histograms), see for instance Fig. 13a and for code the paragraph ’Interpreting, plotting and verification of the output’ below, are helpful visual cues. As detailed below, this can already help to decide whether the noise is homogeneous or heterogeneous and whether short events occur in the dataset. Moreover, we recommend to identify potential artifacts that might disturb analysis and interpretation. However, we have found that model-free idealizations are usually quite robust to artifacts. Hence, our default suggestion is first to apply the idealization methods on the unmodified dataset and to decide later whether artifacts require a more careful analysis.
Low-pass filter Our methodology requires to specify correctly the low-pass filter in the measurement device. The type, often a Bessel filter with an even number of poles, should be specified in the hardware documentation. The sampling rate and cut-off frequency can typically be varied by the user. In our example (Fig. 1), the recordings were sampled at 50,000 Hz and low-pass filtered by a 4-pole Bessel filter with normalized cut-off frequency of 0.1 (5,000 Hz cut-off frequency in time domain).
For simplification and since the error is negligible, we truncate the kernel of the low-pass filter after m data points, for sufficiently large m. As a working rule, we choose m, such that the autocorrelation function of the untruncated analogue low-pass filter is below afterwards, which leads for instance to in the example above.
This is implemented in the function lowpassFilter in the package lowpassFilter (Pein et al. 2020) (currently only Bessel filters are supported). The following code creates the filter object.
We strongly recommend to verify that the filter is correctly specified by zooming into single events and checking whether the obtained idealization convolved with the low-pass filter fits the observations well. This is detailed in paragraph ’Interpreting, verification and storing of the output’ below, where we discuss in more generality how to assess the quality of an obtained idealization. Additionally, one can compare the auto-correlation resulting from the filter, filter$acf, with the estimated auto-correlation of the recordings. To this end, one can either apply standard time-series estimators, as, for instance, offered by the acf function in R, to long segments without conductance changes or use the robust difference-based estimators of (Tecuapetla-Gómez and Munk 2017) on the raw data, available in the R-package dbacf6.
Obtaining an idealization , , and are available in the package clampSeg. All three functions can be applied when homogeneous noise is assumed, but only and allow for heterogeneous noise. The following code illustrates how to call those functions depending on whether the noise is homogeneous or heterogeneous. See “Choosing the right method” for guidance which method and which noise option should be chosen to idealize a given measurement.
The followings paragraphs discuss run time, required Monte Carlo simulations, the output of the approaches, and how to proceed with it. Furthermore, we explain a potentially occurring warning and how to choose tuning parameters, e.g., r and alpha.
Run time and Monte Carlo simulations The run time of all approaches depends on the size of the dataset, but also on the number of detected events. The primary reason are Monte Carlo simulations which are required to obtain critical values that balance the probabilities of detection of true events and of false positives. Monte–Carlo simulations depend on the number of data points and on the low-pass filter. Hence, a new Monte Carlo simulation is required when new values for those parameters occur or when more repetitions are requested. Depending on the number of data points and the total number of repetitions r, Monte Carlo simulations may take long, even up to several hours. Hence, we store and load their results, such that they have to be performed only once and the run time will be much smaller when an idealization with the same parameter is computed. They are fully automatically stored in the workspace and on the disk of the local computing machine; for more details, see the documentation of the function getCritVal in the package clampSeg (Pein and Aspelmeier 2020). To keep track of the progress of a Monte Carlo simulation, one can set the argument messages to a positive integer value m to print a message every m repetitions.
While a larger number of repetitions increases the run time of the simulations, it also reduces statistical errors in the computation of the critical values. For a final analysis, we recommend to use the default values, 10,000 for and and 1,000 for . For a quick analysis, for instance to decide whether further measurements or analyses are required, few hundreds up to 1,000 repetitions usually suffice.
Additionally, also the main computation of the idealization can take some time, usually between few seconds and few minutes, depending on the used idealization method, on the size of the dataset and on the number of detected events. Usually, the run time increases with the complexity of the idealization approach, is the fastest, and the slowest. A situation which is computationally particularly demanding is displayed in Fig. 9 (see below). detects almost no events. Due to internals in the dynamic programming algorithm, this causes a considerably long-run time, in this example of roughly half an hour. We stress that uses as an initial step and hence also is slow in such a situation, though it detects many events as it is able to resolve events on smaller temporal scales at and below the magnitude of the filter length.
Interpreting, plotting, and verification of the output All shown idealization methods return an object of the classes stepblock and localDeconvolution. We omit the exact structure of it (and refer to the man files of the called functions), but demonstrate important ways how to proceed with such an object. First of all, the idealization can be plotted using standard functions in R. Furthermore, the convolution of the idealization with the kernel of the low-pass filter can be computed using the function getConvolution in the package lowpassFilter. The following code demonstrates how to do so. It provides the lower left panel in Fig. 3.
In Fig. 3, we found that the convolution fits the recorded observations well, which is a confirmation for our idealization, but also for a correct specification of the model, in particular of the underlying low-pass filter. We always recommend such a graphical inspection to evaluate the quality of the idealization. If the idealization is not sufficiently good, one might modify tuning parameters (see the paragraph below), try a different idealization method (see “Choosing the right method”), remove artifacts, or seek to improve the quality of the recordings.
Obtaining a model-free idealization is usually only one step in a data analysis. In “Analysis of patchclamp recordings”, we discuss typical follow-up steps. The idealized conductance values and the start and end times of the segments are given in fit$values, fit$leftEnd, and fit$rightEnd, respectively. For instance, the following code creates histograms of the raw data, often called point amplitude histogram, of the idealized conductance levels, often called event histogram, and of the amplitudes, i.e., of the differences between the idealized conductance levels. Examples are given in Fig. 13. We use the half sample mode (Robertson and Cryer 1974), implemented in the R-package modeest, to determine the underlying conductance levels, see the paragraph ’Analysis of the conductance levels’ in “Analysis of patchclamp recordings” for further discussion.
Warning Users may experience a warning saying “at least one segment could not be deconvolved since two successive short segments occurred”. This is caused by the fact that the deconvolution approach incorporated in our methods can only deal with single changes or with isolated peaks (two changes in quick succession but separated by few more observations from other events). Obtaining a deconvolution for three or more changes in quick succession is complicated and time-consuming, and hence, we decided to ignore such events when applying a deconvolution, but to mark them in attr(fit, "noDeconvolution"). For a further analysis, we usually recommend to ignore such events as they might even indicate artifacts. This can be done by setting all marked values to NA.
If too many segments are marked and they appear to be important for the given dataset, we cannot recommend to use our approaches, in this situation of extreme/high flickering a better alternative might be approaches based on conductance distribution fitting; for further details, see our review in “Hidden Markov Models”.
Storing of the output To allow proceeding in a different program, one can store the idealization for instance in a csv file as demonstrated by the following code. Note that we also remove the first and last segment, since their true start and end, respectively, cannot be identified by the data.
Tuning parameters All three methods have multiple parameters which can be tuned to adapt to particular needs. Nonetheless, it is advisable to leave them unchanged unless specific reasons exists. All parameters are described in the man files of the called functions and in the referenced papers. Hence, in the following, we will only give a brief overview about the most important ones. Further details are also provided in the review of our idealization approaches in “Review of existing model-free idealization approaches”.
The choice of the number of repetitions of the Monte Carlo simulations, the argument r, was already discussed above in paragraph ’Run time and Monte Carlo simulations’. The parameters alpha, alpha1, and alpha2 are error levels that bound approximately the probability of detecting one or more false positives (under the idealized scenario that the observations follow exactly the assumed model). As a default choice, we suggest . Larger values increase the chance to detect true events, but also to detect more false positives. One may use larger values to ’screen’ if important events are difficult to detect.
For , the error level is split between the multiscale criterion of (error level ) and the local tests (error level ). As default values, we suggest and , since the focus of is typically on detecting short events primarily, while events on larger scales are often easier to detect. More weight can be put on if either short events are of less interest or if long events are difficult to detect, as well, e.g., since they have a smaller jump size than the short events, for instance because of subconductance states. requires to specify the largest scale , this value should be chosen, such that all events on larger scales are reliably detected by . If required, this can be tested by applying or by Monte Carlo simulations. In our R code, see the example code in the paragraph ’Obtaining an idealization’ above, one can specify the largest scale by setting lengths . Note that the R code offers the additional flexibility to omit some scales below . This can be used to save run time or to increase slightly the detection power on the remaining scales.
Choosing the right method
A guide which method to use is given in Table 1. The two main criteria are whether the noise is homogeneous or heterogeneous and whether short events are present and relevant. Recall that , , and are all suitable when one assumes homogeneous noise, but only and allow for heterogeneous noise. Moreover, and are designed to deal with short events, while requires that events are slightly longer. Because of run time and precision, we generally recommend to use the simplest approach that is suitable for a dataset. Unless the dataset demands otherwise, we recommend over over and a homogeneous over a heterogeneous noise setting.
Visual inspection Homogeneous noise means that the noise distribution is the same at all times and for all conductance levels; otherwise, the noise is called heterogeneous. Heterogeneous noise is often clearly visible by naked eye, as in Fig. 6 where the noise level is higher for the higher conductance level, whence one should use either or with heterogeneous noise setting. In most cases, if heterogeneous noise is not clearly visible, approaches that assume homogeneous noise are suitable.
A short event is defined by two conductance changes in quick succession, e.g., a channel opening only very briefly before closing again. should be used if it is not expected to miss relevant short events. Which events are too short depends not only on the absolute length, but also on the magnitude of the conductance change, noise levels, filtering, and tuning parameters. At least, events shorter than filter length will certainly be missed by . Figure 1 shows such an example, whence one should use either or .
Empirical comparison If visual inspection is not sufficient, we suggest the following empirical procedure. The user should apply all potentially suitable methods to a small excerpt of the data and decide which leads to the best idealization. In general, if the idealizations are similar, the simpler approach should be preferred.
To illustrate the procedure for short events, consider Fig. 3 where we see that detects a large number of short events. In comparison, we see in Fig. 9 that is not able to detect those events and hence is unsuitable for this dataset. In this case, appears to be more suitable. Contrarily, Fig. 5 demonstrates that is very suitable to idealize the Gramicidin dataset, where no short events occur, but events with small conductance changes, while struggles to detect all of them, as seen in Fig. 10, since it also searches for short events and hence has slightly less power on larger temporal scales.
To illustrate the procedure for heterogeneous noise, we idealized the observations in Fig. 6, which have visibly heterogeneous noise, with , which is designed to deal with heterogeneous noise. Results are displayed in Fig. 7. For comparison, an idealization by , assuming homogeneous noise, is displayed in Fig. 8. We see that detects many additional events in the open state, which has higher noise level, and while it is able to detect the short events, the fit is visibly worse than the fit by .
To decide whether the noise is heterogeneous, we recommend to more advanced users also the following systematic approach: if longer segments without gating events are present, one can use them to estimate the noise level. Alternatively, one can idealize the data with or with heterogeneous noise setting and use the idealization to determine noise levels as detailed in (Pein et al. 2021, Section VI-C).
Finally, if homogeneous noise is assumed and short events are relevant, we usually recommend to use instead of as it is simpler and faster. Only if events are very short, such as in Fig. 3, should be used as it detects such events more likely.
Models
In this section, we explain the statistical models underlying our methodology. For more details, see (Hotz et al. 2013; Pein et al. 2018, 2021).
We assume that the recorded data (the measured conductance at time points , equidistantly sampled at rate ) result from a conductance f perturbed by a centered Gaussian white noise process . The noise is scaled by the noise level . Furthermore, conductance and noise are convolved with an analogue low-pass filter, with (truncated) kernel . Hence, after digitization at sampling rate , we obtain:
1 |
with the convolution operator. Here, n denotes the total number of data points (typically several hundred thousands up to few millions). Hence, the resulting errors are Gaussian and centered, but correlated (colored noise).
The conductance f is assumed to be piecewise constant with potentially many different (unknown) segments of (unknown) length and size. The noise can either be homogeneous, i.e., the noise level does not vary over time, or heterogeneous. In the latter case, we assume the noise level to be an unknown piecewise constant function with potential jumps at the locations where the conductance changes, since changes of the noise level also depend on gating events7. More precisely, we model the conductance f and the noise level by:
2 |
where t denotes physical time. The (unknown) conductance levels are denoted as , the (unknown) noise levels as , the (unknown) number of gating event as K, and the (unknown) locations of the gating events as . We stress that the class of signals in (2) is very flexible as potentially any arbitrary number of gating events at arbitrary conductance levels and arbitrary noise levels can be imposed, see Fig. 3 for an example.
Review of tools to analyze patchclamp recordings
In this section, we give a review about methods for the analysis of patchclamp recordings. We start in Hidden Markov Models with a HMM-based analysis and discuss also their interplay with model-free idealizations as well as their advantages and disadvantages in comparison to model-free approaches. The analysis by and the interplay between the different approaches is also illustrated in Fig. 11. Second, we review existing model-free idealization methods in Review of existing model-free idealization approaches. Finally, we discuss in Analysis of patchclamp recordings how idealizations can be used to analyze patchclamp recordings. Given the large amount of different methodology, we are by far not able to give a full review. The following only intends to summarize major ideas to help the reader to put , , and in the right context.
Hidden Markov models
HMM-based analysis We limit our discussion mostly to homogeneous HMMs, which means that the parameters, which describe state transition properties and noise distribution, are constant in time. Inhomogeneous HMMs, see, for instance, (Diehn et al. 2019), are rarely used, as they are computationally more challenging and theoretical guarantees for parameter estimates are much harder to prove. As already discussed in the introduction, the assumption of a homogeneous Markov chain underlying the gating dynamics is almost always appropriate, but the assumption of a homogeneous error distribution to obtain a homogeneous HMM is more critical, since, e.g., because of artifacts, often intensive data cleaning or more complicated models are required. We stress that the quality of an HMM-based analysis crucially depends on the stringent modeling assumption given by a HMM.
Obtaining an idealization by an HMM proceeds in several steps (illustrated in the right-hand side of Fig. 11): First, a specific hidden Markov model has to be selected and ideally verified. This includes to find a Markov model for the gating dynamics, e.g., to fix the number of states and which transitions are possible. Note, that often multiple Markov states are required for one conductance level, e.g., to accommodate different noise levels or dwell times. Though data-driven model-selection tools are available, see, e.g., (Gassiat and Keribin 2000; Gassiat and Boucheron 2003; Celeux and Durand 2008; Chambaz et al. 2009; Lehéricy 2019) and the references therein, this is often done manually by an empirical data analysis or by repeating the steps below until results are satisfying, which can be time-consuming and introduces subjectivity.
As soon as a specific HMM is selected, parameters of the Markov model can either be estimated by the Baum–Welch algorithm, see (Venkataramanan et al. 2000; Qin et al. 2000), by Bayesian approaches, in particular MCMC sampling, see (de Gunst et al. 2001; Siekmann et al. 2011), or by approaches based on the conductance (current) distribution, see (Yellen 1984; Heinemann and Sigworth 1991; Schroeder 2015) and the references therein.
Finally, an idealization can be obtained by the Viterbi algorithm (Viterbi 1967) or by Bayesian methods, in particular particle filtering, see (Fearnhead and Künsch 2018) and the references therein. Recently, a deep neural network approach has been proposed (Celik et al. 2020), which skips the parameter estimation step and directly obtains an idealization. This approach can be seen as a hybrid method in between parametric and model-free approaches. It does not require a specific HMM to obtain an idealization, but training in advance is required, which was done by assuming classes of hidden Markov models with hyperparameters.
Once an idealization is obtained, it can be used in reverse to estimate the parameters of the Markov model. We postpone details to Analysis of patchclamp recordings, since one proceeds as for model-free idealization. Using a layered HMM on simulated filtered signals (Pein et al. 2021, Sect. IV–D) as well as in real data applications (Pein et al. 2021, Sect. V) (Bartsch et al. 2019), we observed that the thus estimated parameters were significantly better than the parameters obtained directly by a Baum–Welch algorithm, most likely because of the applied missed event correction.
Interplay As we will demonstrate in Analysis of patchclamp recordings, model-free idealizations allow a standalone analysis of patchclamp recordings. Moreover, they can assist an HMM-based analysis in various forms (illustrated in Fig. 11): model-free idealization can help to identify and remove artifacts, we used for instance in (Bartsch et al. 2019) to assists an HMM-based analysis in that way. They can be used to determine the number of conductance levels (paragraph ’Analysis of the conductance levels’ in Analysis of patchclamp recordings) and help select and verify a specific Markov model (paragraph ’Selection and verification of a Markov model’ in Analysis of patchclamp recordings). Furthermore, most HMM-based parameter estimation approaches are iterative procedures which require starting values. Those are particularly crucial when the procedure converges to a local optimum only. Such starting values can be provided by previously obtained values using model-free idealizations. Finally, model-free idealizations and the resulting parameter estimates using a missed event correction can be used to verify HMM-based idealization and parameter estimates, and vice versa. This is particularly valuable as they have different strengths and weaknesses as outlined in the following paragraph.
We also note that the local deconvolution approach used in our model-free idealization methods, see (Pein et al. 2018), can be used to improve HMM-based idealizations, obtained, for instance, by a Viterbi algorithm, as our approach not only takes into account explicitly the filtering, but is also time-continuous. It only relies on a prior fit that fixes the number of conductance changes and their rough locations. It can be called by the function deconvolveLocally in the package clampSeg.
HMM versus model-free idealization: compared and contrasted In general, HMM-based approaches achieve a higher temporal resolution of gating dynamics because of their stronger assumptions. Hence, parameter estimates might be more accurate as they rely on more detected events. Moreover, HMMs allow for immediate parameter estimation and interpretation, which is often the main goal of an analysis. And since the HMM state space is fixed in advance, the idealization immediately assigns every time point to one of the states. In contrast, model-free idealizations often have to be postprocessed, (e.g., by clustering or thresholding) to identify discrete states, because conductance levels are determined freely.
On the other hand, there are several disadvantages, some of which are closely entangled with the advantages. As discussed above, the need for often extensive preprocessing adds subjectivity and also more potential sources of data analysis errors. In contrast, in such situations, model-free methods may right away provide a reasonable idealization as they can potentially handle inhomogeneity in a more flexible way, in particular those which act locally on the dataset. The state space and a model for the noise must be fixed in advance, thereby strongly limiting the possible results. Model selection always has a subjective component and can lead to a flawed idealization, for example by inadvertently modeling two states with similar but subtly different conductance or noise levels as only one state, or by prescribing an unsuitable noise model which can lead to detection of spurious state changes. Within an HMM framework, one can only incompletely determine whether the data are compatible with the underlying model assumptions. Hence, despite the above described advantages, at least in simulations and real data examples in Pein et al. (2021); Bartsch et al. (2019), we observed that parameter estimates based on an idealization (either obtained by model-free approaches or by the Viterbi algorithm) appear to be more accurate than direct estimates by the Baum–Welch algorithm. Gating dynamics are time-continuous processes, but for simplification, many HMM approaches underlie a time-discrete Markov chain as an approximation. A time-discrete approximation is also implied by most model-free approaches as they allow gating events only at the sampling points. An exception is the local deconvolution approach used in Pein et al. (2018, 2021).
Some of the subjectivity and other problems in HMM modeling can be mitigated by conducting a model-free idealization to inform preprocessing and model selection. In summary, HMM-based and model-free approaches can (and should) be used to complement each other to verify each other’s results. Artifacts and missed events might be reasons for some differences, but otherwise results should be similar.
Review of existing model-free idealization approaches
Many analyses are still performed by visual inspection, often with manually chosen event times or in a semi-automatic way, for instance by amplitude thresholding (Colquhoun 1987; Sakmann and Neher 1995), as e.g., offered by pCLAMP 10 software (Molecular Devices), or by the semi-automatic SCAN software (Colquhoun and Sigworth 1995) which allows time-course fitting. Hence, those approaches are typically time-consuming and subjective. Moreover, approaches which are based on additional filtering (often by low-pass Gauss filters) aggravate detection of small events. A first approach for a fully automatic idealization was slope thresholding (Basseville and Benveniste 1983); for instance, (VanDongen 1996). Recently, Gnanasambandam et al. (2017) proposed idealizations based on the minimal description length (). All of them (except the semi-automatic SCAN software) ignore low-pass filtering and hence may have difficulties to idealize events correctly on small temporal scales. Furthermore, if events are present on multiple scales (recall Figs. 1, 4, 6), uniscale thresholding procedures will usually fail.
As mentioned in the introduction, (Hotz et al. 2013), (Pein et al. 2018), and (Pein et al. 2021) are multiscale procedures combined with local deconvolution and hence take into account both issues. Consequently, they provide usually more accurate results as demonstrated in simulations and real data applications. As described in the following, they mostly differ in how they take into account the filter when detecting events and hence whether they are suitable to detect short events, but also whether they incorporate the possibility to allow for heterogeneous noise. To understand the methodology better, it is illustrative to plot the convolution of a single gating event and single peaks with the kernel of a low-pass filter, see Fig. 12. We stress that for the short event displayed in Fig. 12b, the filtered signal does not reach the lower conductance level of the original signal. This is generally the case for peaks shorter than the filter length . Hence, if such short events are present, deconvolution techniques are indispensable to idealize those conductance levels correctly.
JSMURF The Jump-Segmentation by MUltiResolution Filter, , from Hotz et al. (2013), combines a multiscale criterion with rigorous error control to reliably detect events on various temporal scales simultaneously. More precisely, it takes into account all scales above the filter length, and for each of those intervals, it ignores the first m data-points. As illustrated in Fig. 12 only during these m point long transitions, the convolution is not matching the conductance f. It provides the following strict error control. The probability that the idealization contains at least one false-positive event (an event that is not contained in the true conductance f) is bounded by the error level . The original work (Hotz et al. 2013) assumed homogeneous noise, and (Pein et al. 2021) proposed an extension to heterogeneous noise.
JULES The JUmp Local dEconvolution Segmentation filter, , from Pein et al. (2018), applies a multiscale criterion to all temporal scales and combines it with a postfilter step to remove incremental steps as, for instance, occurring in Fig. 2. Finally, a local deconvolution approach is proposed to idealize short events well. The error level bounds the probability of detecting a false positive approximately. All in all, is particularly designed for homogeneous noise and short events.
HILDE Heterogeneous Idealization by Local testing and DEconvolution (Pein et al. 2021) obtains idealizations in three steps: It applies to detect events on large temporal scales; afterwards, it tests locally for additional short events. Those tests explicitly take filtering into account. The final idealization is once again obtained by local deconvolution. Local tests are performed on scales up to length . The error level is split between the multiscale criterion of (error level ) and the local tests (error level ). False positives occur again only with probability approximately .
Simulation results In the following, we give a brief qualitative summary about the simulation results in (Hotz et al. 2013; Pein 2017; Pein et al. 2018, 2021). Generally speaking, such computer simulations are a systematic but also computation intensive way to determine precisely how long an event has to be such that an idealization method is able to reliably detect it. However, we stress that all quantitative results depend on the signal-to-noise ratio, the filter, and on tuning parameters.
We found that reliably idealizes events of medium or large length (usually, an event has to be at least few times the filter length) even when the conductance change is small, confer (Hotz et al. 2013). This is essential to idealize subgating events. In comparison, and are able to reliably idealize much shorter peaks, if they are isolated. Isolated means that two events have to be separated by at least three times the filter length if homogeneous noise is assumed but at least five times the filter length if heterogeneous noise is assumed (for a filter truncated after sampling points). Moreover, for a good idealization, events have to be usually only few sampling points long, but can be shorter than the filter length. Hence, those two approaches are suitable to idealize flickering. allows events to be a bit shorter than .
is usually the fastest of our three approaches and an idealization of several hundred thousands up to a few million data points take often only seconds (when Monte Carlo simulations have already been performed). In comparison, an idealization of the same dataset with may last around a minute and with few minutes. All run times are measured on a standard laptop and increase typically linearly in the number of data points. A notable exception are situations in which detects almost no change-points, and then, the run time increases quadratically in the number of observations. For instance, the idealization in Fig. 9 took roughly half an hour. Since uses as a first step, its run time is similarly slow.
Analysis of patchclamp recordings
In this section, we provide a step-by-step guide on how to analyze patchclamp recordings using model-free idealizations. In addition, we describe their interplay with Markov model-based analyses, see also the introduction and in particular Fig. 11 for an illustration. Of course, any analysis depends on the specific datasets and its goals. Hence, the following steps should be seen as more of general guidance that has to be interpreted flexibly. We also stress that it contains time-consuming verification steps which might not be necessary in every analysis.
Analysis of the conductance levels For this step, we assume that the underlying protein attains only a finite number of conformations and hence that only a finite number of conductance levels occur. We aim to determine this number, the values of the conductance levels, and possible transitions between those levels. This can be done in various ways and we will only sketch important ideas. Event histograms (histograms of the idealized conductance levels) and amplitude histograms (histogram of the differences between consecutive segments in the idealization) should be used as a visualization of the underlying conductance levels, see Fig. 13.
The idealized conductance levels form a mixture distribution, typically a Gaussian mixture, around the true conductance levels, where randomness results from measurement and idealization errors (and hence the peaks are narrower if those are better performed). Modes correspond to the true conductance levels. An example can be found in Fig. 13. One can use simple approaches based on a Gaussian assumption to estimate modes. We obtained good results using the half sample mode (Robertson and Cryer 1974), because it is quite robust against outliers. In more difficult cases, where peaks cannot be identified that clearly, more involved statistical methodology to estimate the components of a mixture distribution has to be used; for an overview, see (McLachlan and Peel 2004) and the references therein, or the accuracy of the measurement or idealization has to be increased.
Subsequently, one often aims map idealized conductance levels to their corresponding mixture components. This can, for instance, be done by defining non-overlapping intervals around each estimated conductance level (mixture component) and assigning all events whose estimated idealized conductance level lies within an interval to the corresponding mixture component’s conductance level. It is often a good idea to remove segments that are far from any estimated conductance level from the subsequent analysis, i.e., assigning such idealized conductance level to no interval, since they typically result from artifacts. Note that idealization methods are often sensitive enough to detect baseline fluctuations and fluctuations due to pink noise as events. As a result, often several consecutive events are within the same interval and should be interpreted as one segment only. In other words, this process can also merge segments and thus remove spurious events.
Selection and verification of a Markov model As discussed before, a time-continuous Markov model is a common assumption to analyze patchclamp recordings. Since model-free idealizations are obtained without any prior assumption on the gating dynamics, they can be used to determine and verify a Markov model. To avoid statistical dependency, a careful analysis involves splitting the measurements and using the first part to select a Markov model and the second part to verify the model. Recall that a Markov model has two key properties: dwell times (how long a channel stays in one Markov state) are independent of each other and are exponentially distributed. Since it is often simpler, one might aim to verify uncorrelated, instead of independent, dwell times, though lack of correlation does not imply independence. When checking whether dwell times follow a Markov model, one has to take into account that short events might be missed. Nonetheless, at least in simple Markov models with only few states, one readily can check for uncorrelated and exponentially distributed dwell times; for an example, see (Bartsch et al. 2019, Fig. S4 in the supplement).
Parameter estimation Once a specific Markov model is assumed, one has to estimate its parameters. To this end, it is essential to take into account missed events. Missing events shorter than a certain resolution limit are widely discussed in the literature. The exact distribution is calculated by Hawkes et al. (1990), an estimator called of the Q-matrix is suggested by Qin et al. (1996) and integrated in the software package (Nicolai and Sachs 2013), the exact maximum-likelihood estimator for the Q-matrix for two conductance levels is obtained by Colquhoun et al. (1996), and recently, a Bayesian approach was proposed by Epstein et al. (2016). In Pein et al. (2018, (2021); Bartsch et al. (2019, (2020), we applied simpler approximations, which worked well, since the measurements could be modeled well by Markov models with only two or three states.
Verification using hidden Markov approaches This step was already discussed in Hidden Markov Models. The previous analysis using model-free idealizations can be an essential help to perform an analysis using HMM-based approaches. HMM-based approaches are, however, potentially able to achieve better temporal resolution. Hence, both approaches should be used to verify each other’s results, both in terms of parameter estimation and of idealizations.
Discussion
We gave detailed guidance on how to obtain model-free idealizations using , , and , and on how to use those idealizations together with HMM-based approaches to analyze patchclamp recordings. We believe that this provides a rather comprehensive toolkit for the analysis of many patchclamp recordings.
A notable exception are experiments with varying conductance. Such experiments are interesting, since not only the present value of voltage affects the channel, some channels are also affected by the present rate of voltage change. This includes channels that show no gating when the voltage is constant, but can be activated by a varying voltage. For other channels, different dynamics are observed when the voltage changes. One example is the protein channel Tim23 which tends to close when larger voltage levels are applied constantly (Denkert et al. 2017). Moreover, experiments with a constant voltage only allow to examine the gating dynamics at few voltage levels (or require large experimental effort), while with varying voltage, the dynamics can be analyzed for a whole range of voltages by a single experiment. Brief ideas were discussed in (Pein 2017, Sect. 6. Using our software) and (Diehn et al. 2019).
Though model-free approaches are in general more robust to artifacts than HMM-based approaches, confer (Pein et al. 2018, 2021) who demonstrated for and certain robustness to model violations, there is need for improved methodology (either model-free or HMM-based) with a larger focus on robustness.
Supplementary Information
Below is the link to the electronic supplementary material.
Acknowledgements
Financial support of DFG (CRC803, project Z02) over the past 12 years is gratefully acknowledge. We are grateful to our collaborators Annika Bartsch, Ulf Diederichsen, Manuel Diehn, Thomas Hotz, Ingo P. Mey, Tatjana Polupanow, Ole M. Schütte, Ivo Siekmann, Hannes Sieling, Claudia Steinem, Inder Tecuapetla-Gómez, and Laura Yineth Jula Vanegas. Our special thank goes to Florian Ebmeier, Mariyam Khan, and Stanislav Syekirin for their work on the graphical user interface. F. Pein was also supported by EPSRC EP/N031938/1 (Statscale program). A. Munk was also supported by DFG (Cluster of excellence 2067 MBExC Multiscale Bioimaging: From Molecular Machines to Networks of Excitable Cells) and the Volkswagen Foundation (FBMS). We also thank two anonymous referees for their constructive comments and suggestions which helped us to improve this paper.
Compliance with ethical standards
Conflict of interest
The authors declare that they have no conflict of interest.
Footnotes
All data were measured at the Steinem lab, Institute of Organic and Biomolecular Chemistry, Georg-August-University of Göttingen.
Strictly speaking the terminology ’model-free’ is misleading as also model-free approaches require an underlying model to be valid. However, we use this terminology mainly for historical reasons. The precise (non-parametric) models underlying our methodology are reviewed in “Models”. We only assume that the underlying conductance is piecewise constant, but make no further assumptions about the gating dynamics.
Strictly speaking, this models only heteroscedasticity, one special form of heterogeneous noise that is, for instance, caused by open-channel noise. However, we expect our methods also to be robust to other forms of heterogeneous noise, confer the simulation results in Pein et al. (2021).
Special Issue: Multicomponent lipid membranes.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- Almanjahie IM, Khan RN, Milne RK, Nomura T, Martinac B. Moving average filtering with deconvolution (MAD) for hidden Markov model with filtering and correlated noise. Eur Biophys J. 2019;48(4):383–393. doi: 10.1007/s00249-019-01368-1. [DOI] [PubMed] [Google Scholar]
- Ball FG, Rice JA. Stochastic models for ion channels: introduction and bibliography. Math Biosci. 1992;112(2):189–206. doi: 10.1016/0025-5564(92)90023-p. [DOI] [PubMed] [Google Scholar]
- Bartsch A, Llabrés S, Pein F, Kattner C, Schön M, Diehn M, Tanabe M, Munk A, Zachariae U, Steinem C. High-resolution experimental and computational electrophysiology reveals weak -lactam binding events in the porin PorB. Sci Rep. 2019;9(1):1264. doi: 10.1038/s41598-018-37066-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bartsch A, Ives CM, Kattner C, Pein F, Diehn M, Tanabe M, Munk A, Zachariae U, Steinem C, Llabrés S (2020) An antibiotic-resistance conferring mutation in a neisserial porin: structure, ion flux, and ampicillin binding. bioRxiv 10.1101/2020.11.06.369579 [DOI] [PMC free article] [PubMed]
- Basseville M, Benveniste A. Design and comparative study of some sequential jump detection algorithms for digital signals. IEEE Trans Acoust. 1983;31(3):521–535. [Google Scholar]
- Celeux G, Durand JB. Selecting hidden Markov model state number with cross-validated likelihood. Comput Stat. 2008;23(4):541–564. [Google Scholar]
- Celik N, O’Brien F, Brennan S, Rainbow RD, Dart C, Zheng Y, Coenen F, Barrett-Jolley R. Deep-Channel uses deep neural networks to detect single-molecule events from patch-clamp data. Commun Biol. 2020;3(1):1–10. doi: 10.1038/s42003-019-0729-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chambaz A, Garivier A, Gassiat E. A minimum description length approach to hidden Markov models with Poisson and Gaussian emissions. Application to order identification. J Stat Plan Inference. 2009;139(3):962–977. [Google Scholar]
- Colquhoun D (1987) Practical analysis of single channel records. Microelectrode techiques. Company of Biologists, The Plymouth workshop handbook, Cambridge
- Colquhoun D, Hawkes AG, Srodzinski K. Joint distributions of apparent open and shut times of single-ion channels and maximum likelihood fitting of mechanisms. Philos Trans A Math Phys Eng Sci. 1996;354(1718):2555–2590. [Google Scholar]
- Colquhoun D, Sigworth FJ (1995) Fitting and statistical analysis of single-channel records. In: Single-channel recording, Springer, pp 483–587
- de Gunst MCM, Künsch HR, Schouten JG. Statistical analysis of ion channel data using hidden Markov models with correlated state-dependent noise and filtering. J Am Stat Assoc. 2001;96(455):805–815. [Google Scholar]
- Denkert N, Schendzielorz AB, Barbot M, Versemann L, Richter F, Rehling P, Meinecke M. Cation selectivity of the presequence translocase channel Tim23 is crucial for efficient protein import. Elife. 2017;6:e28324. doi: 10.7554/eLife.28324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Diehn M (2017) Inference in inhomogeneous hidden markov models with application to ion channel data. PhD thesis, Georg-August-Universität Göttingen, http://hdl.handle.net/11858/00-1735-0000-0023-3FB4-2
- Diehn M, Munk A, Rudolf D. Maximum likelihood estimation in hidden Markov Models with inhomogeneous noise. ESAIM: P&S. 2019;23:492–523. [Google Scholar]
- Epstein M, Calderhead B, Girolami MA, Sivilotti LG. Bayesian statistical inference in ion-channel models with exact missed event correction. Biophys J. 2016;111(2):333–348. doi: 10.1016/j.bpj.2016.04.053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fearnhead P, Künsch HR. Particle filters and data assimilation. Annu Rev Stat Appl. 2018;5:421–449. [Google Scholar]
- Fox JA. Ion channel subconductance states. J Membr Biol. 1987;97(1):1–8. doi: 10.1007/BF01869609. [DOI] [PubMed] [Google Scholar]
- Frick S, Hohage T, Munk A. Asymptotic laws for change point estimation in inverse regression. Stat Sin. 2014;24(2):555–575. [Google Scholar]
- Fuliński A, Grzywna Z, Mellor I, Siwy Z, Usherwood PNR. Non-Markovian character of ionic current fluctuations in membrane channels. Phys Rev E. 1998;58(1):919–924. [Google Scholar]
- Gassiat E, Boucheron S. Optimal error exponents in hidden Markov models order estimation. IEEE Trans Inf Theory. 2003;49(4):964–980. [Google Scholar]
- Gassiat E, Keribin C. The likelihood ratio test for the number of components in a mixture with Markov regime. ESAIM-Probab Stat. 2000;4:25–52. [Google Scholar]
- Gnanasambandam R, Nielsen MS, Nicolai C, Sachs F, Hofgaard JP, Dreyer JK (2017) Unsupervised idealization of ion channel recordings by minimum description length: application to human PIEZO1-channels. Front Neuroinform 11 [DOI] [PMC free article] [PubMed]
- Goychuk I, Hänggi P, Vega JL, Miret-Artés S. Non-Markovian stochastic resonance: three-state model of ion channel gating. Phys Rev E. 2005;71(6):061906. doi: 10.1103/PhysRevE.71.061906. [DOI] [PubMed] [Google Scholar]
- Grosse W, Psakis G, Mertins B, Reiss P, Windisch D, Brademann F, Bürck J, Ulrich A, Koert U, Essen LO. Structure-based engineering of a minimal porin reveals loop-independent channel closure. Biochemistry. 2014;53(29):4826–4838. doi: 10.1021/bi500660q. [DOI] [PubMed] [Google Scholar]
- Hawkes AG, Jalali A, Colquhoun D. The distributions of the apparent open times and shut times in a single channel record when brief events cannot be detected. Philos Trans A Math Phys Eng Sci. 1990;332(1627):511–538. doi: 10.1098/rstb.1992.0116. [DOI] [PubMed] [Google Scholar]
- Heinemann SH, Sigworth FJ. Open channel noise. VI. Analysis of amplitude histograms to determine rapid kinetic parameters. Biophys J. 1991;60(3):577–587. doi: 10.1016/S0006-3495(91)82087-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hotz T, Schütte OM, Sieling H, Polupanow T, Diederichsen U, Steinem C, Munk A. Idealizing ion channel recordings by a jump segmentation multiresolution filter. IEEE Trans Nanobiosci. 2013;12(4):376–386. doi: 10.1109/TNB.2013.2284063. [DOI] [PubMed] [Google Scholar]
- Kass RS. The channelopathies: novel insights into molecular and genetic mechanisms of human disease. J Clin Invest. 2005;115(8):1986–1989. doi: 10.1172/JCI26011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lehéricy L. Consistent order estimation for nonparametric hidden Markov models. Bernoulli. 2019;25(1):464–498. [Google Scholar]
- Levis RA, Rae JL. The use of quartz patch pipettes for low noise single channel recording. Biophys J. 1993;65(4):1666–1677. doi: 10.1016/S0006-3495(93)81224-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McLachlan G, Peel D. Finite mixture models. Hoboken: Wiley; 2004. [Google Scholar]
- Mercik S, Weron K. Stochastic origins of the long-range correlations of ionic current fluctuations in membrane channels. Phys Rev E. 2001;63(5):051910. doi: 10.1103/PhysRevE.63.051910. [DOI] [PubMed] [Google Scholar]
- Neher E, Sakmann B. Single-channel currents recorded from membrane of denervated frog muscle fibers. Nature. 1976;260(5554):799–802. doi: 10.1038/260799a0. [DOI] [PubMed] [Google Scholar]
- Nicolai C, Sachs F. Solving ion channel kinetics with the QuB software. Biophys Rev Lett. 2013;8(03n04):191–211. [Google Scholar]
- Overington JP, Al-Lazikani B, Hopkins AL. How many drug targets are there? Nat Rev Drug Discov. 2006;5(12):993–996. doi: 10.1038/nrd2199. [DOI] [PubMed] [Google Scholar]
- Pein F (2017) Heterogeneous multiscale change-point inference and its application to ion channel recordings. PhD thesis, Georg-August-Universität Göttingen, http://hdl.handle.net/11858/00-1735-0000-002E-E34A-7
- Pein F, Tecuapetla-Gómez I, Schütte OM, Steinem C, Munk A. Fully-automatic multiresolution idealization for filtered ion channel recordings: flickering event detection. IEEE Trans Nanobiosci. 2018;17(3):300–320. doi: 10.1109/TNB.2018.2845126. [DOI] [PubMed] [Google Scholar]
- Pein F, Bartsch A, Steinem C, Munk A. Heterogeneous idealization of ion channel recordings—Open channel noise. IEEE Trans Nanobiosci. 2021;20(1):57–78. doi: 10.1109/TNB.2020.3031202. [DOI] [PubMed] [Google Scholar]
- Pein F, Aspelmeier T (2020) clampSeg: idealisation of patch clamp recordings. https://CRAN.R-project.org/package=clampSeg, R package version 1.1-0
- Pein F, Hotz T, Tecuapetla-Gómez I (2020) lowpassFilter: creates and maintains lowpass filters. https://CRAN.R-project.org/package=lowpassFilter, R package version 1.0-0
- Qin F, Auerbach A, Sachs F. Estimating single-channel kinetic parameters from idealized patch-clamp data containing missed events. Biophys J. 1996;70(1):264–280. doi: 10.1016/S0006-3495(96)79568-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Qin F, Auerbach A, Sachs F. Hidden Markov modeling for single channel kinetics with filtering and correlated noise. Biophys J. 2000;79(4):1928–1944. doi: 10.1016/S0006-3495(00)76442-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Raj Singh P, Ceccarelli M, Lovelle M, Winterhalter M, Mahendran KR. Antibiotic permeation across the OmpF channel: modulation of the affinity site in the presence of magnesium. J Phys Chem B. 2012;116(15):4433–4438. doi: 10.1021/jp2123136. [DOI] [PubMed] [Google Scholar]
- Robertson T, Cryer JD. An iterative procedure for estimating the mode. J Am Stat Assoc. 1974;69(348):1012–1016. [Google Scholar]
- Sakmann B, Neher E. Single-channel recording. 2. Berlin: Springer; 1995. [Google Scholar]
- Schroeder I. How to resolve microsecond current fluctuations in single ion channels: the power of beta distributions. Channels. 2015;9(5):262–280. doi: 10.1080/19336950.2015.1083660. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shelley C, Niu X, Geng Y, Magleby KL. Coupling and cooperativity in voltage activation of a limited-state BK channel gating in saturating Ca2+ J Gen Physiol. 2010;135(5):461–480. doi: 10.1085/jgp.200910331. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Siekmann I, Wagner LE, Yule D, Fox C, Bryant D, Crampin EJ, Sneyd J. MCMC estimation of Markov models for ion channels. Biophys J. 2011;100(8):1919–1929. doi: 10.1016/j.bpj.2011.02.059. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sivilotti L, Colquhoun D. In praise of single channel kinetics. J Gen Physiol. 2016;148(2):79–88. doi: 10.1085/jgp.201611649. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Syekirin S, Pein F (2020) readABF: Loads Axon Binary Files. R package version 1.0.2 https://cran.r-project.org/package=readABF
- Tecuapetla-Gómez I, Munk A. Autocovariance estimation in regression with a discontinuous signal and m-dependent errors: a difference-based approach. Scand J Stat. 2017;44(2):346–368. [Google Scholar]
- VanDongen AM. A new algorithm for idealizing single ion channel data containing multiple unknown conductance levels. Biophys J. 1996;70(3):1303–1315. doi: 10.1016/S0006-3495(96)79687-X. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Venkataramanan L, Walsh JL, Kuc R, Sigworth FJ. Identification of hidden Markov models for ion channel currents. I. Colored background noise. IEEE Trans Signal Process. 1998;46(7):1901–1915. [Google Scholar]
- Venkataramanan L, Kuc R, Sigworth FJ. Identification of hidden Markov models for ion channel currents. III. Bandlimited, sampled data. IEEE Trans Signal Process. 2000;48(2):376–385. [Google Scholar]
- Virji M. Pathogenic neisseriae: surface modulation, pathogenesis and infection control. Nat Rev Microbiol. 2009;7(4):274. doi: 10.1038/nrmicro2097. [DOI] [PubMed] [Google Scholar]
- Viterbi A. Error bounds for convolutional codes and an asymptotically optimum decoding algorithm. IEEE Trans Inf Theory. 1967;13(2):260–269. [Google Scholar]
- Yellen G. Ionic permeation and blockade in Ca2+-activated K+ channels of bovine chromaffin cells. J Gen Physiol. 1984;84(2):157–186. doi: 10.1085/jgp.84.2.157. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.