Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2024 Sep 13.
Published in final edited form as: J Neural Eng. 2023 Sep 13;20(5):10.1088/1741-2552/acf1cc. doi: 10.1088/1741-2552/acf1cc

Three Novel Methods for Determining Motor Threshold with Transcranial Magnetic Stimulation Outperform Conventional Procedures

Boshuo Wang 1, Angel V Peterchev 1,2,3,4, Stefan M Goetz 1,2,4,5,*
PMCID: PMC10516469  NIHMSID: NIHMS1931192  PMID: 37595573

Abstract

Objective.

Thresholding of neural responses is central to many applications of transcranial magnetic stimulation (TMS), but the stochastic aspect of neuronal activity and motor evoked potentials (MEPs) challenges thresholding methods. We analyzed existing methods for obtaining TMS motor threshold and their variations, introduced new methods from other fields, and compared their accuracy and speed.

Approach.

In addition to existing relative-frequency methods, such as the five-out-of-ten method, we examined adaptive methods based on a probabilistic motor threshold model using maximum-likelihood (ML) or maximum a-posteriori (MAP) estimation. To improve the performance of these adaptive estimation methods, we explored variations in the estimation procedure and inclusion of population-level prior information. We adapted a Bayesian estimation method which iteratively incorporated information of the TMS responses into the probability density function. A family of non-parametric stochastic root-finding methods with different convergence criteria and stepping rules were explored as well. The performance of the thresholding methods was evaluated with an independent stochastic MEP model.

Main Results.

The conventional relative-frequency methods required a large number of stimuli, were inherently biased on the population level, and had wide error distributions for individual subjects. The parametric estimation methods obtained the thresholds much faster and their accuracy depended on the estimation method, with performance significantly improved when population-level prior information was included. Stochastic root-finding methods were comparable to adaptive estimation methods but were much simpler to implement and did not rely on a potentially inaccurate underlying estimation model.

Significance.

Two-parameter MAP estimation, Bayesian estimation, and stochastic root-finding methods have better error convergence compared to conventional single-parameter ML estimation, and all these methods require significantly fewer TMS pulses for accurate estimation than conventional relative-frequency methods. Stochastic root-finding appears particularly attractive due to the low computational requirements, simplicity of the algorithmic implementation, and independence from potential model flaws in the parametric estimators.

Keywords: transcranial magnetic stimulation, motor threshold, threshold finding methods, parametric estimation, Bayesian estimation, stochastic root-finding

1. Introduction

1.1. Significance and challenges of thresholding in TMS

Transcranial magnetic stimulation (TMS) is a noninvasive technique to activate neurons in the brain [1], [2]. Certain pulse rhythms can further modulate neural circuits, i.e., change how they process endogenous signals [3]. A key parameter of the stimulation pulses is their strength, which influences not only how many neurons but also which neuron types are stimulated [4]-[6]. As the TMS pulse strength affects the activation of neuron types with different functions, it also influences the efficacy of neuromodulation protocols and can even reverse the direction of the effect [7]-[11]. Finally, the stimulation strength is a key parameter not only for ensuring an effective and reproducible procedure but also for safety, and strength limitations for various modulatory protocols are recommended to avoid seizures [12]-[14].

Typically, the TMS pulse strength is individualized to a specific subject’s or patient’s anatomy and physiology based on a measured response threshold. A commonly used threshold is the motor threshold based on activation of finger muscles to stimulation of their representation in the primary motor cortex, but other types of thresholds could also be used [15]-[22]. For the motor threshold, the muscle movement following stimulation in the primary motor cortex is observed visually in some cases, typically in clinical practice, whereas in other cases, typically in research, the size of motor evoked potentials (MEPs) is measured with electromyography (EMG) at the target muscle [23]-[25]. The motor threshold is routinely also used as a reference for other brain targets and its detection is recommended before any further neuromodulatory intervention, even for recurring subjects or patients [26]-[28]. Furthermore, the threshold and its changes can serve as a quantitative metric, e.g., for the diagnosis of lesions [29]. The centrality of the motor threshold to TMS procedures underscores the need for well-defined, rapid, accurate, and unbiased thresholding procedures.

However, the MEPs in response to stimulation in the primary motor cortex are highly stochastic due to the dynamic variability in cortical neurons and the corticospinal tract — a pulse administered twice with identical strength can lead to a maximum MEP response close to saturation once and no response at all in the other case [30]-[32]. Moreover, the variability is far from Gaussian and appears to involve several variability sources [32]-[35]. Consequently, accuracy of the estimation procedure is hard to guarantee and achieving it with a suboptimal estimation algorithm can be time consuming.

1.2. Existing motor threshold estimation methods

The TMS motor threshold, xth, is usually defined as the lowest stimulation pulse amplitude, x, given as a percentage of the maximum stimulator output (MSO), leading to a peripheral muscle response, r, with a 50% probability [27], [28], [36],

xth =argminx(P(r=1x)=50%). (1)

The binary response r is defined as

r={1 movement observed, or Vpp50μV0 movement absent, or Vpp<50μV, (2)

where Vpp is the peak-to-peak MEP amplitude [37]. Historically, the value of 50 μV was chosen relatively arbitrarily as a low level that is not far from the noise limit of most EMG detection systems [38]-[40]. The most common practice for motor threshold finding, or “threshold hunting”, is estimating the relative frequency of positive and negative responses by appropriate stepping of the stimulus amplitude to find the 50% probability level. Therefore, at each test pulse amplitude, a number of stimuli—usually up to ten—have to be administered, hence the name “five-out-of-ten method”. Such an approximation of the probability by a relative frequency within just ten stimuli can be rather imprecise, and a larger number of trials, e.g., twenty or more, per stimulation amplitude is required to obtain more reproducible results for clinical and research purposes [28], [41]. Given its simplicity, the relative-frequency method is widely used and often serves as a benchmark for other methods, despite concerns regarding its mathematical validity or usefulness [41]-[43].

Another well-described method based on the relative response frequency was proposed by Mills and Nithi [39]. Instead of detecting the threshold directly, it uses one point above threshold (upper threshold) and one below (lower threshold), and the threshold estimate itself is defined as the mean of these two points. Mathematically, this relates to a local linearization of the sigmoidal probability distribution of the binary neuronal response as a function of the stimulation amplitude. Similarly, both upper and lower thresholds are defined statistically—all ten responses suprathreshold and subthreshold, respectively. Both the relative-frequency and Mills–Nithi methods require a high number of stimuli [40], [44] and consequently a long duration to complete due to the necessary pause between two stimuli for preventing inter-pulse correlation [45]-[47]. Furthermore, a large number of stimuli also bears a risk of unintended neuromodulation and side effects [14], and therefore thresholding methods should ideally use fewer pulses.

Instead of determining relative frequencies, an alternative thresholding method performs a maximum-likelihood (ML) test for a predefined model describing the response probability distribution—most commonly of a cumulative Gaussian function with two parameters, the midpoint/threshold and slope/spread1. This maximum likelihood estimation (MLE) method was introduced by Awiszus [48] and was adapted from psychophysics where responses have strong stochastic characteristics as well [49], [50]. The result of such parameter estimation by sequential testing (PEST) provides the most likely parameters, which include the threshold. An essential condition is a good selection of the tested stimulus amplitudes [51]. In contrast to frequency-based methods, each amplitude is usually applied just once. Mishory et al. [44] reported that the number of required stimuli and time for determining threshold was markedly lower for ML-PEST compared to the Mills–Nithi two-threshold method, whereas both methods have similar accuracy.

1.3. Limitations of existing methods and alternative approaches

Existing motor threshold estimation methods have some limitations that can be addressed to improve the accuracy and speed of the procedure. A key feature that can notably influence performance is the stepping rule [51], which chooses the next stimulation amplitude adaptively by taking into account the information from the previous observations. For the next stimulation amplitude, the ML-PEST [48] chooses the MLE of the threshold, which is obtained from a reduced single-parameter model of the response and uses a fixed nonstochastic relationship for the spread of the probability function. This relationship is assigned based on averages from representative measurements [48], and therefore does not reflect individual variability. A limitation of this approach is that by not allowing any deviation from a fixed spread–threshold relationship, the method may place all stimuli systematically too high or too low with respect to the optimal stepping for an individual subject, which can lead to slow stepping and large threshold misestimations if the initial stimulation amplitude is poorly chosen [52], [53].

The threshold estimation can be improved by a full two-parameter MLE in which both the threshold and spread are free to vary. Incorporating additional statistical information from prior or typical results of the thresholding procedure can further improve the estimation [52], [54], which is a method adapted from the field of psychophysics [51]. Although Qi et al. called it a Bayesian method [54], this name is not adopted here since the approach incorporates a-posteriori information into a classical Fisher MLE using Bayes’ theorem. This has been known in mathematics [55] and applied fields [56] as maximum a-posteriori (MAP), a term we adopt in this paper.

While Bayes’ theorem has been used for MAP estimation to improve the TMS threshold estimate [52], [54], both ML and MAP methods are point estimators. In contrast to these point estimators, strictly Bayesian methods that are consequently and completely premised on Bayesian statistics [57]-[59] have not been used in TMS. In this text, the Bayesian style particularly refers to the updating rule for the probability function after new observations, which should therefore follow the Bayesian learning paradigm. We adapted the Bayesian estimation method from Kontsevich and Tyler [57] for TMS motor threshold estimation, using the same model as in the point estimators.

Both the relative-frequency methods and the parameter estimation methods adapted from psychophysics reduce the TMS motor responses to binary events (2). The use of EMG for measuring muscle activation by TMS provides a continuous response strength and thus richer information. The EMG motor threshold is not a binary transition between no response and response but a point on the dose–response curve close to the lowest stimulation strength that elicits an excitatory response detectable with conventional EMG recording and analysis methods [60]-[62]. Further, the motor threshold is likely not the lowest stimulation strength to evoke detectable responses, as recent studies demonstrate that the 50 μV threshold is likely closer to the middle of the log-transformed dose–response curve of MEPs when it is separated from the baseline noise [63], [64]. The smoother transition of the expected value of the responses in dependence of the stimulation strength may allow the use of the MEP amplitude to find the threshold faster. To consider this additional analog information of the response amplitude appropriately, stochastic root-finding methods [65], [66] (also known as stochastic approximation) are adapted and evaluated here.

1.4. Organization of this paper

We first describe existing TMS thresholding methods as well as the aforementioned improvements and introduce the Bayesian estimation and stochastics root-find methods, covering a total of three classes and 22 variations of methods. To evaluate and compare the thresholding methods, an independent testbed incorporating a detailed stochastic MEP model [61] is used, with 25,000 virtual subjects. The performance of the thresholding methods is analyzed based on their accuracy of the threshold estimate and speed to acquire threshold (relative-frequency methods) or convergence rate of the threshold error (other methods). The best-performing methods of each class are identified and the trade-off between different methods is discussed. The results show that the three alternative approaches—two-parameter MAP estimation, Bayesian estimation, and stochastic root-finding—outperform single-parameter ML estimator, and all these methods require significantly fewer TMS pulses for accurate motor threshold estimation than the conventional relative-frequency methods.

2. Methods

2.1. Overview of thresholding methods

The estimation of a neuronal property such as the excitation threshold can be performed in two fundamentally different ways. The model-based parameter estimators (MLE, MAP, and Bayesian methods) require a probabilistic description of the input–output black box being analyzed in dependence of a certain parameter set. An inaccurate model will very likely deteriorate the quality of the estimate. In contrast, the early approaches, namely the five-out-of-ten or ten-out-of-twenty relative-frequency method based on recommendations of the International Federation of Clinical Neurophysiology (IFCN) [27], [28] or the Mills–Nithi two-threshold method [39], do not need such a predefined model description as they refer directly to the definition of the threshold. The stochastic root-finding methods can also be counted among this class of non-parametric systems. In the following, the existing and several new or adapted algorithms (Table 1) are shortly described and compared for the application in TMS. The thresholding methods were simulated with Matlab (version 2021a, The Mathworks, Natick, MA, USA) on the Duke Compute Cluster (code available online at [67]) and data analysis and visualization were performed on a desktop computer (MATLAB, version 2023a).

Table 1.

Summary of thresholding methods

Class Methods Variation, threshold estimate xth,i and stepping xi+1, or convergence of ai
Relative-frequency based IFCN relative-frequencya Five-out-of-ten (5/10)
Ten-out-of-twenty (10/20)
Mills-Nithia Original
Modified
Parametric estimation MLE-1b,c, MLE-2b xth,i=xi+1=tMLE,i
MAP-1d, MAP-2 xth,i=xi+1=tMAP,i
Bayesian, Bayesian-NPe xth,i=E(t);xi+1=argmin E(Hi(x,r))
Bayesian-TS xth,i=xi+1=E(t)
Stochastic root-finding ACS-H, DCS-H Harmonic (i-1), non-adaptive
ACS-HA, DCS-HA Harmonic (i-1), adaptive
ACS-GA, DCS-GA Geometric (2-i/2), adaptive
ACS-GHA Geometric (2-i/2) switchingf to harmonic (i-1), adaptive
a

Each variation implemented with both 1% and 2% MSO step size.

b

Implemented using both direct/implicit maximization and explicit evaluation of the log-likelihood function.

c

Also known as adaptive threshold hunting/estimation [48], best-PEST [48], [54], MLE [68], or ML-PEST [53]. Implemented in MTAT [69] and ATH-tool [70].

d

Also known as Bayesian-PEST [54].

e

Implemented with no prior population information.

f

Transition when the maximum change of xi was expected to be less than 1.5% MSO.

2.2. Nonparametric methods

2.2.1. IFCN relative-frequency method

The procedure described by the IFCN [27], [28], [71] leaves freedom of interpretation of some details. For the analysis in this paper, the relative-frequency method was defined as follows to realistically represent the implementation in many laboratories:

  1. The starting amplitude is known from the hotspot search (see section 2.4.2 Simulation routine for further details on the starting amplitude), in which the corresponding neuron population being responsible for activating the observed muscle group (e.g., the first dorsal interosseus or the abductor pollicis brevis) is located with test stimuli on the primary motor cortex. The required amplitude adjustments in this step lead to an amplitude not too far above the desired threshold as spatial focality would be compromised with stronger stimuli. Starting at an amplitude too high above threshold further risks discomfort and other side effects, especially for subjects with high sensitivity and/or low threshold.

  2. At a given stimulation amplitude, a number of stimuli are applied up to a maximum of ten or twenty until either the count of positive responses reaches half the number (five or ten, respectively) or that of negative responses exceed half the number (six or eleven, respectively).

  3. Being suprathreshold by definition in the situation of positive responses accounting for at least half the number of stimuli, the stimulation amplitude is stepped downward, typically by 1% or 2% MSO [28], [71], and Step b) is repeated.

  4. The latter outcome in Step b) precludes 50% positive responses within the number of stimuli and the amplitude is subthreshold. The threshold itself is defined as the last level before this termination point.

Versions with the number of stimuli of ten and twenty and step size of 1% and 2% MSO were performed to show the effect of the number of stimuli per step and the step size on the accuracy and the number of stimuli required for the methods.

2.2.2. Mills–Nithi two-threshold method

The original Mills–Nithi method [39] was carried out as follows:

  1. The subject is stimulated with pulses starting at an amplitude of 20% MSO and increasing in 10% MSO steps until the first positive response is identified.

  2. Applying up to 10 pulses per stimulation amplitude, the stimulation is then decreased in 1% MSO steps when a positive response is observed. The intensity at which all 10 stimuli produce no response is designated as the lower threshold.

  3. Starting from the lowest amplitude that has not resulted in a “no response”, i.e., the last amplitude that has one stimulus with positive response during Step b), and increasing the stimulation in 1% MSO steps, the upper threshold is designated as the minimum intensity at which 10 stimuli all produce positive responses.

  4. The threshold is calculated as the mean of the lower and upper thresholds.

In our study, a modified version of the Mills–Nithi method was performed using the same starting amplitude as the IFCN relative-frequency method. The first negative response was identified by decreasing the amplitude in 10% MSO steps and the upper and lower thresholds were then similarly obtained as in the original methods but in reversed order. Step sizes of 1% and 2% MSO were tested.

2.2.3. Stochastic root-finding methods

The IFCN relative-frequency and Mills–Nithi methods described above both require binary information reduction through quantization of the response (above versus below 50 μV). In contrast to such quantization, robust numeric root-finding algorithms that are still relatively unknown in TMS can directly act on analog data, e.g., the voltage amplitude of the MEP response. Preserving valuable continuous information in the response, such root-finding approaches could enable an improvement of convergence. The sequential formulation proposed here was derived from the method of Robbins and Monro [65], [66], with the neuronal variability playing the role of a noise source. The requirements for this approach are rather weak and comprise constant monotonicity on average [65].

The next stimulation amplitude, xi+1, was defined relative to the current one using an iterative analog control sequence (ACS) that was intended to converge towards the threshold:

xi+1=xiaiΔyi=xiai(yiyth), (3)

where i1 was the step number, ai was the step size of the control sequence, xi denoted the stimulation amplitude and the current threshold estimate xth,i=xi,Δyi=yi-yth, and yi and yth were logarithms of the response yi=lgVi/[1 V] observed at step i and at the MEP threshold Vth (50 μV peak-to-peak by definition), respectively.

The stochastic root-finding sequence converged towards a stimulation amplitude generating an average/expected 50 μV MEP, which should be close to the threshold amplitude generating a median 50 μV MEP. The log-transformation accounted for the first-approximation log-normal distribution of the MEP response amplitude following an inductive stimulus in the motor cortex [46]. The resulting distribution of yi had reduced skewness and should be closer to Gaussian and therefore statistically well-behaved, e.g., with mean close to median (Figure 1Ai). Performing the whole procedure in the logarithmic space also granted the requirement of constant monotonicity. The iteration using the MEP responses thus became

xi+1=xiailg(Vi/Vth). (4)
Figure 1: TMS response model.

Figure 1:

A. An example of the TMS sigmoidal recruiting behavior that is mimicked by the MEP model. i. In the EMG space, this virtual stimulator provides analog values, which some of the algorithms can utilize. The model provides uncorrelated MEP responses (gray dots), which are independent from the tested black-box models in the parameter estimators to avoid model-determined advantages. The median MEP amplitude (black line) crosses 50 μV (black dashed line) at the stimulation threshold (blue line), whereas the mean (log-transformed) MEP amplitude (red dashed line) may not. ii. In the probability space, the threshold is determined from the relative frequency of responses by linear regression (blue line) to find the 50% probability amplitude. The starting amplitude was determined as the minimum amplitude at and above which the response probability was no less than 95%. A Gaussian response model (red line) can also be obtained by applying MLE to the stimulation-response pairs, which fitted well to the relative frequencies for the majority of subject. B. The ground truth thresholds and the slope/spread at the threshold are obtained by linear regression for the 25,000 virtual subjects. i. Joint distribution of threshold and spread. ii.-iv. Distributions of threshold, spread, and spread-to-threshold ratio. The threshold and spread distributions are close to normal or log-normal, with the fitted distributions shown by the red dashed lines. The nominal 0.07-ratio is shown by black dashed lines.

We implemented several versions of the sequence of step sizes ai to compare the convergence properties. In the basic version ACS-H, the step size followed a harmonic series

ai=a0/i,  for i1, (5)

with almost sure (a.s.) convergence2 for the stimulation amplitude towards its target.

Following the style of Delyon and Juditsky [72], adaptive methods were utilized to accelerate convergence by adjusting the control sequence at step i(i>1) only when the response changed from subthreshold to suprathreshold or vice versa from the previous step (i-1), i.e., following a sign change in Δyi:sgnΔyisgnΔyi-1. For harmonic convergence, the corresponding adaptive control sequence ACS-HA was

ai=a0/(ci+1),  for i1, (6)

where ci was the number of sign changes of Δyi up to step i,

ci={ci1sgn(Δyi)=sgn(Δyi1)ci1+1sgn(Δyi)sgn(Δyi1),  for i>1, (7)

with c10. Unlike classical mathematical problems, the TMS sequence has only a very limited number of steps. Thus, quick convergence seems more attractive here rather than any statement for the case of infinite duration. To test faster geometric convergence of 2-i/2, ACS-GA used an adaptive control sequence of

ai=a02ci/2 (8)

Further variations of these methods were designed to examine the convergence rate and performance. To take advantage of both fast convergence and the a.s. convergence properties, a hybrid method ACS-GHA was designed to switch the control sequence ai from geometric to harmonic convergence when the threshold estimate was close to the true threshold and the expected change in xiΔxi=aiΔyi was small—less than 1.5% MSO. Quantized versions, i.e., digital control sequences (DCS), were implemented for comparison as well. The analog value Δyi was reduced using a modified sign function to δyi=sgnΔyi{-1,+1} for the two different cases of below threshold and above (or equal) threshold. DCS allowed the stochastic root-finding method to be used also without EMG recordings, just “response” and “no response” inputs, in a manner compatible with the other estimation methods. We also explored three second-order analog variations of root-finding methods, which did not improve the performance [73] and were not reported here.

The definition of appropriate initial step size a0 was assisted by the iteration formulation (3), which resembled a slope [74] after rearrangement

ai1=ythyixi+1xi. (9)

For reaching the threshold quickly, a0 should accordingly be not too far from the cotangent of the line connecting the present stimulation–response xi,yi towards the threshold xth,yth. Larger values increase the speed of approach of xi towards xth, but may cause the stepping to overshoot and oscillate around the target, whereas smaller numbers promote lengthy one-sided convergence. Statistical information about typical and maximum slopes can inform the choice of a0. For the simulations in this paper, the default initial step size was chosen such that it approximated typical cotangents around the inflection point of the recruitment curve (see section 2.4.1 Response model)

a0=0.2lg(102)lg(105)0.067. (10)

Here lg⁡(10−5) and lg⁡(10−2) (i.e., 10 μV and 10 mV, respectively) were approximate response amplitudes near the EMG noise floor and saturation level at the lower and higher ends of the input–output curve, respectively, and 0.2(20% MSO) was a sufficient change in stimulus strength to cover the transition of the typical sigmoidal response curve. Initial step size from one tenth to ten times the default values evenly sampled on a logarithmic scale were also tested.

2.3. Parametric methods

2.3.1. Basic maximum-likelihood estimation method

The implementation of the basic MLE method (ML-PEST) followed substantially the suggestions and definitions by Awiszus [48]. The response probability was described by a cumulative Gaussian model with two degrees of freedom—the threshold t=xth and the spread s:

p(r=1x,t,s)=1s2πxexp((xt)22s2)dx. (11)

Here, r denoted the binary response (2) of a single stimulus x, and the slope at the threshold, (s2π)-1, was inversely related to the spread.

The likelihood function was defined for independent events

p(rx,t,s)={iri=1}p(ri=1xi,t,s){iri=0}p(ri=0xi,t,s), (12)

where x and r were the lists of stimuli and corresponding responses, respectively, and the probability for failing to evoke a response was p(r=0x,t,s)=1-p(r=1x,t,s). According to the classical Fisher approach, the maximum of this expression for the arguments t and s,tMLE,sMLE=argmax(t,s)p(rx,t,s), provided the most likely parameter set, i.e., the MLE. Due to the strict monotonicity of the logarithm, the product can be split into an equivalent sum, which formed the log-likelihood and was also numerically more robust. The MLE of t at step i provided both the threshold estimate at the current step xth,i and stimulation amplitude at the next step xi+1, i.e., xi+1=xth,i=tMLE,i.

Previous implementations of MLE (12) (e.g., MTAT and ATH-tool, [48], [69], [70]) were simplified by using only the threshold parameter t, whereas the spread parameter was fixed to the value s=0.07t. We replicated this method (MLE-1) and also performed a full MLE with both parameters (MLE-2). Two approaches were implemented and compared to find the threshold t that maximize the likelihood function. The first approach directly maximized the likelihood function over t (implicitly using MATLAB function fmincon, applied on the negative of the log-likelihood). This approach, however, could be unstable during the initial steps when there were few amplitude and response samples (r,x) and the likelihood function had a plateau, i.e., a range of t or (t,s) combinations resulted in the same maximum. The second approach explicitly evaluated the likelihood function for a dense sampling of the parameters t and s and tMLE was calculated as the mean of the argument t over the range of the maximum plateau of the likelihood function [48]. The parameter space was discretized with Δt=0.002 for t(0,1.3]; for MLE-2, 100 logarithmical-distributed samples s were chosen from [0.005, 0.5] and the mean of t was calculated for all (t,s) combinations for which the likelihood function was at its maximum (i.e., if for different s samples, the likelihood function reached maximum for the same t, these t samples were repeated in the average). The upper bound of the threshold parameter exceeded one, which is explained in Section 2.4.1 Response model.

2.3.2. Maximum-a-posterior estimation method

Modified estimation methods (MAP-1 and MAP-2) were designed based on a MAP-like approach [52], [54]. To improve the performance for common cases, the likelihood function (12) was modified to incorporate statistical information of the parameters using Bayes’ theorem per

p(θx,r)=p(θ)p(rx,θ)p(rx), (13)

in which the vector θ=(t,s) denoted the parameter set. The additional information about the distribution of typical parameters p(θ) had to be provided in advance (see section 2.4.1 Response model). When such information is not available, the MAP algorithm can be designed to collect the information itself, i.e., to estimate and progressively refine the parameter distribution from subjects on which it already performed threshold estimation. In that regard, MLE can be considered a special case of MAP estimation, where the parameters have a wide uniform distribution.

For maximizing the a-posterior likelihood (13) with respect to θ, the last factor p(rx) was a constant and therefore omitted. Similar to MLE, the calculation was performed on the logarithm of the likelihood, and the parameter tMAP at step i provided both the threshold estimate at the current step and stimulation amplitude at the next step, i.e., xi+1=xth,i=tMAP,i.

2.3.3. Kontsevich–Tyler Bayesian estimation method

The Bayesian methods here were adapted from Kontsevich and Tyler [57]. All central elements were taken as described, using the Gaussian black box (11) as in the other ML and MAP estimators for consistency. Based on this, new information was embedded into the probability density function p(rx,θ) at every step i using Bayes’ theorem to form the posterior probability

pi(θx,r)=pi(θ)p(rx,θ)+npi(θ)p(rx,θ)dnθ, (14)

where pi(θ) was the probability distribution of the parameters at step i.

The next stimulation amplitude xi+1 was determined using an information-theoretical approach which minimized the expected entropy at the next step [57]

xi+1=argmin xE(Hi(x,r)) (15)

Here, the entropy Hi(x,r) was calculated from the posterior probability as

Hi(x,r)=+npi(θx,r) log(pi(θx,r))dnθ, (16)

and the expectation E was a weighted sum of the entropy Hi(x,r)

E(Hi(x,r))=r=0,1pi(rx)Hi(x,r). (17)

The weights were the probability pi(rx) of sub- and supra-threshold responses at step i, calculated from the current parameter distribution and the Gaussian model (11)

pi(rx)=+np(rx,θ)pi(θ)dnθ, r{0,1}. (18)

Each stimulus–response pair narrowed down the posterior probability density distribution to

pi+1(θ)=pi(θxi+1,ri+1), (19)

and the expected value provided the estimates for the parameters at each step per

θi(l)=E(θ(l))=+nθ(l)pi(θ)dnθ, (20)

where l was the parameter index (i.e., l=1,2 for θ=(t,s)). Therefore, the threshold estimate was xth,i=ti=E(t). In contrast to the point estimator methods, the probably density functions (11) and (14) were explicitly computed and stored for the whole parameter space: r{0,1},x(0,1.3],t(0,1.3], and s[0.005,0.5]. For computational purposes, the parameter space was discretized with Δx=0.01 and Δt=0.002, and 100 logarithmical-distributed samples were chosen for s, corresponding to 8.5 million points for p(r=1x,θ). Such sampling density turned out sufficient as doubling the number of samples did not improve performance in earlier exploration (results not shown). The initial distribution p0(θ) was chosen as the population distribution p(θ) (see section 2.4.1 Response model).

Two variations of the Bayesian method were implemented. The first one had no prior (NP) population information—using a uniform distribution for p0(θ)—to evaluate the contribution of the prior information to convergence. To compare the entropy minimization stepping rule against the threshold stepping (TS) of the MLE and MAP methods, the second variation skipped the entropy calculation ((15)–(18)) and used the latter’s choice of the threshold estimate at step i (20) for the stimulation amplitude for the next step: xi+1=xth,i=E(t).

2.4. Testbed

2.4.1. Response model

We implemented a realistic MEP model [61] which included variability patterns such as amplifier noise, saturation, and neural variability behavior [30], [34], [35]. The model’s increased complexity compared to other numerical studies concerning threshold-estimation methods [48], [75] was intended not only to reproduce more faithfully MEP behavior but also to avoid using the very same models underlying the parameter estimation methods for the test procedure, which would risk numerical favoritism for those particular methods. The response model provided analog MEP amplitude values based on a stochastic recruitment curve (Figure 1Ai), with parameters mimicking real measurements [76]. The analog amplitude response, i.e., the peak-to-peak MEP amplitude VPP in units of volt, was defined as

VPP(x)=ϵyadd,ij(p1,j,p8,j,p9,j)+10ϵymult,ij(p6,j)10p2,j/(1+p3,j(xp5,j+ϵxadd,ij(p7,j))p4,j), (21)

where p1,j to p9,j were the individual response parameters for subject j obtained from an experimental population [34], and intra-individual variability sources ϵxadd,ij and ϵymult,ij and the additive noise ϵyadd,ij varied between subjects j and for pulses i along distributions with subject-dependent parameters. The distributions of recruitment parameters, intra-individual variability, and noise are given in the Appendix. While nominally the stimulation amplitude x should be in the range [0, 1], we extended the range of possible stimulation amplitudes up to 130% MSO in this study, which was 30% above the maximum threshold within the virtual population. This extension was performed to focus on the performance of the thresholding algorithms themselves and eliminate influence from the device and choice of stimulation target. Specifically, the MEP response model was fitted to experimental data obtained with a controllable pulse parameter TMS device using 60 μs pulse width with the maximum capacitor voltage chosen to match stimulation strength to conventional TMS devices for a specific muscle target [34], [76], [77], and stimulation amplitudes higher than the model’s nominal 100% MSO could be achieved by increasing the capacitor voltage or with a different TMS device.

Model parameters were generated for a total of 25,000 virtual subjects. Each subject’s input–output curve was characterized using 100,000 stimulation–response pairs, with a higher number of pulses applied near the amplitudes that generated a 50 μV response in the corresponding noiseless model (see Appendix). The ground truth threshold for each subject was determined by linear regression of the relative frequency of suprathreshold responses to find the amplitude with 50% response probability (Figure 1, panels Aii, Bi, and Bii). The regression also provided the slope at the threshold, which was converted to the spread for the corresponding Gaussian model (Figure 1, panels Bi and Biii). The ratio between the spread and threshold (Figure 1Biv, median, mean, and SD of 0.076, 0.077, and 0.012, respectively) was close to but on average ~10% larger than the fixed nominal value of 0.07 from a previous study [48] and also had a wide distribution. The joint probability density distribution of threshold and spread p(t,s) can be fitted with normal and lognormal distributions, respectively, with details given in the Appendix. The fitted distribution p(t,s) served as prior information p(θ) or p0(θ) for the MAP and Bayesian methods, respectively. By applying MLE to each subject, the parameters t and s for the Gaussian model were obtained and analyzed (Figure S1A). For approximately 90% of subjects, the response probability distributions estimated with Gaussian MLE matched their ground truth well (Figure S1B). For the remaining 10% of subjects, however, MLE resulted in large errors (especially for the spread parameter), showing that the Gaussian model was not a universal fit.

2.4.2. Simulation routine

The testbed comprised all threshold detection algorithms defined previously and the response model. In order to meet the real conditions of a threshold search, each algorithm was given information from the spatial hotspot search, i.e., a starting amplitude which was known to be above the threshold [52], [78], [79]. To avoid ambiguity and circular dependence between the definitions of threshold amplitude and hotspot location [80], the amplitude for the hotspot search was chosen as the amplitude that for at least 95% of the stimuli (and thus “consistently” [28]) generated a response (Figure S2). The starting amplitude was on average 8.7% MSO higher than the ground truth threshold (1.13 times threshold). For the algorithms that use specific integer amplitudes (i.e., IFCN relative-frequency and Mills–Nithi methods), the starting amplitude was rounded to 1% MSO.

The responses for subsequent stimuli were provided to the algorithms by the full MEP model. Whereas the analog versions of the stochastic root-finding methods directly processed this continuous information, all other methods quantized the response into the binary representation of above and below threshold first. To examine both their short- and long-term behaviors, the parametric estimation algorithms and stochastic root-finding methods were tested with a maximum of 200 steps each. The absolute and relative errors of each threshold estimate xth,est versus the ground truth xth,GT  were analyzed with

Δxth=xth,estxth,GT,δxth=Δxthxth,GT. (22)

For comparison with previous studies in the literature, the absolute values of the errors Δxth and δxth were also analyzed, of which δxth was previously referred to as the “relative error” [48], [54]. The results mostly present the relative errors δxth, as they provide normalized distance of the threshold estimate from the true value expressed as a percentage of the latter. The absolute error Δxth overall had similar distributions and are shown in the supplement.

3. Results

3.1. IFCN relative-frequency method and Mills–Nithi two-threshold method

As both methods have been previously established in the field, only the high-level performance is described here, with more details and full statistics provided in the supplement (Tables S1S7).

For the IFCN relative-frequency methods (Figures 2 and S3S5), the number of stimuli until the method terminated and reported a threshold value for a given subject had a wide distribution. On the lower end, the methods stopped after reducing the stimulation amplitude by two or three steps below the starting amplitude, which could be as few as 15 to 20 stimuli for the five-out-of-ten methods or between 40 and 60 for the ten-out-of-twenty methods. On the other hand, some subjects required a larger number of stimuli, reaching over 100 or even 200 depending on the method. Estimates obtained with fewer stimuli overestimated the threshold due to false early fulfillment of the termination rule, whereas those with large numbers of stimuli leaned towards underestimation due to false continuation at an amplitude with response probability less than 50%. The errors were linearly dependent on the number of stimuli with coefficients of determination r2 more than 0.7, and overall had a positive bias (Figure S5, panels Bi and Bii; e.g., median, mean, and SD of 0.91%, 1.00%, and 1.75%, respectively, for δxth of the ten-out-of-twenty method with 1% MSO stepping size). Such linear relationship and bias were inherently determined by the binomial probability distribution to either terminate or continue (Figure S4; see method for calculating the probability in the Supplement), consistent with a previous argument against relative-frequency methods [43]. The number of stimuli drastically decreased by approximately 40% and 50% when the stepping size was increased from 1% to 2% MSO and the coarser five-out-of-ten rule was used instead of the ten-out-of-twenty rule for the relative frequency approximation, respectively, although the shape of the distribution was not affected qualitatively. These changes overall decreased the threshold accuracy (i.e., increased median, mean, and standard deviation of errors) as intuitively expected. However, the errors had similar ranges for both ten-out-of-twenty or five-out-of-ten methods with either step sizes. Therefore, it appears ineffective to use smaller step size and/or a larger number of stimuli at a given amplitude to improve the accuracy, as the threshold error of individual subjects does not necessarily decrease but the required number of pulses increases significantly.

Figure 2: Relative threshold errors of the IFNC relative-frequency and Mills–Nithi methods.

Figure 2:

The relative errors δxth are plotted against the number of stimuli required for all the virtual subjects as dots. Several statistics are shown, with median as white circle, interquartile range between first and third quartiles (Q1 and Q3) as thick error bars, and ranges extending to the lower and upper adjacent values—which are 1.5 times the interquartile range below Q1 and above Q3 and cover approximately ± 2.7 standard deviations for normally-distributed data—as thin error bars. A probability density estimate based on a normal kernel function is also shown with the contour lines. Linear regressions are shown for the IFCN relative-frequency methods as red lines. A. IFNC ten-out-of-twenty method. i. 1% MSO step size. ii. 2% MSO step size. B. IFNC five-out-of-ten method. i. 1% MSO step size. ii. 2% MSO step size. C. Original Mills–Nithi methods. i. 1% MSO step size. ii. 2% MSO step size. D. Modified Mills–Nithi methods. i. 1% MSO step size. ii. 2% MSO step size. Note different horizontal scale in panel Ai versus the others.

The Mills–Nithi methods (Figures 2, S3, and S5) had narrow distributions for the number of stimuli, requiring at least 20 stimuli but rarely exceeding 100. The numbers of stimuli for all four variations were on average similar to or smaller than the IFCN five-out-of-ten method with 1% MSO step size but larger than that with 2% MSO step size. In comparison, the IFCN relative-frequency method could be much faster for any individual subject due to its false early termination and/or if the starting point was already close to the threshold (i.e., steep slope near the threshold). The modified Mills–Nithi methods exploited the additional information obtained from the hotspot search but only reduced the number of stimuli by three or four on average compared to the original method using the same step size. Nevertheless, none of the Mills–Nithi methods resulted in extremely large numbers of stimuli. Due to the averaging of the upper and lower thresholds, the errors of the Mills–Nithi methods were independent of the number of stimuli r2<0.01 and, compared to the IFCN relative-frequency methods, had tighter distributions and were centered closer to zero. Compared to the modified methods, the original Mills–Nithi methods had wider error distributions and many outliers with large errors. Increasing the step size to 2% MSO for either the original or modified method reduced the number of stimuli on average by 14 or 15 steps and narrowed its distribution. Increasing the step size also widened the error distribution slightly.

3.2. Parametric estimation

Unlike the relative-frequency methods, the parametric estimation methods generated a threshold estimate and thus an error at every step (relative error δxth  in Figure 3A and absolute error Δxth in Figure S6A). The median δxth (Figure 3B) and the interquartile range (IQR) of δxth characterized how close the errors approach zero and how wide the errors were distributed. The IQR of δxth, and therefore the range between the lower and upper adjacent values (which were 1.5×IQR below and above the first and third quartile) as well, correlated well with the median δxth  : the ratio between these two statistics after a few steps was approximately 2 and was very stable (Figure S7A). Therefore, the median δxth captured the distribution of most errors, with the remaining outliers summarized as a percentage of the total number of subjects (Figure S7B). Besides the step-by-step distributions of errors, a few metrics were used to benchmark and compare the performance (Table 2). Estimation speed was quantified by the number of steps for a method to reach the same median δxth as the IFCN five-out-of-ten method with 1% MSO step size (median of 64 steps and median δxth of 1.6%). Whereas recent IFCN guidelines [28] recognized that the relative-frequency methods require ten-out-of-twenty trials to generate reliable results, the five-out-of-ten method is still widely used [81]-[84] and therefore chosen as the reference. For accuracy, the median δxth  was compared at 20 and 30 steps, which are respectively considered to produce a sufficiently accurate estimate for all subjects using existing ML-PEST (MLE-1) [41], [48] and commonly used as the upper limit to terminate adaptive thresholding procedures [53], [69], [70].

Figure 3: Relative threshold errors of the parametric estimation methods.

Figure 3:

For MLE methods, results from implicit (im.) maximization of the likelihood function are shown. A. Statistics are shown up to the seventieth steps as boxes-and-whiskers at the first step and every ten steps starting from the tenth step, with median as marker, interquartile range as thick vertical line, upper and lower adjacent values as whiskers (thin vertical lines), and outliers as dots. The lines show the median at every step. The boxes-and-whiskers for different methods are slightly offset for clarity. The log-lin-log mixed scale is used to better visualize the errors around zero, with linear scaling in the range of [−1%, 1%] and logarithmic scaling of amplitudes outside this range. The median, interquartile range, and range of adjacent values of the IFCN five-out-of-ten method with 2% MSO step size are shown as a circle and black error bars as in Figure 2 Bii for reference. B. The median δxth on logarithmic scale, with the inset showing the details in the first 30 steps (the gray area in the main panel).

Table 2.

Error convergence of parametric estimation and root-finding methods

Method Step reaching median |δxth| of 1.6% Median |δxth| at step 20 Median |δxth| at step 30 Optimal a0 (ratio to default value)
MLE-1a 20 (21) 1.57% (1.62%) 1.25% (1.26%)
MAP-1 27 1.96% 1.48%
MLE-2a 40 (39) 2.26% (2.27%) 1.85% (1.83%)
MAP-2 18 1.51% 1.21%
Bayesian 18 1.48% 1.198%
Bayesian-NP 37 2.37% 1.81%
Bayesian-TS 18 1.50% 1.196%
ACS-Hb 24 (18) 1.69% (1.51%) 1.46% (1.31%) ×1.6
DCS-Hc 17 (16) 1.45% (1.45%) 1.17% (1.18%) ×0.79
ACS-HAc 18 (18) 1.52% (1.51%) 1.280% (1.30%) ×0.79
DCS-HAb 22 (19) 1.67% (1.54%) 1.37% (1.24%) ×0.63
ACS-GAc 20 (20) 1.59% (1.58%) 1.45% (1.47%) ×0.79
DCS-GAc 22 (21) 1.68% (1.62%) 1.43% (1.44%) ×0.63
ACS-GHAb 19 (18) 1.53% (1.514%) 1.283% (1.29%) ×0.63
a

Results obtained with direct maximization of the likelihood followed by those using explicit evaluation in parentheses.

b

Results obtained with default a0 followed by those with optimal a0 for lowest median δxth at both step 20 and 30 in parentheses.

c

Results obtained with default a0 followed by those with optimal a0 for lowest median δxth at step 20 in parentheses, with default a0 being optimal for lowest median δxth at step 30.

The basic MLE method (MLE-1, Figure 3, dark blue lines with circle markers), which used a single-parameter MLE for threshold estimation and stepping [48], reached the same error level as the reference IFCN five-out-of-ten method with only 20 steps (Table 2), and the error further decreased by about 20% at 30 steps. Evaluating the likelihood function explicitly had little effect on the threshold estimation performance (Table 2, results in parentheses, and Figure S8) and computational runtime (Figure S9) compared to implicit maximization. Introducing prior information of the parameters of the population (MAP-1, Figure 3, light blue lines with square markers) did not improve the performance for the single-parameter estimation. On the contrary, with the empirical relationship s=0.07t from another population imposed on the parameter distribution of the virtual population in this study, MAP-1 could not fully utilize the prior distribution. Its performance was considerably worse than MLE-1 as the parameters of some subjects in our model were far from such a 0.07 relationship (Figure 1 Biv).

Compared to MLE-1, the two-parameter MLE method (MLE-2, Figure 3, dark red lines with circle markers) had a slower convergence and reached the median error level of the IFCN reference with more (40) pulses. The reduced speed of convergence reflected the exploration of the full parameter space: the likelihood function initially had a large plateau region, which reduced in size as more stimulation–response pairs were recorded. Only after a sufficient number of steps did the likelihood plateau narrow to generate a clear optimal estimate. Similar to MLE-1, the method for maximizing the likelihood function had little effect on performance for MLE-2 (Table 2, results in parentheses, and Figure S8). However, the explicit evaluation with two parameters resulted in a much higher computational runtime per step that was two order-of-magnitudes slower (Figure S9). Using a two-parameter MAP estimator for the threshold (MAP-2, Figure 3, orange lines with square markers) significantly improved the performance of MLE-2, reaching the error level of the IFCN five-out-of-ten method with only 18 steps, and had lower median δxth compared to either MLE-1 or MAP-1. As the spread parameter was no longer fixed to the threshold, MAP-2 could match the wide range of behaviors of the subjects in our model and improved the threshold estimate for all subjects. The prior information significantly influenced the likelihood landscape, rendering one parameter pair (t,s) in the otherwise flat maximum plateau unique. The computational runtime of MAP-1 and MAP-2, which used implicit evaluation, were slightly faster their implicit MLE counterparts (Figure S9).

The original Kontsevich-Tyler method (Bayesian, Figure 3, yellow lines with diamond markers) had the least bias in error distribution (Figure 3A), reached the IFCN five-out-of-ten method’s median δxth quickly at step 18 and had faster convergence for small number (< 20) of stimuli (Figure 3B). The Bayesian method achieved more accurate estimation early on by considering the entire function space of the probability density functions. Whereas ML and MAP estimators evaluated the underlying likelihood function in every step anew based on the same key equation, the Bayesian approach added information into the explicitly evaluated and stored distributions from the last iteration. However, such advantages disappeared with more pulses as the point estimator methods gathered sufficient information. Removing the prior population information (Bayesian-NP, Figure 3, purple lines with diamond markers) resulted in slower convergence and performance comparable to that of MLE-2 with a similar slow down comparing MAP-2 to MLE-2, showing that the prior population information narrowed the distribution of parameters in the probability space alike for the point and distribution estimators. Utilizing the threshold estimate for stepping (Bayesian-TS, Figure 3, green lines with diamond markers) had similar performance compared to the original method of minimizing the expected entropy, demonstrating that the threshold was at or very close to the stimulation amplitude that provided the maximum information gain. This provided further validation for choosing the threshold estimate as the next stimulation amplitude for the MLE and MAP methods, despite the difference of how the estimate was obtained—for the Bayesian methods, the threshold was estimated as the expected value based on its probability distribution (20), whereas for MLE and MAP methods the estimation came from maximizing the likelihood functions (12)–(13). The Bayesian methods were the most computationally costly and slowest (Figure S9), with runtime around four seconds per step and more than two order-of-magnitudes slower than the implicitly evaluated MLE/MAP methods. This was mostly due to updating of the entire probability distribution (14), as the runtime without the entropy calculation in Bayesian-TS was only slightly reduced and still on the same order-of-magnitude. The Bayesian methods were also highly sensitive to rounding-error propagation and sampling of the parameter space, which was a disadvantage compared to MLE and MAP methods.

3.3. Stochastic root-finding methods

The parameter-free stochastic root-finding methods were based on the threshold definition only and therefore had the important advantage over the parametric estimation approaches, which depended on a potentially biased or inaccurate model. Their simple stepping rule was also much faster to compute, with runtime per step on the millisecond scale. Like the parametric methods, the root-finding methods also generated a threshold estimate and an error at every step (relative error δxth in Figures 4 and S13A, and absolute error Δxth in Figures S10 and S13B). The ratio between the IQR of δxth and the median δxth (Figures S11A and S14A) was also centered on 2 after a few steps, but showed larger variations compared to those of parametric estimation methods.

Figure 4: Relative threshold errors of stochastic root-finding methods with default initial step size a0.

Figure 4:

Similar to Figure 3.

The error distributions of all analog root-finding methods (Figure 4, lines with circle markers) became negatively biased after a certain number of steps, i.e., the median was less than zero, the absolute values of the first quartile and lower adjacent value were larger than the third quartile and upper adjacent value, respectively, and there were more negative than positive outliers (Figures 4A and S10A). The median error of ACS-H took about 15 to become negative and did not reach a steady state at step 200. The process for the adaptive methods was much quicker and their median errors reached within ten steps the same steady state (−0.34% and −0.23% MSO for relative and absolute errors, respectively), which the non-adaptive ACS-H should also converge to eventually. The bias originated from the asymmetry of the log-transformed MEP distribution at the threshold [61, Fig. 2]. The 50 μV MEP threshold (which is the median of the MEP distribution according to the probability definition of stimulation threshold) was smaller than the average of the right-skewed distribution due to the nearby EMG noise floor (around 10 μV). Therefore, subthreshold responses generated a smaller Δy to increase a subthreshold stimulation amplitude towards the threshold. In contrast, suprathreshold MEPs can be orders-of-magnitudes larger than the threshold, resulting in large Δy that decreased the stimulation amplitude much lower than the threshold, especially during the early steps. The analog methods converged on the stimulation amplitude where the mean (log-transformed) MEP response equaled 50 μV, which typically occurred below the threshold (Figure 1Ai, statistics in Figure S15) The performance of analog methods could be improved using techniques for MEP detection below the EMG noise floor [63], [64] by recovering the full MEP distribution which should be more symmetric around 50 μV. Three of the analog methods (ACS-HA, ACS-GA, and ACS-GHA) with default parameters reached the accuracy level of the IFCN five-out-of-ten method with 1% MSO step size with only 20 steps or less (Figures 4B and Table 2), whereas ACS-H needed 24 steps.

The non-adaptive ACS-H (Figure 4, green lines) had a monotonically declining median δxth for the entire range of steps. The percentage of outliers (Figures S11B and S14B), however, increased with the steps, showing that the error did not improve for some subjects, which became a larger proportion of the total population as the errors of the other subjects in the population became more tightly distributed. The adaptive ACS-HA (Figure 4, orange lines) outperformed its non-adaptive counterpart by reaching a lower median δxth faster, but with distributions of δxth being more negatively biased and having larger proportion of outliers. The ACS-GA method (Figure 4, blue lines) with higher rate of convergence had a rapid exponential decay of the control sequence and did not further head for the threshold as soon as ai was too small for a notable influence. Its median δxth reached steady state at around 40 steps, albeit with a smaller portion of outliers as its distribution did not continue to tighten. In comparison, ACS-GHA (Figure 4, purple lines) switched from fast geometric convergence that provided the rough estimate to a. s. harmonic convergence and performed very similarly compared to ACS-HA in terms of median δxth and percentage of outliers.

Besides the convergence of the control sequence ai, the root-finding methods’ performance was also determined by the initial value a0 of the sequence. The default value of a0 (10) was chosen to approximate the typical slope of the recruitment curve but its definition was inevitably empirical and potentially suboptimal. Examining the performance as a function of the initial step size indeed revealed that this was the case for some of the root-finding methods. The change from overshoot and oscillatory behavior for large a0 towards lengthy one-sided convergence for smaller numbers were clearly visible (Figure S12, panels Ai–Gi). Some methods did not have one single optimal a0, however, as the minimum of the median δxth at different steps (Figure S12, panels Aii–Gii, black dashed line and dots) may not be obtained with the same a0. Instead of evaluating asymptotic behavior, we chose the optimal a0 that minimized the error at step 20 for which the default was not optimal and the performance indeed improved (Table 2, results in parentheses, and Figure S13). The optimal a0 for most methods was close to the default value, showing that using the slope of typical input–output curves (10) indeed provided an appropriate a0. For ACS-H, the optimal a0 also resulted in a negative bias of error distribution in just a few steps, similar to the other analog versions.

When the default a0 was used, the digital versions of the root-finding methods (Figure 4, darker lines with triangle markers) could asymptotically perform either better than (DCS-H and DCS-HA) or very similar to (DCS-GA) their analog counterpart in terms of median δxth (Figure 4B). As the root-finding approached the target, the difference between the actual and target responses became very small aiΔyi1. The step towards the target Δxi was therefore smaller for the analog version aiΔyi compared to the digital version aiδyi=ai and thus approached the target slower than the latter. It should be further noticed that all digital versions had mostly positive bias for δxth that decayed to zero, with the decay being especially quick for the two adaptive methods. The lack of asymptotic bias in contrast to the analog versions was due to the sign function of the digital versions equalizing the stepping towards threshold in both directions, regardless of the asymmetry of the MEP distribution. With optimal a0, the digital versions had a slower decay of the positive bias of the error distribution compared to those for default a0  (Figure S13Ai) but had improved convergence of δxth (Figure S13Aii).

4. Discussion

We examined a variety of existing and new TMS thresholding methods and compared their performance in a virtual population of 25,000 subjects. The methods included eight based on relative frequencies using either a single threshold (IFCN relative-frequency methods) or two thresholds (Mills–Nithi methods) and either 1% MSO or 2% MSO step size, seven parametric estimation methods, of which four used different combinations of single- or two-parameter ML or MAP estimation and three were based on Bayesian estimation that updated the probability density function itself, and seven stochastic root-finding methods utilizing control sequences with different convergence and stepping rules.

Although there were some trade-offs between different methods, several threshold estimation methods can be considered to have overall better performance within their class or even across all methods (Figures 5 and S16). All relative-frequency methods had similar error performance and should be avoided due to the large number of pulses required and the presence of inherent error bias or large error outliers (see Figure S5). The two-parameter MAP estimation demonstrated the best performance among the point estimators considering the speed of convergence, median error, and computational complexity. The Kontsevich–Tyler Bayesian estimation performed even better regarding the first two aspects, but with increased computational cost, which might mostly matter if implemented on embedded microcontrollers. The best stochastic root-finding methods using analog control sequences was ACS-HA, whereas DCS-H had the strongest performance among both analog and digital versions. Unsurprisingly, the best-performing novel methods as well as the previous single-parameter MLE-1 all outperformed the reference IFCN five-out-of-ten method (median 64 steps and 1.6% median δxth), requiring much fewer steps (within 20 steps, with the novel methods a few steps faster than MLE-1) to reach the same accuracy (Figure 5B, top). Compared to MLE-1, the best-performing novel methods also demonstrated better accuracy at 20 and 30 steps (Figures 5A and S16), which are typical numbers used to terminate the single-parameter MLE methods [41], [48], for example in ATH-tool [70].

Figure 5: Summary of best performing methods.

Figure 5:

Selected statistics of the best performing methods, with full statistics shown in Figure S16. The best parametric methods are two-parameter MAP estimation (MAP-2) and Bayesian estimation, with the single-parameter MLE method (MLE-1) given for comparison. The best stochastic root-finding methods is DCS-H, with both default and optimal (for 20 steps) a0 shown. The MAP-2 and Bayesian methods utilize prior information about the threshold distribution. A. Median δxth at steps 20 and 30. The dashed and dotted lines shows for reference the IFCN five-out-of-ten and ten-out-of-twenty methods with 1% MSO step size, respectively. B. The number of steps needed for a given method to outperform the reference IFCN five-out-of-ten method with 1% MSO step size (top) and the single-parameter MLE-1 at 20 steps (bottom).

Despite their wide range of behavior and difference in performance, all methods performed remarkably accurate in terms of threshold error on the population level. Either δxth  and Δxth  (or δxth and Δxth had mean or median on the level of 1% and 1% MSO, respectively. In comparison, the pulse amplitude variability of TMS machines themselves have exactness (deviation from nominal value) and precision (variation between pulses) in comparable ranges (1% to 3%) depending on the device [85]. The errors of individual subjects, however, had a large spread and may not fall within the ±5% criteria for safety [12], [42] for either relative-frequency methods (Figure S5) or more advanced estimation methods (Figures 3A, 4A, and S13Ai). For some methods, the estimation may not improve with increasing number of TMS pulses for certain subjects (e.g., ACS-H) or the entire population (e.g., ACS-GA) after a certain number of steps.

We analyzed both the errors and their absolute values, and provided detailed visualization and descriptive statistics of the entire distribution. In contrast, many previous studies quantified the absolute values of the relative error (δxth, simply referring to it as the relative error) and/or presented mean and standard deviation only [48], [54]. The use of absolute values for errors provided the convenience to compare a positive metric that decreased with stepping and allowed the use of logarithmic scale for visualization. Whereas the median of the absolute values can capture the spread of the errors very well—our results showed that the ratio between the IQR of the errors and the median of the errors’ absolute value was quite consistently around two for the methods examined—the absolute value of errors, however, could not capture the bias of the estimated thresholds or its symmetry around zero. Similarly, the mean and standard deviation did not capture outliers and skewness of the distributions, which were all important characteristics of the thresholding methods. For example, the asymmetric and positively-biased error distribution of the IFCN relative-frequency methods, particularly the five-out-of-ten method that is widely used in clinical practice, may lead to significant variability of the stimulation amplitude and a tendency to larger amplitudes in treatment. This exacerbates the risk of exceeding safety limits due to the overestimation of thresholds using movement observation instead of EMG measurement [25], as many clinics do. Similarly, the original Mills–Nithi methods had a large number of outliers with negative error, contradicting assertions about its reliability [42].

Our implementation of the various methods all started from a suprathreshold simulation amplitude with a minimum 95% response probability. This, however, is often not the case in practice. A lower starting amplitude could be used to accelerate the threshold search for some methods, especially for subjects with shallow input–output curves, whereas a default stimulator output is sometimes used as starting point for all subjects. The stimulation amplitudes of stochastic root-finding methods are constrained by previous pulses, and different starting points change the error convergence, require a different optimal initial step size a0, and favor analog methods and/or adaptive stepping over the non-adaptive DCS-H (results not shown). In contrast, the starting point does not affect the performance of parametric estimation methods on the population level, e.g., using a fixed 50% MSO amplitude (results not shown), as their stimulation amplitudes determined by the likelihood maximization are not constrained and can freely “jump” arbitrary distances from pulse to pulse. However, both of these situations risk convergence errors when a positive response is observed for a stimulator output well below the true threshold, especially early during the procedure [52], [53].

Existing estimation methods previously were tested using virtual subjects that had the same response model underlying the parameter estimation methods [48]. Such tests of estimators against their own estimation model can be misleading due to statistical bias. We therefore used an independently derived MEP model from the literature, which was previously trained to the statistics of experimental data. The response model used here, although more sophisticated than most other approaches, may still have limitations in representing all features and statistical properties of MEP responses due to the limited data on which it was based [32], [86]-[88]. In addition, complex temporal effects, such as inter-pulse correlation caused by stimulation-induced short-term plasticity and also stimulation-independent excitability fluctuations [45] may become important, particularly for short inter-pulse intervals of only a few seconds used in some labs. Accurate knowledge of the physiologic system, however, is essential for the thresholding problem. This does not only influence the design of a realistic testbed, but also directly determines, for example, how well the Gaussian model used in all parameter estimators suits the requirements or if another model, such as logistic distribution [40], [50], is needed in order to avoid bias.

The performance of some estimation methods demonstrated substantial improvement over existing methods. For example, the two-parameter MAP and Kontsevich–Tyler Bayesian estimation methods all outperformed the single-parameter MLE method. The improvement in accuracy for the Bayesian method, however, comes at a cost of increased runtime per step and larger memory requirement. While these costs may still be challenging for many microcontrollers depending on the specific application and potential implementation as an embedded version on a TMS device, they are readily manageable on personal computers which many recent TMS machines anyway include. The Bayesian method could become more appealing with improved implementation efficiency, for example by reducing the range and/or resolution of parameter space. All parametric estimators, however, had the potential weakness that the underlying model could be incorrect. Some of the parametric methods further relied on accurate prior information of the model parameters to achieve the presented accuracy and improvement over other methods. Such prior information on the population level could require significant effort to collect or may be completely unavailable.

The newly introduced family of non-parametric stochastic root-finding methods overall had similar or even lower errors compared to the parametric estimation methods. Comparing the best performing methods between the two classes, DCS-H was superior in terms of both speed and accuracy compared to the two-parameter MAP estimation and Kontsevich–Tyler Bayesian methods, which both required prior information. Furthermore, stochastic root-finding methods did not rely on potentially incorrect models and thus could work for subjects whose MEP response curves did not follow the Gaussian model. The rule for choosing and updating the step size ai was also simple enough to be performed by hand or a calculator and would not necessarily rely on installed software (e.g., [48], [70]). Hence, the root-finding methods are a promising addition to the available threshold methods that motivate further studies, improvements, and potential generalization to non-motor TMS thresholds where a response model may be unavailable. To enable the TMS community to use this method, we implemented DCS-H as an easy-to-use web-based tool SAMT (Stochastic Approximator of Motor Threshold [89]) using HTML/JavaScript, which is accessible via web browsers on desktop computers and mobile devices.

5. Conclusion

This work analyzed three classes and a total of 22 variations of TMS thresholding methods and compared their performance, including their accuracy and speed. As pointed out in previous critiques, the relative-frequency methods, such as five-out-of-ten, were far from optimal due to their large number of required stimuli, wide distribution of errors, and inherent error bias. Their accuracy was outperformed by parametric estimators and stochastic root-finding methods with a fraction of the stimuli. As the response model underlying parametric estimation methods did not always fit well with the virtual subject data, some parametric methods performed poorly for a small subset of subjects in the population. Including all model parameters in the estimation and incorporating prior information boosted performance. Stochastic root-finding methods were comparable or better in performance than parametric estimation methods, were much simpler to implement, and did not rely on an underlying model.

Supplementary Material

jneacf1ccsupp1.pdf

Acknowledgments

Research reported in this publication was supported by the U.S.A. National Institutes of Health under Award Numbers R01NS117405 and RF1MH124943. The content is solely the responsibility of the authors and does not necessarily represent the official views of the funding agency. The authors thank Dr. Philip Whiting and Dr. Lari M. Koponen for helpful discussions. Computational support was provided by the Duke Compute Cluster.

S. M. G. conceived, supervised, and secured funding for the study, and developed the theoretical framework and original simulation code. B. W. contributed concepts, reconstructed, improved, and optimized the computational framework, and performed the simulations, data analysis, and visualization. A. V. P. contributed concepts, supervised, and secured funding and computational resources for the study. B. W. and S. M. G. wrote the manuscript, and all authors revised, commented on, and approved the final version of the manuscript. Preliminary results of this study were presented at the 5th International Brain Stimulation Meeting [90] (February, 2023, Lisbon, Portugal).

Conflicts of interests

S. M. Goetz and A. V. Peterchev are inventors on patents and patent applications on TMS technology. S. M. Goetz has previously received research funding from Magstim as well as royalties from Rogue Research. A. V. Peterchev has equity options in and serves on the scientific advisory board of Ampa and has received research funding, travel support, patent royalties, consulting fees, equipment loans, hardware donations, and/or patent application support from Rogue Research, Magstim, MagVenture, Neuronetics, BTL Industries, Magnetic Tides, Ampa, and Soterix Medical. B. Wang declares no relevant conflict of interest.

List of key abbreviations and mathematical symbols and notations

Abbreviations

TMS

Transcranial magnetic stimulation

MEP, EMG

Motor evoked potential, electromyography

MSO

Maximum stimulator output

IFCN

International Federation of Clinical Neurophysiology

PEST

Parameter estimation by sequential testing

ML, MLE

Maximum likelihood, maximum likelihood estimation

MAP

Maximum a-posterior (estimation)

ACS, DCS

Analog control sequence, digital control sequence (of stochastic root-finding)

a.s.

Almost sure (convergence)

Mathematical symbols and notations

x,x

Stimulation amplitude, list of stimuli

r,r

Response (binary), list of responses

i

Step number of parametric estimation and stochastic root-finding methods

VPP,Vth,Vi 

Voltage of MEP: peak-to-peak value, value for determining threshold, at step i

t,s,θ

Threshold, spread, and parameter set (of response probability model)

tMLE,tMAP 

Estimate of t parameter using MLE and MAP

a0,ai 

Step size of control sequence (stochastic root-finding methods): initial value, at step i

yi

Logarithm (base 10) of MEP voltage Vi (in units of Volts)

Δyi,δyi 

Analog and digital control sequence

xth,xth,GT 

Threshold amplitude: general, ground truth of model

xth,est,xth,i 

Threshold estimate: general, at step i (parametric estimation and stochastic root-finding methods

Δxth,δxth 

Absolute and relative threshold error

Appendix

Response model parameters

The individual recruitment parameters p1,j to p5,j followed normal and lognormal distributions [61]:

p1,j~𝒩(μ=0.5818,σ2=3.5705103),
p2,j~𝒩(μ=4.5323,σ2=0.12619),
lg(p3,jp4,jp5,j)~𝒩(μ=[3.64660.426391.6665],Σ=[7.20270.569740.197460.569744.74261021.51131020.197461.51131027.1771102]).

The intra-individual variability sources ϵxadd,ij and ϵymult,ij followed normal distributions with subject-specific standard deviation at each step :

ϵxadd ,ij~𝒩(μ=0,σ2=p6,j2),
ϵymult ,ij~𝒩(μ=7,σ2=p7,j2).

in which the standard deviations for subjects j followed lognormal distributions:

lg(p6,j)~𝒩(μ=0.96453,σ2=2.2759102),
lg(p7,j)~𝒩(μ=0.68827,σ2=2.3671102).

The additive noise ϵyadd,ij followed a generalized extreme-value (GEV) distribution, with subject-specific parameters

ϵyadd ,ij~GEV(μ=10p1,j,σ=1.551061.5p8,j,k=0.142p9,j),

where p8,j and p9,j were independent normal distributions 𝒩(0,1), and the GEV distribution had a probability density function of

fGEV=1σ(1+k(xμ)σ)(1+1k)exp((1+k(xμ)σ)1k)

The noiseless model had noise parameters ϵxadd,ij and ϵymult,ij set to zero and ϵyadd,ij=10p1,j, which approached the mean of the GEV for the parameters p1,j,p8,j, and p9,j.

Distribution of threshold and spread

The threshold t and spread s (Figure 1B) was fitted by a joint distribution

(tlg(s))~𝒩(μ=[0.658471.2989],Σ=[σt2ρσtσlg(s)ρσtσlg(s)σlg(s)2]),

with σt=8.970310-2,σlg(s)=2.757610-2, and the correlation coefficient ρ=0.130. This distribution served as p(θ) for the MAP methods and p0(θ) for the Bayesian methods.

Footnotes

Code availability statement

The MATLAB code of the thresholding methods is available at [67] (DOI: 10.5281/zenodo.6483601). SAMT is available to the community online at [89]: https://tms-samt.github.io.

1

The midpoint or threshold determines the horizontal location of the distribution and the slope at threshold or spread determines its shape, i.e., horizontal stretching. When referring to the distribution’s shape parameter, slope and spread are sometimes used interchangeably, although they are inversely related to each other.

2

Almost sure convergence, xi a.s.x, is defined as convergence with probability one: P(limixi=x)=1.

References

  • [1].Goetz SM and Deng Z-D, “The development and modelling of devices and paradigms for transcranial magnetic stimulation,” Int. Rev. Psychiatry, vol. 29, no. 2, pp. 115–145, Mar. 2017, doi: 10.1080/09540261.2017.1305949. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [2].Koponen LM and Peterchev AV, “Transcranial Magnetic Stimulation: Principles and Applications,” in Neural Engineering, He B, Ed., Cham: Springer International Publishing, 2020, pp. 245–270. doi: 10.1007/978-3-030-43395-6_7. [DOI] [Google Scholar]
  • [3].Zhong G, Yang Z, and Jiang T, “Precise Modulation Strategies for Transcranial Magnetic Stimulation: Advances and Future Directions,” Neurosci. Bull, vol. 37, no. 12, pp. 1718–1734, Dec. 2021, doi: 10.1007/s12264-021-00781-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [4].Aberra AS, Wang B, Grill WM, and Peterchev AV, “Simulation of transcranial magnetic stimulation in head model with morphologically-realistic cortical neurons,” Brain Stimulat, vol. 13, no. 1, pp. 175–189, Jan. 2020, doi: 10.1016/j.brs.2019.10.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [5].Di Lazzaro V et al. , “I-wave origin and modulation,” Brain Stimulat, vol. 5, no. 4, pp. 512–525, Oct. 2012, doi: 10.1016/j.brs.2011.07.008. [DOI] [PubMed] [Google Scholar]
  • [6].Kallioniemi E, Säisänen L, Könönen M, Awiszus F, and Julkunen P, “On the estimation of silent period thresholds in transcranial magnetic stimulation,” Clin. Neurophysiol, vol. 125, no. 11, pp. 2247–2252, Nov. 2014, doi: 10.1016/j.clinph.2014.03.012. [DOI] [PubMed] [Google Scholar]
  • [7].Halawa I, Shirota Y, Neef A, Sommer M, and Paulus W, “Neuronal tuning: Selective targeting of neuronal populations via manipulation of pulse width and directionality,” Brain Stimulat, Apr. 2019, doi: 10.1016/j.brs.2019.04.012. [DOI] [PubMed] [Google Scholar]
  • [8].Lang N et al. , “Stimulus intensity and coil characteristics influence the efficacy of rTMS to suppress cortical excitability,” Clin. Neurophysiol, vol. 117, no. 10, pp. 2292–2301, Oct. 2006, doi: 10.1016/j.clinph.2006.05.030. [DOI] [PubMed] [Google Scholar]
  • [9].Sasaki T et al. , “Modulation of motor learning by a paired associative stimulation protocol inducing LTD-like effects,” Brain Stimulat, vol. 11, no. 6, pp. 1314–1321, Nov. 2018, doi: 10.1016/j.brs.2018.07.054. [DOI] [PubMed] [Google Scholar]
  • [10].Sommer M et al. , “Half sine, monophasic and biphasic transcranial magnetic stimulation of the human motor cortex,” Clin. Neurophysiol, vol. 117, no. 4, pp. 838–844, Apr. 2006, doi: 10.1016/j.clinph.2005.10.029. [DOI] [PubMed] [Google Scholar]
  • [11].Turi Z, Lenz M, Paulus W, Mittner M, and Vlachos A, “Selecting stimulation intensity in repetitive transcranial magnetic stimulation studies: A systematic review between 1991 and 2020,” Eur. J. Neurosci, vol. 53, no. 10, pp. 3404–3415, 2021, doi: 10.1111/ejn.15195. [DOI] [PubMed] [Google Scholar]
  • [12].Rossi S, Hallett M, Rossini PM, and Pascual-Leone A, “Safety, ethical considerations, and application guidelines for the use of transcranial magnetic stimulation in clinical practice and research,” Clin. Neurophysiol, vol. 120, no. 12, pp. 2008–2039, Dec. 2009, doi: 10.1016/j.clinph.2009.08.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [13].Rossi S et al. , “Safety and recommendations for TMS use in healthy subjects and patient populations, with updates on training, ethical and regulatory issues: Expert Guidelines,” Clin. Neurophysiol, vol. 132, pp. 269–306, Oct. 2020, doi: 10.1016/j.clinph.2020.10.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [14].Wassermann EM, “Risk and safety of repetitive transcranial magnetic stimulation: report and suggested guidelines from the International Workshop on the Safety of Repetitive Transcranial Magnetic Stimulation, June 5–7, 1996,” Electroencephalogr. Clin. Neurophysiol. Potentials Sect, vol. 108, no. 1, pp. 1–16, Jan. 1998, doi: 10.1016/S0168-5597(97)00096-8. [DOI] [PubMed] [Google Scholar]
  • [15].Antal A, Nitsche MA, Kincses TZ, Lampe C, and Paulus W, “No correlation between moving phosphene and motor thresholds: a transcranial magnetic stimulation study,” Neuroreport, vol. 15, no. 2, pp. 297–302, Feb. 2004, doi: 10.1097/00001756-200402090-00017. [DOI] [PubMed] [Google Scholar]
  • [16].Deblieck C, Thompson B, Iacoboni M, and Wu AD, “Correlation between motor and phosphene thresholds: A transcranial magnetic stimulation study,” Hum. Brain Mapp, vol. 29, no. 6, pp. 662–670, Jun. 2008, doi: 10.1002/hbm.20427. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [17].Goetz SM and Kammer T, “Neuronavigation,” in The Oxford Handbook of TMS, Wassermann EM, Walsh V, Peterchev AV, Ziemann U, Lisanby S, and Siebner H, Eds., 2nd ed. Oxford: Oxford University Press, 2022. [Online]. Available: 10.1093/oxfordhb/9780198832256.013.7 [DOI] [Google Scholar]
  • [18].Noda Y et al. , “Single-Pulse Transcranial Magnetic Stimulation-Evoked Potential Amplitudes and Latencies in the Motor and Dorsolateral Prefrontal Cortex among Young, Older Healthy Participants, and Schizophrenia Patients,” J. Pers. Med, vol. 11, no. 1, Art. no. 1, Jan. 2021, doi: 10.3390/jpm11010054. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [19].Poorganji M et al. , “Differentiating transcranial magnetic stimulation cortical and auditory responses via single pulse and paired pulse protocols: A TMS-EEG study,” Clin. Neurophysiol, vol. 132, no. 8, pp. 1850–1858, Aug. 2021, doi: 10.1016/j.clinph.2021.05.009. [DOI] [PubMed] [Google Scholar]
  • [20].Stewart LM, Walsh V, and Rothwell JC, “Motor and phosphene thresholds: a transcranial magnetic stimulation correlation study,” Neuropsychologia, vol. 39, no. 4, pp. 415–419, Jan. 2001, doi: 10.1016/S0028-3932(00)00130-5. [DOI] [PubMed] [Google Scholar]
  • [21].Tremblay S et al. , “Clinical utility and prospective of TMS–EEG,” Clin. Neurophysiol, vol. 130, no. 5, pp. 802–844, May 2019, doi: 10.1016/j.clinph.2019.01.001. [DOI] [PubMed] [Google Scholar]
  • [22].Wassermann EM, “Variation in the response to transcranial magnetic brain stimulation in the general population,” Clin. Neurophysiol, vol. 113, no. 7, pp. 1165–1171, Jul. 2002, doi: 10.1016/S1388-2457(02)00144-X. [DOI] [PubMed] [Google Scholar]
  • [23].Balslev D, Braet W, McAllister C, and Miall RC, “Inter-individual variability in optimal current direction for transcranial magnetic stimulation of the motor cortex,” J. Neurosci. Methods, vol. 162, no. 1, pp. 309–313, May 2007, doi: 10.1016/j.jneumeth.2007.01.021. [DOI] [PubMed] [Google Scholar]
  • [24].Pridmore S, Filho JAF, Nahas Z, Liberatos C, and George MS, “Motor threshold in transcranial magnetic stimulation: A comparison of a neurophysiological method and a visualization of movement method,” J. ECT, vol. 14, no. 1, pp. 25–27, 1998, doi: 10.1097/00124509-199803000-00004. [DOI] [PubMed] [Google Scholar]
  • [25].Westin GG, Bassi BD, Lisanby SH, and Luber B, “Determination of motor threshold using visual observation overestimates transcranial magnetic stimulation dosage: Safety implications,” Clin. Neurophysiol, vol. 125, no. 1, pp. 142–147, Jan. 2014, doi: 10.1016/j.clinph.2013.06.187. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [26].Cotovio G et al. , “Day-to-day variability in motor threshold during rTMS treatment for depression: Clinical implications,” Brain Stimulat, vol. 14, no. 5, pp. 1118–1125, Sep. 2021, doi: 10.1016/j.brs.2021.07.013. [DOI] [PubMed] [Google Scholar]
  • [27].Rossini PM et al. , “Non-invasive electrical and magnetic stimulation of the brain, spinal cord and roots: basic principles and procedures for routine clinical application. Report of an IFCN committee,” Electroencephalogr. Clin. Neurophysiol, vol. 91, no. 2, pp. 79–92, Aug. 1994, doi: 10.1016/0013-4694(94)90029-9. [DOI] [PubMed] [Google Scholar]
  • [28].Rossini PM et al. , “Non-invasive electrical and magnetic stimulation of the brain, spinal cord, roots and peripheral nerves: Basic principles and procedures for routine clinical and research application. An updated report from an I.F.C.N. Committee,” Clin. Neurophysiol, vol. 126, no. 6, pp. 1071–1107, Jun. 2015, doi: 10.1016/j.clinph.2015.02.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [29].Kobayashi M and Pascual-Leone A, “Transcranial magnetic stimulation in neurology,” Lancet Neurol, vol. 2, no. 3, pp. 145–156, Mar. 2003, doi: 10.1016/S1474-4422(03)00321-1. [DOI] [PubMed] [Google Scholar]
  • [30].Goetz SM and Peterchev AV, “A model of variability in brain stimulation evoked responses,” in 2012 Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Aug. 2012, pp. 6434–6437. doi: 10.1109/EMBC.2012.6347467. [DOI] [PubMed] [Google Scholar]
  • [31].Jung NH, Delvendahl I, Kuhnke NG, Hauschke D, Stolle S, and Mall V, “Navigated transcranial magnetic stimulation does not decrease the variability of motor-evoked potentials,” Brain Stimulat, vol. 3, no. 2, pp. 87–94, Apr. 2010, doi: 10.1016/j.brs.2009.10.003. [DOI] [PubMed] [Google Scholar]
  • [32].Kiers L, Cros D, Chiappa KH, and Fang J, “Variability of motor potentials evoked by transcranial magnetic stimulation,” Electroencephalogr. Clin. Neurophysiol. Potentials Sect, vol. 89, no. 6, pp. 415–423, Dec. 1993, doi: 10.1016/0168-5597(93)90115-6. [DOI] [PubMed] [Google Scholar]
  • [33].Capaday C, “On the variability of motor-evoked potentials: experimental results and mathematical model,” Exp. Brain Res, vol. 239, no. 10, pp. 2979–2995, Oct. 2021, doi: 10.1007/s00221-021-06169-7. [DOI] [PubMed] [Google Scholar]
  • [34].Goetz SM, Luber B, Lisanby SH, and Peterchev AV, “A Novel Model Incorporating Two Variability Sources for Describing Motor Evoked Potentials,” Brain Stimulat, vol. 7, no. 4, pp. 541–552, Jul. 2014, doi: 10.1016/j.brs.2014.03.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [35].Goetz SM et al. , “Isolating two sources of variability of subcortical stimulation to quantify fluctuations of corticospinal tract excitability,” Clin. Neurophysiol, vol. 138, pp. 134–142, Jun. 2022, doi: 10.1016/j.clinph.2022.02.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [36].Groppa S et al. , “A practical guide to diagnostic transcranial magnetic stimulation: Report of an IFCN committee,” Clin. Neurophysiol, vol. 123, no. 5, pp. 858–882, May 2012, doi: 10.1016/j.clinph.2012.01.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [37].Badran BW et al. , “Are EMG and visual observation comparable in determining resting motor threshold? A reexamination after twenty years,” Brain Stimulat, vol. 12, no. 2, pp. 364–366, Mar. 2019, doi: 10.1016/j.brs.2018.11.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [38].Anderson BS and George MS, “A Review of Studies Comparing Methods for Determining Transcranial Magnetic Stimulation Motor Threshold: Observation of Movement or Electromyography Assisted,” J. Am. Psychiatr. Nurses Assoc, vol. 15, no. 5, pp. 304–313, Oct. 2009, doi: 10.1177/1078390309347372. [DOI] [PubMed] [Google Scholar]
  • [39].Mills KR and Nithi KA, “Corticomotor threshold to magnetic stimulation: Normal values and repeatability,” Muscle Nerve, vol. 20, no. 5, pp. 570–576, 1997, doi: . [DOI] [PubMed] [Google Scholar]
  • [40].Tranulis C et al. , “Motor threshold in transcranial magnetic stimulation: comparison of three estimation methods,” Neurophysiol. Clin. Neurophysiol, vol. 36, no. 1, pp. 1–7, Jan. 2006, doi: 10.1016/j.neucli.2006.01.005. [DOI] [PubMed] [Google Scholar]
  • [41].Awiszus F, “On relative frequency estimation of transcranial magnetic stimulation motor threshold,” Clin. Neurophysiol, vol. 123, no. 11, pp. 2319–2320, Nov. 2012, doi: 10.1016/j.clinph.2012.04.014. [DOI] [PubMed] [Google Scholar]
  • [42].Awiszus F, “Fast estimation of transcranial magnetic stimulation motor threshold: is it safe?,” Brain Stimulat, vol. 4, no. 1, pp. 58–59, Jan. 2011, doi: 10.1016/j.brs.2010.09.004. [DOI] [PubMed] [Google Scholar]
  • [43].Awiszus F, “Using relative frequency estimation of transcranial magnetic stimulation motor threshold does not allow to draw any conclusions about true threshold,” Clin. Neurophysiol, vol. 125, no. 6, pp. 1285–1286, Jun. 2014, doi: 10.1016/j.clinph.2013.09.046. [DOI] [PubMed] [Google Scholar]
  • [44].Mishory A et al. , “The Maximum-likelihood Strategy for Determining Transcranial Magnetic Stimulation Motor Threshold, Using Parameter Estimation by Sequential Testing Is Faster Than Conventional Methods With Similar Precision,” J. ECT, vol. 20, no. 3, pp. 160–165, Sep. 2004, doi: 10.1097/00124509-200409000-00007. [DOI] [PubMed] [Google Scholar]
  • [45].Möller C, Arai N, Lücke J, and Ziemann U, “Hysteresis effects on the input-output curve of motor evoked potentials,” Clin. Neurophysiol, vol. 120, no. 5, pp. 1003–1008, May 2009, doi: 10.1016/j.clinph.2009.03.001. [DOI] [PubMed] [Google Scholar]
  • [46].Nielsen J, “Logarithmic Distribution of Amplitudes of Compound Muscle Action Potentials Evoked by Transcranial Magnetic Stimulation,” J. Clin. Neurophysiol, vol. 13, no. 5, pp. 423–434, Sep. 1996, doi: 10.1097/00004691-199609000-00005. [DOI] [PubMed] [Google Scholar]
  • [47].Kallioniemi E, Awiszus F, Pitkänen M, and Julkunen P, “Fast acquisition of resting motor threshold with a stimulus-response curve - possibility or hazard for transcranial magnetic stimulation applications?,” Clin. Neurophysiol. Pract, Dec. 2021, doi: 10.1016/j.cnp.2021.10.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [48].Awiszus F, “Chapter 2 TMS and threshold hunting,” in Supplements to Clinical Neurophysiology, Paulus W, Tergau F, Nitsche MA, Rothwell JG, Ziemann U, and Hallett M, Eds., in Transcranial Magnetic Stimulation and Transcranial Direct Current Stimulation, vol. 56. Elsevier, 2003, pp. 13–23. doi: 10.1016/S1567-424X(09)70205-3. [DOI] [PubMed] [Google Scholar]
  • [49].Pentland A, “Maximum likelihood estimation: the best PEST,” Percept. Psychophys, vol. 28, pp. 377–9, 1980, doi: 10.3758/BF03204398. [DOI] [PubMed] [Google Scholar]
  • [50].Treutwein B, “Adaptive psychophysical procedures,” Vision Res, vol. 35, no. 17, pp. 2503–2522, 1995, doi: 10.1016/0042-6989(95)00016-X. [DOI] [PubMed] [Google Scholar]
  • [51].Strasburger H and Treutwein B, “Fitting the psychometric function,” Percept. Psychophys, vol. 61, no. 1, pp. 87–106, 1999, doi: 10.3758/BF03211951. [DOI] [PubMed] [Google Scholar]
  • [52].Goetz S, Whiting P, and Peterchev A, “Threshold estimation with transcranial magnetic stimulation: algorithm comparison,” Clin. Neurophysiol, vol. 122, p. S197, Jun. 2011, doi: 10.1016/S1388-2457(11)60712-8. [DOI] [Google Scholar]
  • [53].Koponen LM and Peterchev AV, “Preventing misestimation of transcranial magnetic stimulation motor threshold with MTAT 2.0,” Brain Stimul. Basic Transl. Clin. Res. Neuromodulation, vol. 15, no. 5, pp. 1073–1076, Sep. 2022, doi: 10.1016/j.brs.2022.07.057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [54].Qi F, Wu AD, and Schweighofer N, “Fast estimation of transcranial magnetic stimulation motor threshold,” Brain Stimulat, vol. 4, no. 1, pp. 50–57, Jan. 2011, doi: 10.1016/j.brs.2010.06.002. [DOI] [PubMed] [Google Scholar]
  • [55].Fisher RA, “On the mathematical foundations of theoretical statistics,” Philos. Trans. R. Soc. Lond. Ser. Contain. Pap. Math. Phys. Character, vol. 222, no. 594–604, pp. 309–368, 1922, doi: 10.1098/rsta.1922.0009. [DOI] [Google Scholar]
  • [56].Proakis JG and Salehi M, Digital communications, 5th ed. New York: McGraw-Hill, 2008. [Google Scholar]
  • [57].Kontsevich LL and Tyler CW, “Bayesian adaptive estimation of psychometric slope and threshold,” Vision Res, vol. 39, no. 16, pp. 2729–2737, Aug. 1999, doi: 10.1016/S0042-6989(98)00285-5. [DOI] [PubMed] [Google Scholar]
  • [58].Paninski L, “Asymptotic Theory of Information-Theoretic Experimental Design,” Neural Comput, vol. 17, no. 7, pp. 1480–1507, Jul. 2005, doi: 10.1162/0899766053723032. [DOI] [PubMed] [Google Scholar]
  • [59].Wichmann FA and Hill NJ, “The psychometric function: I. Fitting, sampling, and goodness of fit,” Percept. Psychophys, vol. 63, no. 8, pp. 1293–1313, Nov. 2001, doi: 10.3758/BF03194544. [DOI] [PubMed] [Google Scholar]
  • [60].Bawa P and Lemon RN, “Recruitment of motor units in response to transcranial magnetic stimulation in man.,” J. Physiol, vol. 471, no. 1, pp. 445–464, 1993, doi: 10.1113/jphysiol.1993.sp019909. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [61].Goetz SM, Alavi SMM, Deng Z, and Peterchev AV, “Statistical Model of Motor-Evoked Potentials,” IEEE Trans. Neural Syst. Rehabil. Eng, vol. 27, no. 8, pp. 1539–1545, Aug. 2019, doi: 10.1109/TNSRE.2019.2926543. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [62].Mathias JP, Barsi GI, van de Ruit M, and Grey MJ, “Rapid Acquisition of the Transcranial Magnetic Stimulation Stimulus Response Curve,” Brain Stimulat, vol. 7, no. 1, pp. 59–65, Jan. 2014, doi: 10.1016/j.brs.2013.08.003. [DOI] [PubMed] [Google Scholar]
  • [63].Goetz SM, Li Z, and Peterchev AV, “Noninvasive Detection of Motor-Evoked Potentials in Response to Brain Stimulation Below the Noise Floor—How Weak Can a Stimulus Be and Still Stimulate,” in 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Honolulu, HI, USA: IEEE, 2018, p. 4. doi: 10.1109/EMBC.2018.8512765. [DOI] [PubMed] [Google Scholar]
  • [64].Li Z, Peterchev AV, Rothwell JC, and Goetz SM, “Detection of motor-evoked potentials below the noise floor: rethinking the motor stimulation threshold,” J. Neural Eng, 2022, doi: 10.1088/1741-2552/ac7dfc. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [65].Lai TL, “Stochastic approximation,” Ann. Stat, vol. 31, no. 2, pp. 391–406, 2003, doi: 10.1214/aos/1051027873. [DOI] [Google Scholar]
  • [66].Robbins H and Monro S, “A Stochastic Approximation Method,” Ann. Math. Stat, vol. 22, no. 3, pp. 400–407, Sep. 1951, doi: 10.1214/aoms/1177729586. [DOI] [Google Scholar]
  • [67].Wang B, “TMS_Thresholding,” Apr. 13, 2023. 10.5281/zenodo.6483601 [DOI] [Google Scholar]
  • [68].Kallioniemi E, Awiszus F, Pitkänen M, and Julkunen P, “Fast acquisition of resting motor threshold with a stimulus–response curve – Possibility or hazard for transcranial magnetic stimulation applications?,” Clin. Neurophysiol. Pract, vol. 7, pp. 7–15, Jan. 2022, doi: 10.1016/j.cnp.2021.10.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [69].Awiszus F and Borckardt JJ, “TMS Motor Threshold Assessment Tool (MTAT),” 2011. https://www.clinicalresearcher.org/software.htm
  • [70].Julkunen P, “Mobile Application for Adaptive Threshold Hunting in Transcranial Magnetic Stimulation,” IEEE Trans. Neural Syst. Rehabil. Eng, vol. 27, no. 8, pp. 1504–1510, Aug. 2019, doi: 10.1109/TNSRE.2019.2925904. [DOI] [PubMed] [Google Scholar]
  • [71].Rothwell JC, Hallett M, Berardelli A, Eisen A, Rossini P, and Paulus W, “Magnetic stimulation: motor evoked potentials. The International Federation of Clinical Neurophysiology,” Electroencephalogr. Clin. Neurophysiol. Suppl, vol. 52, pp. 97–103, 1999. [PubMed] [Google Scholar]
  • [72].Delyon B and Juditsky A, “Accelerated stochastic approximation,” SIAM J. Optim, vol. 3, no. 4, pp. 868–881, 1993, doi: 10.1137/0803045. [DOI] [Google Scholar]
  • [73].Wang B, Peterchev AV, and Goetz SM, “Three Novel Methods for Determining Motor Threshold with Transcranial Magnetic Stimulation Outperform Conventional Procedures.” bioRxiv, p. 2022.06.26.495134, May 16, 2023. doi: 10.1101/2022.06.26.495134. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [74].Nemirovski A, Juditsky A, Lan G, and Shapiro A, “Robust stochastic approximation approach to stochastic programming,” SIAM J. Optim, vol. 19, no. 4, pp. 1574–1609, 2009, doi: 10.1137/070704277. [DOI] [Google Scholar]
  • [75].Borckardt JJ, Nahas Z, Koola J, and George MS, “Estimating Resting Motor Thresholds in Transcranial Magnetic Stimulation Research and Practice: A computer Simulation Evaluation of Best Methods,” J. ECT, vol. 22, no. 3, pp. 169–175, Sep. 2006, doi: 10.1097/01.yct.0000235923.52741.72. [DOI] [PubMed] [Google Scholar]
  • [76].Peterchev AV, Goetz SM, Westin GG, Luber B, and Lisanby SH, “Pulse width dependence of motor threshold and input-output curve characterized with controllable pulse parameter transcranial magnetic stimulation,” Clin. Neurophysiol, vol. 124, no. 7, pp. 1364–1372, Jul. 2013, doi: 10.1016/j.clinph.2013.01.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [77].Peterchev AV, Jalinous R, and Lisanby SH, “A Transcranial Magnetic Stimulator Inducing Near-Rectangular Pulses With Controllable Pulse Width (cTMS),” IEEE Trans. Biomed. Eng, vol. 55, no. 1, pp. 257–266, Jan. 2008, doi: 10.1109/TBME.2007.900540. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [78].Goetz SM, “Magnetic Neurostimulation from a Physical Perspective,” Dissertation, Technical University of Munich, Munich, Germany, 2012. [Online]. Available: http://mediatum.ub.tum.de?id=1079891
  • [79].Goetz SM et al. , “Enhancement of Neuromodulation with Novel Pulse Shapes Generated by Controllable Pulse Parameter Transcranial Magnetic Stimulation,” Brain Stimulat, vol. 9, no. 1, pp. 39–47, Jan. 2016, doi: 10.1016/j.brs.2015.08.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [80].Awiszus F, “Of thresholds and ‘hot spots’. Quo vadis transcranial magnetic stimulation?,” Clin. Neurophysiol, vol. 125, no. 12, pp. 2451–2453, Dec. 2014, doi: 10.1016/j.clinph.2014.03.010. [DOI] [PubMed] [Google Scholar]
  • [81].Weise K, Numssen O, Thielscher A, Hartwigsen G, and Knösche TR, “A novel approach to localize cortical TMS effects,” NeuroImage, vol. 209, p. 116486, Apr. 2020, doi: 10.1016/j.neuroimage.2019.116486. [DOI] [PubMed] [Google Scholar]
  • [82].Schaworonkow N, Triesch J, Ziemann U, and Zrenner C, “EEG-triggered TMS reveals stronger brain state-dependent modulation of motor evoked potentials at weaker stimulation intensities,” Brain Stimulat, vol. 12, no. 1, pp. 110–118, Jan. 2019, doi: 10.1016/j.brs.2018.09.009. [DOI] [PubMed] [Google Scholar]
  • [83].Benussi A et al. , “Classification accuracy of TMS for the diagnosis of mild cognitive impairment,” Brain Stimulat, vol. 14, no. 2, pp. 241–249, Mar. 2021, doi: 10.1016/j.brs.2021.01.004. [DOI] [PubMed] [Google Scholar]
  • [84].Laakso I, Murakami T, Hirata A, and Ugawa Y, “Where and what TMS activates: Experiments and modeling,” Brain Stimulat, vol. 11, no. 1, pp. 166–174, Jan. 2018, doi: 10.1016/j.brs.2017.09.011. [DOI] [PubMed] [Google Scholar]
  • [85].Zacharias LR, Peres ASC, Souza VH, Conforto AB, and Baffa O, “Method to assess the mismatch between the measured and nominal parameters of transcranial magnetic stimulation devices,” J. Neurosci. Methods, vol. 322, pp. 83–87, Jul. 2019, doi: 10.1016/j.jneumeth.2019.03.021. [DOI] [PubMed] [Google Scholar]
  • [86].Darling WG, Wolf SL, and Butler AJ, “Variability of motor potentials evoked by transcranial magnetic stimulation depends on muscle activation,” Exp. Brain Res, vol. 174, no. 2, pp. 376–385, Sep. 2006, doi: 10.1007/s00221-006-0468-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [87].Orth M and Rothwell JC, “The cortical silent period: intrinsic variability and relation to the waveform of the transcranial magnetic stimulation pulse,” Clin. Neurophysiol, vol. 115, no. 5, pp. 1076–1082, May 2004, doi: 10.1016/j.clinph.2003.12.025. [DOI] [PubMed] [Google Scholar]
  • [88].Roy Choudhury K, Boyle L, Burke M, Lombard W, Ryan S, and McNamara B, “Intra subject variation and correlation of motor potentials evoked by transcranial magnetic stimulation,” Ir. J. Med. Sci, vol. 180, no. 4, pp. 873–880, Dec. 2011, doi: 10.1007/s11845-011-0722-4. [DOI] [PubMed] [Google Scholar]
  • [89].Wang B, Koponen LM, Goetz SM, and Peterchev AV, “SAMT: Stochastic Approximator of Motor Threshold,” 2023. https://tms-samt.github.io/ (accessed Apr. 14, 2023).
  • [90].Wang B, Peterchev A, and Goetz S, “Comparison of thresholding methods for transcranial magnetic stimulation of primary motor cortex,” Brain Stimul. Basic Transl. Clin. Res. Neuromodulation, vol. 16, no. 1, p. 219, Jan. 2023, doi: 10.1016/j.brs.2023.01.308. [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

jneacf1ccsupp1.pdf

RESOURCES