Skip to main content
eLife logoLink to eLife
. 2024 Aug 27;12:RP88762. doi: 10.7554/eLife.88762

Auditory confounds can drive online effects of transcranial ultrasonic stimulation in humans

Benjamin R Kop 1,, Yazan Shamli Oghli 2,, Talyta C Grippe 2,, Tulika Nandi 3,, Judith Lefkes 1, Sjoerd W Meijer 1, Soha Farboud 1, Marwan Engels 1, Michelle Hamani 2, Melissa Null 3, Angela Radetz 3, Umair Hassan 3, Ghazaleh Darmani 2, Andrey Chetverikov 1,4, Hanneke EM den Ouden 4,, Til Ole Bergmann 3,5,, Robert Chen 2,, Lennart Verhagen 1
Editors: Jonas Obleser6, Jonathan Roiser7
PMCID: PMC11349300  PMID: 39190585

Abstract

Transcranial ultrasonic stimulation (TUS) is rapidly emerging as a promising non-invasive neuromodulation technique. TUS is already well-established in animal models, providing foundations to now optimize neuromodulatory efficacy for human applications. Across multiple studies, one promising protocol, pulsed at 1000 Hz, has consistently resulted in motor cortical inhibition in humans (Fomenko et al., 2020). At the same time, a parallel research line has highlighted the potentially confounding influence of peripheral auditory stimulation arising from TUS pulsing at audible frequencies. In this study, we disentangle direct neuromodulatory and indirect auditory contributions to motor inhibitory effects of TUS. To this end, we include tightly matched control conditions across four experiments, one preregistered, conducted independently at three institutions. We employed a combined transcranial ultrasonic and magnetic stimulation paradigm, where TMS-elicited motor-evoked potentials (MEPs) served as an index of corticospinal excitability. First, we replicated motor inhibitory effects of TUS but showed through both tight controls and manipulation of stimulation intensity, duration, and auditory masking conditions that this inhibition was driven by peripheral auditory stimulation, not direct neuromodulation. Furthermore, we consider neuromodulation beyond driving overall excitation/inhibition and show preliminary evidence of how TUS might interact with ongoing neural dynamics instead. Primarily, this study highlights the substantial shortcomings in accounting for the auditory confound in prior TUS-TMS work where only a flip-over sham and no active control was used. The field must critically reevaluate previous findings given the demonstrated impact of peripheral confounds. Furthermore, rigorous experimental design via (in)active control conditions is required to make substantiated claims in future TUS studies. Only when direct effects are disentangled from those driven by peripheral confounds can TUS fully realize its potential for research and clinical applications.

Research organism: Human

Introduction

Noninvasive neuromodulation is a powerful tool for causal inference that strengthens our understanding of the brain and holds great clinical potential (Bergmann and Hartwigsen, 2021; Bestmann and Walsh, 2017). Transcranial ultrasonic stimulation (TUS) is a particularly promising non-invasive brain stimulation technique, overcoming current limitations with high spatial resolution and depth range (Darmani et al., 2022). The efficacy of TUS is well-established in cell cultures and animal models (Menz et al., 2013; Mohammadjavadi et al., 2019; Murphy et al., 2022; Tyler et al., 2008; Tyler et al., 2018; Yoo et al., 2022), and emerging evidence for the neuromodulatory utility of TUS in humans has been reported for both cortical and subcortical structures (cortical: Butler et al., 2022; Lee et al., 2016; Liu et al., 2021; Zeng et al., 2022; subcortical: Ai et al., 2016; Cain et al., 2021; Nakajima et al., 2022). Especially now, at this foundational stage of TUS in humans, it is essential to converge on protocols that maximize the specificity and efficacy of stimulation (Folloni et al., 2019; Verhagen et al., 2019).

Motor inhibitory effects of a commonly applied 1000 Hz pulsed TUS protocol are among the most robust and replicable human findings (Fomenko et al., 2020; Legon et al., 2018b; Xia et al., 2021). Here, by concurrently applying transcranial magnetic stimulation (TMS), modulation of corticospinal excitability is indexed by motor-evoked potentials (MEPs). However, the mechanism by which TUS evokes motor inhibition has remained under debate (Xia et al., 2021).

Recent studies in both animal and human models demonstrate how electrophysiological and behavioral outcomes of TUS can be elicited by nonspecific auditory activation rather than direct neuromodulation (Airan and Butts Pauly, 2018; Braun et al., 2020; Guo et al., 2018; Sato et al., 2018). Indeed, there is longstanding knowledge of the auditory confound accompanying pulsed TUS (Gavrilov and Tsirulnikov, 2012). However, this confound has only recently garnered attention, prompted by a pair of rodent studies demonstrating indirect auditory activation induced by TUS (Guo et al., 2022; Sato et al., 2018). Similar effects have been observed in humans, where exclusively auditory effects were captured with EEG measures (Braun et al., 2020). These findings are particularly impactful given that nearly all TUS studies employ pulsed protocols, from which the pervasive auditory confound emerges (Johnstone et al., 2021).

Indirect effects of stimulation are not unique to TUS, as transcranial magnetic and electric stimulation are also associated with auditory and somatosensory confounds. Indeed, the field of non-invasive brain stimulation as a whole depends on controlling for these confounding factors when present, to unveil the specificity of the neuromodulatory effects (Conde et al., 2019; Duecker et al., 2013; Polanía et al., 2018; Siebner et al., 2022). However, prior online TUS-TMS studies, including those exploring optimal neuromodulatory parameters to inform future work, have considered some but not all necessary conditions to control for the salient auditory confound elicited by a 1000 Hz pulsed protocol (Fomenko et al., 2020; Legon et al., 2018b; Xia et al., 2021).

In this multicenter study, we quantified the impact of the auditory confound to disentangle direct neuromodulatory and indirect auditory contributions to motor inhibitory effects of TUS. To this end, we substantially improved upon prior TUS-TMS studies implementing solely flip-over sham by including both (in)active controls and multiple sound-sham conditions. Furthermore, we investigated dose-response effects through the administration of multiple stimulus durations, stimulation intensities, and individualized simulations of intracranial intensity. Additionally, we considered the possibility that online TUS might not drive a global change in the excitation/inhibition balance but instead might interact with ongoing neural dynamics by introducing state-dependent noise. Finally, we interrogated sound-driven effects through modulation of auditory confound volume, duration, pitch, and auditory masking. We show that motor inhibitory effects of TUS are spatially nonspecific and driven by sound-cued preparatory motor inhibition. However, we do find preliminary evidence that TUS might introduce dose- and state-dependent neural noise to the dynamics of corticospinal excitability. The present study highlights the importance of carefully constructed control conditions to account for confounding factors while exploring and refining TUS as a promising technique for human neuromodulation.

Results

Motor cortical inhibition is not specific to on-target TUS

We first corroborate previous reports of MEP suppression following 500ms of TUS applied over the hand motor area (Experiments I-III; Fomenko et al., 2020; Legon et al., 2018b; Xia et al., 2021). A LMM revealed significantly lower MEP amplitudes following on-target TUS as compared to baseline for Experiment I (b=–0.14, SE = 0.06, t(11) = –2.23, p=0.047), Experiment II (b=–0.18, SE = 0.04, t(26) = –4.82, p=6⋅10–5), and Experiment III (b=–0.22, SE = 0.07, t(15) = –3.08, p=0.008).

However, corticospinal inhibition from baseline was also observed following control conditions. LMMs revealed significant attenuation of MEP amplitude following active control stimulation of the right-hemispheric face motor area (Experiment I: b=–0.12, SE = 0.05, t(11) = –2.29, p=0.043; Experiment II: b=–0.22, SE = 0.04, t(26) = –5.60, p=7⋅10–6), as well as after inactive control stimulation of the white matter ventromedial to the left-hemispheric hand motor area (Experiment IV: b=–0.14; SE = 0.04; t(11) = –3.09; p=0.010). The same effect was observed following sound-only sham (Experiment I: b=–0.14; SE = 0.05; t(11) = –3.18; p=0.009; Experiment II: b=–0.22; SE = 0.04; t(26) = –5.38; p=1⋅10–5; Experiment III: 500 ms-1kHz; b=–0.24; SE = 0.08; t(15) = –2.86; p=0.012). These results suggest a spatially non-specific effect of TUS that is related to the auditory confound (Figure 1).

Figure 1. Non-specific motor inhibitory effects of transcranial ultrasonic stimulation (TUS).

Figure 1.

A significant suppression of motor-evoked potentials (MEP) amplitude relative to baseline (gray) was observed for on-target TUS (green), but also for stimulation of a control region (cyan), and presentation of a sound alone (sound-sham; blue) indicating a spatially non-specific and sound-driven effect on motor cortical excitability. There were no significant differences between on-target and control conditions. Bar plots depict condition means, error bars represent standard errors, clouds indicate the distribution over participants, and points indicate individual participants. Square-root corrected MEP amplitudes are depicted for Experiments I, II, and IV, and Relative MEP amplitude is depicted for Experiment III (see Methods). *p<0.05, **p<0.01, ***p<0.001.

No dose-response effects of TUS on corticospinal inhibition

We further tested for direct ultrasonic neuromodulation by investigating a potential dose-response effect of TUS intensity (Isppa) on motor cortical excitability. First, we applied TUS at multiple free-water stimulation intensities (Figure 2C). In Experiment I, a linear mixed model with the factor ‘intensity’ (32.5/65 W/cm2) did not reveal a significant effect of different on-target TUS intensities on motor excitability (F(1,11) = 0.47, p=0.509, ηp2 = 0.04). In Experiment II, a linear mixed model with the factors ‘stimulation site’ (on-target/active control), ‘masking’ (no mask/masked), and ‘intensity’ (6.35/19.06 W/cm2) similarly did not reveal an effect of stimulation intensity (F(1,50) = 1.29, p=0.261, ηp2 = 0.03). Importantly, there was no effect of stimulation site (F(1,168) = 1.75, p=0.188, ηp2 = 0.01), nor any significant interactions (all p-values >0.1; all ηp2 < 0.06). These results provide neither evidence for spatially specific neuromodulation when directly comparing stimulation sites, nor evidence for a dose-response relationship within the range of applied intensities.

Figure 2. No significant dose-response effects of transcranial ultrasonic stimulation (TUS).

Figure 2.

(A) Acoustic (top) and thermal (bottom) simulations for a single subject in Experiment II. The acoustic simulation depicts the estimated pulse-average intensity (Ipa) above a 0.15 W/cm2 lower bound, with the dotted line indicating the full-width half-maximum of the pressure. The thermal simulation depicts the maximum estimated temperature rise. See Appendix 5 for more information on simulations . (B) On-target TUS motor-evoked potentials (MEP) amplitude as a percentage of active control MEP amplitude against simulated intracranial intensities at the two applied free-water intensities: 6.35 W/cm2 (top) and 19.06 W/cm2 (bottom). The shaded area represents the 95% CI, points represent individual participants. No significant intracranial dose-response relationship was observed. (C) There is no significant effect of free-water stimulation intensity on MEP amplitude. Values are expressed as a percentage of baseline MEP amplitude (square root corrected). Remaining conventions are as in Figure 1. (D) Temporal autocorrelation, operationalized as the slope of the linear regression between trial t and its preceding baseline trial t-1, differed significantly as a function of stimulation site and intensity for masked trials. Individual points represent the differential autocorrelation compared to the active control site. Autocorrelation was not modulated during baseline or sound-only sham, but was significantly higher for on-target TUS at 6.35 W/cm2, and significantly lower for on-target TUS at 19.06 W/cm2 compared to active control TUS. *p<0.05, **p<0.01, ***p<0.001.

However, it is likely that the effectiveness of TUS depends primarily on realized intracranial intensity, which we estimated with individualized 3D simulations (Figure 2A). Yet, testing the relationship between estimated intracranial intensity and MEP amplitude change following on-target TUS similarly did not yield evidence for a dose-response effect (Figure 2B, Appendix 1).

Prior work has primarily focused on probing facilitatory or inhibitory effects on corticospinal excitability. Here, we also explored an alternative: how TUS might introduce noise to ongoing neural dynamics, rather than a directional modulation of excitability. Indeed, human TUS studies have often failed to show a global change in behavioral performance, instead finding TUS effects primarily around the perception threshold where noise might drive stochastic resonance (Butler et al., 2022; Legon et al., 2018a). Whether the precise principles of stochastic resonance generalize from the perceptual domain to the current study is an open question, but it is known that neural noise can be introduced by brain stimulation (van der Groen and Wenderoth, 2016). It is likely that this noise is state-dependent and might not exceed the dynamic range of the intra-subject variability (Silvanto et al., 2007). Therefore, in an exploratory analysis, we exploited the natural structure in corticospinal excitability that exhibits as a strong temporal autocorrelation in MEP amplitude. Specifically, we tested how strongly the MEP on test trial t is predicted by the previous baseline trial t-1. As such, we quantified state-dependent autocorrelation between baseline MEP amplitude and MEP amplitude following on-target TUS, active control TUS, and sound-sham conditions (Appendix 2). In brief, we found a significant interaction between the previous baseline (t-1), stimulation site (on-target/active control), and intensity (6.35/19.06 W/cm2; F(1,30) = 12.10, p=0.002, ηp2 = 0.28) during masked trials. This interaction exhibited as increased autocorrelation for on-target TUS compared to active control TUS at 6.35 W/cm2 (i.e. lower TUS-induced noise; F(1,1287)=13.43, p=3⋅10–4, ηp2 = 0.01), and reduced autocorrelation at 19.06 W/cm2 (i.e. higher noise; F(1,1282)=5.76, p=0.017, ηp2 = 4⋅10–3; Figure 2D). This effect was not only dependent upon intensity and stimulation site, but also dependent on the presence of auditory masking. As such, the effect was also observed in a four-way interaction of the previous baseline, site, intensity, and masking (Appendix 2). These preliminary results might suggest that ultrasound stimulation can interact with ongoing neural dynamics by introducing temporally specific noise, rather than biasing the overall excitation/inhibition balance beyond its natural variation, but further work specifically designed to detect such effects is required.

Audible differences between stimulation sites do not underlie nonspecific inhibition

Stimulation over two separate sites could evoke distinct perceptual experiences arising from bone-conducted sound (Braun et al., 2020). To account for possible audibility differences between stimulation of on-target and active control sites in Experiments I and II, we also tested these conditions in the presence of a time-locked masking stimulus (Figure 3). Following Experiment II, we additionally assessed the blinding efficacy of our masking stimuli, finding that the masking stimulus effectively reduced participant’s the ability to determine whether TUS was administered to approximately chance level (Appendix 3).

Figure 3. No effects of time-locked masking.

Figure 3.

There were no significant effects of time-locked masking, indicating that audible differences between stimulation sites did not obscure or explain the absence of direct neuromodulation. Conventions are as in Figures 1 and 2.

In Experiment I, a linear mixed model with factors ‘masking’ (no mask/masked) and ‘stimulation site’ (on-target/active control) did not reveal a significant effect of masking (F(1,11) = 0.01, p=0.920, ηp2 = 1⋅10–5), stimulation site (F(1,11) = 0.15, p=0.703, ηp2 = 0.01), or their interaction (F(1,11) = 1⋅10–3, p=0.971, ηp2 = 1⋅10–4). Similarly, in Experiment II, the linear mixed model described in the previous section revealed no significant main effect of masking (F(1,30) = 1.68, p=0.205, ηp2 = 0.05), nor any interactions (all p-values >0.1; all ηp2 < 0.06). These results indicate that an underlying specific neuromodulatory effect of TUS was not being obscured by audible differences between stimulation sites.

Sound-driven effects on corticospinal excitability

Duration and pitch

Prior research has shown that longer durations of TUS significantly inhibited motor cortical excitability (i.e. ≥400 ms; Fomenko et al., 2020), while shorter durations did not. In Experiment I, we applied on-target, active control, and sound-sham conditions at shorter and longer durations to probe this effect. When directly comparing these conditions at different stimulus durations (100/500 ms), no evidence for an underlying neuromodulatory effect of TUS was observed, in line with our aforementioned findings. Instead, a linear mixed model with factors ‘condition’ (on-target/active control/sound-sham) and ‘stimulus duration’ (100/500 ms) revealed only a significant main effect of (auditory) stimulus duration, where longer stimulus durations resulted in stronger MEP attenuation (F(1,11) = 10.07, p=0.009, ηp2 = 0.48). There was no significant effect of condition (F(2,11) = 1.30, p=0.311, ηp2 = 0.19), nor an interaction between stimulus duration and condition (F(2,11) = 0.65, p=0.543, ηp2 = 0.11). These results further demonstrate that the auditory confound and its timing characteristics, rather than ultrasonic neuromodulation, underlies the observed inhibition of motor cortical excitability (Figure 4A).

Figure 4. Sound-driven effects on corticospinal excitability.

Figure 4.

(A) Longer (auditory) stimulus durations resulted in lower motor-evoked potential (MEP) amplitudes, regardless of transcranial ultrasonic stimulation (TUS) administration, indicating a sound-duration-dependency of motor inhibitory outcomes (Exp. I). (B) A significant effect of auditory stimulus duration was also observed in Experiment III. (C) The pitch of auditory stimuli also affected MEPs, where lower amplitudes were observed following a 1 kHz tone compared to a 12 kHz tone. There was no significant effect of TUS. Conventions are as in Figures 1 and 2C. *p<0.05, **p<0.01.

We further tested auditory effects in Experiment III, where we administered sound-sham stimuli at four combinations of duration and pitch. A LMM with factors ‘duration’ (500/700 ms) and ‘pitch’ (1/12 kHz) revealed significantly lower MEPs following 500 ms auditory stimuli (Figure 4; duration: F(1,15) = 7.12, p=0.017, ηp2 = 0.32; pitch: F(1,15) = 0.02, p=0.878, ηp2 = 2⋅10–3; interaction: F(1,15) = 2.23, p=0.156, ηp2 = 0.13), supporting the role of auditory stimulus timing in perturbation of MEP amplitude.

Subsequently, ultrasonic stimulation was also administered alongside these four auditory stimuli. Here, an LMM with factors ‘auditory stimulus duration’ (500/700 ms), ‘pitch’ (1/12 kHz), and ‘ultrasonic stimulation’ (yes/no) revealed no significant effect of auditory stimulus duration in contrast to the first test (F(1,15) = 0.44, p=0.517, ηp2 = 0.03). However, a 1 kHz pitch resulted in significantly lower MEP amplitudes than a 12 kHz pitch (Figure 4; F(1,15) = 4.94, p=0.042, ηp2 = 0.25). Importantly, we find no evidence for ultrasonic neuromodulation, where both on-target TUS and sound-sham reduced MEP amplitude from baseline (Figure 1), and where applying on-target TUS did not significantly affect MEP amplitude as compared to sound-sham (F(1,15) = 0.42, p=0.526, ηp2 = 0.03; Figure 4). We observed a nonsignificant trend for the interaction between ‘ultrasonic stimulation’ and ‘auditory stimulus duration’ (F(1,15) = 4.22, p=0.058, ηp2 = 0.22). No trends were observed for the remaining interactions between these three factors (all ηp2 < 0.06, p>0.3). Taken together, these results do not provide evidence for direct ultrasonic neuromodulation but support the influence of auditory stimulation characteristics on motor cortical excitability.

TUS audibility and confound volume

In Experiment IV, we applied TUS to an inactive target – the white matter ventromedial to the left-hemispheric hand motor area – both with and without a continuous auditory masking stimulus. MEP amplitudes did not significantly differ in baseline conditions regardless of whether a continuous sound was being played (b=0.03, SE = 0.06, t(11) = 0.52, p=0.616), indicating that continuous auditory stimulation alone might not be sufficient to inhibit MEP amplitude.

We additionally applied stimulation at multiple intensities to isolate the effect of auditory confound volume. A linear mixed model with factors ‘masking’ (no mask/masked) and ‘intensity’ (4.34/8.69/10.52 Wcm–2) with a random intercept and slope for each factor revealed a significant interaction (F(2,4038)=3.43, p=0.033, ηp2 = 2⋅10–3) and an accompanying effect of ‘masking’ with lesser MEP attenuation when stimulation was masked (F(1,11) = 11.84, p=0.005, ηp2 = 0.52; Figure 5). Follow-up comparisons revealed significantly less attenuation for masked stimulation at 4.34 W/cm2 intensity (F(1,11) = 13.02, p=0.004, ηp2 = 0.55), and a nonsignificant trend for the higher intensities (8.69 W/cm2: F(1,11) = 3.87, p=0.077, ηp2 = 0.27; 10.52 W/cm2: F(1,11) = 3.47, p=0.089, ηp2 = 0.24). In direct comparisons to baseline, all conditions resulted in a significant inhibition of MEP amplitude (all t<–3.36, all p<0.007), with the exception of continuously masked stimulation at the lowest volume, with an intensity of 4.34 W/cm2 Isppa (b=–0.06, SE = 0.03, t(11) = –2.04, p=0.065).

Figure 5. Sound-driven effects on corticospinal excitability.

Figure 5.

Less motor-evoked potentials (MEP) attenuation was measured during continuous masking, particularly for lower stimulation intensities (i.e. auditory confound volumes), pointing towards a role of transcranial ultrasonic stimulation (TUS) audibility in MEP attenuation. *p<0.05.

The data indicate that continuous masking reduces motor inhibition, likely by minimizing the audibility of TUS, particularly when applied at a lower stimulation intensity (i.e. auditory confound volume). The remaining motor inhibition observed during masked trials likely owes to, albeit decreased, persistent audibility of TUS during masking. Indeed, MEP attenuation in the masked conditions descriptively scales with participant reports of audibility. This points towards the role of auditory confound volume in motor inhibition (Appendix 4). Nevertheless, one could instead argue that evidence for direct neuromodulation is seen here. This is unlikely for a number of reasons. First, white matter contains a lesser degree of mechanosensitive ion channel expression and there is evidence that neuromodulation of these tracts may occur primarily in the thermal domain (Guo et al., 2022; Sorum et al., 2021). Second, Experiment IV lacks sufficient inferential power in the absence of an additional control and must, therefore, be interpreted in tandem with Experiments I-III. These experiments revealed no evidence for direct neuromodulation using equivalent or higher stimulation intensities and directly targeting gray matter while also using multiple control conditions. Therefore, we propose that persistent motor inhibition during masked trials owes to the continued, though reduced, audibility of the confound (Appendix 4). However, future work including an additional control (site) is required to definitively disentangle these alternatives.

Preparatory cueing of TMS

We find that MEP attenuation results from auditory stimulation rather than direct neuromodulation. Two putative mechanisms through which sound cuing may drive motor inhibition have been proposed, positing either that explicit cueing of TMS timing results in compensatory processes that drive MEP reduction (Capozio et al., 2021; Tran et al., 2021), or suggesting the evocation of a startle response that leads to global inhibition (Fisher et al., 2004; Furubayashi et al., 2000; Ilic et al., 2011; Kühn et al., 2004; Wessel and Aron, 2013). Critically, we can dissociate between these theories by exploring the temporal dynamics of MEP attenuation. One would expect a startle response to habituate over time, where MEP attenuation would be reduced during startling initial trials, followed by a normalization throughout the course of the experiment. Alternatively, if temporally contingent sound-cueing of TMS drives inhibition, MEP amplitudes should decrease over time as the relative timing of TUS and TMS is being learned, followed by a stabilization at a decreased MEP amplitude once this relationship has been learned.

In Experiments I and II, linear mixed models with ‘trial number’ as a predictor show significant changes in MEP amplitude throughout the experiment, pointing to a learning effect. Specifically, in Experiment I, a significant reduction in MEP amplitude was observed across the first 10 trials where a 500 ms stimulus was delivered (b=–0.04, SE = 0.01, t(11) = –2.88, p=0.015), followed by a stabilization in subsequent blocks (b = –2⋅10–4, SE = 3⋅10–4, t(11) = –0.54, p=0.601). This same pattern was observed in Experiment II, with a significant reduction across the first 20 trials (b=–0.01, SE = 3⋅10–3, t(26) = –4.08, p=4⋅10–4), followed by stabilization (b=6⋅10–5, SE = 1⋅10–4, t(26) = 0.46, p=0.650; Figure 6). The data suggest that the relative timing of TUS and TMS is learned across initial trials, followed by a stabilization at a decreased MEP amplitude once this relationship has been learned. These results could reflect auditory cueing of TMS that leads to a compensatory expectation-based reduction of motor excitability.

Figure 6. Auditory cueing of transcranial magnetic stimulation (TMS).

Figure 6.

There was a significant reduction in motor-evoked potential (MEP) amplitude when participants were first presented with a 500 ms stimulus (initial trials) in Experiment I (left) and Experiment II (right), followed by a stabilization of MEP amplitude during the rest of the experiment (following trials), indicating a learning process in which TUS acts as a cue that signals the onset of TMS. The solid line depicts the loess regression fit, and the shaded area represents the 95% confidence interval.

Discussion

In this study, we show the considerable impact of auditory confounds during audibly pulsed TUS in humans. We employed improved control conditions compared to prior work across four experiments, one preregistered, at three independent institutions. Here, we disentangle direct neuromodulatory and indirect auditory contributions during ultrasonic neuromodulation of corticospinal excitability. While we corroborated motor inhibitory effects of online TUS (Fomenko et al., 2020; Legon et al., 2018b; Xia et al., 2021), we demonstrated that this inhibition also occurs with stimulation of a control region or presentation of a sound alone, suggesting that the auditory confound rather than direct ultrasonic neuromodulation drives inhibition. Furthermore, no direct neuromodulatory effects on overall excitability were observed, regardless of stimulation timing, intensity, or masking. However, we note that an exploratory investigation of temporal dynamics indicated ultrasound might introduce noise to the neural system. Importantly, we found convincing evidence that characteristics of auditory stimuli do globally affect motor excitability, where auditory cueing of TMS pulse timing can affect measures of corticospinal excitability. This underscores the importance of explicit cueing in TMS experimental design. Most importantly, our results call for a reevaluation of earlier findings following audible TUS, and highlight the importance of suitable controls in experimental design (Bergmann and Hartwigsen, 2021; Siebner et al., 2022).

No evidence for direct neuromodulation by TUS

Prior studies have highlighted sound-driven effects of TUS in behavioral and electrophysiological research (Airan and Butts Pauly, 2018; Braun et al., 2020; Guo et al., 2018; Johnstone et al., 2021; Sato et al., 2018). Here, we assessed whether the auditory confound of a conventional 1000 Hz pulsed protocol might underlie motor inhibitory effects, which are among the most robust and replicable human findings (Fomenko et al., 2020; Legon et al., 2018b; Xia et al., 2021). While we successfully replicated this inhibitory effect, we found the same inhibition following stimulation of a motor control site (contralateral, active) and stimulation of a white-matter control site (ipsilateral, inactive; Figure 1). This contrasts with a prior TUS-TMS study which found that TUS of the contralateral hand motor area did not change motor cortical excitability (Xia et al., 2021). Indeed, in all direct comparisons between on-target and control stimulation, no differences in excitability were observed, pointing towards a spatially nonspecific effect of TUS. Considering further inhibitory effects following the administration of an auditory stimulus alone, the data suggest that online TUS motor inhibition is largely driven by the salient auditory confound, rather than spatially specific and direct neuromodulation. However, an exploratory analysis that tested for effects beyond a global shift in excitation-inhibition balance revealed that TUS might interact with ongoing neural dynamics by introducing dose-dependent noise (Figure 2D).

We found no evidence of a dose-response relationship between TUS intensity (Isppa) and motor inhibition when applying stimulation at a wide range of intensities, nor when testing the relationship between simulated intracranial intensities and changes in excitability (Figure 2A–C). Similarly, the administration of a time-locked auditory masking stimulus that effectively reduced TUS detection rates did not provide evidence of direct effects being obscured by audible differences between conditions (Figure 3, Appendix 3). Taken together, this study presents no evidence for direct and spatially specific TUS inhibition of motor excitability when applying a clearly audible protocol, despite using improved control conditions, higher stimulation intensities, and a larger sample size than prior studies (Fomenko et al., 2020; Legon et al., 2018b; Xia et al., 2021). Building on these results, the current challenge is to develop efficacious neuromodulatory protocols with minimal auditory interference. Efforts in this direction are already underway (Mohammadjavadi et al., 2019; Nakajima et al., 2022; Zeng et al., 2022).

Sound-cued motor inhibition

Until now, it was unclear how TUS induced motor inhibition in humans. Here, we show that this inhibition is caused by peripheral auditory stimulation. It is well-known that MEPs are sensitive to both sensory and psychological factors (Duecker et al., 2013). For example, several studies find MEP attenuation following a startling auditory stimulus (Fisher et al., 2004; Furubayashi et al., 2000; Ilic et al., 2011; Kühn et al., 2004; Wessel and Aron, 2013), and have demonstrated the impact of stimulus duration and volume on this inhibition (Furubayashi et al., 2000). It is possible that a similar mechanism is at play for audible TUS protocols. Indeed, we observed modulation of motor cortical excitability dependent upon the characteristics of auditory stimuli, including their duration and timing (Figure 4A and B), their pitch/frequency (Figure 4C), and whether the confound was audible in general, including perceived volume (Figure 5, Appendix 4).

One possible interpretation of the observed MEP attenuation is that the auditory confound acts as a salient cue to predict the upcoming TMS pulse. Prediction-based attenuation has been reported in both sensory and motor domains (Ford et al., 2007; Tran et al., 2021). For example, MEPs are suppressed when the timing of a TMS pulse can be predicted by a warning cue (Capozio et al., 2021; Tran et al., 2021). In the current experimental setup, participants could also learn the relative timing of the auditory stimulus and the TMS pulse. Indeed, we observe MEP attenuation emerge across initial trials as participants learn when to expect TMS, until a stable (i.e. learned) state is reached (Figure 6). Moreover, no motor inhibition was observed when TUS onset was inaudible or when stimulation timing was potentially too fast to function as a predictive cue (100 ms). Taken together, a parsimonious explanation is expectation-based inhibition of TMS-induced MEPs. This inhibitory response might either reflect the inhibition of competing motor programs – a component of motor preparation – or a homeostatic process anticipating the TMS-induced excitation (Capozio et al., 2021; Tran et al., 2021).

Limitations

The precise biomolecular and neurophysiological mechanisms underlying ultrasonic neuromodulation remain under steadily progressing investigation (Weinreb and Moses, 2022; Yoo et al., 2022). A shared interpretation is that mechano-electrophysiological energy transfer is proportional to acoustic radiation force, and thus proportional to stimulation intensity. Accordingly, one could argue that the TUS dose in the present study could have been insufficient to evoke direct neuromodulation. Indeed, despite the applied intensities exceeding prior relevant human work (Fomenko et al., 2020; Legon et al., 2018b; Xia et al., 2021) the total applied neuromodulatory doses are relatively low as compared to, for example, repetitive TUS protocols (rTUS) in animal work (Folloni et al., 2019; Verhagen et al., 2019) or recent human studies (Nakajima et al., 2022).

Alternatively, insufficient neural recruitment could be attributed to stimulation parameters other than intensity. If so, the absence of direct neuromodulation across these experiments might not generalize to parameters outside the tested set. For example, while we replicated and extended prior work targeting the hand motor area at ~30 mm from the scalp (Fomenko et al., 2020; Legon et al., 2018b; Xia et al., 2021), other studies have suggested that the optimal stimulation depth to engage the hand motor area may be more superficial (Osada et al., 2022; Siebner et al., 2022).

One might further argue that the TMS hotspot provides insufficient anatomical precision to appropriately target the underlying hand muscle representation with TUS. The motor hotspot may not precisely overly the cortical representation of the assessed muscle due to the increased coil-cortex distance introduced by the TUS transducer. This distance, and the larger TMS coils required to evoke consistent MEPs, results in a broad electric field that is substantially larger than the TUS beam width (e.g. 6 mm for 250 kHz; Fomenko et al., 2020; Legon et al., 2018b). Thus, it is possible that a transducer aligned with the center of the TMS coil may not be adequate. Nevertheless, we note that previous work utilizing a similar targeting approach has effectively induced changes in corticospinal motor excitability (Zeng et al., 2022). We also note that our stimulation depth and targeting procedures were comparable to all prior TUS-TMS studies, and that our simulations confirmed targeting (Figure 2A, Appendix 5). In summary, our main finding that the auditory confound drove motor inhibition in the present study, and likely had an impact in previous studies, holds true.

Considerations and future directions

Crucially, our results do not provide evidence that TUS is globally ineffective at inducing neuromodulation. While the present study and prior research highlight the confounding role of indirect auditory stimulation during pulsed TUS, there remains strong evidence for the efficacy of ultrasonic stimulation in animal work when auditory confounds are accounted for (Mohammadjavadi et al., 2019), or in controlled in-vitro systems such as an isolated retina, brain slices, or neuronal cultures in which the auditory confound carries no influence (Menz et al., 2013; Tyler et al., 2018).

It follows that where an auditory confound could be expected, appropriate control conditions are critical. These controls could involve stimulating a control region, and/or including a matched sound-only sham. In parallel, or perhaps alternatively, the impact of this confound can be mitigated in several ways. First, we recommend that the influence of auditory components be considered in transducer design and selection. Second, masking the auditory confound can help blind participants to experimental conditions. Titrating auditory mask quality per participant to account for intra- and inter-individual differences in subjective perception of the auditory confound would be beneficial. Here, the method chosen for mask delivery must be considered. While bone-conducting headphones align with the bone conduction mechanism of the auditory confound, they might not deliver sound as clearly as in-ear headphones or speakers. Nevertheless, the latter two rely on air-conducted sound. Notably, in-ear headphones could even amplify the perceived volume of the confound by obstructing the ear canal. Importantly, even when using masking stimuli, auditory stimulation could still influence cognitive task performance, among other measures. Alternative approaches could circumvent auditory confounds by testing deaf subjects, or perhaps more practically by ramping the ultrasonic pulse to minimize or even eliminate the auditory confound. This approach still requires validation and will only be relevant for protocols with pulses of sufficient duration. Here, one can expect that the experimental control required to account for auditory confounds might also hold for alternative peripheral effects, such as somatosensory confounds. Longer pulse durations are common in offline rTUS paradigms (Zeng et al., 2022), with more opportunity for inaudible pulse shaping and the added benefit of separating the time of stimulation from that of measurement. However, appropriate control conditions remain central to making inferences on interventional specificity.

Conclusion

Transcranial ultrasonic stimulation is rapidly gaining traction as a promising neuromodulatory technique in humans. For TUS to reach its full potential we must identify robust and effective stimulation protocols. Here, we demonstrate that one of the most reliable findings in the human literature – online motor cortical inhibition during a 1000 Hz pulsed protocol – primarily stems from an auditory confound rather than direct neuromodulation. Instead of driving overall inhibition, we found preliminary evidence that TUS might introduce noise to ongoing neural dynamics. Future research must carefully account for peripheral confounding factors to isolate the direct neuromodulatory effects of TUS, thereby enabling the swift and successful implementation of this technology in both research and clinical settings.

Materials and methods

Participants

This multicenter study comprised of four experiments conducted independently across three institutions. Experiment I (n=12, 4 female, Mage = 25.9, SDage = 4.6; METC: NL76920.091.21) and Experiment II (n=27, 13 female, Mage = 24.1, SDage = 3.7; METC: NL80331.091.22) were conducted at the Donders Institute of the Radboud University (the Netherlands). Experiment III was conducted at the Krembil Research Institute (n=16, 8 female, Mage = 31.4, SDage = 7.9; Toronto University Health Network Research Ethics Board: 20–5740, Canada), and Experiment IV at the Neuroimaging Centre of the Johannes Gutenberg University Medical Centre Mainz (n=12, 11 female, Mage = 23.0, SDage = 2.7, Landesärztekammer Rheinland-Pfalz: 2021–15808_01, Germany). All participants were healthy, right-handed, without a history of psychiatric or neurological disorders, and provided informed consent. Ethical approval was obtained for each experiment.

Transcranial ultrasonic and magnetic stimulation

Ultrasonic stimulation was delivered with the NeuroFUS system (manufacturer: Sonic Concepts Inc, Bothell, WA, USA; supplier/support: Brainbox Ltd., Cardiff, UK). A radiofrequency amplifier powered a piezoelectric ultrasound transducer via a matching network. Transducers consisted of a two-element annular array. Further transducer specifications are reported in Appendix 8—table 1. Ultrasonic stimulation parameters were based on those used in prior TUS-TMS studies (Table 1, Figure 7A; Fomenko et al., 2020; Legon et al., 2018b; Xia et al., 2021). While ramping the pulses can in principle mitigate the auditory confound (Johnstone et al., 2021; Mohammadjavadi et al., 2019), doing so for such short pulse durations (≤0.3 ms) is not effective. Therefore, we used a rectangular pulse shape to match prior work.

Table 1. Ultrasonic stimulation parameters.

Exp. f(kHz) depth (mm) PD(ms) PRF(kHz) DC PTD(ms) Isppa(W/cm2) P(Mpa) MItc
I 500 35 0.3 1000 30% 100/500 32.5/65 1.02/1.44 0.99/1.40
II 250 28 0.3 1000 30% 500 6.35/19.06 0.45/0.78 0.65/1.12
III 500 30 0.1 1000 10% 500 9.26 0.54 0.53
IV 250 50* 0.3 1000 30% 400 4.34/8.69/10.52 0.37/0.53/0.58 0.53/0.76/0.83

f = fundamental frequency, depth = TPO focus setting for distance of free-water full-width half-maximum from transducer exit plane, PD = pulse duration, PRF = pulse repetition frequency, DC = duty cycle, PTD = pulse train duration, Isppa = spatial-peak pulse-average intensity in free-water, P = pressure, MItc = transcranial derated mechanical index. The ramp shape for all experiments was rectangular. For estimated intracranial indices for Experiments I & II see Appendix 5—figure 5.

*

Note: Experiments I-III targeted the hand motor area. Experiment IV targeted the corticospinal white matter.

Figure 7. Experimental procedures.

Figure 7.

(A) Ultrasonic stimulation protocol for Experiments I & II. In Experiment III a duty cycle of 10% was used. In Experiment IV a stimulus duration of 400 ms was used. (B) Transcranial ultrasonic stimulation (TUS)-transcranial magnetic stimulation (TMS) clamp (https://doi.org/10.5281/zenodo.6517599). (C) Experimental timing. Detailed experimental timing for each experiment is reported in Appendix 7—figures 14.

Single-pulse TMS was delivered with a figure-of-eight coil held at 45° from the midline to induce an approximate posterolateral to anteromedial current. The hand motor hotspot and required TMS intensity were determined using standard procedures as outlined in Appendix 6. To apply TUS and TMS concurrently, the ultrasound transducer was affixed to the center of the TMS coil using a custom-made 3D-printed clamp (Figure 7B; Experiments I, II, & IV; Experiment III: see Fomenko et al., 2020). TMS was triggered 10ms prior to the offset of TUS (Figure 7C). Muscular activity was recorded in the first dorsal interosseous (FDI; Experiments I-III) or in the abductor pollicis brevis (APB; Experiment IV) via electromyography with surface adhesive electrodes using a belly-tendon montage (Appendix 6—table 1).

In Experiments I, II, and IV, we used online neuronavigation with individual anatomical scans to support target selection and consistent TMS and TUS placement (Localite Biomedical Visualization Systems GmbH, Sankt Augustin, Germany; MRI specifications: Appendix 8—table 2). Furthermore, we recorded the position of TUS in Experiments I and II for post-hoc acoustic and thermal simulations.

Experiment I

On-target TUS was delivered to the left-hemispheric hand motor area to determine the effect of ultrasonic stimulation on corticospinal excitability. We introduced controls that improve upon the sole use of flip-over sham conditions used in prior work. First, we applied active control TUS to the right-hemispheric face motor area, allowing for the assessment of spatially specific effects while also better mimicking on-target peripheral confounds. In addition, we also included a sound-only sham condition that closely resembled the auditory confound (Figure 8). Specifically, we generated a 1000 Hz square wave tone with 0.3 ms long pulses using MATLAB. We then added white noise at a signal-to-noise ratio of 14:1. This stimulus was administered to the participant via bone-conducting headphones (AfterShockz Trekz, TX, USA). Finally, we incorporated a baseline condition consisting solely of TMS.

Figure 8. Experimental conditions.

Figure 8.

On-target transcranial ultrasonic stimulation (TUS) of the left-hemispheric hand motor area (Exp. I-III), active control TUS of the right-hemispheric face motor area (Exp. I–II), sound-only sham (Exp. I-III), and inactive control TUS of the white matter ventromedial to the hand motor area (Exp. IV). Conditions involving TUS were presented both with and without auditory masking stimuli.

Ultrasonic stimulation was delivered at two pulse train durations (100/500 ms) and at two intensities (32.5/65 W/cm2 Isppa) to probe a potential dose-response effect. Additionally, with consideration of potentially audible differences between on-target and active control stimulation sites, we applied these conditions both with and without masking stimuli identical to those used during a sound-only sham. Auditory stimuli used for sound sham and/or masking for each experiment are accessible here: 10.5281/zenodo.8374148. See Appendix 7 for an overview of conditions and experimental timing for each experiment.

Conditions were administered in a single-blind inter-subject counterbalanced blocked design while participants were seated at rest. Ultrasound gel was used to couple both transducers to the participant’s scalp (Aquasonic 100, Parker Laboratories, NJ, USA). In total, participants completed 14 blocks of 20 trials each. Each trial lasted 6±1 s. Two baseline measurements were completed, the first occurring as one of the first four blocks, and the second as one of the last four, to capture any general shift in excitability throughout the experiment. TMS was administered on every trial for a total of 280 single pulses.

Experiment II

To confirm and expand upon our findings from Experiment I we conducted a second, preregistered, experiment using the same main conditions and procedures, with a few adaptations (10.17605/OSF.IO/HS8PT). The 2 × 2 × 2 design comprised of stimulation site (on-target/active control), stimulation intensity (6.35/19.06 W/cm2), and auditory masking (no mask/masked). We applied ultrasonic stimulation exclusively at an effective 500 ms pulse train duration. In this experiment, the same 1000 Hz square wave auditory stimulus was used for sound-only sham and auditory masking conditions. This stimulus was administered to the participant over in-ear headphones (ER-3C Insert Earphones, Etymotic Research, Illinois, USA). To better capture any baseline shift in excitability during the experiment, we presented conditions in a single-blind pseudorandomized order in which each consecutive set of 10 trials included each of 10 conditions once. Participants completed 25 trials per condition, resulting in 250 trials total.

To further probe a potential dose-response effect of stimulation intensity, we ran acoustic and thermal simulations (Appendix 5). Here, we assessed the relationship between estimated intracranial intensities and perturbation of corticospinal excitability. While simulations were also run for Experiment I, its sample size was insufficient to test for intracranial dose-response effects.

Following the main experiment, we tested the efficacy of our masking stimuli with a forced-choice task wherein participants reported if they had received TUS for each condition, excluding baseline. Additionally, we investigated whether audible differences between stimulation sites were present during auditory masking (Appendix 3—figure 1).

Experiment III

We further characterized possible effects of auditory confounds on motor cortical excitability by administering varied auditory stimuli, both alongside on-target TUS and without TUS (i.e. sound-only sham). Auditory stimuli were either 500 or 700 ms in duration, the latter beginning 100ms prior to TUS (Appendix 7—figure 3). Both durations were presented at two pitches. Using a signal generator (Agilent 33220 A, Keysight Technologies), a 12 kHz sine wave tone was administered over speakers positioned to the left of the participant as in Fomenko et al., 2020. Additionally, a 1 kHz square wave tone with 0.5 ms long pulses was administered as in Experiments I, II, IV, and prior research (Braun et al., 2020) over noise-cancelling earbuds.

First, we investigated changes in corticospinal excitability from baseline following these auditory stimuli. Participants received 15 trials of baseline (i.e. TMS only) and 15 trials of each of the four sound-only sham stimuli. Conditions were presented in a blocked single-blind randomized order with participants seated at rest. An inter-trial interval of 5 s was used.

Next, we assessed whether applying on-target TUS during these auditory stimuli affected motor excitability. Here, TMS intensity was set to evoke a ~1 mV MEP separately for each of the four sound-only sham conditions (Appendix 6—figure 2). To account for different applied TMS intensities between baseline and these conditions, we calculated Relative MEP amplitude by multiplying each trial by the ratio of applied TMS intensity to baseline TMS intensity. Participants received 15 trials of each auditory stimulus, once with on-target TUS and once as a sound-only sham. Ultrasound gel (Wavelength MP Blue, Sabel Med, Oldsmar, FL) and a 1.5 mm thick gel pad (Aquaflex, Parker Laboratories, NJ, USA) were used to couple the transducer to the participants’ scalps. Conditions were presented in pairs of sound-sham and TUS for each auditory stimulus, counterbalanced between subjects. The order of the different auditory stimuli was randomized across participants.

Experiment IV

We further investigated the role of TUS audibility on motor excitability by administering stimulation to an inactive control site – the white matter ventromedial to the hand motor area. In doing so, TUS is applied over a homologous region of the scalp and skull without likely direct neuromodulation, thus allowing us to closely replicate the auditory confound while simultaneously isolating its effects.

Here, we probed whether the varying volume of the auditory confound at different stimulation intensities might itself impact motor cortical excitability. To this end, we applied stimulation at 4.34, 8.69, and 10.52 W/cm2 Isppa, or in effect, at three auditory confound volumes. We additionally applied stimulation both with and without a continuous auditory masking stimulus that sounded similar to the auditory confound. The stimulus consisted of a 1 kHz square wave with 0.3 ms long pulses. This stimulus was presented through wired bone-conducting headphones (LBYSK Wired Bone Conduction Headphones). The volume and signal-to-noise ratio of the masking stimulus were increased until the participant could no longer hear TUS, or until the volume became uncomfortable.

We administered conditions in a single-blind inter-subject randomized block design. Two blocks were measured per condition, each including 30 TUS-TMS trials and an additional 30 TMS-only trials to capture drifts in baseline excitability. These trials were applied in random order within each block with an inter-trial interval of 5±1 s. Ultrasound gel (Aquasonic 100, Parker Laboratories, NJ, USA) and a ~2–3 mm thick gel pad were used to couple the transducer to the participant’s scalp (Aquaflex, Parker Laboratories, NJ, USA). During blocks with auditory masking, the mask was played continuously throughout the block. Following each block, participants were asked whether they could hear TUS (yes/no/uncertain).

Analysis

Raw data were exported to MATLAB, where MEP peak-to-peak amplitude was calculated using a custom script and confirmed by trial-level visual inspection. The data and code are publicly available (https://doi.org/10.34973/jh6z-yh31; Kop et al., 2024). Trials where noise prevented an MEP from being sufficiently quantified were removed. Given the right-skewed nature of the raw MEP values, we performed a square root transformation to support parametric statistics. For visualization purposes, baseline corrected MEP amplitudes were also calculated.

Linear mixed-effects models (LMMs) were fitted using the lme4 package in R (Bates et al., 2015; R Development Core Team, 2021). Intercepts and condition differences (slopes) were allowed to vary across participants, including all possible random intercepts, slopes, and correlations in a maximal random effects structure (Barr et al., 2013). Statistical significance was set at two-tailed α=0.05 and was computed with t-tests using the Satterthwaite approximation of degrees of freedom. For direct comparisons to a reference level (e.g. baseline), we report the intercept (b), standard error (SE), test-statistics (t), and significance (p). For main effects and interactions, we report the F statistic, significance, and partial eta squared. LMMs included square root transformed MEP peak-to-peak amplitude as the dependent variable, with the relevant experimental conditions and their interactions as predictors. Given the large number of baseline trials in Experiment IV (50% of the total), the LMM testing effects of stimulation intensity and auditory masking instead included baseline corrected MEP amplitude as the dependent variable.

CRediT authorship contribution statement

Figure 9 visualises the contributor roles taxonomy (CRediT) author statement.

Figure 9. Contribution diagram.

Figure 9.

This figure depicts the involvement of each author using the CRediT taxonomy (Brand et al., 2015) and categorizes their contributions according to three levels represented by color: ‘none (gray)’, ‘substantial contribution (light green)’, ‘leading contribution (dark green)’.

Acknowledgements

Experiments I & II were supported by the Dutch Research Council (NWO), awarding VIDI fellowships to LV (18919) and HEMdO (452-17-016), and the Topsector program Holland High Tech (HiTMaT-38H3). We thank Brittany van Beek for assistance with data acquisition and administration, Sarmad Peymaei for contributions to technical piloting, Norbert Hermesdorf for developing the 3D-printed TUS-TMS clamp, and Gerard van Oijen and Pascal de Water for their technical support. Experiment III was funded by the Canadian Institutes of Health Research (FDN 154292, ENG 173742) and the Natural Science and Engineering Research Council of Canada (RGPIN-2020-04176, RTI-2020-0024). Experiment IV and TN were supported by a grant from the Boehringer Ingelheim Foundation to TOB.

Appendix 1

No evidence for direct intracranial dose-response effects (Exp. II)

In Experiments I & II, we tested the potential dose-response effects of TUS by applying stimulation at multiple free-water intensities. Here, no significant effect of administered intensity was observed. However, the efficacy of TUS likely depends on realized intracranial intensities. Therefore, we ran 3D acoustic simulations to estimate the intracranial intensity during on-target TUS. It was not appropriate to combine data from Experiments I and II given the different fundamental frequencies and stimulation depths applied.

Should direct and spatially specific neuromodulation take place, we would expect to see a dose-response effect where MEP amplitude changes with intracranial intensity during on-target TUS, and not during control conditions. Therefore, the critical test to provide evidence of direct neuromodulation is a comparison of the dose-response relationship between on-target TUS and control conditions. To this end, we ran simple linear models for Experiment II, which had a sufficient sample size (n = 27) to assess inter-individual variability. We found no significant difference in the TUS-MEP relationship between on-target TUS and control conditions (active control free-water 6.35 W/cm2: b = -1.02, SE = 3.66, t(24) = -0.28, p = 0.783; active control free-water 19.06 W/cm2: b = 0.04, SE = 1.66, t(24) = 0.02, p = 0.983; sound-sham free-water 6.35 W/cm2: b = 0.58, SE = 4.78, t(24) = 0.12, p = 0.90; sound-sham free-water 19.06 W/cm2: b = -0.66, SE = 1.92, t(24) = -0.35, p = 0.733). We conclude that there is no evidence for a direct neuromodulatory intracranial dose-response relationship. This interpretation is in line with our findings when testing the free-water dose-response effects of TUS.

Notably, given that for all individuals the same ultrasound intensity was applied at the source outside the skull, any intracranial differences in intensity are primarily driven by individual skull characteristics, such as skull thickness. These individual characteristics are expected to co-vary with TMS parameters, such as the TMS intensity (% MSO) required to obtain a 1mV MEP. It is conceivable that such co-varying relationships can drive the overall effects of MEP amplitude and MEP inhibition. Indeed, we observe a significant effect of intracranial intensity on the difference in MEP amplitude between on-target TUS at 6.35 W/cm2 and baseline (b = 19.46, SE = 7.91, t(24) = 2.46, p = 0.022), as well as a trend for on-target TUS at 19.06 W/cm2 (b = 5.77, SE = 3.01, t(24) = 1.92, p = 0.067). Importantly, this is also observed for the sound-sham condition (b = 19.65, SE = 7.58, t(24) = 2.59, p = 0.016), where direct neuromodulation is impossible. These observations emphasize the need for rigorous control conditions to support strong inferences of TUS neuromodulation. In summary, in this experiment, no evidence for a specific and direct neuromodulatory dose-response effect was observed.

Appendix 1—figure 1. No evidence for direct intracranial dose-response effects (Exp. II).

Appendix 1—figure 1.

(A) Hypothetical effects. A target-specific dose-response effect of transcranial ultrasonic stimulation (TUS) would be reflected by a change in motor-evoked potential (MEP) amplitude with increasing intracranial intensity only for on-target TUS (top left), whereas a nonspecific effect of TUS would show the same ‘dose-response’ for both on-target and active control conditions (top right). If there is sound-driven inhibition, a motor inhibitory effect would be observed for on-target, active control, and sound-only sham (bottom left). If sound-driven inhibition would be subject to individual differences, for example in skull morphology which correlates with intracranial intensity, there would be a correlation between MEP amplitude and intracranial intensity that exists for on-target, active control, and sound-sham conditions (bottom right). The latter corresponds with the observed effects. (B) Average square root corrected MEP amplitude, plotted separately for baseline, sham, TUS, and active control conditions, across simulated intracranial intensities. For each participant, we estimated the intracranial ultrasound intensity at the hand M1 target, represented on the x-axis, and the average MEP amplitude for all four conditions. Results for on-target and active control delivered at 6.35 W/cm2 free-water Isppa are depicted on the left, and at 19.06 W/cm2 free-water Isppa are depicted on the right. Please note that, for reference, the baseline and sham conditions are duplicated across both plots. There is no significant difference between on-target TUS and control conditions across inter-individual intracranial intensities. Points represent the average MEP amplitude per participant per condition. The shaded area represents the 95% CI.

Appendix 2

Temporal dynamics (Exp. II)

While studies often focus on quantifying the excitatory or inhibitory effects of TUS, it can be informative to additionally examine how TUS affects neural dynamics in the time domain. Here, we conduct a preliminary exploration of whether autocorrelative measures between a given trial (t) and its preceding measure of baseline motor cortical excitability (t-1) yield insight into the possible introduction of noise by TUS. To this end, we ran a linear mixed model predicting square root corrected MEP amplitude by ‘previous baseline’ amplitude, as well as ‘stimulation site’ (on-target/active control), ‘intensity’ (6.35/19.06 W/cm2), and ‘masking’ (no mask/masked). Previous baseline MEP amplitudes were mean-centered and standardized on a participant level. Random intercepts and slopes were included for ‘stimulation site,’ ‘intensity,’ ‘previous baseline,’ and their interaction, permitting model convergence. This test revealed a significant four-way interaction (F(1,5222) = 26.10, p = 3⋅10-7, ηp2 = 5⋅10-3) and an accompanying main effect of previous baseline MEP amplitude (F(1,26) = 5.70, p = 0.024, ηp2 = 0.18). Follow-up LMMs for unmasked and masked stimulation separately both revealed a significant three-way interaction between ‘stimulation site,’ ‘intensity,’ and ‘previous baseline’ (unmasked: F(1,176) = 7.32, p = 0.007, ηp2 = 0.04; masked: F(1,30) = 12.10, p = 0.002, ηp2 = 0.28). Further follow-up LMMs for each level of masking and intensity were used to test the effects of ‘stimulation site’, ‘previous baseline’, and their interaction. These LMMs revealed a significant interaction for all but unmasked 19.06 W/cm2 stimulation (no mask 6.35 W/cm2: F(1,1293) = 8.36, p = 0.004, ηp2 = 6⋅10-3; no mask 19.06 W/cm2: F(1,1283) = 1.13, p = 0.288, ηp2 = 9⋅10-4; masked 6.35 W/cm2: F(1,1287) = 13.43, p = 3⋅10-4, ηp2 = 0.01; masked 19.06 W/cm2: F(1,1282) = 5.76, p = 0.017, ηp2 = 4⋅10-3).

Appendix 2—figure 1 shows the temporal autocorrelation, as indexed by the linear regression slopes of the relationship between MEP amplitude on the previous baseline trial and the current test trial. Statistical inference is drawn at the current trial (zero-lag, indicated with the filled dots and error bars). For visualization only, the autocorrelation is calculated for a range of time lags, shifting the time series of preceding baseline excitability from 15 trials in the past to 15 trials in the future. This shows, as expected, that autocorrelation and the modulation thereof disappear at larger absolute time lags. The main manuscript shows the difference in autocorrelation between on-target and active control TUS (Figure 2D; e.g. masked 6.35 W/cm2 on-target TUS slope – masked 6.35 W/cm2 active control slope). For sound-sham and baseline, the difference score was calculated based on the mean overall active control trials.

In sum, these preliminary exploratory analyses could point towards TUS introducing temporally specific neural noise to ongoing neural dynamics in a dose-dependent manner, rather than simply shifting the overall excitation-inhibition balance. One possible explanation for the discrepancy between trials with and without auditory masking is the difference in auditory confound perception, where without masking the confound’s volume differs between intensities, while with masking this difference is minimized. Future studies might consider designing experiments such that temporal dynamics of ultrasonic neuromodulation can be captured more robustly, allowing for quantification of possible state-dependent or nondirectional perturbation effects of stimulation.

Appendix 2—figure 1. Temporal dynamics (Exp. II).

Appendix 2—figure 1.

We operationalized autocorrelation as the slope between motor-evoked potential (MEP) amplitude on a given trial (t) and its preceding baseline MEP amplitude (t-1) for each condition. Statistical inference is drawn at the current trial (lag = 0). Dots and error bars reflect the mean and standard error of the slopes across participants for each level of on-target and active control transcranial ultrasonic stimulation (TUS). Additionally, for visualization purposes, we display autocorrelation for shifts in the preceding baseline time series, with lags ranging from -15 to +15 trials. The line depicts the mean slope across participants for each lag, and the shaded area represents the standard error. *p<0.05, **p<0.01, ***p<0.001.

Appendix 3

Blinding efficacy (Exp. II)

In Experiment II, we investigated whether participants were effectively blinded to ultrasonic stimulation when a time-locked auditory masking stimulus was applied (Appendix 3—figure 1A). Participants indicated whether they believed TUS was administered across each condition, excluding baseline. Participants completed four trials per condition, resulting in 36 trials in total, presented in pseudorandomized order where each condition is presented randomly in sets of 9 trials. We used a mixed-effects logistic regression to predict participant response (stimulation yes/no) as a function of stimulation site (on-target/active control), free-water intensity (6.35 and 19.05 W/cm2), and masking (no mask/masked) with random intercepts by the participant. This test revealed significantly lower detection rates when masking was applied (b = -0.90, SE = 0.31, z = -2.87, p = 0.004), but showed no significant effect of stimulation site (b = 0.05, SE = 0.31, z = 0.16, p = 0.875) or intensity (b = 0.30, SE = 0.31, z = 0.95, p = 0.342), nor any significant interactions (all p > 0.6). Only ultrasonic stimulation applied at higher intensities (19.06 W/cm2) in the absence of an auditory masking stimulus were detected by participants at an above-chance rate (i.e. 50%; on-target: Z = 2.6, p = 0.011; active control: Z = 2.9, p = 0.004). Detection rates for lower intensities, masked stimulation, and sound-sham conditions did not differ significantly from chance, as revealed by post-hoc one-sample Wilcoxon signed rank tests (all Z < 1.7, p > 0.09). Taken together, these results show that the masking used in Experiment II successfully reduced participants’ ability to determine whether they had received stimulation.

It is possible that, while blinding to TUS versus no-TUS is successful for explicit judgments, slight audibility differences between on-target and active control stimulation persist even when stimulation is masked. To test for audible differences between stimulation sites (Appendix 3—figure 1B), participants were exposed to two consecutive masked TUS trials. These two stimuli were either the same (i.e. [on-target + on-target] or [active control + active control]) or different (i.e. [on-target + active-control/active-control + on-target]). These two trials were presented at both intensities (6.35 and 19.05 W/cm2), to account for the louder auditory confound associated with higher intensity stimulation. Following each pair of trials, participants were asked to rate their similarity on a visual analog scale (0 = very dissimilar, 100 = very similar) across 8 trials in total. We conducted a two-way repeated measures ANOVA predicting similarity rating by stimulation site (same/different) and stimulation intensity (6.35/19.05 W/cm2). Here, masked stimulation over the two different sites sounded less similar than masked stimulation over the same site (Appendix 3—figure 1B; F(1,26) = 14.00, p = 9⋅10-4) while no significant main effect of, or interaction with, intensity was observed (main effect: F(1,26) = 1.98, p = 0.171; interaction: F(1,26) = 0.26, p = 0.613). These results suggest that, while masking may effectively blind participants, thus reducing the detection of ultrasonic stimulation, slight audible differences between different stimulation sites may still persist.

Appendix 3—figure 1. Blinding efficacy (Exp. II).

Appendix 3—figure 1.

(A) An auditory masking stimulus significantly decreased participants’ ability to determine whether they had received ultrasonic stimulation. (B) However, when masked stimulation was applied over different sites, participants still rated these stimuli as sounding less similar than stimulation applied over the same site. Bars represent means across participants, error bars represent the standard error, and points represent individual participants. ***p<0.001.

Appendix 4

Subjective report of TUS audibility (Exp. IV)

We propose that the results of Experiment IV reflect the impact of auditory confound volume. Here, the volume of the confound can be regarded as the salience of a cue for the upcoming TMS pulse. When masking is applied, the salience of this cue is diminished. Correspondingly, less motor inhibition was observed when stimulation was masked, as opposed to when no masking stimulus was administered and the auditory confound was clearly audible.

During continuous masking, more motor inhibition was observed at higher auditory confound volumes (i.e. intensities). This is likely because the confound can then be perceived, albeit potentially unconsciously. Indeed, in Appendix 4—figure 1B, we also see that participants who rated masked stimulation as uncertain or audible demonstrated more inhibition. Taken together, we suggest that ‘dose-response’ effects of auditory confound volume are being observed. As the confound approaches and exceeds the boundary of (subconscious) audibility, sufficiently salient cueing of the upcoming TMS pulse takes place, thus evoking proportional motor inhibitory effects.

Appendix 4—figure 1. Subjective report of transcranial ultrasonic stimulation (TUS) audibility (Exp. IV).

Appendix 4—figure 1.

(A) Depicting the percentage of blocks for which participants reported a condition as audible, uncertain, or inaudible. When no mask was applied, TUS was audible (green) in nearly all cases. During continuous masking, TUS was experienced as inaudible (red) less often at higher stimulation intensities. Descriptively, reports of inaudibility scale with motor-evoked potential (MEP) amplitudes such that in conditions where TUS was inaudible more frequently, less motor inhibition was observed. (B) Motor inhibitory effects during masked stimulation, split by audibility. Descriptively, when masking rendered TUS inaudible, less motor inhibition was observed than when participants were uncertain or when stimulation was heard.

Appendix 5

To obtain estimates of acoustic targeting, realized intracranial dosage, and safety indices, we conducted individualized 3D simulations of acoustic and thermal effects for Experiments I and II using the validated k-Wave MATLAB toolbox, a pseudospectral time-domain solver (Treeby and Cox, 2010).

Preprocessing was performed in MATLAB 2019b. Anatomical scans (T1w and T2w) were first segmented into distinct tissues using the SimNIBS headreco tool (Thielscher et al., 2015). We then reoriented the segmented volumes such that the axial plane of the recorded transducer location was parallel to the cartesian axis of the computational grid. Subsequently, the segmented volumes were resampled to an isometric volume size of 0.5 mm and interpolated using the nearest neighbor algorithm. We then extracted the skull, scalp, and brain – including gray and white matter – volumes. To correct minor segmentation errors, the volumes were smoothed by a cubic averaging kernel using a window size of four voxels, and re-binarized. The skull layer was additionally combined with a one-voxel outer layer of the filled-in bone volume provided by SimNIBS. Any gaps between the skull and skin volumes were assigned to the skull. The skull, scalp, and brain were all included in acoustic and thermal k-Wave simulations, where all voxels not assigned to these tissues were designated as water. Segmented volumes were then cropped to include the whole skull and the transducer, including a perfectly matched layer (PML) of 10 grid points. The points per wavelength given the 0.5 mm grid spacing were 6 for Experiment I (f = 500 kHz) and 12 for Experiment II (f = 250 kHz).

The acoustic properties of sound speed, density, alpha coefficient, and alpha power, as well as the thermal properties of thermal conductivity and heat capacity, were assigned to the skull, scalp, and brain, as reported in Appendix 5—table 1 (Aubry et al., 2022; IT’IS Foundation, 2022).

We modeled on-target TUS with two-element annular arrays using their respective geometric dimensions and fundamental frequencies for Experiments I and II. The position and orientation of the simulated transducer was extrapolated based on the coordinates of the TMS coil recorded during the experiment with neuronavigation software. We calibrated the simulated source amplitude and phase in free water to reach the applied free-water stimulation intensities of 32.5 and 65 W/cm2 (Experiment I) and 6.35 and 19.06 W/cm2 (Experiment II). We calibrated the simulated transducer such that the center of the acoustic profile’s full-length half-maximum was at the applied depth of 35.1 mm (Experiment I) or 28 mm (Experiment II) from the exit plane of the transducer. The source amplitude and phase resulting in a simulated acoustic profile with the minimum error in relation to the water-tank calibrated acoustic profile provided by the manufacturer were used for acoustic and thermal simulations including the tissue volumes.

Acoustic simulations were then run for three TUS cycles after reaching the steady-state pressure distribution per applied free-water intensity per participant. The maximum pressure map computed by k-Wave was then used to calculate the realized intracranial pulse-average intensity, which was calculated as: Ipa=p22cp , where p is acoustic pressure, c is sound speed, and ρ is density. The mechanical index was calculated based on simulated intracranial pressure as: MI=pf, where p is the peak intracranial pressure, and f is the fundamental frequency. This formula was also used for Experiments III and IV, but with derated free-water pressures.

The simulated pressure map further served as input for the thermal simulations, where we quantified the maximum thermal rise at each point in the computational grid. Here, simulations were run with a 30% DC for a stimulus duration of 500 ms, corresponding to the applied TUS. To compute a conservative safety estimate of thermal rise, stimulation was split into five 100 ms steps, each with a 30 ms on-period and a 70 ms off-period (actual on-period = 0.3 ms per 1 ms).

Appendix 5—table 1. Properties for acoustic and thermal simulations (Exp. I-II).

Density (kg/m3) Sound speed (ms) Attenuation coefficient Thermal conductivity (W/m/°C) Specific heat capacity (J/kg/°C)
Water 994 1500 0.0 0.60 4178
Scalp 1090 1610 0.4 0.37 3391
Skull 1850 2800 8.0 0.32 1313
Brain 1046 1546 0.6 0.51 3630

This table describes the acoustic and thermal properties assigned to each component included in simulations.

Appendix 5—figure 1. Acoustic simulations for Experiment I.

Appendix 5—figure 1.

Individual simulations for Experiment I of acoustic wave propagation at 65 W/cm2 free-water stimulation intensity. Simulated pulse-average intensities above 0.15 W/cm2 are depicted. The FWHM of the pressure is indicated by the dashed white line.

Appendix 5—figure 2. Thermal simulations for Experiment I.

Appendix 5—figure 2.

Individual simulations for Experiment I of thermal rise at 65 W/cm2 free-water stimulation intensity.

Appendix 5—figure 3. Acoustic simulations for Experiment II.

Appendix 5—figure 3.

Individual simulations for Experiment II of acoustic wave propagation at 19.05 W/cm2 free-water stimulation intensity. Simulated pulse-average intensities above 0.15 W/cm2 are depicted. The FWHM of the pressure is indicated by the dashed white line.

Appendix 5—figure 4. Thermal simulations for Experiment II.

Appendix 5—figure 4.

Individual simulations for Experiment II of thermal rise at 19.05 W/cm2 free-water stimulation intensity.

Appendix 5—figure 5. Simulated intracranial indices (Exp. I-II).

Appendix 5—figure 5.

Simulated indices for Experiment I (top) and Experiment II (bottom) for both applied free-water stimulation intensities including spatial-peak pulse- and temporal-average intensity (Isppa, Ispta), the mechanical index (MI), and peak thermal rise (TR).

Appendix 6

TMS hotspot and intensity determination

Single-pulse TMS was delivered with a 97 mm figure-of-eight MC-B70 coil powered by a MagPro X100 + MagOption stimulator (MagVenture, Farum, Denmark) using a biphasic pulse shape. The motor hotspot for the first dorsal interosseous (FDI) was determined by positioning the TMS coil over the hand motor area as identified with neuronavigation software, and iteratively adjusting TMS intensity until a consistent MEP was observed of approximately 1 mV amplitude. After probing locations in a ~2 cm radius, the location resulting in the highest and most stable amplitude MEPs was set as the final stimulation site. Next, the percentage of maximum stimulator output (%MSO) was adjusted until an average MEP amplitude of ~1 mV was observed across approximately ten trials. This intensity was used throughout the rest of the experiment. Both TMS and TUS were externally triggered using Signal version 7.05 (CED, Cambridge, UK). The mean percentage of administered TMS maximum stimulator output was 80.4 ± 12.5% for Experiment I, and 88.0 ± 9.3% for Experiment II. These high stimulation intensities are to be expected, given the offset of TMS from the scalp owing to the ultrasound transducer and the TMS current dissipating with distance (O’Shea and Walsh, 2007).

Experiment III

Single-pulse TMS was delivered via a 70 mm figure-of-eight coil powered by a Magstim 2002 stimulator (Magstim, Whitland, Dyfed, UK). The coil was held at 45° from the midline to induce an approximate posterolateral to anteromedial current using a monophasic pulse shape. Optimal positioning was determined by placing the TMS coil over the left-hemispheric hand motor area and moving the coil in increments of 0.5 cm until a clear MEP was observed over the FDI. Following the assessment of optimal coil flatness and orientation, the location and orientation of TMS were marked on the scalp to support consistent placement throughout the experiment. At this site, the minimal intensity required to evoke an average MEP of ~1 mV across ten trials was determined. The mean %MSO was 73.7 ± 11.1% for the baseline condition (i.e. TMS only). The mean %MSO require to evoke a ~ 1mV MEP during the four sound-sham conditions were significantly higher at 77.4 ± 9.4% (1 kHz, 500 ms, t(15) = -4.64, p = 3⋅10-4), 77.6 ± 10.1% (1 kHz, 700 ms, t(15) = -4.84, p = 2⋅10-4), 76.9 ± 10.6% (12 kHz, 500 ms, t(15) = -3.76, p = 0.002), and 77.2 ± 9.8% (12 kHz, 700 ms, t(15) = -4.09, p = 9.7⋅10-4, see Appendix 6—figure 2). To administer stimulation at high temporal precision, both TUS and TMS were externally triggered using Signal 6.04 (CED, Cambridge, UK).

Experiment IV

The TMS stimulator, coil, pulse shape, and orientation were identical to Experiments I & II, excluding the final four participants in which the Cool-B35 HO coil (Magventure, Farum, Denmark) was used during preparatory measures to allow more precise estimation of the motor hotspot, while always the same TMS coil (MC-B70) was used during the combined TUS-TMS experiment.

The motor hotspot for the abductor pollicis brevis (APB) was determined by positioning the TMS coil over the hand motor area as identified with neuronavigation software, and systematically searching for the largest and most consistent MEP. An automated adaptive staircase procedure with an amplitude threshold of 0.05 mV was used to determine the resting motor threshold (rMT). We subsequently estimated the dose-response curve by applying 20 pulses each at 80, 90, 100, 110, 120, and 130% rMT. In most cases we were unable to reach the plateau of the dose-response curve due to the high motor thresholds resulting from the offset caused by the ultrasound transducer. We aimed to use 120% rMT (n = 3). However, if this intensity surpassed 100% MSO, we opted for 100% MSO instead (n = 9). The mean %MSO was 94.5 ± 10.5%. The TMS intensities required in this experiment were higher than those required in Experiment I-II using the same TMS coil, though still within approximately one standard deviation. This is likely due to the use of a gel pad, which introduces more distance between the TMS coil and the scalp, thus requiring a higher TMS intensity to evoke the same motor activity. To administer stimulation at high temporal precision, both TUS and TMS were externally triggered via a boss-device (sync2brain, Tübingen, Germany) using the BEST Toolbox (Hassan et al., 2022).

Appendix 6—figure 1. Transcranial magnetic stimulation (TMS) hotspot and intensity determination.

Appendix 6—figure 1.

Percentage of maximum stimulator output (%MSO) is determined by finding the motor hotspot and then adjusting %MSO until a stable 1 mV motor-evoked potential (MEP) is observed (Experiments I-III) or until a MEP of minimally 0.05 mV is observed on at least half of trials (Experiment IV).

Appendix 6—figure 2. Transcranial magnetic stimulation (TMS) intensities in Experiment III.

Appendix 6—figure 2.

Percentage maximum stimulator output (%MSO) required in Experiment III to evoke a 1 mV motor-evoked potential (MEP) at baseline and during the four sound-sham conditions. Each sound-sham condition required a significantly higher %MSO than the baseline to evoke a 1 mV MEP. **p<0.01, ***p<0.001.

Appendix 6—table 1. Electromyography.
Experiment Muscle Amplification Filtering Digital sampling rate Software
I & II FDI 1000 gain* 1–1000 Hz* 5 kHz Signal version 7.05
III FDI 1000 gain§ 20–2500 Hz§ 5 kHz Signal version 6.04
IV APB 10 gain <1250 Hz 5 kHz BEST Toolbox**

This table describes the EMG acquisition parameters used to measure muscular activity in the first dorsal interosseous (FDI) or abductor pollicis brevis (APB) using a belly-tendon montage.

*

D440-2, Digitimer Ltd., Hertfordshire, UK.

Micro 1401, CED, Cambridge, UK.

CED, Cambridge, UK.

§

Intronix Technologies Corporation [model: 2024F], Bolton, Canada.

Bittium NeurOne Tesla System, Bittium Biosignals Ltd., Finland.

Appendix 7

Appendix 7—figure 1. Overview of experimental conditions.

Appendix 7—figure 1.

For each experiment (row), core experimental conditions are noted. In Experiments I and II, the main conditions included on-target transcranial ultrasonic stimulation (TUS) (left-hemispheric hand motor area), active control TUS (right-hemispheric face motor area), sound-sham (auditory stimulation alone), and baseline (transcranial magnetic stimulation, TMS only). Here, TUS was applied at two stimulation intensities, and both with and without a time-locked auditory masking stimulus. We report the spatial-peak pulse-averaged intensity in free water without attenuation from biological tissue. In Experiment I, TUS was further administered at both 100 and 500 ms stimulus durations. Experiment II is a preregistered study to confirm and extend the findings of Experiment I. In Experiment III, four auditory stimuli of varying pitch and duration were administered both in isolation (sound-sham) and alongside on-target TUS. In Experiment IV, inactive control TUS was administered to the white matter ventromedial to the hand motor area at three intensities (i.e. auditory confound volumes) both with and without a continuous auditory masking stimulus.

Appendix 7—figure 2. Conditions and timing of Experiments I & II.

Appendix 7—figure 2.

(A) On-target left-hemispheric hand area stimulation (left) and active control right-hemispheric face area stimulation (middle) were applied both with and without a masking tone. The sham condition (right) consisted solely of an identical tone. (B) On-target and active control transcranial ultrasonic stimulation (TUS) were administered for 100 ms [Experiment I] and 500 ms [Experiments I & II] with transcranial magnetic stimulation (TMS) applied 10 ms prior to TUS-offset. In Experiment I, the auditory stimulus administered during sound-sham and masked conditions began ~50 earlier and was 100 ms longer than TUS, while in Experiment II the auditory stimulus was precisely timed to match TUS. Baseline measurement involved no intervention other than TMS.

Appendix 7—figure 3. Conditions and timing of Experiment III.

Appendix 7—figure 3.

(A) On-target transcranial ultrasonic stimulation (TUS) consisted of left-hemispheric hand motor area stimulation in concurrence with an auditory stimulus (left), while sound-sham consisted solely of an auditory stimulus (right). (B) Auditory stimuli were delivered at both 1 and 12 kHz for either 500 or 700 ms. TUS was delivered for 500 ms. Transcranial magnetic stimulation (TMS) was delivered 10 ms prior to TUS offset (i.e. following 490 ms of TUS), and at the same timing in the absence of TUS.

Appendix 7—figure 4. Conditions and timing of Experiment III.

Appendix 7—figure 4.

(A) Inactive control stimulation of white matter ventromedial to the left-hemispheric hand motor area. (B) White matter transcranial ultrasonic stimulation (TUS) was administered for 400 ms, with transcranial magnetic stimulation (TMS) applied 10 ms prior to TUS offset. In the masked condition, an auditory stimulus was played continuously throughout the entire block.

Appendix 8

Appendix 8—table 1. Transducer specifications.

Exp. Model number N elements Radius of curvature Aperature diameter Width Solid water coupling
I CTX500-006 2 63.2 45.0 ~16 yes
II CTX250-014 2 63.2 45.0 ~16 yes
III H246-01 2 0 (flat) 33.6 10 no
IV CTX250-005 2 63.2 45.0 16.45 yes

This table describes transducer specifications used with a Sonic Concepts TPO (Bothell, WA, USA).

Appendix 8—table 2. MRI acquisition parameters.

Experiment Scan TR TE FoV read FoV phase voxel N slices
I & II T1w* 2700 ms 3.69 ms 230 mm 128.1% 0.9 mm iso 224
T2w* 3200 ms 408 ms 230 mm 128.1% 0.9 mm iso 224
IV T1w 2700 ms 3.7 ms 260 mm 88.9% 0.9 mm iso 224

This table shows the MRI acquisition parameters used in Experiments I, II, and IV. Anatomical T1w scans were used for online neuronavigation. For Experiments I and II, both T1w and T2w scans were used for post-hoc acoustic and thermal simulations.

*

3T Siemens Skyra MRI scanner (Siemens Medical Solutions, Erlangen, Germany) with a 32-channel head coil.

3T Siemens Prisma MRI scanner with a 32-channel head coil.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Benjamin R Kop, Email: benjamin.kop@donders.ru.nl.

Jonas Obleser, University of Lübeck, Germany.

Jonathan Roiser, University College London, United Kingdom.

Funding Information

This paper was supported by the following grants:

  • Nederlandse Organisatie voor Wetenschappelijk Onderzoek 18919 to Lennart Verhagen.

  • Nederlandse Organisatie voor Wetenschappelijk Onderzoek 452-17-016 to Hanneke EM den Ouden.

  • Holland High Tech HiTMaT-38H3 to Lennart Verhagen.

  • Canadian Institutes of Health Research FDN 154292 to Robert Chen.

  • Canadian Institutes of Health Research ENG 173742 to Robert Chen.

  • Natural Sciences and Engineering Research Council of Canada RGPIN-2020-04176 to Robert Chen.

  • Natural Sciences and Engineering Research Council of Canada RTI-2020-0024 to Robert Chen.

  • Boehringer Ingelheim Stiftung to Til Ole Bergmann.

Additional information

Competing interests

No competing interests declared.

is the head of software development at sync2brain GmbH, where the bossdevice used in Experiment IV was developed.

Reviewing editor, eLife.

Author contributions

Conceptualization, Resources, Data curation, Software, Formal analysis, Supervision, Validation, Investigation, Visualization, Methodology, Writing – original draft, Project administration, Writing – review and editing.

Conceptualization, Software, Formal analysis, Validation, Investigation, Methodology, Writing – review and editing.

Conceptualization, Software, Formal analysis, Investigation, Methodology, Writing – review and editing.

Conceptualization, Resources, Data curation, Software, Formal analysis, Supervision, Validation, Investigation, Visualization, Methodology, Project administration, Writing – review and editing.

Conceptualization, Data curation, Software, Formal analysis, Investigation, Methodology, Writing – original draft, Writing – review and editing.

Conceptualization, Software, Methodology, Writing – review and editing.

Conceptualization, Software, Methodology, Writing – review and editing.

Data curation, Software, Formal analysis, Investigation, Visualization, Methodology, Writing – original draft, Writing – review and editing.

Investigation.

Resources, Investigation.

Supervision, Investigation.

Conceptualization, Resources, Data curation, Software, Formal analysis, Supervision, Validation, Investigation, Methodology, Project administration.

Conceptualization, Resources, Software, Supervision, Validation, Methodology, Project administration.

Software, Visualization.

Conceptualization, Resources, Formal analysis, Supervision, Funding acquisition, Validation, Visualization, Writing – original draft, Writing – review and editing.

Conceptualization, Resources, Supervision, Funding acquisition, Validation, Project administration, Writing – review and editing.

Conceptualization, Resources, Software, Formal analysis, Supervision, Funding acquisition, Validation, Methodology, Writing – original draft, Project administration, Writing – review and editing.

Conceptualization, Resources, Data curation, Software, Formal analysis, Supervision, Funding acquisition, Validation, Investigation, Visualization, Methodology, Writing – original draft, Project administration, Writing – review and editing.

Ethics

All participants provided informed consent. Ethical approval was obtained for each experiment (Experiment I: METC - NL76920.091.21; Experiment II: METC - NL80331.091.22; Experiment III: Toronto University Health Network Research Ethics Board - 20-5740; Experiment IV: Landesärztekammer Rheinland-Pfalz - 2021-15808_01).

Additional files

MDAR checklist

Data availability

Data and code are available in the Radboud Data Repository: https://doi.org/10.34973/jh6z-yh31 (CC-BY-4.0). The custom TUS-TMS clamp is open-sourced here: https://doi.org/10.5281/zenodo.6517599. The auditory stimuli used for masking and for sound-sham conditions are open-sourced here: https://doi.org/10.5281/zenodo.8374148.

The following datasets were generated:

Kop B, Shamli Oghli Y, Grippe TC, Nandi T, Lefkes J, Meijer SW, Farboud S, Engels M, Hamani M, Null M, Radetz A, Hassan U, Darmani G, Chetverikov A, Ouden HEM, Bergmann TO, Chen R, Verhagen L. 2024. Auditory confounds can drive online effects of transcranial ultrasonic stimulation in humans. Radboud Repository.

Kop B. 2022. benjamin-kop/TUS-TMS-Clamp: TUS-TMS-Clamp. Zenodo.

Kop B. 2023. benjamin-kop/auditory-masking-stimuli: Auditory masking stimuli used in 'Auditory confounds can drive online effects of transcranial ultrasonics stimulation in humans'. Zenodo.

References

  1. Ai L, Mueller JK, Grant A, Eryaman Y, Legon W. Transcranial focused ultrasound for BOLD fMRI signal modulation in humans. 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC); Orlando, FL, USA. 2016. [DOI] [PubMed] [Google Scholar]
  2. Airan RD, Butts Pauly K. Hearing out ultrasound neuromodulation. Neuron. 2018;98:875–877. doi: 10.1016/j.neuron.2018.05.031. [DOI] [PubMed] [Google Scholar]
  3. Aubry JF, Bates O, Boehm C, Butts Pauly K, Christensen D, Cueto C, Gélat P, Guasch L, Jaros J, Jing Y, Jones R, Li N, Marty P, Montanaro H, Neufeld E, Pichardo S, Pinton G, Pulkkinen A, Stanziola A, Thielscher A, Treeby B, van ’t Wout E. Benchmark Problems for Transcranial Ultrasound Simulation: Intercomparison of Compressional Wave Models. arXiv. 2022 doi: 10.48550/ARXIV.2202.04552. [DOI] [PMC free article] [PubMed]
  4. Barr DJ, Levy R, Scheepers C, Tily HJ. Random effects structure for confirmatory hypothesis testing: Keep it maximal. Journal of Memory and Language. 2013;68:255–278. doi: 10.1016/j.jml.2012.11.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Bates D, Mächler M, Bolker B, Walker S. Fitting linear mixed-effects models using lme4. Journal of Statistical Software. 2015;67:i01. doi: 10.18637/jss.v067.i01. [DOI] [Google Scholar]
  6. Bergmann TO, Hartwigsen G. Inferring causality from noninvasive brain stimulation in cognitive neuroscience. Journal of Cognitive Neuroscience. 2021;33:195–225. doi: 10.1162/jocn_a_01591. [DOI] [PubMed] [Google Scholar]
  7. Bestmann S, Walsh V. Transcranial electrical stimulation. Current Biology. 2017;27:R1258–R1262. doi: 10.1016/j.cub.2017.11.001. [DOI] [PubMed] [Google Scholar]
  8. Brand A, Allen L, Altman M, Hlava M, Scott J. Beyond authorship: attribution, contribution, collaboration, and credit. Learned Publishing. 2015;28:151–155. doi: 10.1087/20150211. [DOI] [Google Scholar]
  9. Braun V, Blackmore J, Cleveland RO, Butler CR. Transcranial ultrasound stimulation in humans is associated with an auditory confound that can be effectively masked. Brain Stimulation. 2020;13:1527–1534. doi: 10.1016/j.brs.2020.08.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Butler CR, Rhodes E, Blackmore J, Cheng X, Peach RL, Veldsman M, Sheerin F, Cleveland RO. Transcranial ultrasound stimulation to human middle temporal complex improves visual motion detection and modulates electrophysiological responses. Brain Stimulation. 2022;15:1236–1245. doi: 10.1016/j.brs.2022.08.022. [DOI] [PubMed] [Google Scholar]
  11. Cain JA, Visagan S, Johnson MA, Crone J, Blades R, Spivak NM, Shattuck DW, Monti MM. Real time and delayed effects of subcortical low intensity focused ultrasound. Scientific Reports. 2021;11:6100. doi: 10.1038/s41598-021-85504-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Capozio A, Chakrabarty S, Astill S. The effect of sound and stimulus expectation on transcranial magnetic stimulation-elicited motor evoked potentials. Brain Topography. 2021;34:720–730. doi: 10.1007/s10548-021-00867-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Conde V, Tomasevic L, Akopian I, Stanek K, Saturnino GB, Thielscher A, Bergmann TO, Siebner HR. The non-transcranial TMS-evoked potential is an inherent source of ambiguity in TMS-EEG studies. NeuroImage. 2019;185:300–312. doi: 10.1016/j.neuroimage.2018.10.052. [DOI] [PubMed] [Google Scholar]
  14. Darmani G, Bergmann TO, Butts Pauly K, Caskey CF, de Lecea L, Fomenko A, Fouragnan E, Legon W, Murphy KR, Nandi T, Phipps MA, Pinton G, Ramezanpour H, Sallet J, Yaakub SN, Yoo SS, Chen R. Non-invasive transcranial ultrasound stimulation for neuromodulation. Clinical Neurophysiology. 2022;135:51–73. doi: 10.1016/j.clinph.2021.12.010. [DOI] [PubMed] [Google Scholar]
  15. Duecker F, de Graaf TA, Jacobs C, Sack AT. Time- and task-dependent non-neural effects of real and sham TMS. PLOS ONE. 2013;8:e73813. doi: 10.1371/journal.pone.0073813. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Fisher RJ, Sharott A, Kühn AA, Brown P. Effects of combined cortical and acoustic stimuli on muscle activity. Experimental Brain Research. 2004;157:1–9. doi: 10.1007/s00221-003-1809-6. [DOI] [PubMed] [Google Scholar]
  17. Folloni D, Verhagen L, Mars RB, Fouragnan E, Constans C, Aubry JF, Rushworth MFS, Sallet J. Manipulation of subcortical and deep cortical activity in the primate brain using transcranial focused ultrasound stimulation. Neuron. 2019;101:1109–1116. doi: 10.1016/j.neuron.2019.01.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Fomenko A, Chen KHS, Nankoo JF, Saravanamuttu J, Wang Y, El-Baba M, Xia X, Seerala SS, Hynynen K, Lozano AM, Chen R. Systematic examination of low-intensity ultrasound parameters on human motor cortex excitability and behavior. eLife. 2020;9:e54497. doi: 10.7554/eLife.54497. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Ford JM, Gray M, Faustman WO, Roach BJ, Mathalon DH. Dissecting corollary discharge dysfunction in schizophrenia. Psychophysiology. 2007;44:522–529. doi: 10.1111/j.1469-8986.2007.00533.x. [DOI] [PubMed] [Google Scholar]
  20. Furubayashi T, Ugawa Y, Terao Y, Hanajima R, Sakai K, Machii K, Mochizuki H, Shiio Y, Uesugi H, Enomoto H, Kanazawa I. The human hand motor area is transiently suppressed by an unexpected auditory stimulus. Clinical Neurophysiology. 2000;111:178–183. doi: 10.1016/s1388-2457(99)00200-x. [DOI] [PubMed] [Google Scholar]
  21. Gavrilov LR, Tsirulnikov EM. Focused ultrasound as a tool to input sensory information to humans (Review) Acoustical Physics. 2012;58:1–21. doi: 10.1134/S1063771012010083. [DOI] [Google Scholar]
  22. Guo H, Hamilton M, Offutt SJ, Gloeckner CD, Li T, Kim Y, Legon W, Alford JK, Lim HH. Ultrasound produces extensive brain activation via a cochlear pathway. Neuron. 2018;98:1020–1030. doi: 10.1016/j.neuron.2018.04.036. [DOI] [PubMed] [Google Scholar]
  23. Guo H, Offutt SJ, Hamilton Ii M, Kim Y, Gloeckner CD, Zachs DP, Alford JK, Lim HH. Ultrasound does not activate but can inhibit in vivo mammalian nerves across a wide range of parameters. Scientific Reports. 2022;12:2182. doi: 10.1038/s41598-022-05226-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Hassan U, Pillen S, Zrenner C, Bergmann TO. The Brain Electrophysiological recording & STimulation (BEST) toolbox. Brain Stimulation. 2022;15:109–115. doi: 10.1016/j.brs.2021.11.017. [DOI] [PubMed] [Google Scholar]
  25. Ilic TV, Pötter-Nerger M, Holler I, Siebner HR, Ilic NV, Deuschl G, Volkmann J. Startle stimuli exert opposite effects on human cortical and spinal motor system excitability in leg muscles. Physiological Research. 2011;60:S101–S106. doi: 10.33549/physiolres.932182. [DOI] [PubMed] [Google Scholar]
  26. IT’IS Foundation 2022. Tissue properties database V4.1. IT’IS Foundation. [DOI]
  27. Johnstone A, Nandi T, Martin E, Bestmann S, Stagg C, Treeby B. A range of pulses commonly used for human transcranial ultrasound stimulation are clearly audible. Brain Stimulation. 2021;14:1353–1355. doi: 10.1016/j.brs.2021.08.015. [DOI] [PubMed] [Google Scholar]
  28. Kop B, Shamli Oghli Y, Grippe TC, Nandi T, Lefkes J, Meijer SW, Farboud S, Engels M, Hamani M, Null M, Radetz A, Hassan U, Darmani G, Chetverikov A, Ouden H, Bergmann TO, Chen R, Verhagen L. Auditory confounds can drive online effects of transcranial ultrasonic stimulation in humans. Radboud University; 2024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Kühn AA, Sharott A, Trottenberg T, Kupsch A, Brown P. Motor cortex inhibition induced by acoustic stimulation. Experimental Brain Research. 2004;158:120–124. doi: 10.1007/s00221-004-1883-4. [DOI] [PubMed] [Google Scholar]
  30. Lee W, Kim HC, Jung Y, Chung YA, Song IU, Lee JH, Yoo SS. Transcranial focused ultrasound stimulation of human primary visual cortex. Scientific Reports. 2016;6:34026. 10.1038/srep34026. doi: 10.1038/srep34026. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Legon W, Ai L, Bansal P, Mueller JK. Neuromodulation with single-element transcranial focused ultrasound in human thalamus. Human Brain Mapping. 2018a;39:1995–2006. doi: 10.1002/hbm.23981. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Legon W, Bansal P, Tyshynsky R, Ai L, Mueller JK. Transcranial focused ultrasound neuromodulation of the human primary motor cortex. Scientific Reports. 2018b;8:10007. doi: 10.1038/s41598-018-28320-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Liu C, Yu K, Niu X, He B. Transcranial focused ultrasound enhances sensory discrimination capability through somatosensory cortical excitation. Ultrasound in Medicine & Biology. 2021;47:1356–1366. doi: 10.1016/j.ultrasmedbio.2021.01.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Menz MD, Oralkan O, Khuri-Yakub PT, Baccus SA. Precise neural stimulation in the retina using focused ultrasound. The Journal of Neuroscience. 2013;33:4550–4560. doi: 10.1523/JNEUROSCI.3521-12.2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Mohammadjavadi M, Ye PP, Xia A, Brown J, Popelka G, Pauly KB. Elimination of peripheral auditory pathway activation does not affect motor responses from ultrasound neuromodulation. Brain Stimulation. 2019;12:901–910. doi: 10.1016/j.brs.2019.03.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Murphy KR, Farrell JS, Gomez JL, Stedman QG, Li N, Leung SA, Good CH, Qiu Z, Firouzi K, Butts Pauly K, Khuri-Yakub BPT, Michaelides M, Soltesz I, de Lecea L. A tool for monitoring cell type-specific focused ultrasound neuromodulation and control of chronic epilepsy. PNAS. 2022;119:e2206828119. doi: 10.1073/pnas.2206828119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Nakajima K, Osada T, Ogawa A, Tanaka M, Oka S, Kamagata K, Aoki S, Oshima Y, Tanaka S, Konishi S. A causal role of anterior prefrontal-putamen circuit for response inhibition revealed by transcranial ultrasound stimulation in humans. Cell Reports. 2022;40:111197. doi: 10.1016/j.celrep.2022.111197. [DOI] [PubMed] [Google Scholar]
  38. Osada T, Nakajima K, Ogawa A, Oka S, Kamagata K, Aoki S, Oshima Y, Tanaka S, Konishi S. Distributions of cortical depth of the index finger region in the M1: A representative depth parameter for transcranial ultrasound stimulation. Brain Stimulation. 2022;15:1348–1350. doi: 10.1016/j.brs.2022.09.012. [DOI] [PubMed] [Google Scholar]
  39. O’Shea J, Walsh V. Transcranial magnetic stimulation. Current Biology. 2007;17:R196–R199. doi: 10.1016/j.cub.2007.01.030. [DOI] [PubMed] [Google Scholar]
  40. Polanía R, Nitsche MA, Ruff CC. Studying and modifying brain function with non-invasive brain stimulation. Nature Neuroscience. 2018;21:174–187. doi: 10.1038/s41593-017-0054-4. [DOI] [PubMed] [Google Scholar]
  41. R Development Core Team . Vienna, Austria: R Foundation for Statistical Computing; 2021. https://www.R-project.org [Google Scholar]
  42. Sato T, Shapiro MG, Tsao DY. Ultrasonic neuromodulation causes widespread cortical activation via an indirect auditory mechanism. Neuron. 2018;98:1031–1041. doi: 10.1016/j.neuron.2018.05.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Siebner HR, Funke K, Aberra AS, Antal A, Bestmann S, Chen R, Classen J, Davare M, Di Lazzaro V, Fox PT, Hallett M, Karabanov AN, Kesselheim J, Beck MM, Koch G, Liebetanz D, Meunier S, Miniussi C, Paulus W, Peterchev AV, Popa T, Ridding MC, Thielscher A, Ziemann U, Rothwell JC, Ugawa Y. Transcranial magnetic stimulation of the brain: What is stimulated? - a consensus and critical position paper. Clinical Neurophysiology. 2022;140:59–97. doi: 10.1016/j.clinph.2022.04.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Silvanto J, Muggleton NG, Cowey A, Walsh V. Neural adaptation reveals state-dependent effects of transcranial magnetic stimulation. The European Journal of Neuroscience. 2007;25:1874–1881. doi: 10.1111/j.1460-9568.2007.05440.x. [DOI] [PubMed] [Google Scholar]
  45. Sorum B, Rietmeijer RA, Gopakumar K, Adesnik H, Brohawn SG. Ultrasound activates mechanosensitive TRAAK K+ channels through the lipid membrane. PNAS. 2021;118:e2006980118. doi: 10.1073/pnas.2006980118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Thielscher A, Antunes A, Saturnino GB. Field modeling for transcranial magnetic stimulation: A useful tool to understand the physiological effects of TMS?. 2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC; Milan. 2015. pp. 222–225. [DOI] [PubMed] [Google Scholar]
  47. Tran DMD, McNair NA, Harris JA, Livesey EJ. Expected TMS excites the motor system less effectively than unexpected stimulation. NeuroImage. 2021;226:117541. doi: 10.1016/j.neuroimage.2020.117541. [DOI] [PubMed] [Google Scholar]
  48. Treeby BE, Cox BT. k-Wave: MATLAB toolbox for the simulation and reconstruction of photoacoustic wave fields. Journal of Biomedical Optics. 2010;15:021314. doi: 10.1117/1.3360308. [DOI] [PubMed] [Google Scholar]
  49. Tyler WJ, Tufail Y, Finsterwald M, Tauchmann ML, Olson EJ, Majestic C. Remote excitation of neuronal circuits using low-intensity, low-frequency ultrasound. PLOS ONE. 2008;3:e3511. doi: 10.1371/journal.pone.0003511. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Tyler WJ, Lani SW, Hwang GM. Ultrasonic modulation of neural circuit activity. Current Opinion in Neurobiology. 2018;50:222–231. doi: 10.1016/j.conb.2018.04.011. [DOI] [PubMed] [Google Scholar]
  51. van der Groen O, Wenderoth N. Transcranial random noise stimulation of visual cortex: stochastic resonance enhances central mechanisms of perception. The Journal of Neuroscience. 2016;36:5289–5298. doi: 10.1523/JNEUROSCI.4519-15.2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Verhagen L, Gallea C, Folloni D, Constans C, Jensen DE, Ahnine H, Roumazeilles L, Santin M, Ahmed B, Lehericy S, Klein-Flügge MC, Krug K, Mars RB, Rushworth MF, Pouget P, Aubry JF, Sallet J. Offline impact of transcranial focused ultrasound on cortical activation in primates. eLife. 2019;8:e40541. doi: 10.7554/eLife.40541. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Weinreb E, Moses E. Mechanistic insights into ultrasonic neurostimulation of disconnected neurons using single short pulses. Brain Stimulation. 2022;15:769–779. doi: 10.1016/j.brs.2022.05.004. [DOI] [PubMed] [Google Scholar]
  54. Wessel JR, Aron AR. Unexpected events induce motor slowing via a brain mechanism for action-stopping with global suppressive effects. The Journal of Neuroscience. 2013;33:18481–18491. doi: 10.1523/JNEUROSCI.3456-13.2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Xia X, Fomenko A, Nankoo JF, Zeng K, Wang Y, Zhang J, Lozano AM, Chen R. Time course of the effects of low-intensity transcranial ultrasound on the excitability of ipsilateral and contralateral human primary motor cortex. NeuroImage. 2021;243:118557. doi: 10.1016/j.neuroimage.2021.118557. [DOI] [PubMed] [Google Scholar]
  56. Yoo S, Mittelstein DR, Hurt RC, Lacroix J, Shapiro MG. Focused ultrasound excites cortical neurons via mechanosensitive calcium accumulation and ion channel amplification. Nature Communications. 2022;13:493. doi: 10.1038/s41467-022-28040-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Zeng K, Darmani G, Fomenko A, Xia X, Tran S, Nankoo J-F, Shamli Oghli Y, Wang Y, Lozano AM, Chen R. Induction of human motor cortex plasticity by theta burst transcranial ultrasound stimulation. Annals of Neurology. 2022;91:238–252. doi: 10.1002/ana.26294. [DOI] [PubMed] [Google Scholar]

eLife assessment

Jonas Obleser 1

This important multicenter study provides convincing evidence that the auditory noise emitted during online transcranial ultrasound stimulation (TUS) protocols can pose a considerable confound and is able to explain corticospinal excitability changes as measured with Motor Evoked Potentials (MEP). The findings lay the ground for future studies optimising protocols and control conditions to leverage TUS as a meaningful experimental and clinical tool. A clear strength of the study is the multitude of control conditions (i.e., control sites, acoustic masking, acoustic stimulation). These findings will be of interest to neuroscience researchers using brain stimulation approaches.

Reviewer #1 (Public Review):

Anonymous

Summary: The authors have used transcranial magnetic stimulation (TMS) and motor evoked potentials (MEPs) to determine whether the peripheral auditory confound arising from TUS can drive motor inhibition on its own. They gathered data from three international centers in four experiments testing:

  • - Experiment 1 (n = 11), two different TUS durations and intensities under sound masking or without.

  • - Experiment 2 (n = 27) replicates Exp 1 with different intensities and a fixed TUS duration of 500ms.

  • - Experiment 3 (n = 16) studies the effect of various auditory stimuli testing different duration and pitches while applying TUS in an active site, on-target or no TUS.

  • - Experiment 4 (n = 12) uses an inactive control site to reproduce the sound without effective neuromodulation, while manipulating the volume of the auditory confound at different US intensities with and without continuous sound masking.

Strengths: This study comes from three very strong groups in noninvasive brain stimulation with long experience in neuromodulation, multimodal and electrophysiological recordings. Although complex to understand due to slightly different methodologies across centers, this study provides quantitative evidence relating to the potential auditory confound in online TUS. The results are in line with reductions seen in motor-evoked responses during online 1kHz TUS, and remarkable efforts were made to isolate peripheral confounds from actual neuromodulation factors, highlighting the confounding effect of sound itself.

Weaknesses: However, there are some points that need attention. In my view, the most important are:

1. Despite the main conclusion of the authors stating that there is no dose-response effect of TUS on corticospinal inhibition, the point estimates for change in MEP and Ipssa indicate a more complex picture. The present data and analyses cannot rule out that there is a dose-response function which cannot be fully attributed to difference in sound (since the relationship in inversed, lower intracranial Isppa leads to higher MEP decrease). These results suggest that dose-response function needs to be further studied in future studies.

2. Other methods to test or mask the auditory confound are possible (e.g., smoothed ramped US wave) which could substantially solve part of the sound issue in future studies or experiments in deaf animals etc.

Reviewer #2 (Public Review):

Anonymous

Summary:

This study aims to test auditory confounds during transcranial ultrasound stimulation (TUS) protocols that rely on audible frequencies. In several experiments, the authors show that a commonly observed suppression of motor-evoked potentials (MEP) during TUS can be explained by acoustic stimulation. For instance, not only target TUS, but also stimulation of a control site and acoustic stimulation led to suppressed MEP.

The authors have convincingly addressed all of my comments and provided useful additional details. I believe that this is a strong study that will impact the field. Thanks also for making the sound stimuli open-source.

eLife. 2024 Aug 27;12:RP88762. doi: 10.7554/eLife.88762.3.sa3

Author response

Benjamin R Kop 1, Yazan Shamli Oghli 2, Talyta C Grippe 3, Tulika Nandi 4, Judith Lefkes 5, Sjoerd W Meijer 6, Soha Farboud 7, Marwan Engels 8, Michelle Hamani 9, Melissa Null 10, Angela Radetz 11, Umair Hassan 12, Ghazaleh Darmani 13, Andrey Chetverikov 14, Hanneke EM den Ouden 15, Til Ole Bergmann 16, Robert Chen 17, Lennart Verhagen 18

We are grateful for the insightful suggestions and comments provided by the reviewers. Your constructive feedback has been valuable, and we are thankful for the opportunity to address each point.

We appreciate both reviewers’ recognition of our devotion to rigorous methodology and experimental control in this study, as evidenced by the comments: “remarkable efforts were made to isolate peripheral confounds”, “a clear strength of the study is the multitude of control conditions … that makes results very convincing”, and “thorough design of the study”. Indeed, we hope to have provided more than solid, but compelling evidence for sound-driven motor inhibitory effects of online TUS. We hope that this will be reflected in the assessment. Our conclusions are supported by multiple experiments across multiple institutions using exemplary experimental control including (in)active controls and multiple sound-sham conditions. This contrasts with the sole use of flip-over sham or no-stimulation conditions used in the majority of work to date. Indeed, the current study communicates that substantiated inferences on the efficacy of ultrasonic neuromodulation cannot be made under insufficient experimental control.

In response to the reviewers' comments, we have substantially changed our manuscript. Specifically, we have open-sourced the auditory masking stimuli and specified them in better detail in the text, we have improved the figures to reflect the data more closely, we have clarified the intracranial doseresponse relationship, we have elaborated in the introduction, and we have further discussed the possibility of direct neuromodulation. We hope that you agree these changes have helped to substantially improve the manuscript.

Public reviews

(1.1) Despite the main conclusion of the authors stating that there is no dose-response effects of TUS on corticospinal inhibition, both the comparison of Isppa and MEP decrease for Exp 1 and 2, and the linear regression between MEP decrease (relative to baseline) and the estimated Isppa are significant, arguing the opposite, that there is a dose-response function which cannot be fully attributed to difference in sound (since the relationship in inversed, lower intracranial Isppa leads to higher MEP decrease). These results suggest that doseresponse function needs to be further studied in future studies.

We thank the reviewer for bringing up this point. While we are convinced our study provides no evidence for a direct neuromodulatory dose-response relationship, we have realized that the manuscript could benefit from improved clarity on this point.

A dose-response relationship between TUS intensity and motor cortical excitability was assessed by manipulating free-water Isppa (Figure 4C). Here, no significant effect of free-water stimulation intensity was observed for Experiment I or II, thus providing no evidence for a dose-response relationship (Section 3.2). To aid in clarity, ‘N.S.’ has been added to Figure 4C in the revised manuscript.

However, it is likely that the efficacy of TUS would depend on realized intracranial intensity, which we estimated with 3D simulations for on-target stimulation. These simulations resulted in an estimated intracranial intensity for each applied free-water intensity (i.e., 6.35 and 19.06 W/cm2), for each participant. We then tested whether inter-individual differences in intracranial intensity during on-target TUS affected MEP amplitude. We have realized that the original visualization used to display these data and its explanation was unintuitive. Therefore, we have completely revised Supplementary Figure 6. Because of the substantial length of this section, we have not copied it here. Please see the Supplementary material for the implemented improvements.

In brief, we now show MEP amplitudes on the y-axis, rather than expressing values a %change. This plot depicts how individuals with higher intracranial intensities during ontarget TUS exhibit higher MEP amplitudes. However, this same relationship is observed for active control and sound-sham conditions. If there were a direct neuromodulatory doseresponse relationship of TUS, this would be reflected as the difference between on-target and control conditions changing as the estimated intracranial intensity increases. This was not the case. Further, the fact that the difference between on-target stimulation and baseline changes across intracranial intensities is notable, but this occurs to an equal degree in the control conditions. Therefore, these data cannot be interpreted as evidence for a doseresponse relationship.

We hope the changes in Supplementary Figure 6 will make it clear that there is no evidence for direct intracranial dose-response effects.

(1.2) Other methods to test or mask the auditory confound are possible (e.g., smoothed ramped US wave) which could substantially solve part of the sound issue in future studies or experiments in deaf animals etc...

We agree with the reviewer’s statement. We aimed to replicate the findings of online motor cortical inhibition reported in prior work using a 1000 Hz square wave modulation frequency. While ramping can effectively reduce the auditory confound, as noted in the discussion, this is not feasible for the short pulse durations (0.1-0.3 ms) employed in the current study (Johnstone et al., 2021). We have further clarified this point in the methods section of the revised manuscript as follows:

“While ramping the pulses can in principle mitigate the auditory confound (Johnstone et al., 2021; Mohammadjavadi et al., 2019), doing so for such short pulse durations ( <= 0.3 ms) is not effective. Therefore, we used a rectangular pulse shape to match prior work.”

Mitigation of the auditory confound by testing deaf subjects is a valid approach, and has now been added to the revised manuscript in the discussion as follows:

“Alternative approaches could circumvent auditory confounds by testing deaf subjects, or perhaps more practically by ramping the ultrasonic pulse to minimize or even eliminate the auditory confound.”

(1.3) Dose-response function is an extremely important feature for a brain stimulation technique. It was assessed in Exp II by computing the relationship between the estimated intracranial intensities and the modulation of corticospinal excitability (Fig. 3b, 3c). It is not clear why data from Experiment I could not be integrated in a global intracranial dose-response function to explore wider ranges of intracranial intensities and MEP variability.

We chose not to combine data from Experiment 1 in a global intracranial dose-response function because TUS was applied at different fundamental frequencies and focal depths (Experiment I: 500 kHz, 35 mm; Experiment II: 250 kHz, 28 mm). We have now explicitly communicated this under Supplementary Figure 6:

“It was not appropriate to combine data from Experiments I and II given the different fundamental frequencies and stimulation depths applied… we ran simple linear models for Experiment II, which had a sufficient sample size (n = 27) to assess inter-individual variability.”

(1.4) Furthermore, the dose response function as computed with the MEP change relative to baseline shows a significant effect (6.35W/cm2) or a trend (19.06 W/cm2) for a positive linear relationship. This comparison cannot disentangle the auditory confound from the pure neuromodulatory effect but given the direction of the relationship (lower Isppa associated with larger neuromodulatory effect), it is unlikely that it is driven by sound. This relationship is absent for the Active control condition or the Sound Sham condition, more or less matched for peripheral confound. This needs to be further discussed.

Please refer to point 1.1

(1.5) The clear auditory confound arises from TUS pulsing at audible frequencies, which can be highly subject to inter-individual differences. Did the authors individually titrate the auditory mask to account for this intra- and inter-individual variability in auditory perception?

In Experiments I-III, the auditory mask was identical between participants. In Experiment IV, the auditory mask volume and signal-to-noise ratio were adjusted per participant. In the discussion we recommend individualized mask titration. However, we do note that masking successfully blinded participants in Experiment II, despite using uniform masking stimuli (Supplementary Figure 5).

(1.6) How different is the masking quality when using bone-conducting headphones (e.g., Exp. 1) compared to in-ear headphones (e.g., Exp. 2)?

In our experience, bone conducting headphones produce a less clear, fuzzier, sound than in-ear headphones. However, in-ear headphones block the ear canal and likely result in the auditory confound being perceived as louder. We have included this information in the discussion of the revised manuscript:

“Titrating auditory mask quality per participant to account for intra- and inter-individual differences in subjective perception of the auditory confound would be beneficial. Here, the method chosen for mask delivery must be considered. While bone-conducting headphones align with the bone conduction mechanism of the auditory confound, they might not deliver sound as clearly as in-ear headphones or speakers. Nevertheless, the latter two rely on airconducted sound. Notably, in-ear headphones could even amplify the perceived volume of the confound by obstructing the ear canal.”

(1.7) I was not able to find any report on the blinding efficacy of Exp. 1. Do the authors have some data on this?

We do not have blinding data available for Experiment I. Following Experiment I, we decided it would be useful to include such an assessment in Experiment II.

(1.8) Was the possibility to use smoothed ramped US wave form ever tested as a control condition in this set of studies, to eventually reduce audibility? For such fast PRF, for fast PRF, the slope would still need to be steep to stimulate the same power (AUC), it might not be as efficient.

We indeed tested smoothing (ramping) the waveform. There was no perceptible impact on the auditory confound volume. Indeed, prior research has also indicated that ramping over

such short pulse durations is not effective (Johnstone et al., 2021). Taken together, we chose to continue with a square wave modulation as in prior TUS-TMS studies. We have updated the methods section of the manuscript with the following:

“While ramping the pulses can in principle mitigate the auditory confound (Johnstone et al., 2021; Mohammadjavadi et al., 2019), doing so for such short pulse durations ( <= 0.3 ms) is not effective. Therefore, we used a rectangular pulse shape to match prior work.”

Importantly, our research shows that auditory co-stimulation can confound effects on motor excitability, and this likely occurred in multiple seminal TUS studies. While some preliminary work has been done on the efficacy of ramping in humans, future work is needed to determine what ramp shapes and lengths are optimal for reducing the auditory confound.

(1.9) There are other models or experiments that need to be discussed in order to clearly disassociate the TUS effect from the auditory confound effect, for instance, testing deaf animal models or participants, or experiments with multi-region recordings (to rule out the effects of the dense structural connectivity between the auditory cortex and the motor cortex).

The suggestion to consider multi-region recording in future experiments is important. Indeed, the effects of the auditory confound are expected to vary between brain regions. In the primary motor cortex, we observe a learned inhibition, which is perhaps supported by dense structural connectivity with the auditory system. In contrast, in perceptual areas such as the occipital cortex, one might expect tuned attentional effects in response to the auditory cue. We suggest that it is likely that the impact of the auditory confound also operates on a more global network level. It is reasonable to propose that, in a cognitive task for example, the confound will affect task performance and related brain activity, ostensibly regardless of the extent of direct structural connectivity between the auditory cortex and the (stimulated) region of interest.

Regarding the testing of deaf subjects, this has been included in the revised discussion as follows:

“Alternative approaches could circumvent auditory confounds by testing deaf subjects, or perhaps more practically by ramping the ultrasonic pulse to minimize or even eliminate the auditory confound.”

(1.10) The concept of stochastic resonance is interesting but traditionally refers to a mechanism whereby a particular level of noise actually enhances the response of non-linear systems to weak sensory signals. Whether it applies to the motor system when probed with suprathreshold TMS intensities is unclear. Furthermore, whether higher intensities induce higher levels of noise is not straightforward neither considering the massive amount of work coming from other NIBS studies in particular. Noise effects are indeed a function of noise intensity, but exhibit an inverted U-shape dose-response relationship (Potok et al., 2021, eNeuro). In general SR is rather induced with low stimulation intensities in particular in perceptual domain (see Yamasaki et al., 2022, Neuropsychologia). In the same order of ideas, did the authors compare inter-trials variability across the different conditions?

We thank the reviewer for these insightful remarks. Indeed, stochastic resonance is a concept first formalized in the sensory domain. Recently, the same principles have been shown to apply in other domains as well. For example, transcranial electric noise (tRNS) exhibits similar stochastic resonance principles as sensory noise (Van Der Groen & Wenderoth, 2016). Indeed, tRNS has been applied to many cortical targets, including the motor system. In the current manuscript, we raise the question of whether TUS might engage with neuronal activity following principles similar to tRNS. One prediction of this framework would be that TUS might not modulate excitation/inhibition balance overall, but instead exhibit an inverted U-shape dose-dependent relationship with stochastic noise. Please note, we do not use the ‘suprathreshold TMS intensity’ to quantify whether noise could bring a sub-threshold input across the detection threshold, nor whether it could bring a sub-threshold output across the motor threshold. Instead, we use the MEP read-out to estimate the temporally varying excitability itself. We argue that MEP autocorrelation captures the mixture of temporal noise and temporal structure in corticospinal excitability. Building on the non-linear response of neuronal populations, low stochastic noise might strengthen weakly present excitability patterns, while high stochastic noise might override pre-existing excitability. It is therefore not the overall MEP amplitude, but the MEP timeseries that is of interest to us. Here, we observe a non-linear dose-dependent relationship, matching the predicted inverted U-shape. Importantly, we did not intend to assume stochastic resonance principles in the motor domain as a given. We have now clarified in the revised manuscript that we propose a putative framework and regard this as an open question:

“Indeed, human TUS studies have often failed to show a global change in behavioral performance, instead finding TUS effects primarily around the perception threshold where noise might drive stochastic resonance (Butler et al., 2022; Legon et al., 2018). Whether the precise principles of stochastic resonance generalize from the perceptual domain to the current study is an open question, but it is known that neural noise can be introduced by brain stimulation (Van Der Groen & Wenderoth, 2016). It is likely that this noise is statedependent and might not exceed the dynamic range of the intra-subject variability (Silvanto et al., 2007). Therefore, in an exploratory analysis, we exploited the natural structure in corticospinal excitability that exhibits as a strong temporal autocorrelation in MEP amplitude.”

Following the above reasoning, we felt it critical to estimate noise in the timeseries, operationalized as a t-1 autocorrelation, rather than capture inter-trial variability that ignores the timeseries history and requires data aggregation thereby reducing statistical power. Importantly, we would expect the latter index to capture global variability, putatively masking the temporal relationships which we were aiming to test. The reviewer raises an interesting option, inviting us to wonder if inter-trial variability might be sensitive enough, nonetheless. To this end, we compared inter-trial variability as suggested. This was achieved by first calculating the inter-trial variability for each condition, and then running a three-way repeated measures ANOVA on these values with the independent variables matching our autocorrelation analyses, namely, procedure (on-target/active control)intensity (6.35/19.06)masking (no mask/masked). This analysis did not reveal any significant interactions or main effects.

Author response table 1.

Effect DFn DFd F P
intensity 1 26 2.01136951 0.1679998
procedure 1 26 1.51481729 0.2294241
masking 1 26 0.55249790 0.4639630
intensity:procedure 1 26 0.05069027 0.8236267
intensity:masking 1 26 0.17466900 0.6794268
procedure:masking 1 26 0.27552687 0.6040957
intensity:procedure:masking 1 26 1.96443059 0.1728654

(1.11) State-dependency/Autocorrelations: These values were extracted from Exp2 which has baseline trials. Can the authors provide autocorrelation values at baseline, with and without auditory mask? Can the authors comment on the difference between the autocorrelation profiles of the active TUS condition at 6.35W/cm2 or at 19.06W/cm2. They should somehow be similar to my understanding. Besides, the finding that TUS induces noise only when sound is present and at lower intensities is not well discussed.

In the revised manuscript, we have now included baseline in the figure (Figure 4D). Regarding baseline with and without a mask, we must clarify that baseline involves only TMS (no mask), and sham involves TMS + masking stimulus (masked).

The dose-dependent relationship of TUS intensity with autocorrelation is critical. One possible observation would have been that TUS at both intensities decreased autocorrelation, with higher intensities evoking a greater reduction. Here, we would have concluded that TUS introduced noise in a linear fashion.

However, we observed that lower-intensity TUS in fact strengthened pre-existing temporal patterns in excitability (higher autocorrelation), while during higher-intensity TUS these patterns were overridden (lower autocorrelation). This non-linear relationship is not unexpected, given the non-linear responses of neurons.

If this non-linear dependency is driven by TUS, one could expect it to be present during conditions both with and without auditory masking. However, the preparatory inhibition effect of TUS likely depends on the salience of the cue, that is, the auditory confound. In trials without auditory masking, the salience of the confound in highly dependent on (transmitted) intensity, with higher intensities being perceived as louder. In contrast, when trials are masked, the difference in cue salience between lower and higher intensity stimulation in minimized. Therefore, we would expect for any nuanced dose-dependent direct TUS effect to be best detectable when the difference in dose-dependent auditory confound perception is minimized via masking. Indeed, the dose-dependent effect of TUS on autocorrelation is most prominent when the auditory confound is masked.

“In sum, these preliminary exploratory analyses could point towards TUS introducing temporally specific neural noise to ongoing neural dynamics in a dose-dependent manner, rather than simply shifting the overall excitation-inhibition balance. One possible explanation for the discrepancy between trials with and without auditory masking is the difference in auditory confound perception, where without masking the confound’s volume differs between intensities, while with masking this difference is minimized. Future studies might consider designing experiments such that temporal dynamics of ultrasonic neuromodulation can be captured more robustly, allowing for quantification of possible state-dependent or nondirectional perturbation effects of stimulation.”

(1.12) Statistical considerations. Data from Figure 2 are considered in two-by-two comparisons. Why not reporting the ANOVA results testing the main effect of TUS/Auditory conditions as done for Figure 3. Statistical tables of the LMM should be reported.

Full-factorial analyses and main effects for TUS/Auditory conditions are discussed from Section 3.2 onwards. These are the same data supporting Figure 2 (now Figure 3). We would like to note that the main purpose of Figure 2 is to demonstrate to the reader that motor inhibition was observed, thus providing evidence that we replicated motor inhibitory effects of prior studies. A secondary purpose is to visually represent the absence of direct and spatially specific neuromodulation. However, the appropriate analyses to demonstrate this are reported in following sections, from Section 3.2 onwards, and we are concerned that mentioning these analyses earlier will negatively impact comprehensibility.

Statistical tables of the LMMs are provided within the open-sourced data and code reported at the end of the paper, embedded within the output which is accessible as a pdf (i.e., analysis/analysis.pdf).

(1.13) Startle effects: The authors dissociate two mechanisms through which sound cuing can drive motor inhibition, namely some compensatory expectation-based processes or the evocation of a startle response. I find the dissociation somehow artificial. Indeed, it is known that the amplitude of the acoustic startle response habituates to repetitive stimulation. Therefore, sensitization can well explain the stabilization of the MEP amplitude observed after a few trials.

Thank you for bringing this to our attention. Indeed, an acoustic startle response would habituate over repetitive stimulation. A startle response would result in MEP amplitude being significantly altered in early trials. As the participant would habituate to the stimulus, the startle response would decrease. MEP amplitude would then return to baseline levels. However, this is not the pattern we observe. An alternative possibility is that participants learn the temporal contingency between the stimulus and TMS. Here, compensatory expectation-based change in MEP amplitude would be observed. In this scenario, there would be no change in MEP amplitude during early trials because the stimulus has not yet become informative of the TMS pulse timing. However, as participants learn how to predict TMS timing by the stimulus, MEP amplitude would decrease. This is also the pattern we observe in our data. We have clarified these alternatives in the revised manuscript as follows:

“Two putative mechanisms through which sound cuing may drive motor inhibition have been proposed, positing either that explicit cueing of TMS timing results in compensatory processes that drive MEP reduction (Capozio et al., 2021; Tran et al., 2021), or suggesting the evocation of a startle response that leads to global inhibition (Fisher et al., 2004; Furubayashi et al., 2000; Ilic et al., 2011; Kohn et al., 2004; Wessel & Aron, 2013). Critically, we can dissociate between these theories by exploring the temporal dynamics of MEP attenuation. One would expect a startle response to habituate over time, where MEP amplitude would be reduced during startling initial trials, followed by a normalization back to baseline throughout the course of the experiment as participants habituate to the starling stimulus. Alternatively, if temporally contingent sound-cueing of TMS drives inhibition, MEP amplitudes should decrease over time as the relative timing of TUS and TMS is being learned, followed by a stabilization at a decreased MEP amplitude once this relationship has been learned.”

(1.14) Can the authors further motivate the drastic change in intensities between Exp1 and 2? Is it due to the 250-500 carrier difference? It this coming from the loss power at 500kHz?

The change in intensities between Experiments I and II was not an intentional experimental manipulation. Following completion of data acquisition, our TUS system received a firmware update that differentially corrected the 250 kHz and 500 kHz stimulation intensities. In this manuscript, we report the actual free-water intensities applied during our experiments.

(1.15) Exp 3: Did 4 separate blocks of TUS-TMS and normalized for different TMS intensities used with respect to baseline. But how different was it. Why adjusting and then re adjusting intensities?

The TMS intensities required to evoke a 1 mV MEP under the four sound-sham conditions significantly differed from the intensities required for baseline. In the revised appendix, we have now included a figure depicting the TMS intensities for these conditions, as well as statistical tests demonstrating each condition required a significantly higher TMS intensity than baseline.

TMS intensities were re-adjusted to avoid floor effects when assessing the efficacy of ontarget TUS. Sound-sham conditions themselves attenuate MEP amplitude. This is also evident from the higher TMS intensities required to evoke a 1 mV MEP under these conditions. If direct neuromodulation by TUS would have further decreased MEP amplitude, the concern was that effects might not be detectible within such a small range of MEP amplitudes.

(1.16) In Exp 4, TUS targeted the ventromedial WM tract. Since direct electrical stimulation on white matter pathways within the frontal lobe can modulate motor output probably through dense communication along specific white matter pathways (e.g., Vigano et al., 2022, Brain), how did the authors ensure that this condition is really ineffective? Furthermore, the stimulation might have covered a lot more than just white matter. Acoustic and thermal simulations would be helpful here as well.

Thank you for pointing out this possibility. Ultrasonic and electrical stimulation have quite distinct mechanisms of action. Therefore, it is challenging to directly compare these two approaches. There is a small amount of evidence that ultrasonic neuromodulation of white matter tracts is possible. However, the efficacy of white matter modulation is likely much lower, given the substantially lesser degree of mechanosensitive ion channel expression in white matter as opposed to gray matter (Sorum et al., 2020, PNAS). Further, recent work has indicated that ultrasonic neuromodulation of myelinated axonal bundles occurs within the thermal domain (Guo et al., 2022, SciRep), which is not possible with the intensities administered in the current study. Nevertheless, based on Experiment IV in isolation, it cannot be definitively excluded that there TUS induced direct neuromodulatory effects in addition to confounding auditory effects. However, Experiment IV does not possess sufficient inferential power on its own and must be interpreted in tandem with Experiments I-III. Taken together with those findings, it is unlikely that a veridical neuromodulation effect is seen here, given the equivalent or lower stimulation intensities, the substantially deeper stimulation site, and the absence of an additional control condition in Experiment IV. This likelihood is further decreased by the fact that inhibitory effects under masking descriptively scale with the audibility of TUS.

Off-target effects such as unintended co-stimulation of gray matter when targeting white matter is always an important factor to consider. Unfortunately, individualized simulations for Experiment IV are not available. However, the same type of transducer and fundamental frequency was used as in Experiment II, for which we do have simulations. Given the size of the focus and the very low in-situ intensities extending beyond the main focal point, it is incredibly unlikely that effective stimulation was administered outside white matter in a meaningful number of participants. Nevertheless, the reviewer is correct that this can only be directly confirmed with simulations, which remain infeasible due to both technical and practical constraints. We have included the following in the revised manuscript:

“The remaining motor inhibition observed during masked trials likely owes to, albeit decreased, persistent audibility of TUS during masking. Indeed, MEP attenuation in the masked conditions descriptively scale with participant reports of audibility. This points towards a role of auditory confound volume in motor inhibition (Supplementary Fig. 8). Nevertheless, one could instead argue that evidence for direct neuromodulation is seen here. This unlikely for a number of reasons. First, white matter contains a lesser degree of mechanosensitive ion channel expression and there is evidence that neuromodulation of these tracts may occur primarily in the thermal domain (Guo et al., 2022; Sorum et al., 2021). Second, Experiment IV lacks sufficient inferential power in the absence of an additional control and must therefore be interpreted in tandem with Experiments I-III. These experiments revealed no evidence for direct neuromodulation using equivalent or higher stimulation intensities and directly targeting grey matter while also using multiple control conditions. Therefore, we propose that persistent motor inhibition during masked trials owes to continued, though reduced, audibility of the confound (Supplementary Fig. 8). However, future work including an additional control (site) is required to definitively disentangle these alternatives.”

(1.17) Still for Exp 4. the rational for the 100% MSO or 120% or rMT is not clear, especially with respect to Exp 1 and 2. Equipment is similar as well as raw MEPs amplitudes, therefore the different EMG gain might have artificially increased TMS intensities. Could it have impacted the measured neuromodulatory effects?

Experiment IV was conducted independently at a different institute than Experiments I-II. In contrast to Experiments I-II, a gel pad was used to couple TUS to the participant’s head. The increased TMS-to-cortex distance introduced by the gel pad necessitates higher TMS intensities to compensate for the increased offset. In fact, in 9/12 participants, the intended intensity at 120% rMT exceeded the maximum stimulator output. In those cases, we defaulted to the maximum stimulator output (i.e., 100% MSO). We have clarified in the revised supplementary material as follows:

“We aimed to use 120% rMT (n = 3). However, if this intensity surpassed 100% MSO, we opted for 100% MSO instead (n = 9). The mean %MSO was 94.5 ± 10.5%. The TMS intensities required in this experiment were higher than those required in Experiment I-II using the same TMS coil, though still within approximately one standard deviation. This is likely due to the use of a gel pad, which introduces more distance between the TMS coil and the scalp, thus requiring a higher TMS intensity to evoke the same motor activity.”

Regarding the EMG gain, this did not affect TMS intensities and did not impact the measured neuromodulatory effects. The EMG gain at acquisition is always considered during signal digitization and further analyses.

(1.18) Exp. 4. It would be interesting to provide the changes in MEP amplitudes for those subjects who rated "inaudible" in the self-rating compared to the others. That's an important part of the interpretation: inaudible conditions lead to inhibition, so there is an effect. The auditory confound is not additive to the TUS effect.

Previously, we only provided participant’s ratings of audibility, and showed that conditions that were rated as inaudible more often showed less inhibition, descriptively indicating that inaudible stimulation does not lead to inhibition. This interpretation is in line with our conclusion that the TUS auditory confound acts as a cue signaling the upcoming TMS pulse, thus leading to preparatory inhibition.

We have now included an additional plot and discussion in Supplementary Figure 8 (Subjective Report of TUS Audibility). Here, we show the change in MEP amplitude from baseline for the three continuously masked TUS intensities as in the main manuscript, but now split by participant rating of audibility. Descriptively, less audible sounds result in no marked change or a smaller change in MEP amplitude. This supports our conclusion that direct neuromodulation is not being observed here. When participants were unsure whether they could hear TUS, or when they did hear TUS, more inhibition was observed. However, this is still to a lesser degree than unmasked stimulation which was nearly always audible, and likely also more salient. This also supports our conclusion that these results indicate a role of cue salience rather than direct neuromodulation. Regarding masked conditions where participants were uncertain whether they heard TUS, the sound was likely sufficient to act as a cue, albeit potentially subliminally. After all, preparatory inhibition is not a conscious action undertaken by the participant either. We would also like to note that participants reported perceived audibility after each block, not after each trial, so selfreported audibility was not a fine-grained measurement. The data from Experiment IV suggest that the volume of the cue has an impact on motor inhibition. Taken together with the points mentioned in 1.16, it is not possible to conclude there is evidence for direct neuromodulation in Experiment IV.

(1.19) I suggest to re-order sub panels of the main figures to fit with the chronologic order of appearance in the text. (e.g Figure 1 with (A) Ultrasonic parameters, (B) 3D-printed clamp, (C) Sound-TMS coupling, (D) Experimental condition).

We have restructured the figures in the manuscript to provide more clarity and to have greater alignment with the eLife format.

(2.1) Although auditory confounds during TUS have been demonstrated before, the thorough design of the study will lead to a strong impact in the field.

We thank the reviewer for recognition of the impact of our work. They highlight that auditory confounds during TUS have been demonstrated previously. Indeed, our work builds upon a larger research line on auditory confounds. The current study extends on the confound’s presence by quantifying its impact on motor cortical excitability, but perhaps more importantly by invalidating the most robust and previously replicable findings in humans. Further, this study provides a way forward for the field, highlighting the necessity of (in)active control conditions and tightly matched sham conditions for appropriate inferences in future work. We have amended the abstract to better reflect these points:

“Primarily, this study highlights the substantial shortcomings in accounting for the auditory confound in prior TUS-TMS work where only a flip-over sham control was used. The field must critically reevaluate previous findings given the demonstrated impact of peripheral confounds. Further, rigorous experimental design via (in)active control conditions is required to make substantiated claims in future TUS studies.”

(2.2) A few minor [weaknesses] are that (1) the overview of previous related work, and how frequent audible TUS protocols are in the field, could be a bit clearer/more detailed

We have expanded on previous related work in the revised manuscript:

“Indeed, there is longstanding knowledge of the auditory confound accompanying pulsed TUS (Gavrilov & Tsirulnikov, 2012). However, this confound has only recently garnered attention, prompted by a pair of rodent studies demonstrating indirect auditory activation induced by TUS (Guo et al., 2022; Sato et al., 2018). Similar effects have been observed in humans, where exclusively auditory effects were captured with EEG measures (Braun et al., 2020). These findings are particularly impactful given that nearly all TUS studies employ pulsed protocols, from which the pervasive auditory confound emerges (Johnstone et al., 2021).”

(2.3) The acoustic control stimulus can be described in more detail

We have elaborated upon the masking stimulus for each experiment in the revised manuscript as follows:

Experiment I: “In addition, we also included a sound-only sham condition that resembled the auditory confound. Specifically, we generated a 1000 Hz square wave tone with 0.3 ms long pulses using MATLAB. We then added white noise at a signal-to-noise ratio of 14:1. This stimulus was administered to the participant via bone-conducting headphones.”

Experiment II: “In this experiment, the same 1000 Hz square wave auditory stimulus was used for sound-only sham and auditory masking conditions. This stimulus was administered to the participant over in-ear headphones.”

Experiment III: “Auditory stimuli were either 500 or 700 ms in duration, the latter beginning 100 ms prior to TUS (Supplementary Fig. 3.3). Both durations were presented at two pitches. Using a signal generator (Agilent 33220A, Keysight Technologies), a 12 kHz sine wave tone was administered over speakers positioned to the left of the participant as in Fomenko and colleagues (2020). Additionally, a 1 kHz square wave tone with 0.5 ms long pulses was administered as in Experiments I, II, IV, and prior research (Braun et al., 2020) over noisecancelling earbuds.”

Experiment IV: “We additionally applied stimulation both with and without a continuous auditory masking stimulus that sounded similar to the auditory confound. The stimulus consisted of a 1 kHz square wave with 0.3 ms long pulses. This stimulus was presented through wired bone-conducting headphones (LBYSK Wired Bone ConductionHeadphones). The volume and signal-to-noise ratio of the masking stimulus were increased until the participant could no longer hear TUS, or until the volume became uncomfortable.”

In the revised manuscript we have also open-sourced the audio files used in Experiments I, II, and IV, as well as a recording of the output of the signal generator for Experiment III:

“Auditory stimuli used for sound-sham and/or masking for each experiment are accessible here: https://doi.org/10.5281/zenodo.8374148.”

(2.4) The finding that remaining motor inhibition is observed during acoustically masked trials deserves further discussion.

We agree. Please refer to points 1.16 and 1.18.

(2.5) In several places, the authors state to have "improved" control conditions, yet remain somewhat vague on the kind of controls previous work has used (apart from one paragraph where a similar control site is described). It would be useful to include more details on this specific difference to previous work.

In the revised manuscript, we have clarified the control condition used in prior studies as follows:

Abstract:

“Primarily, this study highlights the substantial shortcomings in accounting for the auditory confound in prior TUS-TMS work where only a flip-over sham control was used.”

Introduction:

“To this end, we substantially improved upon prior TUS-TMS studies implementing solely flip-over sham by including both (in)active control and multiple sound-sham conditions.”

Methods:

“We introduced controls that improve upon the sole use of flip-over sham conditions used in prior work. First, we applied active control TUS to the right-hemispheric face motor area, allowing for the assessment of spatially specific effects while also better mimicking ontarget peripheral confounds. In addition, we also included a sound-only sham condition that closely resembled the auditory confound.”

(2.6) I also wondered how common TUS protocols are that rely on audible frequencies. If they are common, why do the authors think this confound is still relatively unexplored (this is a question out of curiosity). More details on these points might make the paper a bit more accessible to TUS-inexperienced readers.

Regarding the prevalence of the auditory confound, please refer to point 2.2.

Peripheral confounds associated with brain stimulation can have a strong impact on outcome measures, often even overshadowing the intended primary effects. This is well known from electromagnetic stimulation. For example, the click of a TMS pulse can strongly modulate reaction times (Duecker et al., 2013, PlosOne) with effect sizes far beyond that of direct neuromodulation. Unfortunately, this consideration has not yet fully been embraced by the ultrasonic neuromodulation community. This is despite long known auditory effects of TUS (Gavrilov & Tsirulnikov, 2012, Acoustical Physics). It was not until the auditory confound was shown to impact brain activity by Guo et al., and Sato et al., (2018, Neuron) that the field began to attend to this phenomenon. Mohammadjavadi et al., (2019, BrainStim) then showed that neuromodulation persisted even in deaf mice, and importantly, also demonstrated that ramping ultrasound pulses could reduce the auditory brainstem response (ABR). Braun and colleagues (2020, BrainStim) were the first bring attention to the auditory confound in humans, while also discussing masking stimuli. This was followed by a study from Johnstone and colleagues (2021, BrainStim) who did preliminary work assessing both masking and ramping in humans. Recently, Liang et al., (2023) proposed a new form of masking colourfully titled the ‘auditory Mondrian’. Further research into the peripheral confounds associated with TUS is on the way.

However, we agree that the confound remains relatively unexplored, particularly given the substantial impact it can have, as demonstrated in this paper. What is currently lacking is an assessment of the reproducibility of previous work that did not sufficiently consider the auditory confound. The current study constitutes a strong first step to addressing this issue, and indeed shows that results are not reproducible when using control conditions that are superior to flip-over sham, like (in)active control conditions and tightly matched soundsham conditions. This is particularly important given the fundamental nature of this research line, where TUS-TMS studies have played a central role in informing choices for stimulation protocols in subsequent research.

We would speculate that, with TUS opening new frontiers for neuroscientific research, there comes a rush of enthusiasm wherein laying the groundwork for a solid foundation in the field can sometimes be overlooked. Therefore, we hope that this work sends a strong message to the field regarding how strong of an impact peripheral confounds can have, also in prior work. Indeed, at the current stage of the field, we see no justification not to include proper experimental control moving forward. Only when we can dissociate peripheral effects from direct neuromodulatory effects can our enthusiasm for the potential of TUS be warranted.

(2.7) Results, Fig. 2: Why did the authors not directly contrast target TUS and control conditions?

Please refer to point 1.1.

(2.8) The authors observe no dose-response effects of TUS. Does increasing TUS intensity also increase an increase in TUS-produced sounds? If so, should this not also lead to doseresponse effects?

We thank the reviewer for this insightful question. Yes, increasing TUS intensity results in an increased volume of the auditory confound. Under certain circumstances this could lead to ‘dose-response’ effects. In the manuscript, we propose that the auditory confounds acts as a cue for the upcoming TMS pulse, thus resulting in MEP attenuation once the cue is informative (i.e., when TMS timing can be predicted by the auditory confound). In this scenario, volume can be taken as the salience of the cue. When the auditory confound is sufficiently salient, it should cue the upcoming TMS pulse and thus result in a reduction of MEP amplitude.

If we take Experiment II as an example (Figure 3B), the 19.06 W/cm2 stimulation would be louder than the 6.35 W/cm2 intensity. However, as both intensities are audible, they both cue the upcoming TMS pulse. One could speculate that the very slight (nonsignificant) further decrease for 19.06 W/cm2 stimulation could owe to a more salient cueing.

One might notice that MEP attenuation is less strong in Experiment I, even though higher intensities were applied. Directly contrasting intensities from Experiments I and II was not feasible due to differences in transducers and experimental design. From the perspective of sound cueing of the upcoming TMS pulse, the auditory confound cue was less informative in Experiment I than Experiment II, because TUS stimulus durations of both 100 and 500 ms were administered, rather than solely 500 ms durations. This could explain why descriptively less MEP attenuation was observed in Experiment I, where cueing was less consistent.

Perhaps more convincing evidence of a sound-based ‘dose-response’ effect comes from Experiment IV (Figure 4B). Here, we propose that continuous masking reduced the salience of the auditory confound (cue), and thus, less MEP attenuation was be observed. Indeed, we see less MEP change for masked stimulation. For the lowest administered volume during masked stimulation, there was no change in MEP amplitude from baseline. For higher volumes, however, there was a significant inhibition of MEP amplitude, though it was still less attenuation than unmasked stimulation. These results indicate a ‘doseresponse’ effect of volume. When the volume (intensity) of the auditory confound was low enough, it was inaudible over the continuous mask (also as reported by participants), and thus it did not act as a cue for the upcoming TMS pulse, therefore not resulting in motor inhibition. When the volume (intensity) was higher, less participants reported not being able to hear the stimulation, so the cue was to a given extent more salient, and in line with the cueing hypothesis more inhibition was observed.

In summary, because the volume of the auditory confound scales with the intensity of TUS, there may be dose-response effects of the auditory confound volume. Along the border of (in)audibility of the confound, as in masked trials of Experiment IV, we may observe dose-response effects. However, at clearly audible intensities (e.g., Experiment I & II), the size of such an effect would likely be small, as both volumes are sufficiently audible to act as a cue for the upcoming TMS pulse leading to preparatory inhibition.

(2.9) I wonder if the authors could say a bit more on the acoustic control stimulus. Some sound examples would be useful. The authors control for audibility, but does the control sound resemble the one produced by TUS?

Please refer to point 2.3.

(2.10) The authors' claim that the remaining motor inhibition observed during masked trials is due to persistent audibility of TUS relies "only" on participants' descriptions. I think this deserves a bit more discussion. Could this be evidence that there is a TUS effect in addition to the sound effect?

Please refer to points 1.16 and 1.18.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Citations

    1. Kop B, Shamli Oghli Y, Grippe TC, Nandi T, Lefkes J, Meijer SW, Farboud S, Engels M, Hamani M, Null M, Radetz A, Hassan U, Darmani G, Chetverikov A, Ouden HEM, Bergmann TO, Chen R, Verhagen L. 2024. Auditory confounds can drive online effects of transcranial ultrasonic stimulation in humans. Radboud Repository. [DOI] [PMC free article] [PubMed]
    2. Kop B. 2022. benjamin-kop/TUS-TMS-Clamp: TUS-TMS-Clamp. Zenodo. [DOI]
    3. Kop B. 2023. benjamin-kop/auditory-masking-stimuli: Auditory masking stimuli used in 'Auditory confounds can drive online effects of transcranial ultrasonics stimulation in humans'. Zenodo. [DOI] [PMC free article] [PubMed]
    4. IT’IS Foundation 2022. Tissue properties database V4.1. IT’IS Foundation. [DOI]

    Supplementary Materials

    MDAR checklist

    Data Availability Statement

    Data and code are available in the Radboud Data Repository: https://doi.org/10.34973/jh6z-yh31 (CC-BY-4.0). The custom TUS-TMS clamp is open-sourced here: https://doi.org/10.5281/zenodo.6517599. The auditory stimuli used for masking and for sound-sham conditions are open-sourced here: https://doi.org/10.5281/zenodo.8374148.

    The following datasets were generated:

    Kop B, Shamli Oghli Y, Grippe TC, Nandi T, Lefkes J, Meijer SW, Farboud S, Engels M, Hamani M, Null M, Radetz A, Hassan U, Darmani G, Chetverikov A, Ouden HEM, Bergmann TO, Chen R, Verhagen L. 2024. Auditory confounds can drive online effects of transcranial ultrasonic stimulation in humans. Radboud Repository.

    Kop B. 2022. benjamin-kop/TUS-TMS-Clamp: TUS-TMS-Clamp. Zenodo.

    Kop B. 2023. benjamin-kop/auditory-masking-stimuli: Auditory masking stimuli used in 'Auditory confounds can drive online effects of transcranial ultrasonics stimulation in humans'. Zenodo.


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES