Skip to main content
Springer logoLink to Springer
. 2025 Jun 8;230(6):89. doi: 10.1007/s00429-025-02951-3

Functional and structural plasticity induced by audiovisual associations and sensory experiences

Fazilet Zeynep Yildirim-Keles 1,2, Pinar Demirayak 3,4, Hulusi Kafaligonul 5,6,
PMCID: PMC12146228  PMID: 40483614

Abstract

Crossmodal associations and correspondences play important roles in shaping perception and guiding our actions. However, we still have a limited understanding of association-induced changes in the adult human brain. An important question to address is to what extent passive exposure to crossmodal associations leads to functional and structural changes in the brain. Utilizing an audiovisual association paradigm in which motion stimuli presented in the left visual field were paired with auditory tones, we investigated experience-dependent crossmodal plasticity by examining resting-state functional connectivity and cortical thickness. Following the association phase, we observed widespread increases in functional connectivity between the trained region (e.g., right V1, V2, and V3) and higher-order frontal, temporal, and occipital areas. In contrast, functional connectivity changes in the untrained region (e.g., left V1, V2, and V3) were markedly reduced, indicating that the observed plasticity was largely specific to the cortical representation of the stimulated visual field. Consistent with the functional results, cortical thickness analysis revealed a clear hemispheric asymmetry, with significant changes observed exclusively in the right hemisphere. Moreover, both the functional and structural assessments showed widespread changes in high-level association cortices, whereas changes in low-level sensory areas were less pronounced. The overlap between functional and structural measures suggests that enhanced connectivity aligns with structural changes due to crossmodal associations. These results illustrate that associations formed without explicit training or feedback can lead to functional and structural changes in the adult human brain, providing important implications for perception and neural plasticity in daily life situations.

Supplementary Information

The online version contains supplementary material available at 10.1007/s00429-025-02951-3.

Keywords: Crossmodal associations, Motion perception, Sensory experiences, Resting-state, Functional connectivity, Cortical thickness, Neural plasticity

Introduction

Associations play a crucial role in predicting and interpreting ambiguous sensory inputs, shaping our perception, and guiding our actions (Albright 2012). The process of associative learning involves establishing semantic connections between stimuli or behaviors based on their co-occurrence or similarity in outcomes. This type of learning can occur through classical conditioning, where a neutral stimulus becomes associated with a meaningful one, or operant conditioning, where behaviors are strengthened or weakened based on consequences (Hebb 1949; James 1890; Konorski 1967). Understanding these mechanisms is essential as they facilitate the acquisition and retention of new information, enabling organisms to effectively adapt sensory regularities and correspondences in external environment. Moreover, associations are tightly linked with neural plasticity, contributing to changes in neural connectivity and function (Albright 2012; Karim et al. 2021; McGann 2015).

The daily life experiences typically involve concurrent presentation of different sensory inputs, leading to systematic associations and correspondences between features encoded in different sensory modalities (Di Stefano and Spence 2023; Spence 2011). Consistent with this ecological observation, previous studies have shown that crossmodal associations can even be formed without explicit training via passive exposure to multisensory stimuli. For instance, Hidaka et al. (2011) showed that a rapid association (without any conditional reinforcement) between static tones and visual motion directions could be easily formed, and then the tones were able to drive perceived direction of visual motion (see also, Teramoto et al. 2010). Moreover, these audiovisual associations have been found to be selective for certain stimulus features, suggesting that they occur at the perceptual level rather than any decision level. Using motion types engaging pre-attentive visual processing, Kafaligonul and Oluk (2015) further revealed that these audiovisual associations and their effects on visual motion perception exist in absence of higher-order attentive tracking mechanisms, highlighting the automatic pre-attentive nature of sensory associations. Together with previous neurophysiological evidence on different visual features (Messinger et al. 2001; Schlack and Albright 2007), these psychophysical findings illustrate that associations are inherent nature of sensory and perceptual processing and may take place at different cortical areas. Despite these insights, we still have a limited understanding of how these audiovisual associations lead to functional and structural changes in the brain.

In this context, cortical thickness and resting-state functional connectivity measurements have been instrumental in elucidating experience-dependent structural and functional changes in the brain. Cortical thickness studies have shown that training in various tasks (Ditye et al. 2013; Draganski et al. 2004; Haier et al. 2009; Legault et al. 2019; Metzler-Baddeley et al. 2016; Wenger et al. 2012; Worschech et al. 2022) or the degree of visual experience (e.g., central vision loss, macular degeneration, congenital blindness) (Burge et al. 2016; Defenderfer et al. 2023; Park et al. 2009; Prins et al. 2016) can induce alterations in cortical thickness in the human brain. Specifically, prolonged engagement in perceptual, motor, and cognitive tasks has been linked to increases in cortical thickness in areas responsible for the processing of relevant information (e.g., Ditye et al. 2013; Draganski et al. 2004; Haier et al. 2009; Legault et al. 2019; Worschech et al. 2022). Previous studies also indicated that increased cortical thickness is positively correlated with the changes in behavioral performance (Ditye et al. 2013; Gilaie-Dotan et al. 2013; Kanai et al. 2010; for a review see Kanai and Rees 2011) and cortical thickness measures can even predict subsequent rate of perceptual learning in visual tasks (Frank et al. 2016). These studies underscore the well-established link between the brain's structural plasticity and learning across various domains. However, to date, there is no systematic investigation on structural changes elicited by exposure-based short-term audiovisual associations.

Resting-state functional connectivity has also been widely applied to track functional changes in response to experience-dependent plasticity and learning. Functional connectivity at rest provides a valuable index of the plasticity across brain regions involved in sensory processing (Guerra-Carrillo et al. 2014). Notably, this is supported by studies including motion perception (e.g., Sarabi et al. 2018; Urner et al. 2013) or using dynamic audiovisual paradigms in training (e.g., Powers et al. 2012). There is also evidence that even passive exposure to sensory stimuli can lead to changes in resting-state functional connectivity between auditory and visual cortices (Eckert et al. 2008; Zangenehpour and Zatorre 2010). Zangenehpoura and Zatorre (2010)) proposed that modulations of functional connectivity and activation shifts across sensory cortices may be driven by associations passively formed between auditory and visual stimuli. This hypothesis aligns with neuroanatomical studies revealing direct connections between primary auditory and primary visual cortices (e.g., Beer et al. 2011a, b; Clavagnier et al. 2004; Falchier et al. 2002; Garner and Keller 2022). A brief association phase to link static tones with motion direction may strengthen the crosstalk between auditory and visual cortical areas, leading to observed changes in motion perception. On the other side, previous research has also indicated that sounds can evoke imagery and activate the visual cortex through feedback from non-retinal resources (Petro et al. 2014; Vetter et al. 2014). This possibility highlights the strengthening of top-down feedback connections from multisensory association and/or non-sensory areas to visual cortices. Hence, the audiovisual associations may be mediated by changes in either feedforward crosstalk between primary sensory cortices or feedback connections between sensory and multisensory areas or higher order association areas. It is important to note that these possibilities are not mutually exclusive, and changes in both types of connections may jointly contribute to neural plasticity and the observed alterations in motion perception.

In the current study, we aimed to investigate the neuroplasticity associated with short-term audiovisual associations, focusing on both functional connectivity and cortical thickness. Building upon the established role of associations in shaping perceptual processes, our hypothesis posited that forming rapid associations between static tones and visual motion directions would lead to measurable alterations in both resting-state functional connectivity and cortical thickness within relevant brain regions. Unlike typical learning paradigms that require explicit training and feedback, we focused on association effects through passive exposure, allowing us to examine how crossmodal associations elicits plasticity without reinforcement. We utilized the random-dot motion paradigm originally developed by Hidaka et al. (2011), indicating that associations between the direction of coherent random dots and static tones can lead to strong changes in perceived direction of visual motion. In this paradigm, observers performed a perceptual task on visual motion while passively listening to sounds, with vision serving as the primary modality and audition as the secondary. The behavioral findings suggest that the information (i.e., the association acquired through passive exposure) provided by audition (secondary task-irrelevant stimulation) interferes and interacts with the motion processing which is primarily carried out by the visual system. Moreover, previous research has shown that this type of motion (i.e., random dots) and stimulation mainly engage early and mid-level levels of cortical processing (Britten et al. 1992; Ho and Giaschi 2009; Rina et al. 2022). In accordance with these findings, our primary neuroimaging focus was on occipital (V1–V3) and middle temporal (MT+/V5) areas. Specifically, we anticipated that the association established between the direction of visual motion and static tones would enhance the crosstalk between these visual areas and auditory cortices/audiovisual areas, as suggested by previous neuroanatomical and functional connectivity studies. Moreover, we expected connectivity changes to reflect enhanced neural integration and potentially increased cortical thickness in areas responsible for sensory processing. By exploring these dynamics, we aimed to contribute to a deeper understanding of how passive exposure to multisensory stimulation and crossmodal associations modulate brain structure and function, particularly in the context of audiovisual processing.

Materials and methods

Similar to previous research (Hidaka et al. 2011; Kafaligonul and Oluk 2015), our experiment consisted of different phases conducted over the course of several days (Fig. 1a). Participants first took part in a behavioral measurement, followed by an fMRI session. In the following association phase, each participant completed a behavioral association session for 2–3 days. After each association phase, a behavioral assessment was also performed. For the last association phase, participants returned to the laboratory for a final behavioral association session and assessment, followed by a second fMRI session. Participants either underwent 2 or 3 association and behavioral sessions, as some showed the association effect by the second day, and others by the third day. Further information about each step and phase are provided in the following subsections.

Fig. 1.

Fig. 1

Experimental design A Illustration of experimental procedures taking place over the course of several days. Note that the last session of association (exposure and behavioral testing) took place on the last day. B Schematic depiction of the stimulation and timeline of events during a trial of behavioral measurement/assessment. Different levels of motion coherence and audiovisual conditions were used. C Stimulus representation during an association session. Each motion direction (100% motion coherence) was paired with static sounds of distinct frequencies. In this representation, the low and high frequency tones were paired with upward and downward motion directions, respectively

Participants

Fourteen participants took part in the study. The data from one participant were excluded due to technical problems during data collection. The data from the remaining 13 participants (nine females, age range: 21–27 years) were included in data analyses. The final sample size was comparable with functional neuroimaging studies including both audiovisual perceptual learning and resting-state connectivity analysis (e.g., Powers et al. 2012). All participants had normal or corrected-to-normal vision, and no history of neurological and psychiatric disorders by self-report. Before their participation, they were informed about experimental procedures and signed a consent form. All procedures were in accordance with the international guidelines (World Medical Association, 2013) and was approved by the local ethics committee at Faculty of Medicine, Ankara University. The participants were compensated for their time with monetary payment upon completing all steps of the study.

Behavioral measurements and associations

Apparatus and stimuli

MATLAB version 7.12 (The MathWorks, Natick, MA) with PsychToolbox 3.0 extension (Brainard 1997; Pelli 1997) was used for stimulus generation, timing and behavioral data acquisition. Visual stimuli were presented on a 21"LCD computer monitor (NEC MultiSync 2190UXp; resolution: 1600 × 1200 pixels; refresh rate: 60 Hz). Auditory stimuli were introduced through a headset (Sennheiser HD 518). Participants binocularly viewed the monitor from a distance of approximately 57 cm, and were supported by a chin- and head-rest. Calibration of visual and auditory stimuli was carried out using a SpectroCAL photometer (Cambridge Research Systems, Rochester, Kent, UK) and a sound-level meter (SL-4010, Lutron Electronics, Taipei, TW). Behavioral responses were collected with a standard computer keyboard. The behavioral measurements and audiovisual association sessions were carried out in a dark and silent room.

A small red circle (0.2° diameter) at the center of the display served as a fixation target on a black background (0.27 cd/m2). Visual stimuli were random dot kinematograms (RDKs) consisting of gray dots (diameter = 0.2°, 75.80 cd/m2). The dots were presented at a fixed density of 3.5 dots/deg2, within an invisible 5° diameter circular aperture (Fig. 1b, c). The center of circular aperture was on the left visual field and 5° away from the fixation target. Visual stimuli were presented in the parafoveal and peripheral regions (2.5°–7.5° of visual angle) to stimulate motion-sensitive areas without requiring central fixation shifts. This approach targets naturally motion-sensitive regions in the parafoveal visual field (Smith et al. 1998), while also minimizing the risk of eye movements by keeping the stimuli within a fixed, stable area. Given that the stimuli were presented within a well-defined and stable visual field region (2.5° to 7.5°), and participants were instructed to maintain fixation on a centrally located target, any minor shifts in gaze would not significantly affect the results. A white noise algorithm was used to control the coherence of moving dots and determine the position of each dot in a random-dot frame (Britten et al. 1992; Pilly and Seitz 2009). According to this algorithm, the coherent motion was generated with a random selection of a percentage of dots in each random-dot frame to be replotted at a shifted location in a specific direction (upward or downward motion with a 5°/s speed). On the other hand, the remaining noise dots were shifted to a random location and hence, the noise dots had random speed and motion direction. It is important to note that the lifetime of coherent dots was probabilistic outcome of their proportion (i.e., motion coherence) in this algorithm. The lifetimes of coherent dots become short as their percentage decreases. However, the lifetimes get longer when the proportion of coherent dots increases. In behavioral test measurements, the motion coherence was varied by changing the percentage of dots (see Task and Procedure). For the association sessions, we used 100% coherent dots condition and all dots moved in a single direction and stayed on the display during the presentation of motion. A static tone burst (sampling rate: 44.1 kHz) was used as auditory stimulus. The tone burst was a rectangular windowed sinusoidal with either a low (500 Hz) or high (2 kHz) frequency, and delivered at a sound pressure level of 83 dB.

Task and procedure

The behavioral part of the study was conducted in three phases: pre-association test, association, and post-association test (Fig. 1a). In the pre- and post-association test phases, the task and procedure were exactly the same and the participants engaged in a motion direction discrimination task. On each trial, the fixation point was presented first for a variable duration (900–1000 ms). Then, the visual motion, along with auditory tones if introduced, was presented for 1 s. After the stimulus offset, participants were required to indicate whether the global dot motion was upwards or downwards by pressing up or down arrow keys on the keyboard, respectively (Fig. 1b). The participants were specifically instructed not to track any individual dot motion, but to maintain central fixation and attend to both visual motion and auditory stimuli while estimating the overall direction of global motion. For each trial, the motion coherence was pseudo-randomly selected from six different levels (5%, 15%, 30%, 45%, 60%, or 90%). The behavioral test phase included three audiovisual conditions (i.e., visual motion with low frequency tone, visual motion with high frequency tone, visual-only/no tone), six motion coherence levels and two motion directions (upward or downward). Each participant completed two blocks with 216 trials (2 blocks × 216 trials), leading to 12 trials per condition at the end of a behavioral measurement. The order of conditions was randomized within each block.

In the association phase, the timing of stimulus presentation was the same (Fig. 1c). However, instead of visual motion with varying coherence levels, only a 100% coherent motion was presented with concurrent static tones. New random dot displays were generated for each presentation. The direction of motion was paired with a tone of distinct frequency. For seven of the participants, the low-frequency tone was paired with an upward motion and the high-frequency tone with a downward motion and vice versa for the remaining six. For convenience, we refer to these audiovisual pairings according to the motion direction (e.g., upward tone: a tone paired with upward motion, downward tone: a tone paired with downward motion). Participants were instructed to attend to both visual motion and tones while fixating on the red circle at the center of screen. Each association session included 200 presentations (100 for each audiovisual pair), and the order of presentations was randomized within a session. A session lasted for a total of eight minutes. Right after an association session, a follow-up behavioral measurement was performed as described above. Participants did not receive feedback following their responses. This lack of feedback was a deliberate choice, as our aim was to evaluate the effects of audiovisual associations and sensory experiences without explicit reinforcement.

Behavioral data analysis

Behavioral data analysis was performed in R (v3.6.3) (R Core Team, 2021). Percentage of upward motion responses were calculated separately for each participant and experimental condition (i.e., pre- and post-association; visual-only, upward, and downward tones) and fitted with a cumulative normal function using the R package quickpsy (Linares and López-Moliner 2016) to compute coherence thresholds (i.e., the point of subjective equality; PSE) and slope values. The threshold and slope shifts for each test session (pre- and post-) of a participant were calculated by subtracting the threshold and slope values of the visual-only condition from those of audiovisual conditions (i.e., upward tone and downward tone). To evaluate the effect of audiovisual association training on perceived direction and discrimination, thresholds and slope shifts were separately analyzed via repeated-measures ANOVA with two main factors: test phase (pre- vs. post-association) and tone (upward vs. downward tone). Post-hoc t-tests were performed to further understand the source of main effects and interactions, and all p-values were FDR (false discovery rate) corrected (Benjamini and Hochberg 1995). Contrasts with p < 0.05 were considered as significant. Partial eta squared (ηp2) was calculated to measure the effect size, where a value of 0.01 was low, 0.04 moderate, and 0.1 high (Richardson 2011). To evaluate the effect of audiovisual association training on perceived direction and discrimination in the visual-only condition, PSE and slope values for the visual-only condition were contrasted between pre- vs post-association phases using t tests. Cohen’s d (d) was calculated to measure the effect size, where a value of 0.2 was low, 0.5 moderate, and 0.8 high.

Magnetic resonance imaging

To reveal association induced changes in the human brain, MRI recordings were carried out before and after the behavioral association phase. MRI procedures were identical for pre- and post-association phases (Fig. 1a).

Data acquisition

All images were acquired by using a 3 T Siemens Magnetom Trio scanner with a 12-channel head-coil at the National Magnetic Resonance Research Center (UMRAM) at Bilkent University. To minimize head movements, a vacuum cushion and cushioned head stabilizers that reached cheeks from the sides of the head coil, were used. MR compatible headphones were utilized to eliminate potential sound artifacts that resulted from the MRI machine throughout data acquisition. High-resolution 3D anatomical scans were obtained with the following parameters: T1-weighted single turbo FLASH, TR/TE = 7982/3.68 ms, 1 mm isotropic voxels, FOV = 256 × 256 mm2, matrix size = 256 × 256 × 176. Resting-state functional scans were acquired in total darkness, during which observers were instructed to close their eyes, relax but not to sleep. The hemodynamic responses were acquired with a T2-weighted standard EPI sequence (TR/TE = 2000/35 ms, 3 mm isotropic voxels, slice thickness: 3 mm, FOV = 240 × 131.5 mm2, matrix size = 80 × 78 × 28), resulting in 360 volumes per scan. Each resting-state functional scan took 12 min.

Data preprocessing

Anatomical data T1-weighted (T1w) images were first corrected for intensity non-uniformity using Bias Field Correction Tool in statistical parametric mapping (SPM) version 12 (RRID: SCR_007037; Bazay et al. 2022). Then, they were processed using a recon-all processing stream which includes motion correction, skull-stripping, registration, segmentation, smoothing, and parcellation mapping. Standard retinotopic atlases to the individual’s FreeSurfer cortical reconstruction (RRID: SCR_001847) were created by using Anaconda 3, version 2020.11 (Benson et al. 2014; Benson and Winawer 2018).

Functional data fMRI data were preprocessed with CONN toolbox, version 17.f (RRID: SCR_009550; Whitfield-Gabrieli and Nieto-Castanon 2012) and SPM version 12 (RRID: SCR_007037; Bazay et al. 2022) using the standard preprocessing pipeline. Data were functionally realigned and unwrapped; slice-time corrected; structurally and functionally normalized and segmented into gray matter, white matter, and CSF tissue; flagged as potential outliers if framewise displacement exceeded 0.9 mm or if global BOLD signal changes were above five standard deviations; and smoothed, using a 6 mm Gaussian kernel (Whitfield-Gabrieli and Nieto-Castanon 2012). Subject-motion threshold was set to 1 mm, and both functional and structural data were resampled with 2 mm and 1 mm voxels, respectively. Artifacts were detected and entered into a linear regression as potential confounding effects (i.e., white matter, CSF, realignment, scrubbing, and the effect of rest) in order to remove any influence on the BOLD signal. After linear de-trending was performed, images were then band-pass filtered between 0.008–0.09 Hz and motion regressed to minimize the effect of motion and noise sources, as CONN removes temporal frequencies below or above these values by default in order to focus on slow-frequency fluctuations (Nieto-Castanon 2020).

Region of interest definitions

Regions of interests for functional connectivity analysis were defined in the bilateral visual field over early visual areas (V1–V3) based on Benson atlases (Benson et al. 2014) (Fig. 2a). A circular region of interest with 2.5 degrees of visual angle radius centered at 5° on the horizontal axis, corresponding to the trained and stimulated area in the left visual field (Fig. 2b), was identified for each participant and mapped onto the cortex by using cortical mapping method (Defenderfer et al. 2021) (Fig. 2c). Similarly, a circular region of interest with 2.5 degrees of visual angle radius centered at 5° on the horizontal axis, corresponding to the untrained area in the right visual field, was identified for each participant and mapped onto the cortex by using the same cortical mapping method (Defenderfer et al. 2021) (Fig. 2c). Trained and untrained areas in the left and right visual fields were first mapped on individuals’ native space and then converted into volumetric space by using the FSL FLIRT function. Cortical correspondence of the trained and untrained areas were named as cTrained Retinal Locus and cControl Retinal Locus, respectively. To be able to compare our results at the group level, we resampled ROIs (cTrained and cControl Retinal Loci) in MNI 2 mm standard space. Additionally, the bilateral MT+/V5 areas were defined based on Jülich histological atlas (Amunts et al. 2020) (Fig. 2c). Since the retinocortical mapping method (Defenderfer et al. 2021) applies only to early visual regions (V1–V3), the entire MT+/V5 region was used. Hence, cTrained and cControl Retinal Loci and bilateral MT+/V5 areas were used as seed ROIs in the functional connectivity analysis to investigate the connectivity changes with every other voxel in the brain, and in cortical thickness analysis to investigate the structural changes between pre- and post-association sessions.

Fig. 2.

Fig. 2

Identification of region of interests and their cortical projections. A Illustration of the anatomic atlases of eccentricity, polar angle and sigma mapping in early visual areas. Eccentricity is represented across the cortex with central vision at the occipital pole. B Schematic representation of the region of interests. During the behavioral assessments and association sessions, moving random dots were presented in the left visual field. Visual stimuli had a 2.5° radius and located 5° away from the fixation dot in the horizontal axis. C Cortical mappings of the regions of interest. Based on the anatomic atlases, we defined the cortical projections of the region of interests on the right and left visual fields in early visual areas (V1–V3) which corresponds to cControl and cTrained Retinal Loci at the left and right hemispheres, respectively. Bilateral MT+/V5 area was also defined as a region of interest

Functional connectivity analysis

Functional connectivity measures were computed between a seed ROI and every other voxel in the brain (i.e., seed-to-voxel functional connectivity) by using the CONN toolbox v17.f (RRID: SCR_009550; Whitfield-Gabrieli and Nieto-Castanon 2012) to evaluate the effects of audiovisual association training. Eight major seed ROIs (cTrained Retinal Loci at V1, V2, and V3, cControl Retinal Loci at V1, V2, and V3, left MT +/V5, and right MT+/V5) were used to compute functional connectivity with every other voxel in the brain. cTrained and cControl Retinal Loci, and bilateral MT+/V5 were imported into the CONN toolbox for analysis. Based on the results obtained in the first-level analysis, a paired t-test was performed with a threshold set at p < 0.05 FDR-corrected to determine significantly different functional connections between pre-association and post-association assessments. A two-tailed t-test was used to compare post- and pre-association resting state scans to identify the effects of association training on functional connectivity. This allows for the detection of both increases and decreases in functional connectivity following training.

Cortical thickness assessment

Cortical thickness data was derived from T1-weighted MRI images. Cortical thickness was computed for 180 bilateral Glasser atlas regions (Glasser et al. 2016) and the cTrained (at V1, V2, V3) and cControl (at V1, V2, and V3) Retinal Loci using FreeSurfer 6.0.0's standard, automated cortical reconstruction pipeline on a Linux-based computing cluster. This resulted in 366 regions in total (i.e., 180 bilateral Glasser atlas regions, three cTrained Retinal Loci, and three cControl Retinal Loci). The processing steps include skull-stripping (Ségonne et al. 2004), Talairach transformation, subcortical gray and white matter segmentation (Fischl et al. 2002), intensity normalization (Sled et al. 1998), gray and white matter tessellation, topology correction (Fischl et al. 2001), and intensity gradient-based surface deformation to generate gray vs. white and gray vs. cerebrospinal fluid surface models (Dale et al. 1999; Fischl et al. 2001). The resulting surface models were then inflated and registered to a spherical surface atlas, allowing parcellation of cortical regions of interest. Finally, regional cortical thicknesses were computed by taking the mean of white-pial distance at all vertices within each parcellated region (Fischl and Dale 2000). Cortical thickness measurements within the Glasser atlas regions were extracted from anatomical scans that were acquired in both pre- and post-association assessments. Then, they were normalized by dividing to mean cortical thickness value for each participant’s pre- and post-association assessments.

Statistical analysis

Statistical analyses were performed using SPSS v29.0.2.0 (IBM SPSS Statistics, Armonk, NY, USA). Paired samples t-tests were used to compare within-subject effects of the audiovisual association training for cortical thickness and functional connectivity alterations. FDR procedure was applied to correct for false-positive inflation at multiple comparisons and FDR-corrected p values were used as a significance level to test our hypotheses. Paired sample t-test results were limited to FDR-corrected p < 0.05 values for both cortical thickness and functional connectivity comparisons.

Results

We assessed the impact of audiovisual associations by analyzing the data collected before and after the association phase. Through a within-subject experimental design, we examined changes in behavior, cortical thickness, and functional connectivity to uncover the structural and functional plasticity resulting from multisensory experience.

Participants could successfully associate visual and auditory stimuli

The effects of audiovisual associations on behavioral reports based on visual motion direction are shown in Fig. 3. The behavioral assessment on the first day was considered pre-association data, while the behavioral measurement right after the last association session of each participant were used as post-association data. The behavioral measurements on each day of the association phase are also provided in Figure S1 for each participant. Psychometric curves of visual-only, upward tone, and downward tone conditions as a function of motion coherence level are plotted for pre- and post-association test phases (Fig. 3a). Participants were not able to discriminate the direction of random dot motion when the coherence level was low. However, as the proportion of coherent dots increased (i.e., motion coherence level), participants reported motion direction in alignment with the dots'physical displacement, leading to improved performance, as depicted in Fig. 3a. In the pre-association test phase, the psychometric curves for the visual-only, upward tone, and downward tone conditions overlap, showing that static tones did not cause any change in the psychometric curves at this phase. After the paired presentation of motion directions and static tones during the association phase, however, the static tones significantly biased the reported direction of dot motion in favor of the exposed audiovisual pairing. In the post-association test phase, there was a noticeable shift in the curves: the upward and downward tone conditions shifted to the left and right, respectively, while the visual-only condition was relatively centered.

Fig. 3.

Fig. 3

The effects of audiovisual associations on behavioral performance. A Group-averaged psychometric functions for pre- and post-association test phases. B The point of subjective equality (PSE) shifts as a function of the test phase, calculated by subtracting the PSE of visual-only condition from those of motion-sound conditions (i.e., upward and downward tone). C The slope shifts as a function of the test phase, calculated by subtracting the slope of visual-only condition from those of motion-sound conditions. Error bars denote the ± SEM

The analysis on the PSE differences between the visual-only and motion-sound conditions showed that both the main effect of tone condition (F(1, 12) = 12.80, p = 0.004, ηp2 = 0.52) and the interaction between the tone and test phase (F(1, 12) = 7.65, p = 0.017, ηp2= 0.39) were significant whereas the main effect of test phase was not significant (F(1, 12) = 2.63, p = 0.13, ηp2 = 0.18) (Fig. 3b). Post hoc pairwise comparisons conducted on the interaction effect revealed a significant difference between upward and downward tone conditions for the post-test phase (MD = 0.23, SE = 0.047, t(12) = 3.44, p = 0.005). However, no significant difference was found between upward and downward tone conditions for the pre-test phase (MD = 0.02, SE = 0.051, t(12) = 0.74, p = 0.47). These results show that audiovisual association training effectively changed how participants reported the motion direction: the tone that was paired with upward motion (i.e., upward tone) led to an increase in reports of upward motion perception whereas the tone that was paired with downward motion (i.e., downward tone) led to an increase in reports of downward motion perception. The pairing of static tones with visual motion during the association phase could also potentially alter subsequent direction discrimination thresholds. Specifically, in the post-association test phase, it is also possible that static tones might significantly change direction discrimination performance compared to the visual-only condition. We explored this possibility by examining changes in slope values as depicted in Fig. 3c. However, a two-way repeated measures ANOVA (with test phase and tone condition as factors) did not yield any significant main effects (test phase: F(1, 12) = 0.13, p = 0.73, ηp2 = 0.011, tone condition: F(1, 12) = 1.12, p = 0.31, ηp2 = 0.086) or a two-way interaction (F(1, 12) = 0.092, p = 0.77, ηp2 = 0.008). Overall, our findings did not indicate consistent changes in slope values specific to tone conditions or improvements/decline in direction discrimination performance due to audiovisual associations.

To assess whether association learning or/and behavioral training related changes occurred in trials without static sounds (i.e., visual-only), PSE and slope values for the visual only condition were contrasted between pre- vs post-association phases (Figure S2). A paired t-test comparing PSE values in visual-only condition between the pre- and post-association phases found a significant decrease in PSE values from pre- to post-association, MD = 0.076, t(12) = 2.74, p = 0.018, d = 0.76, 95% CI [0.016, 0.14]. A paired t-test for slope values in visual only condition yielded no significant difference between the pre- and post-association phases, MD = 0.06, t(12) = 1.68, p = 0.12, d = 0.47, 95% CI [− 0.018, 0.14]. These findings indicate that in the visual-only condition, observers were not biased in favor of the exposed associations, and their motion discrimination sensitivity did not improve from pre- to post-association phase.

Functional connectivity changed between early visual regions and association areas

Seed-to-voxel functional connectivity analyses examined the connectivity of cortical correspondence of both trained and control seed areas in early visual areas (V1, V2, V3) with the whole brain (Table 1. and Fig. 4). The analysis revealed significant changes mostly in post > pre direction, with connectivity increasing from the pre- to post-association phase whereas post < pre changes, with connectivity decreasing from the pre- to post-association phase, were few. More importantly, functional connectivity changes were notably larger for cTrained Retinal Loci in the right hemisphere than for cControl Retinal Loci in the left hemisphere in all early visual areas (V1, V2, V3).

Table 1.

Functional connectivity changes between seed regions and whole-brain voxels following association

Seed region Cortical areas MNI coordinates Cluster size t(12) =  ± 3.05, p values
Post > pre
cTRL in V1, RH Occipital pole, RH  + 10 − 98 − 2 259 0.002
Occipital pole, LH − 30 − 92 − 18 246 0.003
Inferior frontal gyrus, pars opercularis, LH − 50 + 8 + 16 244 0.004
Superior frontal gyrus, RH  + 10 + 38 + 44 137 0.026
cCRL in V1, LH Occipital pole, RH  + 10 − 98 − 2 181 0.019
cTRL in V2, RH Middle temporal gyrus posterior division, LH − 58 + 22 + 4 1686 < 0.001
Temporal pole, RH  + 48 + 14 − 20 844 < 0.001
Lateral occipital cortex superior division, LH − 30 − 80 − 4 689 < 0.001
Lateral occipital cortex superior division, RH  + 28 − 76 + 10 604 < 0.001
Inferior frontal gyrus, LH − 42 − 46 − 16 351 0.0002
Cingulate gyrus posterior division − 2 − 20 + 28 154 0.003
Precuneus cortex, LH − 8 − 64 + 36 188 0.003
Frontal pole, LH  + 6 + 68 − 18 130 0.022
cCRL in V2, LH Lateral occipital cortex, superior division, LH − 26 − 84 + 4 239 0.002
Middle temporal gyrus, RH  + 46 − 10 − 24 154 0.019
Frontal orbital cortex, RH  + 40 + 32 − 16 146 0.019
cTRL in V3, RH Middle temporal gyrus, LH − 59 + 24 + 18 3694 < 0.001
Lateral occipital cortex RH  + 34 − 72 + 12 1552 < 0.001
Frontal pole, LH − 10 + 44 + 44 504 < 0.001
Temporal pole, RH  + 48 + 14 − 20 394 < 0.001
Frontal pole, RH  + 52 + 44 − 6 374 < 0.001
Frontal pole, LH − 10 + 64 − 18 179 0.004
Temporal occipital fusiform, RH  + 32 − 40 − 22 130 0.019
Inferior frontal gyrus, RH  + 62 + 18 + 24 125 0.019
cCRL in V3, LH Frontal orbital cortex, LH − 48 + 48 − 12 920 < 0.001
Lateral occipital cortex superior division, LH − 30 − 80 + 14 698 < 0.001
Lateral occipital cortex superior division, RH  + 36 − 66 + 18 591 < 0.001
Frontal pole, RH  + 44 + 32 − 8 520 < 0.001
Middle temporal gyrus anterior division, RH  + 62 − 6 − 16 243 0.0006
Angular gyrus, LH − 44 − 50 + 26 155 0.009
Frontal pole, LH − 8 + 44 + 48 116 0.030
MT +/V5, RH Postcentral gyrus, LH − 40 − 38 + 56 197 0.005
Postcentral Gyrus, RH  + 26 − 20 + 42 180 0.005
MT +/V5, LH Lateral occipital cortex, RH  + 38 − 74 + 6 547 < 0.001
Superior temporal gyrus, RH  + 70 − 26 + 16 142 0.015
Occipital pole, LH − 6 − 104 + 12 141 0.015
post < pre
cTRL in V3, RH Cingulate gyrus − 2 − 20 + 30 354 < 0.001
Precuneus cortex − 8 − 66 + 36 146 0.011
cCRL in V3, LH Frontal Pole, LH − 34 + 56 + 16 116 0.030

Fig. 4.

Fig. 4

Functional connectivity changes between the cortical correspondence of control (cControl Retinal Loci) and trained (cTrained Retinal Loci) seed areas in early visual areas (V1, V2, V3) and the whole brain (AF). Seed-to-voxel functional connectivity changes occurred across a wider region for cTrained Retinal Loci than cControl Retinal loci. Paired sample t-test results were limited to FDR-corrected p < 0.05. T statistics results were projected on standard space cortical surface. Dark red-bright yellow color bar scale (in panels AD) indicates t statistics between 2.3 and 5. White-bright yellow color bar scale (in panels EF) indicates t statistics between −5 and 5. Brighter colors represent higher t values. Clusters that are difficult to see have been enlarged and displayed below the main brain images for better visibility

In V1, functional connectivity changes were notably larger for cTrained Retinal Locus than for cControl Retinal Locus. Specifically, cTrained Retinal Locus in right V1 exhibited stronger and more widespread connectivity increases, involving the right and left occipital poles, the left inferior frontal gyrus (pars opercularis), and the right superior frontal gyrus (Fig. 4b). In contrast, cControl Retinal Locus in left V1 showed a more limited increase, with significant connectivity enhancement observed only with the right occipital pole with a smaller region (Fig. 4a).

In V2, functional connectivity changes were larger for cTrained Retinal Locus than for cControl Retinal Locus. cTrained Retinal Locus in right V2 displayed more widespread connectivity increases, particularly with the left middle temporal gyrus (posterior division), right temporal pole, bilateral lateral occipital cortex (superior division), left inferior frontal gyrus, posterior cingulate gyrus, left precuneus, and the left frontal pole (Fig. 4d). On the other hand, cControl Retinal Locus in left V2 exhibited more restricted connectivity changes, showing significant increases with the left lateral occipital cortex (superior division), right middle temporal gyrus, and the right frontal orbital cortex (Fig. 4c).

In V3, connectivity increases again favored the trained region, with cTrained Retinal Locus in right V3 showing widespread connectivity enhancements involving the left middle temporal gyrus, right lateral occipital cortex, bilateral frontal pole, right temporal pole, right temporal occipital fusiform gyrus, and the right inferior frontal gyrus (Fig. 4f). In contrast, cControl Retinal Locus in left V3 showed fewer and less extensive connectivity increases, with enhanced connectivity observed in the left frontal orbital cortex, bilateral lateral occipital cortex (superior division), bilateral frontal pole, right middle temporal gyrus (anterior division), and left angular gyrus (Fig. 4e). In V3, connectivity decreases occurred as well. cTrained Retinal Locus in right V3 showed reduced connectivity with cingulate gyrus and precuneus cortex (Fig. 4f) while cControl Retinal Locus in left V3 showed reduced connectivity with a cluster in the left frontal pole (Fig. 4e).

Seed-to-voxel functional connectivity analyses were also conducted between bilateral MT+/V5 areas and the whole brain (Table 1. and Fig. 5). Functional connectivity between right MT+/V5 area and bilateral postcentral gyrus was increased in post-association compared to pre-association (Fig. 5b). In addition, functional connectivity between left MT+/V5 area and right lateral occipital cortex, right superior temporal gyrus, and left occipital pole was increased in post-association compared to pre-association (Fig. 5a).

Fig. 5.

Fig. 5

Functional connectivity changes between bilateral MT+/V5 and the whole brain. A Functional connectivity increased between the seed left MT +/V5 and the right lateral occipital cortex, right superior temporal gyrus, and left occipital pole. B Functional connectivity increased between the seed right MT+/V5 and bilateral postcentral gyrus. Paired sample t-test results were limited to FDR-corrected p < 0.05. T statistics results were projected on standard space cortical surface. Dark red-bright yellow color bar scale indicates t statistics between 2.3 and 5. Brighter colors represent higher t values. Clusters that are difficult to see have been enlarged and displayed below the main brain images for better visibility

Cortical thickness alterations after association phase

Cortical thickness changes occurred exclusively in the right hemisphere, with both increases and decreases observed across multiple regions (Fig. 6). The results of FDR-corrected paired sample t-tests are summarized in Table 2.. Post-association cortical thickening was observed in the right primary visual cortex. In the temporal cortex, cortical thickness increased in the right PeriSylvian language area, right parahippocampal area 1, and both the right dorsal and ventral posterior superior temporal sulcus. A cortical thinning following association phase was found in multiple frontal and cingulate areas. In the frontal lobe, significant decreases were observed in the right ventral premotor area, area 9 m in and area 8B lateral in the right hemisphere, right posterior area 9, and right anterior area 47r. In the cingulate cortex, cortical thickness decreased in area 33 prime and area 31a in the right hemisphere. Additionally, reductions were observed in the right para-insular area and area 13 l in the right hemisphere. These results overall provide evidence that exposure-based associations and sensory experiences lead to structural plasticity in the adult brain.

Fig. 6.

Fig. 6

Whole-brain cortical thickness changes from pre- to post-association phase. Cortical thickness changes occurred exclusively in the right hemisphere. Paired sample t-test results were limited to FDR-corrected p < 0.05. Statistically significant cortical thickness results were projected on a standard space cortical surface. Red color bar scale indicates increased cortical thickness whereas blue color bar scale represents decreased cortical thickness after the association phase. Bright colors represent more significant results

Table 2.

Whole-brain cortical thickness changes from pre- to post-association phase

Cortical region Mean difference CI lower -CI upper t value (df = 12) Adjusted p value
Post > pre
Primary visual cortex, RH 0.027 0.01, 0.04 3.98 0.046
PeriSylvian language area, RH 0.095 0.06, 0.13 5.73 0.012
Parahippocampal area 1, RH 0.114 0.07, 0.16 5.23 0.015
Superior temporal sulcus dorsal posterior, RH 0.083 0.04, 0.11 4.89 0.018
Superior temporal sulcus ventral posterior, RH 0.133 0.09, 0.18 6.54 0.005
Post < pre
Ventral premotor area, RH − 0.142 − 0.20, − 0.08 − 5.03 0.018
Area 33 prime, RH − 0.118 − 0.18, − 0.06 − 4.24 0.039
Area 9 m, RH − 0.058 − 0.09, − 0.03 − 4.23 0.039
Area 8B lateral, RH − 0.089 − 0.14, − 0.04 − 3.97 0.046
Area 9 posterior, RH − 0.073 − 0.11, − 0.03 − 4.05 0.046
Area anterior 47r, RH − 0.092 − 0.13, − 0.05 − 4.77 0.020
Area 13 l, RH − 0.165 − 0.20, − 0.13 − 9.34  < 0.001
Area OP2-3 VS, RH − 0.078 − 0.12, − 0.04 − 3.99 0.046
Area 31a, RH − 0.164 − 0.23, − 0.10 − 5.50 0.012
Para insular area, RH − 0.136 − 0.20, − 0.07 − 4.84 0.018

Note. The voxel threshold was p <.05. RH: right hemisphere. CI: confidence interval. Cortical regions are labelled according to the glasser atlas

Discussion

Our findings provide compelling evidence that passive exposure to audiovisual associations induces both behavioral shifts and neural plasticity in the adult brain. Behaviorally, the reports based on coherent motion perception were significantly influenced by audiovisual associations, as evidenced by systematic shifts in psychometric functions following exposure. More importantly, resting-state fMRI revealed widespread increases in connectivity between the cortical representation of the trained region in early visual areas (V1–V3) and higher-order occipital, temporal, and frontal regions, whereas connectivity changes for the untrained region were markedly smaller. Additionally, area MT+ showed increased connectivity with occipital, temporal, and somatosensory areas, further supporting experience-dependent network reorganization. Interestingly, cortical thickness changes were observed exclusively in the right hemisphere, primarily in higher-order association regions, with only one significant change in primary visual cortex (V1). Together, these findings demonstrate that passive exposure to audiovisual pairings induces plasticity beyond early sensory areas, extending to large-scale functional networks and structural reorganization in higher-order cortical regions.

Audiovisual associations influence behavioral reports of motion direction

Our behavioral results showed that audiovisual associations can lead to significant changes in the reports on visual motion direction. Specifically, as indicated by significant changes in coherence thresholds, we observed a notable shift in psychometric curves, suggesting a strong influence of associated static tones on perceived motion direction. These findings overall confirm that passive exposure to multisensory stimuli can lead to associative influences and thus cause major changes in subsequent perceptual judgements, particularly when the input to the visual system becomes ambiguous (Albright 2012; Hidaka et al. 2011; Kafaligonul and Oluk 2015; Teramoto et al. 2010).

A potential concern regarding these findings is whether the behavioral shifts reflect decisional biases rather than perceptual changes. In our study, the absence of systematic changes in the slope values across tone conditions provides some evidence against decisional biases as the sole explanation. If participants had relied purely on auditory cues while disregarding visual motion, we would expect changes in psychometric curves (e.g., changes in slope values) in the post-association test phase, indicating altered sensitivity to visual motion. However, our findings do not support this prediction.

That said, it remains possible participants adopted response strategies based on learned audiovisual associations, leading to shifts in their judgments without directly altering sensory processing. Prior studies have shown that response biases can shift the central tendency of psychometric curves (i.e., PSE) without affecting their slope, highlighting the importance of distinguishing between perceptual and decisional influences (García-Pérez and Alcalá-Quintana 2013; Morgan et al. 2012). One approach to distinguish these effects is to assess whether the learned influence on perception decays over time, as perceptual effects typically diminish, whereas decisional biases are more persistent (Winawer et al. 2010). While our design did not include delays between association and test phases to examine this, future studies could explore this possibility.

Previous research using similar experimental designs and stimuli has shown that audiovisual associations are selectively influenced by low-level stimulus features (e.g., visual field, sound frequency) and depend on the specific motion type used during the association and test phases, suggesting perceptual changes (Kafaligonul and Oluk 2015; Kobayashi et al. 2012a, 2012b). Our analyses further support this, showing that connectivity changes were substantial in the trained visual hemifield compared to the untrained hemifield (see Fig. 4). This suggests that the effects of learned associations were mostly localized to the region where exposure occurred, implying that the changes are likely driven by alterations in perceptual processing mechanisms (Hidaka et al. 2011; Kafaligonul and Oluk 2015; Kobayashi et al. 2012a, b). These findings suggest that perceptual changes likely contribute to the observed effects, even if we cannot completely rule out the contribution of decisional biases.

Overall, our findings, along with prior research on crossmodal influences on vision (Altieri et al. 2015; Gibson and Maunsell 1997; Gonzalo et al. 2000; Shams and Seitz 2008; Tanabe et al. 2005; Zangenehpour and Zatorre 2010) highlight the flexibility of sensory processing mechanisms and their role in adaptive behavior. Specifically, the current results suggest that the brain is capable of forming coherent perceptual experiences by monitoring information across sensory modalities, even without explicit training or reinforcement. This crossmodal adaptability may be a fundamental aspect of sensory processing, supporting the integration of passively learned associations that enhance perception, particularly under ambiguous conditions (Albright 2012).

Audiovisual associations influence resting-state functional connectivity

Our seed-to-voxel analyses revealed that exposure-based audiovisual associations may lead to widespread increases in functional connectivity, with these changes being markedly more pronounced for the cortical representation of the trained region in early visual cortices compared to the untrained (control) region. Specifically, the trained seed regions—mapped to the right hemisphere’s V1, V2, and V3 (reflecting stimulation in the left visual field)—showed robust connectivity enhancements with a variety of higher-order regions including the lateral occipital cortex, middle temporal gyrus, frontal pole, temporal pole, inferior frontal gyrus, and occipital fusiform gyrus and a low-level sensory region (i.e., occipital pole). In contrast, the untrained seed regions, located in the left hemisphere, exhibited significantly smaller connectivity changes, highlighting the spatial specificity of the plasticity induced by the audiovisual associations. The seed-to-voxel analyses also revealed decreases in functional connectivity after association phase, but these changes were much less prominent, being found only at three small clusters at cingulate cortex, precunes cortex for the trained V3 and frontal pole for the control V3 region. Finally, our secondary analysis using bilateral MT+/V5 as seed regions revealed increased connectivity with primary sensory regions such as the occipital pole and postcentral gyrus, as well as with higher-order areas including the lateral occipital cortex and superior temporal gyrus.

We found that both the size and the number of clusters for functional connections were more pronounced for high level regions (e.g., middle temporal gyrus, lateral occipital cortex) than for low level regions (e.g., occipital pole) (see Table 1.). Although the current experimental design and analyses do not directly identify the direction of information flow between these regions and reveal the contributions of feedforward and feedback connections, previous multisensory and learning studies emphasize top-down mechanisms. For example, models of brain function, such as predictive coding, posit that higher-level regions generate predictions about sensory input and send these predictions back to lower-level areas to be compared with actual input (Clark 2013; de-Wit et al. 2010; Friston 2005; Rao and Ballard 1999). In our study, the increased connectivity between seed areas and high-level brain regions may represent a refinement of these feedback loops driven by audiovisual experience (Petro et al. 2017; Vetter et al. 2014). Moreover, the involvement of regions such as the frontal pole, temporal pole, and inferior frontal gyrus, which are known for their roles in higher-order cognitive functions, suggests that feedback processes are likely enhanced. These regions typically exert top-down control rather than serving purely as relay stations for feedforward sensory information (Koechlin et al. 2003; Miller and Cohen 2001). Furthermore, perceptual learning similarly involves a strengthening of top-down feedback mechanisms to refine sensory processing (Ahissar and Hochstein 2004). When high-level regions become more functionally connected to sensory areas after training, these regions are more effectively exerting control and modulation, a hallmark of top-down feedback.

We observed more substantial functional connectivity changes for cTrained and cControl seed regions in V2 (eleven clusters across eight different functional regions) and V3 (18 clusters across ten different regions), with none of these connectivity changes involving primary sensory regions. By contrast, cTrained and cControl seed regions in V1 showed connectivity changes in only five clusters, and MT+/V5 in five clusters, yet these changes included connections with primary sensory regions, such as the occipital pole and postcentral gyrus. This pattern—where connectivity changes for V2 and V3 predominantly involved higher-order association areas—underscores the significant role of these regions in audiovisual associations, suggesting that they may be particularly responsive to crossmodal interactions. Notably, this finding aligns with the reverse hierarchy theory of perceptual experiences, which proposes that practice-induced changes occur earlier and more robustly in higher cortical areas than in primary sensory areas, making these higher-level changes more detectable (Ahissar and Hochstein 2004). The connectivity patterns we observed thus provide evidence that even associations without explicit feedback can rely more on higher-level cortical areas, with changes in these regions occurring more rapidly and extensively than in lower-level sensory areas.

Our analysis on resting-state recordings did not reveal any connectivity changes (i.e., feedforward crosstalk) between primary auditory and visual cortices that are traditionally thought to be'sensory-specific'. This may suggest that passive exposure to audiovisual associations over a few days might not be sufficient to induce detectable changes in resting-state connectivity between primary sensory areas. On the other hand, prior research has demonstrated that changes in audiovisual associations can also emerge from interactions at early sensory levels, particularly when auditory and visual stimuli are both temporally and spatially congruent (Beer et al. 2011a, b; Garner and Keller 2022). A key distinction between our study and these previous studies is that our audiovisual pairings were temporally aligned but not necessarily spatially congruent, which may have influenced the degree to which early sensory interactions were engaged. Additionally, studies reporting plasticity at lower-level sensory areas typically involve prolonged or repeated training, whereas our paradigm included only a few days of passive exposure.

Interestingly, we found strengthened connections between low-level seed regions (V1 and MT+/V5) and primary sensory regions like the occipital pole and postcentral gyrus. The association phase in our design included both an association session (i.e., consecutive presentations of audiovisual pairings) and a behavioral measurement. After each association session, the participants performed a direction discrimination task in tone and no tone (visual-only) conditions to measure coherence thresholds and direction discrimination performance. It is conceivable that the connectivity increase between low-level visual areas may be driven by unisensory experience (e.g., no tone condition) and training (direction discrimination of visual motion in all conditions) during behavioral measurements. It is also possible that the sensory experience and training acquired during behavioral measurements might contribute to changes in resting-state connectivity across other regions mentioned above.

Overall, while it is possible that the connectivity increases in resting-state include components of heightened feedforward processing, the substantial involvement of high-level association areas in contrast to nonsignificant changes in connectivity between primary auditory and visual areas highlights an important role of top-down feedback mechanisms in crossmodal associations and correspondence. The enhancements of functional connectivity likely reflect the brain's adaptation to better integrate and interpret multisensory information through improved feedback control (e.g., Rohe et al. 2019).

Structural plasticity following audiovisual associations

In addition to the robust functional connectivity changes observed in the trained regions (at the right V1, V2, and V3), our cortical thickness analyses revealed a parallel pattern of structural plasticity that was exclusively localized to the right hemisphere. Specifically, post-association cortical thickening was observed in the right primary visual cortex, as well as in several temporal regions, including the right PeriSylvian language area, right parahippocampal area 1, and both the right dorsal and ventral posterior superior temporal sulcus. Conversely, significant cortical thinning was detected in multiple frontal areas—namely the right ventral premotor area, areas 9 m and 8B lateral, the right posterior area 9, and right anterior area 47r—as well as in cingulate regions (areas 33 prime and 31a) and the para-insular area/area 13 l.

This pattern of structural changes resembles the functional connectivity results, where the trained regions in right early visual cortices exhibited widespread increases in connectivity with higher-order association areas—including frontal, temporal, and occipital regions—while the untrained regions in the left hemisphere showed markedly smaller connectivity changes. Notably, most of the significant cortical thickness alterations were located in higher-order regions, with only the primary visual cortex among the low-level sensory areas showing a significant increase. This convergence suggests that passively learned audiovisual associations lead to coordinated functional and structural reorganization, potentially reflecting enhanced top-down modulation and the refinement of feedback mechanisms across the cortical hierarchy.

The increased cortical thickness in occipital (i.e., V1) and temporal regions—areas known to support multisensory integration and memory (Albright 2012; Beauchamp 2005; Simons and Spiers 2003)—parallels the enhanced functional connectivity with the low (i.e., occipital pole) and higher-level areas. This expansion in cortical thickness suggests a structural basis for enhanced integrative capacities following associative learning (May 2011). Together, these findings point to a coordinated structural and functional reorganization that may underlie more effective top-down modulation. Conversely, the cortical thinning observed in frontal and cingulate regions might reflect an optimization process, potentially involving synaptic refinement, which aligns with previous studies proposing that reduced cortical thickness in certain areas can indicate more efficient neural circuitry (Holtmaat and Svoboda 2009; Magee and Grienberger 2020; Yoshihara et al. 2009; Zatorre et al. 2012). Mechanistically, these changes may involve glia- or synaptogenesis to support enhanced information processing and integration across sensory modalities (e.g., Zatorre et al. 2012). Our findings underscore the dynamic interplay between structural plasticity and functional connectivity following exposure-based audiovisual associations. They suggest that the brain adapts at both the network level and the structural level—strengthening integrative regions while refining control and modulatory circuits to support enhanced multisensory processing.

As mentioned above, the association phase of current design also included a behavioral measurement right after the association session on each day. Therefore, unisensory experience (visual motion) and training (direction discrimination) might also contribute to the observed changes in cortical thickness. For instance, it is conceivable that the decrease in cortical thickness in low- and/or mid-level visual areas might be driven by these additional measurement sessions and unisensory experience/training. Future research is needed to identify specific contributions of unisensory and multisensory experiences.

Limitations and future directions

Our study aimed to capture the effects of exposure-based audiovisual associations on perceptual changes and neuroplasticity within a limited timeframe and without feedback, distinguishing our results from typical perceptual learning paradigms. Given this focus, our design did not include a control group, as prior literature robustly supports association-driven perceptual shifts and connectivity changes under similar conditions (e.g., Hidaka et al. 2011; Gonzalo et al. 2000). To overcome this limitation, we incorporated an untrained visual hemifield as a control region, enabling each subject to serve as their own control. Even though our behavioral findings suggest that the crossmodal associations themselves were dominant during the association phase, we also acknowledge that exposure to visual motion over the course of several days may contribute to the observed changes in functional connectivity and cortical thickness. Future studies incorporating a control group with unpaired stimuli or no association training will be informative in delineating general effects of visual experiences from crossmodal association-specific changes.

While our sample size is modest, it aligns with neuroimaging research in audiovisual perceptual learning (e.g., Gonzalo et al. 2000; Powers et al. 2012) and/or visual motion studies (Smith et al. 1998; Rina et al. 2022) where individual differences are minimized through a within-subjects design. We further applied FDR corrections to account for multiple comparisons and to reduce the risk of Type I error, ensuring our reported effects were robust. Future investigations with larger sample sizes will enable subgroup analyses to examine potential individual variability in response to crossmodal associations (e.g., correlation analyses) and provide a direct link between behavioral performance values and functional/structural alterations in specific brain regions. Specifically, with a larger sample size, it could be statistically feasible to calculate an index of audiovisual associations [e.g., (PSEdown, post − PSEup, post) − (PSEdown, pre − PSEup, pre)] and test whether participants who experienced a larger auditory influence/index in the post-training session also showed more pronounced changes in functional connectivity and cortical thickness. Given the modest sample size of current dataset, we have refrained from conducting such analysis. Nonetheless, the results of our primary statistical tests provide compelling evidence that passive (exposure-based) sensory experiences may lead to functional and structural plasticity in the adult brain.

It is also worthwhile to mention that previous research highlights the contributions of subcortical structures, such as the thalamus and superior colliculus, to audiovisual integration (for a recent review, see Gao et al. 2023). Notably, studies on humans have predominantly used spatial cueing paradigms (e.g., Fairhall and Macaluso 2009; van den Brink et al. 2014). Given that visual motion processing primarily relies on cortical areas, the paradigm used in this study may not optimally isolate subcortical contributions to audiovisual associations and sensory experiences. A more targeted approach, such as paradigms designed to elicit subcortical activation or studies involving specific patient populations (e.g., Laeng et al. 2021), would be beneficial for addressing this question.

Conclusions

The current findings illustrate how passive exposure to audiovisual associations influence perceptual reports and lead to functional and structural changes in the adult brain. We found that audiovisual associations alter the behavioral reports on ambiguous visual motion, resulting in significant changes in coherence thresholds and highlighting the flexibility of mechanisms underlying dynamic vision. More importantly, resting-state recordings revealed enhancement of functional connectivity between our seed regions at early visual areas and higher-order cortical regions, alongside differential structural plasticity. The overlap between functional and structural measures indicates that enhanced connectivity aligns with structural expansion. These complementary findings highlight the brain's capacity to reorganize itself in response to multisensory input, strengthening integrative regions while refining control and modulatory circuits. Our results emphasize the dynamic nature of adult brain plasticity and the importance of crossmodal associations and correspondence in perception and adaptive behavior.

Supplementary Information

Below is the link to the electronic supplementary material.

Acknowledgements

We would like to thank our colleagues at Bilkent University for their valuable feedback and insightful discussions. We are also grateful to the anonymous reviewers whose constructive and helpful comments significantly improved our manuscript.

Author contribution

F.Z.K.Y.: Conceptualization, methodology, formal analysis, investigation, data curation, writing—original draft, writing—review and editing, and visualization; P.D.: Methodology, formal analysis, data curation, writing—review and editing, and visualization; H.K.: Conceptualization, methodology, investigation, data curation, software, writing—review and editing, funding acquisition, resources, and supervision.

Funding

Open access funding provided by the Scientific and Technological Research Council of Türkiye (TÜBİTAK). This work was supported by the Scientific and Technological Research Council of Türkiye (Co-Funded Brain Circulation Fellowship, 112 C010) and the Turkish Academy of Sciences (TUBA-GEBIP Award).

Data availability

The pre-processed fMRI data and behavioral data that support the findings of this study are available on request from the corresponding author.

Declaration

Conflict of interests

The authors declare no competing interests.

Footnotes

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  1. Ahissar M, Hochstein S (2004) The reverse hierarchy theory of visual perceptual learning. Trends Cogn Sci 8(10):457–464. 10.1016/j.tics.2004.08.011 [DOI] [PubMed] [Google Scholar]
  2. Albright TD (2012) On the perception of probable things: neural substrates of associative memory, imagery, and perception. Neuron 74(2):227–245. 10.1016/j.neuron.2012.04.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Altieri N, Stevenson RA, Wallace MT, Wenger MJ (2015) Learning to associate auditory and visual stimuli: behavioral and neural mechanisms. Brain Topogr 28(3):479–493. 10.1007/s10548-013-0333-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Amunts K, Mohlberg H, Bludau S, Zilles K (2020) Julich-Brain: a 3D probabilistic atlas of the human brain’s cytoarchitecture. Science 369(6506):988–992. 10.1126/science.abb4588 [DOI] [PubMed] [Google Scholar]
  5. Bazay FE-Z, Benabdallah FZ, El Maliani AD (2022) Preprocessing FMRI data in SPM12. In: 2022 9th international conference on wireless networks and mobile communications (WINCOM), pp 1–5. 10.1109/WINCOM55661.2022.9966463
  6. Beauchamp MS (2005) See me, hear me, touch me: Multisensory integration in lateral occipital-temporal cortex. Curr Opin Neurobiol 15(2):145–153. 10.1016/j.conb.2005.03.011 [DOI] [PubMed] [Google Scholar]
  7. Beer AL, Batson MA, Watanabe T (2011a) Multisensory perceptual learning reshapes both fast and slow mechanisms of crossmodal processing. Cogn Affect Behav Neurosci 11(1):1–12. 10.3758/s13415-010-0006-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Beer AL, Plank T, Greenlee MW (2011b) Diffusion tensor imaging shows white matter tracts between human auditory and visual cortex. Exp Brain Res 213(2):299–308. 10.1007/s00221-011-2715-y [DOI] [PubMed] [Google Scholar]
  9. Benjamini Y, Hochberg Y (1995) Controlling the false discovery rate: a practical and powerful approach to multiple testing. J Roy Stat Soc: Ser B (Methodol) 57(1):289–300. 10.1111/j.2517-6161.1995.tb02031.x [Google Scholar]
  10. Benson NC, Winawer J (2018) Bayesian analysis of retinotopic maps. Elife 7:e40224. 10.7554/eLife.40224 [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Benson NC, Butt OH, Brainard DH, Aguirre GK (2014) Correction of distortion in flattened representations of the cortical surface allows prediction of V1–V3 functional organization from anatomy. PLoS Comput Biol 10(3):e1003538. 10.1371/journal.pcbi.1003538 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Brainard DH (1997) The psychophysics toolbox. Spat vis 10(4):433–436 [PubMed] [Google Scholar]
  13. Britten KH, Shadlen MN, Newsome WT, Movshon JA (1992) The analysis of visual motion: a comparison of neuronal and psychophysical performance. J Neurosci 12(12):4745–4765. 10.1523/JNEUROSCI.12-12-04745.1992 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Burge WK, Griffis JC, Nenert R, Elkhetali A, DeCarlo DK, verHoef, L. W., Ross, L. A., & Visscher, K. M. (2016) Cortical thickness in human V1 associated with central vision loss. Sci Rep 6(1):23268. 10.1038/srep23268 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Clark A (2013) Whatever next? Predictive brains, situated agents, and the future of cognitive science. Behav Brain Sci 36(3):181–204. 10.1017/S0140525X12000477 [DOI] [PubMed] [Google Scholar]
  16. Clavagnier S, Falchier A, Kennedy H (2004) Long-distance feedback projections to area V1: Implications for multisensory integration, spatial awareness, and visual consciousness. Cogn Affect Behav Neurosci 4(2):117–126. 10.3758/CABN.4.2.117 [DOI] [PubMed] [Google Scholar]
  17. Dale AM, Fischl B, Sereno MI (1999) Cortical surface-based analysis: I segmentation and surface reconstruction. Neuroimage 9(2):179–194. 10.1006/nimg.1998.0395 [DOI] [PubMed] [Google Scholar]
  18. Defenderfer M, Demirayak P, Visscher KM (2021) A method for mapping retinal images in early visual cortical areas. Neuroimage 245:118737. 10.1016/j.neuroimage.2021.118737 [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Defenderfer MK, Demirayak P, Fleming LL, DeCarlo DK, Stewart P, Visscher KM (2023) Cortical plasticity in central vision loss: cortical thickness and neurite structure. Hum Brain Mapp 44(10):4120–4135. 10.1002/hbm.26334 [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. de-Wit, L., Machilsen, B., & Putzeys, T. (2010) Predictive coding and the neural response to predictable stimuli. J Neurosci 30(26):8702–8703. 10.1523/JNEUROSCI.2248-10.2010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Di Stefano N, Spence C (2023) Perceptual similarity: insights from crossmodal correspondences. Rev Philos Psychol. 10.1007/s13164-023-00692-y [Google Scholar]
  22. Ditye T, Kanai R, Bahrami B, Muggleton NG, Rees G, Walsh V (2013) Rapid changes in brain structure predict improvements induced by perceptual learning. Neuroimage 81:205–212. 10.1016/j.neuroimage.2013.05.058 [DOI] [PubMed] [Google Scholar]
  23. Draganski B, Gaser C, Busch V, Schuierer G, Bogdahn U, May A (2004) Changes in grey matter induced by training. Nature 427(6972):311–312. 10.1038/427311a [DOI] [PubMed] [Google Scholar]
  24. Eckert MA, Kamdar NV, Chang CE, Beckmann CF, Greicius MD, Menon V (2008) A cross-modal system linking primary auditory and visual cortices: evidence from intrinsic fMRI connectivity analysis. Hum Brain Mapp 29(7):848–857. 10.1002/hbm.20560 [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Fairhall SL, Macaluso E (2009) Spatial attention can modulate audiovisual integration at multiple cortical and subcortical sites. Eur J Neurosci 29(6):1247–1257. 10.1111/j.1460-9568.2009.06688.x [DOI] [PubMed] [Google Scholar]
  26. Falchier A, Clavagnier S, Barone P, Kennedy H (2002) Anatomical evidence of multimodal integration in primate striate cortex. J Neurosci off J Soc Neurosci 22(13):5749–5759. 10.1523/JNEUROSCI.22-13-05749.2002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Fischl B, Dale AM (2000) Measuring the thickness of the human cerebral cortex from magnetic resonance images. Proc Natl Acad Sci 97(20):11050–11055. 10.1073/pnas.200033797 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Fischl B, Salat DH, Busa E, Albert M, Dieterich M, Haselgrove C, van der Kouwe A, Killiany R, Kennedy D, Klaveness S, Montillo A, Makris N, Rosen B, Dale AM (2002) Whole brain segmentation: automated labeling of neuroanatomical structures in the human brain. Neuron 33(3):341–355. 10.1016/S0896-6273(02)00569-X [DOI] [PubMed] [Google Scholar]
  29. Fischl B, Liu A, Dale AM (2001) Automated manifold surgery: Constructing geometrically accurate and topologically correct models of the human cerebral cortex. IEEE Trans Med Imaging 20(1):70–80. 10.1109/42.906426 [DOI] [PubMed] [Google Scholar]
  30. Frank SM, Reavis EA, Greenlee MW, Tse PU (2016) Pretraining Cortical Thickness Predicts Subsequent Perceptual Learning Rate in a Visual Search Task. Cereb Cortex 26(3):1211–1220. 10.1093/cercor/bhu309 [DOI] [PubMed] [Google Scholar]
  31. Friston K (2005) A theory of cortical responses. Philos Trans R Soc b: Biol Sci 360(1456):815–836. 10.1098/rstb.2005.1622 [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Gao C, Green JJ, Yang X, Oh S, Kim J, Shinkareva SV (2023) Audiovisual integration in the human brain: a coordinate-based meta-analysis. Cereb Cortex 33(9):5574–5584. 10.1093/cercor/bhac443 [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. García-Pérez MA, Alcalá-Quintana R (2013) Shifts of the psychometric function: distinguishing bias from perceptual effects. Q J Exp Psychol 66(2):319–337. 10.1080/17470218.2012.708761 [DOI] [PubMed] [Google Scholar]
  34. Garner AR, Keller GB (2022) A cortical circuit for audio-visual predictions. Nat Neurosci 25(1):98–105. 10.1038/s41593-021-00974-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Gibson JR, Maunsell JHR (1997) Sensory modality specificity of neural activity related to memory in visual cortex. J Neurophysiol 78(3):1263–1275. 10.1152/jn.1997.78.3.1263 [DOI] [PubMed] [Google Scholar]
  36. Gilaie-Dotan S, Kanai R, Bahrami B, Rees G, Saygin AP (2013) Neuroanatomical correlates of biological motion detection. Neuropsychologia 51(3):457–463. 10.1016/j.neuropsychologia.2012.11.027 [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Glasser MF, Coalson TS, Robinson EC, Hacker CD, Harwell J, Yacoub E, Ugurbil K, Andersson J, Beckmann CF, Jenkinson M, Smith SM, Van Essen DC (2016) A multi-modal parcellation of human cerebral cortex. Nature 536(7615):171–178. 10.1038/nature18933 [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Gonzalo D, Shallice T, Dolan R (2000) Time-dependent changes in learning audiovisual associations: a single-trial fMRI study. Neuroimage 11(3):243–255. 10.1006/nimg.2000.0540 [DOI] [PubMed] [Google Scholar]
  39. Guerra-Carrillo B, Mackey AP, Bunge SA (2014) Resting-state fMRI: a window into human brain plasticity. Neuroscientist 20(5):522–533. 10.1177/1073858414524442 [DOI] [PubMed] [Google Scholar]
  40. Haier RJ, Karama S, Leyba L, Jung RE (2009) MRI assessment of cortical thickness and functional activity changes in adolescent girls following three months of practice on a visual-spatial task. BMC Res Notes 2(1):174. 10.1186/1756-0500-2-174 [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Hebb DO (1949) The organization of behavior; a neuropsychological theory. Wiley, Hoboken, pp xix–335 [Google Scholar]
  42. Hidaka S, Teramoto W, Kobayashi M, Sugita Y (2011) Sound-contingent visual motion aftereffect. BMC Neurosci 12(1):44. 10.1186/1471-2202-12-44 [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Ho CS, Giaschi DE (2009) Low- and high-level first-order random-dot kinematograms: evidence from fMRI. Vision Res 49(14):1814–1824. 10.1016/j.visres.2009.04.018 [DOI] [PubMed] [Google Scholar]
  44. Holtmaat A, Svoboda K (2009) Experience-dependent structural synaptic plasticity in the mammalian brain. Nat Rev Neurosci 10(9):647–658. 10.1038/nrn2699 [DOI] [PubMed] [Google Scholar]
  45. James W (1890) The principles of psychology. Henry Holt and Co, New York. 10.1037/10538-000 [Google Scholar]
  46. Kafaligonul H, Oluk C (2015) Audiovisual associations alter the perception of low-level visual motion. Front Integr Neurosci. 10.3389/fnint.2015.00026 [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Kanai R, Rees G (2011) The structural basis of inter-individual differences in human behaviour and cognition. Nat Rev Neurosci 12(4):231–242. 10.1038/nrn3000 [DOI] [PubMed] [Google Scholar]
  48. Kanai R, Bahrami B, Rees G (2010) Human parietal cortex structure predicts individual differences in perceptual rivalry. Curr Biol 20(18):1626–1630. 10.1016/j.cub.2010.07.027 [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Karim AKMR, Proulx MJ, de Sousa AA, Likova LT (2021) Neuroplasticity and crossmodal connectivity in the normal, healthy brain. Psychol Neurosci 14(3):298–334. 10.1037/pne0000258 [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Kobayashi M, Teramoto W, Hidaka S, Sugita Y (2012a) Indiscriminable sounds determine the direction of visual motion. Sci Rep 2(1):365. 10.1038/srep00365 [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Kobayashi M, Teramoto W, Hidaka S, Sugita Y (2012b) Sound frequency and aural selectivity in sound-contingent visual motion aftereffect. PLoS ONE 7(5):e36803. 10.1371/journal.pone.0036803 [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Koechlin E, Ody C, Kouneiher F (2003) The architecture of cognitive control in the human prefrontal cortex. Science 302(5648):1181–1185. 10.1126/science.1088545 [DOI] [PubMed] [Google Scholar]
  53. Konorski, J. (1967). Integrative activity of the brain. Chicago.
  54. Laeng B, Flaaten CB, Walle KM, Hochkeppler A, Specht K (2021) “Mickey mousing” in the brain: motion-sound synesthesia and the subcortical substrate of audio-visual integration. Front Human Neurosci. 10.3389/fnhum.2021.605166 [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Legault J, Grant A, Fang S-Y, Li P (2019) A longitudinal investigation of structural brain changes during second language learning. Brain Lang 197:104661. 10.1016/j.bandl.2019.104661 [DOI] [PubMed] [Google Scholar]
  56. Linare D, López-Moliner J (2016) quickpsy: An R package to fit psychometric functions for multiple groups. R J 8(1):122–131. 10.32614/RJ-2016-008 [Google Scholar]
  57. Magee JC, Grienberger C (2020) Synaptic plasticity forms and functions. Ann Rev Neuroscince 43:95–117. 10.1146/annurev-neuro-090919-022842 [DOI] [PubMed] [Google Scholar]
  58. May A (2011) Experience-dependent structural plasticity in the adult human brain. Trends Cogn Sci 15(10):475–482. 10.1016/j.tics.2011.08.002 [DOI] [PubMed] [Google Scholar]
  59. McGann JP (2015) Associative learning and sensory neuroplasticity: How does it happen and what is it good for? Learn Mem 22(11):567–576. 10.1101/lm.039636.115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Messinger A, Squire LR, Zola SM, Albright TD (2001) Neuronal representations of stimulus associations develop in the temporal lobe during learning. Proc Natl Acad Sci 98(21):12239–12244. 10.1073/pnas.211431098 [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Metzler-Baddeley C, Caeyenberghs K, Foley S, Jones DK (2016) Task complexity and location specific changes of cortical thickness in executive and salience networks after working memory training. Neuroimage 130:48–62. 10.1016/j.neuroimage.2016.01.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Miller EK, Cohen JD (2001) An integrative theory of prefrontal cortex function. Ann Rev Neurosci 24:167–202. 10.1146/annurev.neuro.24.1.167 [DOI] [PubMed] [Google Scholar]
  63. Morgan M, Dillenburger B, Raphael S, Solomon JA (2012) Observers can voluntarily shift their psychometric functions without losing sensitivity. Atten Percept Psychophys 74(1):185–193. 10.3758/s13414-011-0222-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Nieto-Castanon A (2020) Handbook of functional connectivity magnetic resonance imaging methods in CONN. 10.56441/hilbertpress.2207.6598
  65. Park H-J, Lee JD, Kim EY, Park B, Oh M-K, Lee S, Kim J-J (2009) Morphological alterations in the congenital blind based on the analysis of cortical thickness and surface area. Neuroimage 47(1):98–106. 10.1016/j.neuroimage.2009.03.076 [DOI] [PubMed] [Google Scholar]
  66. Pelli DG (1997) The VideoToolbox software for visual psychophysics: Transforming numbers into movies. Spat vis 10(4):437–442 [PubMed] [Google Scholar]
  67. Petro LS, Paton AT, Muckli L (2017) Contextual modulation of primary visual cortex by auditory signals. Philos Trans R Soc b: Biol Sci 372(1714):20160104. 10.1098/rstb.2016.0104 [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Petro LS, Vizioli L, Muckli L (2014) Contributions of cortical feedback to sensory processing in primary visual cortex. Front Psychol. 10.3389/fpsyg.2014.01223 [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Pilly PK, Seitz AR (2009) What a difference a parameter makes: a psychophysical comparison of random dot motion algorithms. Vision Res 49(13):1599–1612. 10.1016/j.visres.2009.03.019 [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Powers AR, Hevey MA, Wallace MT (2012) Neural correlates of multisensory perceptual learning. J Neurosci 32(18):6263–6274. 10.1523/JNEUROSCI.6138-11.2012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Prins D, Plank T, Baseler HA, Gouws AD, Beer A, Morland AB, Greenlee MW, Cornelissen FW (2016) Surface-based analyses of anatomical properties of the visual cortex in macular degeneration. PLoS ONE 11(1):e0146684. 10.1371/journal.pone.0146684 [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Rao RPN, Ballard DH (1999) Predictive coding in the visual cortex: a functional interpretation of some extra-classical receptive-field effects. Nat Neurosci 2(1):79–87. 10.1038/4580 [DOI] [PubMed] [Google Scholar]
  73. Richardson JTE (2011) Eta squared and partial eta squared as measures of effect size in educational research. Educ Res Rev 6(2):135–147. 10.1016/j.edurev.2010.12.001 [Google Scholar]
  74. Rina A, Papanikolaou A, Zong X, Papageorgiou DT, Keliris GA, Smirnakis SM (2022) Visual motion coherence responses in human visual cortex. Front Neurosci 16:719250. 10.3389/fnins.2022.719250 [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Rohe T, Ehlis A-C, Noppeney U (2019) The neural dynamics of hierarchical Bayesian causal inference in multisensory perception. Nat Commun 10(1):1907. 10.1038/s41467-019-09664-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Sarabi MT, Aoki R, Tsumura K, Keerativittayayut R, Jimura K, Nakahara K (2018) Visual perceptual training reconfigures post-task resting-state functional connectivity with a feature-representation region. PLoS ONE 13(5):e0196866. 10.1371/journal.pone.0196866 [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Schlack A, Albright TD (2007) Remembering visual motion: neural correlates of associative plasticity and motion recall in cortical area MT. Neuron 53(6):881–890. 10.1016/j.neuron.2007.02.028 [DOI] [PubMed] [Google Scholar]
  78. Ségonne F, Dale AM, Busa E, Glessner M, Salat D, Hahn HK, Fischl B (2004) A hybrid approach to the skull stripping problem in MRI. Neuroimage 22(3):1060–1075. 10.1016/j.neuroimage.2004.03.032 [DOI] [PubMed] [Google Scholar]
  79. Shams L, Seitz AR (2008) Benefits of multisensory learning. Trends Cogn Sci 12(11):411–417. 10.1016/j.tics.2008.07.006 [DOI] [PubMed] [Google Scholar]
  80. Simons JS, Spiers HJ (2003) Prefrontal and medial temporal lobe interactions in long-term memory. Nat Rev Neurosci 4(8):637–648. 10.1038/nrn1178 [DOI] [PubMed] [Google Scholar]
  81. Sled, J. G., Zijdenbos, A. P., & Evans, A. C. (1998). A nonparametric method for automatic correction of intensity nonuniformity in MRI data. IEEE Transactions on Medical Imaging, 17(1), 87–97. IEEE Transactions on Medical Imaging. 10.1109/42.668698 [DOI] [PubMed]
  82. Smith AT, Greenlee MW, Singh KD, Kraemer FM, Hennig J (1998) The processing of first- and second-order motion in human visual cortex assessed by functional magnetic resonance imaging (fMRI). J Neurosci off J Soc Neurosci 18(10):3816–3830. 10.1523/JNEUROSCI.18-10-03816.1998 [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Spence C (2011) Crossmodal correspondences: a tutorial review. Atten Percept Psychophys 73(4):971–995. 10.3758/s13414-010-0073-7 [DOI] [PubMed] [Google Scholar]
  84. Tanabe HC, Honda M, Sadato N (2005) Functionally segregated neural substrates for arbitrary audiovisual paired-association learning. J Neurosci 25(27):6409–6418. 10.1523/JNEUROSCI.0636-05.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  85. Teramoto W, Hidaka S, Sugita Y (2010) Sounds move a static visual object. PLoS ONE 5(8):e12255. 10.1371/journal.pone.0012255 [DOI] [PMC free article] [PubMed] [Google Scholar]
  86. Urner M, Schwarzkopf DS, Friston K, Rees G (2013) Early visual learning induces long-lasting connectivity changes during rest in the human brain. Neuroimage 77:148–156. 10.1016/j.neuroimage.2013.03.050 [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. van den Brink RL, Cohen MX, van der Burg E, Talsma D, Vissers ME, Slagter HA (2014) Subcortical, modality-specific pathways contribute to multisensory processing in humans. Cereb Cortex 24(8):2169–2177. 10.1093/cercor/bht069 [DOI] [PubMed] [Google Scholar]
  88. Vetter P, Smith FW, Muckli L (2014) Decoding sound and imagery content in early visual cortex. Curr Biol 24(11):1256–1262. 10.1016/j.cub.2014.04.020 [DOI] [PMC free article] [PubMed] [Google Scholar]
  89. Wenger E, Schaefer S, Noack H, Kühn S, Mårtensson J, Heinze H-J, Düzel E, Bäckman L, Lindenberger U, Lövdén M (2012) Cortical thickness changes following spatial navigation training in adulthood and aging. Neuroimage 59(4):3389–3397. 10.1016/j.neuroimage.2011.11.015 [DOI] [PubMed] [Google Scholar]
  90. Whitfield-Gabrieli S, Nieto-Castanon A (2012) Conn: a functional connectivity toolbox for correlated and anticorrelated brain networks. Brain Connect 2(3):125–141. 10.1089/brain.2012.0073 [DOI] [PubMed] [Google Scholar]
  91. Winawer J, Huk AC, Boroditsky L (2010) A motion aftereffect from visual imagery of motion. Cognition 114(2):276–284. 10.1016/j.cognition.2009.09.010 [DOI] [PubMed] [Google Scholar]
  92. Worschech F, Altenmüller E, Jünemann K, Sinke C, Krüger THC, Scholz DS, Müller CAH, Kliegel M, James CE, Marie D (2022) Evidence of cortical thickness increases in bilateral auditory brain structures following piano learning in older adults. Ann N Y Acad Sci 1513(1):21–30. 10.1111/nyas.14762 [DOI] [PubMed] [Google Scholar]
  93. Yoshihara Y, De Roo M, Muller D (2009) Dendritic spine formation and stabilization. Curr Opin Neurobiol 19(2):146–153. 10.1016/j.conb.2009.05.013 [DOI] [PubMed] [Google Scholar]
  94. Zangenehpour S, Zatorre RJ (2010) Crossmodal recruitment of primary visual cortex following brief exposure to bimodal audiovisual stimuli. Neuropsychologia 48(2):591–600. 10.1016/j.neuropsychologia.2009.10.022 [DOI] [PubMed] [Google Scholar]
  95. Zatorre RJ, Fields RD, Johansen-Berg H (2012) Plasticity in gray and white: neuroimaging changes in brain structure during learning. Nat Neurosci 15(4):528–536. 10.1038/nn.3045 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

The pre-processed fMRI data and behavioral data that support the findings of this study are available on request from the corresponding author.


Articles from Brain Structure & Function are provided here courtesy of Springer

RESOURCES