Abstract
Visual perception can be modulated by sounds. A drastic example of this is the sound-induced flash illusion: when a single flash is accompanied by two bleeps, it is sometimes perceived in an illusory fashion as two consecutive flashes. However, there are strong individual differences in proneness to this illusion. Some participants experience the illusion on almost every trial, whereas others almost never do. We investigated whether such individual differences in proneness to the sound-induced flash illusion were reflected in structural differences in brain regions whose activity is modulated by the illusion. We found that individual differences in proneness to the illusion were strongly and significantly correlated with local grey matter volume in early retinotopic visual cortex. Participants with smaller early visual cortices were more prone to the illusion. We propose that strength of auditory influences on visual perception is determined by individual differences in recurrent connections, cross-modal attention and/or optimal weighting of sensory channels.
Keywords: multisensory, visual cortex, individual differences, voxel-based morphometry, sound-induced flash illusion
1. Introduction
When a single flash is accompanied by a rapid series of two or more bleeps, a perceptual ‘fission’ of the flash sometimes occurs and it is incorrectly perceived as multiple flashes [1]. The illusion is a striking example of how sounds can modulate visual perception (and thus challenges older theories of visual dominance [2]). It is in line with recent theories that propose statistically optimal weighting of sensory channels [3,4]. According to these theories, the weighting of sensory channels that together form an integrated percept depends on their relative levels of signal-to-noise ratio. In case of the sound-induced flash illusion, the number of events is easier to tell for auditory beeps than for visual flashes [4]. This also fits with other findings, showing that hearing has greater temporal resolution than vision (e.g. [5]; while vision has greater spatial resolution than hearing [6]).
However, perception of multisensory stimuli varies not only with stimulus properties, but also varies across observers. The same stimulus can evoke cross-modal effects reliably in some participants, but not in others. This can be seen for the sound-induced flash illusion [7], as well as for the McGurk illusion [8]. Individual differences in proneness to the McGurk illusion are correlated with the amplitude of blood-oxygen-level-dependent (BOLD)-signal responses to cross-modal stimuli in the left superior temporal sulcus (STS) [9,10]. Moreover, individual proneness to the sound-induced flash illusion is correlated with the degree to which sounds modulate visual event-related responses [7]. However, the neural basis of this variance in proneness to audio-visual interactions is still unclear.
Here, we tested whether individual proneness to an audiovisual illusion was correlated with differences in brain structure. Variability in several aspects of visual perception is correlated with differences in local brain structure (for a recent overview, see [11]). For instance, individual differences in the surface area of early visual cortex are correlated with individual differences in proneness to illusory size perception [12]. However, individual differences in the degree of cross-modal interactions have not previously been linked with variability in brain structure. Here, we tested whether proneness to the sound-induced flash illusion was correlated with differences in regional grey matter (GM) volume as measured using voxel-based morphometry (VBM; [13]). Experience of the ‘sound-induced flash illusion’ is accompanied by enhanced activity in retinotopically defined primary visual cortex (V1), superior colliculus (SC) and STS [14,15]. Furthermore, electroencephalography source localization [7] and short latencies of event-related magnetic field responses [15] suggest a role of auditory cortex in the illusion. Therefore, we hypothesized that individual differences in susceptibility to the sound-induced flash illusion would be reflected in structural variation of these regions across individuals.
2. Methods
(a). Participants
A total of 29 subjects from the University College London (UCL) participant pool (20 females, aged 18 to 42 years; mean: 25 years, s.d.: 6 years) took part in this study. All participants completed the behavioural study outside the scanner and underwent the anatomical magnetic resonance imaging (MRI) scan on a different day.
(b). Stimuli
The visual stimulus consisted of a uniform white disk (140 cd m−2) that flashed for 24 ms (two frames at 85 Hz) on a uniform grey background (90 cd m−2) on a cathode ray tube monitor. The disk diameter was 2° visual angle, and it was placed at 5° eccentricity directly above or below a fixation cross that was displayed at the middle of the screen. The auditory stimulus consisted of a pure tone at 3.5 kHz that was played for 20 ms at 65 dBA on speakers adjacent to the monitor. All stimuli were programmed and presented in Matlab (Mathworks, Ltd) using the Cogent Graphics (http://www.vislab.ucl.ac.uk/cogent.php) and Psychophysics Toolbox 3 extensions ([16,17], http://psychtoolbox.org). In each trial, either one, or two flashes were presented, accompanied by either no, one, or two beeps, resulting in six trial types (1F0B, 2F0B, 1F1B, 2F1B, 1F2B and 2F2B were xFxB stands for the number of flashes and beeps, respectively). The onsets of flashes and beeps were synchronous. In trials with a second flash and/or second beep, the onset of the second event was time-locked to 34 ms after the offset of the first flash (see figure 1a,b).
(c). Procedure
Participants sat on a chair in front of the monitor at 65 cm distance. They were asked to indicate whether they saw one or two flashes after each trial pressing either ‘1’ or ‘2’ on a numerical keypad with the index and middle finger of their right hand (in a time-window lasting 1800 ms after the stimulus presentation). Participants were advised they could ignore the beeps. Trials were presented in blocks of 102 trials with counterbalanced number of trial types in random order per block. A block lasted about five minutes, and participants were encouraged to take breaks in between blocks. The position of the visual stimulus (above or below fixation) was consistent within each block and changed (counterbalanced) between blocks. Each participant completed four to six blocks.
To ensure that participants kept fixation throughout a block their eye movements were monitored with an eyetracker system (Cambridge Research Systems). For 21 participants, eye movement data were fed into the stimulus presentation script online. For the programme to present the next trial, participants had to keep fixation for at least 500 ms. Fixations had to be within a square region of 3 × 3 around the fixation cross. For the remaining eight subjects, eye-data were analysed offline. For those participants, trials were excluded from analysis if the eyetracker did not record eye-position. Of the remaining trials (85.13%, s.d.: 13.54%), we included trials if participants' fixation did not deviate more than 1.5° from the midpoint of the screen on the vertical axis (95.41%, s.d.: 5.01%).
(d). Analysis of behavioural data
All statistical analyses of the behavioural data were performed in Matlab (Mathworks, Ltd) and Pasw v. 18.0 (SPSS inc./IBM). To test for the sound-induced flash illusion, we compared the proportion of correct answers between conditions with a repeated measures ANOVA and post hoc t-tests. To determine proneness to the illusion, a ‘fission score’ (FiS) was calculated for each subject. It was defined as one minus the proportion of correct answers in the illusion trials (pCorr(1F2B)) and corrected for any response bias to report two flashes independent of the number of beeps:
Additionally, the FiS was calculated separately for the two stimulus positions (above or below the fixation cross). The mean, range and variance for FiS and the simple proportion of correct trials were determined. Furthermore, the correlation between FiS for trials in which the disk flashed above and below the fixation cross, respectively, was calculated.
(e). MRI data acquisition and pre-processing
T1 anatomical images of the brain were obtained with a 1.5 T Siemens Sonata MRI scanner (Siemens Medical). High-resolution anatomical images were acquired using a T1-weighted, three-dimensional, modified, driven-equilibrium Fourier transform sequence (repetition time = 12.24 ms; echo time = 3.56 ms; field of view = 256 × 256 mm; voxel size = 1 × 1 × 1 mm).
T1-weighted magnetic resonance images were first segmented for GM and white matter (WM) using the ‘New Segment’ segmentation tools in Statistical Parametric Mapping 8 (SPM8, http://www.fil.ion.ucl.ac.uk/spm). Subsequently, we performed diffeomorphic anatomical registration through exponentiated lie algebra (DARTEL) in SPM8 for intersubject registration of the GM and WM images [18]. To ensure that regional GM volume was maintained after the registration, the registered images were modulated by the Jacobian determinant of the flow fields computed by DARTEL. The registered images were smoothed with a Gaussian kernel of 8 mm full-width at half-maximum and transformed to Montreal Neurological Institute (MNI) stereotactic space using affine and nonlinear spatial normalization implemented in SPM8.
(f). Voxel-based morphometry: statistical analysis
To test for correlations between GM volume and illusion strength, multiple regression analyses were performed on the smoothed GM images. FiSs were entered as vectors of interest into the design matrix, while total GM volume, age and sex were included as regressors of no interest in the model to control for any differences in these variables.
To incorporate our a priori hypotheses concerning the brain structures that we predicted to be involved, region of interest (ROI) masks were created using the SPM anatomy toolbox (http://www.fz-juelich.de/inm/inm-1/spm_anatomy_toolbox) and MarsBaR (http://marsbar.sourceforge.net/). The first three ROIs were derived from the illusion-specific significant activations reported in [14]. This study reported significant activation of retinotopically defined V1, and of right posterior STS and the SC. Because we did not have retinotopic data for our VBM subjects and the size of V1 has relatively large inter-individual variation [12,19], we used histological maximum probability maps to combine the BA17 and BA18 regions into one mask [20]. Right posterior STS and SC were incorporated via a 10 and 4 mm radius sphere, respectively, centred on the stereotactic coordinates of the peak voxels reported in [14]: [54,−54,30] and [2,−30,0]. Primary auditory cortex was added as a ROI following [7] and as defined by histological maximum probability maps [20].
The average GM volume within the ROIs was derived with MarsBaR and correlated with FiSs (controlling for total GM volume, age and sex). We used Bonferroni correction to adjust statistical thresholds for multiple ROIs tested. To avoid violations of the assumption of normally distributed data, a logit transformation was applied to the GM volumes [13]. Normality plots and Lilliefors tests [21] were then performed to test the assumption of normality for the distribution of the logit transformed average GM volumes within the ROIs and for FiSs, (the assumption of normality could not be refuted for any of them: FiS: D = 0.10, p = 0.68, n.s.; BA1718 GM: D = 0.11, p = 0.49, n.s.; r pSTS GM: D = 0.07, p = 0.99, n.s.; SC GM: D = 0.09, p = 0.87, n.s.; primary auditory cortex GM: D = 0.15, p = 0.10, n.s.). Outside the ROIs, an additional exploratory whole-brain analysis was carried out, using a threshold of p < 0.05, corrected for multiple comparisons using the familywise error (FWE) rate.
3. Results
(a). Behavioural results
The sound-induced flash illusion was replicated [1,14]. Participants on average answered correctly on 38 per cent (s.d.: 28%) of 1F2B trials, indicating that they perceived the illusion on average in 62 per cent of trials. The inter-individual range was 2–100%. A repeated measures ANOVA indicated a significant difference between conditions (F2.50 = 45.17, p < 0.001, Greenhouse-Geisser corrected; figure 1c; table 1). The proportion of correct answers was significantly lower in the 1F2B condition, when compared with all other conditions and indicated by post hoc paired t-tests. Ninety-five per cent confidence intervals of the differences indicated that this was a strong and robust effect (table 2). Whether the disc appeared in the upper or lower visual field did not alter the strength of the sound-induced flash illusion (FiS below: 0.64 (s.d.: 0.29), FiS above: 0.62 (s.d.: 0.36); paired t-test: t28 = 0.64, p = 0.53, n.s.). The likelihood of reporting the illusion was highly correlated within individuals and across trials in which the disk flashed above versus below the fixation cross (r = 0.84, p < 10−7). The distribution of FiSs across participants is given in figure 1d.
Table 1.
min | max | range | mean | s.d. | |
---|---|---|---|---|---|
1F1B | 0.79 | 1.00 | 0.21 | 0.94 | 0.06 |
1F2B | 0 | 0.98 | 0.98 | 0.38 | 0.28 |
2F1B | 0.04 | 0.98 | 0.94 | 0.66 | 0.29 |
2F2B | 0.32 | 1.00 | 0.68 | 0.93 | 0.14 |
1F0B | 0.57 | 1.00 | 0.43 | 0.87 | 0.11 |
2F0B | 0.49 | 1.00 | 0.51 | 0.88 | 0.13 |
FiS | 0.02 | 1.20 | 1.17 | 0.63 | 0.30 |
FiS below | 0 | 1.25 | 1.25 | 0.64 | 0.29 |
FiS above | 0.04 | 1.39 | 1.35 | 0.62 | 0.36 |
Table 2.
contrast | t28 | s.d. (t) | 95% CI | p |
---|---|---|---|---|
1F2B versus 1F1B | −11.24 | 0.27 | [−0.66, −0.46] | <10−11 |
1F2B versus 2F1B | −4.86 | 0.31 | [−0.40, −0.16] | <10−4 |
1F2B versus 2F2B | −9.11 | 0.33 | [−0.67, −0.42] | <10−9 |
1F2B versus 1F0B | −10.01 | 0.26 | [−0.59, −0.39] | <10−10 |
1F2B versus 2F0B | −10.07 | 0.26 | [−0.60, −0.39] | <10−10 |
(b). MRI results
The VBM analysis revealed a strong and statistically significant negative correlation between FiS and local GM volume in the BA17&18 ROI (controlled for global GM volume, age and sex): r = −0.55, t24 = −3.27, p = 0.01 (two-tailed and Bonferroni corrected for multiple ROIs; cf. figure 2). Note that this correlation remained qualitatively unchanged and statistically significant when not controlling for age and gender (r = −0.54, t26 =−3.30, p = 0.003) and when using raw behavioural scores (1-pCorr(1F2B)) instead of FiS (r = −0.47, t26 = −2.74, p = 0.01). No significant correlation between local GM volume and FiS was found for the primary auditory cortex, posterior STS and SC ROIs (cf. figure 3). Additional exploratory whole-brain analysis yielded no further significant findings at a threshold of p < 0.05, corrected for multiple comparisons (FWE correction).
To further test whether the correlation between proneness to the fission illusion and GM volume in the BA17&18 ROI was driven by the BA17 or BA18 region (or both), we correlated GM volume in each region separately with FiS (again controlled for global GM volume, age and sex). The GM volume of both regions was significantly negatively correlated with FiS; the relationship was slightly stronger for the BA18 mask (r = −0.60, t24 = −3.68, p = 0.001, two-tailed) than for the BA17 mask (r = −0.47, t24 = −2.64, p = 0.01, two-tailed), but this difference was not significant (Z = 0.97, p = 0.33, n.s.).
4. Discussion
We found reliable inter-individual differences in proneness to the sound-induced flash illusion. While some participants experienced the illusion hardly ever, others experienced it on almost every trial. Moreover, individual proneness to the illusion proved highly reliable across blocks with different flash locations, suggesting it to be a stable, trait-like feature.
We found a strong correlation between proneness to the illusion and local GM volume in early visual cortex (cf. figure 2). Participants with low GM volume in the BA17/18 region experienced the illusion significantly more often. Given we regressed out global GM volume, this points to a systematic relationship between individual proneness to the illusion and the relative amount of total GM dedicated to early visual cortex.
Higher proneness to the sound-induced flash illusion is associated with greater multisensory modulation of visual event related potentials (ERPs) [7]. In that earlier study proneness to the illusion correlated not only with the magnitude of illusion associated ERPs but also with supra-additive multisensory ERPs in trials with two beeps and two flashes (that did not induce any illusion). This result suggests increased proneness to the sound-induced flash illusion is associated with a more general enhancement of audiovisual integration. BOLD responses in V1 were found to be enhanced in an illusion specific way [14]. That is, BOLD responses in V1 were only enhanced by a second beep in trials in which it successfully induced the illusion. Taken together, these previous studies demonstrate that proneness to the sound-induced flash illusion correlates with stronger and more frequent multisensory modulations of early visual cortex activity. In light of these previous findings, our results predict that individuals with smaller visual cortices will exhibit stronger multisensory modulation of visual responses.
One possible explanation for greater multisensory effects being observed in anatomically smaller visual cortices may be incomplete scaling of multisensory connections with early visual cortex. Because visual cortex volume was controlled for total GM volume in our study, a smaller value implies a smaller proportion of GM dedicated to early visual processing. This might imply a greater ratio of multisensory GM (e.g. in STS) versus visual GM and thus result in a greater number of multisensory synapses per visual neuron. This, in turn, would explain the higher likelihood for auditory modulation of visual perception, as indicated by proneness to the sound-induced flash illusion. However, this hypothesis should be taken with care and needs further testing. A post hoc test on our data could not confirm a significant correlation between the FiS, on the one hand, and the ratio of GM volume in the BA1718 mask versus in the spherical right posterior STS mask, on the other (r = −0.18, t25 = −0.91, p = 0.37, n.s.).
An alternate hypothesis is that the weighting of sensory channels is tuned to the availability of neural resources. Our finding suggests that the relative amount of neural resources dedicated to the visual modality in an individual brain influences the weight placed on this sensory channel. Such a mechanism would be complementary to weighting mechanisms tuned to relative levels of input noise [3,23–25] as confirmed for the sound-induced flash illusion [4]. If the brain weights sensory channels according to their relative levels of effective noise, this will reflect more than input noise. It will also take into account the relative levels of intrinsic noise of sensory channels. One (well studied) aspect of such intrinsic noise is the general suitability of a sensory channel for the stimulus dimension at hand. This is reflected in general tendencies across participants, like visual dominance for spatial judgements [6] and auditory dominance for temporal judgements [5]. That is, the signal-to-noise ratio for spatial stimulus aspects is generally higher in the visual than in the auditory channel, whereas the opposite is true for temporal stimulus aspects.
Our results suggest an additional, more subtle aspect of intrinsic noise: it may vary between subjects according to the amount of GM dedicated to the specific sensory channel. Such a hypothesis would be in line with previous results, showing that the surface area of primary visual cortex in healthy humans is correlated with the cortical magnification factor at eccentricities comparable to that of our visual stimulus [26]. That is, subjects with smaller visual cortices have a visual representation that exhibits coarser spatial tuning. Further, this relationship between visual cortex area and acuity exists on the behavioural level as well. Cortical magnification within V1 is correlated with Vernier and grating acuity thresholds—across observers and eccentricities [27]. Taken together, previous results have shown that early visual cortex size correlates with the signal-to-noise ratio of visual representations. Our current finding converges with these results to suggest that the weighting of the visual channel in multisensory integration is tuned to the amount of GM dedicated to early visual cortex.
Variability in V1 surface area is negatively correlated with proneness to contextual visual-size illusions [12]. This result is interesting in the context of our study because it parallels the relationship between small visual cortex and high illusion proneness we found. This suggests that contextual influences are generally increased in small visual cortices—both, within and across modalities. Schwarzkopf et al. [12] interpreted their finding as pointing to a greater number of lateral connections from distant visual field representations within smaller visual cortices. One might speculate that our results might point to a similar neuroanatomical phenomenon, albeit across different areas of the brain rather than within one area. Increased contextual influence on visual processing might be owing to a higher degree of neural connectedness in small visual cortices—within and across areas as well as within and across modalities.
Finally, our results might be linked to individual differences in attention mechanisms [11,28]. Recent findings link the strength of the illusion to several such mechanisms. Specifically, top-down modality-specific attention shifts can suppress processing in the distractor modality and thereby attenuate the (visuo-tactile version of the) illusion [29]. Spatial attention directed away from the audiovisual stimuli diminishes early occipito-temporal components of the illusion-specific ERP components (which have been shown to be increased for participants experiencing the illusion more often [30]). Disruption of the angular gyrus with transcranial magnetic stimulation results in less frequent perception of the illusion, which has been attributed to attenuated effects of bottom-up attention evoked by the sounds [31]. Consequently, participants who are more prone to the sound-induced flash illusion could be more susceptible to auditory attentional capture, they could allocate more attention to the spatial position of the audiovisual stimuli or they could be less able to suppress the auditory modality via top-down attention. It is also interesting to speculate whether both the effects of cross-modal attention and the early visual cortex GM volume may be linked to effects of large network oscillatory phase reset [32,33].
Future experiments will shed more light on the mechanisms behind our finding. A putative relationship between early visual cortex size and its connectedness with multisensory areas can be tested using probabilistic tractography [34]. Functional definitions of early visual cortex (using retinotopic mapping, [35]) would allow dissociation of visual cortex area, and thickness and their respective association with proneness to the sound-induced flash illusion. To test the role of subtle differences in visual processing across participants, future studies could use sensitive visual tests, such as vernier acuity or a version of our purely visual trials, modified to enhance inter-individual variance in this condition (such as adding noise masks). A potential link between proneness to the illusion and differences in cross-modal attention could be tested behaviourally and followed up by tests on the individual propensity for cross-modal oscillatory phase reset [32,33].
In conclusion, we found a strong, negative correlation between early visual cortex GM volume and proneness to the sound-induced flash illusion. We proposed a neuroanatomical and functional explanations for this finding and ways to test these explanations in further experiments.
Acknowledgements
All participants were right-handed, had normal or corrected to normal vision and reported no hearing problems. Written informed consent was obtained from each participant, and the study was approved by the UCL ethics committee.
This work was funded by the Wellcome Trust and Japan Science and Technology Agency (R.K.). We thank Dr Ged Ridgway and Dr Vincenzo Romei as well as two anonymous referees for valuable input. Neuroimaging data cannot be made freely available owing to privacy and ethical restrictions associated with the possibility of reconstructing facial identity from the data. Data enquiries for anonymised subsets of the original data can be made to the corresponding author.
References
- 1.Shams L., Kamitani Y., Shimojo S. 2000. Illusions. What you see is what you hear . Nature 408, 788. 10.1038/35048669 (doi:10.1038/35048669) [DOI] [PubMed] [Google Scholar]
- 2.Colavita F. B. 1974. Human sensory dominance. Percept. Psychophys. 16, 409–412 10.3758/BF03203962 (doi:10.3758/BF03203962) [DOI] [Google Scholar]
- 3.Ernst M. O., Banks M. S. 2002. Humans integrate visual and haptic information in a statistically optimal fashion. Nature 415, 429–433 10.1038/415429a (doi:10.1038/415429a) [DOI] [PubMed] [Google Scholar]
- 4.Shams L., Ma W. J., Beierholm U. 2005. Sound-induced flash illusion as an optimal percept. Neuroreport 16, 1923–1927 10.1097/01.wnr.0000187634.68504.bb (doi:10.1097/01.wnr.0000187634.68504.bb) [DOI] [PubMed] [Google Scholar]
- 5.Romei V., De Haas B., Mok R. M., Driver J. 2011. Auditory stimulus timing influences perceived duration of co-occurring visual stimuli. Front. Psychol. 2, 215. 10.3389/fpsyg.2011.00215 (doi:10.3389/fpsyg.2011.00215) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Alais D., Burr D. 2004. The ventriloquist effect results from near-optimal bimodal integration. Curr. Biol. 14, 257–262 10.1016/j.cub.2004.01.029 (doi:10.1016/j.cub.2004.01.029) [DOI] [PubMed] [Google Scholar]
- 7.Mishra J., Martinez A., Sejnowski T. J., Hillyard S. A. 2007. Early cross-modal interactions in auditory and visual cortex underlie a sound-induced visual illusion. J. Neurosci. 27, 4120–4131 10.1523/JNEUROSCI.4912-06.2007 (doi:10.1523/JNEUROSCI.4912-06.2007) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.McGurk H., MacDonald J. 1976. Hearing lips and seeing voices. Nature 264, 746–748 10.1038/264746a0 (doi:10.1038/264746a0) [DOI] [PubMed] [Google Scholar]
- 9.Nath A. R., Fava E. E., Beauchamp M. S. 2011. Neural correlates of interindividual differences in children's audiovisual speech perception. J. Neurosci. 31, 13 963–13 971 10.1523/JNEUROSCI.2605-11.2011 (doi:10.1523/JNEUROSCI.2605-11.2011) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Nath A. R., Beauchamp M. S. 2012. A neural basis for interindividual differences in the McGurk effect, a multisensory speech illusion. NeuroImage 59, 781–787 10.1016/j.neuroimage.2011.07.024 (doi:10.1016/j.neuroimage.2011.07.024) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Kanai R., Rees G. 2011. The structural basis of inter-individual differences in human behaviour and cognition. Nat. Rev. Neurosci. 12, 231–242 10.1038/nrn3000 (doi:10.1038/nrn3000) [DOI] [PubMed] [Google Scholar]
- 12.Schwarzkopf D. S., Song C., Rees G. 2011. The surface area of human V1 predicts the subjective experience of object size. Nat. Neurosci. 14, 28–30 10.1038/nn.2706 (doi:10.1038/nn.2706) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Ashburner J., Friston K. J. 2000. Voxel-based morphometry: the methods. Neuroimage 11, 805–821 10.1006/nimg.2000.0582 (doi:10.1006/nimg.2000.0582) [DOI] [PubMed] [Google Scholar]
- 14.Watkins S., Shams L., Tanaka S., Haynes J. D., Rees G. 2006. Sound alters activity in human V1 in association with illusory visual perception. Neuroimage 31, 1247–1256 10.1016/j.neuroimage.2006.01.016 (doi:10.1016/j.neuroimage.2006.01.016) [DOI] [PubMed] [Google Scholar]
- 15.Shams L., Iwaki S., Chawla A., Bhattacharya J. 2005. Early modulation of visual cortex by sound: an MEG study. Neurosci. Lett. 378, 76–81 10.1016/j.neulet.2004.12.035 (doi:10.1016/j.neulet.2004.12.035) [DOI] [PubMed] [Google Scholar]
- 16.Brainard D. H. 1997. The psychophysics toolbox. Spatial Vis. 10, 433–436 10.1163/156856897X00357 (doi:10.1163/156856897X00357) [DOI] [PubMed] [Google Scholar]
- 17.Pelli D. G. 1997. The VideoToolbox software for visual psychophysics: transforming numbers into movies. Spatial Vis. 10, 437–442 10.1163/156856897X00366 (doi:10.1163/156856897X00366) [DOI] [PubMed] [Google Scholar]
- 18.Ashburner J. 2007. A fast diffeomorphic image registration algorithm. NeuroImage 38, 95–113 10.1016/j.neuroimage.2007.07.007 (doi:10.1016/j.neuroimage.2007.07.007) [DOI] [PubMed] [Google Scholar]
- 19.Dougherty R. F., Koch V. M., Brewer A. A., Fischer B., Modersitzki J., Wandell B. A. 2003. Visual field representations and locations of visual areas V1/2/3 in human visual cortex. J. Vis. 3, 586–598 10.1167/3.10.1 (doi:10.1167/3.10.1) [DOI] [PubMed] [Google Scholar]
- 20.Eickhoff S. B., Heim S., Zilles K., Amunts K. 2006. Testing anatomically specified hypotheses in functional imaging using cytoarchitectonic maps. NeuroImage 32, 570–582 10.1016/j.neuroimage.2006.04.204 (doi:10.1016/j.neuroimage.2006.04.204) [DOI] [PubMed] [Google Scholar]
- 21.Lilliefors H. W. 1967. On the Kolmogorov–Smirnov test for normality with mean and variance unknown. J. Am. Statist. Assoc. 62, 399–402 10.1080/01621459.1967.10482916 (doi:10.1080/01621459.1967.10482916) [DOI] [Google Scholar]
- 22.Holmes C. J., Hoge R., Collins L., Woods R., Toga A. W., Evans A. C. 1998. Enhancement of MR images using registration for signal averaging. J. Comput. Assisted Tomogr. 22, 324–333 10.1097/00004728-199803000-00032 (doi:10.1097/00004728-199803000-00032) [DOI] [PubMed] [Google Scholar]
- 23.Battaglia P. W., Jacobs R. A., Aslin R. N. 2003. Bayesian integration of visual and auditory signals for spatial localization. J. Opt. Soc. Am. A Opt. Image Sci. Vis. 20, 1391–1397 10.1364/JOSAA.20.001391 (doi:10.1364/JOSAA.20.001391) [DOI] [PubMed] [Google Scholar]
- 24.Morgan M. L., Deangelis G. C., Angelaki D. E. 2008. Multisensory integration in macaque visual cortex depends on cue reliability. Neuron 59, 662–673 10.1016/j.neuron.2008.06.024 (doi:10.1016/j.neuron.2008.06.024) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Beauchamp M. S., Pasalar S., Ro T. 2010. Neural substrates of reliability-weighted visual-tactile multisensory integration. Front. Systems Neurosci. 4, 25. 10.3389/fnsys.2010.00025 (doi:10.3389/fnsys.2010.00025) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Harvey B. M., Dumoulin S. O. 2011. The relationship between cortical magnification factor and population receptive field size in human visual cortex: constancies in cortical architecture. J. Neurosci. 31, 13 604–13 612 10.1523/JNEUROSCI.2572-11.2011 (doi:10.1523/JNEUROSCI.2572-11.2011) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Duncan R. O., Boynton G. M. 2003. Cortical magnification within human primary visual cortex correlates with acuity thresholds. Neuron 38, 659–671 10.1016/S0896-6273(03)00265-4 (doi:10.1016/S0896-6273(03)00265-4) [DOI] [PubMed] [Google Scholar]
- 28.Kanai R., Dong M. Y., Bahrami B., Rees G. 2011. Distractibility in daily life is reflected in the structure and function of human parietal cortex. J. Neurosci. 31, 6620–6626 10.1523/JNEUROSCI.5864-10.2011 (doi:10.1523/JNEUROSCI.5864-10.2011) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Werkhoven P. J., van Erp J. B. F., Philippi T. G. 2009. Counting visual and tactile events: the effect of attention on multisensory integration. Atten. Percept. Psychophys. 71, 1854–1861 10.3758/APP.71.8.1854 (doi:10.3758/APP.71.8.1854) [DOI] [PubMed] [Google Scholar]
- 30.Mishra J., Martínez A., Hillyard S. A. 2010. Effect of attention on early cortical processes associated with the sound-induced extra flash illusion. J. Cogn. Neurosci. 22, 1714–1729 10.1162/jocn.2009.21295 (doi:10.1162/jocn.2009.21295) [DOI] [PubMed] [Google Scholar]
- 31.Kamke M. R., Vieth H. E., Cottrell D., Mattingley J. B. 2012. Parietal disruption alters audiovisual binding in the sound-induced flash illusion. NeuroImage 62, 1334–1341 10.1016/j.neuroimage.2012.05.063 (doi:10.1016/j.neuroimage.2012.05.063) [DOI] [PubMed] [Google Scholar]
- 32.Lakatos P., O'Connell M. N., Barczak A., Mills A., Javitt D. C., Schroeder C. E. 2009. The leading sense: supramodal control of neurophysiological context by attention. Neuron 64, 419–430 10.1016/j.neuron.2009.10.014 (doi:10.1016/j.neuron.2009.10.014) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Romei V., Gross J., Thut G. 2012. Sounds reset rhythms of visual cortex and corresponding human visual perception. Curr. Biol. 22, 807–813 10.1016/j.cub.2012.03.025 (doi:10.1016/j.cub.2012.03.025) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Beer A. L., Plank T., Greenlee M. W. 2011. Diffusion tensor imaging shows white matter tracts between human auditory and visual cortex. Exp. Brain Res. Experimentelle Hirnforschung. Expérimentation cérébrale 213, 299–308 10.1007/s00221-011-2715-y (doi:10.1007/s00221-011-2715-y) [DOI] [PubMed] [Google Scholar]
- 35.Sereno M. I., Dale A. M., Reppas J. B., Kwong K. K., Belliveau J. W., Brady T. J., Rosen B. R., Tootell R. B. 1995. Borders of multiple visual areas in humans revealed by functional magnetic resonance imaging. Science 268, 889–893 10.1126/science.7754376 (doi:10.1126/science.7754376) [DOI] [PubMed] [Google Scholar]