Skip to main content
. 2021 Jun 11;31(11):4986–5005. doi: 10.1093/cercor/bhab136

Figure 5 .


Figure 5

Predicting cocktail-party responses. Passive-listening models were tested during the cocktail-party task by predicting BOLD responses in the cocktail-party data. Since a voxel might represent information from both attended and unattended stimuli, response predictions were expressed as a convex combination of individual predictions for the attended and unattended story within each 2-speaker story. Prediction scores were computed as the combination weights (Inline graphic) were varied in [0 1] (see Materials and Methods). Prediction scores for a given model were averaged across speech-selective voxels within each ROI (Inline graphic). The normalized scores of spectral, articulatory, and semantic models are displayed in several representative ROIs (HG/HS, HG/HS, and PT). Solid and dashed lines indicate mean and 95% confidence intervals across subjects. Scores based on only the attended story (Inline graphic), based on only the unattended story (Inline graphic), and based on the optimal combination of the two (Inline graphic) are marked with circles. For the “spectral model” in left HG/HS, Inline graphic is larger than Inline graphic (P < 10−4); and the optimal combination equally weighs attended and unattended stories. For the “articulatory model” in left HG/HS, Inline graphic is larger than Inline graphic (P < 10−4), whereas Inline graphic is greater than Inline graphic (P < 10−2). Besides, the optimal combination puts slightly higher weight to attended story than unattended story. For the “semantic model” in left PT, Inline graphic is much higher than Inline graphic (P < 10−4), and the optimal combination puts much greater weight to attended story than unattended one. These representative results imply that attention may have divergent effects at various levels of speech representations across cortex.