Skip to main content
Frontiers in Neuroscience logoLink to Frontiers in Neuroscience
. 2018 Sep 20;12:666. doi: 10.3389/fnins.2018.00666

The Association Between Genetic Variation in FOXP2 and Sensorimotor Control of Speech Production

Siyun Zhang 1,, Jiangli Zhao 1,, Zhiqiang Guo 2,, Jeffery A Jones 3, Peng Liu 1, Hanjun Liu 1,4,*
PMCID: PMC6158330  PMID: 30294257

Abstract

Significant advances have been made in understanding the role of auditory feedback in sensorimotor integration for speech production. The neurogenetic basis of this feedback-based control process, however, remains largely unknown. Mutations of FOXP2 gene in humans are associated with severe deficits in speech motor behavior. The present study examined the associations between a FOXP2 common variant, rs6980093 (A/G), and the behavioral and event-related potential (ERP) responses to -50 and -200 cents pitch perturbations during vocal production in a sample of 133 Chinese adults. Behaviorally, the GG genotype was associated with significantly smaller vocal compensations for -200 cents perturbations relative to the AA and AG genotypes. Furthermore, both the AA and AG genotypes exhibited significant positive correlations between the degree of vocal compensation for -50 and -200 cents perturbations and the variability of normal voice fundamental frequency, whereas no such correlation existed for the GG genotype. At the cortical level, significantly larger P2 responses to -200 cents perturbations were associated with the GG genotype as compared to the AA and AG genotypes due to increased left-lateralized activity in the superior, middle, and inferior frontal gyrus, precentral gyrus, anterior cingulate cortex, middle temporal gyrus, and insula. The neurobehavioral responses to -50 cents perturbations, however, did not vary as a function of genotype. These findings present the first neurobehavioral evidence for an association between FOXP2 genetic variant and auditory-motor integration for vocal pitch regulation. The differential effects of FOXP2 genotypes at rs6980093 may reflect their influences on the weighting of feedback and feedforward control of speech production.

Keywords: auditory feedback, sensorimotor integration, FOXP2, genetic variant, event-related potential

Introduction

Speech production relies on the integration of auditory feedback into the vocal motor system in the brain (Hickok et al., 2011). Although compensatory adjustments of vocal output in response to altered auditory feedback have been well documented (Burnett et al., 1998; Houde and Jordan, 1998; Macdonald et al., 2010), the neural mechanisms underlying auditory-motor integration are far from being understood. Event-related potentials (ERPs) of the N1-P2 complex have been identified in the cortical processing of pitch feedback perturbations during vocal production (Behroozmand and Larson, 2011; Liu H. et al., 2011; Scheerer et al., 2013), which are thought to, respectively, reflect the early pre-attentive detection of feedback errors and later cognitive processing of auditory-motor transformations. Neuroimaging studies have revealed a complex neural network in the fronto-temporo-parietal regions to be involved in auditory feedback control of speech production (Tourville et al., 2008; Zarate and Zatorre, 2008; Parkinson et al., 2012; Chang et al., 2013; Behroozmand et al., 2015, 2016). Changes to these critical regions caused by neurological diseases, such as Parkinson’s disease (PD), Alzheimer disease (AD), and temporal lobe epilepsy (TLE), lead to disorders of speech motor control as reflected by abnormal vocal compensations and/or associated brain activity (Liu et al., 2012; Huang et al., 2016; Li et al., 2016; Mollaei et al., 2016; Ranasinghe et al., 2017).

Some individuals, however, suffer from speech disorders that are inheritable, such as developmental verbal dyspraxia (MacDermot et al., 2005), stuttering (Ambrose et al., 1997), and phonological processing disorders related to 16p11.2 deletions (Hippolyte et al., 2016). These genetic speech disorders are associated with impaired sensorimotor processing of speech production. For example, individuals with developmental verbal dyspraxia have difficulties in controlling orofacial muscles (Watkins et al., 2002a), and individuals with stuttering and 16p11.2 deletions carriers have shown atypical compensations for speech feedback perturbations (Cai et al., 2012; Demopoulos et al., 2018). Despite significant progress in the identification of risk genes associated with speech and language disorders (Graham and Fisher, 2013; Konopka and Roberts, 2016), the neurogenetic basis of speech motor control remains largely unknown.

One milestone in the exploration of the link between genetics and speech and language disorders was the discovery of FOXP2 gene mutations that disrupt the DNA-binding site of the protein in a landmark multigenerational study of the KE family (Lai et al., 2001). As a monogenic speech disorder caused by FOXP2 mutations, developmental verbal dyspraxia is characterized by deficits in the production of orofacial motor sequences necessary for fluent speech (Hurst et al., 1990; Vargha-Khadem et al., 1998) and impaired grammatical skills (Watkins et al., 2002a). There is also evidence that suggests associations between FOXP2 mutations and speech sound disorders (Zhao et al., 2010). The FOXP2 gene encodes a forkhead domain transcription factor that is expressed in the cortico-striatal network, including the lateral frontal and temporo-parietal cortices, basal ganglia, and cerebellum (Ferland et al., 2003; Lai et al., 2003; Teramitsu et al., 2004). Abnormalities in gray matter density in this network have been identified in structural imaging studies on individuals with FOXP2 mutations (Vargha-Khadem et al., 1998; Watkins et al., 2002b; Belton et al., 2003; Padovani et al., 2010; Premi et al., 2012). Functional imaging studies of overt speech production have shown decreased brain activity in Broca’s area and putamen in individuals with FOXP2 mutations (Liegeois et al., 2003) and associations between FOXP2 genotypes and variations of activation in the IFG in healthy populations (Pinel et al., 2012). Behaviorally, the impact of FOXP2 gene on learning of auditory-motor interactions is reflected by associations between individual differences in speech category learning and variation in the FOXP2 gene (Chandrasekaran et al., 2015). Also, animal studies have shown disruptions in vocal learning and vocal-motor variability in songbirds after knockdown of FoxP2 (Haesler et al., 2007; Murugan et al., 2013) and adult mice with Foxp2 mutation (Chabout et al., 2016). These studies in animals and humans have implicated a role of FOXP2 in sensorimotor processing. On the other hand, activation of brain regions within the cortico-striatal network, such as the superior temporal gyrus (STG), inferior frontal gyrus (IFG), inferior parietal lobule (IPL), putamen, and thalamus, has been identified in the auditory-motor processing of feedback errors during speech production (Tourville et al., 2008; Zarate and Zatorre, 2008; Parkinson et al., 2012; Behroozmand et al., 2015; Tang et al., 2018). These findings led us to hypothesize an association between FOXP2 and sensorimotor control of speech production. There is so far, however, no direct evidence to support this hypothesis.

To test this hypothesis, we correlated a FOXP2 single nucleotide polymorphism (SNP rs6980093) that involves the adenine (A) and guanine (G) exchange with neurobehavioral responses to feedback errors during speech production. We identified rs6980093 as target SNP because of its associations with variations of brain activity in the IFG during speech-related reading tasks (Pinel et al., 2012), individual differences in language and reading skills (Mozzi et al., 2017), and speech category learning abilities (Chandrasekaran et al., 2015). Healthy young adults were instructed to produce sustained vowel sounds while they heard pitch perturbations in their voice auditory feedback. Compensatory vocal responses and ERP responses known as the N1-P2 complex were measured and compared as a function of FOXP2 genotype. Given the important role of FOXP2 in tuning the function of cortico-striatal networks that regulate critical aspects of speech, language, and motor control (Fisher and Scharff, 2009; Lieberman, 2009; Konopka and Roberts, 2016), we hypothesized that an association would exist between the FOXP2 gene and auditory-vocal integration, as reflected by the modulation of vocal compensations and ERP responses by variation in SNP rs6980093.

Materials and Methods

Subject

One hundred and fifty college students were recruited from Sun Yat-sen University of China to participate in this experiment. All participants were right-handed and from the population of Han Chinese. The inclusion criteria were as follows: no prior history of neurological or psychiatric diseases, no speech, hearing or language disorders, non-smokers, and no history of taking neuroactive substances (e.g., alcohol, caffeine, drugs, etc.). Written informed consent was obtained from all participants, and the research protocol was approved by the Institution Review Board of The First Affiliated Hospital at Sun Yat-sen University of China in accordance with the Code of Ethics of the World Medical Association (Declaration of Helsinki).

Genotyping

All participants were genotyped for the polymorphism in FOXP2 (SNP rs6980093). DNA was extracted from saliva samples collected with the OrageneTM DNA collection kit (OG-500, from DNA Genotek) following standard instructions. The DNA of all participants was genotyped for a polymorphism in SNP rs6980093. Sequencing was performed by the dideoxy-chain termination method (ABI Applied Biosystems) using an ABI 3730XI Real-Time PCR System.

The frequency of the FOXP2 (SNP rs6980093) genotypes (n = 150; AA: n = 55; AG: n = 71; GG: n = 24) did not significantly deviate from the Hardy–Weinberg equilibrium (Chi-square test, χ2 = 0.018, p = 0.892). Of the 150 participants who eventually entered the study, 17 participants were excluded because their electroencephalography (EEG) data did not reach the criteria of inclusion (see EEG data analyses). Thus, the data from 133 young adults aged 18–29 years (mean age 21 ± 2 years) were collected in the present experiment. The participants were divided into three groups according to their genotypes: AA (n = 49; 14 males), AG (n = 63; 18 males), and GG (n = 21; 4 males). The three groups did not differ in their age [F(2,130) = 1.649, p = 0.196] and gender (χ2 = 0.333, p = 0.564).

Procedure

All participants completed a vocal production experiment using the frequency altered feedback (FAF) paradigm after the collection of their saliva. They were cued to start and stop vocalizing when a blue indicator light on a computer screen was on and off, resulting in a stable vocalization that was 3 s in length. During each vocalization, participants heard their own voice unexpectedly pitch-shifted downward 50 or 200 cents (100 cents = 1 semitone) once. Each pitch shift had a fixed duration of 200 ms and occurred 1500–2000 ms after the onset of vocalization. The two sizes of pitch perturbations were pseudo-randomly presented to all participants. Prior to initiating next vocalization, participants were required to take a break of 2–3 s to avoid vocal fatigue. They produced 200 consecutive vocalizations that led to a total of 200 trials, including 100 trials for the -50 cents perturbations and 100 trials for the -200 cents perturbations.

Apparatus

Participants were tested in a sound-treated booth. In order to partially mask their air-borne and bone-conducted feedback, the acoustical recording system was calibrated prior to data recording, ensuring that the intensity of voice feedback was 10 dB sound pressure level (SPL) higher than that of participant’s vocal output. The voice signals were transduced by a dynamic microphone (DM2200, Takstar Inc.) and sent to an Eventide Eclipse Harmonizer via a MOTU Ultralite Mk3 Firewire audio interface. A custom-developed MIDI software program (Max/MSP v.5.0 by Cycling 74) was used to control the Eventide Eclipse Harmonizer to pitch-shift the voice signals. Acoustical parameters, including the direction, magnitude, and inter-stimulus interval (ISI) of the pitch perturbations, were manipulated by this program. Also, this program was used to generate transistor-transistor logic (TTL) control pulses that marked the onset of the pitch perturbation. Finally, the pitch-shifted voice signals were amplified by an ICON NeoAmp headphone amplifier and fed back to participants through insert earphones (ER1-14A, Etymotic Research Inc.). The original and pitch-shifted voice signals as well as the TTL pulses were digitized with a sampling frequency of 10 kHz by a PowerLab A/D converter (model ML880, AD Instruments) and recorded using LabChart software (v.7.0 by AD Instruments).

The EEG signals were recorded from each participant’s scalp with a 64-electrode Geodesic Sensor Net (Electrical Geodesics Inc.). A high input-impedance Net Amps 300 amplifier (Zin ≈ 200 MΩ; Electrical Geodesics Inc.) was used to amplify the EEG signals. This amplifier accepts scalp-electrode impedances up to 40–60 kΩ, thus the impedance levels of individual sensors were carefully adjusted to be less than 50 kΩ and maintained throughout the recording (Ferree et al., 2001). All channels were referenced to the vertex (Cz) during the online recording (Ferree et al., 2001). The TTL pulses that marked the onset of the pitch perturbation were sent to the EEG recording system via a DIN cable. Finally, the EEG signals were digitized at a sampling frequency of 1000 Hz and saved onto a Mac Pro computer using NetStation software (v.4.5, Electrical Geodesics Inc.).

Vocal Responses Measurement

The magnitude and latencies of compensatory vocal responses to pitch-shifted auditory feedback were measured using IGOR PRO software (v.6.0, WaveMetrics Inc.). Voice fundamental frequency (F0) contours in Hertz were extracted from voice signals using Praat software (Boersma, 2001) and converted to the cent scale with the following formula: cents = 100 × [12 × log2(F0/reference)] [reference = 195.997 Hz (G3 note)]. We then segmented the voice F0 contours in cents into epochs using a window of -200 ms to 700 ms relative to the onset of the pitch perturbation and performed a waterfall procedure to visually inspect all individual trials. Following this procedure, bad trials that were contaminated by vocal interruptions or signal processing errors were excluded from further analyses. Those trials whose direction opposed the downward perturbations were defined as compensatory responses and entered the averaging procedure. Finally, 81% of compensatory vocal trials that contained no artifacts were normalized by subtracting the mean F0 values in the 200 ms baseline period from the F0 values after the perturbation onset and averaged to generate an overall compensatory response for each condition. The magnitude of a vocal response in cents was defined as the greatest F0 value following the response onset. The latency was measured as the peak time when the voice F0 contours reached a maximum value. Additionally, the SD of the baseline mean F0 for the averaged response was measured as an index of the variability of the participant’s voice in the absence of feedback perturbations.

EEG Data Analysis

The EEG data were submitted to NetStation software for offline analysis. First, they were band-pass filtered using a filter with cut-off frequencies of 1–20 Hz. Following a segmentation procedure, the filtered EEG signals were segmented into 500 ms post-stimulus epochs with a 200 ms pre-stimulus baseline. Next, an artifact detection procedure was performed on the segmented epochs to reject trials with segments that included voltage values that exceeded ±55 μv of the moving average over an 80-ms window, which is typically the result of excessive muscular activity, eye blinks, or eye movements. Individual electrodes were rejected if they contained artifacts in more than 20% of the epochs, and individual files were excluded from the averaging procedure if they contained more than 10 bad channels. In order to ensure appropriate rejections of those trials with artifacts, we performed an additional visual inspection of all the individual trials. On average, 81% of individual trials were retained for averaging. Finally, artifact-free trials were re-referenced to the average of the electrodes on each mastoid, averaged, and baseline-corrected to generate an overall response for each condition. The amplitudes and latencies of the N1 and P2 components were measured from 10 electrodes (FC1, FC2, FC3, FC4, FCz, C1, C2, C3, C4, and Cz) as the negative and positive peaks in the time windows of 80–160 ms and 180–280 ms after the onset of pitch perturbation, respectively, since the N1 and P2 responses to pitch-shifted voice auditory feedback are primarily pronounced in the frontal and central areas (Hawco et al., 2009; Chen et al., 2012).

In order to further evaluate the effect of genetic variation on the neural networks that support the cortical processing of voice auditory feedback, we performed source localization of N1 and P2 responses across the genotypes using standard low-resolution electromagnetic tomography (sLORETA) (Pascual-Marqui, 2002). Previous fMRI and intra-cerebral recordings studies (Mulert et al., 2004; Zumsteg et al., 2006) have validated the sources estimated by sLORETA, and this method has been successfully applied to localizing the cortical generators of N1 and P2 responses to pitch-shifted voice auditory feedback (Huang et al., 2016; Guo et al., 2017). For sLORETA, the standardized current density was calculated with a dense grid of 6239 voxels at a spatial resolution of 5 mm in the Montreal Neurological Institute (MNI)-reference brain. The voxel-based sLORETA images were calculated in a realistic standardized head model that was computed with the boundary element (BEM) approach (Fuchs et al., 2002) using the MNI152 template (Mazziotta et al., 2001). In the present study, the voxel-based sLORETA images were computed at a 5 ms time window centered at the maximal global field power peaks within the N1 and P2 time windows, and compared across the genotypes using sLORETA-built-in-voxelwise randomization tests with 10000 permutations based on statistical non-parametric mapping (SnPM) for multiple comparison employing a log-F-ratio statistic. The voxels with significant differences (corrected p < 0.05) were specified in MNI coordinates and labeled as Brodmann areas (BA) within the EEGLAB software (Delorme and Makeig, 2004).

Statistical Analysis

Repeated-measures analyses of variance (RM-ANOVAs) were conducted on the vocal and ERP responses (N1 and P2) to pitch perturbations in SPSS (v.20.0). The magnitudes and latencies of vocal compensations were subject to two-way RM-ANOVAs, including a within-subject factor of stimulus magnitude (-50 and -200 cents) and a between-subject factor of genotype (AA, AG, and GG). The magnitudes and latencies of N1 and P2 responses were analyzed using three-way RM-ANOVAs in which stimulus magnitude and electrode site were regarded as within-subject factors while genotype was regarded as a between-subject factor. Subsidiary RM-ANOVAs were conducted in the case of significant higher-order interactions between any of those variables. Post hoc analyses were performed using Bonferroni adjustment for multiple comparisons. Probability values were corrected for multiple degrees of freedom when the sphericity assumption was violated. The size of differences across the conditions was described by calculating effect size indexed by ηp2. An alpha level of 0.05 was considered significant for all statistical analyses.

Results

Vocal Responses

Figures 1A,B show the grand-averaged F0 contours in response to -50 and -200 cents perturbations, respectively. The magnitudes of vocal compensations for -200 cents perturbations appear to be influenced by the genotype, showing smaller vocal compensations for the GG genotype relative to the AA and AG genotype. A two-way RM-ANOVA conducted on the magnitudes of vocal responses revealed no significant main effects of stimulus magnitude [F(1,130) = 1.678, p = 0.198, ηp2 = 0.013] and genotype [F(2,130) = 2.222, p = 0.112, ηp2 = 0.033]. The interaction between these two variables, however, reached significance [F(2,130) = 5.488, p = 0.005, ηp2 = 0.078]. Follow-up one-way RM-ANOVAs showed that the magnitudes of vocal responses did not vary as a function of genotype for the -50 cents condition [F(2,130) = 0.998, p = 0.371, ηp2 = 0.015]. Nevertheless, there was a significant main effect of genotype for the -200 cents condition [F(2,130) = 4.484, p = 0.013, ηp2 = 0.065]. Bonferroni post hoc comparisons showed that the GG genotype was associated with significantly smaller vocal compensations relative to the AA (p = 0.044) and AG (p = 0.011) genotypes (Figure 1C), while the magnitudes of vocal responses did not differ significantly between individuals with AG and AA variants (p = 1.000). For the latencies of vocal response, the main effects of stimulus magnitude [F(1,130) = 0.065, p = 0.799, ηp2 = 0.001] and genotype [F(2,130) = 1.288, p = 0.279, ηp2 = 0.019] did not reach significance. As well, there was no significant interaction between stimulus magnitude and genotype [F(2,130) = 0.268, p = 0.765, ηp2 = 0.004].

FIGURE 1.

FIGURE 1

(A,B) Grand-averaged voice F0 contours in responses to –50 and –200 cents pitch perturbations for the AA (black), AG (blue), and GG (red) genotypes. Errors bars represent standard errors of the means (SEM). (C) T-bar plots of the magnitudes (mean + SEM) of vocal compensations across the genotypes, showing that the GG genotype exhibited significantly smaller magnitudes of vocal compensations for –200 cents perturbations relative to the AA (p = 0.044) and AG (p = 0.011) genotypes. The asterisk indicates significant differences in the magnitudes of vocal responses between the genotype groups. (D–F) Pearson correlation analyses revealed significant positive correlations between the magnitudes of vocal compensations and the SDs of the baseline F0 for the AA (r = 0.534, p < 0.001) and AG genotypes (r = 0.264, p = 0.003) but not for the GG genotypes (r = 0.107, p = 0.500).

Pearson correlation analyses were performed to investigate the relationship between the magnitude of vocal compensation and the variability of the baseline voice while hearing normal auditory feedback across the genotypes. This measure is hypothesized to reflect the degree of reliance on auditory feedback in the online monitoring of speech production (Scheerer and Jones, 2012; Huang et al., 2016; Li et al., 2016). The results revealed significant positive correlations between the magnitudes of vocal responses and the SDs of the baseline F0 for individuals with the AA (r = 0.534, p < 0.001) (see Figure 1D) and AG genotypes (r = 0.264, p = 0.003) (see Figure 1E), indicating that the variability of normal voice F0 was predicative of the degree of vocal compensation for these two groups. However, this correlation did not reach significance for individuals with the GG genotype (r = 0.107, p = 0.500) (see Figure 1F).

ERP Findings

Figures 2, 3 show the grand-averaged ERP waveforms (A) and topographical distributions of the N1 (B) and P2 (C) amplitudes in response to -50 and -200 cents perturbations, respectively. The effects of FOXP2 genotype on the cortical ERP responses appeared to be more pronounced in the case of the 200 cents condition, showing larger P2 amplitudes for the GG genotype relative to the AA and AG genotypes. A three-way RM-ANOVA revealed that the -200 cents condition elicited significantly larger N1 amplitudes than the -50 cents condition [F(1,130) = 39.040, p < 0.001, ηp2 = 0.231] (see Figure 4A). N1 amplitudes were also found to vary across the electrode sites [F(9,1170) = 7.141, p < 0.001, ηp2 = 0.052], with larger N1 amplitudes at the frontal electrodes relative to the central electrodes. The main effect of genotype, however, did not reach significance [F(1,130) = 0.204, p = 0.815, ηp2 = 0.003]. The interactions between any of these variables also did not reach significance (p > 0.05).

FIGURE 2.

FIGURE 2

Grand-averaged ERP waveforms (A) and topographical distributions of the N1 (B) and P2 (C) amplitudes in response to –50 cents pitch perturbations across the genotypes. The black, blue, and red solid lines represent cortical ERPs for the AA, AG, and GG genotypes, respectively.

FIGURE 3.

FIGURE 3

Grand-averaged ERP waveforms (A) and topographical distributions of the N1 (B) and P2 (C) amplitudes in response to –200 cents pitch perturbations across the genotypes. The black, blue, and red solid lines represent cortical ERPs for the AA, AG, and GG genotypes, respectively.

FIGURE 4.

FIGURE 4

T-bar plots of the amplitudes (mean + SEM) of N1 (A) and P2 (B) responses to –50 and –200 cents perturbations in auditory feedback for the AA (black), AG (blue), and GG (red) genotypes. The –200 cents condition elicited significantly larger N1 (p < 0.001) and P2 (p < 0.001) amplitudes than the –50 cents condition. Individuals with the GG genotype produced significantly larger P2 responses to –200 cents pitch perturbations than individuals with the AA (p = 0.023) and AG genotypes (p = 0.019). The asterisk indicates significant differences in the N1 and P2 amplitudes across the conditions.

For the N1 latencies, the -50 cents condition elicited significantly slower N1 responses than the -200 cents condition [F(1,130) = 88.820, p < 0.001, ηp2 = 0.406]. There was also a significant main effect of electrode site [F(9,1170) = 6.068, p < 0.001, ηp2 = 0.045]; electrode FC1 was associated with significantly longer N1 latencies than electrodes C1 (p = 0.004) and Cz (p = 0.017). However, neither the main effect of genotype [F(1,130) = 0.100, p = 0.905, ηp2 = 0.002] nor the interactions between any of these three variables (p > 0.05) reached significance.

A three-way RM-ANOVA conducted on the P2 amplitudes revealed a significant main effect of stimulus magnitude [F(1,130) = 194.803, p < 0.001, ηp2 = 0.600], showing significantly larger P2 amplitudes for the -200 cents condition relative to the -50 cents condition (see Figure 4B). Significantly larger P2 amplitudes at the frontal electrodes relative to the central electrodes led to a significant main effect of electrode site [F(9,1170) = 182.553, p < 0.001, ηp2 = 0.584]. There was also a significant main effect of genotype [F(2,130) = 3.596, p = 0.030, ηp2 = 0.052] as well as a significant interaction between stimulus magnitude and genotype [F(2,130) = 4.706, p = 0.011, ηp2 = 0.068]. Follow-up two-way RM-ANOVAs were conducted across the stimulus magnitudes. For the -50 cents condition, P2 amplitudes did not vary as a function of genotype [F(2,130) = 2.627, p = 0.076, ηp2 = 0.039], whereas there was a significant main effect of electrode site [F(9,1170) = 123.493, p < 0.001, ηp2 = 0.487] as well as a significant interaction between genotype and electrode site [F(18,1170) = 2.246, p = 0.026, ηp2 = 0.033]. Further analyses revealed systematic changes of P2 amplitudes as a function of genotype at electrodes FC1 [F(2,130) = 3.202, p = 0.044, ηp2 = 0.047], FC2 [F(2,130) = 3.505, p = 0.033, ηp2= 0.051], FCz [F(2,130) = 3.633, p = 0.029, ηp2= 0.053], and Cz [F(2,130) = 3.056, p = 0.050, ηp2= 0.045]. Post hoc Bonferroni comparisons revealed that the GG genotype was associated with significantly larger P2 amplitudes relative to the AA genotype (FC1: p = 0.048; FC2: p = 0.044; FCz: p = 0.027; Cz: p = 0.046) (see Figure 2A). For the -200 cents condition, there was a significant main effect of genotype on the P2 amplitudes [F(2,130) = 4.359, p = 0.015, ηp2= 0.063]. Post hoc Bonferroni comparison revealed that individuals with the GG genotype produced significantly larger P2 amplitudes than individuals with the AA (p = 0.023) and AG genotypes (p = 0.019) (see Figures 3, 4B), whereas P2 amplitudes did not differ significantly between individuals with the AA and AG genotypes (p = 1.000). There was also a significant main effect of electrode site [F(9,1170) = 186.929, p < 0.001, ηp2= 0.590], indicating significantly larger P2 amplitudes at the frontal electrodes relative to the central electrodes. The interaction between electrode site and genotype, however, did not reach significance [F(18,1170) = 1.696, p = 0.087, ηp2= 0.025].

Regarding the P2 latencies, the main effect of stimulus magnitude [F(1,130) = 127.197, p < 0.001, ηp2= 0.495] was found to be significant; significantly longer P2 latencies were elicited by the -50 cents condition as compared to the -200 cents condition. The main effect of electrode site [F(9,1170) = 2.825, p = 0.017, ηp2 = 0.021] also reached significance, indicating that electrode FC1 was associated with significantly longer P2 latencies than electrode C1 (p = 0.019). However, P2 latencies did not vary as a function of genotype [F(2,130) = 1.541, p = 0.218, ηp2= 0.023]. In addition, the interactions between these variables did not reach significance (p > 0.05).

sLORETA Findings

Table 1 and Figure 5 show coordinates and corresponding brain regions where individuals with the GG genotype produced significantly larger P2 responses to -200 cents pitch perturbations relative to individuals with the AA and AG genotypes. When compared to the AA genotype, the GG genotype was associated with enhanced P2 responses due to increased brain activity in a left-lateralized neural network, including the left IFG (BA 9, p = 0.035), middle frontal gyrus (MFG) (BA 6, p = 0.044), precentral gyrus (PrCG) (BA 6, p = 0.019; BA9, p = 0.043), anterior cingulate cortex (ACC) (BA 32, p = 0.017; BA 24, p = 0.024), middle temporal gryus (MTG) (BA 21, p = 0.041), and insula (BA 13, p = 0.007; BA 45, p = 0.040) (see Figure 5A). Similarly, enhanced P2 responses associated with the GG vs. AG genotype were the result of increased brain activity in the left MFG (BA 6, p = 0.012; BA 32, p = 0.025), superior frontal gyrus (SFG) (BA 6, p = 0.026), and ACC (BA 32, p = 0.002; BA 24, p = 0.001) (see Figure 5B).

Table 1.

sLORETA Log-F statistics for maximum activations obtained from comparisons of the GG genotype with the AA and AG genotypes in the P2 time window (MNI coordinates).

Condition Brain region BA X Y Z Log-F value
GG vs. AA Left IFG 9 -35 5 30 4.375
Left MFG 6 -45 0 45 4.295
Left PrCG 6 -35 0 30 4.626
9 -35 5 40 4.271
Left ACC 32 -15 5 40 4.652
24 -15 0 40 4.546
LFG MTG 6 -60 -25 -10 4.295
Left insula 13 -30 15 15 4.987
45 -30 25 5 4.306
GG vs. AG Left MFG 6 -20 5 55 4.978
32 -15 10 50 4.675
Left SFG 6 -20 10 55 4.653
Left ACC 32 -15 5 40 5.804
24 -15 0 40 5.831

IFG, inferior frontal gyrus; MFG, middle frontal gyrus; PrCG, precentral gyrus; ACC, anterior cingulate cortex; MTG, middle temporal gyrus; SFG, superior frontal gyrus.

FIGURE 5.

FIGURE 5

Source reconstructions of the amplitudes of P2 responses to –200 cents pitch perturbations, showing enhanced brain activity in the P2 time windows for individuals with the GG genotype when compared to individuals with the AA (A) and AG (B) genotypes.

Discussion

The present study investigated the extent to which FOXP2 variation (SNP rs6980093) was associated with change in the neurobehavioral responses to pitch perturbations during vocal production. When compared to individuals with the GG genotype, individuals with the AA and AG genotypes produced significantly larger vocal compensations for pitch perturbation of -200 cents perturbations that were positively correlated with the variability of their normal voice F0. Furthermore, individuals with the GG genotype produced significantly larger P2 responses to -200 cents perturbations relative to individuals with the AA and AG genotypes due to left-lateralized increased activity in the SFG, MFG, IFG, PrCG, ACC, MTG, and insula. However, the neurobehavioral responses to -50 cents perturbations did not vary as a function of FOXP2 genotype. These findings provide the first behavioral and neural evidence that genetic variation in FOXP2 is associated with sensorimotor integration for vocal pitch regulation.

Our behavioral results of larger vocal compensations associated with the AA and AG genotypes relative to the GG genotype are in line with previous findings of abnormally enhanced vocal compensations produced by individuals with 16p11.2 deletions (Demopoulos et al., 2018). They also parallel findings of incomplete and inaccurate vocal imitation during song learning in young zebra finches with knockdown of FoxP2 gene (Haesler et al., 2007), and impaired acquisition of motor skills and learning of auditory-motor integration in mice carrying heterozygous FoxP2 mutations (Groszer et al., 2008; Kurt et al., 2012). More interestingly, individuals with the AA and AG genotypes exhibited significant positive correlations between the degree of vocal compensations and the variability of their normal voice F0, while no such correlation existed for individuals with the GG genotype. This correlation has been hypothesized to reflect an increased reliance on auditory feedback in sensorimotor control of speech production (Scheerer and Jones, 2012; Chen et al., 2013). In light of the DIVA (directions into velocities of articulators) model (Golfinopoulos et al., 2010), successful control of speech production involves both feedback and feedforward control. Feedforward control allows speakers to correctly produce the speech targets through the internal representations of the motor programs, while feedback control is used to constantly monitor and correct feedback errors for maintaining the accuracy of the internal representations. Decreased reliance on auditory feedback results in an increased reliance on feedforward control and vice versa (Golfinopoulos et al., 2010). Professional singers, who develop a stronger reliance on feedforward control mechanisms, produce significantly smaller vocal compensations than non-musicians (Jones and Keough, 2008) and even are able to successfully ignore large pitch perturbations (Zarate and Zatorre, 2005, 2008). In contrast, significant correlations between vocal variability in patients with PD and TLE and their enhanced vocal compensations for pitch perturbations (Chen et al., 2013; Huang et al., 2016; Li et al., 2016), have been interpreted as an increased reliance on auditory feedback due to impaired feedforward control. This hypothesis has also been used to account for abnormally enhanced vocal compensations produced by individuals with 16p11.2 deletions (Demopoulos et al., 2018). Accordingly, individuals with the AA and AG genotypes may weight auditory feedback more heavily to detect feedback errors from vocal output and thus produce large vocal compensations. By contrast, individuals with the GG genotype may place an increased reliance on feedforward control and tend to “ignore” deviant auditory feedback, thereby producing less of a compensatory response.

Cortically, the GG genotype was associated with significantly larger P2 responses than the AA and AG genotypes, whereas N1 responses did not vary as a function of genotype. These findings may reflect an association between FOXP2 gene and speech motor control not at the early detection of feedback errors but at the later transformation of auditory feedback into corrective motor commands. Furthermore, enhanced P2 responses associated with the GG genotype received contributions from a left-lateralized network including the SFG, MFG, IFG, ACC, MTG, and insula (see Figure 5). Activation of these cortical regions has been identified during auditory-motor control of speech production in previous fMRI (Zarate and Zatorre, 2008; Parkinson et al., 2012; Behroozmand et al., 2015) and ERP studies (Huang et al., 2016; Guo et al., 2017). These results are consistent with the expression of the FOXP2 gene in the lateral frontal and temporo-parietal cortices (Ferland et al., 2003; Lai et al., 2003) as well as associations between the FOXP2 gene and activation in the left IFG and PrCG during overt speech production in clinical (Liegeois et al., 2003) and healthy populations (Pinel et al., 2012). Interestingly, a group of musicians with absolute pitch also produced significantly larger P2 responses in the left hemisphere than non-musicians (Behroozmand et al., 2014). Also, professional singers exhibited enhanced brain activity in the ACC, STG, and insula when exposed to vocal pitch errors as compared to non-musicians (Zarate and Zatorre, 2005, 2008). Together with these previous findings, our observation of larger P2 responses in the fronto-temporal regions in individuals with the GG genotype perhaps reflect a more pronounced shift from feedback to feedforward control of vocal production, suggesting that FOXP2 SNP at rs6980093 may influence the weighting of feedback vs. feedforward control of speech production.

Note that there is a dual-sensory reference frame including both auditory and somatosensory feedback in the DIVA model (Golfinopoulos et al., 2010). Somatosensory feedback provides critical information about speech articulators (Tremblay et al., 2003) and makes significant contributions to speech motor control (Larson et al., 2008; Golfinopoulos et al., 2011; Lametti et al., 2012; Kleber et al., 2013, 2017). The relationship between auditory and somatosensory feedback remains unclear, but there is evidence that these two types of feedback may be in opposition to each other when pitch perturbations are heard. For example, larger vocal compensations for pitch perturbations were elicited by anesthetizing the vocal folds as compared to the pre-anesthetic condition (Larson et al., 2008). Furthermore, Lametti et al. (2012) reported an inverse relationship between reliance on auditory vs. somatosensory feedback: participants who compensated more for somatosensory perturbations compensated less for auditory and vice versa. Therefore, an alternative explanation is that individuals with the GG genotype who exhibited decreased reliance on auditory feedback may weight somatosensory feedback more heavily to attenuate vocal compensations for pitch perturbations when compared to individuals with the AA and AG genotypes, suggesting that the FOXP2 gene might have an impact on a preferential reliance on sensory feedback.

Our finding that FOXP2 genetic variation is associated with change in neurobehavioral responses to perceived vocal pitch errors provides an important piece of the puzzle of individual differences in sensorimotor control of speech production. Despite the well-documented large individual variability of vocal compensations (Burnett et al., 1998; Liu P. et al., 2011; Scheerer and Jones, 2012) and cortical ERPs (N1 and P2) (Chen et al., 2012; Li et al., 2013; Behroozmand et al., 2014) elicited by feedback perturbations, much less is known about the causes of these individual differences. Recent evidence has suggested that individual differences in the neurobehavioral processing of vocal pitch errors are related to the participants’ intrinsic brain activity in the fronto-temporal regions (Guo et al., 2016) and the subregional morphology of subcortical structures (Tang et al., 2018). As well, Zhu et al. (2016) found a negative correlation between vocal compensation magnitudes and estradiol levels and an association between increased P2 amplitudes and decreased progesterone levels in young females. Our findings of the relationship between FOXP2 genetic variant and neurobehavioral responses to pitch perturbations open up a new perspective for linking the genetics to individual variability in speech motor control.

It is worthy noting that the effect of FOXP2 variation was not observed on the neurobehavioral responses to -50 cents perturbations, which may be related to the differential neural mechanisms that underlie the processing of small and large pitch perturbations in voice auditory feedback. For example, vocalization-induced suppression, as demonstrated by smaller N1 responses to pitch perturbations at vocal onset during active vocalization relative to passive listening, was significantly reduced as the size of the pitch perturbation increased (Behroozmand and Larson, 2011). Cantonese speakers produced smaller vocal and larger P2 responses to -200 and -500 cents perturbations than Mandarin speakers, whereas this group difference did not exist in the case of -50 and -100 cents perturbations (Liu et al., 2010; Chen et al., 2012). Professional singers were more capable of suppressing compensatory vocal responses to 200 cents perturbations (closer to 0 cent) than to 25 cents perturbations with increased activity in the right STG, superior temporal sulcus, left planum temporale and supramarginal gyrus (Zarate and Zatorre, 2008; Zarate et al., 2010), whereas this pattern did not exist in non-musicians (Scheerer et al., 2013). In an analogous way to musicians, individuals with the GG genotype may be better at suppressing vocal compensations for large pitch perturbations than for small pitch perturbations due to their decreased reliance on auditory feedback as compared to individuals with the AA and AG genotypes, leading to decreased vocal compensations and increased P2 responses in the condition of -200 cents perturbations.

Clearly, several inherent limitations of the present study must be acknowledged. First, the sample size obtained for analysis is relatively small, although it is consistent with a number of previous studies linking genetics to speech/language processing or disorders (Zhao et al., 2010; Pinel et al., 2012; Chandrasekaran et al., 2015; Xie et al., 2015). As well, our sample did not have an equal number of female and male across the three genotypes. These confounding factors may lead to low statistical power and high false-discovery rates. Future work should be conducted in a larger sample with a balance of sexes. Second, we examined the association between a single FOXP2 SNP (rs6980093) and sensorimotor control of speech production, but variability in neuroanatomy was not affected by this SNP (Hoogman et al., 2014). Whether and how this SNP can influence sensorimotor integration without altering brain structure needs to be further investigated. In addition, a comprehensive assessment of auditory processing and cognitive function should be performed in the future studies, since compensatory control of speech production involves many perception and production processes that demand high-level cognitive processing (Liu et al., 2015; Guo et al., 2017). Despite these limitations, our findings offer a starting point for the examination of the mechanisms of speech motor control from a genetic perspective.

Author Contributions

HL and SZ designed the experiments. SZ, JZ, and ZG performed the experiments and analyzed the data. SZ, JJ, PL, and HL interpreted the results and wrote the manuscript. All authors read and approved the final manuscript.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Footnotes

Funding. This study was funded by grants from the National Natural Science Foundation of China (Nos. 31371135, 81472154, and 81772439), Guangdong Natural Science Funds for Distinguished Young Scholar (No. S2013050014470), Guangdong Province Science and Technology Planning Project (No. 2017A050501014), and Guangzhou Science and Technology Programme (No. 201604020115).

References

  1. Ambrose N. G., Cox N. J., Yairi E. (1997). The genetic basis of persistence and recovery in stuttering. J. Speech Lang. Hear. Res. 40 567–580. 10.1044/jslhr.4003.567 [DOI] [PubMed] [Google Scholar]
  2. Behroozmand R., Ibrahim N., Korzyukov O., Robin D. A., Larson C. R. (2014). Left-hemisphere activation is associated with enhanced vocal pitch error detection in musicians with absolute pitch. Brain Cogn. 84 97–108. 10.1016/j.bandc.2013.11.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Behroozmand R., Larson C. R. (2011). Error-dependent modulation of speech-induced auditory suppression for pitch-shifted voice feedback. BMC Neurosci. 12:54. 10.1186/1471-2202-12-54 [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Behroozmand R., Oya H., Nourski K. V., Kawasaki H., Larson C. R., Brugge J. F., et al. (2016). Neural correlates of vocal production and motor control in human Heschl’s gyrus. J. Neurosci. 36 2302–2315. 10.1523/JNEUROSCI.3305-14.2016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Behroozmand R., Shebek R., Hansen D. R., Oya H., Robin D. A., Howard M. A., III, et al. (2015). Sensory-motor networks involved in speech production and motor control: an fMRI study. Neuroimage 109 418–428. 10.1016/j.neuroimage.2015.01.040 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Belton E., Salmond C. H., Watkins K. E., Vargha-Khadem F., Gadian D. G. (2003). Bilateral brain abnormalities associated with dominantly inherited verbal and orofacial dyspraxia. Hum. Brain Mapp. 18 194–200. 10.1002/hbm.10093 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Boersma P. (2001). Praat, a system for doing phonetics by computer. Glot Int. 5 341–345. [Google Scholar]
  8. Burnett T. A., Freedland M. B., Larson C. R., Hain T. C. (1998). Voice F0 responses to manipulations in pitch feedback. J. Acoust. Soc. Am. 103 3153–3161. 10.1121/1.423073 [DOI] [PubMed] [Google Scholar]
  9. Cai S., Beal D. S., Ghosh S. S., Tiede M. K., Guenther F. H., Perkell J. S. (2012). Weak responses to auditory feedback perturbation during articulation in persons who stutter: evidence for abnormal auditory-motor transformation. PLoS One 7:e41830. 10.1371/journal.pone.0041830 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Chabout J., Sarkar A., Patel S. R., Radden T., Dunson D. B., Fisher S. E., et al. (2016). A Foxp2 mutation implicated in human speech deficits alters sequencing of ultrasonic vocalizations in adult male mice. Front. Behav. Neurosci. 10:197. 10.3389/fnbeh.2016.00197 [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Chandrasekaran B., Yi H. G., Blanco N. J., McGeary J. E., Maddox W. T. (2015). Enhanced procedural learning of speech sound categories in a genetic variant of FOXP2. J. Neurosci. 35 7808–7812. 10.1523/JNEUROSCI.4706-14.2015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Chang E. F., Niziolek C. A., Knight R. T., Nagarajan S. S., Houde J. F. (2013). Human cortical sensorimotor network underlying feedback control of vocal pitch. Proc. Natl. Acad. Sci. U.S.A. 110 2653–2658. 10.1073/pnas.1216827110 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Chen X., Zhu X., Wang E. Q., Chen L., Li W., Chen Z., et al. (2013). Sensorimotor control of vocal pitch production in Parkinson’s disease. Brain Res. 1527 99–107. 10.1016/j.brainres.2013.06.030 [DOI] [PubMed] [Google Scholar]
  14. Chen Z., Liu P., Wang E. Q., Larson C. R., Huang D., Liu H. (2012). ERP correlates of language-specific processing of auditory pitch feedback during self-vocalization. Brain Lang. 121 25–34. 10.1016/j.bandl.2012.02.004 [DOI] [PubMed] [Google Scholar]
  15. Delorme A., Makeig S. (2004). EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis. J. Neurosci. Methods 134 9–21. 10.1016/j.jneumeth.2003.10.009 [DOI] [PubMed] [Google Scholar]
  16. Demopoulos C., Kothare H., Mizuiri D., Henderson-Sabes J., Fregeau B., Tjernagel J., et al. (2018). abnormal speech motor control in individuals with 16p11.2 deletions. Sci. Rep. 8:1274. 10.1038/s41598-018-19751-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Ferland R. J., Cherry T. J., Preware P. O., Morrisey E. E., Walsh C. A. (2003). Characterization of Foxp2 and Foxp1 mRNA and protein in the developing and mature brain. J. Comp. Neurol. 460 266–279. 10.1002/cne.10654 [DOI] [PubMed] [Google Scholar]
  18. Ferree T. C., Luu P., Russell G. S., Tucker D. M. (2001). Scalp electrode impedance, infection risk, and EEG data quality. Clin. Neurophysiol. 112 536–544. 10.1016/S1388-2457(00)00533-2 [DOI] [PubMed] [Google Scholar]
  19. Fisher S. E., Scharff C. (2009). FOXP2 as a molecular window into speech and language. Trends Genet. 25 166–177. 10.1016/j.tig.2009.03.002 [DOI] [PubMed] [Google Scholar]
  20. Fuchs M., Kastner J., Wagner M., Hawes S., Ebersole J. S. (2002). A standardized boundary element method volume conductor model. Clin. Neurophysiol. 113 702–712. 10.1016/S1388-2457(02)00030-5 [DOI] [PubMed] [Google Scholar]
  21. Golfinopoulos E., Tourville J. A., Bohland J. W., Ghosh S. S., Nieto-Castanon A., Guenther F. H. (2011). fMRI investigation of unexpected somatosensory feedback perturbation during speech. Neuroimage 55 1324–1338. 10.1016/j.neuroimage.2010.12.065 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Golfinopoulos E., Tourville J. A., Guenther F. H. (2010). The integration of large-scale neural network modeling and functional brain imaging in speech motor control. Neuroimage 52 862–874. 10.1016/j.neuroimage.2009.10.023 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Graham S. A., Fisher S. E. (2013). Decoding the genetics of speech and language. Curr. Opin. Neurobiol. 23 43–51. 10.1016/j.conb.2012.11.006 [DOI] [PubMed] [Google Scholar]
  24. Groszer M., Keays D. A., Deacon R. M., de Bono J. P., Prasad-Mulcare S., Gaub S., et al. (2008). Impaired synaptic plasticity and motor learning in mice with a point mutation implicated in human speech deficits. Curr. Biol. 18 354–362. 10.1016/j.cub.2008.01.060 [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Guo Z., Huang X., Wang M., Jones J. A., Dai Z., Li W., et al. (2016). Regional homogeneity of intrinsic brain activity correlates with auditory-motor processing of vocal pitch errors. Neuroimage 142 565–575. 10.1016/j.neuroimage.2016.08.005 [DOI] [PubMed] [Google Scholar]
  26. Guo Z., Wu X., Li W., Jones J. A., Yan N., Sheft S., et al. (2017). Top-down modulation of auditory-motor integration during speech production: the role of working memory. J. Neurosci. 37 10323–10333. 10.1523/JNEUROSCI.1329-17.2017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Haesler S., Rochefort C., Georgi B., Licznerski P., Osten P., Scharff C. (2007). Incomplete and inaccurate vocal imitation after knockdown of FoxP2 in songbird basal ganglia nucleus Area X. PLoS Biol. 5:e321. 10.1371/journal.pbio.0050321 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Hawco C. S., Jones J. A., Ferretti T. R., Keough D. (2009). ERP correlates of online monitoring of auditory feedback during vocalization. Psychophysiology 46 1216–1225. 10.1111/j.1469-8986.2009.00875.x [DOI] [PubMed] [Google Scholar]
  29. Hickok G., Houde J. F., Rong F. (2011). Sensorimotor integration in speech processing: computational basis and neural organization. Neuron 69 407–422. 10.1016/j.neuron.2011.01.019 [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Hippolyte L., Maillard A. M., Rodriguez-Herreros B., Pain A., Martin-Brevet S., Ferrari C., et al. (2016). The number of genomic copies at the 16p11.2 locus modulates language, verbal memory, and Inhibition. Biol. Psychiatry 80 129–139. 10.1016/j.biopsych.2015.10.021 [DOI] [PubMed] [Google Scholar]
  31. Hoogman M., Guadalupe T., Zwiers M. P., Klarenbeek P., Francks C., Fisher S. E. (2014). Assessing the effects of common variation in the FOXP2 gene on human brain structure. Front. Hum. Neurosci. 8:473. 10.3389/fnhum.2014.00473 [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Houde J. F., Jordan M. I. (1998). Sensorimotor adaptation in speech production. Science 279 1213–1216. 10.1126/science.279.5354.1213 [DOI] [PubMed] [Google Scholar]
  33. Huang X., Chen X., Yan N., Jones J. A., Wang E. Q., Chen L., et al. (2016). The impact of Parkinson’s disease on the cortical mechanisms that support auditory-motor integration for voice control. Hum. Brain Mapp. 37 4248–4261. 10.1002/hbm.23306 [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Hurst J. A., Baraitser M., Auger E., Graham F., Norell S. (1990). An extended family with a dominantly inherited speech disorder. Dev. Med. Child Neurol. 32 352–355. 10.1111/j.1469-8749.1990.tb16948.x [DOI] [PubMed] [Google Scholar]
  35. Jones J. A., Keough D. (2008). Auditory-motor mapping for pitch control in singers and nonsingers. Exp. Brain Res. 190 279–287. 10.1007/s00221-008-1473-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Kleber B., Friberg A., Zeitouni A., Zatorre R. (2017). Experience-dependent modulation of right anterior insula and sensorimotor regions as a function of noise-masked auditory feedback in singers and nonsingers. Neuroimage 147 97–110. 10.1016/j.neuroimage.2016.11.059 [DOI] [PubMed] [Google Scholar]
  37. Kleber B., Zeitouni A. G., Friberg A., Zatorre R. J. (2013). Experience-dependent modulation of feedback integration during singing: role of the right anterior insula. J. Neurosci. 33 6070–6080. 10.1523/JNEUROSCI.4418-12.2013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Konopka G., Roberts T. F. (2016). Insights into the neural and genetic basis of vocal communication. Cell 164 1269–1276. 10.1016/j.cell.2016.02.039 [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Kurt S., Fisher S. E., Ehret G. (2012). Foxp2 mutations impair auditory-motor association learning. PLoS One 7:e33130. 10.1371/journal.pone.0033130 [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Lai C. S., Fisher S. E., Hurst J. A., Vargha-Khadem F., Monaco A. P. (2001). A forkhead-domain gene is mutated in a severe speech and language disorder. Nature 413 519–523. 10.1038/35097076 [DOI] [PubMed] [Google Scholar]
  41. Lai C. S., Gerrelli D., Monaco A. P., Fisher S. E., Copp A. J. (2003). FOXP2 expression during brain development coincides with adult sites of pathology in a severe speech and language disorder. Brain 126 2455–2462. 10.1093/brain/awg247 [DOI] [PubMed] [Google Scholar]
  42. Lametti D. R., Nasir S. M., Ostry D. J. (2012). Sensory preference in speech production revealed by simultaneous alteration of auditory and somatosensory feedback. J. Neurosci. 32 9351–9358. 10.1523/JNEUROSCI.0404-12.2012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Larson C. R., Altman K. W., Liu H., Hain T. C. (2008). Interactions between auditory and somatosensory feedback for voice F (0) control. Exp. Brain Res. 187 613–621. 10.1007/s00221-008-1330-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Li W., Chen Z., Liu P., Zhang B., Huang D., Liu H. (2013). Neurophysiological evidence of differential mechanisms involved in producing opposing and following responses to altered auditory feedback. Clin. Neurophysiol. 124 2161–2171. 10.1016/j.clinph.2013.04.340 [DOI] [PubMed] [Google Scholar]
  45. Li W., Chen Z., Yan N., Jones J. A., Guo Z., Huang X., et al. (2016). Temporal lobe epilepsy alters auditory-motor integration for voice control. Sci. Rep. 6:28909. 10.1038/srep28909 [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Lieberman P. (2009). FOXP2 and human cognition. Cell 137 800–802. 10.1016/j.cell.2009.05.013 [DOI] [PubMed] [Google Scholar]
  47. Liegeois F., Baldeweg T., Connelly A., Gadian D. G., Mishkin M., Vargha-Khadem F. (2003). Language fMRI abnormalities associated with FOXP2 gene mutation. Nat. Neurosci. 6 1230–1237. 10.1038/nn1138 [DOI] [PubMed] [Google Scholar]
  48. Liu H., Meshman M., Behroozmand R., Larson C. R. (2011). Differential effects of perturbation direction and magnitude on the neural processing of voice pitch feedback. Clin. Neurophysiol. 122 951–957. 10.1016/j.clinph.2010.08.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Liu P., Chen Z., Jones J. A., Huang D., Liu H. (2011). Auditory feedback control of vocal pitch during sustained vocalization: a cross-sectional study of adult aging. PLoS One 6:e22791. 10.1371/journal.pone.0022791 [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Liu H., Wang E. Q., Chen Z., Liu P., Larson C. R., Huang D. (2010). Effect of tonal native language on voice fundamental frequency responses to pitch feedback perturbations during vocalization. J. Acoust. Soc. Am. 128 3739–3746. 10.1121/1.3500675 [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Liu H., Wang E. Q., Verhagen Metman L., Larson C. R. (2012). Vocal responses to perturbations in voice auditory feedback in individuals with Parkinson’s disease. PLoS One 7:e33629. 10.1371/journal.pone.0033629 [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Liu Y., Hu H., Jones J. A., Guo Z., Li W., Chen X., et al. (2015). Selective and divided attention modulates auditory-vocal integration in the processing of pitch feedback errors. Eur. J. Neurosci. 42 1895–1904. 10.1111/ejn.12949 [DOI] [PubMed] [Google Scholar]
  53. MacDermot K. D., Bonora E., Sykes N., Coupe A. M., Lai C. S., Vernes S. C., et al. (2005). Identification of FOXP2 truncation as a novel cause of developmental speech and language deficits. Am. J. Hum. Genet. 76 1074–1080. 10.1086/430841 [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Macdonald E. N., Goldberg R., Munhall K. G. (2010). Compensations in response to real-time formant perturbations of different magnitudes. J. Acoust. Soc. Am. 127 1059–1068. 10.1121/1.3278606 [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Mazziotta J., Toga A., Evans A., Fox P., Lancaster J., Zilles K., et al. (2001). A probabilistic atlas and reference system for the human brain: International Consortium for Brain Mapping (ICBM). Philos. Trans. R. Soc. Lond. B Biol. Sci. 356 1293–1322. 10.1098/rstb.2001.0915 [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Mollaei F., Shiller D. M., Baum S. R., Gracco V. L. (2016). Sensorimotor control of vocal pitch and formant frequencies in Parkinson’s disease. Brain Res. 1646 269–277. 10.1016/j.brainres.2016.06.013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Mozzi A., Riva V., Forni D., Sironi M., Marino C., Molteni M., et al. (2017). A common genetic variant in FOXP2 is associated with language-based learning (dis)abilities: evidence from two Italian independent samples. Am. J. Med. Genet. B Neuropsychiatr. Genet. 174B, 578–586. 10.1002/ajmg.b.32546 [DOI] [PubMed] [Google Scholar]
  58. Mulert C., Jager L., Schmitt R., Bussfeld P., Pogarell O., Moller H. J., et al. (2004). Integration of fMRI and simultaneous EEG: towards a comprehensive understanding of localization and time-course of brain activity in target detection. Neuroimage 22 83–94. 10.1016/j.neuroimage.2003.10.051 [DOI] [PubMed] [Google Scholar]
  59. Murugan M., Harward S., Scharff C., Mooney R. (2013). Diminished FoxP2 levels affect dopaminergic modulation of corticostriatal signaling important to song variability. Neuron 80 1464–1476. 10.1016/j.neuron.2013.09.021 [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Padovani A., Cosseddu M., Premi E., Archetti S., Papetti A., Agosti C., et al. (2010). The speech and language FOXP2 gene modulates the phenotype of frontotemporal lobar degeneration. J. Alzheimers Dis. 22 923–931. 10.3233/JAD-2010-101206 [DOI] [PubMed] [Google Scholar]
  61. Parkinson A. L., Flagmeier S. G., Manes J. L., Larson C. R., Rogers B., Robin D. A. (2012). Understanding the neural mechanisms involved in sensory control of voice production. Neuroimage 61 314–322. 10.1016/j.neuroimage.2012.02.068 [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Pascual-Marqui R. D. (2002). Standardized low-resolution brain electromagnetic tomography (sLORETA): technical details. Method Find. Exp. Clin. Pharmacol. 24(Suppl. D), 5–12. [PubMed] [Google Scholar]
  63. Pinel P., Fauchereau F., Moreno A., Barbot A., Lathrop M., Zelenika D., et al. (2012). Genetic variants of FOXP2 and KIAA0319/TTRAP/THEM2 locus are associated with altered brain activation in distinct language-related regions. J. Neurosci. 32 817–825. 10.1523/JNEUROSCI.5996-10.2012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Premi E., Pilotto A., Alberici A., Papetti A., Archetti S., Seripa D., et al. (2012). FOXP2, APOE, and PRNP: new modulators in primary progressive aphasia. J. Alzheimers Dis. 28 941–950. 10.3233/JAD-2011-111541 [DOI] [PubMed] [Google Scholar]
  65. Ranasinghe K. G., Gill J. S., Kothare H., Beagle A. J., Mizuiri D., Honma S. M., et al. (2017). Abnormal vocal behavior predicts executive and memory deficits in Alzheimer’s disease. Neurobiol. Aging 52 71–80. 10.1016/j.neurobiolaging.2016.12.020 [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Scheerer N. E., Behich J., Liu H., Jones J. A. (2013). ERP correlates of the magnitude of pitch errors detected in the human voice. Neuroscience 240 176–185. 10.1016/j.neuroscience.2013.02.054 [DOI] [PubMed] [Google Scholar]
  67. Scheerer N. E., Jones J. A. (2012). The relationship between vocal accuracy and variability to the level of compensation to altered auditory feedback. Neurosci. Lett. 529 128–132. 10.1016/j.neulet.2012.09.012 [DOI] [PubMed] [Google Scholar]
  68. Tang X., Chen N., Zhang S., Jones J. A., Zhang B., Li J., et al. (2018). Predicting auditory feedback control of speech production from subregional shape of subcortical structures. Hum. Brain Mapp. 39 459–471. 10.1002/hbm.23855 [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Teramitsu I., Kudo L. C., London S. E., Geschwind D. H., White S. A. (2004). Parallel FoxP1 and FoxP2 expression in songbird and human brain predicts functional interaction. J. Neurosci. 24 3152–3163. 10.1523/JNEUROSCI.5589-03.2004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Tourville J. A., Reilly K. J., Guenther F. H. (2008). Neural mechanisms underlying auditory feedback control of speech. Neuroimage 39 1429–1443. 10.1016/j.neuroimage.2007.09.054 [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Tremblay S., Shiller D. M., Ostry D. J. (2003). Somatosensory basis of speech production. Nature 423 866–869. 10.1038/nature01710 [DOI] [PubMed] [Google Scholar]
  72. Vargha-Khadem F., Watkins K. E., Price C. J., Ashburner J., Alcock K. J., Connelly A., et al. (1998). Neural basis of an inherited speech and language disorder. Proc. Natl. Acad. Sci. U.S.A. 95 12695–12700. 10.1073/pnas.95.21.12695 [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Watkins K. E., Dronkers N. F., Vargha-Khadem F. (2002a). Behavioural analysis of an inherited speech and language disorder: comparison with acquired aphasia. Brain 125 452–464. [DOI] [PubMed] [Google Scholar]
  74. Watkins K. E., Vargha-Khadem F., Ashburner J., Passingham R. E., Connelly A., Friston K. J., et al. (2002b). MRI analysis of an inherited speech and language disorder: structural brain abnormalities. Brain 125 465–478. [DOI] [PubMed] [Google Scholar]
  75. Xie Z., Maddox W. T., Knopik V. S., McGeary J. E., Chandrasekaran B. (2015). Dopamine receptor D4 (DRD4) gene modulates the influence of informational masking on speech recognition. Neuropsychologia 67 121–131. 10.1016/j.neuropsychologia.2014.12.013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Zarate J. M., Wood S., Zatorre R. J. (2010). Neural networks involved in voluntary and involuntary vocal pitch regulation in experienced singers. Neuropsychologia 48 607–618. 10.1016/j.neuropsychologia.2009.10.025 [DOI] [PubMed] [Google Scholar]
  77. Zarate J. M., Zatorre R. J. (2005). Neural substrates governing audiovocal integration for vocal pitch regulation in singing. Ann. N. Y. Acad. Sci. 1060 404–408. 10.1196/annals.1360.058 [DOI] [PubMed] [Google Scholar]
  78. Zarate J. M., Zatorre R. J. (2008). Experience-dependent neural substrates involved in vocal pitch regulation during singing. Neuroimage 40 1871–1887. 10.1016/j.neuroimage.2008.01.026 [DOI] [PubMed] [Google Scholar]
  79. Zhao Y., Ma H., Wang Y., Gao H., Xi C., Hua T., et al. (2010). Association between FOXP2 gene and speech sound disorder in Chinese population. Psychiatry Clin. Neurosci. 64 565–573. 10.1111/j.1440-1819.2010.02123.x [DOI] [PubMed] [Google Scholar]
  80. Zhu X., Niu Y., Li W., Zhang Z., Liu P., Chen X., et al. (2016). Menstrual cycle phase modulates auditory-motor integration for vocal pitch regulation. Front. Neurosci. 10:600. 10.3389/fnins.2016.00600 [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Zumsteg D., Lozano A. M., Wennberg R. A. (2006). Depth electrode recorded cerebral responses with deep brain stimulation of the anterior thalamus for epilepsy. Clin. Neurophysiol. 117 1602–1609. 10.1016/j.clinph.2006.04.008 [DOI] [PubMed] [Google Scholar]

Articles from Frontiers in Neuroscience are provided here courtesy of Frontiers Media SA

RESOURCES