Abstract
Deficits in fluent speech production following left hemisphere stroke are a central concern because of their impact on patients’ lives and the insight they provide about the neural organization of language processing. Fluent speech production requires the rapid coordination of phonological, semantic, and syntactic processing, so this study examined how deficits in connected speech relate to these language sub-systems. Behavioural data (N = 69 participants with aphasia following left hemisphere stroke) consisted of a diverse and comprehensive set of narrative speech production measures and measures of overall severity, semantic deficits, and phonological deficits. These measures were entered into a principal component analysis with bifactor rotation—a latent structure model where each item loads on a general factor that reflects what is common among the items, and orthogonal factors that explain variance not accounted for by the general factor. Lesion data were available for 58 of the participants, and each factor score was analysed with multivariate lesion–symptom mapping. Effects of connectivity disruption were evaluated using robust regression with tract disconnection or graph theoretic measures of connectivity as predictors. The principal component analysis produced a four-factor solution that accounted for 70.6% of the variance in the data, with a general factor corresponding to the overall severity and length and complexity of speech output (complexity factor), a lexical syntax factor, and independent factors for Semantics and Phonology. Deficits in the complexity of speech output were associated with a large temporo-parietal region, similar to overall aphasia severity. The lexical syntax factor was associated with damage in a relatively small set of fronto-parietal regions which may reflect the recruitment of control systems to support retrieval and correct usage of lexical items that primarily serve a syntactic rather than semantic function. Tract-based measures of connectivity disruption were not statistically associated with the deficit scores after controlling for overall lesion volume. Language network efficiency and average clustering coefficient within the language network were significantly associated with deficit scores after controlling for overall lesion volume. These results highlight overall severity as the critical contributor to fluent speech in post-stroke aphasia, with a dissociable factor corresponding to lexical syntax.
Keywords: aphasia, fluency, syntax, lesion–symptom mapping, connectivity
Zevgolatakou et al. report that measures of narrative speech in post-stroke aphasia correspond to (i) general impairment severity and sentence/utterance complexity, which are associated with damage in a large temporo-parietal region and (ii) lexical syntax, which is associated with damage in a relatively small set of fronto-parietal regions.
Graphical Abstract
Introduction
Aphasia is an acquired impairment of language production and comprehension that affects approximately one-third of stroke survivors and is one of the most frequent and debilitating consequences of brain injury.1,2 From the earliest research on aphasia,3,4 deficits in fluent speech production have been a central concern of both clinical and scientific importance because of their impact on patients’ lives and the insight they provide about the neural organization of language processing. In the intervening 100 + years, it has become clear that fluent speech production requires the rapid coordination of multiple cognitive and neural systems, including syntax, sentence planning and working memory, semantics, and phonology and articulatory motor control.5-8 This study used data-driven methods to investigate the relationships among fluency, overall aphasia severity, and deficits of syntax, semantics and phonology.
Contributors to fluent speech production
Although aphasia is traditionally divided into ‘fluent’ and ‘non-fluent’ subtypes, this distinction has been widely criticised for not correctly capturing symptom co-occurrence and for having a poor agreement between diagnostic instruments.9-13 A key problem is that multiple different underlying impairments can disrupt fluent speech production. These underlying impairments reflect different cognitive sub-systems with different neural bases and may require different treatment approaches.10
Deficits of planning or executing the articulatory gestures that make up speech can produce hesitations, phonological errors and distortions, which are one form of non-fluent speech. Such deficits can arise from impairments in phonological planning or in articulatory motor control (e.g. dysarthria) or their interface (apraxia of speech) and are associated with damage to precentral motor control regions, postcentral somatosensory cortex, and inferior parietal regions that support phonological–articulatory planning.7,14-16
Difficulty with retrieving the intended word can also produce hesitations during a connected speech while the speaker searches for that word. Such difficulty results from impaired lexical–semantic processes: semantic knowledge of words and selection processes to choose among candidate words.17,18 Impairments of these lexical–semantic processes are associated with damage to anterior temporal and inferior frontal regions.19-21
Syntactic processes are also critical for fluent speech production, and there is a long history of association between ‘agrammatism’ and non-fluent aphasia.22,23 Agrammatism is characterized by reduced or omitted syntactic structure, which is a common characteristic of non-fluent speech. These structural elements can be at the sentence-level (e.g. well-formedness of sentences, extent of structural embeddings) or at the lexical-level (e.g. correct use of pronouns, production of determiners and closed class words). A recent comprehensive framework integrated a large body of evidence to propose that posterior superior and middle temporal regions support hierarchical lexical–syntactic functions for production and comprehension while inferior frontal regions support morpho-syntactic sequencing primarily relevant for production.6 This view was supported by a dissociation between reduction/omission of grammatical structure (‘agrammatism’), which was associated with frontal damage, and grammatical errors that were not reductions (‘paragrammatism’), which was associated with posterior superior and middle temporal damage.24
However, the neural basis for syntactic processing is somewhat inconsistent across studies, implicating large regions of frontal, parietal, and temporal cortex, as well as the underlying white matter. Some evidence suggests that the language network shows lexical–semantic and combinatorial sensitivity with no regions selective for purely syntactic processes.25 In contrast to a syntax-specific view, some have argued that syntactic deficits are a result of relatively general cognitive resource reductions.26 At least one study7 found no regions where damage was associated with a syntactic deficit; it is likely that other such null results exist but have been suppressed by publication bias.
Thus, fluency deficits could plausibly arise from phonological/articulatory, lexical–semantic, or syntactic deficits. Connectivity within these systems also plays an important role: several studies have found that damage to frontal and peri-Sylvian white matter tracts is an important contributor to fluency deficits.27-30 More recent ‘network neuroscience’ work has quantified connectivity disruption using graph theory measures, which are meant to capture broader connectivity properties like the efficiency of information transmission and integration or segregation of neural components.31 One study found that graph theory measures outperformed connection weights as predictors of fluency deficits.32
A further complication is that recent studies have identified aphasia severity as a major dividing feature. A large-scale study of 330 participants with aphasia across three different aetiologies (post-stroke aphasia, primary progressive aphasia, and post-operative aphasia) found that severity was the primary dimension of variability, explaining 75% of the variance in aphasia battery sub-scores.33 A more focused examination of 226 participants with post-stroke aphasia found that a ‘mild aphasia’ cluster was behaviourally and neuroanatomically distinct from the other two clusters, which corresponded to the semantic deficit and phonological deficit.34
Multi-dimensional and data-driven approaches to fluency
Data-driven statistical methods like factor analysis and principal component analysis (PCA) use correlations among measures to group them into latent factors or components and calculate composite scores for those factors. Several studies have focused on measures of connected speech and used factor analysis to identify clusters of measures that suggest common underlying components5,35-37 (for recent reviews of factor analysis approaches in contemporary aphasia research see38,39). A key factor analysis study5 of spontaneous speech in 274 individuals with aphasia identified 6 latent factors that accounted for 52% of the variance, with substantial overlap between some of the factors. In particular, four of the factors were strongly correlated and primarily reflected fluency (words per minute) and syntactic structure (utterance length, use of grammatically complex structures, propositional density, verb marking and grammatical errors). The other factors reflected semantic anomalies (e.g. jargon) and utterance repairs.
Because multiple cognitive sub-systems contribute to fluent speech production, it is important to consider them simultaneously. That is, combining measures derived from connected speech elicitation tasks with other measures that capture specific sub-systems is necessary for investigating how fluency relates to those sub-systems. A study of 50 participants with primary progressive aphasia40 found that frontal regions were associated with speech sound distortions and syntactic deficits, anterior and inferior temporal regions with lexical retrieval, and posterior temporal regions with phonological errors. In post-stroke aphasia, a statistical path modelling analysis8 found that syntactic impairment had a strong and direct association with fluency, while word production, comprehension and working memory deficits were indirectly associated with fluency.
Such multi-dimensional approaches can allow investigation of how the sub-systems relate to one another in addition to how they contribute to fluency. This is particularly important for the domain of syntax deficits, which (as briefly reviewed in the previous section) have sentence-level and word-level aspects, and could result from general resource reduction (which can be reflected in overall severity) or combinatorial deficits rather than syntax-specific impairments.
Several recent studies have applied this approach and identified consistent dissociations between fluency, semantic, phonological, and executive deficits.39 In these studies, the fluency components had high loadings from composite measures of spontaneous speech production [e.g. fluency scores from aphasia assessments such as the Western Aphasia Battery (WAB) and Comprehensive Aphasia Test] and coarse measures of fluency (such as words per minute and mean length of utterance). A recent study41 used finer-grained discourse production measures and showed that connected speech production had distinct components of quantity (total number of content words and number of unique content words), quality (ratio of total words to unique words, also known as the type–token ratio), and words per minute (which they labelled ‘motor speech ability’). Of note, these studies included varied measures of fluency but very limited measures of syntax. Another recent study specifically focused on connected speech production in acute left hemisphere stroke42 and found distinct components of utterance length/complexity, syntactic accuracy, lexical retrieval, and words per minute (which they labelled ‘production fluency’). This last study suggests dissociations between phonological, semantic, syntactic, and combinatorial processes, but the semantic measures were limited, and phonological measures were virtually absent, so the connection to those broader components remains unclear.
The current study
The current study used a data-driven approach to evaluate how measures of fluency and syntax deficits in post-stroke aphasia relate to overall aphasia severity, semantic deficits, and phonological deficits, and the lesion correlates of these components. We build on prior data-driven work, making several key innovations and improvements. First, we used quantitative production analysis (QPA)23 to derive a diverse set of narrative speech production measures that include fluency and syntax at lexical, utterance and sentence levels. Like Ding et al.,42 we used a comprehensive set of QPA measures; however, we also included general measures of language impairment and measures of semantic and phonological deficits. These additional measures served as ‘anchors’ for three well-established aspects of aphasic deficits (overall severity, semantic deficit, and phonological deficit) allowing the QPA-based measures of fluency and syntax deficit to attach to those factors or to form separate factors.
Second, to explicitly capture overall severity as a core dimension of variability in aphasia, we used a bifactor PCA model. Bifactor modelling re-emerged relatively recently as a psychometric technique that hypothesizes a single general factor, which explains shared variance across all items (measures), and a series of orthogonal (uncorrelated) domain-specific factors.43,44 Because we included anchor variables and used bifactor rotation, we expected to find severity, semantic, and phonological factors; the key research question was how the QPA-based measures of fluency and syntax would distribute among those factors or form additional factors.
Third, we conducted lesion–symptom mapping (LSM) analyses using a multivariate optimization technique known as sparse canonical correlation analysis for neuroimaging (SCCAN), which more accurately identifies symptom-relevant lesion areas.45 SCCAN is particularly important for the present study because fluency and syntax are likely to rely on a distributed network of brain regions, and SCCAN is better able to detect networks of symptom-relevant brain regions.45 In addition, SCCAN uses cross-validation (CV) to optimize the sparseness of the solution and evaluates the prediction accuracy (and the statistical significance) of the overall solution. As a result, the accuracy of lesion-deficit prediction determines the size of the critical region identified by the LSM analysis. In contrast, in mass-univariate LSM, the focality of the result is based on the conservativeness of the correction for multiple comparisons, which is set based on statistical considerations.46-48 Therefore, SCCAN is better able to capture distributed networks of lesion-relevant regions and to determine whether there are (possibly multiple) small symptom-relevant regions or a single large region.
Fourth, based on the premise that connectivity disruption can undermine functional networks that support language production, we tested the contribution of white matter damage using both tract-based and network-based measures. This was motivated by prior studies that highlighted the role of white matter damage [particularly the frontal aslant tract (FAT) and the anterior portion of the arcuate fasciculus (AF)]27,49,50 in fluency and syntax deficits, although we did not find specific effects of frontal white matter damage in our prior work.7 That is, the current state of the evidence is equivocal, possibly because there is substantial variability in both the methods and results of prior work on effects of connectivity disruption on language deficits in post-stroke aphasia. Therefore, these exploratory analyses were intended to help build toward a consensus rather than to test specific hypotheses. In the current study, we evaluated the effects of damage to a larger set of language-relevant white matter tracts [left AF, superior longitudinal fasciculus, inferior fronto-occipital fasciculus (IFOF), uncinate fasciculus (UF), and FAT]. We also evaluated the effects of connectivity disruption using measures derived from graph theory (global efficiency, characteristic path length, average clustering coefficient), which are meant to capture broader communication efficacy and efficiency.
In sum, the present study provides a comprehensive behavioural and neural assessment of fluency deficits in post-stroke aphasia. A data-driven approach was used to determine how a diverse set of measures of fluency and syntax processing were associated with or dissociated from measures of aphasia severity, semantic deficits, and phonological deficits. Multivariate LSM and connectivity disruption analyses were then used to determine the neural correlates of these behavioural deficit dimensions.
Materials and methods
Participants
The initial data set consisted of behavioural data from 74 participants who were native English speakers, right-handed, and had suffered a left hemisphere stroke. These participants are the subset who completed the connected speech elicitation task from a larger study of post-stroke aphasia.51 We conducted a preliminary leave-one-out PCA stability analysis in which we iteratively removed an individual participant and tested the effect of that exclusion on the PCA results (this is a non-parametric version of standard outlier detection methods). In general, there was a very high correlation (>0.95) between PCA results with and without a participant, indicating stable results that are minimally influenced by any one participant. However, there were five high-influence participants: excluding one of these participants substantially changed the results. These five outlier participants were excluded from the analysis (these were multivariate outlier cases: their pattern of performance did not conform to the factor structure that described the rest of the participants. However, they were not outliers on any one dimension and their performance profiles were not particularly similar to one another. We speculate that these participants were using idiosyncratic strategies so their performance on some measures was substantially better than would be expected based on their performance on other measures). The analytic sample contained behavioural data from 69 participants and imaging data were available for 58 of these participants.
All but five participants were examined at the chronic stage of aphasia (at least 3 months post-onset), but the exclusion of these five participants did not change the results so the more inclusive analyses are reported here. The data are part of a larger study on language processing following left hemisphere stroke and several previous articles on language deficits in aphasia have included subsets of the participants.7,19,34,48,52,53 A summary of demographic and clinical information is shown in Table 1. The original data collection and sharing were approved by Institutional Review Boards at the Einstein Healthcare Network and University of Pennsylvania School of Medicine. The analyses of de-identified data in the current study were approved by the University of Edinburgh PPLS Research Ethics Committee.
Table 1.
Variable | Behavioural analysis sample (N = 69) | Lesion analysis sample (N = 58) |
---|---|---|
Sex (F:M) | 33:36 | 27:31 |
Age | 58.0 (50.0–68.0) [31.0–79.0] | 58.5 (51.3–68.0) [31.0–79.0] |
Months post-onset | 23.0 (7.0–75.0) [0.5–234] | 28.0 (7.0–81.0) [0.5–234] |
Education (years) | 14.0 (12.0–18.0) [10.0–21.0] | 14.0 (12.0–18.0) [10.0–21.0] |
WAB AQ | 83.1 (71.7–90.4) [47.2–99.3] | 81.3 (71.3–91.0) [47.2–99.3] |
WAB fluency | 8.0 (5.0–9.0) [2.0–10.0] | 8.0 (5.0–9.0) [2.0–10.0] |
Total words | 231.0 (202.0–264.0) [99.0–404.0] | 230.0 (201.2–263.4) [99.0–404.0] |
Words per minute | 64.5 (44.8–86.6) [14.1–193.5] | 61.6 (43.3–88.5) [14.1–193.5] |
Med Utter Len | 5.0 (4.0–6.0) [1.0–11.0] | 5.0 (4.0–6.0) [1.0–8.5] |
PNT: Correct | 0.79 (0.65–0.86) [0.23–0.97] | 0.79 (0.65–0.86) [0.23–0.97] |
PNT: Semantic Err | 0.034 (0.017–0.051) [0–0.11] | 0.034 (0.017–0.051) [0–0.11] |
PNT: Formal/NW Err | 0.068 (0.04–0.16) [0.006–0.37] | 0.068 (0.042–0.17) [0.006–0.37] |
PRT | 94.0 (87.0–97.0) [57.0–100.0] | 94.0 (86.3–96.8) [57.0–100.0] |
Camel and Cactus | 80.0 (72.0–84.0) [45.0–94.0] | 80.0 (72.0–83.8) [45–94] |
Semantic Discrimination | 90.0 (85.0–93.0) [65.0–100.0] | 89.0 (85.0–93.0) [65.0–100.0] |
Non-word repetition | 52.0 (32.0–70.0) [8.0–95.0] | 53.5 (30.5–72.3) [8.0–95.0] |
Aphasia classification | ||
Anomic | 40 | 31 |
Broca | 13 | 14 |
Conduction | 10 | 9 |
TSA | 2 | 2 |
TMA | 3 | 1 |
Wernicke | 1 | 1 |
Note: Values are number of participants or medians with IQR in parentheses and range in square brackets.
Err = error, NW = non-word, TSA = transcortical sensory aphasia, TMA = transcortical motor aphasia, Med Utter Len = median utterance length.
Image acquisition and preprocessing
Lesion location was assessed based on MRI (N = 45) or CT (N = 13) brain scans, following the same procedures as previous studies of this data set.7,19,34,48,52,53 For the MRI scans, lesions were manually segmented on each participant’s T1-weighted structural image by a technician and reviewed by an experienced neurologist for accuracy. Each participant’s brain image was registered to the Montreal Neurological Institute space Colin27 template by an automated symmetric diffeomorphic registration algorithm,54 which iteratively samples the space to find the best solution to align the intact portion of the brain (i.e. with the lesion masked out) with the template. This solution (image transformation) is then applied to the lesion mask to register it to the same template. For the CT scans, a neurologist drew the lesions directly onto the Colin27 template after rotating it (pitch only) to match the approximate slice plane of the participant’s scan.
Quantitative production analysis
Connected speech samples were primarily elicited via the ‘Cinderella story’ with the requirement that every elicited narrative had at least 150 words; if necessary, additional connected speech was elicited using other well-known stories (such as ‘The Little Red Riding Hood’) in order to reach at least 150 words. The speech samples were transcribed and coded by a speech pathologist or research assistant specifically trained to perform transcription and QPA coding following guidelines based on published QPA scoring protocol.23,55,56 The protocol is designed to characterize aphasic sentence production, focusing on syntax and more general measures of fluency, and has been adapted to accommodate fluent as well as non-fluent participants. During training, all coders achieved ∼90% agreement on their transcription and coding of utterance boundaries, utterance content and grammatical structure.
The measures selected for the analyses reflected a wide range of morphological, structural and lexical properties of speech. Based on the grouping described in the QPA manual,56 the measures correspond to lexical content (words per minute, proportion of closed class words, proportion of pronouns, proportion of verbs, determiner index, and inflection index), auxiliary verb usage (auxiliary complexity index), and structural analysis [proportion of words in sentences, proportion of well-formed sentences, embedding index, mean sentence length, median utterance length, mean verb phrase (VP) length]. See Supplementary Table 1 for an additional explanation of the QPA measures. This QPA manual grouping was used as a starting point, but the grouping and characterization of these measures differ somewhat across studies,23,55,57 and a goal of this study was to evaluate how these measures relate to aphasia severity, semantic deficits and phonological deficits.
Additional behavioural assessments
The behavioural testing of the participants consisted of a wide range of neuropsychological batteries and tests. To supplement the QPA measures, a subset of measures was selected to reflect overall severity and two important contributors to language production that are not captured by QPA measures: phonology and semantics. These measures provide are optimized for capturing phonological and semantic deficits and are independent of connected speech, so they provide a stronger assessment of whether aspects of connected speech are a result of such deficits.
WAB-Revised:58 A standard aphasia assessment battery that provides an overall measure of aphasia severity—the aphasia quotient (AQ). The measure used is WAB AQ.
Philadelphia Naming Test (PNT):59 A test of single word production in object (picture) naming that provides measures of overall word production deficit (accuracy) as well as semantic and phonological deficits (semantic and phonological errors, respectively). Measures used are as follows: proportion of correct responses, proportion of semantic errors and proportion of formal and non-word errors.
Camel and Cactus Test (CCT):60 A picture-based semantic association test. The measure used is percentage of correct responses.
Semantic discrimination: A verbal semantic judgment task derived from the semantic category probe task61 by using only list-length 1 data. The measure used is percentage of correct responses.
Philadelphia Repetition Test (PRT):62 A test of word repetition using the same words as the PNT. This test primarily measures phonological processing with lexical contributions.63,64 The measure used is percentage of correct responses.
Non-word repetition task:61 A test of non-word repetition, which primarily measures phonological production. The measure used is percentage of correct responses.
Principal component analysis
The 21 behavioural measures (13 from QPA, 8 additional measures) were entered into a principal component analysis (PCA) with a bifactor rotation. (Analogous exploratory factor analysis produced virtually identical factor loadings [r > 0.98 for each of the four factors/components] and it is more straightforward to compute individual participant scores from PCA, so PCA was used here). A bifactor model is a latent structure model where each item loads on a general factor that reflects what is common among the items, and two or more orthogonal factors that potentially explain variance not accounted for by the general factor.44 In this case, overall aphasia severity plausibly affects all measures, so using a bifactor model explicitly allows all measures to load on this general factor. The measures that load strongly and only on this factor will be ones that are very closely related to aphasia severity. The additional factors can then represent language sub-systems that may be affected by stroke independently of overall severity. In other words, the bifactor model allows directly exploring the extent to which items reflect a common trait (aphasia severity) and the extent to which they reflect statistically independent sub-systems (semantics, phonology, syntax, etc.). The resulting component scores for every participant were then used for the LSM analysis.
Lesion–symptom mapping analysis
LSM analyses were performed using SCCAN.45,65 SCCAN is an optimization algorithm that finds a set of weights that maximize the relationship between behavioural scores and voxel lesion values. Like other multivariate LSM methods, SCCAN considers all voxels together rather than individually. In addition, voxel weights are smoothed and very small clusters of voxels are set to zero to avoid the inclusion of isolated voxels in the solution. Most importantly, the SCCAN LSM result is governed by a ‘sparseness’ value that defines the proportion of voxels that are included in the solution. CV is used to empirically determine an optimal sparseness value that maximizes the predictive accuracy of the solution (i.e. maximum correlation between observed and predicted behavioural scores) while minimizing the number of voxels in the solution (i.e. sparser solutions are favoured over less sparse ones). As a result, if a behavioural score can be predicted from lesion in a small subset of voxels, SCCAN will identify this subset of voxels; if damage throughout a large territory is associated with the behavioural score, then SCCAN will find that non-sparse solution.
The goodness of the SCCAN solution is the correlation between the predicted and observed behavioural scores and the statistical significance of that correlation is used to determine whether the SCCAN solution is acceptable. If SCCAN is not able to identify a set of voxels that statistically significantly predict the behavioural score, then SCCAN has failed to identify an adequate solution. Critically, the SCCAN solution is evaluated in terms of its overall predictive accuracy, not individual voxels within it, so there is only one overall statistical test, not individual tests for voxels. This is in stark contrast to mass-univariate LSM and some multivariate LSM methods (such as LSM using support vector regression66), where statistical significance is evaluated at the level of individual voxels and requires correcting for multiple comparisons across many voxels. The conservativeness of this multiple comparisons correction has a major impact on the focality of results. Unlike these other LSM methods, the sparseness or focality of SCCAN LSM results is determined by predictive accuracy of the overall solution.
Measures of connectivity disruption
Connectivity disruption measures were derived using atlas-based metrics of lesion impact generated in MATLAB 2020b67 using the Lesion Quantification Toolkit68 and DSI Studio (version date: Oct 24, 2020; http://dsi-studio.labsolver.org). The toolkit calculates connectivity disruption by overlaying the lesion files on an atlas of intact connectivity.69 This whole-brain connectivity atlas was derived using deterministic tractography on the population-averaged HCP-842 data set with multiple turning angle thresholds to obtain 500 000 population-level streamline trajectories (i.e. estimated white matter fibre trajectories based on directional diffusion information), which were manually vetted and assigned to known white matter fibre tracts by a team of neuroanatomists.69
Participant binarized and spatially normalized lesion files were used as inputs to the toolkit, which was run with the following parameters: connection criteria = pass, spared connection threshold = 50 and smoothing kernel = 2. The primary atlas-based measures of interest were (i) disconnection severity within critical left hemisphere white matter tracts, (ii) metrics describing the whole-brain network structure, and (iii) metrics describing the regional structure of a language sub-network.
Tract-based disconnection severity was calculated by overlaying the lesion files on each tract within the HCP-842 tractography atlas69 and quantifying the number of streamlines intersecting the lesion relative to the total number of streamlines within the tract. This method of estimating tract disconnection severity has been shown to be more sensitive than traditional lesion load measures.70 Disconnection severity within the left superior longitudinal fasciculus (SLF), left AF, left UF, left FAT, and left IFOF were used for analyses.
In order to generate network metrics, a structural connectivity matrix was constructed for each participant by starting with the streamline trajectories included in the HCP-842 tractography atlas,69 removing streamlines that intersected with the participant’s lesion, and counting the number of remaining streamlines that passed through each pair of regions within the AAL3 atlas.71 The standard implementation of the Lesion Quantification Toolkit outputs disconnection severity matrices (i.e. the number of streamlines that intersect the lesion territory). To compute a connectivity matrix, the number of preserved streamlines was estimated as the number of streamlines within the HCP-842 tractography atlas that did not intersect with the lesion territory. Graph theoretical measures quantifying the network-level impact of the lesion were derived using DSI Studio’s implementation of the Brain Connectivity Toolbox31 applied to each participant’s structural connectivity matrix where each element in the matrix corresponds to the count of preserved streamline connections between a pair of regions.
Structural connectivity matrices and corresponding graph theoretical measures were also generated for a language sub-network comprised of the following left hemisphere regions within the AAL3 atlas and based on prior work using similar methods32, 72-76 middle frontal gyrus, inferior frontal gyrus (IFG) (pars opercularis, pars triangularis, and pars orbitalis), precentral gyrus, postcentral gyrus, insula, superior portion of the temporal pole, middle portion of the temporal pole, inferior temporal gyrus, middle temporal gyrus, superior temporal gyrus, supramarginal gyrus (SMG) and angular gyrus. The global, weighted network measures for the whole brain and the regional language network were used for subsequent analyses.
Three graph theoretical measures were selected for analyses (and are illustrated in Fig. 1):
Average clustering coefficient: the average of the clustering coefficients of every node within a network. Clustering coefficient is a measure of network segregation, capturing the extent to which functionally related regions are densely connected into specialized clusters. Damage within a functionally segregated, densely connected cluster can impact the efficiency of local processing77,78 but may have a limited effect on other clusters, thus allowing other functions to be preserved.
Characteristic path length: the average of the shortest paths between all pairs of nodes within the network. The characteristic path length is a measure of network integration based on how many steps are required to get from one node to another node. Higher values indicate that (on average) information must travel through more steps to get from one node to another, suggesting a less functionally integrated network.31 It is associated with reduced functional connectivity,79 degree of language impairment,32,68 and fluency in particular32 (some studies refer to path length as ‘propagation speed’ or ‘propagation steps’).
Global Efficiency: the average of each individual node’s global efficiency, which is calculated by starting with the shortest path lengths between that node and all other nodes (as for characteristic path length), taking the inverse of those path lengths, and averaging those values. Like characteristic path length, global efficiency is a measure of network integration that captures how quickly information can travel between brain regions,80 but the difference in their calculation has substantial implications. If two nodes are not connected, the path length between them is infinite and the communication efficiency is 0. When calculating the average, these values have a radically different impact (i.e. disconnected nodes very strongly increase characteristic path length, but only moderately decrease global efficiency). More generally, the characteristic path length is strongly influenced by long paths (infinite path length between disconnected nodes is an extreme case) but efficiency is more sensitive to short paths. This difference may be further exaggerated when these measures are calculated for stroke-damaged brains, which are more likely to have disconnected nodes or to rely on longer paths to communicate around the lesion territory. Johnson et al.77 found that higher global efficiency within the semantic network was associated with better response to naming therapy in chronic post-stroke aphasia.
Statistical analysis
PCA was implemented using the psych package version 1.9.12,81 with the bifactor rotation implementation from Jennrich and Bentler.82 The behavioural measures were checked for factorability with a Kaiser–Meyer–Olkin (KMO) test; conventionally, KMO > 0.7 is considered adequate for factor analysis and KMO < 0.5 is unacceptable. A scree plot, combined with a parallel analysis and Velicer's minimum average partial (MAP) test, was used to determine the number of dimensions that should be retained.
The SCCAN LSM analysis was conducted in R version 3.6 using the package LESYMAP version 0.0.0.9221 (https://github.com/dorianps/LESYMAP). SCCAN LSM was run separately for each of the factors that resulted from the PCA (and a supplementary analysis for WAB AQ), with sparseness optimized independently for each analysis using 8-fold CV (the number of folds was increased from the default 4-fold CV to improve stability of the results). The LSM solution was evaluated based on the correlation between predicted and observed behavioural scores. This evaluation applies to the overall LSM solution, not to individual voxels within the solution, so it is only one test and does not require correction for multiple comparisons.
Associations between connectivity measures and behavioural deficits were tested first using bivariate correlations, then multiple regression to assess simultaneous impact of multiple types of connectivity damage. Separate regressions were run for each behavioural deficit type and connectivity damage group (tracts, network metrics). Several of the connectivity measures had skewed or bimodal distributions, so spearman rank correlations were used for bivariate correlations and multiple regression analyses used robust standard error estimation83 implemented in the lavaan package version 0.6–8.84 Results are reported as coefficient estimates and 95% confidence intervals (95% CIs).
Results
Principal component analysis
Bivariate correlations among the behavioural measures are shown in Supplementary Figure 1. The behavioural measures had adequate overall factorability (KMO = 0.78) and no individual measure was in the ‘unacceptable’ range (all KMO > 0.5). Scree plot, parallel analysis, and MAP test all suggested the extraction of four factors.
Factor loadings from a four-factor PCA with bifactor rotation are shown in Fig. 2 (and in Supplementary Table 2). These four components accounted for 70.6% of the variance (RC1: 33.1%, RC2: 16.1%, RC4: 11.9%, RC3: 9.5%). By design, the first principal component reflected the general underlying factor resulting from the bifactor rotation. As expected, it had high positive loadings from general severity measures (WAB AQ and PNT accuracy). In addition, it contained high positive factor loadings from measures of utterance/sentence length (mean sentence length, median utterance length, mean VP length) and measures of sentence structural integrity and complexity (proportion of words in sentences, inflection index, embedding index). This suggests that the quantity (length) and complexity of speech output is closely related to overall severity, while other aspects can be more effectively separated from severity. We will refer to this factor as ‘complexity’ because that is the aspect of fluent speech production that was uniquely associated with this factor, although it also reflects general aphasia severity.
The second factor had high positive loadings of lexical syntax measures that are considered critical for detecting agrammatic speech production (proportions of closed class words, verbs and pronouns) and moderate loadings from sentence planning measures (proportion of words in sentences, proportion of well-formed sentences). We will refer to this factor as ‘lexical syntax’. The third factor had high negative loadings from tests of repetition of words (PRT) and non-words (non-word repetition task) and a high positive loading from the proportion of phonological errors in picture naming, so it was named ‘Phonology’ (note that this means that a participant’s factor scores for this factor are reversed relative to the other factors: for other factors, higher scores indicate better performance, but for Phonology, higher scores indicate a more severe phonological deficit). The last factor was labelled ‘Semantics’ because it had high negative loadings from the proportion of semantic errors from the PNT and high positive loadings on the percentage of correct responses from the CCT and the semantic discrimination task.
In addition to the overall factor structure, the data-driven PCA approach yielded non-obvious observations about individual measures, such as the inflection index, determiner index and use of pronouns. Inflections are syntactic elements, and their omission is associated with agrammatism,85 so one might have expected them to be part of a syntax factor. Inflections in English typically add phonological complexity (e.g. plural/-s/and past-tense/-d/or/-t/), so it is also possible that their omission is a result of a phonological/articulatory deficit and would cluster with other measures of phonological production deficits. Neither of these turned out to be the case in the present data: the inflection index measure clustered with general measures of aphasia severity (WAB AQ), fluency (words per minute), and sentence and utterance length. Determiners are also small phonological elements that primarily serve a syntactic function, so one might expect them to pattern with inflections. Like inflection index, determiner index did strongly load on the first factor, but it also strongly loaded on the second (lexical syntax) factor. In contrast, use of pronouns—which are short and primarily serve a structural role—loaded almost exclusively on the lexical syntax factor (with verbs and closed class words), not on the first factor or on the Phonological factor. This pattern highlights that the ‘lexical syntax’ factor appears to be specific to production of lexical elements that primarily serve a syntactic (rather than semantic) function more so than other aspects of syntax.
Sparse canonical correlation analysis
Lesion coverage was good throughout the left middle cerebral artery (MCA) territory, particularly the dorsal speech production system structures of the frontal lobe and inferior parietal lobe (Supplementary Figure 2). SCCAN LSM results are shown in Fig. 3 and Table 2.
Table 2.
Brain region | Complexity | Lexical syntax |
---|---|---|
Inferior parietal lobule | 0.301 | 0.231 |
Supramarginal gyrus | 0.660 | 0.063 |
Angular gyrus | 0.001 | 0.029 |
Postcentral gyrus | 0.454 | 0.020 |
Precentral gyrus | 0.067 | 0.124 |
Inferior frontal gyrus, pars triangularis | 0.065 | 0.195 |
Inferior frontal gyrus, pars opercularis | 0.010 | 0.035 |
Middle frontal gyrus | 0.048 | 0.052 |
Superior frontal gyrus | 0.003 | 0.012 |
Insula | 0.097 | 0.003 |
Superior parietal gyrus | 0.052 | 0.002 |
Middle occipital gyrus | 0.003 | 0.0 |
Superior temporal gyrus | 0.612 | 0.0 |
Middle temporal gyrus | 0.278 | 0.0 |
Heschl’s gyrus | 0.271 | 0.0 |
Rolandic operculum | 0.219 | 0.0 |
Caudate | 0.051 | 0.0 |
Superior temporal pole | 0.031 | 0.0 |
Putamen | 0.005 | 0.0 |
Note. All regions refer to the left hemisphere.
Deficits in complexity of speech output were associated with a large temporo-parietal region (optimal sparseness = 0.576, CV correlation = 0.603, P < 0.001): SMG and postcentral gyrus, posterior superior temporal gyrus and middle temporal gyrus, and Heschl’s gyrus (Fig. 3, top row). This result is very similar to the pattern we have previously observed for overall aphasia severity (WAB ) and replicated in this sample (see supplemental materials).
Deficits on the lexical syntax factor were associated with damage in a relatively small set of fronto-parietal regions (optimal sparseness = 0.216, CV correlation = 0.517, P < 0.001): IFG pars triangularis, precentral gyrus, and the dorsal portion of the inferior parietal lobule (Fig. 3, bottom row). SCCAN LSM analyses for the Phonology and Semantics factors did not produce statistically significant solutions as indicated by low CV correlations (Phonology CV correlation = 0.085, P = 0.524; Semantic CV correlation = 0.185, P = 0.164).
Connectome analyses
The behavioural scores tended to be more strongly correlated with lesion volume than with connectivity measures (see Supplementary Table 3). Therefore, the results reported here are based on multiple regression analyses that control for lesion volume. In multiple regression analyses with lesion volume and damage to each of the tracts as predictors, lesion volume was a significant predictor of complexity [Estimate = −0.006, 95% CI = (−0.009, −0.003), P = 0.030], lexical syntax [Estimate = −0.007, 95% CI = (−0.011, −0.002), P = 0.005] and semantics [Estimate = −0. 005, 95% CI = (−0.009, 0.001), P = 0.010] but not phonology [Estimate = 0.000, 95% CI = (−0.003, 0.004), P = 0.905]. None of the tracts had statistically significant associations; full regression results are reported in Table 3.
Table 3.
Complexity | Lexical syntax | Phonology | Semantics | |
---|---|---|---|---|
Tracts | ||||
Volume | −0.0063 (−0.0095, −0.0032)** | −0.0066 (−0.011, −0.002)** | 0.00021 (−0.0032, 0.0036) | −0.0048 (−0.0085, −0.0011)* |
Arcuate fasciculus | 0.008 (−0.0021, 0.018) | −0.0068 (−0.014, 0.00049). | 0.0097 (−0.0027, 0.022) | 0.00068 (−0.0069, 0.0083) |
Frontal aslant tract | −0.0017 (−0.0091, 0.0056) | −0.0014 (−0.0078, 0.005) | 0.0012 (−0.0069, 0.0093) | −0.0054 (−0.014, 0.0036) |
Superior longitudinal | −0.0081 (−0.019, 0.003) | 0.00041 (−0.0099, 0.011) | −0.0095 (−0.022, 0.0033) | 0.01 (−0.0015, 0.022). |
Uncinate fasciculus | 0.0044 (−0.0056, 0.014) | 0.0025 (−0.0093, 0.014) | −0.005 (−0.017, 0.0068) | 0.0053 (−0.0064, 0.017) |
Inferior fronto-occipital | 0.000039 (−0.008, 0.0081) | 0.0047 (−0.0031, 0.012) | −0.0029 (−0.013, 0.0072) | −0.0057 (−0.015, 0.0038) |
Whole-brain network | ||||
Volume | −0.0072 (−0.011, −0.0037)** | −0.0089 (−0.014, −0.0034)** | 0.00011 (−0.0042, 0.0044) | −0.0062 (−0.01, −0.002)** |
Global efficiency | −13 (−200, 170) | −150 (−380, 81) | 130 (−82, 350) | −61 (−270, 150) |
Ave clustering coefficient | 150 (−390, 700) | 340 (−300, 980) | −340 (−940, 260) | 490 (−46, 1000). |
Characteristic path length | 0.00070 (−0.0013, 0.0027) | −0.0012 (−0.0035, 0.0012) | 0.00022 (−0.0022, 0.0027) | 0.0029 (0.00051, 0.0054)* |
Language network | ||||
Volume | −0.0072 (−0.0097, −0.0046)** | −0.0078 (−0.011, −0.0045)** | −0.00011 (−0.0026, 0.0024) | −0.0025 (−0.0059, 0.00096) |
Global efficiency | 1.4 (0.084, 2.8)* | 2.8 (1.6, 4)** | −2.1 (−3.2, −0.91)** | −2 (−3.3, −0.65)** |
Ave clustering coefficient | −2.1 (−13, 8.8) | 8.8 (−0.5, 18) . | −8.1 (−16, −0.54)* | 9 (1.2, 17)* |
Characteristic path length | −0.00011 (−0.0015, 0.0013) | 0.00028 (−0.00084, 0.0014) | −0.0012 (−0.0023, −9.8e-05)* | −0.00019 (−0.0011, 0.00077) |
Note: Values show the regression coefficient estimate (95% confidence interval in brackets).
** P < 0.01, * P < 0.05, P < 0.1.
In analogous analyses of the graph theoretical measures of connectivity, the same lesion volume effects were statistically significant. In addition, language network efficiency was a significant predictor for each of the four PCA factors: Complexity [Estimate = 1.424, 95% CI = (0.084, 2.765), P = 0.037], Lexical syntax [Estimate = 2.781, 95% CI = (1.572, 3.989), P < 0.001], Phonology [Estimate = −2.064, 95% CI = (−3.215, −0.913), P < 0.001] and Semantics [Estimate = −1.986, 95% CI = (−3.326, −0.645), P = 0.004]. Language network average clustering coefficient was also a significant predictor of Phonology [Estimate = −8.100, 95% CI = (−15.662, −0.538), P = 0.036] and Semantics [Estimate = 9.036, 95% CI = (1.249, 16.823), P = 0.023], and a marginally significant predictor of Lexical syntax [Estimate = 8.777, 95% CI = (−0.501, 18.055), P = 0.064].
These were exploratory analyses, not driven by strong hypotheses, and comprising a relatively large number of comparisons, so these connectome disruption results should be interpreted with caution.
Discussion
This study examined the cognitive and neural basis of fluent speech production deficits in post-stroke aphasia, and how they relate to the effects of overall severity, semantic deficits and phonological deficits. This was done by first combining a comprehensive set of narrative speech production measures (derived from QPA) at lexical, utterance, and sentence levels, with measures of general language impairment and measures of semantic and phonological deficits. These behavioural measures were entered into a bifactor PCA, which extracts a single general factor and a series of orthogonal (uncorrelated) domain-specific factors. The bifactor PCA revealed that QPA measures of sentence/utterance length and complexity loaded primarily on the general severity factor, along with the explicit severity measure of WAB AQ. This factor accounted for 33.1% of the variance in the data. A distinct lexical syntax factor was also detected (accounting for 16.1% of the variance). This factor was comprised primarily of the usage of words that serve a structural/grammatical role rather than a semantic role (closed class words, pronouns, determiners and verbs) and some aspects of sentence planning (proportion of words in sentences, proportion of well-formed sentences). Phonology and Semantics factors had expected loadings (accounting for 11.9% and 9.5% of the variance, respectively) with virtually no contributions from any of the QPA measures, suggesting that QPA measures capture aspects of fluent speech production that are quite distinct from phonological planning and semantic processing.
Prior studies that examined the internal structure of QPA measures23,55,57 broadly agree about a lexical–morphological factor (proportions of closed class words, pronouns and verbs) and a sentence-level structural factor (proportion of words in sentences, embedding index, mean sentence and utterance lengths). They disagree about whether correct use of determiners, inflections and auxiliary verbs is morphological–lexical or sentential or a separate grammatical factor, and whether use of ‘well-formed’ sentences belongs with the sentence structure measures or the separate grammatical factor. There is also uncertainty about whether words per minute belongs with the lexical–morphological factor or is a separate factor reflecting speech rate or discourse productivity. The present results converge on the points of agreement (a lexical-level syntax factor and a sentence-level length/complexity factor), but the inclusion of severity measures (WAB AQ and naming accuracy) and prioritizing severity by using a bifactor rotation revealed that length/complexity of speech production is closely related to overall severity. Although several studies have also documented the relationship between fluency and aphasia severity,35–37,86 it has not been recognized in studies that did not include explicit measures of severity and/or tested participants in a relatively narrow range of severity.
The ambiguous measures (words per minute; use of determiners, inflections and auxiliary verbs; and well-formedness of sentences) also loaded strongly on this severity-related complexity factor, although words per minute, determiner index and proportion of well-formed sentences also substantively loaded on the lexical syntax factor (as did the proportion of words in sentences). This suggests that the structural complexity of aphasic speech output is strongly related to overall aphasia severity and the separable grammatical factor is largely lexical, corresponding to the correct use of words that primarily serve structural roles and add very little independent semantic content. Production of such semantically ‘light’ words may be particularly dependent on syntactic cues and vulnerable to syntactic deficits,88 commensurate with their primarily structural role.
The lesion correlates of these deficit dimensions (factor scores) were then evaluated using SCCAN LSM, a multivariate optimization technique that more accurately identifies symptom-relevant lesion areas, including distributed networks of brain areas. Deficits on the general complexity factor were associated with damage in a fairly large portion of the lesion territory, primarily in parietal and posterior temporal regions. These regions include the dorsal speech production stream89 and, in the context of the framework proposed by Matchin and Hickok,6 suggest that fluent production of narrative speech—as measured by length and structural complexity of utterances or sentences—is particularly reliant on temporo-parietal regions that support high-level structural planning of longer utterances and more complex sentence structures. However, there was a substantial overlap between this factor and overall aphasia severity (WAB AQ), both in the PCA results and in the SCCAN LSM results. Both the complexity factor and aphasia severity were associated with damage throughout the lesion territory, even extending dorsally beyond traditional language areas, reflecting that damage most anywhere in the left MCA territory can produce an aphasia-relevant deficit (as we have previously found48). This suggests that a broad range of different language deficits can contribute to the overall aphasia severity score and to the length and complexity of aphasic narrative speech.
Deficits on the lexical syntax factor were associated with damage to a small set of fronto-parietal regions: dorsal inferior parietal lobule (dIPL), precentral gyrus, and IFG pars triangularis. Precentral gyrus and dIPL are not regions that previous studies have consistently associated with syntactic deficits or agrammatism, which are more typically associated with posterior temporal and/or inferior frontal damage.6 IFG is associated with a broad range of language processes (including syntax) and is the subject of ongoing debate. Because the lexical syntax factor was orthogonal to the complexity factor, it is unlikely that the IFG involvement observed here reflects syntactic planning, and because the factor is also orthogonal to the Phonology factor, it is unlikely that this reflects phonological–articulatory planning. A remaining possibility is that the IFG involvement reflects the recruitment of control systems to support the difficult retrieval of semantically ‘light’ words,87 which primarily serve syntactic roles and do not have strong support from semantic representations.88
Tract-based measures of connectivity disruption were weakly (if at all) associated with the deficit scores after controlling for overall lesion volume. In part, this may be because atlas-based estimation of tract damage is not very effective.90 Even if the tract damage measures were very good, tract disconnection and behavioural deficit measures were both strongly correlated with overall lesion volume, leaving relatively little variance for an association between tract disconnection and behavioural deficit.
The graph theoretical measures of connectivity produced a few suggestive patterns. First, measures derived from a left hemisphere peri-Sylvian language network were better predictors than the same measures calculated based on the connectivity of the whole brain. This may be because connectivity within the language network is particularly important for residual language function after left hemisphere stroke (the present data do not address whether this is also true in neurologically intact individuals) or because this is the region where there was the greatest variability between participants, since this is where most of them had lesions (as is normally the case for post-stroke aphasia).
Second, within the language network, global efficiency was the most consistent predictor, being significantly associated with each of the four PCA-derived deficit measures. Global network efficiency is high when the paths between nodes are relatively short and the nodes can exchange information relatively quickly and efficiently31,80 (it is also known as propagation speed32 or propagation steps91). In this case, higher efficiency scores mean that the left hemisphere language network is more functionally integrated and the results indicate better integration of the left hemisphere language network is positively associated with length and complexity of aphasic speech output, its lexical–syntactic content, and reduced phonological deficits (better phonological ability). Surprisingly, the efficiency of the left hemisphere language network was negatively associated with semantic ability. Semantic cognition relies on a bilateral ‘hub-and-spoke’ architecture92 and the global efficiency measure tends to be less useful for larger and sparser networks,31 so these results should be interpreted with caution.
Third, the average clustering coefficient was significantly positively associated with performance on phonological and semantic measures (and marginally with lexical syntax). This suggests that phonological and semantic processing (and, somewhat less reliably, syntax) are supported by relatively specialized, densely connected clusters or sub-networks, as captured by the clustering coefficient measure.
Limitations
Several key limitations need to be considered when drawing conclusions from the present results. The participant sample was limited by the availability of narrative speech samples and lesion data. This resulted in a sample size (N = 58) that was moderate relative to other LSM studies, with most of the participants in the mild-to-moderate range of aphasia severity (median WAB AQ = 81.0, IQR = 71.3–91.0), though it did include some participants with more severe aphasia. Similarly, although the sample included participants across nearly the full range of fluency, most participants were in the upper half of the range (median WAB fluency = 8.0, IQR = 5.0–9.0, range = 2.0–10.0), which is traditionally considered ‘fluent’ aphasia, though individuals in this range nevertheless typically have some disruptions to fluency. This severity distribution is a natural consequence of requiring at least 150 narrative words for QPA scoring—very severely impaired participants are often unable to produce that many narrative words, especially if their impairment is severely non-fluent. Both PCA and LSM are most effective when the sample covers a broad range of severity, including participants with very mild (possibly sub-clinical) deficits. Therefore, the present results are informative about fluency deficits in mild-to-moderate post-stroke aphasia, but it is possible that the pattern of deficits in the severe range is somewhat different.
The behavioural data were adequate for PCA according to standard metrics (KMO > 0.7), but the PCA results need to be considered in the context of which behavioural measures were included and not included (or not available). We sought to balance detailed measures of narrative speech with key anchors (aphasia severity, semantic deficit, phonological deficit). The present study included fewer grammatical measures than prior PCA studies of QPA and fluency35,37,57 but compared with other PCA studies of aphasia,39,41 it included more grammatical measures and fewer other measures. This fills an important gap in the literature by identifying how aspects of connected speech production are related to or independent of those key anchors. Specifically, compared with PCA studies that focused on QPA, we observed fewer QPA-specific factors, largely because measures of speech length and complexity were strongly related to overall aphasia severity rather than forming separate factors. We did find a lexical syntax factor, which is common for QPA-focused studies but rare for broad PCA studies of aphasia, suggesting that this is a distinct dimension of impairment. Measures of motor speech impairment (dysarthria or apraxia of speech) and executive function deficits were not available for inclusion in the present study, so the results do not address how the observed patterns relate to those deficits.
Another key limitation is that diffusion data were not available for the participants, so connectivity disruption was estimated by overlaying each participant’s lesion map on a tractography atlas derived from the HCP-842 data. This method has been widely used, but (to our knowledge) has not been validated and we have previously expressed caution about using it.90 We found that, after controlling for overall lesion volume, tract-based connectivity disruption was weakly (if at all) associated with the deficit scores and graph theory measures of connectivity disruption produced very mixed and inconsistent results. Although unsatisfying on their own, we hope these results will encourage further development and validation of connectivity disruption measures to quantify this important aspect of brain injury.
Summary and conclusion
In sum, the present results demonstrate that length and complexity of narrative speech production is closely related to severity, both in the behavioural sense (i.e. aphasia severity as measured by WAB AQ) and in the neural sense (a large portion of the left MCA territory is associated with reduced length and complexity of narrative speech production). In contrast, lexical aspects of syntax (usage of closed class words, pronouns, verbs, and determiners) appear to be separable from the severity and associated with damage in a smaller set of fronto-parietal regions, suggesting that these deficits may be related to the impaired requirement of control systems to support retrieval and correct usage of words that primarily serve structural functions rather than adding semantic content. Finally, graph theoretic measures of language network integration (efficiency) and cohesion of sub-networks (clustering coefficient) were statistically associated with the deficit scores after controlling for lesion volume, suggesting that these measures of connectivity deserve further investigation.
Supplementary Material
Acknowledgements
We thank Adelyn Brecher, Taylor Foley and the rest of the Moss Rehabilitation Research Institute aphasia research team for QPA coding and data management. We also thank Aja Murray for helpful statistical advice and Erica Middleton, Jerzy Szaflarski, Junhua Ding and the rest of the Functional Communication project team for valuable discussions.
Abbreviations
- AF =
arcuate fasciculus
- AQ =
aphasia quotient
- CCT =
Camel and Cactus Test
- CI =
confidence interval
- CV =
cross-validation
- FAT =
frontal aslant tract
- IFG =
inferior frontal gyrus
- IFOF =
inferior fronto-occipital fasciculus
- KMO =
Kaiser–Meyer–Olkin
- LSM =
lesion–symptom mapping
- MCA =
middle cerebral artery
- PCA =
principal component analysis
- PNT =
Philadelphica Naming Test
- PRT =
Philadelphia Repetition Test
- QPA =
Quantitative Production Analysis
- SCCAN =
sparse canonical correlation analysis for neuroimaging
- SLF =
superior longitudinal fasciculus
- SMG =
supramarginal gyrus
- UF =
uncinate fasciculus
- VP =
verb phrase
- WAB =
Western Aphasia Battery
Contributor Information
Eleni Zevgolatakou, Department of Psychology, University of Edinburgh, 7 George Square, Edinburgh EH8 9JZ, UK.
Melissa Thye, Department of Psychology, University of Edinburgh, 7 George Square, Edinburgh EH8 9JZ, UK.
Daniel Mirman, Department of Psychology, University of Edinburgh, 7 George Square, Edinburgh EH8 9JZ, UK.
Funding
This research was funded by US National Institutes of Health (National Institute on Deafness and Other Communication Disorders) grant R01DC017137 (MPI: DM and Jerzy P. Szaflarski). The data were previously collected as part of a project funded by US National Institutes of Health (National Institute on Deafness and Other Communication Disorders) grant R01DC000191 (PI: Myrna F. Schwartz).
Competing interests
The authors report no competing interests.
Supplementary material
Supplementary material is available at Brain Communications online.
Data availability
Anonymized behavioural data and lesion data are available from Daniel Mirman (dan@danmirman.org) subject to approval by the Einstein Healthcare Network Institutional Review Board.
References
- 1. Engelter ST, Gostynski M, Papa S, et al. Epidemiology of aphasia attributable to first ischemic stroke. Stroke. 2006;37(6):1379–1384. [DOI] [PubMed] [Google Scholar]
- 2. Flowers HL, Skoretz SA, Silver FL, et al. Poststroke aphasia frequency, recovery, and outcomes: A systematic review and meta-analysis. Arch Phys Med Rehabil. 2016;97(12):2188–2201.e8. [DOI] [PubMed] [Google Scholar]
- 3. Broca P. Remarques sur le siège de la faculté du langage articulé, suivies d’une observation d’aphémie (perte de la parole). Bulletins de la Société d’anatomie (Paris). 1861;6:330–357. [Google Scholar]
- 4. Lichtheim L. On aphasia. Brain. 1885;7:433–484. [Google Scholar]
- 5. Gordon JK. Factor analysis of spontaneous speech in aphasia. J Speech Lang Hear Res. 2020;63(12):4127–4147. [DOI] [PubMed] [Google Scholar]
- 6. Matchin W, Hickok G. The cortical organization of syntax. Cereb Cortex. 2020;30(3):1481–1498. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Mirman D, Kraft AE, Harvey DY, Brecher AR, Schwartz MF. Mapping articulatory and grammatical subcomponents of fluency deficits in post-stroke aphasia. Cogn Affect Behav Neurosci. 2019;19(5):1286–1298. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Nozari N, Faroqi-Shah Y. Investigating the origin of nonfluency in aphasia: A path modeling approach to neuropsychology. Cortex. 2017;95:119–135. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Schwartz MF. What the classical aphasia categories can’t do for us, and why. Brain Lang. 1984;21(1):3–8. [DOI] [PubMed] [Google Scholar]
- 10. Caplan D. Aphasic syndromes. In: Heilman KM and Valenstein E, eds. Clinical neuropsychology. 5th ed. Oxford University Press; 2012:22–41. [Google Scholar]
- 11. Tremblay P, Dick AS. Broca and wernicke are dead, or moving past the classic model of language neurobiology. Brain Lang. 2016;162:60–71. [DOI] [PubMed] [Google Scholar]
- 12. Kasselimis DS, Simos PG, Peppas C, Evdokimidis I, Potagas C. The unbridged gap between clinical diagnosis and contemporary research on aphasia: A short discussion on the validity and clinical utility of taxonomic categories. Brain Lang. 2017;164:63–67. [DOI] [PubMed] [Google Scholar]
- 13. Crary MA, Wertz RT, Deal JL. Classifying aphasias: Cluster analysis of western aphasia battery and Boston diagnostic aphasia examination results. Aphasiology. 1992;6(1):29–36. [Google Scholar]
- 14. Ripamonti E, Frustaci M, Zonca G, Aggujaro S, Molteni F, Luzzatti C. Disentangling phonological and articulatory processing: A neuroanatomical study in aphasia. Neuropsychologia. 2018;121:175–185. [DOI] [PubMed] [Google Scholar]
- 15. Schwartz MF, Faseyitan OK, Kim J, Coslett HB. The dorsal stream contribution to phonological retrieval in object naming. Brain. 2012;135(12):3799–3814. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Basilakos A, Rorden C, Bonilha L, Moser D, Fridriksson J. Patterns of poststroke brain damage that predict speech production errors in apraxia of speech and aphasia dissociate. Stroke. 2015;46(6):1561–1566. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Botezatu MR, Mirman D. Impaired lexical selection and fluency in post-stroke aphasia. Aphasiology. 2019;33:667––6688.. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Dell GS, Lawler EN, Harris HD, Gordon JK. Models of errors of omission in aphasic naming. Cogn Neuropsychol. 2004;21(2):125–145. [DOI] [PubMed] [Google Scholar]
- 19. Chen Q, Middleton E, Mirman D. Words fail: Lesion-symptom mapping of errors of omission in post-stroke aphasia. J Neuropsychol. 2019;13:183–197. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Stark BC, Basilakos A, Hickok G, Rorden C, Bonilha L, Fridriksson J. Neural organization of speech production: A lesion-based study of error patterns in connected speech. Cortex. 2019;117:228–246. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Wilmskoetter J, Fridriksson J, Gleichgerrcht E, et al. Neuroanatomical structures supporting lexical diversity, sophistication, and phonological word features during discourse. NeuroImage: Clinical. 2019;24:101961. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Damasio AR. Aphasia. N Engl J Med. 1992;326(8):531–539. [DOI] [PubMed] [Google Scholar]
- 23. Saffran EM, Berndt RS, Schwartz MF. The quantitative analysis of agrammatic production: Procedure and data. Brain Lang. 1989;37(3):440–479. [DOI] [PubMed] [Google Scholar]
- 24. Matchin W, Basilakos A, Stark BC, den Ouden DB, Fridriksson J, Hickok G. Agrammatism and paragrammatism: A cortical double dissociation revealed by lesion-symptom mapping. Neurobiology of Language. 2020;1:208––2225.. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Fedorenko E, Blank IA, Siegelman M, Mineroff Z. Lack of selectivity for syntax relative to word meanings throughout the language network. Cognition. 2020;203:104348. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Caplan D, Michaud J, Hufford R. Dissociations and associations of performance in syntactic comprehension in aphasia and their implications for the nature of aphasic deficits. Brain Lang. 2013;127(1):21–33. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Basilakos A, Fillmore PT, Rorden C, Guo D, Bonilha L, Fridriksson J. Regional white matter damage predicts speech fluency in chronic post-stroke aphasia. Front Hum Neurosci. 2014;8(October):1–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Dronkers NF, Plaisant O, Iba-Zizen MT, Cabanis EA. Paul broca’s historic cases: High resolution MR imaging of the brains of leborgne and lelong. Brain. 2007;130(5):1432–1441. [DOI] [PubMed] [Google Scholar]
- 29. Gajardo-Vidal A, Lorca-Puls DL, team P, et al. Damage to broca’s area does not contribute to long-term speech production outcome after stroke. Brain. 2021;144(3):817–832. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Halai AD, Woollams AM, Lambon Ralph MA. Using principal component analysis to capture individual differences within a unified neuropsychological model of chronic post-stroke aphasia: Revealing the unique neural correlates of speech fluency, phonology and semantics. Cortex. 2017;86:275–289. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Rubinov M, Sporns O. Complex network measures of brain connectivity: Uses and interpretations. NeuroImage. 2010;52(3):1059–1069. [DOI] [PubMed] [Google Scholar]
- 32. Del Gaizo J, Fridriksson J, Yourganov G, et al. Mapping language networks using the structural and dynamic brain connectomes. Eneuro. 2017;4(5):ENEURO.0204–17.2017––17.2017.. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Fan JM, Gorno-Tempini ML, Dronkers NF, Miller BL, Berger MS, Chang EF. Data-Driven, visual framework for the characterization of aphasias across stroke, post-resective, and neurodegenerative disorders over time. Front Neurol. 2020;11:616764:1–10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Landrigan JF, Zhang F, Mirman D. A data-driven approach to post-stroke aphasia classification and lesion-based prediction. Brain. 2021;144(5):1372–1383. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Vermeulen J, Bastiaanse R, Van Wageningen B. Spontaneous speech in aphasia: A correlational study. Brain Lang. 1989;36(2):252–274. [DOI] [PubMed] [Google Scholar]
- 36. Casilio M, Rising K, Beeson PM, Bunton K, Wilson SM. Auditory-Perceptual rating of connected speech in aphasia. American Journal of Speech-Language Pathology. 2019;28(2):550–568. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Wagenaar E, Snow C, Prins R. Spontaneous speech of aphasic patients: A psycholinguistic analysis. Brain Lang. 1975;2:281–303. [DOI] [PubMed] [Google Scholar]
- 38. Wilson SM, Hula WD. Multivariate approaches to understanding aphasia and its neural substrates. Curr Neurol Neurosci Rep. 2019;19(53):1–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Mirman D, Thye M. Uncovering the neuroanatomy of core language systems using lesion-symptom mapping. Curr Dir Psychol Sci. 2018;27(6):455–461. [Google Scholar]
- 40. Wilson SM, Henry ML, Besbris M, et al. Connected speech production in three variants of primary progressive aphasia. Brain. 2010;133(7):2069–2088. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Alyahya RSW, Halai AD, Conroy P, Lambon Ralph MA. A unified model of post-stroke language deficits including discourse production and their neural correlates. Brain. 2020;143(5):1541–1554. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Ding J, Martin RC, Hamilton AC, Schnur TT. Dissociation between frontal and temporal-parietal contributions to connected speech in acute stroke. Brain. 2020;143(3):862–876. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Dunn KJ, McCray G. The place of the bifactor model in confirmatory factor analysis investigations into construct dimensionality in language testing. Front Psychol. 2020;11:1–16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44. Reise SP, Moore TM, Haviland MG. Bifactor models and rotations: Exploring the extent to which multidimensional data yield univocal scale scores. J Pers Assess. 2010;92(6):544–559. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Pustina D, Avants B, Faseyitan OK, Medaglia JD, Coslett HB. Improved accuracy of lesion to symptom mapping with multivariate sparse canonical correlations. Neuropsychologia. 2018;115:154–166. [DOI] [PubMed] [Google Scholar]
- 46. Mirman D, Landrigan JF, Kokolis S, Verillo S, Ferrara C, Pustina D. Corrections for multiple comparisons in voxel-based lesion-symptom mapping. Neuropsychologia. 2018;115:112–123. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. Thye M, Szaflarski JP, Mirman D. Shared lesion correlates of semantic and letter fluency in post-stroke aphasia. J Neuropsychol. 2021;15(1):143–150. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48. Thye M, Mirman D. Relative contributions of lesion location and lesion size to predictions of varied language deficits in post-stroke aphasia. NeuroImage: Clinical. 2018;20(October):1129–1138. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49. Fridriksson J, Guo D, Fillmore PT, Holland A, Rorden C. Damage to the anterior arcuate fasciculus predicts non-fluent speech production in aphasia. Brain. 2013;136(11):3451–3460. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. Dick AS, Garic D, Graziano P, Tremblay P. The frontal aslant tract (FAT) and its role in speech, language and executive function. Cortex. 2019;111:148–163. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51. Mirman D, Strauss TJ, Brecher AR, et al. A large, searchable, web-based database of aphasic performance on picture naming and other tests of cognitive function. Cogn Neuropsychol. 2010;27(6):495–504. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52. Thothathiri M, Kimberg DY, Schwartz MF. The neural basis of reversible sentence comprehension: Evidence from voxel-based lesion symptom mapping in aphasia. J Cogn Neurosci. 2012;24(1):212–222. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53. Mirman D, Chen Q, Zhang Y, et al. Neural organization of spoken language revealed by lesion-symptom mapping. Nat Commun. 2015;6(6762):1–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54. Avants BB, Schoenemann PT, Gee JC. Lagrangian frame diffeomorphic image registration: Morphometric comparison of human and chimpanzee cortex. Med Image Anal. 2006;10:397–412. [DOI] [PubMed] [Google Scholar]
- 55. Rochon E, Saffran EM, Berndt RS, Schwartz MF. Quantitative analysis of aphasic sentence production: Further development and new data. Brain Lang. 2000;72:193–218. [DOI] [PubMed] [Google Scholar]
- 56. Berndt RS, Wayland S, Rochon E, Saffran EM, Schwartz MF. Quantitative production analysis: A training manual for the analysis of aphasic sentence production. Psychology Press; 2000. [DOI] [PubMed] [Google Scholar]
- 57. Gordon JK. A quantitative production analysis of picture description. Aphasiology. 2006;20(2/3/4):188–204. [Google Scholar]
- 58. Kertesz A. Western aphasia battery-revised (WAB-R). Pro-Ed; 2006. [Google Scholar]
- 59. Roach A, Schwartz MF, Martin N, Grewal RS, Brecher AR. The Philadelphia naming test: Scoring and rationale. Clinical Aphasiology. 1996;24:121–133. [Google Scholar]
- 60. Bozeat S, Lambon Ralph MA, Patterson KE, Garrard P, Hodges JR. Non-verbal semantic impairment in semantic dementia. Neuropsychologia. 2000;38(9):1207–1215. [DOI] [PubMed] [Google Scholar]
- 61. Freedman ML, Martin RC. Dissociable components of short-term memory and their relation to long-term learning. Cogn Neuropsychol. 2001;18(3):193–226. [DOI] [PubMed] [Google Scholar]
- 62. Dell GS, Schwartz MF, Martin N, Saffran EM, Gagnon DA. Lexical access in aphasic and nonaphasic speakers. Psychol Rev. 1997;104(4):801–838. [DOI] [PubMed] [Google Scholar]
- 63. Nozari N, Kittredge AK, Dell GS, Schwartz MF. Naming and repetition in aphasia: Steps, routes, and frequency effects. J Mem Lang. 2010;63(4):541–559. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64. Nozari N, Dell GS. How damaged brains repeat words: A computational approach. Brain Lang. 2013;126(3):327–337. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65. Avants BB, Libon DJ, Rascovsky K, et al. Sparse canonical correlation analysis relates network-level atrophy to multivariate cognitive measures in a neurodegenerative population. NeuroImage. 2014;84:698–711. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66. Zhang Y, Kimberg DY, Coslett HB, Schwartz MF, Wang Z. Multivariate lesion-symptom mapping using support vector regression. Hum Brain Mapp. 2014;35(12):5861–5876. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67. MATLAB [Computer software] . Version 9.9.0 (R2020b). Natick, MA, USA: MathWorks; 2020.
- 68. Griffis JC, Metcalf NV, Corbetta M, Shulman GL. Lesion quantification toolkit: A MATLAB software tool for estimating grey matter damage and white matter disconnections in patients with focal brain lesions. NeuroImage: Clinical. 2021;30:102639. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69. Yeh FC, Panesar S, Fernandes D, et al. Population-averaged atlas of the macroscale human structural connectome and its network topology. NeuroImage. 2018;178:57–68. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70. Griffis JC, Metcalf NV, Corbetta M, Shulman GL. Structural disconnections explain brain network dysfunction after stroke. Cell Rep. 2019;28(10):2527–2540.e9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71. Rolls ET, Huang CC, Lin CP, Feng J, Joliot M. Automated anatomical labelling atlas 3. Neuroimage. 2020;206:116189. [DOI] [PubMed] [Google Scholar]
- 72. den Ouden DB, Malyutina S, Basilakos A, et al. Cortical and structural-connectivity damage correlated with impaired syntactic processing in aphasia. Hum Brain Mapp. 2019;40(7):2153–2173. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73. Fridriksson J, den Ouden DB, Hillis AE, et al. Anatomy of aphasia revisited. Brain. 2018;141(3):848–862. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74. Gleichgerrcht E, Fridriksson J, Rorden C, Bonilha L. Connectome-based lesion-symptom mapping (CLSM): A novel approach to map neurological function. NeuroImage: Clinical. 2017;16(July):461–467. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75. Hula WD, Panesar S, Gravier ML, et al. Structural white matter connectometry of word production in aphasia: An observational study. Brain. 2020;143(8):2532–2544. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76. Marebwa BK, Fridriksson J, Yourganov G, Feenaughty L, Rorden C, Bonilha L. Chronic post-stroke aphasia severity is determined by fragmentation of residual white matter networks. Sci Rep. 2017;7(1):8188. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77. Johnson JP, Meier EL, Pan Y, Kiran S. Pre-treatment graph measures of a functional semantic network are associated with naming therapy outcomes in chronic aphasia. Brain Lang. 2020;207:104809. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78. Kiran S, Meier EL, Johnson JP. Neuroplasticity in aphasia: A proposed framework of language recovery. J Speech Lang Hear Res. 2019;62(11):3973–3985. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79. Griffis JC, Metcalf NV, Corbetta M, Shulman GL. Damage to the shortest structural paths between brain regions is associated with disruptions of resting-state functional connectivity after stroke. NeuroImage. 2020;210:116589. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80. Latora V, Marchiori M. Efficient behavior of small-world networks. Phys Rev Lett. 2001;87(19):198701. [DOI] [PubMed] [Google Scholar]
- 81. Revelle W. Psych: Procedures for Psychological, Psychometric, and Personality Research. 2009. https://CRAN.R-project.org/package = psych.
- 82. Jennrich RI, Bentler PM. Exploratory bi-factor analysis. Psychometrika. 2011;76(4):537–549. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83. Savalei V. Understanding robust corrections in structural equation modeling. Structural Equation Modeling: A Multidisciplinary Journal. 2014;21(1):149–160. [Google Scholar]
- 84. Rosseel Y. Lavaan: An R package for structural equation modeling. J Stat Softw. 2012;; 48((1):1–36. [Google Scholar]
- 85. Thompson CK, Bastiaanse R. Introduction to agrammatism. In: Perspectives on agrammatism. Psychology Press; 2012:1–16. [Google Scholar]
- 86. Gordon JK, Clough S. How fluent? Part B. Underlying contributors to continuous measures of fluency in aphasia. Aphasiology. 2020;34(5):643–663. [Google Scholar]
- 87. Breedin SD, Saffran EM, Schwartz MF. Semantic factors in verb retrieval: An effect of complexity. Brain Lang. 1998;63(1):1–31. [DOI] [PubMed] [Google Scholar]
- 88. Gordon JK, Dell GS. Learning to divide the labor: An account of deficits in light and heavy verb production. Cogn Sci. 2003;27(1):1–40. [PubMed] [Google Scholar]
- 89. Hickok GS. Computational neuroanatomy of speech production. Nature Reviews Neuroscience. 2012;13(2):135–145. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90. Geller J, Thye M, Mirman D. Estimating effects of graded white matter damage and binary tract disconnection on post-stroke language impairment. NeuroImage. 2019;189:248–257. [DOI] [PubMed] [Google Scholar]
- 91. Wilmskoetter J, Fridriksson J, Basilakos A, et al. Indirect white matter pathways are associated with treated naming improvement in aphasia. Neurorehabil Neural Repair. 2021;35(4):346–355. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92. Lambon Ralph MA, Jefferies E, Patterson KE, Rogers TT. The neural and computational bases of semantic cognition. Nature Reviews Neuroscience. 2017;18(1):42–55. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Anonymized behavioural data and lesion data are available from Daniel Mirman (dan@danmirman.org) subject to approval by the Einstein Healthcare Network Institutional Review Board.