Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2018 Jul 6.
Published in final edited form as: Hum Brain Mapp. 2018 Mar 12;39(7):2777–2785. doi: 10.1002/hbm.24039

Presurgical language fMRI: Clinical practices and patient outcomes in epilepsy surgical planning

Christopher FA Benjamin A,ϕ, Alexa X Li B, Hal Blumenfeld A, R Todd Constable A, Rafeed Alkawadri A, Stephan Bickel C, Christoph Helmstaedter D, Stefano Meletti E, Richard Bronen A, Simon K Warfield F, Jurriaan M Peters F, David Reutens G, Monika Połczyńska H, Dennis D Spencer A, Lawrence J Hirsch A
PMCID: PMC6033659  NIHMSID: NIHMS950689  PMID: 29528160

Abstract

The goal of this study was to document current clinical practice and report patient outcomes in presurgical language functional MRI (fMRI) for epilepsy surgery. Epilepsy surgical programs worldwide were surveyed as to the utility, implementation, and efficacy of language fMRI in the clinic; 82 programs responded. Respondents were predominantly from the US (61%) academic programs (85%), and evaluated adults (44%), adults and children (40%), or children only (16%). Nearly all (96%) reported using language fMRI. Surprisingly, fMRI is used to guide surgical margins (44% of programs) as well as language lateralization (100%). Sites using fMRI for localization most often use a distance margin around activation of 10mm. While considered useful, 56% of programs reported at least one instance of disagreement with other measures. Direct brain stimulation typically confirmed fMRI findings (74%) when guiding margins, but instances of unpredicted decline were reported by 17% of programs and unexpected preservation of function by 54%. Programs reporting unexpected decline did not clearly differ from those which did not. Clinicians using fMRI to guide surgical margins do not typically map known language-critical areas beyond Broca’s and Wernicke’s. This initial data shows many clinical teams are confident using fMRI not only for language lateralization but also to guide surgical margins. Reported cases of unexpected language preservation when fMRI activation is resected, and cases of language decline when it is not, emphasize a critical need for further validation. Comprehensive studies comparing commonly-used fMRI paradigms to predict stimulation mapping and post-surgical language decline remain of high importance.

Keywords: Epilepsy, presurgical, fMRI, language

1.1 Introduction

Neurosurgery is an effective and potentially curative treatment for temporal lobe epilepsy [Wiebe et al., 2001]. Surgical risk to language and memory can exclude a patient from treatment. As 34%–41% of left temporal patients undergoing focal resections experience a decline in naming [Busch et al., 2016; Sherman et al., 2011], determining the surgical risk to language remains essential.

While the Intracarotid Amobarbital Test ("Wada" testing) has been the gold standard for determining the language dominant hemisphere, functional magnetic resonance imaging (fMRI), which combines cognitive assessment with MRI, is non-invasive and can be accurate [Sabsevitz et al., 2003] and less costly [Medina et al., 2004]. The evidence supporting fMRI's validity was recently outlined [Szaflarski et al., 2017], with the conclusion that language fMRI is a valid alternative to Wada testing in most patients. One approach to incorporating fMRI in clinical decision making developed by Swanson and colleagues is shown in Figure 1 [Swanson et al., 2015]. In short, if language fMRI shows left hemisphere dominance and a patient has right hemisphere pathology, Wada testing may be deemed unnecessary for language lateralization. Given the significance of a right hemisphere language finding and the increased probability of discordance with Wada [Janecek et al., 2013], an argument can be made for repeating language fMRI for right dominance findings.

Figure 1. An algorithm for determining when to conduct fMRI or Wada testing.

Figure 1

This heuristic may be modified, for example, based on baseline memory function, and is based on a particular language task (Semantic Decision Making task; Binder et al., 1995). Reprinted from Swanson, Binder, Raghavan & Euler, "Handbook on the Neuropsychology of Epilepsy," 2015, p169, copyright, with permission from Springer Verlag Deutsch.

While different language fMRI tasks will yield differing maps and accuracy in predicting post-operative decline [Binder et al., 2008], good estimates of fMRI's validity in a tertiary epilepsy setting are available for a semantic decision making task. With this protocol, a key study found decline of more than two standard deviations in naming skill (relative to controls) could be predicted with 100% sensitivity and 73% specificity, with prediction superior to that using Wada (92% and 45%, respectively) [Sabsevitz et al., 2003]. Overall, 41% of variance in post-operative language skill was predicted by fMRI and the positive predictive value was 81% for fMRI and 67% for Wada. In 229 epilepsy patients, using a slightly different analysis, 80% were classified as left dominant using fMRI, of which 92% were also Wada left dominant (167/182; 15 bilateral) [Janecek et al., 2013]. fMRI bilateral cases (n=28) were typically left (46%) or bilateral (36%) on Wada, though occasionally (18%) right. fMRI right cases were most often also right on Wada (53%), though could be markedly discrepant (21% [n=4] left Wada; all right handed, right seizure foci). Explicitly examining cases of fMRI–Wada disagreement, this protocol has been shown to more accurately predict naming decline than Wada testing [Sabsevitz et al., 2003]. BOLD fMRI language maps are not validated for the routine drawing of boundaries around indispensable cortex (the removal of which is associated with language decline), however.

Importantly, fMRI is not validated for comprehensive language mapping to tailor surgical margins and avoid post-surgical language decline [Giussani et al., 2010; Szaflarski et al., 2017]. Clinical guidelines for fMRI use are available within radiology [American College of Radiology, 2014] and neuropsychology [Bobholz et al., 2004]. Recent surveys from the European Union's E-PILEPSY project showed 82% of European epilepsy programs use language fMRI [Mouthaan et al., 2016], with Wada being used more judiciously. The questions clinical teams ask of language fMRI, how the results are interpreted, and whether they improve clinical care, however, remain unclear. Whether current clinical practices are in keeping with current best evidence is also unknown. The goals of this study were to characterize how epilepsy programs use language fMRI in surgical planning and aggregate reports of patient outcomes that are otherwise likely to go unreported, to help inform future validation and study of clinical language fMRI.

2.1 Materials and methods

This study was approved and overseen by Yale Medical School's Institutional Review Board. All participants provided informed consent (Appendix A).

2.2 Survey

A survey focused on clinicians’ use of and experience with presurgical language fMRI was developed (Appendix A). Questions centered on identifying the dominant hemisphere ("lateralization"); identifying language areas to guide surgical margins ("localization"); how programs use fMRI; their confidence in results; and patient outcomes. A past survey of extraoperative mapping was used as a reference in design [Hamberger et al., 2014]. Clinicians from neurology, radiology and neurosurgery provided feedback on clarity and length. The research design was reviewed by Yale's Center for Analytic Sciences. A second, related survey of those acquiring fMRI data will be reported separately. The survey was presented via www.qualtrics.com. Questions were organized hierarchically with all respondents answering key questions which could elicit related questions. If a question was not answered a warning appeared, but the respondent could elect to continue without answering.

2.3 Procedure

2.3.1 Site identification

Within the US the major body formally accrediting epilepsy programs, the National Association of Epilepsy Centers (NAEC), provided details of all epilepsy programs. We invited all programs completing surgery (levels 3 or 4) and Wada or fMRI. Worldwide, prominent epilepsy organizations (e.g., ILAE members) and researchers were contacted. We asked individuals to invite other programs they knew, to increase sample size, using a modified "snowball sampling" approach [Goodman, 1961].

2.3.2 Data collection

The survey was open 07.17.2015–01.15.2016. We emailed invitations to all NAEC sites and followed up by telephone. We also contacted researchers, epilepsy organizations, and ILAE member committees worldwide requesting they forward an email invitation to relevant contacts. In 11.2015 we sent reminders and emailed the American Epilepsy Society (AES) listserv.

2.3.3 Final sample

82 surveys were received from respondents involved in selecting patients for epilepsy surgery. The number of responses per question is indicated in square brackets; e.g., [n=X]. Responses vary due to the study's hierarchical design; some specific responses elicited additional relevant questions. Data were cleaned; of note, two respondents answered only initial questions on their confidence in methods for language lateralization, and three reported no patients received fMRI. Detailed review of the 79% of responses for which an institution was identifiable showed one (1.5%) could reflect a duplicate response from a program. As it may have reflected a unique response and survey instructions emphasized provision of one clinical response (Supplement A), the data were not removed. A second pair was identified, but it was apparent that the respondents work in different programs, but with fMRI executed by the same individual. Respondent's country was estimated from stated location (per the survey or email) or IP address. Descriptive statistics are presented and compared using t and Fisher’s Exact tests.

Programs were typically university-affiliated (85%), evaluating a mean of 106 (SD 67; 10–300) patients annually and surgically treating a mean of 34 (SD 21, 0–100) [n=80 respondents]. Overall 84% evaluated adults and 56% children; specifically – 44% evaluated predominantly adults; 40% predominantly both adults and children; and 16% predominantly children (<18 years) [n=80]. Respondents who identified their background included neurologists (89%), neurosurgeons (6%) and neuropsychologists (5%) [n=66]. They were typically clinician-researchers (52%) or clinicians (45%) [n=65], and 93% reported direct involvement in deciding whether patients are offered surgery [n=68]. Just under half (46%) were surgical program directors [n=68]. Nine respondents reported they also collect, analyze and interpret the fMRI data.

Almost two thirds of respondents were from the USA (50); with further responses from Australia (6); Canada (3); France (3); Italy (3); Turkey (3); England (2); Germany (2); Denmark (1); Egypt (1); Georgia (1); Japan (1); Netherlands (1); Norway (1); Portugal (1); Scotland (1); Sweden (1); and Switzerland (1).

3.1 Results

Through the following results, the number of responses available for each question is indicated in square brackets at the end of the relevant sentence [e.g., n=X]. See 2.3.3 for discussion of hierarchical survey structure, data cleaning, and variable response numbers.

3.2 Methods used clinically for language lateralization

Nearly all respondents reported using fMRI (96% of programs), neuropsychological assessment (99%) and extraoperative stimulation mapping (93%) in evaluating language preoperatively [total respondents: n=80] (Figure 2). The majority of programs also use intraoperative stimulation mapping (ISM) (83%), with other methods each used by fewer than half of programs. Patients are most likely to receive neuropsychological assessment (93% of patients), fMRI (58%) or Wada testing (43%) [n=80] at programs using these methods. The three programs not using fMRI (two US, one worldwide) reported they did not have fMRI capabilities, or felt it was not necessary given (e.g.) availability of Wada. Clinicians reported high confidence that, with all methods at their disposal, they can identify a patient’s language dominant hemisphere (mean 92%, SD 8; 60–100) and localize language regions to guide surgical margins (mean 84%, SD 14, 30–100) [n=81].

Figure 2. Language mapping methods: Proportion of programs using (blue) and patients receiving (orange) each method (overall and by geographic region).

Figure 2

The proportion of patients receiving a measure is estimated based only on programs using that method–i.e., at the 83% of sites using ISM, on average 23% of patients receive ISM. Neuropsychological assessment (Npsych), functional MRI (fMRI), extraoperative stimulation mapping (ESM), intraoperative stimulation mapping (ISM), Wada testing (Wada), Magnetoencephalography (MEG), Cortico-cortical evoked potentials (CCEPS), transcranial magnetic stimulation (TMS), Gamma-activation mapping (Gamma). Question: “Please estimate the proportion of surgical candidates at your center who receive the following prior to surgery to clarify language organization.”

3.3 Clinicians’ opinions of language fMRI

Programs using fMRI reported confidence (73%, SD 18, range 1–100) in their program's ability to identify a patient’s language dominant hemisphere using fMRI alone [n=78]. They reported low confidence (45%, SD 27, 0–92) in their ability to use fMRI to identify specific language areas to guide surgical boundaries (i.e., localization) [n=77]. Accordingly, when considering the technique of fMRI generally they considered it reliable for language lateralization (81% confidence, SD 15 19–100) but less so for localization (i.e., identifying language areas to guide surgical boundaries; 48%, SD 25 0–93; t(74)=11.471 p<0.001) [n=76, 78]. When asked if language fMRI completed at different centers yields equivalent results, their response was neutral (score of 0 SD 2.5; confidence scale: strongly disagree to strongly agree [−5 to +5]).

3.4 Language lateralization with fMRI

All programs administering fMRI use it to identify the language dominant hemisphere [n=71]. Just under half of programs (46%) reported fMRI has never disagreed with other measures of laterality. A third of programs (34%) reported at least one instance of disagreement with Wada; a quarter (24%) with stimulation mapping; and 15% with other methods (e.g., MEG, semiology, TMS, neuropsychological testing) [n=74]. Multiple programs noted fMRI yields bilateral or equivocal findings more often than other methods, particularly in children.

In cases of disagreement fMRI was most often judged to have been incorrect (55%); a third reported cases where true lateralization remained unknown (29%); and a fifth (18%) reported cases where fMRI had been correct and the other modality was not [n=38]. Five programs reported cases of Wada-fMRI disagreement where fMRI was judged correct. None had been published, though one was included “in general in wider publications.” Of all programs reporting disagreement between fMRI and another measure, two (5%) had published these cases [n=39].

3.5 Guiding surgical margins (localization) with fMRI

Forty four percent of programs reported using fMRI to localize language [n=71]. Most centers using fMRI to localize language (guide surgical margins and preserve language cortex) seek to map known language areas (86%) [n=29]. These include Broca’s (86%) and Wernicke’s areas (76%). Other areas which can be language critical, "basal temporal language area" (BTLA; 17%) and other regions (21%), are mapped infrequently.

Programs that do not use fMRI to localize language (i.e., programs using fMRI only for language lateralization) less often seek to map known language regions (58%) (Fisher’s exact p<0.001) [n=40]. Specifically, these programs map Broca’s (43%; p<0.001) and Wernicke’s Areas (38%; p=0.003) less frequently than programs using fMRI for localization, and also rarely map BTLA (5%, p=0.122) or other regions (5%; p=0.062).

Clinicians using fMRI to guide surgical margins report success in mapping Broca’s 75% of the time (SD 15%, 39–93%) [n=25]; in mapping Wernicke’s 71% of the time (SD 16%, 30–92%) [n=22]; BTLA 62% of the time (SD 28%, 19–90%) [n=5] and other regions 52% of the time (SD 31%, 3–80%) [n=6]. The few who use fMRI for localization, but do not seek to identify specific language regions [n=4], find fMRI is rarely successful (27% of cases) in guiding margins.

Direct electrical stimulation at fMRI-positive cortex typically confirms fMRI findings (74% of the time; SD 15%, 30–95%), with most of these respondents (86%) reporting they specifically seek to map certain language regions [n=28]. Instances of post-operative language decline were reported with equivalent frequency (26%) by both programs using fMRI for lateralization only [n=27] and those using it for both lateralization and localization [n=23].

3.6 Cognitive outcomes when fMRI activation is preserved

When no fMRI-positive language cortex is resected, half of responding programs (49%) reported no cases of persisting language decline three months post-surgery [n=75]. Seventeen percent reported at least one otherwise unexplained instance of language decline following temporal [67%], frontal (50%) or less often parietal (33%) or occipital (8%) surgery [n=12]. Of note, most patients (70%) receive postoperative neuropsychological testing at most (96%) programs [n=79]. None of these cases of decline had been published. A third of programs did not know if cases of decline had occurred.

Programs noting cases of decline most often related these to surgical variables (100% reported at least one such case) and 23% reported instances where fMRI-related factors were considered responsible. Surgically, resection of white matter language tracts was most often noted (85%). Resection too close to fMRI-positive areas was also often reported (38%), as were surgical complications (23%); multiple subpial transections over eloquent cortex (15%); and surgical injury outside the planned resection area (15%). Resection in language association cortex and post-surgical inflammation was also noted, and 31% reported cases where the cause was unknown. With respect to fMRI-related factors, different programs attributed at least one instance of unanticipated deficits to imaging (e.g., artifact, insufficient resolution) (n=1); analysis (n=1); and patient-related factors (e.g., seizures, movement) (n=1).

In exploratory analysis, programs reporting cases of cognitive decline in spite of fMRI-active areas being preserved did not differ from those which did not on a range of variables (Appendix C). While a nonsignificant difference, in this sample programs reporting unexpected decline more often predominantly evaluated children (31% v 11%); were less likely to review the actual fMRI images at conference (69% v 89%); and less often had the individual involved in fMRI data analysis interpret data at surgical conference (38% v 59%).

3.7 Cognitive outcomes when fMRI activation is resected

Half of the programs using fMRI to guide surgical margins (54%) reported cases where patients maintained function after resection of language-positive sites [n=26]. These followed frontal (62%), temporal (54%), or parietal (23%) resections. Select sites noted surgery in the left operculum (bilaterally active), posterotemporal cortex (unspecified); right posterotemporal cortex (bilateral activation) and middle frontal gyrus. None had been published.

In exploratory analysis, these programs were less likely to use the Wada (in at least some instances) to evaluate language organization (Appendix C), while use of language fMRI was equivalent (61% (SD 25) vs 64% (SD 24) [n=14,12]). Other nonsignificant differences in this sample included programs reporting unexpected preservation more often loading images into an intraoperative system (64% vs 33%); using fMRI to localize language and guide surgical margins (57% vs 33%); and completing more surgeries annually (42 (SD 21) vs 34 (SD 16)).

Language outcomes at sites using (76%) and not using (24%) Wada testing were also compared post-hoc [n=80]. There was no difference in the proportion of sites reporting cases of persistent (>3 month) post-operative decline in function when fMRI-positive language sites were preserved (sites not using Wada: 20%, sites using Wada 29%; Fisher’s exact p=0.728) [n=15,35]. There was also no difference in cases of maintained pre-operative ability when fMRI-positive cortex was resected, though there was a trend towards sites not using the Wada to more often report preserved post-operative function (not using Wada: 86%, using Wada 42%; Fisher’s exact p=0.081) [n=7,19].

3.8 Geographic variation in methods

Based on previous data showing geographic differences in Wada use within and outside the United States, we completed post-hoc analyses evaluating geographic differences in key study findings. Significantly fewer respondents from non-US programs reported using the Wada (non-US 50%, US 92%; Fisher's exact p<0.000) [n=32, n=48], MEG (non-US 28%, US 63%, p=0.003), and Gamma mapping (non-US 3%, US 25%, p=0.012) to clarify language organization (Figure 2). Programs outside the US were significantly more likely to use fMRI for language localization (61%) as compared with those within the US (33%; χ2=5.465, p<0.05) [n=28, n=43]. There were no such differences in the proportion of sites reporting fMRI had disagreed with other methods of determining language laterality (non-US 48%, US 58%; χ2=0.641, p=0.423) [n=29, n=45]; in reports of patients experiencing persistent post-operative language decline when all fMRI-positive language sites were preserved (non-US 33%, US 19%; χ2=1.29, p=0.256) [n=24, n=26]; in unexpected language preservation when fMRI-positive language cortex was resected (non-US 64%, US 42%; Fisher's exact p=0.431) [n=14, n=12]; or in the number of programs identifying any language areas beyond Broca's and Wernicke's cortex (non-US 24%; US 25%; Fisher's exact p=1.0) [n=17, n=16].

3.9 How clinicians interpret fMRI maps

Teams who routinely use fMRI to guide surgical margins most often constrain resections to a given distance from fMRI activation (59% of programs) [n=29]. The distance threshold used by programs varied from 3–50mm (average 15mm, SD 12). The most commonly-used margin was 10mm (42% of programs). Programs often (28%) will not operate in the same gyrus as activation or will use other criteria (28%), most often routine investigation of activation with direct stimulation. These programs most often report they would resect fMRI-positive cortex in at least some circumstances (79%), for instance if cleared by direct stimulation (62%); if activation was not anatomically consistent with a language area (38%); if deficits would likely be temporary (e.g., uncomplicated unilateral SMA resection) (38%); or if the patient was willing to accept post-surgical deficits (24%) [n=29]. One site noted they would resect any fMRI-positive activation “not [in] a primary language site (Broca/Wernicke)” (this site reported both unpredicted decline and unexpected preservation), while one site was uncertain, and 17% reported they would never resect cortex that was fMRI language-positive.

To interpret fMRI, teams most often review the images (visually) at surgical conference (86% of programs) and the team or referrer reviews a written report (71%) (n=79). The individual involved in analysis often interprets the data for the team (63%). Laterality indices are rarely used (whole brain, 28%; specific region, 18%) and a third of programs load data into an intraoperative mapping system (35%).

Respondents expressed confidence that fMRI’s ability to lateralize language will improve with further technical advances (mean=89%, SD 12) though were less confident about future advances improving fMRI’s utility for localization (mean=64%, SD 26%; n=77) (t(75)=8.831, p<0.001).

4.1 Discussion

These results confirm fMRI is well established for language lateralization, and in contrast to expectations is already used in many epilepsy programs to guide surgical margins. Consistent with the research literature,6 most clinical programs using language fMRI (54%) reported instances where fMRI language lateralization had conflicted with other methods including the Wada (34%). In these situations it is rarely clear which method is “correct,” though when they could do so respondents judged fMRI incorrect more often (55%). These findings may suggest the fMRI tasks used by different programs vary in their ability to predict language outcome; reflect a decline in language not captured by visual naming tasks [Hamberger and Seidel, 2003]; or represent recall bias and clinicians' uncertainty regarding fMRI.

While fMRI is not validated for language localization, programs have already begun to cautiously use it for this purpose (44%), typically removing fMRI-positive cortex only after confirmation via direct stimulation, counseling patients on possible decline, or never removing fMRI-positive cortex. This is consistent with clinicians' reports in the literature that they consider language fMRI appropriate "to guide surgical margins and preserve language cortex" and that "fMRI… can be used with the same results as awake craniotomy” [Ganslandt et al., 2016]. This emphasizes the importance of further validating highly standardized and easily replicable forms of fMRI [Mouthaan et al., 2016]. Cases of unexpected preservation and unexpected decline are equally concerning; the former may increase post-surgical cognitive impairment while the latter may increase surgical failure rates [Jakola et al., 2011]. Until validated, any criteria used to "preserve" functional cortex relative to fMRI map boundaries in surgery are arbitrary. Excluding patients based on fMRI results regardless of direct cortical stimulation findings, as reported by 17% of programs, may unnecessarily restrict resections and decrease the probability that patients achieve seizure freedom.

Unlike researchers, clinicians interpreting fMRI often appear unaware that rather than showing the immutable boundaries of language-critical cortex, fMRI language maps reveal the probable location of language areas. They are likely also unaware that language maps reflect assumptions made by the individuals who selected the cognitive task and imaging sequences, and analyzed and reported the data. These assumptions can, of course, dramatically change map boundaries. Examples are shown in Figure 3. Researchers presenting findings to clinical teams may improve patient care by helping their colleagues understand that the apparent boundaries of language areas will vary with a patient's language skill (Figure 3A); analysis parameters (3B); signal loss not evident in the final map (3C); the task and control conditions used (3D); and the areas the analyst is asked (or not asked) to identify (3E). Just as importantly, the fact that language fMRI represents task-correlated changes in blood oxygenation (rather than neural activation) and may show non-critical regions may be emphasized [Polczynska et al., 2014].

Figure 3. Language maps will vary in clinically meaningful ways due to multiple variables.

Figure 3

Surgical teams can manage these factors by using experts in both imaging (e.g., radiology) and cognition (e.g., neuropsychology) in clinical fMRI design, analysis and interpretation.

A: Language skill. A patient's language ability will change their activation map. Maps using the same tasks in Farsi and English from a patient who reported fluency in, and made medical decisions in English.

B: Data analysis. Each analysis step changes the map. Data "smoothing" removes noise. Whether it is appropriate, and to what degree, is debated. The degree of smoothing in commercial software may be unspecified. Identical analysis without (left) and with (right) smoothing (8mm kernel).

C: Data quality. A statistical map (left) does not show where raw data are missing (right, asterisks). These areas will not be active even if they are language critical. This map (left) was presented to a surgical team for surgical planning without caveat.

D. Cognitive task. Different language tasks give different maps. Subtle changes in task instructions, patient motivation and cognitive strategy change language maps. (D1) Visual object, (D2) text reading and (D3) auditory tasks are shown as well as (D4) the intersection of these.

E: Analyst expectations. The analyst's perceived goal will change the activation map. Two overlaid maps (red; yellow) generated independently by two clinicians for the same patient (see Benjamin et al., 2017).[Benjamin et al., 2017] Analysts were blind to case details. One prioritized frontal (red) and the other temporal (yellow) regions, as when mapping frontal tumor vs temporal lobectomy cases. Overlap in orange.

3E is reprinted from Benjamin et al., "Presurgical language fMRI..," Human Brain Mapping (2017); Creative Commons Attribution-Non-Commercial License.

The role of the clinician in defining the results is of particular importance given clinicians' current focus on Broca's and Wernicke's but not other known language-critical regions (e.g., BTLA, Exner's area, SMA) [Anderson et al., 1990; Benjamin et al., 2017; Krainik et al., 2003; Krauss et al., 1996; Roux et al., 2009]. More sophisticated models of language have been suggested for well over a century in the clinical literature [e.g., see Benjamin et al., 2017 for a review] and have been a recent focus in the cognitive literature [e.g. Tremblay & Dick, 2016]. Of particular relevance to clinicians is a compelling recent reanalysis and reinterpretation of the nature of Wernicke's area [Binder, 2015], which suggests discrete bilateral and unilateral components in typically organized individuals [see also Price, 2012], as and the suggestion that more sophisticated tasks (e.g., evaluating grammar) may better map language regions such as the angular gyrus [Polczynska et al., 2017].

In a clinical setting, many of these critical factors are obscured when tasks and analysis packages from commercial MR vendors are used. US clinicians are not mandated to use FDA approved software, and delegating these considerations is risky when perhaps the best validated language protocols and analysis software to date are open-source and freely available (Appendix B). The clinician is ultimately responsible for their results. The skills required for clinical fMRI (selecting sequences and cognitive paradigms; analyzing and interpreting data) do not fit within one existing discipline, and this represents an opportunity for research neuroscientists to improve clinical care. When a "gold standard" for language fMRI develops, it will likely approximate the interdisciplinary Wada protocol with a team of at least two qualified professionals from a selection of radiology, neuropsychology, neuroscience, and/or engineering. Expertise in the clinical, imaging and cognitive skills required for fMRI, rather than an individual's discipline, will likely best predict quality of care. This is already acknowledged in US billing codes for fMRI, that allow for psychologists and medical doctors to complete and bill fMRI.

The above discussion fits with the observation that, in our sample, clinical programs that reported unexpected decline were less likely to review the data in surgical conference (69% v 89%) and less likely to have the individual who completed analysis interpret data at conference (38% v 59%; though note that neither reached statistical significance). The greater use of the Wada for language lateralization by programs not reporting unexpected decline, despite their using language fMRI as regularly as other programs, may reflect a more cautious supplementation of fMRI with Wada when results are equivocal or uncertain.

4.2 Limitations

A limitation of this report is our inability to relate these findings to the specific language tasks used. We attempted to make this link by asking respondents to have those acquiring and analyzing fMRI at their program complete a paired survey (see Appendix A). Respondents rarely did so, potentially due to the time burden of this survey (often 15–20 minutes). Thus this project provides a valuable broad overview of how clinicians are using fMRI, and future large-scale research with primary data and multiple paradigms remains critical to inform the relationship between different tasks, language outcomes, and the results of direct cortical stimulation mapping. Of note, variation in protocols is likely to influence overall network lateralization less than regional localization. It is possible respondents' recall and reports are imperfect, and it is not possible for us to judge what reports of unexpected decline or preservation are based on. Of particular note, in considering patient outcomes some respondents may have used formal outcome data while others will have relied on their memory, so that some responses may variably reflect objective outcomes or clinicians' beliefs. These data better represent American (61%) and academic (85%) epilepsy programs. To increase our response rate, collaborators forwarded the survey to numerous colleagues to maximize our sample size. This meant that we were not able to identify specifically how many sites were offered the opportunity to participate, however, obscuring the true response rate. As an estimate, we contacted 221 US NAEC programs and received 50 US responses, suggesting a 23% response rate (using this approach Hamberger et al. [2014] received 39 US responses). Note also that our prioritization of respondents' anonymity makes it difficult to identify duplicate responses, though fewer than 2% of identifiable responses were duplicates, and any duplicates will likely both be from neurologists (89% of respondents). A major limitation of this work, however, is that the results here include data from only one primarily Asian site (Japan), and do not include data from other Asian programs. Future work might specifically target programs in these regions to ensure their representation. The survey length will also have decreased responses. Regardless, with these flaws in mind, our respondents’ data provides valuable, detailed data for the first time on how fMRI is applied in the clinic and moves that can be taken to improve its use.

4.3 Conclusions

Clinical fMRI is widely used to predict language laterality and post-surgical language change. In contrast to research evidence, it is also used for guiding surgical margins, which may lead to either unnecessary caution or over-confidence in surgery. The caveats documented in the literature–e.g., occasional disagreement with Wada–are seen in the clinic. Outcomes can likely be improved through use of existing, well validated tasks. Our data, from centers using a range of tasks and methods, emphasize that cautious use of language fMRI for lateralization is warranted and that fMRI maps cannot simply be treated as representing language-critical cortex. Without standardization and explicit validation, any criteria for preserving language-critical cortex relative to fMRI map boundaries are arbitrary. We suggest an initial minimum for clinical care might involve ensuring those who analyze a program's language fMRI, and understand the task's cognitive design, interpret the data in 3D in discussion with the team at conference. They might begin by reviewing the task itself (expected activation), the best estimates of its sensitivity and specificity, and the limitations of the specific results (patient factors, areas of signal loss). This will reduce opportunity for misinterpretation and likely improve patient care.

Supplementary Material

Supplement A

Supplement A: The survey questions completed by respondents.

Supplement B

Supplement B: The semantic decision-making language task discussed in text is available directly from Jeff Binder. A version of the task, using the same stimuli within "Presentation" software, is also freely available at cogneuro.net/hbm2018. The analysis software noted includes (is not limited to) SPM (www.fil.ion.ucl.ac.uk/spm/) and FSL (fsl.fmrib.ox.ac.uk/fsl) are excellent, open, and well documented software packages for fMRI analysis that are widely used that are continually refined and improved. Note on FDA approval and fMRI software: In the US, for clinical care, the clinician completing analysis is responsible for determining which software they use, and how they use it. Their choice is not constrained to US Food and Drug Agency (FDA) approved products; the FDA constrains what the marketers of software state the software should be used for. That is, if software is FDA approved for a given purpose, the FDA has judged it can be used for this purpose.

Supplement C

Supplement C: Characteristics of programs reporting unexpected language decline and unexpected language preservation. Exploratory analysis. Note that sample size varies as a function of the number of respondents answering all required questions. Unexpected decline: All respondents were asked “So far as you know, have any patients experienced a persistent (>3 month) postoperative language decline in spite of your preserving all fMRI-positive language sites?” (Q86). Unexpected preservation: The question “as far as you know, have any patients maintained pre-operative language ability despite resected fMRI-positive language cortex?” (Q90) was shown to all respondents who did not report they would never remove fMRI-identified, language positive sites (Q85). DCS = Direct Cortical Stimulation mapping.

Acknowledgments

This article is dedicated to our respondents; thank you for donating the time to make this work possible.

Study funding

This work was supported by Yale CTSA [UL1TR000142] from the National Center for Advancing Translational Science (NCATS), National Institutes of Health USA; and the Swebilius Foundation.

Footnotes

None of the authors has any conflict of interest to disclose.

References

  1. American College of Radiology. ACR–ASNR–SPR Practice Parameter for the Performance of Functional Magnetic Resonance Imaging (fMRI) of the Brain. Amended 2014 (Resolution 39) 2014 [Google Scholar]
  2. Anderson SW, Damasio AR, Damasio H. Troubled letters but not numbers. Brain. 1990;113:749–766. doi: 10.1093/brain/113.3.749. [DOI] [PubMed] [Google Scholar]
  3. Benjamin CF, Walshaw PD, Hale K, Gaillard WD, Baxter LC, et al. Presurgical language fMRI: Mapping of six critical regions. Human Brain Mapping. 2017;38:4239. doi: 10.1002/hbm.23661. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Binder Jeffrey R. The Wernicke area Modern evidence and a reinterpretation. Neurology. 2015;85.24:2170–2175. doi: 10.1212/WNL.0000000000002219. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Binder JR, Swanson SJ, Hammeke TA, Sabsevitz DS. A comparison of five fMRI protocols for mapping speech comprehension systems. Epilepsia. 2008;49:1980–1997. doi: 10.1111/j.1528-1167.2008.01683.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Bobholz J, Bilder B, Bookheimer S, Cole M, Mirsky A, Pliskin N, Rao S, Ricker J, Saykin A, Sweeney J, et al. Official position of the division of clinical neuropsychology (APA division 40) on the role of neuropsychologists in clinical use of fMRI. Clin Neuropsychol. 2004;18:349–351. doi: 10.1080/1385404049088718. [DOI] [PubMed] [Google Scholar]
  7. Busch RM, Floden DP, Prayson B, Chapin JS, Kim KH, Ferguson L, Bingaman W, Najm IM. Estimating risk of word-finding problems in adults undergoing epilepsy surgery. Neurology. 2016;87:2363–2369. doi: 10.1212/WNL.0000000000003378. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Ganslandt O, Nimsky C, Buchfelder M, Grummich P. fMRI in Neurosurgery. In: Filippi M, editor. fMRI Techniques and Protocols. 2. New york: Springer; 2016. pp. 801–815. [Google Scholar]
  9. Giussani C, Roux FE, Ojemann J, Sganzerla E Pietro, Pirillo D, Papagno C. Is preoperative functional magnetic resonance imaging reliable for language areas mapping in brain tumor surgery? Review of language functional magnetic resonance imaging and direct cortical stimulation correlation studies. Neurosurgery. 2010;66:113–120. doi: 10.1227/01.NEU.0000360392.15450.C9. [DOI] [PubMed] [Google Scholar]
  10. Goodman LA. Snowball Sampling. Ann Math Stat. 1961;32:148–170. [Google Scholar]
  11. Hamberger MJ, Williams AC, Schevon Ca. Extraoperative neurostimulation mapping: Results from an international survey of epilepsy surgery programs. Epilepsia. 2014;55:933–939. doi: 10.1111/epi.12644. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Hamberger MJ, Seidel WT. Auditory and visual naming tests: normative and patient data for accuracy, response time, and tip-of-the-tongue. J Int Neuropsychol Soc. 2003;9:479–489. doi: 10.1017/s135561770393013x. [DOI] [PubMed] [Google Scholar]
  13. Jakola AS, Unsgård G, Solheim O. Quality of life in patients with intracranial gliomas: the impact of modern image-guided surgery. J Neurosurg. 2011;114:1622–1630. doi: 10.3171/2011.1.JNS101657. [DOI] [PubMed] [Google Scholar]
  14. Janecek J, Swanson SJ, Sabsevitz DS, Hammeke TA, Raghavan M, Rozman M, Binder JR. Language lateralization by fMRI and Wada testing in 229 epilepsy patients: Rates and predictors of discordance. Epilepsia. 2013;54:314–22. doi: 10.1111/epi.12068. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Krainik A, Lehéricy S, Duffau H, Capelle L, Chainay H, et al. Postoperative speech disorder after medial frontal surgery: role of the supplementary motor area. Neurology. 2003;60:587–94. doi: 10.1212/01.wnl.0000048206.07837.59. [DOI] [PubMed] [Google Scholar]
  16. Krauss GL, Fisher R, Plate C, Hart J, Uematsu S, Gordon B, Lesser RP. Cognitive effects of resecting basal temporal language areas. Epilepsia. 1996;37:476–483. doi: 10.1111/j.1528-1157.1996.tb00594.x. [DOI] [PubMed] [Google Scholar]
  17. Medina LS, Aguirre E, Bernal B, Altman NR. Functional MR imaging versus Wada test for evaluation of language lateralization: cost analysis. Radiology. 2004;230:49–54. doi: 10.1148/radiol.2301021122. http://www.ncbi.nlm.nih.gov/pubmed/14695386. [DOI] [PubMed] [Google Scholar]
  18. Mouthaan BE, Rados M, Barsi P, Boon P, Carmichael DW, et al. Current use of imaging and electromagnetic source localization procedures in epilepsy surgery centers across Europe. Epilepsia. 2016;57:770–776. doi: 10.1111/epi.13347. [DOI] [PubMed] [Google Scholar]
  19. Polczynska M, Curtiss S, Walshaw P, Siddarth P, Benjamin C, Moseley BD, Vigil C, Jones M, Eliashiv D, Bookheimer S. Grammar tests increase the ability to lateralize language function in the Wada test. Epilepsy Res. 2014;108:1864–1873. doi: 10.1016/j.eplepsyres.2014.09.014. [DOI] [PubMed] [Google Scholar]
  20. Połczyńska M, Japardi K, Curtiss S, Moody T, Benjamin C, Cho A, Vigil C, Kuhn T, Jones M, Bookheimer S. Improving language mapping in clinical fMRI through assessment of grammar. Neuroimage: Clinical. 2017a;15:415–427. doi: 10.1016/j.nicl.2017.05.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Price Cathy J. A review and synthesis of the first 20years of PET and fMRI studies of heard speech, spoken language and reading. Neuroimage. 2012;62.2:816–847. doi: 10.1016/j.neuroimage.2012.04.062. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Roux FE, Dufor O, Giussani C, Wamain Y, Draper L, Longcamp M, Démonet JF. The graphemic/motor frontal area Exner’s area revisited. Ann Neurol. 2009;66:537–545. doi: 10.1002/ana.21804. [DOI] [PubMed] [Google Scholar]
  23. Sabsevitz DS, Swanson SJ, Hammeke Ta, Spanaki MV, Possing ET, Morris GL, Mueller WM, Binder JR. Use of preoperative functional neuroimaging to predict language deficits from epilepsy surgery. Neurology. 2003;60:1788–92. doi: 10.1212/01.wnl.0000068022.05644.01. [DOI] [PubMed] [Google Scholar]
  24. Sherman EMS, Wiebe S, Fay-Mcclymont TB, Tellez-Zenteno J, Metcalfe A, Hernandez-Ronquillo L, Hader WJ, Jetté N. Neuropsychological outcomes after epilepsy surgery: Systematic review and pooled estimates. Epilepsia. 2011;52:857–869. doi: 10.1111/j.1528-1167.2011.03022.x. [DOI] [PubMed] [Google Scholar]
  25. Swanson SJ, Binder JR, Raghavan M, Euler M. Functional MRI in the presurgical epilepsy evaluation. In: Barr W, Morrison C, editors. Handbook on the Neuropsychology of Epilepsy. New York: Springer; 2015. pp. 169–194. [Google Scholar]
  26. Szaflarski JP, Gloss D, Binder JR, Gaillard WD, Golby AJ, et al. Practice guideline summary: Use of fMRI in the presurgical evaluation of patients with epilepsy. Neurology. 2017;88(4):395–402. doi: 10.1212/WNL.0000000000003532. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Tremblay P, Dick AS. Broca and Wernicke are dead, or moving past the classic model of language neurobiology. Brain and language. 2016;162:60–71. doi: 10.1016/j.bandl.2016.08.004. [DOI] [PubMed] [Google Scholar]
  28. Wiebe S, Blume WT, Girvin JP, Eliasziw M. A randomized, controlled trial of surgery for temporal-lobe epilepsy. N Engl J Med. 2001;345:311–318. doi: 10.1056/NEJM200108023450501. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplement A

Supplement A: The survey questions completed by respondents.

Supplement B

Supplement B: The semantic decision-making language task discussed in text is available directly from Jeff Binder. A version of the task, using the same stimuli within "Presentation" software, is also freely available at cogneuro.net/hbm2018. The analysis software noted includes (is not limited to) SPM (www.fil.ion.ucl.ac.uk/spm/) and FSL (fsl.fmrib.ox.ac.uk/fsl) are excellent, open, and well documented software packages for fMRI analysis that are widely used that are continually refined and improved. Note on FDA approval and fMRI software: In the US, for clinical care, the clinician completing analysis is responsible for determining which software they use, and how they use it. Their choice is not constrained to US Food and Drug Agency (FDA) approved products; the FDA constrains what the marketers of software state the software should be used for. That is, if software is FDA approved for a given purpose, the FDA has judged it can be used for this purpose.

Supplement C

Supplement C: Characteristics of programs reporting unexpected language decline and unexpected language preservation. Exploratory analysis. Note that sample size varies as a function of the number of respondents answering all required questions. Unexpected decline: All respondents were asked “So far as you know, have any patients experienced a persistent (>3 month) postoperative language decline in spite of your preserving all fMRI-positive language sites?” (Q86). Unexpected preservation: The question “as far as you know, have any patients maintained pre-operative language ability despite resected fMRI-positive language cortex?” (Q90) was shown to all respondents who did not report they would never remove fMRI-identified, language positive sites (Q85). DCS = Direct Cortical Stimulation mapping.

RESOURCES