Abstract
The classic mapping of distinct aspects of working memory (WM) to mutually exclusive brain areas is at odds with the distributed processing mechanisms proposed by contemporary network science theory. Here, we use machine-learning to determine how aspects of WM are dynamically coded in the human brain. Using cross-validation across independent fMRI studies, we demonstrate that stimulus domains (spatial, number and fractal) and WM processes (encode, maintain, probe) are classifiable with high accuracy from the patterns of network activity and connectivity that they evoke. This is the case even when focusing on ‘multiple demands’ brain regions, which are active across all WM conditions. Contrary to early neuropsychological perspectives, these aspects of WM do not map exclusively to brain areas or processing streams; however, the mappings from that literature form salient features within the corresponding multivariate connectivity patterns. Furthermore, connectivity patterns provide the most precise basis for classification and become fine-tuned as maintenance load increases. These results accord with a network-coding mechanism, where the same brain regions support diverse WM demands by adopting different connectivity states.
Early neuropsychological studies suggested that different aspects of working memory (WM) are exclusively associated with specific brain areas. Here, the authors show, using machine-learning analysis of fMRI, how WM processes are dynamically coded by large-scale overlapping networks in the human brain.
Introduction
It is well-established that human working memory (WM) is behaviourally complex, comprising dissociable systems for temporarily maintaining different types of information in mind1,2 and mechanisms for selectively attending and transforming that information3,4. A major aim of early functional imaging research was to map these distinct aspects of WM onto the brain. Initially, a localist framework was applied, which sought to understand WM in terms of discrete brain circuits, where each neuroanatomical component was ascribed a specific function5,6. WM stages, including encoding7, maintenance8 and probe9, WM demands, including maintenance load10–13 and mental manipulation7,14, and WM domains, including spatial, object or numeric5,15,16, were attributed to dedicated brain regions and processing streams17.
More recently, advances in connectivity methods have motivated a paradigm shift towards a network-coding perspective, where cognitive processes are considered emergent properties of interactions that occur across different widespread coalitions of brain regions18–24. From this perspective, the localist framework is too simplistic because WM functions map to underlying brain systems in a many-to-many as opposed to one-to-one manner25,26. This notion of network coding has gained rapid support26,27; however, it remains unclear how aspects of WM that are known to be behaviourally distinct1–4 map within the multivariate network space. Furthermore, there is a need to reconcile the network perspective with the robust evidence of mutually exclusive localist mappings of WM functions, as provided by the early-brain imagining literature5–17.
We addressed these issues by developing a multivariate machine-learning pipeline to examine how patterns of brain activity and connectivity dynamically changed during the performance of two novel fMRI tasks that were designed to probe distinct aspects of WM function9,28–31. We tested key predictions of the network perspective while seeking to understand how evidence of classic localist mappings can coexist with it. To avoid bias all analyses were data driven and to ensure reproducibility the results were robustly confirmed across datasets from independent studies. We predicted that: (1) No brain region or connection would be involved exclusively in one or other aspect of WM. (2) WM stages (encode, maintain, or probe) and domains (spatial, number or fractal) would be reliably classifiable from the multivariate patterns of brain activity and connectivity that they evoked. (3) Classification accuracies for these aspects of WM would be significant even when focusing on ‘Multiple Demand Cortex’, i.e., the set of brain regions that is most consistently active across cognitive conditions32,33. (4) Classic localist mappings such as dorsolateral and left lateralised frontal cortex involvement in spatial and number WM would be evident as non-exclusive features within these multivariate patterns. (e) Conditions that affect WM difficulty, e.g., maintenance load, would relate most closely to network dynamics11–13 as opposed to regional brain activity34,35. Our results strongly support the network-coding perspective, provide new insights into the underlying mechanisms of WM and demonstrate how models of the early localist literature may be reconciled.
Results
Task design and behavioural results
The behavioural task designs are detailed in Fig. 1. In brief, during Study 1, participants were presented with arrays containing numbers and fractals displayed at a subset of spatial locations within a 4 × 4 grid. They had 10 s to encode the items from one of these three-dimensions and then maintained that information for a further 10 s delay. Subsequently, a probe array was displayed that differed with respect to one number, one fractal-pattern and one location. Participants identified the non-matching item from the maintained domain. A 10 s inter-stimulus-interval (ITI) separated the next trials. The number of items from each dimension (WM load) was 3, 5 or 7.
Accuracies (Supplementary Tables 1, 2 and Supplementary Figure 1) were examined in a 3 × 3 (load × domain) repeated-measures ANOVA. Mean accuracy was 91.13% ± 6.31 SDs. There was a significant main effect of load (F(2,36) = 11.85 p < 0.001) with lower accuracy at high load (high < low t = 4.859, p < 0.001; high < medium t = −3.03, p < 0.018). The main effect of domain and load × domain interaction was non-significant. Reaction times (RT) (Supplementary Tables 3, 4 and Supplementary Figure 2) were examined in a model of the same structure. Median reaction times (RT) (correct trials only) were 3.637 s ± 1.54 SDs. There were significant main effects of load and domain (load F(2,36) = 168.59, p < 0.0001; domain F(2,36) = 24.5, p < 0.0001), and a significant load × domain interaction (F(4,72) = 4.508, p < 0.005). RTs increased with load (t low < med = −10.7, p < 0.0001; t low < high = −16.3, p < 0.0001). Fractal and number trials were slower than location trials (t location < number = −5.38, p < 0.001; t location < fractal = −5.88, p < 0.001).
Study 2 used the same task design, except that (1) only numbers were displayed on the grid (2) there were two levels of WM load (3 & 6) and (3) a retro-cue indicating either ‘maintain’ or ‘manipulate’ preceded the maintenance period. In the manipulate condition, participants transformed the information by spatially rotating the positions 90 degrees clockwise or incrementing each number by 3. The retro cue was pseudo-randomised to ensure that encoding demands matched across the maintain and manipulate trials.
Accuracies (Supplementary Tables 5, 6 and Supplementary Figure 3) were examined in a 2 × 2 × 2 (manipulation × domain × load) repeated-measures ANOVA. Mean accuracy was 84.24% ± 8.5 SDs. There was a significant main effect of load (F(1,15) = 43.2, p < 0.0001) and a significant load × domain interaction (F(1,15) = 5.29, p < 0.04) with accuracy for number trials lower during high-load. RTs (Supplementary Tables 7, 8 and Supplementary Figure 4) were examined in a model of the same structure. Median reaction times (RT) (correct trials only) were 3.937 s ± 1.26 SDs. There was a significant main effect of load and a significant load × domain interaction (F(1,15) = 74.26, p < 0.0001; F(1,15) = 17.05, p < 0.001). RTs increased as a function of load (t = 8.60, p < 0.0001), and RTs for number and location trials under low load were slower relative to high load (t = 9.17, p < 0.001; t = 4.7, p < 0.001). There were no significant effects of manipulation and no other significant interactions. Together, these results show that participants were able to perform both tasks with high accuracy and that there were the expected costs of WM load on performance.
Mapping multiple demand brain regions
Single subject general linear models capturing voxel-wise changes in activity were constructed using a mini-block design (Fig. 1e), in which each stage (e.g., cue, encode, maintain, probe) of each trial was captured by a separate predictor, alongside nuisance covariates. Conjunction analyses36 (i.e. logical AND, Fig. 2a, d) were conducted to identify regions that were consistently active across different WM conditions (i.e. stimuli domains and processing stages). To identity brain areas that were jointly active for all domains (Domain General—DG) we performed a conjunction between the number, fractal and location trials using parameter estimates averaged across WM stages and loads. The resulting activation pattern resembled the brain volume commonly referred to as ‘Multiple Demand Cortex’32,33 (Fig. 2b). To identify the subset of DG regions where activity was sustained during all three stages (e.g., encode, maintain and probe) a second conjunction was performed using parameter estimates averaged across WM domains and loads. We refer to the resultant map as Stage General (SG) (Fig. 2c). This division of DG into SG and non-SG produced a distinct separation between areas that were active for specific processing stages, including primary, secondary, associative visual cortex, motor areas and bilateral thalamus, and areas that were active for all stages, which included, lateral frontal and parietal cortices, anterior insula and thalamus. Finally, we confirmed the reproducibility of the DG and SG conjunctions across the two studies by calculating dice coefficients, which provide a binary estimate of similarity between the patterns of activation. High correspondences were evident: DG = 0.80 and SG = 0.76.
Regions of interest analysis
To reduce computational load voxels within the brain were combined as regions of interest. This data-reduction step is important for network connectivity analyses which produce a high number of features per node37. Key predictions pertained to Multiple Demand Cortex; therefore, we used a custom three-dimensional variant on the watershed algorithm38 to parcellate the SG and DG-conjunction maps from Study 1 into discrete activation clusters comprising 20 and 33 ROIs respectively (Supplementary Tables 9, 10). For completeness, we also conducted analyses with an established whole-brain (WB) parcellation39 covering the entire brain with 268 ROIs (Supplementary Table 11).
Unsupervised clustering based on ROI activity
Hierarchical clustering40 was used to simultaneously group in a data-driven manner the DG ROIs according to similarity of their activation profiles across WM conditions, and the WM conditions according to similarity of their associated patterns of activation across the DG ROIs. This analysis demonstrated high dissociability of WM stages (Fig. 2e, Total Purity = 94.01% see methods) relative to WM domains. There was an interpretable fractionation of the DG ROIs into five distinct clusters corresponding to (1) frontoparietal, (2) somatomotor, (3) associative visual, (4) primary visual and (5) insula, putamen and thalamus (Fig. 2f). Activity was generally lower during maintenance relative to encoding or probe, and this effect was particularly pronounced for the visual clusters41. To examine the dominant natural structure of the data more holistically, t-Distributed Stochastic Neighbour Embedding42 was used to group all mini-blocks in a data driven manner based on the patterns of activation across the WB ROIs. This analysis again demonstrated that the mini-blocks clustered most strongly in both studies according to processing stage (Fig. 2g). This dominance of WM stage is expected given the task design, because the stages differ according to the visual and motor demands of the task.
Maintenance evokes a low-activity high-connectivity state
To further investigate the lower activity observed during maintenance, ROI activations and functional connectivity (FC) estimates from generalised psychophysiological interaction models28,30 (PPI) were averaged across regions and connections (Fig. 3a) for each ROI cluster during the encoding and maintenance stages of the task. The frontoparietal ROIs were consistently active during both encoding and maintenance relative to the inter-trial interval (ITI); whereas, the visual ROIs were more active during encoding than the ITI but not during maintenance (Fig. 3b & Supplementary Table 12). In contrast, intra-cluster connectivity (i.e. averaged for all connections within each cluster), as well as inter-cluster connectivity (i.e. averaged for all connections between clusters), were significantly increased for all clusters (p < 0.001) during both encoding and maintenance relative to the ITI; this was the case for the frontoparietal and the visual clusters (Fig. 3c & Supplementary Tables 13, 14). Differences in the strength of intra-cluster connectivity between encoding and maintenance were small/negligible in the frontoparietal and the visual areas, although the latter were somewhat variable across studies (Supplementary Table 15). Inter-cluster connectivity was substantially stronger during encoding than maintenance, although it was significantly greater than resting baseline for both (Supplementary Table 16). Thus, WM maintenance is characterised by heightened frontoparietal activity and a stable connectivity state throughout the broader WM network.
Load-related effects during maintenance
If the maintenance of WM information related to frontoparietal and visual functional connectivity, then higher maintenance load should have primarily affected inter- and intra-cluster FC during the maintenance period. We tested this by comparing changes in mean activity, as well as intra- and inter-cluster FC during the maintenance stage relative to the ITI. In accordance with our prediction, mean activity across the DG ROIs was insensitive to higher WM load during the maintenance period (F2,3,6 = 0.75, p > 0.43). Analysis of individual ROIs (Bonferroni-Holm corrected) showed load-related increases (p < 0.0001) during maintenance amongst a limited subset of brain regions. These included the superior and inferior parietal, precentral, insula and supramarginal cortex ROIs (Supplementary Table 17). In contrast, there was a robust global increase at higher load in intra-FC (F2,3,6 = 22.55, p < 0.0001) (Fig. 3e, Supplementary Tables 18–26). Study 2 showed the same effect (t15 = 2.71, p < 0.0161).
To determine whether a specific subset of connections was upregulated at higher load regardless of domain, a mass univariate analysis was conducted, in which a general linear model comprising domain and load main effects, and their interaction was fitted to each connection. When analysing entire trials (Fig. 4a), significant domain main effects (Fig. 4b) were evident focused on the intra-connections of the frontoparietal ROI cluster along with connections to associative visual and motor ROIs. Load effects were primarily evident for the connections between visual ROIs and their connections to the frontoparietal and insula ROIs. Connections showing a significant interaction between load and domain were more evenly distributed throughout the network. Critically though, when examining data from the maintenance (Fig. 4c) stage, i.e., when there were no differences in the visual stimuli on screen, no main effects of load were evident; instead, there were main effects of domain, and domain by load interactions. These were focused on connections within the frontoparietal ROIs and intra-connections of frontoparietal and visual ROIs. These results accord with the expected prominent role for frontoparietal connections in WM maintenance. They also demonstrate that increased maintenance load leads to selective upregulation of different combinations of connections, i.e., dependent on the type of information that is being maintained.
Decoding processing stages
As WM stages showed strong separation in the hierarchical cluster analysis, we expected that multivariate pattern analysis would categorise WM stages with high-accuracy based on either activity or connectivity. A three-way classifier was trained on data from Study 1. The accuracy of the classifier model was determined by applying it to a cross-validation holdout subset from Study 1 and data from Study 2, to which it was naïve. This process was repeated for different ROI-sets (i.e. a whole-brain parcellation of another group (WB)39, or focused on the DG and SG ROIs). Classification of processing stages based on multivariate activation (AccWB = 90.89;AccDG = 91.37;AccSG = 75.89) (Fig. 4e) and connectivity (AccWB = 81.18;AccDG = 87.58;AccSG = 76.13) (Fig. 4f) patterns was of extreme high accuracy compared to the null models. This was the case for all ROI sets. Comparing models for different ROI-sets showed that the SG ROIs provided a less accurate classification than either the WB or DG sets (p < 0.001). This suggests, that while the connectivity within the SG-volume is sufficient to decode stages accurately, there is significant additional information within other brain areas (e.g., visual and motor) of the DG volume (Supplementary Tables 27–30).
Decoding visual domains
The hierarchical clustering showed only weak dissociations between the WM domains; however, the mass univariate analysis showed domain sensitivity of selected connections, which could indicate multivariate coding of domains. We tested whether the domains could be decoded based on the activity or connectivity patterns from any of the ROI sets (WB, DG and SG) during maintenance, i.e., where visual and motor demands were carefully controlled in the task design. The same process was applied as per analysis of the processing stages, with classifiers trained on data from Study 1, then applied to holdout data from Study 1 and data from Study 2, to which they were naive. Remarkably, the classification accuracies for WM domains were comparable to those for the WM stages for ROI activation patterns (AccWB = 91.75;F1DG = 70.6;F1SG = 75.2)(Fig. 4e) and connectivity patterns (AccWB = 87.9;F1DG = 85.8;F1SG = 83.6) (Fig. 4f). Post-hoc comparisons showed that added information contained in the WB ROI-set significantly (p < 0.001) improved classification compared to both DG and SG sets (Supplementary Tables 31, 32).
Increased accuracy of decoding at higher WM load
A key prediction of the network perspective was that the connectivity state of the WM network would become more synchronous at higher load. To test whether this led to more discriminable domains, the classifiers were applied to data stacks comprised of single trial connectivity and activity measures, and accuracy was compared across levels of load during the maintenance stage. The results showed that even when applied to the single trial data, regardless of ROI set or metric, the visual domains could be classified with significant accuracy relative to the permutation null distributions (mean accuracy = 68.2%). More importantly, classification was substantially more accurate for both medium and high load compared to low load, regardless of measure (activity or connectivity) and for both studies. Therefore, as maintenance load increased, domain-related activation (Fig. 4g), and to a greater degree connectivity patterns (Fig. 4h), became more discriminable (both p < 0.001, Supplementary Tables 33–40). These results accorded with the observation of domain*load interactions in the above mass-univariate analysis.
Stage-general and stage-specific coding of WM domains
We investigated the degree to which features of the domain-specific patterns of activation and FC were sustained across processing stages. We separated events by stage and estimated the domain classification performance distribution for a global model trained on a subset of data taken from all stages and tested against the replication set. The results showed that the regional activity and dFC patterns that encode WM domains classification generalised across stages, i.e. across stages a similar pattern differentiates domains (mean global accuracy = 76.34%, Fig. 4i. p < 0.0001, see Supplementary Tables 41, 42). We then examined whether domain states were ‘stage specific’, i.e., there are states that expressed predominantly during a specific processing stage and coded for the different domains. To test this, we trained domain classification models on a subset of data from each individual stage (i.e. Encode, Maintain and Probe) and tested these models on events from all stages from the held-out and replications sets (Fig. 4j).
Each stage-specific model has events that match the stage it was trained on as well as mismatch events. Therefore, we examined (for each set and metric) the averaged performance for match and mismatch events. In general match events exhibited significantly (Supplementary Table 43) higher accuracy (Accmatch = 74.16%) than the mismatch counterparts (Accmismatch = 58.74%). Interestingly, activation-based models the differences between match and mismatch (Accmatch = 70.1%, Accmismatch = 61.76%) were less pronounced than for models trained using connectivity (Accmatch = 78.2%, Accmismatch = 58.75). Importantly, in both the DG and SG sets, stage-specific accuracy significantly out-performed global accuracy (tDG(18) = 6.14, tSG(18) = 7.02, p < 0.0001). These results demonstrate that domain × stage interactions are coded in network connectivity as opposed to activity. It also indicates a particularly prominent role for frontoparietal-visual connections in the coding of domain × stage conjunctions.
Contrasting manipulation vs. maintenance
Several studies from the classic literature reported a dissociation between lateral frontal lobe areas involved in the maintenance and manipulation of WM items2,14. We tested the classifiability of maintenance and manipulation events from Study 2 using a liberal within-study cross-validation pipeline (i.e. leave three subjects out 10 times and estimate classification accuracy based on the mean distribution of this subset). Classification accuracy was not significantly higher than the permutation null distribution for models trained on activation patterns or FC patterns for the DG, SG or WB ROI sets (Supplementary Tables 44, 45). Notably, repeating the same within-study pipeline to WM stages and WM visual domains generated highly significant results. This null-finding was unexpected; therefore, we conducted a timecourse analyses focusing at finer temporal grain on the activation timecourses of four 5 mm radius spherical ROIs placed at the peak dorsolateral frontoparietal coordinates from a classic study of WM manipulation43. Analysing the difference in activity at the end of the manipulation vs. maintenance periods (where the haemodynamic response should be close to peak) showed the expected manipulation-related increase in activation (t = 3.93 p = 0.001) when averaged across all four ROIs (Fig. 5a, b). Repeating this analysis for individual ROIs showed the same effect in all cases (left parietal t = 2.75 p = 0.015; right parietal t = 2.63 p = 0.019; left frontal t = 3.15 p = 0.007; right frontal t = 2.17 p = 0.047). In contrast, there were no significant effects at the end of the encoding period (all p > 0.3), i.e., just prior to the retro-cue. Plotting the full FIR timecourse for each condition showed that these effects of manipulation were overshadowed by larger effects of load and domain (Fig. 5c–e). This result accords with the view that the brain regions that are most closely associated with manipulation demands also have broader roles in WM43.
Characterising spatial topologies with sparse models
We inspected the features that contributed most prominently to classification of WM domains to compare them with predictions from the classic localist literature. Regularised sparse models were trained44 with whole-trial data from the DG and WB ROIs. This approach estimates the limited set of connections required for accurate classification of each domain along with a weighting matrix for those connections and a bias term (Fig. 6a, d). Notably, in both ROI sets the sparse models performed as well as the dense models. Back projecting the feature weightings for number WM rendered a left lateralised network, including connections spanning visual, parietal, temporal and posterior frontal ROIs. Spatial WM, included a dorsal network spanning inferior parietal cortex, motor cortex and posterior/dorsal frontal cortex ROIs. Fractal WM included right lateralised connections between the ventral visual association areas and projecting between visual, parietal and more ventral frontal cortex ROIs. The overall topographies of these projections were consistent across ROI sets albeit comprised of different connections and critically, they were comparable to those expected based on the classic literature. Nonetheless, further inspection (Fig. 6c, f) of the effect of WM maintenance relative to the ITI showed that none of the connections were exclusively active for just one domain. Instead, they were generally active for all domains relative to the resting ISI, but most strongly for different domains. Thus, WM domains mapped to densely overlapping patterns of connections as opposed to mutually exclusive networks.
Discussion
Our results provide converging evidence to support the network-coding perspective on human WM function (Fig. 7). Behaviourally distinct aspects of WM including stimulus domains and processing stages evoked strongly dissociable multivariate patterns of activity and connectivity across heavily overlapping brain networks. These aspects of WM were classifiable with remarkably high accuracy even when analysing the SG ROIs, which had the broadest roles as they were active across all conditions. Furthermore, the finer grained domain by stage conjunctions were coded by connectivity but not activity patterns. Moreover, the effects of maintenance load on network connectivity were characterised by heightened classifiability of stimulus domains, and at the mass-univariate level load by domain interactions as opposed to load main effects. This accords with a fine-tuning of the network towards a domain-optimised state when the difficulty of the WM task increased. Critically, these task-evoked brain states were consistent across participants and independent studies. Therefore, they reflect on a fundamental level how human brain networks are organised to flexibly support diverse WM demands.
On the surface, the notion of a dynamic network coding mechanism can seem incompatible with early localist models of the brain-imaging and neuropsychological fields. This is problematic because much of the neuroimaging field is dominated by localist mappings; indeed, in some cases they have been replicated across multiple studies and groups16. The central aim of our study was to determine whether the localist and network frameworks could be reconciled. Based on our results, we argue that this is clearly achievable. More specifically, different conclusions have been drawn from our and previous studies because of two main factors. First, previous studies have typically used WM tasks that are designed to contrast limited numbers of WM conditions. For example, contrasting WM for object vs. spatial stimuli45, encoding vs. maintenance46, maintenance vs. manipulation43, or low vs. high load47. Such designs when applied in unconstrained whole-brain analyses give the illusion of one-to-many functional-anatomical mappings, i.e., where discrete circuits or processing streams are dedicated to the WM function that is being studied. When coupled with hypothesis driven analyses that focus on limited volumes of the brain, for example, the lateral frontal cortices, this problem is exacerbated because it leads to the illusion of one-to-one mappings, where a single functional anatomical module is exclusively ascribed the function of interest (see refs. 20,25,26 for reviews).
Here, we applied a more holistic analysis approach, designing multifactorial tasks that enabled a broad range of behavioural conditions to be contrasted25,48, and using data-driven analyses that were not biased by assumptions regarding how those conditions map onto the brain. Notably, in our dataset, conducting simple subtractive contrasts between any given pair of conditions in isolation would have produced the illusion of mutually exclusive or modular brain systems; conversely, in the context of the broader findings these mappings were shown to be many-to-many as opposed to mutually exclusive. Given the broad cognitive scope and lack of spatial constraints in our study, we find it reassuring that the findings reported in the localist literature were not exposed as incorrect per se; instead, in all cases they were recast as salient features of the densely distributed heavily overlapping multivariate activation and connectivity patterns that characterise different aspects of WM.
Prominent incongruences between localist models were also reconciled when recast within the network framework. For example, a debate of the early literature regarded whether dorsal and ventral modules within the lateral frontal cortices had specialised roles in visual and spatial WM6, an extension of the putative ‘what vs. where’ visual processing streams49, or instead were responsible for applying-specific processes to information of any type, e.g., maintaining/monitoring vs. manipulating information in WM2,3,14,43. When taking a multivariate view, our analyses show evidence for both mappings during the performance of the same task. Trials where spatial locations were encoded and maintained could be classified based on increased connectivity through a dorsal network dominated by visual, parietal and posterior frontal areas. Fractal trials were classified based on increased connectivity throughout a ventral network spanning visual processing streams and projecting to more inferior frontal areas. Critically though, these connections were not exclusively sensitive to either stimulus domain. Furthermore, the set of frontal and parietal brain regions associated with manipulation demands was confirmed to be more active during manipulation vs. maintenance trials, yet, all four of those ROIs were also sensitive to stimulus domain and the level of maintenance WM load. Therefore, as opposed to there being discrete systems for these WM demands, the same network of brain regions dynamically reconfigures its detailed pattern of activity and connectivity dependent on them.
One might argue that this general involvement of brain regions in diverse WM demands favours globalist perspectives. which based on the observation of common activation patterns across diverse cognitive tasks proposed the existence of a central and highly flexible cognitive resource in the brain, i.e., Multiple Demand Cortex32,33. Again, the modular and globalist perspectives can appear incompatible; however, they were reconciled within the network-coding framework; the domain general perspective was correct insofar as brain regions corresponding to Multiple Demand Cortex were commonly recruited across all WM domains and stages. However, this volume was not comprised of functionally homogeneous brain regions or even a simple binary sub-division into sub-networks48. Instead, the ROIs had diverse profiles in terms of the combination of WM conditions under which they were most active; this conforms to a many-to-many mapping between function and anatomy. Indeed, all the above aspects of WM were classifiable from the multivariate pattern of activation and connectivity even within the most broadly active subset of brain region (SG). From this, we can infer that Multiple Demand Cortex dynamically codes for different task demands in a multivariate manner not only at the neuronal50 and voxelwise level as previously reported51,52, but also at the macroscopic level through transient changes in its internal network configuration53.
In summary, the network coding perspective provides an overarching framework that can reconcile insights from diverse studies of WM. We believe that the distributed coding of different aspects of WM that is evident across macroscopic brain networks is analogous to that which is observed in the coding mechanisms of local populations of neurons54. Such coding likely is a scale-free property of how networks efficiently and robustly support diverse information types and processes50. Here, using multivariate analysis methods that align with these distributed coding mechanisms provided remarkably high classification accuracies with models that generalised across individuals and independent studies. Tentatively, we posit that the capacity to transiently express brain states that are optimal for performing-specific tasks may underlie population differences in cognitive ability. Given the reliability of our classification approach, we believe that univariate measures summarising the strengths of expression of the connectivity patterns for stimulus domains and processes should be explored in studies of group and individual differences in WM and as outcome measures in trials that seek to enhance WM functions. The latter application is particularly promising given recent studies showing that the functional connectivity of the WM network is amenable to modulation by non-invasive brain stimulation55 and augmentation by focused cognitive training56,57. Future work should also focus on analysing directed information flow using effective connectivity methods to determine how transitions between these task-evoked processing states are orchestrated within the human brain29,58,59.
Methods
Experimental design and data collection
Participants— 19 right-handed healthy adults (6 female, mean age 22.368, range 18–28 years of age) participated in Study 1 and 17 right-handed healthy adults (11 female, mean age 23.938, range 19–41 years of age) participated in Study 2. All participants had normal hearing and corrected to normal vision. Before commencing the study they read instructions regarding the task and protocol, agreed to experimental procedures and underwent a short training session to ensure that they could perform the task. The training session consisted of approximately 15–20 min practicing the task on a laptop. They then entered the MRI scanner and undertook the task. One participants in Study 2, was excluded due to a technical error that resulted in corrupted behavioural recordings. Sample sizes were selected to align with studies of WM in prior literature. The studies were approved by the University of Western Ontario ethics committee.
Procedure— In both studies participants performed three runs of similar cognitive experiments designed to manipulate WM domains and processing stages. Each run was composed of pseudo-randomly allocated blocks, where each block included a single trial with distinct encoding, maintenance and probe stages (Fig. 1).
Study 1— Participants were required to encode and maintain a set of features from an array of compound stimuli, composed of a pseudo-randomly selected numbers and fractals placed at random spatial positions within a 4 × 4 grid. Each trial began with a pre-encoding cue directing participants to focus on features from one of these three stimulus domains (number, fractal or spatial). Then, three, five or seven compound stimuli were presented within the 4 × 4 grid. After 10 s of encoding, stimuli were removed, and participants were required to maintain the features from the cued domain for 10 s. Subsequently, participants were presented with a probe array where one each of the numbers, fractals and locations had been shuffled. They were required to indicate within a 10 s timeframe the location of the shuffled item that was within the currently maintained domain. The trial terminated at the point in time when the participant responded and the next trial began after a 10-s inter-trial-interval (ITI), during which a fixation cross was displayed and they were instructed to rest. In the imaging analysis, this (ITI) provided the baseline for comparing all other events, i.e., where no overt processing of the WM task was required.
Study 2— Participants were again required to encode and maintain a set of features from an array of compound stimuli in a similar to the first in design; except for three major differences. (1) A second retro-cue was added to cue the participants to either manipulate (add 3 or mentally rotate 90 clockwise the spatial pattern) or maintain the encoded information. (2) Stimulus domains included numbers and spatial locations only. This reduced complexity in the factorial design to compensate for the addition of the manipulation condition. (3) Similarly, the load condition were limited to two levels, these being three or six items per stimulus domain.
Behavioural data collection— In both experiments, data were collected in three runs of scanning acquisition. In the first experiment, each run contained 18 trials, two each from nine possible combinations of stimulus domains (number, fractal and spatial) and WM load (3, 5 and 7). In the second experiment, each run contained 16 trials, two each from eight possible combinations of cognitive processes (Maintenance and Manipulation), stimulus domains (number and spatial) and WM load (3 and 6). Stimuli were presented on a back-projection screen visible from the bore of the MRI scanner via a mirror mounted to the head-coil. Responses were taken with a custom MRI-compatible trackball mouse. Both WM paradigms were programmed using Adobe Flash Builder 4.5 and embedded in a scanner interface programmed in Visual Basic.
MRI data collection— Brain images were collected using a 3 Tesla Trio TIM SIEMENS Scanner. A T2 weighted echo planar image depicting blood oxygenation level dependent (BOLD) contrast was acquired every 2000 ms. The first ten images were discarded to account for equilibrium effects. Images consisted of 36 3 mm slices, with an 80 × 80 matrix, 240 × 240 mm field of view, 30 ms TE, 2 s TR, 45° flip angle, 2.65 ms echo spacing. A 1 mm resolution MPRAGE structural scan was also collected for each with a 256 × 240 × 384 matrix, 900 ms TI, 2.98 ms TE and 9° flip angle. Acquisition in Study 2 was almost identical except that the flip angle was 60°.
Imaging Quality control— Raw bold signal to noise ratio (SNR) metrics were extracted across studies using an in-house implementation of the metrics proposed by Friedman60. A multivariate outlier detection analysis was performed to identify any low SNR values. No scans were omitted at this stage.
Exploratory analysis
Pre-processing— Before analysis, data were preprocessed using a custom pipeline that included functions from SPM12 (Statistical Parametric Mapping Welcome Department of Imaging Neuroscience), FSL (FMRIB Software Library v5.0) and MATLAB 2017b. Specifically, data were slice-timing and motion corrected, spatially warped onto the standard Montreal Neurological Institute template using a custom DARTEL template generated from all the structural scans (for each study independently), and spatially smoothed with an 8 mm3 full width at half maximum Gaussian kernel. The data were high-pass filtered (cutoff period of half of the scan length—approx. 300–400 s) to remove low-frequency drifts in the MRI signal.
Whole-brain BOLD activity estimate— Nuisance experimental matrices were formed using motion estimates (in an extended 24-parameter model) and motion outlier (spikes) events based on Yan et al. recommendations61. Then a single subject design matrix was constructed using the HRF convolved experimental onsets and nuisance matrix. This was applied to the fMRI stack to estimate beta coefficient fits using the classic mass-univariate GLM in SPM12. Finally, whole-brain maps depicting statistical parametric estimates were generated using predefined contrasts of interest. In the voxelwise analysis, this was followed by 2nd level group contrasts against the null distribution, focusing on the following experimental dimensions across conditions: (1) visual domains (numbers, fractal and spatial), (2) WM stages (encode, maintain, probe), (3) load (high, medium, low) & (4) and manipulation (on, off). In Study 1 there was no ‘manipulation’ condition. In Study 2, the visual domains comprised only numbers and spatial stimuli and there was no medium load.
Data structures— Single subject general linear models capturing regional changes in activity were constructed using a mini-block design, in which each stage (e.g., cue, encode, maintain, probe) of each individual trial was captured by a separate predictor, alongside nuisance covariates. The resultant whole-brain maps depicting parameter estimates were taken to the group level, creating stacks composed of 19 × 3 × 3 × 3 × 4 × 2 = 4104 events (i.e. subjects × runs × domains × loads × stages × repeats) for Study 1 and 16 × 3 × 2 × 2 × 2 × 2 = 3072 events (i.e. subjects × runs × manipulation × domains × loads × stages × repeats) for Study 2.
Conjunction analyses— We used minimum statistic conjunction-null tests36,62 to examine intersections across experimental domains as per SPM12’s suggested flow. Specifically, we used the minimum T-statistic over n orthogonal contrasts. Inference was based on individual cluster corrected statistical volumes per contrasts, which were later combined using a logical AND. We derived minimal cluster size by performing uncorrected analysis with a relaxed threshold (p < 0.01), then applying the minimal false-discovery-rate (FDR) cluster (PFDR < 0.05) to generate a cluster corrected map for each of the contrasts in the specific conjunction. This enabled us to infer a conjunction effect at significant voxels. The above procedure was performed in an identical manner across both studies and the following comparisons were performed: (1) Domain General (DG) a 3-way conjunction over the three stimulus domains in Study 1 and independently on the two visual domains in Study 2; and (2) Stage General (SG) intersecting the three processing stages in both studies (i.e. encode, maintain and probe).
Similarity calculation— Minimum statistic conjunction-null tests were performed separately for each study36 in SPM12 with voxelwise thresholding at p < 0.01 followed by false-discovery-rate (FDR) cluster correction across the whole-brain mass at p < 0.05. The quotient of similarity (QS) summarized by dice coefficients was calculated across the two studies to identify clusters that were reproducibly identified for the DG and SG conjunctions across both studies. Specifically, binary masks were created from the cluster thresholded maps and the similarities calculated using the following formula:
1 |
Data driven parcellation— Our analyses focused on ROI’s produced from the above conjunction maps and a whole-brain parcellation made available by another group63. These ROI’s enabled both dimension reduction of the voxelwise space and calculation of task-evoked connectivity matrices. Thus, they form a basis for connectomics54.
Data driven parcellation— We used a “watershed transform”64, to segment the inverse of the minimal conjunction maps into ROIs. This common segmentation procedure treats the three-dimensional statistical volume as a multi-dimensional surface where high and low intensities represent elevations. The algorithm iteratively’filled’ independent catchment basins (CB) with unique labels by flooding the various-independent local minima in the statistical volume and their surroundings. At the end of the segmentation, we filtered out small ROIs, by applying a >50-voxel exclusion criteria. The conjunction maps from Study 1 were parcellated into ROIs using our in-house 3D implementation of the watershed algorithm. Using the ROI set a database was formed containing the mean parameter estimates for each ROI and each WM event within the stack (as defined above).
Hierarchical clustering— Focusing on the (more inclusive) DG-ROI set we collapsed events to compare visual domains and processing stages (i.e. averaged across runs and loads). This formed an m × n matrix (one for each experiment), where m are events averaged for each individual (subjects × visual domains × processing stages) and n are the ROI’s, e.g., from DG-conjunction. Using Euclidean distance and the Ward method65 (which minimizes the total within-cluster variance) we clustered the ROIs and WM conditions along both axes. We used agglomerative hierarchical clustering40 (HCL) to uncover structure within the hierarchies of the activation data stack defined above. Dendrogram Inconsistency was used to define the natural data-driven cutoff across the HCL hierarchy. This threshold was measured as the difference between height of the current link and the mean height of its sub-graph (depth = 5) divided by the standard deviation of the sub-graph.
Cluster purity measure— Purity is a common external evaluation criterion of cluster quality. As the clustering is performed in an unsupervised manner, and we should not expect correspondence between number of classes nor class assignments due to chance alone, we matched each cluster to the class that was most frequent and measured the intersection between them. Accuracy was measured as the percent of total events that matched this initial pairing. Where N was the number of total events, k the number of clusters, the set of clusters and the set of classes.
2 |
Transient functional connectivity estimate— While there exist various methods to estimate connectivity between two regions66 dynamic or model-based connectivity measures pose a challenge67. Here we used psycho-physiological interaction models28,30 (PPI), which estimate the task-evoked functional connectivity between each pair of brain regions. We used this approach primarily for its simplicity and computational efficacy. Notably, the classic method is limited to a single-PPI contrast. Recently a generalised form of PPI was suggested to assess simultaneously multiple dimensions of the experimental space. We used a custom MATLAB implementation of the following model:
3 |
where X was, the matrix containing psychological timecourses (i.e. timecourses for encode, maintain and probe events) and H(X) was the HRF convolution of that matrix. YT was the target time series and YS the source time series. E was the nuisance regressor matrix defined previously in the pre-processing stage. βG included weights of no-interest and βj the weights for the PPI predictors, which were the target of further analysis. β0 was the intercept and e the residual error. This model was estimated for all pairs of connections to form a connectivity matrix and upper lower triangles were averaged to form an undirected weighted connectivity matrix for each condition in the design matrix.
Machine learning
Multi-class classification— Multivariate classification based on fMRI measures has been successfully applied extensively in the past, e.g., to explore multivariate discriminant neuronal coding relating to stimuli, task rules, emotions and more52,68,69. Although it has mostly been used in the context of local voxel patterns, combining this approach with global patterns extracted using some high level parcellation scheme allows in-depth inspection of patterns from both activity and connectivity across the brain. In this paper, we used dense and sparse multi-class classification models in a ’one-vs-all’ scheme which means that a model was trained to differentiate between a class and all events that are not of that class. The classification accuracy was an estimation of the information content in a dataset70. To achieve this, a stack of models per class was constructed. For the dense models we used an ’Error Correcting Output Codes’71 (ECOC) ensemble approach to solve the multi-class problem. We used the MATLAB built in function with linear support vector machine binary classification as the actual learning algorithm to discriminate between groups. For the sparse models we use multinomial logistic regression by Friedman et al.44. Using the GLMnet MATLAB toolbox https://web.stanford.edu/~hastie/glmnet/glmnet_alpha.html. This identifies the minimal set of mutually exclusive features for each class along with weighting matrices and class biases.
Null-model generation— To estimate significance relative to the null distribution, we randomly divided events from Study 1 into two unequal parts (75/25%) based on the participant’s id. The smaller sample was our within-study validation set, and the bigger was the training set. We also included all the data from Study 2 as an independent test set. We trained two classification models one using the actual labels from the training set and another model (the null model) using shuffled labels (i.e. destroying any possible connection between dependent and independent measures). We then estimated the model’s performance within study (using the validation set), and the models robustness across studies (using the test set). This step was repeated a 1000 times to form two distributions, our actual performance that quantifies the relevance of the neuronal information to the classification task and the null distribution that quantifies the classification of events by chance.
Feature selection flow— Neuroimaging data can range from highly selective hypothesis driven features (i.e., specific regions of interest) to whole-brain voxelwise inspection (several to tens of thousands of features). Connectivity data provides an almost squared increase in dimensionality complexity relative to regional activity (from tens to billions of features). This raises two complementary problems, multicollinearity, where multiple features share similar relevant information for classification and the “curse of dimensionality”, where the number of features far surpasses the number of observations. Dealing with the latter is possible by using sparse logistic regression instead of SVM.
Lambda selection— We trained our sparse models with a range of regularization values using an exponential decay function. Optimal lambda was selected from this range by finding the maximum absolute second derivative over the sum minimal cross entropy values derived from the training set (also known as finding the knee in scree plots).
Performance measures— In the context of this work, accuracy was defined as the balance between the precision and recall (see detailed definitions Sokolova et al.72), commonly known as F1-score, which is considered a more appropriate measure for multi-class classification problems.
Statistical significance testing
To determine statistical significance, we used the methods detailed below. All testing was performed using MATLAB 2017b statistical toolbox and costume MATLAB functions (see code availability).
Behavioural significance testing— We used repeated-measures ANOVA to examine behavioural effects for both accuracy and response time. In the latter case we used log transformed response times to better comply with the normality assumption of the test. When applicable, this was followed by a post hoc multi-comparison pairwise tests to identify the basis and direction of the significant effect.
Cluster averaged significance testing— We also used repeated-measures ANOVA to examine global effects in the domain general ROI set for clusters (defined in the clustering section) averaged activation and connectivity. In the later case, we distinguished between inter and intra connectivity. Mauchly’s test for sphericity was performed to assess whether ANOVA assumptions were violated and if so Greenhouse Geisser corrected p-values were used. When applicable, this was followed by a post hoc multi-comparison pairwise test to identify the basis and direction of significant effects. All post hoc tests used Bonferroni correction.
Classification accuracy significance testing— Following40,73–75 to estimate classification accuracy significance we used a conservative repeated cross-validation approach to form paired performance estimations as defined in performance estimation section. We then estimated the empirical probability as , where b was the number of events where Accperm > Accobs, i.e. the number of events where the permuted null model out-performed the model trained by real data, and m was the number of random sampling pairs (10 partitioning × 100 bootstraps). Effect size was then calculated using the Mann–Whitney U-test to compare between the permuted and observed performance (i.e., taking the conservative-independent replication set values).
Measure comparison significance testing— To compare between activation and connectivity accuracies we averaged performance within each of the ten-stratified partitioning (selecting ten different sets of training and testing based on unique subject’s id’s see above) and used t-tests to compare distributions above and beyond the different sets.
Match/mismatch significance testing— To address the extent to which features in the models for classification of domains generalised vs. were specific to WM stages, we trained stage specific domain models using events from each processing stage (encode, maintain & probe) independently. These were then applied to data to which they were naive from all three domains and all three stages.
Domain by load thresholded network analysis— To uncover-specific connections that were upregulated as a function of both load and domain (i.e. interaction effects) we applied linear mixed effects models (using MATLAB fitglme function) independently for each connection. Specifically, the upper triangle of the connectivity matrix was extracted for all trial events, and for each connection the following model was fitted ‘Y = 1 + study + domain × load + (1|subject)’. ANOVA was then used on the model to assess main and interaction effects. P-values were FDR corrected for multiple comparisons. This procedure was performed first for whole trials (i.e. averaging across processing stages) and then for maintenance only.
Code availability
All custom MATLAB routines and data used to generate the analysis and figures of this paper will be committed upon acceptance for review to the following GitHub: https://github.com/esoreq/WM.
Supplementary information
Acknowledgements
E.S. undertook this work as part of his Ph.D., funded by Imperial College London and completed it while employed on MRC project grant MR/R005370/1 awarded to R.L. and A.H.
Author contributions
All three authors conceived the study. A.H. designed and programmed both tasks, contributed to the design of the analysis pipeline and helped write the article. R.L. contributed to the design of the analysis pipeline and helped write the article. E.S. designed and developed all components of the analysis pipeline, performed all analyses, created the figures and authored the article.
Data availability
Neural and behavioural datasets for both studies have been made available online at OpenNeuro Study one; 10.18112/openneuro.ds001634.v1.0.1. Study two; 10.18112/openneuro.ds001635.v1.0.1.
Competing interests
The authors declare no competing interests.
Footnotes
Journal peer review information: Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Supplementary Information accompanies this paper at 10.1038/s41467-019-08840-8.
References
- 1.Baddeley A. Working memory: theories, models, and controversies. Annu. Rev. Psychol. 2012;63:1–29. doi: 10.1146/annurev-psych-120710-100422. [DOI] [PubMed] [Google Scholar]
- 2.Owen AM, Evans AC, Petrides M. Evidence for a two-stage model of spatial working memory processing within the lateral frontal cortex: a positron emission tomography study. Cereb. Cortex. 1996;6:31–38. doi: 10.1093/cercor/6.1.31. [DOI] [PubMed] [Google Scholar]
- 3.Postle BR. The cognitive neuroscience of visual short-term memory. Curr. Opin. Behav. Sci. 2015;1:40–46. doi: 10.1016/j.cobeha.2014.08.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Stokes MG. ‘Activity-silent’working memory in prefrontal cortex: a dynamic coding framework. Trends Cogn. Sci. 2015;19:394–405. doi: 10.1016/j.tics.2015.05.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Fletcher PC, Henson RNA. Frontal lobes and human memory: insights from functional neuroimaging. Brain. 2001;124:849–881. doi: 10.1093/brain/124.5.849. [DOI] [PubMed] [Google Scholar]
- 6.Goldman-Rakic PS. Architecture of the prefrontal cortex and the central executive. Ann. N. Y. Acad. Sci. 1995;769:71–84. doi: 10.1111/j.1749-6632.1995.tb38132.x. [DOI] [PubMed] [Google Scholar]
- 7.D’Esposito, M., Postle, B. R. & Rypma, B. In Executive control and the frontal lobe: Current issues 3–11 (Springer, Berlin, 2000).
- 8.Jonides J, et al. The role of parietal cortex in verbal working memory. J. Neurosci. 1998;18:5026–5034. doi: 10.1523/JNEUROSCI.18-13-05026.1998. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Gazzaley A, Rissman J, D’esposito M. Functional connectivity during working memory maintenance. Cogn. Affect. Behav. Neurosci. 2004;4:580–599. doi: 10.3758/CABN.4.4.580. [DOI] [PubMed] [Google Scholar]
- 10.Cowan, N. Attention and memory: An integrated framework. (Oxford University Press, New York, 1998).
- 11.Majerus S, Péters F, Bouffier M, Cowan N, Phillips C. The dorsal attention network reflects both encoding load and top–down control during working memory. J. Cogn. Neurosci. 2018;30:144–159. doi: 10.1162/jocn_a_01195. [DOI] [PubMed] [Google Scholar]
- 12.Schon K, Newmark RE, Ross RS, Stern CE. A working memory buffer in parahippocampal regions: evidence from a load effect during the delay period. Cereb. Cortex. 2015;26:1965–1974. doi: 10.1093/cercor/bhv013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Ikkai A, McCollough AW, Vogel EK. Contralateral delay activity provides a neural measure of the number of representations in visual working memory. J. Neurophysiol. 2010;103:1963–1968. doi: 10.1152/jn.00978.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Owen AM, et al. Redefining the functional organization of working memory processes within human lateral prefrontal cortex. Eur. J. Neurosci. 1999;11:567–574. doi: 10.1046/j.1460-9568.1999.00449.x. [DOI] [PubMed] [Google Scholar]
- 15.Wager TD, Smith EE. Neuroimaging studies of working memory. Cogn., Affect. Behav. Neurosci. 2003;3:255–274. doi: 10.3758/CABN.3.4.255. [DOI] [PubMed] [Google Scholar]
- 16.Spitzer, B. & Roelfsema, P. R. The distributed nature of working memory. Trends Cogn. Sci. 21 (2017). [DOI] [PubMed]
- 17.Desimone R, Duncan J. Neural mechanisms of selective visual attention. Annu. Rev. Neurosci. 1995;18:193–222. doi: 10.1146/annurev.ne.18.030195.001205. [DOI] [PubMed] [Google Scholar]
- 18.Cole MW, et al. Multi-task connectivity reveals flexible hubs for adaptive task control. Nat. Neurosci. 2013;16:1348. doi: 10.1038/nn.3470. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Fox MD, et al. The human brain is intrinsically organized into dynamic, anticorrelated functional networks. Proc. Natl Acad. Sci. USA. 2005;102:9673–9678. doi: 10.1073/pnas.0504136102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Hampshire A, Sharp DJ. Contrasting network and modular perspectives on inhibitory control. Trends Cogn. Sci. 2015;19:445–452. doi: 10.1016/j.tics.2015.06.006. [DOI] [PubMed] [Google Scholar]
- 21.Mišić B, Sporns O. From regions to connections and networks: new bridges between brain and behavior. Curr. Opin. Neurobiol. 2016;40:1–7. doi: 10.1016/j.conb.2016.05.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Preti MG, Bolton TA, Van De Ville D. The dynamic functional connectome: state-of-the-art and perspectives. Neuroimage. 2017;160:41–54. doi: 10.1016/j.neuroimage.2016.12.061. [DOI] [PubMed] [Google Scholar]
- 23.Rissman J, Gazzaley A, D’Esposito M. Measuring functional connectivity during distinct stages of a cognitive task. Neuroimage. 2004;23:752–763. doi: 10.1016/j.neuroimage.2004.06.035. [DOI] [PubMed] [Google Scholar]
- 24.Sadaghiani S, Poline JB, Kleinschmidt A, D’Esposito M. Ongoing dynamics in large-scale functional connectivity predict perception. Proc. Natl Acad. Sci. USA. 2015;112:8463–8468. doi: 10.1073/pnas.1420687112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Lorenz R, et al. Dissociating frontoparietal brain networks with neuroadaptive Bayesian optimization. Nat. Commun. 2018;9:1227. doi: 10.1038/s41467-018-03657-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Matthews PM, Hampshire A. Clinical concepts emerging from fMRI functional connectomics. Neuron. 2016;91:511–528. doi: 10.1016/j.neuron.2016.07.031. [DOI] [PubMed] [Google Scholar]
- 27.Medaglia JD, Lynall ME, Bassett DS. Cognitive network neuroscience. J. Cogn. Neurosci. 2015;27:1471–1491. doi: 10.1162/jocn_a_00810. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Friston KJ, et al. Psychophysiological and modulatory interactions in neuroimaging. Neuroimage. 1997;6:218–229. doi: 10.1006/nimg.1997.0291. [DOI] [PubMed] [Google Scholar]
- 29.Hampshire A, et al. Network mechanisms of intentional learning. Neuroimage. 2016;127:123–134. doi: 10.1016/j.neuroimage.2015.11.060. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.McLaren DG, Ries ML, Xu G, Johnson SC. A generalized form of context-dependent psychophysiological interactions (gPPI): a comparison to standard approaches. Neuroimage. 2012;61:1277–1286. doi: 10.1016/j.neuroimage.2012.03.068. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Erika-Florence M, Leech R, Hampshire A. A functional network perspective on response inhibition and attentional control. Nat. Commun. 2014;5:4073. doi: 10.1038/ncomms5073. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Duncan J, Owen AM. Common regions of the human frontal lobe recruited by diverse cognitive demands. Trends Neurosci. 2000;23:475–483. doi: 10.1016/S0166-2236(00)01633-7. [DOI] [PubMed] [Google Scholar]
- 33.Fedorenko E, Duncan J, Kanwisher N. Broad domain generality in focal regions of frontal and parietal cortex. Proc. Natl Acad. Sci. USA. 2013;110:16616–16621. doi: 10.1073/pnas.1315235110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Liang X, Zou Q, He Y, Yang Y. Topologically reorganized connectivity architecture of default-mode, executive-control, and salience networks across working memory task loads. Cereb. Cortex. 2015;26:1501–1511. doi: 10.1093/cercor/bhu316. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Cohen JR, D’Esposito M. The segregation and integration of distinct brain networks and their relationship to cognition. J. Neurosci. 2016;36:12083–12094. doi: 10.1523/JNEUROSCI.2965-15.2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Nichols T, Brett M, Andersson J, Wager T, Poline JB. Valid conjunction inference with the minimum statistic. Neuroimage. 2005;25:653–660. doi: 10.1016/j.neuroimage.2004.12.005. [DOI] [PubMed] [Google Scholar]
- 37.Poldrack RA. Region of interest analysis for fMRI. Soc. Cogn. Affect. Neurosci. 2007;2:67–70. doi: 10.1093/scan/nsm006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Fedorenko E, Hsieh PJ, Nieto-Castañón A, Whitfield-Gabrieli S, Kanwisher N. New method for fMRI investigations of language: defining ROIs functionally in individual subjects. J. Neurophysiol. 2010;104:1177–1194. doi: 10.1152/jn.00032.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Finn ES, et al. Functional connectome fingerprinting: identifying individuals using patterns of brain connectivity. Nat. Neurosci. 2015;18:1664. doi: 10.1038/nn.4135. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Hastie, T., Tibshirani, R. & Friedman, J. Hierarchical clustering in The elements of statistical learning pp. 101–135 (Springer-Verlag, New York, 2009).
- 41.Courtney SM, Ungerleider LG, Keil K, Haxby JV. Transient and sustained activity in a distributed neural system for human working memory. Nature. 1997;386:608. doi: 10.1038/386608a0. [DOI] [PubMed] [Google Scholar]
- 42.Maaten Lvd, Hinton G. Visualizing data using t-SNE. J. Mach. Learn. Res. 2008;9:2579–2605. [Google Scholar]
- 43.Veltman DJ, Rombouts SA, Dolan RJ. Maintenance versus manipulation in verbal working memory revisited: an fMRI study. Neuroimage. 2003;18:247–256. doi: 10.1016/S1053-8119(02)00049-6. [DOI] [PubMed] [Google Scholar]
- 44.Friedman J, Hastie T, Tibshirani R. Regularization paths for generalized linear models via coordinate descent. J. Stat. Softw. 2010;33:1. doi: 10.18637/jss.v033.i01. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Courtney SM, Ungerleider LG, Keil K, Haxby JV. Object and spatial visual working memory activate separate neural systems in human cortex. Cereb. Cortex. 1996;6:39–49. doi: 10.1093/cercor/6.1.39. [DOI] [PubMed] [Google Scholar]
- 46.Ranganath C, DeGutis J, D’Esposito M. Category-specific modulation of inferior temporal activity during working memory encoding and maintenance. Cogn. Brain Res. 2004;20:37–45. doi: 10.1016/j.cogbrainres.2003.11.017. [DOI] [PubMed] [Google Scholar]
- 47.Xu Y, Chun MM. Dissociable neural mechanisms supporting visual short-term memory for objects. Nature. 2006;440:91. doi: 10.1038/nature04262. [DOI] [PubMed] [Google Scholar]
- 48.Hampshire A, Highfield RR, Parkin BL, Owen AM. Fractionating human intelligence. Neuron. 2012;76:1225–1237. doi: 10.1016/j.neuron.2012.06.022. [DOI] [PubMed] [Google Scholar]
- 49.Haxby JV, et al. The functional organization of human extrastriate cortex: a PET-rCBF study of selective attention to faces and locations. J. Neurosci. 1994;14:6336–6353. doi: 10.1523/JNEUROSCI.14-11-06336.1994. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Stokes MG, et al. Dynamic coding for cognitive control in prefrontal cortex. Neuron. 2013;78:364–375. doi: 10.1016/j.neuron.2013.01.039. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Woolgar A, Hampshire A, Thompson R, Duncan J. Adaptive coding of task-relevant information in human frontoparietal cortex. J. Neurosci. 2011;31:14592–14599. doi: 10.1523/JNEUROSCI.2616-11.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Woolgar A, Jackson J, Duncan J. Coding of visual, auditory, rule, and response information in the brain: 10 years of multivoxel pattern analysis. J. Cogn. Neurosci. 2016;28:1433–1454. doi: 10.1162/jocn_a_00981. [DOI] [PubMed] [Google Scholar]
- 53.Cole MW, et al. Multi-task connectivity reveals flexible hubs for adaptive task control. Nat. Neurosci. 2013;16:1348–1355. doi: 10.1038/nn.3470. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Sporns O, Tononi G, Kötter R. The human connectome: a structural description of the human brain. PLoS Comput. Biol. 2005;1:e42. doi: 10.1371/journal.pcbi.0010042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Violante IR, et al. Externally induced frontoparietal synchronization modulates network dynamics and enhances working memory performance. eLife. 2017;6:e22001. doi: 10.7554/eLife.22001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Covey TJ, Shucard JL, Benedict RH, Weinstock-Guttman B, Shucard DW. Improved cognitive performance and event-related potential changes following working memory training in patients with multiple sclerosis. Mult. Scler. J. Transl. Clin. 2018;4:2055217317747626. doi: 10.1177/2055217317747626. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Huntley J, Hampshire A, Bor D, Owen A, Howard R. Adaptive working memory strategy training in early Alzheimer’s disease: randomised controlled trial. Br. J. Psychiatry. 2017;210:61–66. doi: 10.1192/bjp.bp.116.182048. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Parkin BL, Hellyer PJ, Leech R, Hampshire A. Dynamic network mechanisms of relational integration. J. Neurosci. 2015;35:7660–7673. doi: 10.1523/JNEUROSCI.4956-14.2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Friston KJ, Harrison L, Penny W. Dynamic causal modelling. Neuroimage. 2003;19:1273–1302. doi: 10.1016/S1053-8119(03)00202-7. [DOI] [PubMed] [Google Scholar]
- 60.Friedman L, Glover GH. Report on a multicenter fMRI quality assurance protocol. J. Magn. Reson. Imaging. 2006;23:827–839. doi: 10.1002/jmri.20583. [DOI] [PubMed] [Google Scholar]
- 61.Yan CG, et al. A comprehensive assessment of regional variation in the impact of head micromovements on functional connectomics. Neuroimage. 2013;76:183–201. doi: 10.1016/j.neuroimage.2013.03.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Friston KJ, Holmes AP, Price C, Büchel C, Worsley K. Multisubject fMRI studies and conjunction analyses. Neuroimage. 1999;10:385–396. doi: 10.1006/nimg.1999.0484. [DOI] [PubMed] [Google Scholar]
- 63.Finn ES, et al. Functional connectome fingerprinting: identifying individuals using patterns of brain connectivity. Nat. Neurosci. 2015;18:1664–1671. doi: 10.1038/nn.4135. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Meyer F, Beucher S. Morphological segmentation. J. Vis. Commun. Image Represent. 1990;1:21–46. doi: 10.1016/1047-3203(90)90014-M. [DOI] [Google Scholar]
- 65.Murtagh F. A survey of recent advances in hierarchical clustering algorithms. Comput. J. 1983;26:354–359. doi: 10.1093/comjnl/26.4.354. [DOI] [Google Scholar]
- 66.Smith SM, et al. Network modelling methods for FMRI. Neuroimage. 2011;54:875–891. doi: 10.1016/j.neuroimage.2010.08.063. [DOI] [PubMed] [Google Scholar]
- 67.Hutchison RM, et al. Dynamic functional connectivity: promise, issues, and interpretations. Neuroimage. 2013;80:360–378. doi: 10.1016/j.neuroimage.2013.05.079. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Haynes JD, Rees G. Neuroimaging: decoding mental states from brain activity in humans. Nat. Rev. Neurosci. 2006;7:523. doi: 10.1038/nrn1931. [DOI] [PubMed] [Google Scholar]
- 69.Kay KN, Naselaris T, Prenger RJ, Gallant JL. Identifying natural images from human brain activity. Nature. 2008;452:352. doi: 10.1038/nature06713. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Cox DD, Savoy RL. Functional magnetic resonance imaging (fMRI) “brain reading”: detecting and classifying distributed patterns of fMRI activity in human visual cortex. Neuroimage. 2003;19:261–270. doi: 10.1016/S1053-8119(03)00049-1. [DOI] [PubMed] [Google Scholar]
- 71.Escalera S, Pujol O, Radeva P. On the decoding process in ternary error-correcting output codes. IEEE Trans. Pattern Anal. Mach. Intell. 2010;32:120–134. doi: 10.1109/TPAMI.2008.266. [DOI] [PubMed] [Google Scholar]
- 72.Sokolova M, Lapalme G. A systematic analysis of performance measures for classification tasks. Inf. Process. Manag. 2009;45:427–437. doi: 10.1016/j.ipm.2009.03.002. [DOI] [Google Scholar]
- 73.Demšar J. Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 2006;7:1–30. [Google Scholar]
- 74.Edgington, E. & Onghena, P. Randomization tests. (Chapman and Hall/CRC, New York, 2007).
- 75.Phipson B, Smyth GK. Permutation P-values should never be zero: calculating exact P-values when permutations are randomly drawn. Stat. Appl. Genet. Mol. Biol. 2010;9:39. doi: 10.2202/1544-6115.1585. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Neural and behavioural datasets for both studies have been made available online at OpenNeuro Study one; 10.18112/openneuro.ds001634.v1.0.1. Study two; 10.18112/openneuro.ds001635.v1.0.1.