Abstract
Interpreting the neural response elicited during task functional magnetic resonance imaging (fMRI) remains a challenge in neurodevelopmental research. The monetary incentive delay (MID) task is an fMRI reward processing task that is extensively used in the literature. However, modern psychometric tools have not been used to evaluate measurement properties of the MID task fMRI data. The current study uses data for a similar task design across three adolescent samples (N = 346 [Agemean 12.0; 44 % Female]; N = 97 [19.3; 58 %]; N = 112 [20.2; 38 %]) to evaluate multiple measurement properties of fMRI responses on the MID task. Confirmatory factor analysis (CFA) is used to evaluate an a priori theoretical model for the task and its measurement invariance across three samples. Exploratory factor analysis (EFA) is used to identify the data-driven measurement structure across the samples. CFA results suggest that the a priori model is a poor representation of these MID task fMRI data. Across the samples, the data-driven EFA models consistently identify a six-to-seven factor structure with run and bilateral brain region factors. This factor structure is moderately-to-highly congruent across the samples. Altogether, these findings demonstrate a need to evaluate theoretical frameworks for popular fMRI task designs to improve our understanding and interpretation of brain-behavior associations.
Keywords: Neurodevelopment, Monetary incentive delay task, Reward processing, Measurement invariance, Adolescence, Functional magnetic resonance imaging
1. Introduction
The way in which phenomena are numerically represented in a study plays a critical role in the interpretations that a researcher can make. For over half a century, psychology has grappled with measurement issues to improve the reliability and interpretation of results (Association American Educational Research et al., 2014, Borsboom, 2006, Cronbach and Meehl, 1955, Flake and Fried, 2020, Lilienfeld and Strother, 2020). While not a core concern in the initial work using functional magnetic resonance imaging (fMRI), measurement issues such as reliability (i.e., precision of measurement) have become an increasing concern in the neuroimaging community (Bennett and Miller, 2010, Elliott et al., 2020, Nikolaidis et al., 2022, Noble et al., 2019). However, the validity of measurements (i.e., the degree to which the measurement accurately reflects the target construct) has largely remained at the periphery.
Since the inception of the blood oxygen level dependent (BOLD) contrast (Kwong et al., 1992, Ogawa et al., 1990), neurodevelopmental researchers have adopted several experimental tasks to measure the neural processes associated with latent constructs of cognitive functioning. This includes the measurement of reward processing (Richards et al., 2013, Silverman et al., 2015) and cognitive control (Luna, 2009, Luna et al., 2015) to better link changes in brain and behavior across the developmental period from childhood to adulthood (i.e., adolescence [Sawyer et al., 2018]). To date, neurodevelopmental work has contributed to progress in understanding the developing brain (Bethlehem et al., 2022, Mills et al., 2016) and has defined several neurodevelopmental frameworks (Casey et al., 2019, Ernst, 2014, Steinberg, 2010). Despite this progress, the generalizability of neurodevelopmental findings (Sherman et al., 2018), predictive ability of the frameworks (Ernst et al., 2022) and reliability issues (Elliott et al., 2020, Noble et al., 2019) in brain-behavior studies using fMRI remain a barrier to its ability to inform theory, clinical practice and public policy.
To confidently interpret a brain-behavior association, it is crucial to understand how the neural response elicited during fMRI tasks reflects the specific mental process the researcher intended to measure (Francken et al., 2022). The monetary incentive delay (MID) task is a popular fMRI reward processing task that is extensively used in the adult and adolescent literature. Despite its popularity in brain-behavior research, the measurement properties of the task remain unclear. The purpose of the present study is to provide clarity about the measurement properties of the MID task by evaluating the degree to which a nearly identical MID task design elicits a neural response that can be and is measured in the same way across multiple adolescent samples. Holding preprocessing and analytic pipelines constant, both confirmatory (restricted, hypothesis-driven) and exploratory (unrestricted, data-driven) factor analyses are used to determine whether a range of definitions of the reward processing constructs (i.e., approach and avoidance) are related and interchangeable across samples. Results from these analyses will be valuable in understanding the generalizability of neural findings within and across MID task contrasts and adolescent samples.
1.1. Task FMRI in correlational research: methodological and conceptual challenges
A primary goal of contemporary neurodevelopmental frameworks is to explain brain-behavior associations across samples and conditions (Ernst et al., 2022, Rosenberg et al., 2018). The range of operationalizations of a given construct, such as reward, in fMRI pose methodological and conceptual challenges, as different operationalizations of variables (e.g., contrasts from task fMRI) may not always map onto postulated construct(s).
One methodological challenge is designing fMRI tasks that can truly isolate specific mental processes (Poldrack and Yarkoni, 2016) such as sensitivity towards rewards (i.e., researcher’s construct of interest). In the simplest form, BOLD signals obtained during two experimentally manipulated conditions (e.g., low-probability/high-reward vs. high-probability/low-reward) of a task are compared to obtain scores that measure between-person differences in how a presumed phenomenon, such as motivation to approach rewarding stimuli, is manifested. The assumption is that subtraction of the estimated signals between trial types is a valid numerical representation of a specific interindividual difference in the construct of interest: arousal and allure of high reward stimuli that drives approach behaviors. However, this assumption is often violated because a contrast score does not always produce a measurement that accurately captures the latent mental process (Hanson, 2022, Poldrack, 2010).
One conceptual challenge is the under-discussed gap between the experimental and correlational psychology frameworks (Cronbach, 1957). Whereas experimental psychologists aim to control their conditions to find ordered structure in the behavior of their subjects/participants through experimental designs (Skinner, 1956), correlational psychologists take advantage of naturally occurring individual differences between participants to infer associations, or networks of associations, among variables. In the case of neurodevelopmental research that leverages both frameworks, ensuring the consistency of the interpretation(s) across brain-behavior studies requires confidence that (a) the fMRI task is a valid measure to isolate the mental process of interest1 and (b) there is converging evidence across different measures of a given construct reflecting the direction2 of associations that are consistent with the theoretical framework. While a mental process from an experimental task can be demonstrated to occur across administrations (Caplan, 2007, Henson, 2005), it is crucial for neurodevelopmental researchers using or developing tasks for correlation brain-behavior studies to validate that the structure and function of the construct aligns with the theory (Cronbach and Meehl, 1955, Flake et al., 2022, Lilienfeld and Strother, 2020). Failing to undertake this challenging work may lead to misinterpretation of the resulting brain-behavior findings across studies. Engaging in these efforts may also help standardize the verbal descriptions and language of the measured, complex mental processes.
In the neurodevelopmental literature, consistency of measurement is often established by citing work for a particular construct, such as reward (as author MD here has done in [Demidenko et al., 2020]) or motivational processing during the MID task (as in [Bjork et al., 2010]), through reference to studies that use a range of fMRI task designs and contrasts. In principle, this is a reasonable approach to justify and form a testable hypothesis about the association among constructs based on coherence across the literature. There are indeed a wide range of tasks (Richards et al., 2013, Sherman et al., 2018) and contrasts (Tervo-Clemmens et al., 2020) being used in the neurodevelopmental literature. The diversity in tasks and contrasts is both a strength and limitation. Measurement diversity is a strength because it provides the potential for convergent evidence across different reward types, such as instrumental-reward tasks (e.g., MID task) or decision-making tasks (e.g., Wheel of Fortune) (Richards et al., 2013), contributing inferences about the overarching construct. Measurement diversity is a limitation because each study typically measures the construct of interest using a single task or set of contrasts, so the relations among the different operational measures of what is assumed to be the same construct are often unclear. In practice, the postulated mental process reflected in the specific study is accepted, ‘loosely connected’ (Haeffel, 2022, p. 2) and rarely confirmed empirically, which is problematic when testing theories (Hull, 1952, Uttal, 2003).
1.2. Tasks and contrasts: uncertainty about the measured process
Motivational processing tasks are common tools used to study brain-behavior associations in the neurodevelopmental literature. One hypothesis is that an individuals’ motivational processing manifests as activation of distinct brain regions comprising approach and avoidance systems. Approach systems are thought to manifest in striatal and orbitofrontal cortex regions, whereby the function of these systems is to track the appetitiveness, incentive salience and motivation to approach stimuli. Avoidance systems are thought to manifest in amygdalar, hippocampal and insular regions, whereby the function of these systems is to track aversiveness of stimuli, negative valence, and motivation to avoid stimuli (Richards et al., 2013). Both systems are developmentally important in that to they are hypothesized to have implications for when and how substance use and risk-taking may emerge and progress over time (Doremus-Fitzwater et al., 2010, Ernst, 2014, Ernst and Luciana, 2015, Luciana et al., 2012). While there are distinct questions that can be formed based on the motivational processing literature, there are several challenges in interpreting and comparing functional activation across the neurodevelopmental studies.
Take for instance, the proposition that scores obtained from reward tasks used in fMRI can numerically represent a component of motivational processing (Schultz, 2015) that is important in neurodevelopmental research, such as reward anticipation (Silverman et al., 2015). Studies evaluating brain-behavior associations using the MID task have used a diverse set of contrasts to measure motivational processing and test their brain-behavior hypotheses in the context of reward anticipation. For example, in a late-adolescent sample, individual differences in activity in the nucleus accumbens (NAcc) during combined Large and Small Gain versus Neutral contrast were related to individual differences in marijuana use (Martz et al., 2016) whereas NAcc activity during the Large Gain versus Neutral stimuli contrast was not related to differences in a broad measure of substance use (Demidenko et al., 2020). In mid-adolescents, activity in the NAcc during the Large Gain versus Small Gain contrast was related to differences in alcohol use only among high sensation seekers in one study (Büchel et al., 2017) and only tobacco use in another study (Karoly et al., 2015). Some may argue that the same construct – approach motivation – was measured in all these studies, however, without empirical tests of measurement structure of the fMRI task these differences in brain-behavior relations becomes difficult to interpret.
While meta-analyses of reward tasks have provided comprehensive reviews of brain regions involved in different components of reward processing (Bartra et al., 2013, Chen et al., 2022, Clithero and Rangel, 2014, Diekhof et al., 2012, Dugré et al., 2018, Flannery et al., 2020, Lutz and Widmer, 2014, Oldham et al., 2018, Silverman et al., 2015, Wilson et al., 2018), there are few studies to date that evaluate within- and between-study measurement properties of brain-brain associations in these tasks. This, in part, is because most of the studies using reward fMRI tasks to measure motivational processing have obtained data from small samples using a wide variety of study designs, making the assessment of measurement properties difficult. However, some tasks, such as the MID, have a long history in the reward literature and have been administered in numerous studies, making them a good target for investigation of measurement properties.
The MID task (Knutson et al., 2000) was designed to elicit robust activation of reward regions during reward anticipation and reward consumption (Knutson and Heinz, 2015). The task has been cited as, ‘the most widely used probe of human incentive motivational processing by nondrug rewards’ (Bjork, 2020, p. 2). The task has been reported to relate to affective ratings in adult and adolescent samples (Bjork et al., 2004, Knutson et al., 2001, Wu et al., 2014) and has been utilized extensively throughout the neurodevelopmental and substance use literature (Balodis and Potenza, 2015, Bjork, 2020). Different variants of the task are currently used in large consortium (Casey et al., 2018, Schumann et al., 2010) and longitudinal studies (Murray et al., 2020, Yau et al., 2012). In the context of neurodevelopmental research, the MID task is used to evaluate changes in reward-related activity during adolescence and how these changes in neural activity are associated with outcomes including risk-taking and substance use (Casey et al., 2018, Bjork, 2020).
One hypothesized theoretical framework of the MID task that has been proposed decomposes the anticipatory phase into two distinct constructs. In short, cues in the task elicit approach and avoidance signals (Knutson et al., 2014, Knutson and Greer, 2008), whereby levels of approach are presumed to manifest as activity in the NAcc whereas levels of avoidance are presumed to manifest as activity in the anterior insula (AI). During the anticipation phase, anticipatory gain and loss cues often differentially activate the NAcc, AI, medial premotor and dorsal striatal regions (Bjork et al., 2004, Knutson et al., 2003). Meta-analyses, based mostly on adult samples, have provided some evidence for the proposed hypothesis: activity in the bilateral AI is higher during anticipatory loss versus gain conditions and the activity in the right AI and bilateral NAcc are higher for anticipatory gain cues than for neutral cues (Knutson and Greer, 2008). However, this theoretical framework has not been empirically tested using the tools of modern psychometrics. In fact, it remains unclear whether these are discernible components of reward observed in the between-subject neural activity during the anticipatory phase and how consistent this measurement property is across adolescent samples.
Understanding the measurement properties of the MID task is critical to making valid interpretations about neural changes and their association with behavior across adolescence. In human and animal models, there are significant biological and behavioral changes with the onset of pubertal development during adolescence (Dahl et al., 2018, Forbes and Dahl, 2010, Spear, 2004). In particular, puberty is marked by changes in dopamine receptors (Spear, 2000) in reward-relevant regions that are important to motivational processing and risk-taking behaviors (Luciana et al., 2012, Pfeifer and Berkman, 2018, Silverman et al., 2015). While the MID task has repeatedly been shown to activate brain regions hypothesized to be meaningful to approach and avoidance systems (Chen et al., 2022, Oldham et al., 2018), there is little empirical evidence confirming that the hypothesized approach and avoidance systems are observed in the MID task consistently across adolescent samples.
Given the popularity of the MID task (Bjork, 2020), evaluating psychometric properties of the task would be valuable to researchers asking developmental and brain-behavior questions. If the between-subject covariation between manifest variables from the MID task is not consistent across adolescent and adult groups, the differences observed between different age groups may be attributed to a construct (i.e., motivational processing) when in fact the difference may be due to differential item or test functioning (Association American Educational Research et al., 2014). Second, if the between-subject covariation between the manifest variables from the MID task demonstrates a different number of factors and different structure across the samples, this may indicate that the studies are measuring meaningfully different constructs. In both cases, the bias in the measure may be inadvertently attributed to differences observed between groups, samples or brain-behavior associations (Borsboom, 2006).
2. Current study
To date, meta-analyses and systematic reviews of reward processing have spanned diverse sets of tasks, task contrasts (Sherman et al., 2018, Silverman et al., 2015, Tervo-Clemmens et al., 2020) and different software, preprocessing and analytic pipelines have the potential to impact results in adolescent and adult samples (Botvinik-Nezer et al., 2020, Carp, 2012). The neurodevelopmental cognitive neuroscience literature would benefit from understanding the extent to which measurement properties generalize across multiple adolescent samples that collected data using the same reward task. The Adolescent Brain Cognitive Development (ABCD; [Casey et al., 2018]), Michigan Longitudinal Study (MLS; [Yau et al., 2012]) and Adolescent Health Risk Behavior (AHRB; [Demidenko et al., 2020]) studies collected data using a nearly identical MID task. These MID task data allow for an empirical test of the measurement properties of the task and its invariance across studies. This would be an important first step in understanding how task results can and can’t be interpreted within and between studies using this version of the design. Here, the theoretical framework proposed in Knutson et al. (2014) is put to a severe test and data-driven measurement properties are evaluated for the MID task across three adolescent fMRI samples.
First, the degree to which a primary theoretical framework of the MID task is empirically supported across samples is evaluated using a restrictive and semi-restrictive model. Aim 1a evaluates the degree to which task contrasts reflect the hypothesized latent factors, approach and avoidance. This is achieved by fitting a confirmatory (restricted) factor (CFA) model to six anticipation task contrast measures for four brain regions of interest (ROIs). The model articulates the hypothesis that contrasts at these hypothesized regions (e.g., NAcc and AI) load onto two factors, approach and avoidance, and the hypothesis that the same structure is present in each adolescent sample. The hypothesized structure is based on the theoretical framework proposed in Knutson et al. (2014) and is represented by the diagram in Fig. 1B. In Aim 1b, given the likelihood of non-zero cross-loadings across brain regions a less restrictive CFA is fit using the exploratory structural equation model (ESEM). Like CFA, ESEM fits an a priori factor model. Unlike the specific CFA model in Aim 1a, ESEM allows for empirically driven non-zero cross-loadings, providing a semi-restrictive test. Like Aim 1a, Aim 1b fits the ESEM model to examine invariance across the three samples of participants. The hypothesized two factor model is consistent with Fig. 1B, except for adding data-driven non-zero loadings.
Fig. 1.
Brain Regions (A), Conceptual Figure for Confirmatory Factor Analysis (CFA) and Exploratory Structural Equation Model (B). Harvard-Oxford Bilateral Nucleus Accumbens (NAcc; Yellow) and Anterior Insula (Red) regions imposed on MNI152 2 mm brain. R = Right; L = Left; A = Anterior; P = Posterior.
Second, because the hypothesized factor structure in Aim 1 may be an incorrect representation of the estimated neural activity, a data-driven factor structure of MID task contrasts is evaluated and compared across samples. In Aim 2, exploratory (unrestricted) factor (EFA) is used to evaluate the structure of the measures using a data-driven approach across the three adolescent samples. This allows for the flexibility in the number of factors and the item loadings. Due to the exploratory nature, no specific hypotheses are made but EFA is run on each study separately and compared qualitatively and quantitatively.
Pubertal development coincides with changes in motivational processing during adolescence (Forbes and Dahl, 2010). Therefore, in Aim 3, the effects of pubertal development are considered. Local SEM (sliding window) is used to examine how the factor loadings change across the self-reported pubertal developmental scale in the ABCD study, to evaluate whether measurement properties from Aim 2 stabilizes as participants’ self-reported pubertal maturity increases. Due to the unclear evidence of how pubertal development may impact the measurement properties of the MID task, no hypotheses are specified.
The introduction, hypotheses, methods, and analytic code (fMRI analyses using python, and CFA, ESEM, EFA and LSEM models using R) were pre-registered and are openly available online (https://osf.io/a6t8k; Demidenko et al., 2023).
3. Methods
3.1. Participants
3.1.1. Michigan Longitudinal Study (MLS)
The MLS is a longitudinal community study that is designed to characterize neural differences between participants from families with (2/3) and without a parental history of alcohol use disorder. The MLS study includes two cohorts, a Child Risk and Neuropsychological Risk cohorts. The focus here is on the first available session of neuroimaging for the Neuropsychological Risk cohort (Martz et al., 2016). For a full description of the study purpose see Zucker et al. (2000). MLS had a total of 177 participants (Age 18 - 23 years old) in the Neuropsychological Risk cohort that completed the imaging protocol. Participants who have the necessary structural, functional, and behavioral data and pass quality control steps are retained for the final analyses.
3.1.2. Adolescent Health Risk Behavior (AHRB) study
The AHRB study is a longitudinal multi-phase study that is designed to characterize neural differences of risk-taking behaviors (Demidenko et al., 2020). The focus here is on the neuroimaging phase (Phase 2). For Phase 2 Wave 1, participants were recruited based on Phase 1 Wave 1 self-reported risk behaviors. AHRB recruited N = 115 high and average/low risk-taking adolescents (Age 17 - 21 years old) to complete the neuroimaging protocol. Of the 115 participants eligible for inclusion, seven participants could not undergo magnetic resonance imaging (MRI) so 108 completed the imaging protocol. Participants who have the necessary structural, functional, and behavioral data and pass quality control steps are retained for the final analyses.
3.1.3. Adolescent Brain Cognitive Development (ABCD) study
The ABCD study is a longitudinal national study that is designed to study the ways socioeconomics and ethnicity affect substance use, mental health, and health outcomes across development (Volkow et al., 2018). The ABCD study recruited 11,878 adolescents across 21 sites (Casey et al., 2018). The focus here is on the 4.0 brain imaging data that is released by the ABCD-BIDS Community Collection (ABCC; Feczko [ et al., 2021]), which as of January 2023 (following stage 1 in principal acceptance) contains year 2 imaging data from approximately 7000 participants (Age 11 - 13 years old). Given the reported effects of motion on reliability of task fMRI measures in the younger ABCD sample (Kennedy et al., 2022) participants that have a reported mean framewise displacement value for the MID that is > .90 are excluded from our analysis. The ABCD sample (N = 11,878) is much larger than the MLS and AHRB samples, which poses an imbalance issue when testing for differences in factor structure across samples. To minimize this, a subsample of 1000 participants are randomly selected across sites (to alleviate site effects) and low motion subjects are excluded. Independent of the first 1000 randomly selected participants, another subsample of 1000 participants are randomly selected to use as a held-out sample to evaluate the performance of the EFA and LSEM in the ABCD data. Thus, a total of 2000 adolescents are subsampled from the full ABCD sample data and participants who have the necessary structural, functional, and behavioral data and pass quality control steps are retained for the final analyses.
3.2. FMRI task & regions of interest
3.2.1. Monetary Incentive Delay (MID) task
Across the ABCD, AHRB and MLS studies, reward processing is measured using comparable versions of the modified Monetary Incentive Delay (MID) task. The task and their nuanced differences are described below, and schematics provided in Fig. 2.
Fig. 2.
Motivational Processing: Schematics of the Monetary Incentive Delay (MID) task from the ABCD, AHRB and MLS samples.
The MID task (Knutson et al., 2003) is used to model neural signatures of the anticipation and receipt of monetary gains or losses. The modified MID task from ABCD and AHRB study (Casey et al., 2018) are identical in design. Apart from some minor differences in fixation/probe/feedback durations, the version of the MID task in the MLS (Yau et al., 2012) is nearly identical to the ABCD and AHRB studies. Consistent across the ABCD, AHRB and MLS studies, during the MID task each trial starts with a cue type and consists of three phases: anticipation, probe, and outcome (that is, feedback).
Details about the tasks across the three studies are summarized in supplemental Table S1. In the ABCD (Casey et al., 2018) and AHRB (Demidenko et al., 2020) versions of the modified MID task, the cues are differentiated by color (“Win” cue = Pink; “Dont Lose” cue = Yellow; “No Money at Stake” cue = Teal). There are twelve trial orders of the task, consisting of 50 contiguous trials and 10 trial types per run (5:42 min long). In the MLS version of the MID task (Martz et al., 2016, Yau et al., 2012), the cues are differentiated by color in grayscale (“Win” cue = Red; “Dont Lose” cue = Blue; “No Money at Stake” cue = White). Trials are presented in pseudorandom order, there are 50 contiguous trials and 100 trials total (lasting approximately 5 min long). In all three studies the task difficulty is individualized by first obtaining a mean response time during a practice trial of the task outside of the scanner. The task is individualized to reach a 60% accuracy rate by adjusting the difficulty (i.e., probe duration). For example, in the ABCD/AHRB study, the difficulty is adjusted every third incentivized trial based on the overall accuracy for the last six trials.
In the ABCD, AHRB and MLS study, participants are compensated with a fixed amount of money to participate in the study and receive additional money in accordance with their performance during the MID task. The average during the baseline data in the ABCD sample is $21.43 (Casey et al., 2018) and the maximum in the AHRB sample is $30 (Demidenko et al., 2020), specific details for average or maximum compensation in the MLS sample have not been reported. Compared to other approaches, such as rewarding “points” that can be exchanged for candy (Cao et al., 2019), a monetary incentive is vital in motivational processing during the task if measuring arousal and valence. Several contrasts are traditionally modeled in the MID task, for more details see Oldham et al. (2018).
3.2.2. Regions of interest (ROI)
The focus of this study is to evaluate the static mapping (Knutson et al., 2014) of the MID task. Following the pre-specified theoretical structure (Fig. 1), we limit the problem space to two bilateral brain regions: the left and right nucleus accumbens (NAcc) and the left and right anterior insula (AI). The regions are defined using the Harvard-Oxford Cortical and Subcortical atlas (50thr). In the Harvard-Oxford Cortical atlas, the insula is not constrained to the AI. Therefore, the term ‘Anterior Insula’ is used to identify the probability map on neurosynth.org on October 8th, 2022. The associated AI map (.nii.gz) is downloaded, thresholded at ∼50 % (> 8) and used to mask the left and right ROI from the Harvard-Oxford Cortical atlas to the anterior portions. The resulting left/right NAcc and left/right AI ROIs are used to extract the mean signal intensity values (from the contrast beta maps described below) for the subsequent analyses.
The NAcc and AI regions selected here are consistent with brain regions commonly described in the reward circuitry (Bissonette et al., 2014, Gentry et al., 2019, Haber, 2017), those indicated as meaningful to the anticipatory phase of the MID task (Knutson et al., 2014, Oldham et al., 2018) and the regions associated with the terms ‘reward anticipation’, ‘approach behaviors’ and ‘avoidance behaviors’ using the NeuroQuery tool (Dockès et al., 2021).
3.3. MRI acquisition, preprocessing and analysis
3.3.1. Acquisition
The acquisition details for the ABCD, AHRB and MLS datasets are summarized in supplemental Table S2.
3.3.2. Preprocessing anatomical and functional data
Structural and functional MRI preprocessing is performed using fMRIPrep 23.1.0 (Esteban et al., 2019, Esteban et al., 2022; RRID:SCR_016216), which is based on Nipype 1.8.3 ((Gorgolewski et al., 2011, Gorgolewski et al., 2018); RRID:SCR_002502). Preprocessing between the ABCD, AHRB and MLS datasets are held constant except for two differences. The MLS study does not collect fieldmaps and uses a spin-echo acquisition. Therefore, fMRIPrep’s fieldmap-less distortion correction is not possible to perform on the MLS BOLD data. For all ABCD and AHRB data, fieldmap distortion correction is used when the participants do have fieldmap data. In addition, the time-repetition (TR) in MLS is 2000 ms and ABCD/AHRB is 800 ms, due to the longer repetition-time in the MLS sample slice-time correction is applied in MLS and but not in ABCD/AHRB. Outside of these exceptions, the preprocessing of the BIDS data use comparable pipelines.
Preprocessing of anatomical data. T1-weighted images are corrected for intensity non-uniformity (INU) with N4BiasFieldCorrection (Tustison et al., 2010), distributed with ANTs 2.3.3 (Avants et al., 2008, RRID:SCR_004757) and used as T1w-reference throughout the fMRIPrep workflow. The T1w-reference is then skull-stripped with a Nipype implementation of the antsBrainExtraction.sh workflow (from ANTs), using OASIS30ANTs as target template. Brain tissue segmentation of cerebrospinal fluid (CSF), white-matter (WM) and gray-matter (GM) is performed on the brain-extracted T1w using fast (FSL 6.0.5.1:57b01774, RRID:SCR_002823, Zhang, [ et al., 2001]). Brain surfaces are reconstructed using recon-all (FreeSurfer 7.2.0, RRID:SCR_001847, Dale, ] et al., 1999]), and the brain mask estimated previously is refined with a custom variation of the method to reconcile ANTs-derived and FreeSurfer-derived segmentations of the cortical gray-matter of Mindboggle (RRID:SCR_002438, Klein, [ et al., 2017]). Volume-based spatial normalization to one standard space (MNI152NLin2009cAsym) is performed through nonlinear registration with antsRegistration (ANTs 2.3.3), using brain-extracted versions of both T1w reference and the T1w template. The following template is selected for spatial normalization: ICBM 152 Nonlinear Asymmetrical template version 2009c (Fonov [ et al., 2009], RRID:SCR_008796; TemplateFlow ID: MNI152NLin2009cAsym).
Preprocessing of functional data. For each of the 2 BOLD functional runs, the following preprocessing is performed. First, a reference volume and its skull-stripped version are generated using a custom methodology of fMRIPrep. Head-motion parameters with respect to the BOLD reference (transformation matrices, and six corresponding rotation and translation parameters) are estimated before any spatiotemporal filtering using mcflirt (FSL 6.0.5.1:57b01774, Jenkinson, [ et al., 2002]). The estimated fieldmap [in ABCD and AHRB] is then aligned with rigid-registration to the target EPI (echo-planar imaging) reference run. The field coefficients are mapped on to the reference EPI using the transform. In the MLS data, BOLD runs are slice-time corrected to 0.966 s (0.5 of slice acquisition range 0 s-1.93 s) using 3dTshift from AFNI (Cox and Hyde [, 1997], RRID:SCR_005927). The BOLD time-series (including slice-timing correction when applied) are resampled onto their original, native space by applying the transforms to correct for head-motion. The BOLD reference is then co-registered to the T1w reference using bbregister (FreeSurfer) which implements boundary-based registration (Greve and Fischl, 2009). Co-registration is configured with six degrees of freedom. Framewise displacement (FD) is computed using two formulations following Power (absolute sum of relative motions, Power [ et al., 2014]) and Jenkinson (relative root mean square displacement between affines, Jenkinson [ et al., 2002]). FD and DVARS are calculated for each functional run, both using their implementations in Nipype (following the definitions by Power [ et al., 2014]). Principal components are estimated after high-pass filtering the preprocessed BOLD time-series (using a discrete cosine filter with 128 s cut-off) for anatomical CompCor (aCompCor). For aCompCor, three probabilistic masks (CSF, WM and combined CSF+WM) are generated in anatomical space. The head motion (translations and rotations) estimates are expanded with the inclusion of temporal derivatives and quadratic terms for each (Satterthwaite et al., 2013) and are included with the aCompCor components and cosine estimates as nuisance regressors in the first level design matrix. The BOLD time-series are resampled into standard space, generating a preprocessed BOLD run in MNI152NLin2009cAsym space.
3.3.3. Analyses
First level, second level and region of interest analyses are performed using Python 3.9.7 and Nilearn version 0.9.2 (Abraham et al., 2014).
In the first level analysis, a design matrix is specified for each participant. The design matrix includes the 15 task-relevant regressors (see task schematic in Fig. 2) and a number of nuisance regressors that are calculated by fMRIPrep. The task relevant regressors include the onset and duration of the five anticipatory cue types (e.g., Large Gain or No Money at Stake) and ten feedback cue types (e.g., You won $5 or You did not win $5), convolved with glover hemodynamic function (HRF). The nuisance regressors include estimated translation and rotation (+ derivatives & squared derivatives) of head motion, the first eight aCompCor noise components and the corresponding cosine regressors for high pass filtering (with a cutoff of 128 s) that are calculated by fMRIPrep (see preprocessing of functional data). Individual BOLD timeseries are prewhitened using an ‘ar1’ noise model and spatially smoothed using a 5 mm full-width half-maximum Gaussian kernel. A GLM is fit to estimate the brain masked BOLD timeseries by task-relevant conditions in the design matrix. For each individual subject and run, six contrasts are computed using the estimated beta (supplemental Fig. S1):
-
1.
Anticipation Big Gain versus Neutral
-
2.
Anticipation Big Gain & Small Gain versus Neutral
-
3.
Anticipation Big Gain versus Big Loss
-
4.
Anticipation Big Loss versus Neutral
-
5.
Anticipation Big & Small Loss versus Neutral
-
6.
Anticipation Big Loss versus Big Gain
For each of the six contrasts and two runs, four ROI mean signal intensity values are estimated. For each of the four ROIs, voxels within the ROI mask for a given MNI contrast beta map are extracted and averaged. If a participant’s beta map contains < 50 % of the voxels in an ROI, the participant is excluded from the analyses. Across the three samples, this produces a dataset of Six Contrasts x Four ROIs for each run across N participants.
For each study (i.e., ABCD, AHRB, MLS), a second level model is used to estimate whole brain activation maps. For each participant, contrast estimates from the two MIDs runs are averaged using a weighted fixed effects model and a group level mixed effect model averaged the within-subject estimates across subjects. These maps are used as measures of average activation patterns during the MID tasks in each of the studies across contrast types. The Big Gain versus Neutral contrast for each study is presented in-text, the remaining contrast types are provided in supplemental materials and publicly on NeuroVault.org. Due to the influence of sample size on the t-statistic, each whole brain contrast map across samples is converted to a Cohen’s d effect size using the formula: . Cohen’s d, t-/z-statistical maps are shared on NeuroVault. Since no inferences are meant to be drawn from these maps, and the data are primarily used for qualitative comparison of the task activation across the samples, no cluster correction is applied to resulting statistical maps.
3.4. Data analysis
3.4.1. Restricted, semi-restricted and unrestricted factor analysis
All subsequent Confirmatory Factor Analysis (CFA, Restricted), Exploratory Structural Equation Modeling (ESEM, Semi-restricted), Exploratory Factor Analysis (EFA, Unrestricted), and Local Structural Equation Modeling (LSEM) are conducted in R version 4.2.1 (R R Core Team, 2022). These analyses are performed on the extracted mean signal intensity values for the four ROIs across six contrast types (in each of the two runs) for participants from the ABCD, AHRB and MLS samples for data that pass quality control. Before running the structural models, the associations among all ROIs across contrast types are evaluated using product-moment correlations. To reduce the number of redundant parameters in the models, left and right ROIs whose correlations exceed r = .85 are constrained to be equal in the model.
CFA and multi-group CFA are performed as implemented in the lavaan package (Hirschfeld and Brachel, 2014, Rosseel, 2012, Rosseel et al., 2021) and used to describe how task contrast variables reflect the hypothesized factors and to test the invariance of that description across the three adolescent samples (Aim 1a). An a priori model is specified using lavaan syntax depicting the item level indicators across contrasts, runs and ROIs (see Fig. 1 for specified model) as indicators of two exogenous latent variables, approach, and avoidance, that may be correlated. Models were identified by fixing one of the factor loadings for each latent factor to = ‘1’ and estimated using the maximum likelihood estimator (ML). The CFA model comprising all three samples specified the same factor structure across the combined three samples and so is used as the strict invariance model comparison (i.e., all constraints are equal). In the second step, configural invariance (factor structure) is invoked by first including ‘group’ (i.e., ABCD, AHRB, MLS) in the CFA model to evaluate structural invariance across the three samples. If the models show adequate fit, in the third step, metric (item loadings) invariance is invoked by adding a group equality constraint of ‘loadings’ (e.g., weak invariance), with invariance of the factor loadings across groups tested via nested model comparison of model fits, using χ2 difference. Given the lack of measurement models of fMRI ROI mean signal intensity data, we do not plan to test strong invariance, such as scalar invariances (D’Urso et al., 2022, Putnick and Bornstein, 2016). Individual and multigroup model parameter estimates are reported and a table with the χ2 statistic, degrees of freedom, p-value, root mean squared error of approximation (RMSEA), comparative fit index (CFI), Tucker-Lewis index (TLI) and standardized root mean square residual (SRMR) is provided. While traditional dynamic fit index cutoffs are debatable (McNeish and Wolf, 2021), model fit cuts-offs are used to determine whether a model has ‘excellent fit’ (i.e., RMSEA ≤.05, CFI ≥.95, TLI ≥.95, and SRMR ≤.05). These cut-offs are used in the fMRI connectivity literature (Beltz and Gates, 2017) and are recommended cut-offs for CFA (Brown, 2006). Models exhibiting poor fit (i.e., RMSEA ≥.10, CFI/TLI ≤.90, and SRMR ≥.10) will not be interpreted. Change in model fit indices in CFI (≤.005), RMSEA (≥.01) or SRMR (≥.025), and AIC/BIC differences via model comparisons (i.e., anova) are used as indicators whether models are worse fits to the data (Chen, 2007; Meade et al., 2008). These indicators are used to provide converging evidence about improvements in model fit and disagreements are contextualized in text.
ESEM is performed as implemented in the esemComp package (Silvestrin and de Beer, 2022). ESEM is used to test a less restrictive model (Aim 1b). As previously described (Marsh et al., 2014, Zyl and Klooster, 2022), measurement error and method covariance may contribute to non-zero cross-loadings that may negatively impact model fit. ESEM complements EFA with target rotation towards the CFA model, but allowing for cross-loadings of items that increase fit to the data. Like the CFA model, a two-factor ESEM model is fit to all the item level indicators across contrasts, runs and ROIs, with target rotation specified as the hypothesized structure in Fig. 1. The model is fit to the data using a ‘ML’ estimator with the variances of the latent variables fixed to ‘1’ for identification.
EFA is performed as implemented in the factanal R package. This data-driven approach is used to estimate an unrestricted (data-driven) factor structure in the data for each study separately (Aim 2). First, a scree-plot, Horn’s parallel analysis (Dinno, 2009) as implemented using the paran package (Dinno, 2018) and BIC estimate are used to converge on the number of factors in the data that is most parsimonious. Using an oblique, ‘promax’ rotation (not imposing orthogonalization on the data), item loadings are estimated for the specified number of factors. The EFA is performed on the individual samples (i.e., ABCD, AHRB and MLS). The results are reported using heatmaps to indicate item loadings onto the exogenous variables. The EFA models are quantitatively compared between datasets using the Tucker index of factor congruence (Lorenzo-Seva and ten Berge, 2006) as implemented using the factor.congruence() function in the psych package.
3.4.2. Effect of pubertal developmental and sex at birth on EFA results
The ABCD sample is composed of youth in the early-to-middle pubertal developmental stages. This may impact the factor structure of neural mechanisms derived from reward contrasts. Using the held-out sample (N = 1000), the EFA results from the above are invoked as a CFA structure for the ABCD data to evaluate potential impact of pubertal development on the factor structure (Olaru et al., 2020). To evaluate how the factor structure differs across pubertal development (Aim 3), local SEM is performed as implemented in the sirt package (Robitzsch, 2022). The pubertal developmental scale (range 1 – 5) is introduced as a moderator into the CFA to examine if and how the factor loadings differ/change across the levels of pubertal development. Given that there may be differences between child and parent reports of pubertal development in the ABCD sample (Demidenko, Kelly et al., 2022), a separate model is run for the youth self-reported pubertal development and parent-reported pubertal development measures. To provide supplemental information to the local SEM results, we evaluate measurement invariance of the CFA structure across the self-reported sex at birth, male and female, in the ABCD data.
3.4.3. Sensitivity analysis: site and sampling effect
Sensitivity analyses are performed to evaluate the effect of ABCD sampling and the effect of University of Michigan as a site on the CFA and EFA models.
First, the AHRB and MLS studies are both collected on 3T GE scanners at the University of Michigan by the MRI Laboratory technicians. One of the 21 ABCD sites includes the University of Michigan (‘site_id’ = 13). To check the extent to which scanner site contributes to differences in factor structure, the CFA and EFA are rerun using only the data from the University of Michigan sites (i.e., holding scanner type and location constant). These results are reported in the supplemental materials.
Second, the CFA a priori (Fig. 1) model is bootstrapped (N = 300) using the ABCD data and model fit statistics are extracted (as described above). Then, the EFA model is bootstrapped (N = 300) using the ABCD data to estimate the variability in the number of factors that best explain the data. The extracted statistics (i.e., model fit and number of factors) are then reported in supplemental materials to visualize stability in these CFA/EFA estimates.
4. Results
4.1. Deviations from the stage 1 registered report
The analyses below are consistent with the analyses proposed in the Stage 1 Registered Report with three minor and one major exception (https://osf.io/a6t8k). First, in the Stage 1 submission the fMRIPrep preprocessing pipeline proposed to perform slice-time correction on all three samples. However, the ABCC data does not contain the necessary slice-timing information in the associated json files. The fast temporal sampling enabled by the multi-band sequence in the ABCD and AHRB fMRI data reduces the importance of slice-timing corrections in the short TR data. Thus, slice-timing correction is used in the MLS preprocessing pipeline but is not used in ABCD and AHRB preprocessing pipelines. Second, fieldmap-less distortion correction could not be applied on the MLS data because it is collected using spiral acquisition. As the AHRB data do not contain missing fieldmaps and the ABCC data selects a single fieldmap within a session to apply on all the functional runs, fieldmap-less distortion correction is not used in the preprocessing pipeline. Third, in the Stage 1 submission we proposed using the maximum-likelihood robust (MLR) estimator when estimating the lavaan models. However, when applied to the data, this estimator had a high rate of singularity issues as compared to the standard maximum-likelihood estimator, specifically for the ESEM. Since the proposed EFA analyses use the maximum-likelihood (ML) estimator, the ML estimator is also used in the lavaan models. Finally, one major protocol deviation is the exclusion of GE scanner sites in the ABCC data. As of October 2023, the GE data contain fieldmap errors, which made the functional data not usable based on the exclusion criteria. Hence, all ABCD data from GE scanner sites are excluded and post hoc sensitivity analyses are performed to evaluate the effect of this deviation across different intervals of N in the non-GE ABCD data. The final Stage 2 code & analyses are available on Github3.
4.2. Sample selection: ABCD, AHRB and MLS
For the ABCD sample, the NDA contains N = 10,414 Year 2 follow-up subjects (NDA Data Package #1209276). From these data, two subsamples of N = 48 subjects for each of the 21 study sites are subsampled (First: N = 1008, Second: N = 1008). From these three subsamples, subjects are selected if they contain an a) anatomy, b) field map and c) MID data via ABCC as of January 2023. This results in N = 599 (Primary Sample), N = 595 (Held-out sample) and N = 157 (site-13) that have usable MRI data.
From the initial ABCD, AHRB and MLS samples, participants are excluded based on five metrics to arrive at the final analytic sample. These metrics are described in greater detail in supplemental Section 2.1.2. For the ABCD sample, Primary N = 253, Held-out N = 248 and Site-13 N = 157 are excluded resulting in the final three ABCD samples of N = 346, N = 347 & N = 0, respectively. For the AHRB sample, out of the N = 108 participants that have MRI data, 11 participants are excluded resulting in a final sample of N = 97. For the MLS sample, out of the N = 177 participants that have MRI data, 65 participants are excluded resulting in a final sample of N = 112. For more specific details for exclusion reasons, see supplemental Table S3.
4.3. Descriptive statistics and main effects
The demographic information for each study is reported in Section 2.1.3, Table S4. The ABCD, AHRB and MLS samples differ in their probe hit response accuracy (%) and probe response times (ms). The average probe hit accuracy during the MID task in the ABCD, AHRB and MLS are 57 % (SD = .04), 57 % (SD = .04) and 71 % (SD = .13), respectively. The average probe mean response times during the MID task in the ABCD, AHRB and MLS are 269 ms (SD = 44 ms), 298 ms (SD = 21 ms) and 205 ms (SD = 28 ms), respectively. For more details about the performance and meaningful differences on the MID task across the three samples, see supplemental Section 2.1.4, Fig. S4.
For each of the three samples, the Cohen’s d effect size maps for the Big Win > Neutral anticipatory contrast are reported in Fig. 3. The activation in the ABCD, AHRB and MLS samples is consistent with reported activation likelihood estimates in meta-analyses for MID task contrasts (Oldham et al., 2018). Group-level activity is observed in the bilateral dorsal and ventral striatum, AI, supplementary motor cortex and occipital regions. The magnitude of the effect in the dorsal and ventral striatum is largest in the MLS sample, followed by the ABCD and AHRB samples.
Fig. 3.
Group Cohen’s d activation maps for (A) ABCD, (B) AHRB and (C) MLS sample. Threshold: d min = .20 & max = 1.5. Map overlaid on MNI152NLin2009cAsym brain. Cross placed on MNI coordinate associated with Right NAcc [12, 8, –10]. Note: The ‘Win’ cue type in the ABCD/AHRB sample is Pink and in the MLS it is Red.
Cohen's d maps for the other five contrasts (ABCD, AHRB, MLS), and each of the ABCD scanners and sites are provided in supplemental Section 2.1.5, Fig. S5, Fig. S6 and Fig. S7. The associated t-/z-statistic and Cohen’s d maps are available at the NeuroVault Collection #15642 (https://identifiers.org/neurovault.collection:15649).
4.4. Aim 1: evaluating the a priori CFA model of the MID task
The goal of Aim 1 is to evaluate whether the proposed theoretical framework (Fig. 1B) for the MID task is empirically supported by task fMRI MID data using a restrictive (Aim 1a) and semi-restrictive model (Aim 1b). The proposed hypothesis is that the measurement structure, as visually represented in Fig. 1B, is empirically represented in the data and consistent across the three samples. The product-moment correlation matrix between the 28 manifest variables across the three samples is reported in Fig. 4.
Fig. 4.
Product-moment correlations between the mean signal intensity for the [L]eft/[R]ight Nucleus Accumbens and Anterior Insula across six contrast types for Run 1 and Run 2 in ABCD, AHRB and MLS samples. Note: Manifest variable [1–28] labels specified in Fig. 5.
4.4.1. Aim 1a: evaluating the confirmatory factor analysis model across the ABCD, AHRB and MLS samples
In Aim 1a, an a priori model is specified depicting the item-level indicators across runs, ROIs and contrasts representing the two exogenous latent variables, approach and avoidance. Using model fit statistics, the CFA (restrictive) model is used to evaluate whether the model is empirically supported and consistent across the three samples.
The strict model (as specified in Fig. 1B) is fit to the data, by fitting the a priori model to the combined mean signal intensity data (ABCD, AHRB and MLS). Based on the registered cut-off criteria (i.e., RMSEA ≥.10, CFI/TLI ≤ = 0.90, and SRMR ≥.10), the model fit statistics, χ2(354) = 23,971.2 p < .001; CFI = 0.12, TLI = .06, RMSEA = 0.35, SRMR = 0.20, indicate that the a priori strict model is a poor fit to these data. Next, configural structural invariance is evaluated across the three samples (or group structure) by only imposing the constraint that the factor structure is equivalent across the three samples (i.e., item loadings, intercepts and residual variances are permitted to vary across samples). The model fit statistics for the configural invariance model, χ2(1072) = 24,733.9, p < .001; CFI = 0.12, TLI = .07, RMSEA = 0.35, SRMR = 0.21, indicate that the configural invariance model is also a poor fit to these data (Table 1). The fits of the strict (BIC: –7595) and the configural invariance models (BIC: –7132) are not significantly different (χ2[718] = 762.8, p = .12). As the configural invariance model had poor fit, metric invariance with further equivalence constraints on item loadings is simply included as a reference in Table 1 which contains the complete model fit statistics.
Table 1.
Model Fits Statistics Across Confirmatory Factor Analysis (CFA) and Exploratory Structural Equation Modeling (ESEM).
χ2 | DF | p-value | RMSEA | SRMR | TLI | CFI | AIC | BIC | |
---|---|---|---|---|---|---|---|---|---|
Strict CFA(Reference) | 23,971.2 | 354 | <.001 | 0.35 | 0.2 | 0.06 | 0.12 | –7940 | –7595 |
Config MG-CFA | 24,733.9 | 1072 | <.001 | 0.35 | 0.21 | 0.07 | 0.12 | –8125 | –7132 |
Metric MG-CFA | 24,856.7 | 1104 | <.001 | 0.34 | 0.21 | 0.1 | 0.12 | –8066 | –7211 |
Overall ESEM | 23,358.4 | 323 | <.001 | 0.36 | 0.18 | 0 | 0.14 | –8491 | –8011 |
Config = Configural; MG = Multi-group; CFA = Confirmatory Factor Analysis; DF = Degrees of Freedom; RMSEA = root mean squared error of approximation; CFI = comparative fit index, SRMR = standardized root mean squared residual.
Based on the parameters in the strict and configural invariance models, as specified in Fig. 1B, the manifest variables do not form the latent constructs of approach or avoidance. For the strict CFA model (supplemental Section 2.2.1, Table S5 and Fig. S8A), 9 of 16 (56 %) manifest variables that are hypothesized to form the latent approach construct have βs ≤ .30. The pattern of low loadings is similar for the latent Avoidance construct, also in the strict model. These results indicate that the covariance structure in these data does not align with the a priori theoretical structure for an approach and avoidance measurement model.
4.4.2. Aim 1b: evaluating the exploratory structural equation model across the ABCD, AHRB and MLS samples
In Aim 1b, ESEM is used to fit an a priori factor model. Unlike the CFA model in Aim 1a, the ESEM allows for empirically driven non-zero cross-loadings. This is a less-restrictive model than the CFA, and so, the cross-loadings may improve the model misspecifications in this semi-restrictive framework. Like Aim 1a, model fit statistics are used to evaluate whether the model was empirically supported and consistent across the three samples. In the combined mean signal intensity data, as indicated by the fit statistics, χ2(323) = 23,358.4, p < .001, CFI = 0.14, TLI = 0.00, RMSEA = 0.36, SRMR = 0.18, the strict ESEM is a poor fit to the data. Also, the ESEM (BIC = –8011) is a poorer fit to the data than the strict CFA model (BIC = –7595), χ2(31) = 612.8, p < .001. Taken together, the CFA and ESEM provide empirical evidence that the a priori model, as specified in Fig. 1B, is a poor representation of the covariance structure for the twenty-eight manifest variables from the MID task fMRI data across the three samples. Thus, the hypothesis that the mean signal intensities from bilateral NAcc and AI from the two BOLD runs across the contrasts would load onto two factors, approach and avoidance, is not supported.
4.5. Aim 2: evaluating the data-driven specified structure of the MID task
Aim 2 uses an EFA (unrestricted) model to estimate the number of factors and the item loadings in a measurement model for the contrasts and brain regions obtained from the MID task across the ABCD, AHRB and MLS samples.
The heatmap in Fig. 4 represents the product-moment correlations among the mean signal intensity values for the bilateral NAcc and AI from the two BOLD runs across the six contrast types in the ABCD, AHRB and MLS samples. A visually comparable structure across the three samples can be observed. Based on parallel analyses and scree plots, a parsimonious six factors best represent the covariance matrices for the twenty-eight manifest variables in the ABCD and AHRB data, and seven factors best represent the covariance matrix in the MLS data. Results from the EFA are shown in Fig. 5; factor structure and loadings in Fig. 5A and the Φ factor intercorrelation matrix in Fig. 5B. (for example of structure and loadings diagram for ABCD, see supplemental Fig. S8B). Across the three samples, separate factors are indicated by the Run 1 and Run 2 measures of the contrasts for the bilateral NAcc. Similarly, the Run 1 and Run 2 measures of the contrasts for the bilateral AI are indicators of different factors.
Fig. 5.
(A) Exploratory Factor Analysis loadings for the ABCD, AHRB and MLS samples, loadings < .20 are removed from the EFA results; and (B) Factor Intercorrelation matrix (Φ). AD = ABCD; AB = AHRB; MS = MLS; R1 = Run 1, R2 = Run 2. L/R-Nacc = Left/Right Nucleus Accumbens; L/R-Ins = Left/Right Anterior Insula.
The quantitative similarity of the factors obtained from the EFA models in Aim 2 are reported in Fig. 6. Fig. 6A consists of factor labels as output by the data-driven process implemented in fa(). Fig. 6B, is expanded on in detail here, consists of the relabeled factors based on qualitative/quantitative similarity. The data-driven factors identified for the ABCD and AHRB data are highly similar, as indicated by the high congruence of the pattern of factor loadings. For example, both ABCD and AHRB contain factors that are highly congruent (see Fig. 6B for heatmap and supplemental Section 2.2.2, Table S7 for Congruence matrix) which are indicated by Run 1 NAcc Reward (AD4 ∼ AB4, r = .95), Run 2 NAcc Reward (AD3∼AB3, r = .96), Run 1 AI Reward versus Neutral (AD2 ∼ AB2, r = .96), Run 2 AI Reward versus Neutral (AD1 ∼ AB1, r = .97), and negative/positive loadings of Reward and Loss combinations for Run 1 (AD5 ∼ AB5, r = .94) and Run 2 (AD6 ∼ AB6, r = .95). Likewise, there is factor congruence between the MLS data and ABCD/AHRB factors as well. Specifically, Run 2 NAcc Reward (AD3 ∼ MS3, r = .98; AB3 ∼ MS3, r = .97), Run 2 AI Reward versus Neutral (AD1 ∼ MS1, r = .92; AB1 ∼ MS1, r = .91). However, the structure and loadings for the MLS results do differ from ABCD/AHRB. For example, the simple, single factor indicated by the Run 1 AI Reward in ABCD/AHRB (AD2, AB3), is distributed across two factors in the MLS data MS2 and MS5 (AD2 ∼ MS2, r = .79; AD2 ∼ MS5, r = .51; AB2 ∼ MS2, r = .90; AB2 ∼ MS5, r = .45). Consistent with findings from Aim 1, across the three samples, the EFA results suggest that the factor structure underlying the multidimensional associations among NAcc and AI regions across the six contrasts for the two runs in these data is not the a priori theoretical structure that is hypothesized in Fig. 1B. Rather, mean signal intensity in these regions and contrasts appear to be differentiated more by ROI (NAcc versus AI) and run (Run 1 versus Run 2) than approach and avoidance latent constructs.
Fig. 6.
Factor Congruence across (A) data-driven factors for ABCD (AD), AHRB (AB) and MLS (MS) samples and (B) relabeled data-driven factors based on multidimensional and qualitatively similarity.
4.6. Aim 3: effects of pubertal development on the ABCD EFA results
The ABCD sample contains participants that are at different stages of pubertal development. Since pubertal development coincides with changes in motivational processing during adolescence (Forbes and Dahl, 2010), LSEM is used to evaluate whether the factor loadings change across the self-reported pubertal development in the ABCD sample.
The ABCD EFA structure and loadings results from Aim 2 are used to inform the a priori model for the held-out sample in the LSEM analyses. A parsimonious four factor model is used that reflects an (1) Run 1 Avoidance AI, (2) Run 2 Avoidance AI, (3) Run 1 Approach NAcc and (4) Run 2 Approach NAcc. Based on the highly correlated residual variance within runs for several contrasts for the EFA results from Aim 2, in four cases for the NAcc and two cases of the AI, the a priori model included correlated residuals. While not having an ’excellent’ model fit based on the cut-offs, the held-out sample model fit statistics are meaningfully improved over the Aim 1 models, χ2(240) = 2117.1, p < .001, CFI = 0.77, TLI = 0.73, RMSEA = 0.15, SRMR = 0.10 (∆CFI = 0.65, ∆TLI = 0.67, ∆RMSEA = –0.20, ∆SRMR = –0.10). As we did not pre-register additional model fit improvement procedures, extra steps are not taken to prevent overfitting of the data in a sample size that may be insufficient to estimate all parameters (e.g., loadings, variance and residual covariances).
Based on the LSEM model, there is little consistent evidence that the parent or youth reported pubertal development moderates the factor loadings. In the parent LSEM permutation model (bootstraps = 1000), although the loading of the Right Big Win versus Big Lose Insula for Run 2 onto the NAcc Run 2 factor decreases across levels of PDS (p = .04), all other estimates are not significant (p > .05). Then, in the youth LSEM permutation model (bootstraps = 1000), while Right Big Win versus Big Lose Insula parameter loading does not significantly decrease across levels of PDS, the loading of Left/Right All Lose versus Neutral Insula for Run 2 onto the Insula Run 2 factor decreases significantly across levels of the PDS, all other estimates are not significant (p > .05). The global fit statistics are reported in supplemental Section 2.2.3, Fig. S9.
4.7. Sensitivity analyses
Several sensitivity analyses were pre-registered. First, to complement the LSEM analyses of whether pubertal stage moderated the factor loadings, a sensitivity analysis is performed to check for measurement invariance of ABCD EFA structure across sex (Aim 3). Second, with multiple samples being tested at the University of Michigan site, the CFA and EFA results were re-run for only the University of Michigan site. Third, the effect of sampling on the CFA fit statistics and the EFA number of factors in the ABCD data is evaluated.
4.7.1. Measurement invariance in ABCD for males/females
Overall, the factor structure is comparable across males and females. Fit statistics indicated comparable model fit to the data in the configural invariance model across sex, χ2(484) = 2444.7, p < .001, CFI = 0.76, TLI = 0.73, RMSEA = 0.15, SRMR = 0.11, to the strict CFA model for the held-out data, χ2(240) = 2133.0, p < .001, CFI = 0.77, TLI = 0.73, RMSEA = 0.15, SRMR = 0.10. Based on ANOVA comparisons, the sex configural model (BIC = –.7253.4) fits the data better than the strict held-out model (BIC = 7603.4), χ2(244) = 311.6, p = .002.
4.7.2. Effect of University of Michigan site on CFA and EFA results
The pattern of findings are consistent when only analyzing AHRB/MLS data from the University of Michigan (UM) site. The fit of the a priori model to the empirical data for the CFA model is similarly poor. For the CFA model, the strict and configural CFA models are a poor fit to the data (Strict: χ2(354) = 8843.5, p < .001, CFI = 0.13, TLI = 0.07, RMSEA = 0.34, SRMR = 0.20; Configural: χ2(713) = 9229.9, p < .001, CFI = 0.13, TLI = 0.08, RMSEA = 0.34, SRMR = 0.21). For the EFA analysis, the ABCC data did not have usable data for University of Michigan site (site 13), so the EFA analyses were not re-ran for ABCD UM only site. Thus, the comparison between UM sites is the same as is obtained for AHRB and MLS samples (see Fig. 4 and 5), in that the EFA identified the same 6-factor structure as found with the larger data.
4.7.3. Effect of sampling on CFA model fit and EFA factors
Sampling within the held-out ABCD sample, there is little effect on the CFA model fit statistics or the recommended factors for the EFA analysis. Specifically, across the bootstrapped datasets (N = 300) the RMSEA (M =.36, SD =.01) & SRMR (M =.21, SD =.01) had minimal variability. There is wider variability in the AIC (M = –4473.9, SD = 352.9) and BIC statistics (M = –4165.9, SD = 352.9). Finally, across the bootstrapped datasets the minimum and maximum recommended number of factors for the EFA are 6 and 7 (Mean: 6.2), respectively. The distribution of these estimates are reported in supplemental Section 2.3, Fig. S10.
4.8. Post hoc analyses
In the Aim 1 and Aim 2 results, the factor structure and covariance matrices indicate that the a priori ROI and contrast variables are more congruent within runs than across runs. To better understand this finding, run specific (i.e., Run 1 and Run 2) mean-level group maps for the Big Win > Neutral anticipation contrast are estimated for each of the three samples using the run level estimates in a group level mixed effect model. The difference [Run 1 - Run 2] is calculated between the resulting maps. Fig. 7A-C reports the difference between the group-level run maps for each sample. Across the three samples, the magnitude of the activity decreases from Run 1 and Run 2. To evaluate the voxelwise similarity of Cohen's d estimates between Run 1 and Run 2, pairwise_similarity() is used as implemented in the PyReliMRI package (Demidenko and Poldrack, 2023). The Spearman’s rank (rho) correlation coefficient between the run maps is reasonably high (Fig. 7D), ABCD Run 1 ∼ Run 2 rho = .82, AHRB Run 1 ∼ Run 2 rho = .72, and MLS Run 1 ∼ Run 2 rho = .80. This demonstrates that, on average, the group-level activation is relatively stable across samples and runs. However, the change in between-subject scores across runs may have substantially reduced the association between factors for each run.
Fig. 7.
Differences in estimated Cohen’s d group activity between Runs [Run 1 - Run 2] for (A) ABCD, (B) AHRB and (C) MLS samples, and (D) pairwise similarity (Spearman’s rho) between run Cohen’s d maps.
4.8.1. Protocol deviation for GE scanners
The Stage 2 analyses did not include the ABCD GE scanner data. The ABCC data contained an error that resulted in faulty fieldmap data for all GE scanners. As of October 2023, the fix is not implemented and so the data are not usable. To ensure the difference in sampling and the sample size in ABCD data does not impact the above CFA and EFA results, a sensitivity analysis is performed. The primary (N = 346) and held-out (N = 347) ABCD data were combined into a larger dataset (N = 693). Then, the combined ABCD data were randomly sampled from N = 50 to N = 1000 (intervals of N = 10). CFA fit statistics and the recommended number of factors were calculated at each sample size. Besides the AIC/BIC fit statistics which linearly scale with sample size, all other fit statistics did not meaningfully differ and the recommended number of factors in the ABCD data are stable at six across the range of sample sizes (see supplemental Section 2.3.2, Fig. S11).
5. Discussion
The theoretical dimensions of the approach and avoidance systems in the MID task (Knutson et al., 2001) have been extensively discussed (Knutson et al., 2014, Knutson and Greer, 2008, Knutson and Srirangarajan, 2019) and characterized in meta-analyses of group-level maps (Balodis and Potenza, 2015, Chen et al., 2022, Oldham et al., 2018). However, it is not yet understood whether the between-subject covariance structure supports this framework. The current empirical study uses a measurement approach to evaluate and understand the structure of the MID task across three independent adolescent samples. The [three] key findings are: (1) during the anticipation of the probe for gain and loss trials, the MID task consistently activates the bilateral AI and dorsal/ventral striatal regions in group-level maps; (2) across the a priori ROIs and contrasts the model fit statistics from CFA do not support the multidimensional structure for approach and avoidance system in the data; however, (3) there is a consistent structure in the data that is run and region specific. The results and interpretations are largely consistent in the three independent adolescent samples with comparable fMRI tasks across multi-band (ABCD & AHRB) and single-band (MLS) MRI acquisition protocols.
The MID task activates regions that are consistent with the prior literature (Chen et al., 2022, Oldham et al., 2018) but effects are attenuated in multi-band data. Specifically, across the ABCD, AHRB and MLS samples, activation is observed in the bilateral ventral striatum, AI, supplementary motor cortex and occipital regions during the Big Win versus Neutral contrast. However, Cohen's d estimates for neural activity are larger in the MLS (single-band) than the ABCD/ABCD samples (multi-band). While the observed effect size for neural activity for the Big Win > Neutral contrasts is largest in the MLS sample, the decrease in magnitude of the effect from Run 1 to Run 2 is also largest in the MLS data. The higher effect size in single- versus multi-band data is consistent with prior work reporting that increased multi-band factors may reduce the effect size in subcortical regions due to increased noise (see Fig. 2 in Risk et al., 2021, p. 7) and this may impact the neural activity during the MID task (Srirangarajan et al., 2021). To an extent, our results confirm that the estimated neural effects in subcortical regions may be lower in multi-band acquisitions. Nevertheless, the proportion of the difference attributable to the multi-band factor is unclear in these data. Specifically, the MLS sample (single-band) had larger inherent smoothness in the data, a different trial randomized scheme that holds constant across runs (as opposed to trials randomized for each run as in ABCD/AHRB) and higher probe accuracy/faster mean response times during the performance of the MID task. As a result, higher smoothness in the data may produce a larger effect as a result of the increased signal-to-noise ratio (Mikl et al., 2008) but come with lower spatial specificity (Blazejewska et al., 2019). It is unclear, however, how task randomization and performance may increase or decrease group-level activity across runs in the MID task design. This highlights the nuanced differences in group-level estimates of neural activity across samples that needs to be better understood as a function of MRI acquisition and task administration.
When evaluating whether the between-subject covariation in the neural data supports the approach and avoidance systems multidimensional structure as proposed in Knutson et al. (2014), the models demonstrate a poor fit to these data. The measurement model includes the a priori bilateral NAcc and AI brain regions from six contrasts across two separate MID task runs (as specified in the diagram in Fig. 1) from the ABCD, AHRB and MLS sampled BOLD data. Neither the strict model nor the configural model (only constraining the factor structure) fit the data well. Sensitivity analyses evaluating the model solely at the University of Michigan site (to keep facility/scanners constant) also demonstrate that neither the strict nor the configural model fit the data well. While the group-level neural activity is consistent with reported meta-analytic results for the MID task (Oldham et al., 2018), the covariance structure of the data combined from three samples is inconsistent with the theoretical framework (Knutson et al., 2014). This suggests that the pre-specified multidimensional structure of the approach and avoidance manifest variables is not an adequate explanation of the phenomenon evoked during this modified version of the MID task in these three adolescent samples.
The lack of evidence for the multidimensional structure of the approach and avoidance framework may be due to both methodological and conceptual challenges. Methodologically, the modified version of the MID task in the current analyses cannot delineate anticipatory and action preparation/probe preparation processes (Richards et al., 2013). This is a limitation observed in many MID tasks that use short (e.g., 250 ms; Cao et al., 2019; Knutson et al., 2003) or long cue designs (e.g., 2000 ms (Casey et al., 2018, Yau et al., 2012). In other words, after observing the trial type, participants either immediately or after a brief delay prepare to respond. While the current study models ‘anticipation’ as the entirety of the Cue and Fixation period, there may be modeling decisions that may improve the delineation of the incentive salience and action preparation. Conceptually, conjunction analyses of published BOLD data have reported that there is meaningful overlap between positive and negative rewards in the AI and striatal regions (Bartra et al., 2013). Overlap in the AI may be due to its role in executive control pertaining to attention and cognitive control (Molnar-Szakacs and Uddin, 2022) and overlap in the NAcc may reflect the fact that there are context specific responses in appetitive and avoidant processes (Berridge, 2019). These challenges may reduce the precision with which small effects of approach and avoidance processes may be disentangled in the data.
Despite the lack of support for theoretical framework in the data, the EFA demonstrates consistent structure that is modestly-to-extremely congruent across the samples. Across the three adolescent samples, the covariance structure of the twenty-eight variables of the mean signal intensity values is best represented by six (in ABCD & AHRB) or seven factors (in MLS). These data-driven factors delineate runs and brain regions. Specifically, the Left and Right NAcc or AI mean signal intensity values load onto independent factors for Run 1 and Run 2. This finding is consistent across the ABCD, AHRB and MLS samples. In fact, factor congruence demonstrates that the run and region-specific factors have a high degree of structural similarity between the ABCD and AHRB samples (r = .94 –.97) and moderate-to-high degree of structural similarity between the ABCD/AHRB and MLS samples (r = .51 –.98). However, across each of the three samples there is little structural similarity between manifest variables for Run 1 and Run 2. It is unsurprising that the within run data are highly correlated as there is a high degree of overlap in stimuli used in the a priori contrasts (i.e., Big Win) and evidence from the multivariate pattern analysis literature reports that there is a high degree of similarity between stimuli within runs (Mumford et al., 2014). The weak covariance between runs is surprising but consistent with the literature reporting low test-retest reliability in univariate multi-session and multi-run task fMRI data (Elliott et al., 2020). The preserved structure but lack of run congruence poses two conceptual questions: How does averaging weakly related run variables modify the covariance structure of the data; and are there analytic decisions that may increase reliability of between-person differences across runs while preserving the factor structure?
The factor structures identified in the current study are consistent across pubertal development, site effects and sampling variability. First, the loadings in the run and region-specific factors are largely consistent across the levels of pubertal development, for both child and parent reports, in the held-out ABCD data. Second, as briefly mentioned above, when the CFA/EFA and EFA analyses are constrained to only the University of Michigan site for the AHRB and MLS samples, the pattern of results are consistent. Finally, when randomly subsampling the ABCD data, the fit statistics for the CFA and the recommended number of factors for the EFA are largely consistent. This demonstrates that the CFA and EFA results for the current data are insensitive to these parameters. However, the analyses are based on data collected in adolescence. Complimentary work should also evaluate whether and how the factor structure manifests in adulthood and other phases across the lifespan.
The lack of between-subjects covariation between the bilateral NAcc and AI mean signal intensity values is surprising given the consistent activation in these brain regions at the group-level. One explanation may be that the increased noise in the multi-band data (Risk et al., 2021) may have modified the structure of covariances among cortical and subcortical regions. However, the covariance pattern in the a priori brain regions is consistent in the single-band (MLS) and multi-band (ABCD/AHRB) MID task fMRI data. A second explanation for the lack of support for theoretical framework may be that the covariation may be present at a more granular level, specifically, at the voxel or ROI time-series data. However, in an independent analysis of the AHRB MID data (Demidenko, Huntley et al., 2022), no evidence is reported for group or subgroup paths in combined and run specific time-series that demonstrate model specified (group iterative multiple model estimation) covariation between the bilateral insula and ventral striatum ROIs.
Evidence from CFA and EFA model results obtained here do not provide sufficient support to accept the approach and avoidance framework defined in Knutson et al. (2014) to explain the between-subject differences in univariate neural activity. While this does not undermine the experimental effect of the anticipatory phase in the MID task across the adolescent samples, the results do demonstrate that the measurements diverge from the theoretical framework. As discussed in the introduction, the scores obtained from a univariate analysis during the MID task are a manifestation of between-person differences in what is a presumed phenomenon of motivation to approach rewarding stimuli. While subjecting biological process measures to modern psychometric procedures may not always be well-reasoned, if brain-behavior researchers elect to use neural activity elicited during the MID task in correlational research it is prudent to provide a sufficient description of the measured mental process (Francken et al., 2022) and leverage tools from the psychological testing literature to determine how the proxy score for ‘motivational processing’ derived from neural activity during the MID task supports an interpretation for different populations and testing conditions (Association American Educational Research et al., 2014, Slaney, 2017). Without a well delineated framework, the approaches used to explain why activation in one group and a given contrast may differ from another group may lack theoretical grounding. Despite the lack of support for the theoretical framework, there is evidence of a relatively consistent and converging structure from the MID contrast data. Refining the interpretability of the underlying BOLD signal and the resulting theoretical framework for the MID task may be helpful as it continues to be used in large studies focused on individual differences research (e.g., IMAGEN [Schumann et al., 2010] and ABCD study [Casey et al., 2018]). Future research may explore what phenotypes these features are consistently related to and how they explain pre- and/or post-scan affective ratings across developmental samples. Further probing the intercorrelations among the data-driven factors, self-report and behavioral data would help elaborate how these factors contribute to between-subject differences.
5.1. Study limitations
The analyses here are based on pre-specified data preprocessing and analytic decisions. It is well documented that there are a wide-range of pipeline decisions which can impact the resulting estimates (Botvinik-Nezer et al., 2020, Bowring et al., 2022, Churchill et al., 2015, Li et al., 2021). As a result, decisions at the preprocessing and/or model fitting approach may have meaningfully impacted some of the results and interpretations. However, a strength of our study is that it leverages registered reports (Chambers et al., 2014), whereby the conceptual and analytic framework is reviewed prior to the access to the data and subsequent analyses. Nevertheless, future research that uses alternative preprocessing and modeling decisions would help confirm the reproducibility of these results and interpretations.
The modified version of the MID task used in the current study maintains limitations from earlier designs (Knutson et al., 2003) that do not allow for temporally separating the BOLD response between trials. Specifically, the MID task is a rapid event-related design whereby the interstimulus interval is shorter than the hemodynamic response function (Soares et al., 2016). This is confounded by the lack of an intertrial interval between the current trials (t) feedback cue (i.e., participants read the feedback of whether they are successful in responding to the probe and the amount they did/didn’t win/lose) and the onset of the subsequent trial (t + 1). Given the slow properties of the BOLD signal (Buxton, 2012), this poses challenges in deconvolving the signal in traditional GLM models. As a result, this may have contaminated the properties of the “anticipatory” signal and thus the validity of the interference.
Third, a subset of the MID contrasts and specific selection of ROIs from Harvard-Oxford subcortical brain atlas are used in the current analyses. The MID task includes a range of contrasts and modeling decisions. Likewise, the use of the Harvard-Oxford subcortical brain atlas to extract the bilateral NAcc and AI alternative to a different atlas may have a meaningful impact on the resulting estimates (Bryce et al., 2021). While the typical contrasts and a standard deterministic atlas are used in these analyses, perhaps the data may be represented differently in an EFA model if these decisions were varied and/or the variable expanded. However, expanding the number of manifest variables for EFA may increase the uncertainty around the estimates at small samples.
Finally, GE scanner data are not included for these analyses which may impact some of the estimates. While CFA fit statistics and EFA numbers of factors are stable across sample sizes, it is feasible that the structure of loadings may slightly differ in the GE data as these data may include different types of participants, sites and noise structure. Furthermore, the LSEM results may also be impacted because of the reduced sample size. While this may not impact the interpretations/conclusions in the current manuscript, the estimates should be considered in the context of this limitation. Nevertheless, this major deviation is fully documented, and sensitivity analyses were performed to evaluate the robustness of the conclusions (Chambers and Tzavella, 2022).
6. Conclusion
Modern psychometric tools have not been used to evaluate measurement properties of the fMRI signals obtained during participants’ performance of the MID task. The current study uses a comparable MID task design across three adolescent samples to evaluate multiple theoretical and data-driven measurement properties of fMRI for twenty-eight fMRI-based manifest variables. Across the three samples, group-level neural activity during the MID anticipatory phase activated expected regions: bilateral NAcc and AI. Evaluation of an a priori multidimensional approach and avoidance systems framework using CFA provided no support for the a priori framework based on twenty-eight manifest variables across three adolescent samples. However, data-driven EFA models identified a six/seven factor structure of between-person covariance that is congruent across samples, where the twenty-eight manifest variables are indicators of run and bilateral brain region specific factors. Future research should evaluate the phenotypes that best characterize the covariance structure in the MID task during fMRI to enhance the interpretability of brain-behavioral findings within and between adolescent samples.
Data & code availability statement
The ABCD-BIDS input data can be accessed through the ABCD-BIDS Community Collection with an established Data Use Agreement (see https://abcdstudy.org/). The BIDS version of the MLS and AHRB data can be obtained with the approval of Dr. Mary Heitzeg and/or Dr. Daniel Keating, respectively.
R code
The .html file containing the description of the function, analyses, and associated code is named ‘ABCD_Analysis_CFA-ESEM-EFA-LSEM.html’ located within the Stage 2 Github folder (associated .rmd is also provided). If using R version > 4.1 the script should run from start to finish.
Python code
The .py script to run the first level analysis with Nilearn’s implementation on the MID data is named ‘mid_firstlevel.py’ and is located in the Stage 2 Github folder. The .py script to (1) extract region of interest (ROI) mean signal intensity estimates from the specified contrast beta maps and (2) calculate the voxel overlap between the NAcc ROI and participants' brain mask is named ‘betamap_roi.py’ and is located in the Stage 2 Github folder. (Note: steps for acquiring ROI parcellations is not included in the script but a text file named ‘ROI_create.txt’ is provided in the Stage 1 Github folder). The extracted values for a given subject, run, contrast and ROI, as implemented using Nilearn’s functions, were compared for 3 random files using fslmeants -i {beta_map.nii.gz} -m {roi.nii.gz} to confirm values were identical. The Second Level fixed effect model (estimates across runs) and group level model (estimates across subjects) is implemented using Nilearn’s functions in the script 'mid_fixedeff.py' and ‘mid_sgroup.py’, respectively, and is located in the Stage 2 Github folder.
CRediT authorship contribution statement
Mumford Jeanette A.: Methodology, Writing – original draft, Writing – review & editing. Demidenko Michael: Conceptualization, Formal analysis, Methodology, Project administration, Visualization, Writing – original draft, Writing – review & editing. Ram Nilam: Methodology, Writing – original draft, Writing – review & editing. Poldrack Russell A.: Conceptualization, Methodology, Supervision, Writing – original draft, Writing – review & editing.
Declaration of Competing Interest
The authors declare that they have no conflicts of interest.
Acknowledgements
MID is funded by the Ruth L. Kirschstein Postdoctoral Individual National Research Service Award through the National Institute on Drug Abuse (F32DA055334-01A1) RAP is supported by the National Institute of Mental Health (R01MH117772 and R01MH130898). Thanks to Dr. Daniel Keating for agreeing to share the Adolescent Health Risk Behavior (AHRB; R01HD075806) study data and to Dr. Mary Heitzeg for agreeing to share the Michigan Longitudinal Study (MLS; R01AA012217) data for this project. The authors would also like to thank the research participants and staff involved in data collection of the Adolescent Brain Cognitive Development (ABCD) Study data. The ABCD Study is a multisite, longitudinal study designed to recruit more than 10,000 children ages 9 and 10 and follow them over 10 years into early adulthood. The ABCD Study is supported by the National Institutes of Health (NIH) and additional federal partners under award numbers U01DA041048, U01DA050989, U01DA051016, U01DA041022, U01DA051018, U01DA051037, U01DA050987, U01DA041174, U01DA041106, U01DA041117, U01DA041028, U01DA041134, U01DA050988, U01DA051039, U01DA041156, U01DA041025, U01DA041120, U01DA051038, U01DA041148, U01DA041093, U01DA041089, U24DA041123, and U24DA041147. The list of supporters is available at https://abcdstudy.org/federal-partners.html. The list of participating sites and study investigators is available at https://abcdstudy.org/study-sites/.
Thanks to members of the Developmental Cognition and Neuroimaging lab at University of Minnesota, specifically Eric Feczko, Audrey Houghton, Rae McCollum, and Anders Perrone, for assisting and providing access to the ABCD-BIDS Community Collection (ABCC) data. The ABCC data repository grows and changes over time (https://collection3165.readthedocs.io/). Thanks to Krisanne Litinas at the University of Michigan for providing expert advice and scripts to convert the AHRB data into BIDS format. Thanks to Mary Soules and Ryan Klaus (with assistance from Krisanne Litinas) at the University of Michigan for working to convert the MLS data to BIDS format. Thanks to Dr. Brian Knutson for reviewing and commenting on the a priori hypotheses proposed for the Monetary Incentive Delay task.
Supplementary data associated with this article can be found in the online version at doi:10.1016/j.dcn.2023.101337.
Here, intentionally refer to the fMRI task and not a specific contrast for a given task. A researcher should theoretically and empirically demonstrate that the task relates to the specified mental process in the context of contrasts based on traditional univariate, multivariate or connectivity analytic approaches.
If a specific effect size is hypothesized it is also related to magnitude. See ‘good enough’ belt as discussed in Serlin & Lapsley (1985)
Zenodo Demidenko et al. (2023)
Appendix A. Supplementary material
Supplementary material.
.
References
- Abraham A., Pedregosa F., Eickenberg M., Gervais P., Mueller A., Kossaifi J., Gramfort A., Thirion B., Varoquaux G. Machine learning for neuroimaging with scikit-learn. Front. Neuroinformatics. 2014;8 doi: 10.3389/fninf.2014.00014. 〈https://www.frontiersin.org/articles/10.3389/fninf.2014.00014〉 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Association American Educational Research, Association American Psychological, National Council on Measurement in Education, editors. Standards for Educational and Psychological Testing. American Educational Research Association; 2014. [Google Scholar]
- Avants B.B., Epstein C.L., Grossman M., Gee J.C. Symmetric diffeomorphic image registration with cross-correlation: Evaluating automated labeling of elderly and neurodegenerative brain. Med. Image Anal. 2008;12(1):26–41. doi: 10.1016/j.media.2007.06.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Balodis I.M., Potenza M.N. Anticipatory Reward Processing in Addicted Populations: A Focus on the Monetary Incentive Delay Task. Biol. Psychiatry. 2015;77(5):434–444. doi: 10.1016/j.biopsych.2014.08.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bartra O., McGuire J.T., Kable J.W. The valuation system: A coordinate-based meta-analysis of BOLD fMRI experiments examining neural correlates of subjective value. NeuroImage. 2013;76:412–427. doi: 10.1016/j.neuroimage.2013.02.063. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Beltz A.M., Gates K.M. Network Mapping with GIMME. Multivar. Behav. Res. 2017;52(6):789–804. doi: 10.1080/00273171.2017.1373014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bennett C.M., Miller M.B. How reliable are the results from functional magnetic resonance imaging? Ann. N. Y. Acad. Sci. 2010;1191(1):133–155. doi: 10.1111/j.1749-6632.2010.05446.x. [DOI] [PubMed] [Google Scholar]
- Berridge K.C. Affective valence in the brain: Modules or modes? Nat. Rev. Neurosci. 2019;20(4):225–234. doi: 10.1038/s41583-019-0122-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bethlehem R. a I., Seidlitz J., White S.R., Vogel J.W., Anderson K.M., Adamson C., Adler S., Alexopoulos G.S., Anagnostou E., Areces-Gonzalez A., Astle D.E., Auyeung B., Ayub M., Bae J., Ball G., Baron-Cohen S., Beare R., Bedford S.A., Benegal V.…Alexander-Bloch A.F. Brain charts for the human lifespan. Nature. 2022:1–11. doi: 10.1038/s41586-022-04554-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bissonette G., Gentry R., Padmala S., Pessoa L., Roesch M. Impact of appetitive and aversive outcomes on brain responses: Linking the animal and human literatures. Front. Syst. Neurosci. 2014;8 doi: 10.3389/fnsys.2014.00024. 〈https://www.frontiersin.org/articles/10.3389/fnsys.2014.00024〉 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bjork J.M. The Ups and Downs of Relating Nondrug Reward Activation to Substance Use Risk in Adolescents. Curr. Addict. Rep. 2020 doi: 10.1007/s40429-020-00327-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bjork J.M., Knutson B., Fong G.W., Caggiano D.M., Bennett S.M., Hommer D.W. Incentive-elicited brain activation in adolescents: Similarities and differences from young adults. J. Neurosci.: Off. J. Soc. Neurosci. 2004;24(8):1793–1802. doi: 10.1523/JNEUROSCI.4862-03.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bjork J.M., Smith A.R., Chen G., Hommer D.W. Adolescents, Adults and Rewards: Comparing Motivational Neurocircuitry Recruitment Using fMRI. PLOS ONE. 2010;5(7) doi: 10.1371/journal.pone.0011440. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Blazejewska A.I., Fischl B., Wald L.L., Polimeni J.R. Intracortical smoothing of small-voxel fMRI data can provide increased detection power without spatial resolution losses compared to conventional large-voxel fMRI data. NeuroImage. 2019;189:601–614. doi: 10.1016/j.neuroimage.2019.01.054. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Borsboom D. The attack of the psychometricians. Psychometrika. 2006;71(3):425–440. doi: 10.1007/s11336-006-1447-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Botvinik-Nezer R., Holzmeister F., Camerer C.F., Dreber A., Huber J., Johannesson M., Kirchler M., Iwanir R., Mumford J.A., Adcock R.A., Avesani P., Baczkowski B.M., Bajracharya A., Bakst L., Ball S., Barilari M., Bault N., Beaton D., Beitner J.…Schonberg T. Variability in the analysis of a single neuroimaging dataset by many teams. Nature. 2020;582(7810):84–88. doi: 10.1038/s41586-020-2314-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bowring A., Nichols T.E., Maumet C. Isolating the sources of pipeline-variability in group-level task-fMRI results. Hum. Brain Mapp. 2022;43(3):1112–1128. doi: 10.1002/hbm.25713. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brown T.A. Confirmatory Factor Analysis for Applied Research. First edition., The Guilford Press,; 2006. (First edition) [Google Scholar]
- Bryce N.V., Flournoy J.C., Guassi Moreira J.F., Rosen M.L., Sambook K.A., Mair P., McLaughlin K.A. Brain parcellation selection: An overlooked decision point with meaningful effects on individual differences in resting-state functional connectivity. NeuroImage. 2021;243 doi: 10.1016/j.neuroimage.2021.118487. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Büchel C., Peters J., Banaschewski T., Bokde A.L.W., Bromberg U., Conrod P.J., Flor H., Papadopoulos D., Garavan H., Gowland P., Heinz A., Walter H., Ittermann B., Mann K., Martinot J.-L., Paillère-Martinot M.-L., Nees F., Paus T., Pausova Z.…Knutson B. Blunted ventral striatal responses to anticipated rewards foreshadow problematic drug use in novelty-seeking adolescents. Nat. Commun. 2017;8(1):1–11. doi: 10.1038/ncomms14140. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Buxton R.B. Dynamic Models of BOLD Contrast. NeuroImage. 2012;62(2):953–961. doi: 10.1016/j.neuroimage.2012.01.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cao Z., Bennett M., Orr C., Icke I., Banaschewski T., Barker G.J., Bokde A.L.W., Bromberg U., Büchel C., Quinlan E.B., Desrivières S., Flor H., Frouin V., Garavan H., Gowland P., Heinz A., Ittermann B., Martinot J.-L., Nees F., Whelan R. Mapping adolescent reward anticipation, receipt, and prediction error during the monetary incentive delay task. Hum. Brain Mapp. 2019;40(1):262–283. doi: 10.1002/hbm.24370. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Caplan D. Experimental design and interpretation of functional neuroimaging studies of cognitive processes. Hum. Brain Mapp. 2007;30(1):59–77. doi: 10.1002/hbm.20489. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Carp J. On the Plurality of (Methodological) Worlds: Estimating the Analytic Flexibility of fMRI Experiments. Front. Neurosci. 2012;6 doi: 10.3389/fnins.2012.00149. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Casey B.J., Cannonier T., Conley M.I., Cohen A.O., Barch D.M., Heitzeg M.M., Soules M.E., Teslovich T., Dellarco D.V., Garavan H., Orr C.A., Wager T.D., Banich M.T., Speer N.K., Sutherland M.T., Riedel M.C., Dick A.S., Bjork J.M., Thomas K.M., ABCD Imaging Acquisition Workgroup The Adolescent Brain Cognitive Development (ABCD) study: Imaging acquisition across 21 sites. Dev. Cogn. Neurosci. 2018;32:43–54. doi: 10.1016/j.dcn.2018.03.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Casey B.J., Heller A.S., Gee D.G., Cohen A.O. Development of the emotional brain. Neurosci. Lett. 2019;693:29–34. doi: 10.1016/j.neulet.2017.11.055. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chambers C.D., Tzavella L. The past, present and future of registered reports. Nat. Hum. Behav. 2022;6(1) doi: 10.1038/s41562-021-01193-7. Article 1. [DOI] [PubMed] [Google Scholar]
- Chambers C.D., Feredoes E., Muthukumaraswamy S.D., Etchells P. Instead of “playing the game” it is time to change the rules: registered reports at AIMS neuroscience and beyond. AIMS Neurosci. 2014;1(1) doi: 10.3934/Neuroscience.2014.1.4. Article 1. [DOI] [Google Scholar]
- Chen F.F. Sensitivity of goodness of fit indexes to lack of measurement invariance. Struct. Equ. Model. 2007;14(3):464–504. doi: 10.1080/10705510701301834. [DOI] [Google Scholar]
- Chen Y., Chaudhary S., Li C.-S.R. Shared and distinct neural activity during anticipation and outcome of win and loss: A meta-analysis of the monetary incentive delay task. NeuroImage. 2022;264 doi: 10.1016/j.neuroimage.2022.119764. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Churchill N.W., Spring R., Afshin-Pour B., Dong F., Strother S.C. An automated, adaptive framework for optimizing preprocessing pipelines in task-based functional MRI. PLOS ONE. 2015;10(7) doi: 10.1371/journal.pone.0131520. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Clithero J.A., Rangel A. Informatic parcellation of the network involved in the computation of subjective value. Soc. Cogn. Affect. Neurosci. 2014;9(9):1289–1302. doi: 10.1093/scan/nst106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cox R.W., Hyde J.S. Software tools for analysis and visualization of fMRI data. NMR Biomed. 1997;10(4–5):171–178. doi: 10.1002/(SICI)1099-1492(199706/08)10:4/5<171::AID-NBM453>3.0.CO;2-L. [DOI] [PubMed] [Google Scholar]
- Cronbach L.J. The two disciplines of scientific psychology. Am. Psychol. 1957;12(11):671–684. doi: 10.1037/h0043943. [DOI] [Google Scholar]
- Cronbach L.J., Meehl P.E. Construct validity in psychological tests. Psychol. Bull. 1955;52(4):281–302. doi: 10.1037/h0040957. [DOI] [PubMed] [Google Scholar]
- D’Urso, E.D., Maassen, E., Assen, M.A.L.M. van, Nuijten, M.B., Roover, K.D., & Wicherts, J. (2022). The Dire Disregard of Measurement Invariance Testing in Psychological Science. PsyArXiv. 10.31234/osf.io/n3f5u. [DOI] [PubMed]
- Dahl R.E., Allen N.B., Wilbrecht L., Suleiman A.B. Importance of investing in adolescence from a developmental science perspective. Nature. 2018;554(7693):441–450. doi: 10.1038/nature25770. [DOI] [PubMed] [Google Scholar]
- Dale A.M., Fischl B., Sereno M.I. Cortical surface-based analysis: I. Segmentation and surface reconstruction. NeuroImage. 1999;9(2):179–194. doi: 10.1006/nimg.1998.0395. [DOI] [PubMed] [Google Scholar]
- Demidenko, M.I., & Poldrack, R.A. (2023). PyReliMRI: An Open-source Python tool for Estimates of Reliability in MRI Data [Computer software]. 〈https://zenodo.org/record/8387971〉.
- Demidenko M.I., Huntley E.D., Jahn A., Thomason M.E., Monk C.S., Keating D.P. Cortical and subcortical response to the anticipation of reward in high and average/low risk-taking adolescents. Dev. Cogn. Neurosci. 2020;44 doi: 10.1016/j.dcn.2020.100798. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Demidenko M.I., Kelly D.P., Hardi F.A., Ip K.I., Lee S., Becker H., Hong S., Thijssen S., Luciana M., Keating D.P. Mediating effect of pubertal stages on the family environment and neurodevelopment: An open-data replication and multiverse analysis of an ABCD Study®. Neuroimage: Rep. 2022;2(4) doi: 10.1016/j.ynirp.2022.100133. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Demidenko M.I., Huntley E.D., Weigard A.S., Keating D.P., Beltz A.M. Neural heterogeneity underlying late adolescent motivational processing is linked to individual differences in behavioral sensation seeking. J. Neurosci. Res. 2022;100(3):762–779. doi: 10.1002/jnr.25005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Demidenko, M.I., Mumford, J.A., Ram, N., & Poldrack, R.A. (2023). Code for a Multi-Sample Evaluation of the Measurement Structure and Function of the Modified Monetary Incentive Delay Task in Adolescents [Computer software]. 〈https://zenodo.org/records/10211538〉. [DOI] [PMC free article] [PubMed]
- Diekhof E.K., Kaps L., Falkai P., Gruber O. The role of the human ventral striatum and the medial orbitofrontal cortex in the representation of reward magnitude—an activation likelihood estimation meta-analysis of neuroimaging studies of passive reward expectancy and outcome processing. Neuropsychologia. 2012;50(7):1252–1266. doi: 10.1016/j.neuropsychologia.2012.02.007. [DOI] [PubMed] [Google Scholar]
- Dinno A. Exploring the sensitivity of Horn’s parallel analysis to the distributional form of random data. Multivar. Behav. Res. 2009;44(3):362–388. doi: 10.1080/00273170902938969. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dinno, A. (2018). paran: Horn’s Test of Principal Components/Factors (1.5.2) [Computer software]. 〈https://CRAN.R-project.org/package=paran〉.
- Dockès J., Varoquaux G., Poline J.-B. Preventing dataset shift from breaking machine-learning biomarkers. GigaScience. 2021 doi: 10.1093/gigascience/giab055. 〈https://hal.archives-ouvertes.fr/hal-03293375〉 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Doremus-Fitzwater T.L., Varlinskaya E.I., Spear L.P. Motivational systems in adolescence: Possible implications for age differences in substance abuse and other risk-taking behaviors. Brain Cogn. 2010;72(1):114–123. doi: 10.1016/j.bandc.2009.08.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dugré J.R., Dumais A., Bitar N., Potvin S. Loss anticipation and outcome during the monetary incentive delay task: a neuroimaging systematic review and meta-analysis. PeerJ. 2018;6 doi: 10.7717/peerj.4749. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Elliott M.L., Knodt A.R., Ireland D., Morris M.L., Poulton R., Ramrakha S., Sison M.L., Moffitt T.E., Caspi A., Hariri A.R. What is the test-retest reliability of common task-functional MRI measures? New empirical evidence and a meta-analysis. Psychol. Sci. 2020;31(7):792–806. doi: 10.1177/0956797620916786. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ernst M. The triadic model perspective for the study of adolescent motivated behavior. Brain Cogn. 2014;89:104–111. doi: 10.1016/j.bandc.2014.01.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ernst M., Luciana M. Neuroimaging of the dopamine/reward system in adolescent drug use. CNS Spectr. 2015;20(4):427–441. doi: 10.1017/S1092852915000395. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ernst M., Gowin J., Gaillard C. In: The Cambridge Handbook of Cognitive Development. Borst G., Houdé O., editors. Cambridge University Press; 2022. The triadic neural systems model through a machine-learning mill; pp. 516–534. [DOI] [Google Scholar]
- Esteban O., Markiewicz C.J., Blair R.W., Moodie C.A., Isik A.I., Erramuzpe A., Kent J.D., Goncalves M., DuPre E., Snyder M., Oya H., Ghosh S.S., Wright J., Durnez J., Poldrack R.A., Gorgolewski K.J. fMRIPrep: a robust preprocessing pipeline for functional MRI. Nat. Methods. 2019;16(1):111–116. doi: 10.1038/s41592-018-0235-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Esteban O., Markiewicz C.J., Goncalves M., Provins C., Kent J.D., DuPre E., Salo T., Ciric R., Pinsard B., Blair R.W., Poldrack R.A., Gorgolewski K.J. fMRIPrep: a robust preprocessing pipeline for functional MRI [Computer software] Zenodo. 2022 doi: 10.5281/zenodo.7117719. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Feczko, E., Conan, G., Marek, S., Tervo-Clemmens, B., Cordova, M., Doyle, O., Earl, E., Perrone, A., Sturgeon, D., Klein, R., Harman, G., Kilamovich, D., Hermosillo, R., Miranda-Dominguez, O., Adebimpe, A., Bertolero, M., Cieslak, M., Covitz, S., Hendrickson, T., … Fair, D.A. (2021). Adolescent Brain Cognitive Development (ABCD) Community MRI Collection and Utilities (p. 2021.07.09.451638). bioRxiv. 10.1101/2021.07.09.451638. [DOI]
- Flake J.K., Fried E.I. Measurement schmeasurement: questionable measurement practices and how to avoid them. Adv. Methods Pract. Psychol. Sci. 2020;3(4):456–465. doi: 10.1177/2515245920952393. [DOI] [Google Scholar]
- Flake J.K., Luong R., Shaw M. Addressing a crisis of generalizability with large-scale construct validation. Behav. Brain Sci. 2022;45 doi: 10.1017/S0140525X21000376. [DOI] [PubMed] [Google Scholar]
- Flannery J.S., Riedel M.C., Bottenhorn K.L., Poudel R., Salo T., Hill-Bowen L.D., Laird A.R., Sutherland M.T. Meta-analytic clustering dissociates brain activity and behavior profiles across reward processing paradigms. Cogn., Affect., Behav. Neurosci. 2020;20(2):215–235. doi: 10.3758/s13415-019-00763-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fonov V., Evans A., McKinstry R., Almli C., Collins D. Unbiased nonlinear average age-appropriate brain templates from birth to adulthood. NeuroImage. 2009;47:S102. doi: 10.1016/S1053-8119(09)70884-5. [DOI] [Google Scholar]
- Forbes E.E., Dahl R.E. Pubertal development and behavior: hormonal activation of social and motivational tendencies. Brain Cogn. 2010;72(1):66–72. doi: 10.1016/j.bandc.2009.10.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Francken J.C., Slors M., Craver C.F. Cognitive ontology and the search for neural mechanisms: Three foundational problems. Synthese. 2022;200(5) doi: 10.1007/s11229-022-03701-2. [DOI] [Google Scholar]
- Gentry R.N., Schuweiler D.R., Roesch M.R. Dopamine signals related to appetitive and aversive events in paradigms that manipulate reward and avoidability. Brain Res. 2019;1713:80–90. doi: 10.1016/j.brainres.2018.10.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gorgolewski K.J., Burns C., Madison C., Clark D., Halchenko Y., Waskom M., Ghosh S. Nipype: a flexible, lightweight and extensible neuroimaging data processing framework in python. Front. Neuroinformatics. 2011;5 doi: 10.3389/fninf.2011.00013. 〈https://www.frontiersin.org/articles/10.3389/fninf.2011.00013〉 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gorgolewski K.J., Esteban O., Markiewicz C.J., Ziegler E., Ellis D.G., Jarecka D., Notter M.P., Johnson H., Burns C., Manhães-Savio A., Hamalainen C., Yvernault B., Salo T., Goncalves M., Jordan K., Waskom M., Wong J., Modat M., Loney F., Ghosh S. nipy/nipype: 1. 1. 5 [Comput. Softw. ]. Zenodo. 2018 doi: 10.5281/zenodo.1480713. [DOI] [Google Scholar]
- Greve D.N., Fischl B. Accurate and robust brain image alignment using boundary-based registration. NeuroImage. 2009;48(1):63–72. doi: 10.1016/j.neuroimage.2009.06.060. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Haber S.N. In: Decision Neuroscience. Dreher J.-C., Tremblay L., editors. Academic Press; 2017. Chapter 1—anatomy and connectivity of the reward circuit; pp. 3–19. [DOI] [Google Scholar]
- Haeffel G.J. Psychology needs to get tired of winning. R. Soc. Open Sci. 2022;9(6) doi: 10.1098/rsos.220099. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hanson S.J. The failure of blobology: fMRI misinterpretation, maleficience and muddle. Front. Hum. Neurosci. 2022;16 doi: 10.3389/fnhum.2022.870091. 〈https://www.frontiersin.org/article/10.3389/fnhum.2022.870091〉 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Henson R. What can functional neuroimaging tell the experimental psychologist? Q. J. Exp. Psychol. Sect. A. 2005;58(2):193–233. doi: 10.1080/02724980443000502. [DOI] [PubMed] [Google Scholar]
- Hirschfeld G., Brachel R. von. Improving multiple-group confirmatory factor analysis in R – a tutorial in measurement invariance with continuous and ordinal indicators. Pract. Assess., Res., Eval. 2014;19(1) doi: 10.7275/qazy-2946. [DOI] [Google Scholar]
- Hull C.L. Yale University Press,; 1952. A behavior system; an introduction to behavior theory concerning the individual organism. [Google Scholar]
- Jenkinson M., Bannister P., Brady M., Smith S. Improved optimization for the robust and accurate linear registration and motion correction of brain images. NeuroImage. 2002;17(2):825–841. doi: 10.1016/s1053-8119(02)91132-8. [DOI] [PubMed] [Google Scholar]
- Karoly H.C., Bryan A.D., Weiland B.J., Mayer A., Dodd A., Feldstein Ewing S.W. Does incentive-elicited nucleus accumbens activation differ by substance of abuse? An examination with adolescents. Dev. Cogn. Neurosci. 2015;16:5–15. doi: 10.1016/j.dcn.2015.05.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kennedy J.T., Harms M.P., Korucuoglu O., Astafiev S.V., Barch D.M., Thompson W.K., Bjork J.M., Anokhin A.P. Reliability and stability challenges in ABCD task fMRI data. NeuroImage. 2022;252 doi: 10.1016/j.neuroimage.2022.119046. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Klein A., Ghosh S.S., Bao F.S., Giard J., Häme Y., Stavsky E., Lee N., Rossa B., Reuter M., Neto E.C., Keshavan A. Mindboggling morphometry of human brains. PLOS Comput. Biol. 2017;13(2) doi: 10.1371/journal.pcbi.1005350. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Knutson B., Greer S. Anticipatory affect: neural correlates and consequences for choice. Philos. Trans. R. Soc. B: Biol. Sci. 2008;363(1511):3771–3786. doi: 10.1098/rstb.2008.0155. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Knutson B., Heinz A. Probing psychiatric symptoms with the monetary incentive delay task. Biol. Psychiatry. 2015;77(5):418–420. doi: 10.1016/j.biopsych.2014.12.022. [DOI] [PubMed] [Google Scholar]
- Knutson B., Srirangarajan T. In: Emotion in the Mind and Body. Neta M., Haas I.J., editors. Springer International Publishing; 2019. Toward a deep science of affect and motivation; pp. 193–220. [DOI] [Google Scholar]
- Knutson B., Westdorp A., Kaiser E., Hommer D. FMRI visualization of brain activity during a monetary incentive delay task. NeuroImage. 2000;12(1):20–27. doi: 10.1006/nimg.2000.0593. [DOI] [PubMed] [Google Scholar]
- Knutson B., Adams C.M., Fong G.W., Hommer D. Anticipation of increasing monetary reward selectively recruits nucleus accumbens. J. Neurosci. 2001;21(16) doi: 10.1523/JNEUROSCI.21-16-j0002.2001. RC159–RC159. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Knutson B., Fong G.W., Bennett S.M., Adams C.M., Hommer D. A region of mesial prefrontal cortex tracks monetarily rewarding outcomes: characterization with rapid event-related fMRI. NeuroImage. 2003;18(2):263–272. doi: 10.1016/S1053-8119(02)00057-5. [DOI] [PubMed] [Google Scholar]
- Knutson B., Katovich K., Suri G. Inferring affect from fMRI data. Trends Cogn. Sci. 2014;18(8):422–428. doi: 10.1016/j.tics.2014.04.006. [DOI] [PubMed] [Google Scholar]
- Kwong K.K., Belliveau J.W., Chesler D.A., Goldberg I.E., Weisskoff R.M., Poncelet B.P., Kennedy D.N., Hoppel B.E., Cohen M.S., Turner R. Dynamic magnetic resonance imaging of human brain activity during primary sensory stimulation. Proc. Natl. Acad. Sci. 1992;89(12):5675–5679. doi: 10.1073/pnas.89.12.5675. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li, X., Ai, L., Giavasis, S., Jin, H., Feczko, E., Xu, T., Clucas, J., Franco, A., Heinsfeld, A.S., Adebimpe, A., Vogelstein, J.T., Yan, C.-G., Esteban, O., Poldrack, R.A., Craddock, C., Fair, D., Satterthwaite, T., Kiar, G., & Milham, M.P. (2021). Moving Beyond Processing and Analysis-Related Variation in Neuroscience (p. 2021.12.01.470790). https://doi.org/10.1101/2021.12.01.470790.
- Lilienfeld S.O., Strother A.N. Psychological measurement and the replication crisis: four sacred cows. Can. Psychol. 2020;61(4):281–288. doi: 10.1037/cap0000236. [DOI] [Google Scholar]
- Lorenzo-Seva U., ten Berge J.M.F. Tucker’s congruence coefficient as a meaningful index of factor similarity. Methodol.: Eur. J. Res. Methods Behav. Soc. Sci. 2006;2:57–64. doi: 10.1027/1614-2241.2.2.57. [DOI] [Google Scholar]
- Luciana M., Wahlstrom D., Porter J.N., Collins P.F. Dopaminergic modulation of incentive motivation in adolescence: age-related changes in signaling, individual differences, and implications for the development of self-regulation. Dev. Psychol. 2012;48(3):844–861. doi: 10.1037/a0027432. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Luna B. Developmental changes in cognitive control through adolescence. Adv. Child Dev. Behav. 2009;37:233–278. doi: 10.1016/s0065-2407(09)03706-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Luna B., Marek S., Larsen B., Tervo-Clemmens B., Chahal R. An integrative model of the maturation of cognitive control. Annu. Rev. Neurosci. 2015;38:151–170. doi: 10.1146/annurev-neuro-071714-034054. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lutz K., Widmer M. What can the monetary incentive delay task tell us about the neural processing of reward and punishment? Neurosci. Neuroeconomics. 2014;3:33–45. doi: 10.2147/NAN.S38864. [DOI] [Google Scholar]
- Marsh H.W., Morin A.J.S., Parker P.D., Kaur G. Exploratory structural equation modeling: an integration of the best features of exploratory and confirmatory factor analysis. Annu. Rev. Clin. Psychol. 2014;10:85–110. doi: 10.1146/annurev-clinpsy-032813-153700. [DOI] [PubMed] [Google Scholar]
- Martz M.E., Trucco E.M., Cope L.M., Hardee J.E., Jester J.M., Zucker R.A., Heitzeg M.M. Association of marijuana use with blunted nucleus accumbens response to reward anticipation. JAMA Psychiatry. 2016;73(8):838–844. doi: 10.1001/jamapsychiatry.2016.1161. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McNeish D., Wolf M.G. Dynamic fit index cutoffs for confirmatory factor analysis models. Psychol. Methods. 2021 doi: 10.1037/met0000425. (No Pagination Specified-No Pagination Specified) [DOI] [PubMed] [Google Scholar]
- Meade A.W., Johnson E.C., Braddy P.W. Power and sensitivity of alternative fit indices in tests of measurement invariance. J. Appl. Psychol. 2008;93(3):568–592. doi: 10.1037/0021-9010.93.3.568. [DOI] [PubMed] [Google Scholar]
- Mikl M., Mareček R., Hluštík P., Pavlicová M., Drastich A., Chlebus P., Brázdil M., Krupa P. Effects of spatial smoothing on fMRI group inferences. Magn. Reson. Imaging. 2008;26(4):490–503. doi: 10.1016/j.mri.2007.08.006. [DOI] [PubMed] [Google Scholar]
- Mills K.L., Goddings A.L., Herting M.M., Meuwese R., Blakemore S.J., Crone E.A., Dahl R.E., Güroğlu B., Raznahan A., Sowell E.R., Tamnes C.K. Structural brain development between childhood and adulthood: convergence across four longitudinal samples. NeuroImage. 2016;141:273–281. doi: 10.1016/j.neuroimage.2016.07.044. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Molnar-Szakacs I., Uddin L.Q. Anterior insula as a gatekeeper of executive control. Neurosci. Biobehav. Rev. 2022 doi: 10.1016/j.neubiorev.2022.104736. [DOI] [PubMed] [Google Scholar]
- Mumford J.A., Davis T., Poldrack R.A. The impact of study design on pattern estimation for single-trial multivariate pattern analysis. NeuroImage. 2014;103:130–138. doi: 10.1016/j.neuroimage.2014.09.026. [DOI] [PubMed] [Google Scholar]
- Murray L., Lopez-Duran N.L., Mitchell C., Monk C.S., Hyde L.W. Neural mechanisms of reward and loss processing in a low-income sample of at-risk adolescents. Soc. Cogn. Affect. Neurosci. 2020 doi: 10.1093/scan/nsaa157. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nikolaidis, A., Chen, A.A., He, X., Shinohara, R., Vogelstein, J., Milham, M., & Shou, H. (2022). Suboptimal phenotypic reliability impedes reproducible human neuroscience (p. 2022.07.22.501193). bioRxiv. 10.1101/2022.07.22.501193. [DOI]
- Noble S., Scheinost D., Constable R.T. A decade of test-retest reliability of functional connectivity: a systematic review and meta-analysis. NeuroImage. 2019;203 doi: 10.1016/j.neuroimage.2019.116157. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ogawa S., Lee T.M., Kay A.R., Tank D.W. Brain magnetic resonance imaging with contrast dependent on blood oxygenation. Proc. Natl. Acad. Sci. U. S. A. 1990;87(24):9868–9872. doi: 10.1073/pnas.87.24.9868. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Olaru, G., Robitzsch, A., Hildebrandt, A., & Schroeders, U. (2020). Local Structural Equation Modeling for Longitudinal Data. PsyArXiv. 10.31234/osf.io/q79c5. [DOI]
- Oldham S., Murawski C., Fornito A., Youssef G., Yücel M., Lorenzetti V. The anticipation and outcome phases of reward and loss processing: a neuroimaging meta-analysis of the monetary incentive delay task. Hum. Brain Mapp. 2018;39(8):3398–3418. doi: 10.1002/hbm.24184. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pfeifer J.H., Berkman E.T. The development of self and identity in adolescence: neural evidence and implications for a value-based choice perspective on motivated behavior. Child Dev. Perspect. 2018;12(3):158–164. doi: 10.1111/cdep.12279. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Poldrack R.A. Mapping mental function to brain structure: How can cognitive neuroimaging succeed? Perspect. Psychol. Sci.: A J. Assoc. Psychol. Sci. 2010;5(5):753–761. doi: 10.1177/1745691610388777. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Poldrack R.A., Yarkoni T. From brain maps to cognitive ontologies: informatics and the search for mental structure. Annu. Rev. Psychol. 2016;67(1):587–612. doi: 10.1146/annurev-psych-122414-033729. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Power J.D., Mitra A., Laumann T.O., Snyder A.Z., Schlaggar B.L., Petersen S.E. Methods to detect, characterize, and remove motion artifact in resting state fMRI. NeuroImage. 2014;84:320–341. doi: 10.1016/j.neuroimage.2013.08.048. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Putnick D.L., Bornstein M.H. Measurement invariance conventions and reporting: the state of the art and future directions for psychological research. Dev. Rev.: DR. 2016;41:71–90. doi: 10.1016/j.dr.2016.06.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- R Core Team . R Foundation for Statistical Computing,; 2022. R: A language and environment for statistical computing. (4.2.1) [Computer software]〈https://www.R-project.org/〉 [Google Scholar]
- Richards J.M., Plate R.C., Ernst M. A systematic review of fMRI reward paradigms used in studies of adolescents vs. adults: the impact of task design and implications for understanding neurodevelopment. Neurosci. Biobehav. Rev. 2013;37(5):976–991. doi: 10.1016/j.neubiorev.2013.03.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Risk B.B., Murden R.J., Wu J., Nebel M.B., Venkataraman A., Zhang Z., Qiu D. Which multiband factor should you choose for your resting-state fMRI study? NeuroImage. 2021;234 doi: 10.1016/j.neuroimage.2021.117965. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Robitzsch, A. (2022). sirt: Supplementary Item Response Theory Models (3.12–66) [Computer software]. 〈https://CRAN.R-project.org/package=sirt〉.
- Rosenberg M.D., Casey B.J., Holmes A.J. Prediction complements explanation in understanding the developing brain. Nat. Commun. 2018;9(1) doi: 10.1038/s41467-018-02887-9. Article 1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rosseel Y. lavaan: an R package for structural equation modeling. J. Stat. Softw. 2012;48(1) doi: 10.18637/jss.v048.i02. Article 1. [DOI] [Google Scholar]
- Rosseel, Y., Jorgensen, T.D., Rockwood, N., Oberski, D., Byrnes, J., Vanbrabant, L., Savalei, V., Merkle, E., Hallquist, M., Rhemtulla, M., Katsikatsou, M., Barendse, M., Scharf, F., & Du, H. (2021). lavaan: Latent Variable Analysis (0.6–9) [Computer software]. 〈https://CRAN.R-project.org/package=lavaan〉.
- Satterthwaite T.D., Elliott M.A., Gerraty R.T., Ruparel K., Loughead J., Calkins M.E., Eickhoff S.B., Hakonarson H., Gur R.C., Gur R.E., Wolf D.H. An improved framework for confound regression and filtering for control of motion artifact in the preprocessing of resting-state functional connectivity data. NeuroImage. 2013;64:240–256. doi: 10.1016/j.neuroimage.2012.08.052. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sawyer S.M., Azzopardi P.S., Wickremarathne D., Patton G.C. The age of adolescence. Lancet Child Adolesc. Health. 2018;2(3):223–228. doi: 10.1016/S2352-4642(18)30022-1. [DOI] [PubMed] [Google Scholar]
- Schultz W. Neuronal reward and decision signals: from theories to data. Physiol. Rev. 2015;95(3):853–951. doi: 10.1152/physrev.00023.2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schumann G., Loth E., Banaschewski T., Barbot A., Barker G., Büchel C., Conrod P.J., Dalley J.W., Flor H., Gallinat J., Garavan H., Heinz A., Itterman B., Lathrop M., Mallik C., Mann K., Martinot J.-L., Paus T., Poline J.-B., IMAGEN consortium The IMAGEN study: reinforcement-related behaviour in normal brain function and psychopathology. Mol. Psychiatry. 2010;15(12):1128–1139. doi: 10.1038/mp.2010.4. [DOI] [PubMed] [Google Scholar]
- Sherman L., Steinberg L., Chein J. Connecting brain responsivity and real-world risk taking: strengths and limitations of current methodological approaches. Dev. Cogn. Neurosci. 2018;33:27–41. doi: 10.1016/j.dcn.2017.05.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Silverman M.H., Jedd K., Luciana M. Neural networks involved in adolescent reward processing: an activation likelihood estimation meta-analysis of functional neuroimaging studies. NeuroImage. 2015;122:427–439. doi: 10.1016/j.neuroimage.2015.07.083. (https://doi.org/v) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Silvestrin, M., & de Beer, L.T. (2022). esemComp: ESEM-within-CFA syntax composer (R package version 0.2) [Computer software]. 〈https://mateuspsi.github.io/esemComp〉.
- Skinner B.F. A case history in scientific method. Am. Psychol. 1956;11(5):221–233. doi: 10.1037/h0047662. [DOI] [Google Scholar]
- Slaney K. Validating Psychological Constructs: Historical, Philosophical, and Practical Dimensions. 1st ed. 2017 edition., Palgrave Macmillan,; 2017. [Google Scholar]
- Soares J.M., Magalhães R., Moreira P.S., Sousa A., Ganz E., Sampaio A., Alves V., Marques P., Sousa N. A Hitchhiker’s guide to functional magnetic resonance imaging. Front. Neurosci. 2016;10:515. doi: 10.3389/fnins.2016.00515. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Spear L.P. The adolescent brain and age-related behavioral manifestations. Neurosci. Biobehav. Rev. 2000;24(4):417–463. doi: 10.1016/s0149-7634(00)00014-2. [DOI] [PubMed] [Google Scholar]
- Spear L.P. Adolescent brain development and animal models. Ann. N. Y. Acad. Sci. 2004;1021(1):23–26. doi: 10.1196/annals.1308.002. [DOI] [PubMed] [Google Scholar]
- Srirangarajan T., Mortazavi L., Bortolini T., Moll J., Knutson B. Multi-band FMRI compromises detection of mesolimbic reward responses. NeuroImage. 2021;244 doi: 10.1016/j.neuroimage.2021.118617. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Steinberg L. A dual systems model of adolescent risk-taking. Dev. Psychobiol. 2010;52(3):216–224. doi: 10.1002/dev.20445. [DOI] [PubMed] [Google Scholar]
- Tervo-Clemmens B., Quach A., Calabro F.J., Foran W., Luna B. Meta-analysis and review of functional neuroimaging differences underlying adolescent vulnerability to substance use. NeuroImage. 2020;209 doi: 10.1016/j.neuroimage.2019.116476. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tustison N.J., Avants B.B., Cook P.A., Zheng Y., Egan A., Yushkevich P.A., Gee J.C. N4ITK: improved N3 bias correction. IEEE Trans. Med. Imaging. 2010;29(6):1310–1320. doi: 10.1109/TMI.2010.2046908. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Uttal W.R. Bradford Books,; 2003. The New Phrenology: The Limits of Localizing Cognitive Processes in the Brain. [Google Scholar]
- Volkow N.D., Koob G.F., Croyle R.T., Bianchi D.W., Gordon J.A., Koroshetz W.J., Pérez-Stable E.J., Riley W.T., Bloch M.H., Conway K., Deeds B.G., Dowling G.J., Grant S., Howlett K.D., Matochik J.A., Morgan G.D., Murray M.M., Noronha A., Spong C.Y.…Weiss S.R.B. The conception of the ABCD study: from substance use to a broad NIH collaboration. Dev. Cogn. Neurosci. 2018;32:4–7. doi: 10.1016/j.dcn.2017.10.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wilson R.P., Colizzi M., Bossong M.G., Allen P., Kempton M., Abe N., Barros-Loscertales A.R., Bayer J., Beck A., Bjork J., Boecker R., Bustamante J.C., Choi J.S., Delmonte S., Dillon D., Figee M., Garavan H., Hagele C., Hermans E.J., MTAC The neural substrate of reward anticipation in health: a meta-analysis of fMRI findings in the monetary incentive delay task. Neuropsychol. Rev. 2018;28(4):496–506. doi: 10.1007/s11065-018-9385-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wu C.C., Samanez-Larkin G.R., Katovich K., Knutson B. Affective traits link to reliable neural markers of incentive anticipation. NeuroImage. 2014;84:279–289. doi: 10.1016/j.neuroimage.2013.08.055. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yau W.-Y.W., Zubieta J.-K., Weiland B.J., Samudra P.G., Zucker R.A., Heitzeg M.M. Nucleus accumbens response to incentive stimuli anticipation in children of alcoholics: relationships with precursive behavioral risk and lifetime alcohol use. J. Neurosci. 2012;32(7):2544–2551. doi: 10.1523/JNEUROSCI.1390-11.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang Y., Brady M., Smith S. Segmentation of brain MR images through a hidden Markov random field model and the expectation-maximization algorithm. IEEE Trans. Med. Imaging. 2001;20(1):45–57. doi: 10.1109/42.906424. [DOI] [PubMed] [Google Scholar]
- Zucker R.A., Fitzgerald H.E., Refior S.K., Puttler L.I., Pallas D.M., Ellis D.A. Children of Addiction. Routledge,; 2000. The Clinical and Social Ecology of Childhood for Children of Alcoholics: Description of a Study and Implications for a Differentiated: Description of a Study and Implications for a Differentiated Social Policy. [Google Scholar]
- Zyl L. van, Klooster P.M. ten. Exploratory structural equation modeling: practical guidelines and tutorial with a convenient online tool for Mplus. Front. Psychiatry. 2022;12 doi: 10.3389/fpsyt.2021.795672. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Supplementary material.