Abstract
The amygdala and its connections with medial prefrontal cortex (mPFC) play central roles in the development of emotional processes. While several studies have suggested that this circuitry exhibits functional changes across the first two decades of life, findings have been mixed ‐ perhaps resulting from differences in analytic choices across studies. Here we used multiverse analyses to examine the robustness of task‐based amygdala—mPFC function findings to analytic choices within the context of an accelerated longitudinal design (4–22 years‐old; N = 98; 183 scans; 1–3 scans/participant). Participants recruited from the greater Los Angeles area completed an event‐related emotional face (fear, neutral) task. Parallel analyses varying in preprocessing and modeling choices found that age‐related change estimates for amygdala reactivity were more robust than task‐evoked amygdala—mPFC functional connectivity to varied analytical choices. Specification curves indicated evidence for age‐related decreases in amygdala reactivity to faces, though within‐participant changes in amygdala reactivity could not be differentiated from between‐participant differences. In contrast, amygdala—mPFC functional connectivity results varied across methods much more, and evidence for age‐related change in amygdala—mPFC connectivity was not consistent. Generalized psychophysiological interaction (gPPI) measurements of connectivity were especially sensitive to whether a deconvolution step was applied. Our findings demonstrate the importance of assessing the robustness of findings to analysis choices, although the age‐related changes in our current work cannot be overinterpreted given low test–retest reliability. Together, these findings highlight both the challenges in estimating developmental change in longitudinal cohorts and the value of multiverse approaches in developmental neuroimaging for assessing robustness of results.
Keywords: amygdala, development, longitudinal, multiverse, prefrontal cortex, robustness
The current study used multiverse analyses to examine age‐related changes in task‐evoked amygdala–mPFC circuitry in an accelerated longitudinal cohort ages 4–22 years. We found evidence that in the context of this task, age‐related change estimates for amygdala reactivity to faces were more robust to analytic decisions than were estimates of age‐related change in amygdala—mPFC functional connectivity.
1. INTRODUCTION
Rodent models and human neuroimaging have provided converging evidence for the importance of the amygdala and medial prefrontal cortex (mPFC) in the development of threat processing (Adolphs, 2008; Forbes, Phillips, Silk, Ryan, & Dahl, 2011), emotion regulation (Pozzi, Vijayakumar, Rakesh, & Whittle, 2020; Silvers, Shu, Hubbard, Weber, & Ochsner, 2015; Sullivan & Perry, 2015), and affective learning (Moriceau & Sullivan, 2006; Pattwell et al., 2016). Characterizing growth trajectories of these regions may provide insight into neural constructions underlying emotional development (Meyer & Lee, 2019). To probe amygdala—mPFC circuitry across development, face stimuli are frequently employed because they effectively engage this circuitry while being child‐appropriate (Hariri, Tessitore, Mattay, Fera, & Weinberger, 2002). Though a number of studies have examined age‐related changes from childhood to young adulthood in amygdala responses and amygdala—mPFC functional connectivity (FC) associated with emotional face stimuli, findings have varied widely (likely due in part to differences in sample composition and task design; see Table S1 for details). Several studies have found age‐related change in amygdala reactivity, including decreases as a function of age in response to emotional faces (Gee et al., 2013; Guyer et al., 2008; Killgore, Oki, & Yurgelun‐Todd, 2001; Passarotti, Sweeney, & Pavuluri, 2009; Swartz, Carrasco, Wiggins, Thomason, & Monk, 2014; Telzer et al., 2015) as well as other images (Decety, Michalska, & Kinzler, 2012; Silvers et al., 2017b; Vink, Derks, Hoogendam, Hillegers, & Kahn, 2014), increases in amygdala reactivity with age (Joseph et al., 2015; Todd, Evans, Morris, Lewis, & Taylor, 2011), developmental sex differences (Xu et al., 2021) or peaks during adolescence (Hare et al., 2008; Vijayakumar, Pfeifer, Flournoy, Hernandez, & Dapretto, 2019). Others have observed no age‐related changes (Kujawa et al., 2016; Pfeifer et al., 2011; Pine et al., 2001; Wu et al., 2016; Yurgelun‐Todd & Killgore, 2006; Zhang, Padmanabhan, Gross, & Menon, 2019).
With task‐evoked amygdala—mPFC FC, several studies have found age‐related decreases from childhood to young adulthood (Gee et al., 2013; Kujawa et al., 2016; Silvers et al., 2017a; Wu et al., 2016), while others have found increases (Decety et al., 2012; Perlman & Pelphrey, 2011; Vink et al., 2014), developmental sex differences (Xu et al., 2021), or little age‐related change (Zhang et al., 2019). While some investigations have found differing age‐related change for faces displaying different emotions (Killgore & Yurgelun‐Todd, 2007; Swartz et al., 2014; Vijayakumar et al., 2019), even investigations of fearful faces specifically have varied in their developmental findings for both amygdala reactivity and amygdala—mPFC FC (Forbes et al., 2011; Gee et al., 2013; Killgore et al., 2001; Wu et al., 2016; Zhang et al., 2019).
While the small sample sizes examined in many studies on amygdala—mPFC development likely contribute to differences in findings (Marek et al., 2020), especially given low reliability of many amygdala—mPFC measures (Elliott et al., 2020; Herting, Gautam, Chen, Mezher, & Vetter, 2017; Sauder, Hajcak, Angstadt, & Phan, 2013), important methodological differences also exist across studies. Differences in age range or sample demographics, stimuli, task (e.g., passive viewing vs. emotion labeling or matching; Lieberman et al., 2007), task design (blocked vs. event‐related; Sergerie, Chochol, & Armony, 2008), or contrast used (faces > shapes vs. faces > baseline) may also contribute to discrepancies (see Table S1). The brain regions under investigation also differ across studies; for example, prefrontal clusters with which amygdala connectivity was assessed. Interpreting discrepancies across studies without appreciation for these methodological differences would be inappropriate, and in fact, incorrect. Yet, such differences do not account for all discrepancies in findings across studies. Variation in processing pipelines is another source of differences across studies, as varying analytic decisions can produce qualitatively different findings, even between putatively identical analyses of the same dataset (Botvinik‐Nezer et al., 2020). Choices including software package (Bowring, Maumet, & Nichols, 2019), spatial smoothing (Jo et al., 2007), treatment of head motion (Achterberg & van der Meulen, 2019), parcellation (Bryce et al., 2021), and functional connectivity approach (Di, Zhang, & Biswal, 2020) can also impact results and qualitatively change findings (Cisler, Bush, & Steele, 2014). Additionally, the majority of developmental investigations of amygdala—mPFC function have studied cross‐sectional samples. Because cross‐sectional studies are susceptible to cohort effects and cannot measure within‐participant change, longitudinal work has been recommended for better charting of developmental trajectories (Crone & Elzinga, 2015; Madhyastha et al., 2018).
Here, we used multiverse analyses to examine the robustness of developmental changes to varied analytical decisions. We focused on task‐related amygdala—mPFC functional development in an accelerated longitudinal sample ranging from ages 4 to 22 years. We selected a task that was designed to be appropriate for young ages to characterize developmental change in amygdala—mPFC responses to fear and neutral faces across childhood and adolescence, and we asked whether findings were robust to analytical choices. This accelerated longitudinal design is an extension of the sample reported on by Gee et al. (2013). We preregistered two hypotheses (https://osf.io/8nyj7/) predicting that both amygdala reactivity (1) and amygdala—mPFC connectivity (2) as measured with generalized psychophysiological interaction models (gPPI), would decrease as a function of age during viewing of fearful faces relative to baseline (see Table 1 Aims 1a and 2a).
TABLE 1.
Summary of main aims, hypotheses, methods, and findings
Aim | Preregistered hypothesis | Analysis methodology | Key findings |
---|---|---|---|
1a. Age‐related change in amygdala reactivity to fear faces |
Amygdala reactivity to fearful faces will decrease with age, such that younger children will have more positive amygdala reactivity (higher BOLD response to fear faces relative to implicit baseline) than older youth. |
Multiverse amygdala ROI (anatomically‐defined) analysis using multilevel linear regression at the group level. Multiverse decision points: Preprocessing software, GLM software, GLM nuisance regressors, amygdala ROI definition, contrast estimate type (t‐stat vs. beta estimate), HRF shape, group‐level model covariates, and exclusion of previously analyzed scans |
|
1b. Age‐related change in patterns of amygdala responses across task trials | None |
Multiverse analysis of slopes of amygdala reactivity across trials, and amygdala reactivity in each half of trials using multilevel linear regression at the group level, single trial models Multiverse decision points: Global signal subtraction, amygdala ROI definition, and group‐level model covariates |
|
2a. Age‐related change in amygdala—mPFC functional connectivity to fear faces, as measured by generalized psychophysiological interaction (gPPI) | Amygdala–mPFC FC will decrease as a function of age such that as age increases, the valence of FC will shift from positive to negative. |
Multiverse gPPI analysis with anatomically defined bilateral amygdala seed and mPFC target ROIs using multilevel linear regression at the group level. Multiverse decision points: Deconvolution step, mPFC ROI definition, contrast estimate type (t‐stat vs. beta estimate), and group‐level model covariates |
|
2b. Age‐related change in amygdala—mPFC functional connectivity to fear faces, as measured by beta‐series correlation (BSC) | None for BSC specifically |
Multiverse BSC analysis between amygdala and mPFC using multilevel linear regression at the group level. Multiverse decision points: Global signal subtraction, amygdala ROI definition, mPFC ROI definition, and group‐level model covariates |
|
3. Associations of amygdala reactivity, change in amygdala reactivity across trials, or amygdala—mPFC FC with separation anxiety | None |
Multiverse multilevel linear regressions with brain measures as predictors for separation anxiety behaviors, controlling for age Multiverse decision points: Separation anxiety measure, FC measure, mPFC ROI (FC only), amygdala ROI, contrast, and deconvolution step (gPPI only) |
|
Although we did not preregister further hypotheses, we also investigated age‐related changes in within‐scan differences in amygdala responses across trials and FC using beta series correlations. As previous work identified associations between amygdala—mPFC FC and separation anxiety (Carpenter et al., 2015; Gee et al., 2013), we asked whether any amygdala—mPFC measures were associated longitudinally with separation anxiety behaviors (see Table 1 Aim 3). We used “multiverse” analyses and specification curves to examine the impact of analytical decisions on results. We also investigated test–retest reliability of all brain measurements across longitudinal study visits, given the importance of such reliability for interpreting individual differences or developmental change (Herting et al., 2017). Our multiverse approach allows us to thoroughly explore the robustness of different findings to analytical choices, highlighting the importance of considering both robustness and reliability in developmental research.
2. METHODS
Before completing analyses, we preregistered methods for the current study through the Open Science Framework at https://osf.io/8nyj7/. Only analyses for age‐related changes in amygdala reactivity and amygdala–mPFC gPPI were preregistered in detail (see Table 1 Aims 1a and 2a), and we did not preregister multiverse analyses. Methods detailed below include both information included in the preregistration and additional information and analyses not preregistered. Analysis code and documentation can be found at https://github.com/pab2163/amygdala_mpfc_multiverse.
2.1. Participants
Participants were recruited as part of a larger study examining brain development as a function of early life caregiving experiences. The current sample (N = 98; 55 female, 43 male) included typically developing children, adolescents, and young adults covering the ages 4–22 years‐old (M = 11.9 years old) who enrolled to participate in a study on emotional development. All participants were reported to be physically and psychiatrically healthy (no medical or psychiatric disorders), as indicated by a telephone screening before participation, and free of MRI contraindications. All except four participants fell below clinical cutoffs (see Figure S2) on the Child Behavior Checklist (CBCL) Total Problems, Internalizing Problems, and Externalizing Problems scales (Achenbach, 1991). The larger study also included youths with a history of institutional and/or foster care outside of the United States, who are not included here. Participants from the greater Los Angeles area were recruited through flyers, state birth records, community events, online advertising, lab website and newsletters, psychologists' offices, psychology courses at a local university (participants ages 18–22 years‐old only), and word‐of‐mouth. Each participant completed up to three MRI scans spaced at an average interval of 18 months between visits. Parents provided written consent, children 7+ years old gave written assent, and children under 7 years old gave verbal assent. Study protocols were approved by the local university institutional review board. These data were collected between 2009 and 2015.
An accelerated longitudinal design was used such that participants' starting ages at scan 1 comprised the entire range of sample ages (4–22 years old), and coverage was approximately balanced across the entire age range (see Figure 1b). The design was structured into three study “waves” corresponding with recruitment efforts and visit protocols. Participants were overenrolled at Wave 1 to account for anticipated attrition (e.g., braces, relocation, etc.) to achieve the desired sample size across the three waves. While most participants were recruited such that their first scan occurred at Wave 1, a smaller set of participants were recruited at Wave 2, such that some participants completed their first scan at Wave 2 (see Figure 1). For these participants, only two scans were planned.
FIGURE 1.
(a) Schematic showing study inclusion criteria. (b) Included scans at each study wave, with each dot representing one scan, and horizontal lines connecting participants across study waves
Of the 191 participants participating in the longitudinal study, 140 completed at least one MRI scan. After exclusions for incomplete task runs (including falling asleep), computer errors resulting in missing stimulus timing files, high head motion, and failed visual QA (scanner/motion artifacts), a final sample of 98 participants (N = 183 total scans) was included for analysis (see Figure 1). Exclusion criteria were preregistered after conducting preliminary preprocessing, but before construction of group‐level models and multiverse analysis plans. This sample included 40 participants with 1 scan (including 10 participants ages 18‐22 not enrolled for additional scan sessions), 31 with 2 scans, and 27 with 3 scans (one more participant than preregistered due to an initial coding error). Wave 1 data from 42 of these participants were reported on by Gee et al. (2013).
The median annual household income for participating families was $85,001–$100,000 (for reference, median annual household income in Los Angeles County from 2015 to 2019 was $68,044; US Census Bureau, 2021). Epidemiological methods were not used to recruit a sample representative of the Los Angeles or United States populations (Heeringa et al., 2004), and Hispanic or Latinx participants were particularly underrepresented. Further sample demographics can be found in the Supporting Information (see Tables S2 and S3, Figures S1 and S2).
2.2. Separation anxiety
For each participant (except for 10 adults 18–22 years), a parent completed both the Revised Children's Anxiety and Depression Scale (RCADS‐P) and the Screen for Child Anxiety Related Emotional Disorders (SCARED‐P) to assess the frequency of symptoms of anxiety and low mood (Birmaher et al., 1999; Chorpita, Yim, Moffitt, Umemoto, & Francis, 2000). Following prior work suggesting associations between task‐evoked amygdala—mPFC FC and separation anxiety (Carpenter et al., 2015; Gee et al., 2013), we used the separation anxiety subscales from both the SCARED‐P and RCADS‐P as measures of anxiety‐related behaviors in asking whether such FC may be linked to anxiety levels during childhood and adolescence. For 11 participants who had missing items on the SCARED‐P, indicating parents had skipped or forgotten to answer a question, we imputed responses using 5‐Nearest Neighbor imputation using only the other items included in the SCARED‐P separation anxiety subscale (Beretta & Santaniello, 2016). As expected, raw separation anxiety scores on both measures decreased as a function of age, while standardized scores (which are normed based on gender and grade level) were consistent across development with few children at or near clinical threshold (see Figure 6).
FIGURE 6.
Multiverse analyses of associations between amygdala—mPFC circuitry and separation anxiety. (a). Age‐related change in SCARED and RCADS raw and t‐scores for parent‐reported separation anxiety subscales. The red dotted line in the middle panel represents the clinical threshold for the standardized RCADS measure (because this t‐score measure is standardized based on age and gender, no age‐related change is expected). (b). Separate specification curves for associations of amygdala reactivity (left), amygdala—mPFC connectivity (both gPPI and BSC; center two panels), and amygdala reactivity slopes across trials (right) with the three separation anxiety outcomes shown in a. Points represent estimated associations between brain measures and separation anxiety (controlling for mean FD and age) and lines are corresponding 95% posterior intervals. Models are ordered by beta estimates, and the dotted line represents the median estimate across all specifications. Color indicates sign of beta estimates and whether respective posterior intervals include 0 (red = negative excluding 0, blue = negative including 0, green = positive including 0). Scores on each separation anxiety outcome were z‐scored for comparison. (c). Example model predictions for associations between fear > baseline amygdala—mPFC gPPI and each separation anxiety measure. Predictions and 95% posterior intervals are plotted for each separation anxiety measure separately for each mPFC region, and for gPPI pipelines with and without a deconvolution step. Pipelines shown use robust regression, have random slopes, no covariates for task block or scanner, and no quadratic age term
2.3. Emotion discrimination task
Participants completed either two (at Wave 3) or three (at Waves 1 and 2) runs of a modified “go/no‐go” task with emotional faces during fMRI scanning. Runs varied by emotional expression (fear, happy, sad), and within each run participants viewed emotional faces interspersed with neutral faces. To ensure that participants were paying attention, they were asked to press a button whenever they saw a neutral face (no response was required for any other face expression). The order of the runs was counterbalanced across participants; the stimuli within each run were pseudorandomized (Wager & Nichols, 2003) to allow for event‐related estimates of the hemodynamic response, and fixed across participants. For the present analysis, only the fear run of the task was used. The other two runs, which used happy and sad faces in place of fear, are not included in the present analysis as these conditions were not present at all waves of data collection. As 50% of trials were “go” trials under this paradigm, we refer to the task as an emotion discrimination task, rather than a true “go/no‐go” paradigm since there was no strong prepotent motor response. Stimuli within each run were presented with a jittered ITI (3–10 s, median = 4.93 s) according to a genetic algorithm with a fixation cross on the screen (Wager & Nichols, 2003). Face images were adult White female faces from the Karolinska Directed Emotional Faces database (Calvo & Lundqvist, 2008), and the same face stimuli were used across longitudinal study visits (Vijayakumar et al., 2019). Each run (130 TRs, duration of 4:20) consisted of 48 trials (24 neutral faces, 24 fearful faces), each presented for 350 ms. All fMRI analyses of this task used event‐related designs.
2.4. MRI acquisition
Participants under 18‐years‐old completed a mock scanning session before the MRI scan to acclimate to the scanner environment and practice lying still for data collection. Waves 1 and 2 were collected on a Siemens 3T TIM Trio MRI scanner using a standard radiofrequency head coil. A two‐dimensional spin echo image (TR, 4,000 ms; TE, 40 ms; matrix size, 256 × 256; 4 mm thick; 0 mm gap) was acquired in the oblique plane to guide slice configuration in both structural and functional scans. A whole‐brain high‐resolution T1‐weighted anatomical scan (MPRAGE; 256 × 256 in‐plane resolution; 256 mm FOV; 192 × 1 mm sagittal slices) was acquired for each participant for registration of functional data. The task was presented through MR‐compatible goggles during scanning. T2*‐weighted echoplanar images (interleaved slice acquisition) were collected at an oblique angle of ~30° (130 volumes/run; TR = 2,000 ms; TE = 30 ms; flip angle = 90°; matrix size = 64 × 64; FOV = 192 mm; 34 slices; 4 mm slice thickness; skip = 0 mm; voxel size = 3x3x4 mm). Wave 3 was collected on a Siemens 3T TIM Trio MRI scanner at a different location using identical acquisition parameters.
2.5. Behavioral analyses
We used multilevel logistic regression models to estimate age‐related changes in several task performance metrics. We fit separate models for the d′ performance metric, overall accuracy (probability of a correct response on any trial), hit rate (on neutral face trials), and false alarm rate (on fear face trials) as the respective outcomes, and included nested random effects for task sessions within participants (models were not nested for d′ as this analysis used only 1 metric per session rather than trial‐wise outcomes, but still included random effects for participants). Additionally, to model age‐related change in reaction times during correct hit trials, we fit linear, quadratic, cubic, and inverse age (1/age; Luna, Garver, Urban, Lazar, & Sweeney, 2004; Luna, Tervo‐Clemmens, & Calabro, 2021) regressions with identical random effects structures. Model equations and results for all behavioral analyses can be found in the Supporting Information (see Supporting Information methods pp. 12–14, Figures S3 and S4).
2.6. Preregistered fMRI pipeline
3dskullstrip from the Analysis of Functional NeuroImages (AFNI, v20.1.16) software package (Cox, 1996) was first run on all MPRAGE scans. Next, experimenters checked the quality of the skull stripping. If there were outstanding issues with a particular scan run (areas of brain tissue cut off, or significant areas of skull left in, 30/195 scans), FSL's brain extraction tool (BET; Jenkinson, Beckmann, Behrens, Woolrich, & Smith, 2012) was used instead. We used robust brain center estimation, and modified the fractional intensity values between 0.5 and 0.7 to optimize quality. Slice‐time correction was not used. Timeseries of the 6 head motion parameters were calculated and subsequent spatial realignment of BOLD volumes was completed using MCFLIRT in FSL (Jenkinson, Bannister, Brady, & Smith, 2002). Scans over a threshold of >40 volumes with >0.9 mm framewise displacement (FD; calculated as the sum of absolute frame‐to‐frame differences between head realignment estimates; Power, Barnes, Snyder, Schlaggar, & Petersen, 2012) were excluded from analysis (12 out of an initial 195, or 6.2%). After this exclusion, an average of 96.7% (range = [70.1%–100%]) of stimulus‐coincident volumes in each scan were below the 0.9 mm FD threshold. The mean age of participants with excluded scans was 7.16 years and 8/12 were male. Registration matrices were calculated for registration of functional images to high‐resolution structural T1 images using FSL's FLIRT with boundary‐based registration. Registration matrices for standard MNI space were also calculated using both FLIRT (linear registration) and FNIRT (nonlinear registration) with 12 DOF and a warp resolution of 10 mm. Data were high‐pass filtered at 0.01 Hz and smoothed with an isotropic Gaussian kernel with FWHM of 6 mm before running general linear models (GLMs), and four‐dimensional volumes were grand mean scaled such that the average intensity value was 10,000.
Following preprocessing, we ran scan‐level GLMs using FSL's FEAT (v6.00). We included event‐related regressors for fear and neutral faces (convolved with a double‐gamma HRF), their temporal derivatives (Pernet, 2014), and 24 head motion nuisance regressors (the 6 head realignment parameters, their temporal derivatives, and their squares (Power et al., 2012). Volumes with FD >0.9 mm were downweighed to 0 in the GLM. Pre‐whitening was used to estimate and remove temporal autocorrelation (Woolrich, Ripley, Brady, & Smith, 2001). For each scan, we calculated fear > baseline, neutral > baseline, and fear > neutral contrasts. We used native‐space bilateral amygdala masks generated using Freesurfer (v6.0; Fischl, 2012) by VanTieghem et al. (2021).
2.7. Multiverse analyses and specification curves
In addition to the preregistered pipelines, we conducted multiverse analyses to address all aims in Table 1 and constructed sets of separate specification curves for each aim (see Table 2). In general, multiverse analyses aim to probe the consistency of results across all “reasonable” possible combinations of analysis decisions (i.e., simultaneously taking all possible “forking paths”; Steegen, Tuerlinckx, Gelman, & Vanpaemel, 2016). Because analyzing fMRI data using all reasonable specifications was infeasible (i.e., possibilities are virtually infinite), we took the approach of “sampling” from the many reasonable or commonly‐used analysis choices for each multiverse. Despite not being completely comprehensive, this approach still allowed for thorough investigation into the robustness of results. For all multiverse analyses, we constructed specification curves by ranking models by their beta estimates (ascending) for parameters of interest for interpretation and visualization (Cosme & Lopez, 2020; Klapwijk, van den Bos, Tamnes, Mills, & Raschle, 2019; Orben & Przybylski, 2019; Simonsohn, Simmons, & Nelson, 2015, 2020). Because specification choices were not preregistered, we did not conduct formal null hypothesis testing of specification curves. Instead, as continuous measures of evidence, we report the proportion of specifications resulting in an estimate of the same sign, as well as the proportion of specifications resulting in 95% posterior intervals excluding 0 in the same direction. In addition, to analyze in more detail the impact of specific choices, we submitted point estimates for parameters of interest across all specifications to multiple regression models. From these models, we examined the conditional effects of each analysis decision point on the parameter of interest (see Supporting Information methods pp. 30–31, Figures S11–S13, S41–S43, and S55–S57).
TABLE 2.
Summary of forking pipelines used in analyses for each aim a
Aim/analysis | Decision point | Choices |
---|---|---|
1a. Age‐related change in amygdala reactivity to fear faces > baseline |
Preprocessing software | FSL FEAT , C‐PAC |
GLM software | FSL FEAT , AFNI 3dDeconvolve | |
Hemodynamic response function | Double gamma , single gamma | |
Nuisance regressors | 24 motion regressors , 6 motion regressors, 18 motion regressors + WM + CSF | |
Low‐frequency artifact removal | High‐pass filter (0.01 Hz) , quadratic drift regressor | |
First‐level GLM estimates | Beta estimates , T‐statistics | |
Native versus standard MNI space | Native space (Freesurfer) , Harvard‐Oxford Atlas in MNI | |
Amygdala ROI | Bilateral , left, right, high signal, low signal | |
Inclusion of 45 previously analyzed scans | Include , exclude | |
Outlier treatment | Exclude ±3SD from mean , exclude ±3SD from mean + robust regression | |
Group‐level model covariates | Mean FD , mean FD + run, mean FD + scanner, mean FD + run + scanner | |
Group‐level model quadratic term | Yes, no | |
Group‐level model random slopes | Yes , no | |
1b. Age‐related change in patterns of amygdala responses across task trials FSL preproc and GLM, high‐pass filter, 24 motion regressors, 2G HRF, beta estimates, included previously analyzed scans, and robust group‐level regression |
Method of quantifying within‐scan change | Slopes across trials, trials split into halves, single‐trial models |
Global signal subtraction | Yes, no | |
Amygdala ROI (all MNI space) | Bilateral, left, right | |
Group‐level model covariates | Mean FD, mean FD + run, mean FD + scanner, mean FD + run + scanner | |
Group‐level model quadratic term | Yes, no | |
Group‐level model random slopes | Yes, no | |
2a. Age‐related change in amygdala–mPFC functional connectivity (FC) to fear faces > baseline, as measured by (gPPI) FSL preproc and GLM, high‐pass filter, 24 motion regressors, 2G HRF, and bilateral amygdala ROI in MNI space |
Deconvolution step | Yes, no |
mPFC ROI (all MNI space) | Three different 5 mm spheres , large vmPFC mask | |
Outlier treatment | Exclude ±3SD from mean , exclude ±3SD from mean + robust regression | |
Inclusion of 45 previously analyzed scans | Include , exclude | |
Group‐level model covariates | Mean FD , mean FD + run, mean FD + scanner, mean FD + run + scanner | |
Group‐level model quadratic term | Yes, no | |
Group‐level model random slopes | Yes , no | |
2b. Age‐related change in amygdala–mPFC functional connectivity to fear faces > baseline, as measured by (BSC) FSL preproc and GLM, high‐pass filter, 24 motion regressors, 2G HRF, beta estimates, robust group‐level regression, and included previously analyzed scans |
Amygdala ROI (all MNI space) | Bilateral, left, right |
mPFC ROI (all MNI space) | Three different 5 mm spheres, large vmPFC mask | |
Global signal subtraction | Yes, no | |
Group‐level model covariates | Mean FD, mean FD + run, mean FD + scanner, mean FD + run + scanner | |
Group‐level model quadratic term | Yes, no | |
Group‐level model random slopes | Yes, no | |
3. Associations of amygdala reactivity, change in amygdala reactivity across trials, or amygdala–mPFC FC with separation anxiety See Supporting Information methods p. 29 for details on included pipelines |
Brain measure | Amygdala reactivity, amygdala reactivity slopes, amygdala–mPFC gPPI, amygdala–mPFC BSC |
Global signal subtraction (amygdala reactivity slopes and BSC only) | Yes, no | |
Deconvolution step (gPPI only) | Yes, no | |
mPFC ROI (all MNI space, gPPI, and BSC only) | Three different 5 mm spheres, large vmPFC mask | |
Separation anxiety outcome variable | RCADS, SCARED raw scores, SCARED t‐scores |
Bolded & italicized choices indicate those most closely matching preregistered pipelines.
2.8. Multiverse amygdala reactivity analyses
For amygdala reactivity analyses, we examined the robustness of age‐related change estimates to a variety of analytical decisions. In addition to the preregistered FSL‐based pipeline, we preprocessed data using C‐PAC software (v1.4.1; Craddock et al., 2013). We used C‐PAC to take advantage of features supporting running multiple pipeline “forks” in parallel (for example multiple nuisance regression forks using the same registration). No spatial smoothing was used in C‐PAC pipelines (see Supporting Information methods p. 14). Following C‐PAC and FSL preprocessing, we examined the impact of different sets of commonly‐used analysis methods on age‐related change in amygdala reactivity. We varied analysis choices of GLM software, hemodynamic response function, nuisance regressors, first‐level GLM estimates, amygdala ROI, exclusion criteria (exclude vs. include scans analyzed by Gee et al., 2013), group‐level model outlier treatment, and group‐level model covariates (see Table 2 and Supporting Information methods pp. 14–17). Multiverse analyses of amygdala reactivity included a total of 2,808 model specifications (156 ways of defining participant‐level amygdala reactivity × 18 group‐level model specifications) for each contrast. We analyzed all specifications in parallel. In addition, we examined nonlinear age‐related changes using quadratic and inverse age models (see Figures S14–S17) and ran a smaller set of analyses (Figure S19) to ask whether we could differentiate within‐participant change over time from between‐participant differences through alternative model parametrization (see Supporting Information methods p. 19).
For all specifications, individual‐level amygdala reactivity estimates were submitted to a group‐level multilevel regression model for estimation of age‐related changes. All models allowed intercepts to vary by participant, and some specifications also allowed for varying slopes (see Supporting Information methods p. 15 for model syntax). All models also included a scan‐level covariate for head motion (mean FD; Power et al., 2012; Satterthwaite et al., 2012, 2013). Consistent with prior work, head motion was higher on average in younger children, and decreased with age (see Figure S5), though head motion was not associated with amygdala reactivity estimates for most specifications (see Figure S26). Age‐related change findings examined for the preregistered pipeline also remained consistent under more stringent exclusion thresholds based on mean FD (see Figure S27). Across preprocessing specifications, we also examined within‐scan similarity of amygdala and whole‐brain voxelwise reactivity patterns (see Figures S20 and S21) and between‐scan correlations of average amygdala reactivity estimates (Figures S22–S24).
2.9. Change in amygdala reactivity across trials
To probe whether amygdala reactivity exhibited within‐scan change in an age‐dependent manner, we modeled reactivity to each face trial using a least squares separate method (LSS; Abdulrahman & Henson, 2016). After preprocessing, we used FEAT to fit 48 separate GLMs corresponding to each trial in each scan. A given trial was modeled with its own regressor and the remaining 47 trials were modeled with a single regressor. Each GLM also included 24 head motion nuisance regressors and had TRs with FD >0.9 mm downweighted to 0. BOLD data were high‐pass filtered at 0.01 Hz before the GLM. From each of the 48 GLMs, we extracted the mean amygdala beta estimates corresponding to a contrast for each single trial > baseline.
We constructed separate multiverse analyses using three different methods for measuring change in amygdala reactivity across trials. For Method 1 (slopes), we measured rank‐order correlations between trial number and trial‐wise amygdala betas. For Method 2 (trial halves), we split trials into the first half (Trials 1–12) and second half (Trials 13–24), and modeled age‐related change in each half. For Method 3 (single‐trial models), we constructed larger multilevel models with individual trials as the unit of observation. We conducted several analysis specifications for each method (see Table 2 and Supporting Information methods pp. 21–23), and generated corresponding specification curves.
2.10. Multiverse amygdala—mPFC FC analyses
We applied multiverse analysis techniques toward examining age‐related changes in amygdala—mPFC FC using gPPI and beta‐series correlation (BSC) methods. Briefly, gPPI estimates FC by constructing an interaction term between the timecourse in a seed region of interest and a stimulus (task) regressor. Voxels whose activity are well fit by this interaction term (a psychological‐physiological interaction, or PPI) are assumed to be “functionally coupled” with the seed region in a way that depends on the behavioral task (McLaren, Ries, Xu, & Johnson, 2012; O'Reilly, Woolrich, Behrens, Smith, & Johansen‐Berg, 2012). BSC offers a different way of estimating functioning connectivity, by constructing “timeseries” of beta values (i.e., a beta series) in a condition of interest for two regions of interest, and calculating the product–moment correlation between those beta series.
We constructed separate specification curves for age‐related change in gPPI and BSC for each contrast. Across gPPI specifications, we varied whether to use a deconvolution step in creating interaction regressors (Di & Biswal, 2017; Gitelman, Penny, Ashburner, & Friston, 2003), as well as several other analysis decision points (see Table 2 and Supporting Information methods pp. 24–25). The deconvolution step applies to the preprocessed BOLD data from the seed timecourse: these data are first deconvolved to estimate the “underlying neural activity” that produced the BOLD signal (Gitelman et al., 2003), then these deconvolved signals are multiplied with the task regressor (e.g., for fear faces). Finally, this new interaction term is convolved with a hemodynamic response function to produce the BOLD FC regressor of interest. Given recent work indicating that centering the task regressor before creation of the interaction term can mitigate spurious effects (Di, Reynolds, & Biswal, 2017), we also compared pipelines in which we centered the task regressor before deconvolution (pipelines including deconvolution in main analyses did not include this step; see Figure S44).
We preregistered constructing an mPFC ROI containing 120 voxels centered at the peak coordinates reported by Gee et al. (2013) for age‐related change in fear > baseline gPPI (Talairach 2,32,8; or MNI 3,35,8). However, after preregistration we discovered that these peak coordinates were not at the center of the ROI reported by Gee et al. (2013), and were quite close to the corpus callosum. The 120‐voxel ROI we created that was centered at this peak coordinate would have contained a high proportion of white matter voxels relative to cortical voxels (though this was not true for the mPFC ROI identified by Gee et al. (2013)). To address this issue, we instead constructed three spherical ROIs with 5 mm radii; the first centered at the above peak coordinates, the second shifted slightly anterior, and the third shifted slightly ventral relative to the second (see Figure 4). Lastly, to examine amygdala FC with a more broadly‐defined mPFC, we also used a “large vmPFC” mask encompassing many of the areas within the ventromedial prefrontal cortex derived from Mackey and Petrides (2014).
FIGURE 4.
Multiverse analyses of age‐related change in amygdala—mPFC connectivity using gPPI methods. (a). MNI space mPFC ROIs used in connectivity analyses. (b). Example participant‐level data and model predictions for age‐related related change in amygdala—mPFC gPPI for analysis pipelines with a deconvolution step (red), or without (blue) for each of the four regions shown in a. Although deconvolution changed the sign of age‐related change estimates, the estimates are not “statistically significant” for each pipeline alone, except for mPFC ROIs 1 and 2 without deconvolution. (c). Specification curve of age‐related change in fear > baseline amygdala—mPFC gPPI. Points represent estimated linear age‐related change and lines are corresponding 95% posterior intervals. Models are ordered by age‐related change estimates, and the dotted line represents the median estimate across all specifications. Color indicates sign of beta estimates and whether respective posterior intervals include 0 (blue = negative including 0, green = positive including 0, purple = positive excluding 0, black = median across all specifications). Black points with error bars represent the median and IQR ranks of specifications making the choice indicated on the corresponding line. (d). Model specification information corresponding to each model in c. Variables on the y‐axis represent analysis choices, corresponding color‐coded marks indicate that a choice was made, and blank space indicates that the choice was not made in a given analysis. Within each category (Group‐Level Model, mPFC ROI, and Participant‐Level Model), respectively, decision points are ordered from top to bottom by the median model rank when the corresponding choice is made (i.e., choices at the top of each panel tend to have more negative age‐related change estimates). See https://pbloom.shinyapps.io/amygdala_mpfc_multiverse/ for interactive visualizations
For BSC analyses, we used beta estimates from the LSS GLMs described above for analyses of within‐scan change in amygdala reactivity. Across BSC specifications we varied analyses across several decision points (see Table 2 and Supporting Information methods p. 26), including whether to include a correction for global signal (post hoc distribution centering [Fox, Zhang, Snyder, & Raichle, 2009]). We extracted mean beta estimates for amygdala and mPFC ROIs for each trial, then calculated product–moment correlations between the timeseries of betas across trials (neutral and fear separately) for both regions (Di et al., 2020). These correlation coefficients were transformed to z‐scores, then submitted to group‐level models.
Age‐related changes in gPPI and BSC were estimated using multilevel regression models as described for the amygdala reactivity analyses. We focused primarily on linear age‐related change, but also examined quadratic and inverse age associations (see Figures S45–S48 and S58–S61). We separately examined group mean gPPI and BSC for each contrast (see Figures S38 and S52), as well as associations between mean FD and both FC measures across specifications (see Figures S49 and S62). Additionally, we examined mean estimates and age‐related change in “task‐independent” FC as measured by beta weight of the “physio” term from the seed amygdala timeseries within the gPPI model (representing baseline amygdala—mPFC FC controlling for task‐induced variance; Figures S50 and S51).
2.11. Multiverse analyses of associations between amygdala and mPFC circuitry and separation anxiety behaviors
We used further multiverse analyses to ask whether amygdala reactivity, change in amygdala reactivity over the course of the task, or amygdala—mPFC FC were associated with separation anxiety behaviors. Separate specification curves were created for each brain measure type (amygdala reactivity, amygdala reactivity change across trials, and amygdala—mPFC FC). All analyses used multilevel regression models with covariates for age, and specification curves included both RCADS‐P and SCARED‐P separation anxiety subscales as outcomes (see Table 2 and Supporting Information methods pp. 29–30). Because we did not have parent‐reported RCADS‐P or SCARED‐P scores for 10 adult participants, these analyses had an N = 173.
2.12. Reliability analyses
To better understand the proportion of variance in each measure explained by the grouping of observations within repeated measurements of the same participants over time, we computed Bayesian intraclass correlation coefficient (ICC) estimates through variance decomposition of the posterior predictive distributions of the multilevel regression models previously described. We implemented these through the performance R package (Lüdecke, Ben‐Shachar, Patil, Waggoner, & Makowski, 2021; Nakagawa, Johnson, & Schielzeth, 2017). Negative ICC estimates under this method are possible, and indicate that the posterior predictive distribution has higher variance when not conditioning on random effects than when conditioning on them (likely indicating the posterior predictive variance is large, and random effects explain very little of this variance).
2.13. Model‐fitting
All statistical models fit at the group level were run in the R (v 3.6.1) computing environment. In order to most accurately model age‐related changes in each of our measures, we attempted to take into account both between‐participants information and repeated measurements within participants over time. Unless otherwise indicated, models were estimated using Hamiltonian Markov chain Monte Carlo (MCMC) sampling as implemented in the Stan programming language through the brms package in R (Bürkner, 2019; Gelman, Lee, & Guo, 2015). Unless otherwise indicated, all models used package default weakly‐informative priors (student‐t distributions with mean 0, scale parameter of 10 standardized units, and 3 degrees of freedom for all fixed effects), and were run with four chains of 2,000 sampling iterations (1,000 warmup) each (see Supporting Information methods pp. 18–19 and p. 30 for syntax).
2.14. Interactive visualizations
Because static plots visualizing the model predictions for all models in each multiverse would require far more page space than available, we created web‐based interactive visualization tools for exploring different model specifications and viewing the corresponding raw (participant‐level) data and fitted model predictions using R and Shiny (Beeley, 2013). These visualizations can be found at https://pbloom.shinyapps.io/amygdala_mpfc_multiverse/
2.15. Deviations from preregistration
Although we largely completed the preregistered analyses, the current study includes many analyses beyond those proposed in the initial preregistration. Because the additional analyses (i.e., all multiverses) conducted here give us substantial analytical flexibility over that initially indicated by preregistration, we consider all results here to be at least in part exploratory (rather than completely confirmatory), despite the preregistered hypotheses. Additionally, we note that BSC analyses, analyses of change in amygdala reactivity across trials, and analyses of associations between all brain measures and separation anxiety were exploratory, and conducted after we had seen the results of the preregistered reactivity and gPPI analyses. In addition, to avoid possible selection bias introduced by the analytical flexibility inherent in running many parallel analyses, we consider all analysis specifications simultaneously, emphasizing that without further methodological work, we consider all such choices in tandem as providing equal evidential value. While reliability analyses were not preregistered, they too provide key information for interpreting the current analyses.
3. RESULTS
3.1. Age‐related change in amygdala reactivity
We used multilevel regression models and specification curve analyses to examine age‐related changes in amygdala reactivity to faces in an accelerated longitudinal sample ranging from ages 4 to 22 years (Figure 2). Across specifications, we found relatively consistent evidence for negative age‐related change in anatomically‐defined (Harvard‐Oxford atlas and Freesurfer‐defined) amygdala reactivity to fear faces > baseline, such that the vast majority of analysis specifications (99.6%) estimated linear slopes at the group level that were negative in sign, and the majority (60.0%) of 95% posterior intervals about these slopes excluded 0 (Figure 2a; interactive version at https://pbloom.shinyapps.io/amygdala_mpfc_multiverse/). Thus, over half of models indicated that on average, increases in age were associated with decreases in amygdala reactivity to fear faces > baseline. Because the Wave 1 data in the current study included the 42 scans used by Gee et al. (2013) to identify age‐related changes in amygdala—mPFC circuitry for the fear > baseline contrast, results including these scans may have been more likely to find similar change (particularly for fear > baseline, see Figures S11–S13 and S25). Estimated age‐related change was on average weaker, though still largely negative (98.1% negative, 25.3% of posterior intervals excluding 0) when 42 previously analyzed scans (ages 4–17 years) were excluded to provide stricter independence from previously analyzed data (see Figure S11, Gee et al., 2013). Estimated average age‐related change for the fear > baseline contrast was somewhat stronger when using a right amygdala ROI compared to the left amygdala, and when using t‐stats extracted from scan‐level GLMs rather than beta estimates for group‐level models (see Figure S11).
FIGURE 2.
Multiverse analyses of age‐related change in amygdala reactivity. (a). Specification curve of age‐related change in fear > baseline amygdala reactivity. Points represent estimated linear age‐related change and lines are corresponding 95% posterior intervals (PIs). Models are ordered by age‐related change estimates, with the dotted line representing the median estimate across all specifications. Color indicates sign of beta estimates and whether respective posterior intervals include 0 (red = negative excluding 0; blue = negative including 0, green = positive including 0, black = median across all specifications). (b). Model specification information corresponding to each model in A. Variables on the y‐axis represent analysis choices, corresponding color‐coded marks indicate that a choice was made, and blank space indicates that the choice was not made in a given analysis. Within each category panel (amygdala ROI, Group‐Level Model, and Participant‐Level Model), decision points are ordered from top to bottom by the median model rank when the corresponding choice is made (i.e., choices at the top of each panel tend to have more negative age‐related change estimates). Black points with error bars represent the median and IQR ranks of specifications making the choice indicated on the corresponding line. (c). Example participant‐level data and model predictions for age‐related related change in amygdala reactivity for both the fear > baseline (green) and neutral‐baseline (orange) contrasts. Data are shown for a preregistered pipeline using a native space bilateral amygdala mask, 24 motion regressors, t‐statistics, high‐pass filtering, and participant‐level GLMs in FSL. Points represent participant‐level estimates, light lines connect estimates from participants with multiple study visits, and dark lines with shaded area represent model predictions and 95% posterior intervals. (d). Specification curves for a subset of models separately parametrizing within‐participant (right) vs. between‐participant (left) age‐related change for both the fear > baseline (green) and neutral > baseline (orange) contrasts, as well as the median across specifications (black). See https://pbloom.shinyapps.io/amygdala_mpfc_multiverse/ for interactive visualizations
Parallel multiverse analyses found similarly consistent age‐related decreases in neutral faces > baseline amygdala reactivity (see Figure 2c for an example pipeline and Figure S9 for specification curve), but no consistent evidence for age‐related change for the fear > neutral contrast (see Figure S10). However, there was consistent evidence for higher reactivity for fear faces > neutral on average as well as each emotion compared to baseline (Figures S6–S8), indicating that while the amygdala responses were robust and generally stronger for fear faces compared to neutral, such fear > neutral differences did not change with age. Across contrasts, varying the inclusion of block order or scanner covariates, inclusion of random intercepts, and use of robust regression models had little impact on age‐related change estimates (see Figures 2b and S6–S8).
While group‐level estimates of average age‐related change were relatively consistent across specifications, the estimated age terms in these models could be influenced by both within‐participant change and between‐participant differences (King et al., 2018; Madhyastha et al., 2018). A smaller separate specification curve indicated that when models were parametrized to differentiate within‐participant change and between‐participant differences, average within‐participant change was not consistent across specifications and could not be estimated with precision (Figure 2d). In contrast, estimates of between‐participant differences largely indicated negative age‐related differences in concurrence with our initial model parametrization. At the same time, within‐participant versus between‐participant terms were not reliably different from one another, indicating that models could not distinguish them despite higher precision for estimating between‐participant differences (see Figure S19). We did not find consistent evidence for quadratic age‐related changes in amygdala reactivity (see Figures S14–S17). Inverse age models (i.e., amygdala reactivity modeled as a function of 1/age) indicated results similar to those of linear and quadratic models with most specifications for the fear > baseline and neutral > baseline (though less consistent) contrasts indicating age‐related decreases (see Figure S18).
3.2. Age‐related differences in within‐scan amygdala reactivity change
To ask whether age‐related changes in amygdala reactivity could be due to developmental changes in patterns of amygdala reactivity across face trials (within a run), we examined whether within‐scan change in amygdala reactivity varied with age (see Table 1 Aim 1b). Analyses included 42 specifications (3 amygdala regions of interest [ROIs] × 2 global signal correction options × 7 group‐level models). Across both fear and neutral trials, linear slopes of amygdala reactivity were negative on average, indicating higher amygdala reactivity at the beginning of the run (Figure 3a and Figure S30). Across specifications, for both fear (100% of estimates had the same sign, 95.2% of posterior intervals excluding 0 in the same direction) and neutral trials (100% of estimates in the same direction, 38.1% of posterior intervals excluding 0), there was evidence that these within‐scan slopes were steeper (i.e., more negative) at younger ages, though evidence was relatively weaker for neutral trials (Figure 3d,e). Specifications with a global signal subtraction step also tended to find stronger age‐related change.
FIGURE 3.
Age‐related change in amygdala reactivity across trials. (a). An example model of estimated age‐related change in slopes of beta estimates across both fear (green) and neutral (orange) trials. Negative slopes represent higher amygdala activity in earlier trials relative to later trials. (b). Example models of estimated age‐related change in amygdala reactivity for the fear > baseline (left) and neutral > baseline (right) contrasts for both the first (red) and second (blue) halves of trials. In both a and b, points represent participant‐level estimates, light lines connect estimates from participants with multiple study visits, and dark lines with shaded area represent model predictions and 95% posterior intervals. (c). Example single‐trial model predictions of estimated amygdala reactivity for fear (left) and neutral (right) faces as a function of age and trial number. Age was modeled as a continuous variable, and average predictions for participants of age 6 (red), 12 (green), and 18 (blue) years are shown for visualization purposes. All estimates in a–c shown are from an example analysis pipeline using bilateral amygdala estimates and without global signal correction. (d). Specification curve for age‐related change in slopes across fear trials (i.e., many parallel analyses for the fear trials in subplot b). (e). Specification curve for age‐related change in slopes across neutral trials (i.e., neutral trials in plot b). GSS = global signal correction using post hoc mean centering. For both d and e, color indicates sign of beta estimates and whether respective posterior intervals include 0 (green = positive including 0, purple = positive excluding 0, and black = median across all specifications), and horizontal dotted lines represent median estimates across all analysis decisions. Variables on the y‐axis represent analysis choices, corresponding color‐coded marks indicate that a choice was made, and blank space indicates that the choice was not made in a given analysis
Similarly, when splitting trials into the first half (Trials 1–12) versus second half (Trials 13–24), there was consistent evidence (100% of estimates had the same sign, 69.2% with posterior interval excluding 0) for an interaction between age and trial half, such that average reactivity to fear faces > baseline in the first half of trials decreased as a function of age more so than did average reactivity during the second half of trials (see Figures 3b and Figure S32). This interaction was in the same direction for neutral trials across most specifications (88.5% of estimates), but was typically not as strong (3.8% of posterior intervals excluding 0). Single‐trial models indicated similar age‐related change in within‐scan amygdala dynamics (see Figure 3c and S33 and S34). Mean group‐level amygdala reactivity was higher for the first half of trials for fear faces > baseline across several specifications, though there were not consistent differences between trial halves for mean amygdala reactivity to neutral faces (Figure S31).
3.3. Age‐related change in task‐evoked amygdala—mPFC functional connectivity
We used multilevel regression modeling and specification curve analyses to examine age‐related change in task‐evoked amygdala—mPFC FC within the accelerated longitudinal cohort (see Table 1 Aims 2a,b). For the fear > baseline contrast, a specification curve with 288 total specifications (4 definitions of participant‐level gPPI estimates × 4 mPFC ROIs × 18 group‐level models) of amygdala—mPFC gPPI did not find consistent evidence of age‐related change: while 59.0% of models found point estimates in the positive direction, only 23% of posterior intervals excluded 0 (Figure 4c,d, interactive version at https://pbloom.shinyapps.io/amygdala_mpfc_multiverse/). Specification curve analyses found that the sign of the estimated age‐related change depended almost entirely on deconvolution, such that most specifications including a deconvolution step resulted in negative age‐related change estimates never distinguishable from 0 (78.5% of point estimates negative, 0% of posterior intervals excluding 0), and most specifications not including a deconvolution step resulted in positive age‐related change estimates (96.5% of point estimates positive, 47.9% of posterior intervals excluding 0). A visualization of the effects of the deconvolution step on amygdala FC with each of four mPFC ROIs is presented in Figure 4b. While mPFC ROI definition and other analysis decision points also influenced estimates of age‐related change in gPPI (Figure 4d), follow‐up regression models indicated that the effect of including the deconvolution step was several times larger for the fear > baseline contrast (see Figures S41–S43).
Through equivalent multiverse analyses we also found no evidence of consistent linear age‐related change in amygdala—mPFC gPPI for the neutral > baseline and fear > neutral contrasts (see Figures S39 and S40), or nonlinear change for any contrast (see Figures S45–S48). In addition, we did not see consistent evidence for group average amygdala–mPFC gPPI for any contrast, though such results often differed as a function of whether a deconvolution step was included (see Figure S38). Though we included gPPI analysis specifications excluding the 42 scans at timepoint 1 studied by Gee et al. (2013), exclusion of these scans had little impact on age‐related change results (see Figure 4d).
In addition to gPPI analyses, we used BSC analyses to examine age‐related changes in task‐evoked amygdala—mPFC connectivity (see Table 1 Aim 2b). As with gPPI, multiverse analyses of amygdala—mPFC BSC (168 total specifications; 3 amygdala ROI definitions × 4 mPFC ROI definitions × 2 global signal options × 7 group‐level models) for fear trials (vs. baseline) did not yield strong evidence of age‐related change across pipelines (84.5% of point estimates in the same direction, 24.4% of posterior intervals excluding 0; Figure 5a, interactive version at https://pbloom.shinyapps.io/amygdala_mpfc_multiverse). Unlike gPPI analyses, however, choice of mPFC ROI (as well as amygdala ROI, though this was not examined for gPPI) most impacted age‐related change in BSC estimates, rather than preprocessing or modeling analytical choices (Figures 5b and S55–S57). Accordingly, while global signal subtraction resulted in weaker amygdala—mPFC BSC on average (see Figure S52), inclusion of this step did not consistently affect age‐related change estimates (Figure 4c). We did not find consistent evidence for age‐related change in amygdala–mPFC BSC for neutral trials (vs. baseline), or for fear > neutral trials (Figures S53 and S54). We did not find consistent evidence for nonlinear age‐related change for any contrast (Figures S58–S61).
FIGURE 5.
Multiverse analyses of age‐related change in amygdala—mPFC connectivity using beta‐series correlation (BSC) methods. (a). Specification curve of age‐related change in amygdala—mPFC BSC for fear trials. Points represent estimated linear age‐related change and lines are corresponding 95% posterior intervals. Models are ordered by age‐related change estimates, and the dotted line represents the median estimate across all specifications. Color indicates sign of beta estimates and whether respective posterior intervals include 0 (blue = negative including 0, green = positive including 0, purple = positive excluding 0, and black = median across all specifications). (b). Model specification information corresponding to each model in a. Variables on the y‐axis represent analysis choices, corresponding color‐coded marks indicate that a choice was made, and blank space indicates that the choice was not made in a given analysis. Within each category (amygdala ROI, group‐level model, global signal subtraction, and mPFC ROI) respectively, decision points are ordered from top to bottom by the median model rank when the corresponding choice is made (i.e., choices at the top of each panel tend to have more negative age‐related change estimates). Black points with error bars represent the median and IQR ranks of specifications making the choice indicated on the corresponding line. GSS = global signal correction using post hoc mean centering. (c). Example model predictions for age‐related change in amygdala—mPFC BSC for fear trials for analysis pipelines with a global signal subtraction (GSS, post hoc mean centering) step (red), or without (blue) for each of the four mPFC regions (see Figure 4a) with the left and right amygdala. Pipelines shown have random slopes, no covariates for task block or scanner, and no quadratic age term. (d). Between‐scan rank‐order correlations between amygdala—mPFC connectivity measures. All gPPI measures are for the fear > baseline contrast, and BSC measures are for fear trials. See https://pbloom.shinyapps.io/amygdala_mpfc_multiverse/ for interactive visualizations
Additionally, we constructed a correlation matrix using rank‐order correlations of scan‐level BSC and gPPI estimates for the fear (vs. baseline) condition. Across scans, there was little evidence of correspondence between BSC and gPPI metrics for amygdala—mPFC connectivity (Figures 5d and S63–S66). Further, FC estimates tended to be positively correlated within a method type (BSC, gPPI) across mPFC ROIs, though less strongly for gPPI estimates with versus without a deconvolution step.
In addition to gPPI and BSC methods for FC, we also explored between‐scan associations between amygdala reactivity and mPFC reactivity (Figures S28 and S29). Multilevel models indicated that amygdala reactivity for fear faces > baseline was positively associated with mPFC reactivity for fear faces > baseline for all mPFC ROIs, though we did not find consistent evidence for age‐related changes in associations between amygdala and mPFC reactivity to fear faces > baseline (see Figure S29).
3.4. Amygdala—mPFC measures and separation anxiety
We conducted multiverse analyses of associations between several amygdala—mPFC measures (amygdala reactivity, amygdala—mPFC FC, within‐scan changes in amygdala reactivity) and separation anxiety behaviors (see Table 1 Aim 3). Separation anxiety behaviors on average decreased with age, as indicated by the RCADS‐P and SCARED‐P raw scores (Figure 6a–c). Neither specification curves for amygdala reactivity (18 total specifications, 56% of point estimates in the same direction as median estimate, 0% of posterior intervals excluding 0), amygdala—mPFC gPPI FC (90 total specifications, 72% of point estimates in the same direction as median estimate, 1% of posterior intervals excluding 0), amygdala—mPFC BSC FC (18 total specifications, 83% of point estimates in the same direction as median estimate, 0% of posterior intervals excluding 0), nor slope of amygdala responses across trials (12 total specifications, 75% of point estimates in the same direction as median estimate, 17% of posterior intervals excluding 0), found consistent evidence for associations between brain measures and separation anxiety. Similar specification curves found little consistent evidence for associations between brain measures and generalized anxiety, social anxiety, or total anxiety behaviors (see Figure S67). All specifications controlled for age (see Supporting Information methods p. 30).
To more specifically follow up on previous work reporting associations between separation anxiety behaviors and amygdala—mPFC gPPI for fear > baseline specifically (Gee et al., 2013), we plotted model predictions for such models from the above multiverse analysis for each of the four mPFC ROIs, across all three separation anxiety outcome measures, and both with and without a deconvolution step (Figure 6e). We did not find consistent evidence for associations with separation anxiety, and results showed high sensitivity to the deconvolution step, mPFC ROI, and outcome measure used.
3.5. Reliability
To examine test–retest reliability estimates of amygdala—mPFC measures across longitudinal visits, we computed Bayesian ICC estimates using a variance decomposition method (Lüdecke et al., 2021). Because such models can accommodate missing data, all observations (98 participants, 183 total scans) were used, including participants with only 1 visit. All amygdala reactivity (Figure 7a) and amygdala—mPFC FC (Figure 7c) measures, as well as slopes of amygdala reactivity estimates across trials (Figure 7b), demonstrated poor reliability (ICC <.4; Cicchetti & Sparrow, 1981; Elliott et al., 2020). Separation anxiety measures demonstrated somewhat higher, though still largely poor reliability (point estimates ~0.4, 95% CIs included values below 0.4; Figure 7d). Head motion in the scanner (mean FD) showed the highest reliability (ICC = 0.52, 95% CI [0.29, 0.68]).
FIGURE 7.
Longitudinal test–retest Bayesian ICC estimates. ICC values are shown for amygdala reactivity (a), slopes of amygdala reactivity betas across trials (b), amygdala—mPFC functional connectivity using both gPPI and BSC methods (c), and separation anxiety and in‐scanner head motion measurements (d). Shaded background colors depict whether ICC estimates are categorized as poor (<.4), fair (.4–.6), or good (.6–.75) reliability. No ICC estimates met the threshold for excellent reliability (>.75). Bayesian ICC estimates were calculated through a variance decomposition based on posterior predictive distributions. Negative values indicate higher posterior predictive variances not conditioned on random effect terms than conditioned on random effects terms
4. DISCUSSION
Measures that are both robust to researcher decisions and reliable across measurement instances are critical for studies of the human brain (Botvinik‐Nezer et al., 2020; Bowring et al., 2019; Elliott et al., 2020; Xu et al., 2022). The accelerated longitudinal design and multiverse analysis approach used in the current study allowed a rare opportunity to examine both reliability and robustness of amygdala—mPFC measures using a rapid event‐related face task from early childhood through young adulthood. Overall, estimates for age‐related change in amygdala reactivity were relatively robust to a variety of analytical decision points, while age‐related change estimates for amygdala—mPFC connectivity were more sensitive to researcher choices. gPPI analyses were particularly sensitive to whether a deconvolution step was applied. Yet, in concurrence with previous work (Elliott et al., 2020; Haller et al., 2022; Herting et al., 2017; Infantolino, Luking, Sauder, Curtin, & Hajcak, 2018; Kennedy et al., 2021; Nord, Gray, Charpentier, Robinson, & Roiser, 2017; Sauder et al., 2013), amygdala—mPFC measures displayed consistently poor test–retest reliability across many analytical specifications. While low reliability estimates in the present study may be due in part to the long (~18 months) test–retest interval (Elliott et al., 2020) and potential true developmental change (Herting et al., 2017), low reliability nevertheless imposes a major caveat toward interpretation of the current developmental findings.
The present findings are valuable from a methodological standpoint in evaluating the robustness of analytical tools used. A measurement can have high test–retest reliability yet low robustness (high sensitivity) to analytical decisions, or vice versa (Li et al., 2021). Because neither robustness nor reliability guarantee the other, current findings on the impacts of analytic choices will likely be informative in guiding future studies. Thus, we discuss each of the main analyses below, with particular emphasis on how findings are impacted by analytic choices.
4.1. Amygdala reactivity
While there were differences across model specifications, the majority of pipelines supported our hypothesis that amygdala reactivity to fearful faces decreases with age from early childhood through early adulthood (see Table 1 Aim 1a). Across specifications, we found relatively robust evidence for age‐related decreases in amygdala reactivity to both fearful and neutral faces (Figure 2a). Yet, findings also varied considerably across specifications. For example, only 60% of pipelines produced results that would be individually labeled as “significant” (under α = .05), indicating that multiple investigations of this dataset could likely lead to qualitatively different conclusions. While over half of analyses found evidence consistent with studies indicating greater amygdala reactivity to fear faces > baseline in younger children (Forbes et al., 2011; Gee et al., 2013; Guyer et al., 2008; Swartz et al., 2014), the other 40% of specifications would have been consistent with investigations that found little age‐related change (NB: there were also differences in samples, age ranges, task parameters, and behavioral demands across these studies; Kujawa et al., 2016; Wu et al., 2016; Zhang et al., 2019). We also found that different specifications resulted in somewhat different nonlinear trajectories (see Figures S14–S18). Not only did inverse age and quadratic age models find different trajectories (as would be expected), but quadratic trajectories themselves also displayed considerable analytic variability, with some specifications finding “convex” and others finding “concave” fits (see Figure S17). Although estimating nonlinear age‐related change was not a primary goal of the present study, future work should use model comparisons for better differentiating nonlinear patterns (Curran, Obeidat, & Losardo, 2010; Luna et al., 2021).
Models also found evidence for between‐participant differences, but could neither identify within‐participant change (Figure 2d) nor differentiate between‐participant from within‐participant estimates. As such, interpretation of the age‐related change reported here is subject to many of the same limitations that apply to cross‐sectional designs (Glenn, 2003), where age‐related changes may not necessarily indicate “true” developmental growth. High uncertainty in estimating average within‐participant change could be driven by several factors, including true heterogeneity in individual trajectories, low measurement reliability, scanner differences across longitudinal timepoints, or unmodeled variables impacting amygdala reactivity. Additionally, the within‐participants terms represent a smaller age range (a maximum of 4 years for any given participant), relative to the broader age range assessed by the between‐participants terms (18 years), which may have placed additional limits on identifying reliable within‐participant change.
Age‐related change in amygdala responses to fear faces over baseline seemed largely the result of earlier trials in the task (see Figures S32–S34). While differences in task design and contrast across studies have been highlighted as potential sources of discrepant findings on the development of amygdala function (Killgore & Yurgelun‐Todd, 2007; Lieberman et al., 2007; Swartz et al., 2014), this result indicates that attention to trial structure and task duration may also be necessary in comparing studies. Because the paradigm used in the current study involved a task requiring participants to press for one face (“neutral”) and not press for “fear” faces, findings specific to fear faces over baseline under the current paradigm may also be driven by behavioral task demands.
4.2. Amygdala—mPFC functional connectivity
We did not find evidence for our second hypothesis, as neither gPPI nor BSC analyses indicated consistent evidence of age‐related change in amygdala—mPFC FC (see Table 1 Aims 2a,b, Figures 4 and 5). Thus, the age‐related changes in task‐evoked amygdala—mPFC connectivity identified in prior work (Gee et al., 2013; Kujawa et al., 2016; Wu et al., 2016) were not identified here, consistent with (Zhang et al., 2019). Crucially, however, our specification curves did not find strong evidence against such age‐related change, as we did not observe precise and consistent “null” estimates across specifications. Additionally, quadratic and inverse age models did not find consistent evidence for nonlinear age‐related change (see Figures S45–S48 and S58–S61).
gPPI results were sensitive to whether a deconvolution step had been included in the preprocessing pipeline, such that we mostly found age‐related decreases in amygdala—mPFC connectivity with a deconvolution step included, and age‐related increases without it (although most pipelines would not have been “statistically significant” on their own, see Figure 4b). While deconvolution has been argued to be a necessary step for event‐related PPI analyses (Gitelman et al., 2003), recent work has shifted guidelines on its use, and it may not be recommended for block designs (Di et al., 2020; Di & Biswal, 2017). Because the true “neuronal” signal underlying the BOLD timeseries within a given ROI cannot be directly measured, deconvolution algorithms are difficult to validate. Further, deconvolution may cause PPI results to be driven by baseline connectivity if task regressors are not centered (Di et al., 2017), although such centering did not have a major influence on age‐related change results in the present analyses (see Figure S44). Within the current study, small tweaks to AFNI's 3dTfitter algorithm for deconvolution resulted in vastly different regressors (see Figure S36), suggesting the potential for high analytic variability even between gPPI analyses ostensibly using deconvolution. While the present study does not provide evidence that can inform whether or not deconvolution is recommended, further work is needed to optimize and validate applications of gPPI methods and selection of appropriate task designs. gPPI may be better equipped for block‐designs and particularly ill‐posed for rapid event‐related tasks due to both difficulties in resolving which times within the BOLD timeseries reflect FC evoked by rapid (350 ms) events and low statistical power in estimating such task‐evoked connectivity (see Figures S35–S37; O'Reilly et al., 2012). Concurrent with previous work, BSC analyses may have higher statistical power for identifying task‐related connectivity signal than gPPI within event‐related designs more generally (Cisler et al., 2014).
Age‐related change estimates for amygdala—mPFC BSC showed somewhat higher robustness to analytic decisions compared to gPPI. For BSC analyses, choice of mPFC ROI contributed most to variability in age‐related change estimates (see Figures 5b and S55–S57). While a global signal correction (post hoc distribution centering) greatly decreased average amygdala—mPFC BSC connectivity (see Figures 5d and S52) for both fear and neutral faces, this analytical step did not impact age‐related change estimates as heavily (Figures S55–S57). The fact that global signal correction so dramatically decreased average estimated amygdala—mPFC BSC may indicate that, like with resting‐state fMRI analyses, positive FC values are due in part to motion and physiology‐related confounds (Gratton et al., 2020; Power et al., 2019). Supporting this, BSC estimates were correlated with mean FD across scans for the fear > baseline and neutral > baseline contrasts only when a global signal correction was not applied (see Figure S62). In addition, while test–retest reliability for all BSC measures was poor, BSC estimates from pipelines including a global signal correction step mostly demonstrated somewhat higher ICC (Figure 6). While these results are consistent with prior work indicating that correcting for the global signal can mitigate artifacts (Ciric et al., 2017; Satterthwaite et al., 2012), other work indicates that such corrections also remove meaningful biological signals (Belloy et al., 2018; Glasser et al., 2018; Yousefi, Shin, Schumacher, & Keilholz, 2018).
4.3. Amygdala–mPFC circuitry and separation anxiety
We did not find associations between any task‐related amygdala—mPFC measures (reactivity or FC) and separation anxiety behaviors (see Table 1 Aim 3; Figure 6). This finding stands in contrast to associations between amygdala—mPFC connectivity and anxiety identified in previous developmental work (Gee et al., 2013; Jalbrzikowski et al., 2017; Kujawa et al., 2016; Qin et al., 2014). However, given that analyses of brain‐behavior associations may require imaging cohorts much larger than the current sample (especially considering the low reliability of the measures used; Grady, Rieck, Nichol, Rodrigue, & Kennedy, 2020; Marek et al., 2020), the absence of relationships here may not be strong evidence against the existence of potential associations between amygdala—mPFC circuitry and developing anxiety‐related behaviors.
4.4. Advantages and pitfalls of the multiverse approach
Our findings contribute to a body of work demonstrating that preprocessing and modeling choices can meaningfully influence results (Botvinik‐Nezer et al., 2020). Indeed, most studies involving many analytical decision points could benefit from multiverse analyses. Such specification curves can help to examine the stability of findings in both exploratory and confirmatory research (Flournoy et al., 2020). Particularly when methodological “gold standards” have not been determined, specification curves may be informative for examining the impacts of potential analysis decisions (Bridgeford et al., 2020; Dafflon et al., 2020). Further, wider use of specification curves might help to resolve discrepancies between study findings stemming from different analysis pipelines.
While specification curve analyses may benefit much future research, we also note that multiverses are only as comprehensive as the included specifications (Steegen et al., 2016), and such analyses alone do not solve problems related to unmodeled confounds, design flaws, inadequate statistical power, circular analyses, or nonrepresentative sampling. Further, unless all specifications are decided a priori, analyses are vulnerable to problems of analytic flexibility (Gelman & Loken, 2014), and inclusion of less justified specifications can bias results (Del Giudice & Gangestad, 2021). Because specification curves can include hundreds or thousands of individual analyses, rigorous evaluation of individual models can be difficult. To this end, we created interactive visualizations for visual exploration of individual analysis specifications.
Computational resources are a relevant concern when conducting multiverse analyses as well. In the current study, preprocessing (registration in particular) was the most computationally intensive step, taking an estimated 4 hours of compute time per scan per pipeline using 4 cores on a Linux‐based institutional research computing cluster. However, specification curve analyses themselves were relatively less intensive, with all group‐level models of amygdala reactivity completing in a total of 48 hr using 4 cores on a laboratory Linux‐based server. Specification curves using maximum likelihood models (lme4 in R; Bates, Maechler, & Bolker, 2011) were even more efficient, with thousands of models running within minutes using a 2019 MacBook Pro (2.8 GHz Intel Core i7).
4.5. Limitations
The present study is subject to several limitations that may be addressed in future investigations. Perhaps most crucially, our conclusions (along with those of many developmental fMRI studies) are limited by the poor test–retest reliability of the fMRI data. Because amygdala—mPFC measures showed low reliability across study visits, the statistical power of our analyses of age‐related changes is likely low (Elliott et al., 2020; Zuo, Xu, & Milham, 2019). Low‐powered studies can yield increased rates of both false positive and false negative results (as well as errors of the sign and magnitude of estimates; Button et al., 2013; Gelman & Carlin, 2014); therefore, we caution against interpretation of our developmental findings (and brain‐behavior associations) beyond the cohort studied in the present investigation. In particular, the low statistical power of our rapid event‐related task design may be a major contributor to the low test–retest reliability and variance in outcomes across analysis specifications. That being said, achieving high‐powered studies presents a challenge for studying populations that cannot tolerate lengthy fMRI sessions. Both findings that were more robust to analytical decisions (amygdala reactivity) and findings that were less so (amygdala—mPFC connectivity, associations with separation anxiety) may be most valuable in meta‐analytic contexts where greater aggregate statistical power can be achieved. In particular, future work on amygdala—mPFC development will benefit from optimization of measures both on robustness to analytic variability (Li et al., 2021) and reliability (Kragel, Han, Kraynak, Gianaros, & Wager, 2021).
Present findings are also limited by the number of participants studied (Bossier et al., 2020; Marek et al., 2020), the number of longitudinal study sessions per participant (King et al., 2018), and the duration of the task (Nee, 2019). Work with larger sample sizes, more study sessions per participant, and more task data collected per session will be necessary for charting functional amygdala—mPFC development and examining heterogeneity across individuals (although collecting task‐based fMRI will continue to be challenging for studies including younger children). The generalizability of the current findings may also be limited by the fact that this cohort was skewed toward high incomes and not racially or ethnically representative of the Los Angeles or United States population.
Findings are also somewhat limited by the fact that the present study is not wholly confirmatory, despite preregistration. Because our multiverse analysis approaches expanded significantly beyond the methods we preregistered, most of the present analyses, while hypothesis‐driven, must be considered exploratory (Flournoy et al., 2020). The fact that some specifications used data included in previous similar analyses of the same cohort (Gee et al., 2013) also limits the confirmatory power of the present study (Kriegeskorte, Simmons, Bellgowan, & Baker, 2009). This may be especially true because longitudinal models could not identify within‐person change as distinct from between‐participant differences (see Figure 2d), indicating that our age‐related change estimates may be influenced by cross‐sectional information similar to that investigated by Gee et al. (2013).
Though the current study aimed to estimate longitudinal age‐related changes in amygdala—mPFC functional circuitry evoked by fear and neutral faces, the current findings may not be specific to these stimuli (Hariri et al., 2002). Because our task did not include non‐face foils or probe specific emotion‐related processes, results may be driven by attention, learning, or visual processing, rather than affective or face processing. In particular, because participants were instructed to press a button for neutral faces and withhold a button press for fear faces, observed amygdala—mPFC responses may in part reflect response inhibition (for fear faces; Menon, Adleman, White, Glover, & Reiss, 2001) and target detection processes (for neutral faces; Jonkman, Lansbergen, & Stauder, 2003). Findings for the fear > baseline and neutral > baseline contrasts also may not be valence‐specific in the absence of a different emotional face as part of the contrast. Further, because all faces were adult White women, the current results may not generalize to faces more broadly (Richeson, Todd, Trawalter, & Baird, 2008; Telzer, Humphreys, Shapiro, & Tottenham, 2012). Additionally, because face stimuli were the same across study visits, exposure effects across sessions may confound longitudinal findings (although exposure effects may be possible any time a task is repeated, even if stimuli are unique), particularly age‐related decreases in amygdala responses (Telzer et al., 2018). While within‐session amygdala habituation effects have been shown across several paradigms (Geissberger et al., 2020; Hare et al., 2008; Hein et al., 2018), between‐session habituation effects are unlikely beyond 2–3 weeks (Geissberger et al., 2020; Johnstone et al., 2005; Plichta et al., 2014; Spohrs et al., 2018).
Finally, our findings on age‐related change in amygdala and mPFC function may be biased or confounded by age‐related differences in head motion (Ciric et al., 2017), anatomical image quality and alignment (Gilmore, Buser, & Hanson, 2020; Rorden, Bonilha, Fridriksson, Bender, & Karnath, 2012), signal dropout, and physiological artifacts (Boubela et al., 2015; Fair et al., 2020; Gratton et al., 2020). While our multiverse analyses included preprocessing and group‐level modeling specifications designed to minimize some of such potential issues, future work is still needed to optimize discrimination of developmental changes of interest from such potential confounds.
Despite these limitations, the present study concords with prior investigations in demonstrating the value of multiverse approaches to quantify sensitivity to researcher decisions. The results highlight key analytic considerations for future studies of age‐related changes in amygdala—mPFC function, as well as for studies of human brain development more broadly.
CONFLICT OF INTEREST
The authors declare no potential conflict of interest.
AUTHOR CONTRIBUTIONS
Paul Alexander Bloom: Conceptualization, methodology, formal analysis, writing ‐ original draft, visualization. Michelle VanTieghem: Methodology, writing – review and editing. Laurel Gabard‐Durnam: Investigation, methodology, writing – review and editing. Dylan G. Gee: Investigation, methodology, writing – review and editing. Jessica Flannery: Investigation, writing – review and editing. Christina Caldera: Investigation, writing – review and editing. Bonnie Goff: Investigation, writing – review and editing. Eva H. Telzer: Investigation, writing – review and editing. Kathryn L. Humphreys: Investigation, writing – review and editing. Dominic S. Fareri: Investigation, writing – review and editing. Mor Shapiro: Investigation, writing – review and editing. Sameah Algharazi: Validation, writing – review and editing. Niall Bolger: Methodology, formal analysis, writing – review and editing. Mariam Aly: Methodology, formal analysis, supervision, writing – review and editing. Nim Tottenham: Conceptualization, methodology, investigation, resources, data curation, writing ‐ original draft, supervision, funding acquisition.
Supporting information
Appendix S1: Supporting Information
Bloom, P. A. , VanTieghem, M. , Gabard‐Durnam, L. , Gee, D. G. , Flannery, J. , Caldera, C. , Goff, B. , Telzer, E. H. , Humphreys, K. L. , Fareri, D. S. , Shapiro, M. , Algharazi, S. , Bolger, N. , Aly, M. , & Tottenham, N. (2022). Age‐related change in task‐evoked amygdala—prefrontal circuitry: A multiverse approach with an accelerated longitudinal cohort aged 4–22 years. Human Brain Mapping, 43(10), 3221–3244. 10.1002/hbm.25847
Funding information Dana Foundation; National Institute of Mental Health, Grant/Award Number: 2R01MH091864; National Science Foundation, Grant/Award Number: DGE 1644869
Contributor Information
Paul Alexander Bloom, Email: paul.bloom@columbia.edu.
Nim Tottenham, Email: nlt7@columbia.edu.
DATA AVAILABILITY STATEMENT
Because participants and their parents did not consent to data sharing at the time of participation, we cannot make data from the current study publicly available. Code for preprocessing, analysis, and data visualizations for this manuscript is available at https://github.com/pab2163/amygdala_mpfc_multiverse. While unfortunately this code cannot be run as written without data, we have attempted to document analysis steps clearly. In addition, we provide publicly available simulated data structured similarly to the study data on amygdala reactivity, such that interested readers can view multiverse analysis walkthroughs (https://pab2163.github.io/amygdala_mpfc_multiverse) and experiment with analysis code. Additional materials, including MNI space masks and preregistration documentation, are available at https://osf.io/hvdmx/.
REFERENCES
- Abdulrahman, H. , & Henson, R. N. (2016). Effect of trial‐to‐trial variability on optimal event‐related fMRI design: Implications for beta‐series correlation and multi‐voxel pattern analysis. NeuroImage, 125, 756–766. 10.1016/j.neuroimage.2015.11.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Achenbach, T. M. (1991). Integrative guide for the 1991 CBCL/4‐18, Ysr, and Trf profiles (1st US‐1st Printing edition). University of Vermont/Department of Psychiatry.
- Achterberg, M. , & van der Meulen, M. (2019). Genetic and environmental influences on MRI scan quantity and quality. Developmental Cognitive Neuroscience, 38, 100667. 10.1016/j.dcn.2019.100667 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Adolphs, R. (2008). Fear, faces, and the human amygdala. Current Opinion in Neurobiology, 18(2), 166–172. 10.1016/j.conb.2008.06.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bates, D. , Maechler, M. , & Bolker, B . (2011). lme4: Linear mixed‐effects models using S4 classes (0.999375‐39) [Computer software]. http://www.idg.pl/mirrors/CRAN/web/packages/lme4/
- Beeley, C. (2013). Web application development with R using shiny. Packt Publishing Ltd.
- Belloy, M. E. , Naeyaert, M. , Abbas, A. , Shah, D. , Vanreusel, V. , van Audekerke, J. , … Verhoye, M. (2018). Dynamic resting state fMRI analysis in mice reveals a set of quasi‐periodic patterns and illustrates their relationship with the global signal. NeuroImage, 180, (Pt B), 463–484. 10.1016/j.neuroimage.2018.01.075 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Beretta, L. , & Santaniello, A. (2016). Nearest neighbor imputation algorithms: A critical evaluation. BMC Medical Informatics and Decision Making, 16(Suppl 3), 74. 10.1186/s12911-016-0318-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- Birmaher, B. , Brent, D. A. , Chiappetta, L. , Bridge, J. , Monga, S. , & Baugher, M. (1999). Psychometric properties of the screen for child anxiety related emotional disorders (SCARED): A replication study. Journal of the American Academy of Child & Adolescent Psychiatry, 38(10), 1230–1236. 10.1097/00004583-199910000-00011 [DOI] [PubMed] [Google Scholar]
- Bossier, H. , Roels, S. P. , Seurinck, R. , Banaschewski, T. , Barker, G. J. , Bokde, A. L. W. , … Moerkerke, B. (2020). The empirical replicability of task‐based fMRI as a function of sample size. NeuroImage, 212, 116601. 10.1016/j.neuroimage.2020.116601 [DOI] [PubMed] [Google Scholar]
- Botvinik‐Nezer, R. , Holzmeister, F. , Camerer, C. F. , Dreber, A. , Huber, J. , Johannesson, M. , … Schonberg, T. (2020). Variability in the analysis of a single neuroimaging dataset by many teams. Nature, 582(7810), 84–88. 10.1038/s41586-020-2314-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Boubela, R. N. , Kalcher, K. , Huf, W. , Seidel, E.‐M. , Derntl, B. , Pezawas, L. , … Moser, E. (2015). FMRI measurements of amygdala activation are confounded by stimulus correlated signal fluctuation in nearby veins draining distant brain regions. Scientific Reports, 5(1), 10499. 10.1038/srep10499 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bowring, A. , Maumet, C. , & Nichols, T. E. (2019). Exploring the impact of analysis software on task fMRI results. Human Brain Mapping, 40(11), 3362–3384. 10.1002/hbm.24603 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bridgeford, E. W. , Wang, S. , Yang, Z. , Wang, Z. , Xu, T. , Craddock, C. , …, Vogelstein, J. T . (2020). Big data reproducibility: Applications in brain imaging. bioRxiv, 802629. 10.1101/802629 [DOI]
- Bryce, N. V. , Flournoy, J. C. , Guassi Moreira, J. F. , Rosen, M. L. , Sambook, K. A. , Mair, P. , & McLaughlin, K. A. (2021). Brain parcellation selection: An overlooked decision point with meaningful effects on individual differences in resting‐state functional connectivity. NeuroImage, 243, 118487. 10.1016/j.neuroimage.2021.118487 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bürkner, P.‐C. (2019). brms: Bayesian regression models using “Stan” (2.10.0) [Computer software]. https://CRAN.R‐project.org/package=brms
- Button, K. S. , Ioannidis, J. P. A. , Mokrysz, C. , Nosek, B. A. , Flint, J. , Robinson, E. S. J. , & Munafò, M. R. (2013). Power failure: Why small sample size undermines the reliability of neuroscience. Nature Reviews Neuroscience, 14(5), 365–376. 10.1038/nrn3475 [DOI] [PubMed] [Google Scholar]
- Calvo, M. G. , & Lundqvist, D. (2008). Facial expressions of emotion (KDEF): Identification under different display‐duration conditions. Behavior Research Methods, 40(1), 109–115. 10.3758/BRM.40.1.109 [DOI] [PubMed] [Google Scholar]
- Carpenter, K. L. H. , Angold, A. , Chen, N.‐K. , Copeland, W. E. , Gaur, P. , Pelphrey, K. , … Egger, H. L. (2015). Preschool anxiety disorders predict different patterns of amygdala‐prefrontal connectivity at school‐age. PLoS One, 10(1), e0116854. 10.1371/journal.pone.0116854 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chorpita, B. F. , Yim, L. , Moffitt, C. , Umemoto, L. A. , & Francis, S. E. (2000). Assessment of symptoms of DSM‐IV anxiety and depression in children: A revised child anxiety and depression scale. Behaviour Research and Therapy, 38(8), 835–855. 10.1016/S0005-7967(99)00130-8 [DOI] [PubMed] [Google Scholar]
- Cicchetti, D. V. , & Sparrow, S. A. (1981). Developing criteria for establishing interrater reliability of specific items: Applications to assessment of adaptive behavior. American Journal of Mental Deficiency, 86(2), 127–137. [PubMed] [Google Scholar]
- Ciric, R. , Wolf, D. H. , Power, J. D. , Roalf, D. R. , Baum, G. L. , Ruparel, K. , … Satterthwaite, T. D. (2017). Benchmarking of participant‐level confound regression strategies for the control of motion artifact in studies of functional connectivity. NeuroImage, 154, 174–187. 10.1016/j.neuroimage.2017.03.020 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cisler, J. M. , Bush, K. , & Steele, J. S. (2014). A comparison of statistical methods for detecting context‐modulated functional connectivity in fMRI. NeuroImage, 84, 1042–1052. 10.1016/j.neuroimage.2013.09.018 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cosme, D. , & Lopez, R. (2020). Neural indicators of food cue reactivity, regulation, and valuation and their associations with body composition and daily eating behavior. PsyArXiv . 10.31234/osf.io/23mu5 [DOI] [PMC free article] [PubMed]
- Cox, R. W. (1996). AFNI: Software for analysis and visualization of functional magnetic resonance Neuroimages. Computers and Biomedical Research, 29(3), 162–173. [DOI] [PubMed] [Google Scholar]
- Craddock, C. , Sikka, S. , Cheung, B. , Khanuja, R. , Ghosh, S. S. , Yan, C.‐G. , …, Milham, M. (2013). Towards automated analysis of connectomes: The configurable pipeline for the analysis of connectomes (C‐PAC). Neuroinformatics. Stockholm, Sweden. 10.3389/conf.fninf.2013.09.00042 [DOI]
- Crone, E. A. , & Elzinga, B. M. (2015). Changing brains: How longitudinal functional magnetic resonance imaging studies can inform us about cognitive and social‐affective growth trajectories. WIREs Cognitive Science, 6(1), 53–63. 10.1002/wcs.1327 [DOI] [PubMed] [Google Scholar]
- Curran, P. J. , Obeidat, K. , & Losardo, D. (2010). Twelve frequently asked questions about growth curve modeling. Journal of Cognition and Development, 11(2), 121–136. 10.1080/15248371003699969 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dafflon, J. , Costa, P. F. D. , Váša, F. , Monti, R. P. , Bzdok, D. , Hellyer, P. J. , …, Leech, R. (2020). Neuroimaging: Into the multiverse. bioRxiv, 359778. 10.1101/2020.10.29.359778 [DOI]
- Decety, J. , Michalska, K. J. , & Kinzler, K. D. (2012). The contribution of emotion and cognition to moral sensitivity: A neurodevelopmental study. Cerebral Cortex, 22(1), 209–220. 10.1093/cercor/bhr111 [DOI] [PubMed] [Google Scholar]
- Del Giudice, M. , & Gangestad, S. W. (2021). A traveler's guide to the multiverse: Promises, pitfalls, and a framework for the evaluation of analytic decisions. Advances in Methods and Practices in Psychological Science, 4(1), 2515245920954925. 10.1177/2515245920954925 [DOI] [Google Scholar]
- Di, X. , & Biswal, B. B. (2017). Psychophysiological interactions in a visual checkerboard task: Reproducibility, reliability, and the effects of deconvolution. Frontiers in Neuroscience, 11, 573. 10.3389/fnins.2017.00573 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Di, X. , Reynolds, R. C. , & Biswal, B. B. (2017). Imperfect (de)convolution may introduce spurious psychophysiological interactions and how to avoid it. Human Brain Mapping, 38(4), 1723–1740. 10.1002/hbm.23413 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Di, X. , Zhang, Z. , & Biswal, B. B. (2020). Understanding psychophysiological interaction and its relations to beta series correlation. bioRxiv, 322073. 10.1101/322073 [DOI] [PMC free article] [PubMed]
- Elliott, M. L. , Knodt, A. R. , Ireland, D. , Morris, M. L. , Poulton, R. , Ramrakha, S. , … Hariri, A. R. (2020). What is the test‐retest reliability of common task‐functional MRI measures? New empirical evidence and a meta‐analysis. Psychological Science, 31(7), 792–806. 10.1177/0956797620916786 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fair, D. A. , Miranda‐Dominguez, O. , Snyder, A. Z. , Perrone, A. , Earl, E. A. , Van, A. N. , … Dosenbach, N. U. F. (2020). Correction of respiratory artifacts in MRI head motion estimates. NeuroImage, 208, 116400. 10.1016/j.neuroimage.2019.116400 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fischl, B. (2012). FreeSurfer. NeuroImage, 62(2), 774–781. 10.1016/j.neuroimage.2012.01.021 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Flournoy, J. C. , Vijayakumar, N. , Cheng, T. W. , Cosme, D. , Flannery, J. E. , & Pfeifer, J. H. (2020). Improving practices and inferences in developmental cognitive neuroscience. Developmental Cognitive Neuroscience, 45, 100807. 10.1016/j.dcn.2020.100807 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Forbes, E. E. , Phillips, M. L. , Silk, J. S. , Ryan, N. D. , & Dahl, R. E. (2011). Neural systems of threat processing in adolescents: Role of pubertal maturation and relation to measures of negative affect. Developmental Neuropsychology, 36(4), 429–452. 10.1080/87565641.2010.550178 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fox, M. D. , Zhang, D. , Snyder, A. Z. , & Raichle, M. E. (2009). The global signal and observed anticorrelated resting state brain networks. Journal of Neurophysiology, 101(6), 3270–3283. 10.1152/jn.90777.2008 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gee, D. G. , Humphreys, K. L. , Flannery, J. , Goff, B. , Telzer, E. H. , Shapiro, M. , … Tottenham, N. (2013). A developmental shift from positive to negative connectivity in human amygdala–prefrontal circuitry. Journal of Neuroscience, 33(10), 4584–4593. 10.1523/JNEUROSCI.3446-12.2013 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Geissberger, N. , Tik, M. , Sladky, R. , Woletz, M. , Schuler, A.‐L. , Willinger, D. , & Windischberger, C. (2020). Reproducibility of amygdala activation in facial emotion processing at 7T. NeuroImage, 211, 116585. 10.1016/j.neuroimage.2020.116585 [DOI] [PubMed] [Google Scholar]
- Gelman, A. , & Carlin, J. (2014). Beyond power calculations: Assessing type S (sign) and type M (magnitude) errors. Perspectives on Psychological Science, 9(6), 641–651. 10.1177/1745691614551642 [DOI] [PubMed] [Google Scholar]
- Gelman, A. , Lee, D. , & Guo, J. (2015). Stan: A probabilistic programming language for Bayesian inference and optimization. Journal of Educational and Behavioral Statistics, 40(5), 530–543. 10.3102/1076998615606113 [DOI] [Google Scholar]
- Gelman, A. , & Loken, E. (2014). The statistical crisis in science: Data‐dependent analysis—A ‘garden of forking paths’—Explains why many statistically significant comparisons don't hold up. American Scientist, 102(6), 460–466. [Google Scholar]
- Gilmore, A. , Buser, N. , & Hanson, J. L. (2020). Variations in structural MRI quality significantly impact commonly‐used measures of brain anatomy. bioRxiv, 581876. 10.1101/581876 [DOI] [PMC free article] [PubMed]
- Gitelman, D. R. , Penny, W. D. , Ashburner, J. , & Friston, K. J. (2003). Modeling regional and psychophysiologic interactions in fMRI: The importance of hemodynamic deconvolution. NeuroImage, 19(1), 200–207. 10.1016/S1053-8119(03)00058-2 [DOI] [PubMed] [Google Scholar]
- Glasser, M. F. , Coalson, T. S. , Bijsterbosch, J. D. , Harrison, S. J. , Harms, M. P. , Anticevic, A. , … Smith, S. M. (2018). Using temporal ICA to selectively remove global noise while preserving global signal in functional MRI data. NeuroImage, 181, 692–717. 10.1016/j.neuroimage.2018.04.076 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Glenn, N. D. (2003). Distinguishing age, period, and cohort effects. In Mortimer J. T. & Shanahan M. J. (Eds.), Handbook of the life course (pp. 465–476). Boston, MA: Springer. 10.1007/978-0-306-48247-2_21 [DOI] [Google Scholar]
- Grady, C. L. , Rieck, J. R. , Nichol, D. , Rodrigue, K. M. , & Kennedy, K. M. (2020). Influence of sample size and analytic approach on stability and interpretation of brain‐behavior correlations in task‐related fMRI data. Human Brain Mapping, 42(1), 204–219. 10.1002/hbm.25217 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gratton, C. , Dworetsky, A. , Coalson, R. S. , Adeyemo, B. , Laumann, T. O. , Wig, G. S. , …, Campbell, M. C. (2020). Removal of high frequency contamination from motion estimates in single‐band fMRI saves data without biasing functional connectivity. bioRxiv, 837161. 10.1101/837161 [DOI] [PMC free article] [PubMed]
- Guyer, A. E. , Monk, C. S. , McClure‐Tone, E. B. , Nelson, E. E. , Roberson‐Nay, R. , Adler, A. D. , … Ernst, M. (2008). A developmental examination of amygdala response to facial expressions. Journal of Cognitive Neuroscience, 20(9), 1565–1582. 10.1162/jocn.2008.20114 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Haller, S. P. , Chen, G. , Kitt, E. R. , Smith, A. R. , Stoddard, J. , Abend, R. , … Pagliaccio, D. (2022). Reliability of task‐evoked neural activation during face‐emotion paradigms: Effects of scanner and psychological processes. Human Brain Mapping. Advance online publication. 10.1002/hbm.25723 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hare, T. A. , Tottenham, N. , Galvan, A. , Voss, H. U. , Glover, G. H. , & Casey, B. J. (2008). Biological substrates of emotional reactivity and regulation in adolescence during an emotional Go‐NoGo task. Biological Psychiatry, 63(10), 927–934. 10.1016/j.biopsych.2008.03.015 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hariri, A. R. , Tessitore, A. , Mattay, V. S. , Fera, F. , & Weinberger, D. R. (2002). The amygdala response to emotional stimuli: A comparison of faces and scenes. NeuroImage, 17(1), 317–323. 10.1006/nimg.2002.1179 [DOI] [PubMed] [Google Scholar]
- Heeringa, S. G. , Wagner, J. , Torres, M. , Duan, N. , Adams, T. , & Berglund, P. (2004). Sample designs and sampling methods for the collaborative psychiatric epidemiology studies (CPES). International Journal of Methods in Psychiatric Research, 13(4), 221–240. 10.1002/mpr.179 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hein, T. C. , Mattson, W. I. , Dotterer, H. L. , Mitchell, C. , Lopez‐Duran, N. , Thomason, M. E. , … Monk, C. S. (2018). Amygdala habituation and uncinate fasciculus connectivity in adolescence: A multi‐modal approach. NeuroImage, 183, 617–626. 10.1016/j.neuroimage.2018.08.058 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Herting, M. M. , Gautam, P. , Chen, Z. , Mezher, A. , & Vetter, N. C. (2017). Test‐retest reliability of longitudinal task‐based fMRI: Implications for developmental studies. Developmental Cognitive Neuroscience, 33, 17–26. 10.1016/j.dcn.2017.07.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Infantolino, Z. P. , Luking, K. R. , Sauder, C. L. , Curtin, J. J. , & Hajcak, G. (2018). Robust is not necessarily reliable: From within‐subjects fMRI contrasts to between‐subjects comparisons. NeuroImage, 173, 146–152. 10.1016/j.neuroimage.2018.02.024 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jalbrzikowski, M. , Larsen, B. , Hallquist, M. N. , Foran, W. , Calabro, F. , & Luna, B. (2017). Development of White matter microstructure and intrinsic functional connectivity between the amygdala and ventromedial prefrontal cortex: Associations with anxiety and depression. Biological Psychiatry, 82(7), 511–521. 10.1016/j.biopsych.2017.01.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jenkinson, M. , Bannister, P. , Brady, M. , & Smith, S. (2002). Improved optimization for the robust and accurate linear registration and motion correction of brain images. NeuroImage, 17(2), 825–841. 10.1006/nimg.2002.1132 [DOI] [PubMed] [Google Scholar]
- Jenkinson, M. , Beckmann, C. F. , Behrens, T. E. J. , Woolrich, M. W. , & Smith, S. M. (2012). FSL. NeuroImage, 62(2), 782–790. 10.1016/j.neuroimage.2011.09.015 [DOI] [PubMed] [Google Scholar]
- Jo, H. J. , Lee, J.‐M. , Kim, J.‐H. , Shin, Y.‐W. , Kim, I.‐Y. , Kwon, J. S. , & Kim, S. I. (2007). Spatial accuracy of fMRI activation influenced by volume‐ and surface‐based spatial smoothing techniques. NeuroImage, 34(2), 550–564. 10.1016/j.neuroimage.2006.09.047 [DOI] [PubMed] [Google Scholar]
- Johnstone, T. , Somerville, L. H. , Alexander, A. L. , Oakes, T. R. , Davidson, R. J. , Kalin, N. H. , & Whalen, P. J. (2005). Stability of amygdala BOLD response to fearful faces over multiple scan sessions. NeuroImage, 25(4), 1112–1123. 10.1016/j.neuroimage.2004.12.016 [DOI] [PubMed] [Google Scholar]
- Jonkman, L. M. , Lansbergen, M. , & Stauder, J. E. A. (2003). Developmental differences in behavioral and event‐related brain responses associated with response preparation and inhibition in a Go/NoGo task. Psychophysiology, 40(5), 752–761. 10.1111/1469-8986.00075 [DOI] [PubMed] [Google Scholar]
- Joseph, J. E. , Zhu, X. , Gundran, A. , Davies, F. , Clark, J. D. , Ruble, L. , … Bhatt, R. S. (2015). Typical and atypical neurodevelopment for face specialization: An fMRI study. Journal of Autism and Developmental Disorders, 45(6), 1725–1741. 10.1007/s10803-014-2330-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kennedy, J. T. , Harms, M. P. , Korucuoglu, O. , Astafiev, S. V. , Barch, D. M. , Thompson, W. K. , …, Anokhin, A. P. (2021). Reliability and stability challenges in ABCD task fMRI data. (p. 463750). 10.1101/2021.10.08.463750 [DOI] [PMC free article] [PubMed]
- Killgore, W. D. S. , Oki, M. , & Yurgelun‐Todd, D. A. (2001). Sex‐specific developmental changes in amygdala responses to affective faces. Neuroreport, 12(2), 427. [DOI] [PubMed] [Google Scholar]
- Killgore, W. D. S. , & Yurgelun‐Todd, D. A. (2007). Unconscious processing of facial affect in children and adolescents. Social Neuroscience, 2(1), 28–47. 10.1080/17470910701214186 [DOI] [PubMed] [Google Scholar]
- King, K. M. , Littlefield, A. K. , McCabe, C. J. , Mills, K. L. , Flournoy, J. , & Chassin, L. (2018). Longitudinal modeling in developmental neuroimaging research: Common challenges, and solutions from developmental psychology. Developmental Cognitive Neuroscience, 33, 54–72. 10.1016/j.dcn.2017.11.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Klapwijk, E. , van den Bos, W. , Tamnes, C. K. , Mills, K. L. , & Raschle, N. (2019). Opportunities for increased reproducibility and replicability of developmental cognitive neuroscience [preprint]. PsyArXiv . 10.31234/osf.io/fxjzt [DOI] [PMC free article] [PubMed]
- Kragel, P. A. , Han, X. , Kraynak, T. E. , Gianaros, P. J. , & Wager, T. D. (2021). Functional MRI can be highly reliable, but it depends on what you measure: A commentary on Elliott et al. (2020). Psychological Science, 32(4), 622–626. 10.1177/0956797621989730 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kriegeskorte, N. , Simmons, W. K. , Bellgowan, P. S. F. , & Baker, C. I. (2009). Circular analysis in systems neuroscience: The dangers of double dipping. Nature Neuroscience, 12(5), 535–540. 10.1038/nn.2303 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kujawa, A. , Wu, M. , Klumpp, H. , Pine, D. S. , Swain, J. E. , Fitzgerald, K. D. , … Phan, K. L. (2016). Altered development of amygdala‐anterior cingulate cortex connectivity in anxious youth and Young adults. Biological Psychiatry: Cognitive Neuroscience and Neuroimaging, 1(4), 345–352. 10.1016/j.bpsc.2016.01.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li, X. , Ai, L. , Giavasis, S. , Jin, H. , Feczko, E. , Xu, T. , …, Milham, M. P. (2021). Moving beyond processing and analysis‐related variation in neuroscience (p. 470790). bioRxiv . 10.1101/2021.12.01.470790 [DOI]
- Lieberman, M. D. , Eisenberger, N. I. , Crockett, M. J. , Tom, S. M. , Pfeifer, J. H. , & Way, B. M. (2007). Putting feelings into words: Affect labeling disrupts amygdala activity in response to affective stimuli. Psychological Science, 18(5), 421–428. 10.1111/j.1467-9280.2007.01916.x [DOI] [PubMed] [Google Scholar]
- Lüdecke, D. , Ben‐Shachar, M. S. , Patil, I. , Waggoner, P. , & Makowski, D. (2021). Performance: An R package for assessment, comparison and testing of statistical models. Journal of Open Source Software, 6(60), 3139. 10.21105/joss.03139 [DOI] [Google Scholar]
- Luna, B. , Garver, K. E. , Urban, T. A. , Lazar, N. A. , & Sweeney, J. A. (2004). Maturation of cognitive processes from late childhood to adulthood. Child Development, 75(5), 1357–1372. 10.1111/j.1467-8624.2004.00745.x [DOI] [PubMed] [Google Scholar]
- Luna, B. , Tervo‐Clemmens, B. , & Calabro, F. J. (2021). Considerations when characterizing adolescent neurocognitive development. Biological Psychiatry, 89(2), 96–98. 10.1016/j.biopsych.2020.04.026 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mackey, S. , & Petrides, M. (2014). Architecture and morphology of the human ventromedial prefrontal cortex. The European Journal of Neuroscience, 40(5), 2777–2796. 10.1111/ejn.12654 [DOI] [PubMed] [Google Scholar]
- Madhyastha, T. , Peverill, M. , Koh, N. , McCabe, C. , Flournoy, J. , Mills, K. , … McLaughlin, K. A. (2018). Current methods and limitations for longitudinal fMRI analysis across development. Developmental Cognitive Neuroscience, 33, 118–128. 10.1016/j.dcn.2017.11.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marek, S. , Tervo‐Clemmens, B. , Calabro, F. J. , Montez, D. F. , Kay, B. P. , Hatoum, A. S. , …, Dosenbach, N. U. F. (2020). Towards reproducible brain‐wide association studies. bioRxiv. 257758. 10.1101/2020.08.21.257758 [DOI]
- McLaren, D. G. , Ries, M. L. , Xu, G. , & Johnson, S. C. (2012). A generalized form of context‐dependent psychophysiological interactions (gPPI): A comparison to standard approaches. NeuroImage, 61(4), 1277–1286. 10.1016/j.neuroimage.2012.03.068 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Menon, V. , Adleman, N. E. , White, G. H. , Glover, G. H. , & Reiss, A. L. (2001). Error‐related brain activation during a Go/NoGo response inhibition task. Human Brain Mapping, 12(3), 131–143. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Meyer, H. C. , & Lee, F. S. (2019). Translating developmental neuroscience to understand risk for psychiatric disorders. American Journal of Psychiatry, 176(3), 179–185. 10.1176/appi.ajp.2019.19010091 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moriceau, S. , & Sullivan, R. M. (2006). Maternal presence serves as a switch between learning fear and attraction in infancy. Nature Neuroscience, 9(8), 1004–1006. 10.1038/nn1733 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nakagawa, S. , Johnson, P. C. D. , & Schielzeth, H. (2017). The coefficient of determination and intra‐class correlation coefficient from generalized linear mixed‐effects models revisited and expanded. Journal of the Royal Society Interface, 14(134), 20170213. 10.1098/rsif.2017.0213 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nee, D. E. (2019). FMRI replicability depends upon sufficient individual‐level data. Communications Biology, 2(1), 1–4. 10.1038/s42003-019-0378-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nord, C. L. , Gray, A. , Charpentier, C. J. , Robinson, O. J. , & Roiser, J. P. (2017). Unreliability of putative fMRI biomarkers during emotional face processing. NeuroImage, 156, 119–127. 10.1016/j.neuroimage.2017.05.024 [DOI] [PMC free article] [PubMed] [Google Scholar]
- O'Reilly, J. X. , Woolrich, M. W. , Behrens, T. E. J. , Smith, S. M. , & Johansen‐Berg, H. (2012). Tools of the trade: Psychophysiological interactions and functional connectivity. Social Cognitive and Affective Neuroscience, 7(5), 604–609. 10.1093/scan/nss055 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Orben, A. , & Przybylski, A. K. (2019). The association between adolescent well‐being and digital technology use. Nature Human Behaviour, 3(2), 173–182. 10.1038/s41562-018-0506-1 [DOI] [PubMed] [Google Scholar]
- Passarotti, A. M. , Sweeney, J. A. , & Pavuluri, M. N. (2009). Neural correlates of incidental and directed facial emotion processing in adolescents and adults. Social Cognitive and Affective Neuroscience, 4(4), 387–398. 10.1093/scan/nsp029 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pattwell, S. S. , Liston, C. , Jing, D. , Ninan, I. , Yang, R. R. , Witztum, J. , … Lee, F. S. (2016). Dynamic changes in neural circuitry during adolescence are associated with persistent attenuation of fear memories. Nature Communications, 7, 11475. 10.1038/ncomms11475 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Perlman, S. B. , & Pelphrey, K. A. (2011). Developing connections for affective regulation: Age‐related changes in emotional brain connectivity. Journal of Experimental Child Psychology, 108(3), 607–620. 10.1016/j.jecp.2010.08.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pernet, C. R. (2014). Misconceptions in the use of the general linear model applied to functional MRI: A tutorial for junior neuro‐imagers. Frontiers in Neuroscience, 8, 1. 10.3389/fnins.2014.00001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pfeifer, J. H. , Masten, C. L. , Moore, W. E. , Oswald, T. M. , Mazziotta, J. C. , Iacoboni, M. , & Dapretto, M. (2011). Entering adolescence: Resistance to peer influence, risky behavior, and neural changes in emotion reactivity. Neuron, 69(5), 1029–1036. 10.1016/j.neuron.2011.02.019 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pine, D. S. , Grun, J. , Zarahn, E. , Fyer, A. , Koda, V. , Li, W. , … Bilder, R. M. (2001). Cortical brain regions engaged by masked emotional faces in adolescents and adults: An fMRI study. Emotion, 1(2), 137–147. 10.1037/1528-3542.1.2.137 [DOI] [PubMed] [Google Scholar]
- Plichta, M. M. , Grimm, O. , Morgen, K. , Mier, D. , Sauer, C. , Haddad, L. , … Meyer‐Lindenberg, A. (2014). Amygdala habituation: A reliable fMRI phenotype. NeuroImage, 103, 383–390. 10.1016/j.neuroimage.2014.09.059 [DOI] [PubMed] [Google Scholar]
- Power, J. D. , Barnes, K. A. , Snyder, A. Z. , Schlaggar, B. L. , & Petersen, S. E. (2012). Spurious but systematic correlations in functional connectivity MRI networks arise from subject motion. NeuroImage, 59(3), 2142–2154. 10.1016/j.neuroimage.2011.10.018 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Power, J. D. , Lynch, C. J. , Silver, B. M. , Dubin, M. J. , Martin, A. , & Jones, R. M. (2019). Distinctions among real and apparent respiratory motions in human fMRI data. NeuroImage, 201, 116041. 10.1016/j.neuroimage.2019.116041 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pozzi, E. , Vijayakumar, N. , Rakesh, D. , & Whittle, S. (2020). Neural correlates of emotion regulation in adolescents and emerging adults: A meta‐analytic study. Biological Psychiatry. 89(2), 194–204. 10.1016/j.biopsych.2020.08.006 [DOI] [PubMed] [Google Scholar]
- Qin, S. , Young, C. B. , Duan, X. , Chen, T. , Supekar, K. , & Menon, V. (2014). Amygdala subregional structure and intrinsic functional connectivity predicts individual differences in anxiety during early childhood. Biological Psychiatry, 75(11), 892–900. 10.1016/j.biopsych.2013.10.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Richeson, J. A. , Todd, A. R. , Trawalter, S. , & Baird, A. A. (2008). Eye‐gaze direction modulates race‐related amygdala activity. Group Processes & Intergroup Relations, 11(2), 233–246. 10.1177/1368430207088040 [DOI] [Google Scholar]
- Rorden, C. , Bonilha, L. , Fridriksson, J. , Bender, B. , & Karnath, H.‐O. (2012). Age‐specific CT and MRI templates for spatial normalization. NeuroImage, 61(4), 957–965. 10.1016/j.neuroimage.2012.03.020 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Satterthwaite, T. D. , Wolf, D. H. , Loughead, J. , Ruparel, K. , Elliott, M. A. , Hakonarson, H. , … Gur, R. E. (2012). Impact of in‐scanner head motion on multiple measures of functional connectivity: Relevance for studies of neurodevelopment in youth. NeuroImage, 60(1), 623–632. 10.1016/j.neuroimage.2011.12.063 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Satterthwaite, T. D. , Wolf, D. H. , Ruparel, K. , Erus, G. , Elliott, M. A. , Eickhoff, S. B. , … Gur, R. C. (2013). Heterogeneous impact of motion on fundamental patterns of developmental changes in functional connectivity during youth. NeuroImage, 83, 45–57. 10.1016/j.neuroimage.2013.06.045 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sauder, C. L. , Hajcak, G. , Angstadt, M. , & Phan, K. L. (2013). Test‐retest reliability of amygdala response to emotional faces. Psychophysiology, 50(11), 1147–1156. 10.1111/psyp.12129 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sergerie, K. , Chochol, C. , & Armony, J. L. (2008). The role of the amygdala in emotional processing: A quantitative meta‐analysis of functional neuroimaging studies. Neuroscience & Biobehavioral Reviews, 32(4), 811–830. 10.1016/j.neubiorev.2007.12.002 [DOI] [PubMed] [Google Scholar]
- Silvers, J. A. , Insel, C. , Powers, A. , Franz, P. , Helion, C. , Martin, R. , … Ochsner, K. N. (2017b). The transition from childhood to adolescence is marked by a general decrease in amygdala reactivity and an affect‐specific ventral‐to‐dorsal shift in medial prefrontal recruitment. Developmental Cognitive Neuroscience, 25, 128–137. 10.1016/j.dcn.2016.06.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Silvers, J. A. , Insel, C. , Powers, A. , Franz, P. , Helion, C. , Martin, R. E. , … Ochsner, K. N. (2017a). VlPFC–vmPFC–amygdala interactions underlie age‐related differences in cognitive regulation of emotion. Cerebral Cortex (New York, NY), 27(7), 3502–3514. 10.1093/cercor/bhw073 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Silvers, J. A. , Shu, J. , Hubbard, A. D. , Weber, J. , & Ochsner, K. N. (2015). Concurrent and lasting effects of emotion regulation on amygdala response in adolescence and Young adulthood. Developmental Science, 18(5), 771–784. 10.1111/desc.12260 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Simonsohn, U. , Simmons, J. , & Nelson, L. (2015). Specification curve: Descriptive and inferential statistics on all reasonable specifications. Marketing Papers . 10.2139/ssrn.2694998 [DOI]
- Simonsohn, U. , Simmons, J. P. , & Nelson, L. D. (2020). Specification curve analysis. Nature Human Behaviour, 4(11), 1208–1214. 10.1038/s41562-020-0912-z [DOI] [PubMed] [Google Scholar]
- Spohrs, J. , Bosch, J. E. , Dommes, L. , Beschoner, P. , Stingl, J. C. , Geiser, F. , … Viviani, R. (2018). Repeated fMRI in measuring the activation of the amygdala without habituation when viewing faces displaying negative emotions. PLoS One, 13(6), e0198244. 10.1371/journal.pone.0198244 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Steegen, S. , Tuerlinckx, F. , Gelman, A. , & Vanpaemel, W. (2016). Increasing transparency through a multiverse analysis. Perspectives on Psychological Science, 11(5), 702–712. 10.1177/1745691616658637 [DOI] [PubMed] [Google Scholar]
- Sullivan, R. M. , & Perry, R. E. (2015). Mechanisms and functional implications of social buffering in infants: Lessons from animal models. Social Neuroscience, 10(5), 500–511. 10.1080/17470919.2015.1087425 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Swartz, J. R. , Carrasco, M. , Wiggins, J. L. , Thomason, M. E. , & Monk, C. S. (2014). Age‐related changes in the structure and function of prefrontal cortex–amygdala circuitry in children and adolescents: A multi‐modal imaging approach. NeuroImage, 86, 212–220. 10.1016/j.neuroimage.2013.08.018 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Telzer, E. H. , Flannery, J. , Humphreys, K. L. , Goff, B. , Gabard‐Durman, L. , Gee, D. G. , & Tottenham, N. (2015). “The cooties effect”: Amygdala reactivity to opposite‐versus same‐sex faces declines from childhood to adolescence. Journal of Cognitive Neuroscience, 27(9), 1685–1696. 10.1162/jocn_a_00813 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Telzer, E. H. , Humphreys, K. L. , Shapiro, M. , & Tottenham, N. (2012). Amygdala sensitivity to race is not present in childhood but emerges over adolescence. Journal of Cognitive Neuroscience, 25(2), 234–244. 10.1162/jocn_a_00311 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Telzer, E. H. , McCormick, E. M. , Peters, S. , Cosme, D. , Pfeifer, J. H. , & van Duijvenvoorde, A. C. K. (2018). Methodological considerations for developmental longitudinal fMRI research. Developmental Cognitive Neuroscience, 33, 149–160. 10.1016/j.dcn.2018.02.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Todd, R. M. , Evans, J. W. , Morris, D. , Lewis, M. D. , & Taylor, M. J. (2011). The changing face of emotion: Age‐related patterns of amygdala activation to salient faces. Social Cognitive and Affective Neuroscience, 6(1), 12–23. 10.1093/scan/nsq007 [DOI] [PMC free article] [PubMed] [Google Scholar]
- US Census Bureau . (2021). U.S. Census Bureau QuickFacts: Los Angeles County, California . https://www.census.gov/quickfacts/losangelescountycalifornia
- VanTieghem, M. , Korom, M. , Flannery, J. , Choy, T. , Caldera, C. , Humphreys, K. L. , … Tottenham, N. (2021). Longitudinal changes in amygdala, hippocampus and cortisol development following early caregiving adversity. Developmental Cognitive Neuroscience, 48, 100916. 10.1016/j.dcn.2021.100916 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vijayakumar, N. , Pfeifer, J. H. , Flournoy, J. C. , Hernandez, L. M. , & Dapretto, M. (2019). Affective reactivity during adolescence: Associations with age, puberty and testosterone. Cortex, 117, 336–350. 10.1016/j.cortex.2019.04.024 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vink, M. , Derks, J. M. , Hoogendam, J. M. , Hillegers, M. , & Kahn, R. S. (2014). Functional differences in emotion processing during adolescence and early adulthood. NeuroImage, 91, 70–76. 10.1016/j.neuroimage.2014.01.035 [DOI] [PubMed] [Google Scholar]
- Wager, T. D. , & Nichols, T. E. (2003). Optimization of experimental design in fMRI: A general framework using a genetic algorithm. NeuroImage, 18(2), 293–309. [DOI] [PubMed] [Google Scholar]
- Woolrich, M. W. , Ripley, B. D. , Brady, M. , & Smith, S. M. (2001). Temporal autocorrelation in univariate linear modeling of FMRI data. NeuroImage, 14(6), 1370–1386. 10.1006/nimg.2001.0931 [DOI] [PubMed] [Google Scholar]
- Wu, M. , Kujawa, A. , Lu, L. H. , Fitzgerald, D. A. , Klumpp, H. , Fitzgerald, K. D. , … Phan, K. L. (2016). Age‐related changes in amygdala–frontal connectivity during emotional face processing from childhood into young adulthood. Human Brain Mapping, 37(5), 1684–1695. 10.1002/hbm.23129 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xu, J. , Hao, L. , Chen, M. , He, Y. , Jiang, M. , Tian, T. , … Qin, S. (2021). Developmental sex differences in negative emotion decision‐making dynamics: Computational evidence and amygdala‐prefrontal pathways. Cerebral Cortex, bhab359. 10.1093/cercor/bhab359 [DOI] [PubMed] [Google Scholar]
- Xu, T. , Cho, J. W. , Kiar, G. , Bridgeford, E. W. , Vogelstein, J. T. , & Milham, M. P. (2022). A guide for quantifying and optimizing measurement reliability for the study of individual differences. bioRxiv, 478100. 10.1101/2022.01.27.478100 [DOI] [PubMed]
- Yousefi, B. , Shin, J. , Schumacher, E. H. , & Keilholz, S. D. (2018). Quasi‐periodic patterns of intrinsic brain activity in individuals and their relationship to global signal. NeuroImage, 167, 297–308. 10.1016/j.neuroimage.2017.11.043 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yurgelun‐Todd, D. A. , & Killgore, W. D. S. (2006). Fear‐related activity in the prefrontal cortex increases with age during adolescence: A preliminary fMRI study. Neuroscience Letters, 406(3), 194–199. 10.1016/j.neulet.2006.07.046 [DOI] [PubMed] [Google Scholar]
- Zhang, Y. , Padmanabhan, A. , Gross, J. J. , & Menon, V. (2019). Development of human emotion circuits investigated using a big‐data analytic approach: Stability, reliability, and robustness. Journal of Neuroscience, 39(36), 7155–7172. 10.1523/JNEUROSCI.0220-19.2019 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zuo, X.‐N. , Xu, T. , & Milham, M. P. (2019). Harnessing reliability for neuroscience research. Nature Human Behaviour, 3(8), 768–771. 10.1038/s41562-019-0655-x [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Appendix S1: Supporting Information
Data Availability Statement
Because participants and their parents did not consent to data sharing at the time of participation, we cannot make data from the current study publicly available. Code for preprocessing, analysis, and data visualizations for this manuscript is available at https://github.com/pab2163/amygdala_mpfc_multiverse. While unfortunately this code cannot be run as written without data, we have attempted to document analysis steps clearly. In addition, we provide publicly available simulated data structured similarly to the study data on amygdala reactivity, such that interested readers can view multiverse analysis walkthroughs (https://pab2163.github.io/amygdala_mpfc_multiverse) and experiment with analysis code. Additional materials, including MNI space masks and preregistration documentation, are available at https://osf.io/hvdmx/.