Psychometric Properties of the NIH Toolbox Cognition and Emotion Batteries Among Children and Adolescents with Congenital Heart Defects

Julia Wallace; Rafael Ceschin; Vince K Lee; Nancy H Beluk; Cheryl Burns; Sue Beers; Cecilia Lo; Ashok Panigrahy; Daryaneh Badaly

doi:10.1101/2023.02.11.23285800

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

[Preprint]. 2023 Feb 14:2023.02.11.23285800. [Version 1] doi: 10.1101/2023.02.11.23285800

Psychometric Properties of the NIH Toolbox Cognition and Emotion Batteries Among Children and Adolescents with Congenital Heart Defects

Julia Wallace ¹, Rafael Ceschin ^1,², Vince K Lee ^1,³, Nancy H Beluk ¹, Cheryl Burns ⁴, Sue Beers ⁵, Cecilia Lo ⁶, Ashok Panigrahy ^1,², Daryaneh Badaly ⁷

PMCID: PMC9949204 PMID: 36824875

Abstract

Objective:

The NIH Toolbox offers brief, computerized measures of cognitive and psychosocial functioning. However, its psychometric properties were established among typically developing children and adolescents. The current study provides the first comprehensive assessment of its psychometric properties among young patients with congenital heart defects (CHD).

Study Design:

We prospectively recruited 58 patients with CHD and 80 healthy controls between the ages of 6 and 17. Participants completed the NIH Toolbox Cognition and Emotion Batteries, a battery of clinician-administered neuropsychological tests, and ratings of their quality of life. Their parents also completed ratings of their functioning.

Results:

On the Cognition Battery, we found expectable group differences and developmentally expected gains across ages. For the most part, composites and subtests were significantly correlated with neuropsychological measures of similar constructs. Higher scores were generally associated with ratings of better day-to-day functioning among children with CHD. On the Emotion Battery, we found no significant group differences, echoing prior research. For the most part, scales showed acceptable internal consistency among both groups. There was adequate construct coherence for most of questionnaires among healthy control but not participants with CHD. Correlations with a comparison tool were largely within expectable directions.

Conclusion:

The NIH Toolbox may provide a valid and useful assessment of cognitive functioning among children and adolescents with CHD. While it may offer reliable and valid scales of psychosocial functioning, further research is needed to understand the meaningfulness of the scales for participants with CHD.

Children and adolescents with congenital heart defects (CHD), particularly those with more severe forms of cardiac anomalies, those who have undergone surgical correction in the first year of life, and those with medical comorbidities, are at risk for a myriad of cognitive and psychosocial challenges (Phillips & Longoria, 2020). As a result, guidelines and recommendations have been set forth for the screening, assessment, and intervention of neurodevelopmental concerns among young individuals with CHD (Marino et al., 2012; Ilardi et al., 2020). With this in mind, the development of a uniform set of brief and easily administered assessment tools may facilitate screening, tracking progress over time, and comparing data across centers. The cognitive and psychosocial assessments from the NIH Toolbox (NIHTB) may offer such a set of tools. However, their psychometric properties have yet to be explored among children and adolescents with CHD.

The NIHTB offers a series of computerized assessments of different domains of functioning (www.nihtoolbox.org; Gershon et al., 2013). In particular, the NIHTB Cognition Battery (NIHTB-CB) and Emotion Battery (NIHTB-EB) assess cognitive and psychosocial functioning. The NIHTB was designed to provide a set of common data elements among disparate centers using standard methodology to minimize the likelihood that result differences would be attributable to the test instruments used. It was normed for ages 3 to 85 providing a set of tools to study outcomes across the lifespan, and has been translated into a number of languages to allow for global comparisons. Furthermore, the NIHTB is both user- and participant-friendly, as it provides accessible and flexible training options, can be completed in relatively brief amount of time by participants via in person or virtual administration, and offers automatic calculation of scores.

The psychometric properties of the NIHTB were established among typically developing individuals (Bauer & Zelazo, 2013; Mungas et al., 2013; Salsman et al., 2013). However, its authors highlighted the importance of validation among clinical populations (Weintraub et al., 2013). Recently, researchers have begun to use the NIHTB among individuals with CHD, a population with an increased risk of deficits across multiple cognitive and socioemotional domains (Bellinger & Newburger, 2013). That being said, the psychometric properties of the tool within this population have largely remained unexplored. As a result, the current study examined the psychometric properties NIHTB-CB and NIHTB-EB among children and adolescents with CHD, as compared to healthy controls. We examined whether the tools showed: a) expectable group similarities and differences among patients with CHD and healthy controls, b) typical developmental trends (for cognitive tasks only), c) coherent structures using confirmatory factor analysis and indices of internal consistency (for psychosocial questionnaires only), d) correlations with measures of similar constructs (i.e., convergent validity), and d) and anticipated correlations with measures of day-to-day functioning (i.e., concurrent validity) (for cognitive tasks only).

Methods

Participants

We prospectively recruited 97 children and adolescents with an array of heart defects, including those with single ventricle physiology, aortic arch anomalies, stenosis, and other malformations, as well as 88 healthy controls from a single academic medical center. We recruited English-speaking children and adolescents between 6 to 17 years of age. Individuals diagnosed with chromosomal anomalies or with history of intensive treatment for any diagnosis were excluded from the control group. Of the 185 recruited participants for whom parents provided consent, 43 individuals were not assessed due to the following reasons: scheduling conflicts (26), decided to longer participate or withdrew (10), behavioral noncompliance (4), or other reasons (3). Additionally, 3 healthy participants with significant medical histories and 1 participant with CHD who underwent a heart transplant after consent were withdrawn by the principal investigators. The final sample of 138 participants included 58 subjects with CHD and 80 healthy controls. Details of the demographics and diagnoses of the participants are summarized in Supplemental Table 1.

Children were assented to the project, and their parent or legal guardian provided consent on their behalf. The project was approved by the University of Pittsburgh Institutional Review Board and completed in accordance with the ethical principles of the Helsinki Declaration.

Assessment Instruments

Fifty-eight participants with CHD and 74 comparison controls completed the NIHTB-CB using both desktop and iPad versions of the battery; 52 children and adolescents with CHD and 68 healthy controls responded to self-report questionnaires from the NIHTB-EB. A clinician-issued pencil-and-paper battery was administered to a subset of 44 participants with CHD and 70 controls; parents and participants also completed behavioral ratings and quality of life inventories.

NIHTB Cognitive Battery

Participants completed subtests of NIHTB-CB, generating composite scores for Crystallized Cognition, Fluid Cognition, and Total Cognition. The subtests for Crystallized Cognition include the Oral Reading Recognition Test (ORRT) and the Picture Vocabulary Test (PVT). On the ORRT, a test of letter identification and word reading, participants were shown letters (for younger children) or words (for older participants) and asked to read them aloud (Gershon et al., 2013). On the PVT, which assesses receptive vocabulary, participants selected images matching descriptive audio cues. The test uses computer adaptive testing methods (Gershon et al., 2013).

The subtests for Fluid Cognition include the List Sorting Working Memory Test (LSWMT), the Pattern Comparison Processing Speed Test (PCPST), the Flanker Inhibitory Control and Attention Test (FIC+AT), the Dimensional Change Card Sort Test – Executive Function (DCCST), and the Picture Sequence Memory Test (PSMT). On the LSWMT, a test of working memory, participants ages 7 and older were presented with a series of items presented visually and auditorily and were then asked to repeat the presented items, ordering them based on particular criteria (e.g., item size). The test requires participants to hold a set amount of information in mind, mentally organize the list of items based on a given criteria, and then verbally recall the information (Tulsky et al., 2013). On the PCPST, a test of processing speed, participants were presented with two images on the screen and had to quickly indicate if the two images matched (smiley and frowny face buttons for those ages 6 and younger, yes and no buttons for those ages 7 and older) (Carlozzi et al., 2013). On the FIC+AT, participants were instructed to focus on a central image (fish for those ages 7 and younger; arrows for those ages 8 and older) flanked by additional stimuli, which may or may not point in the same direction. Participants then had to identify the direction of the middle stimulus. The test assesses the ability to selectively pay attention to stimuli while inhibiting focus on irrelevant information (Zelazo et al., 2013). On the DCCST, which requires inhibitory control and mental flexibility, participants were first instructed to focus on the color or shape of a forthcoming image and then shown a reference image. Depending on the prompt, participants had to quickly match one of two test images, varying in both color and shape, to the reference image (Zelazo et al., 2013). On the PSMT, participants were presented with a series of pictorial scenes presented in a particular order, and then had to identify the order in which the scenes were presented. The test evaluates learning and immediate retrieval of information (Bauer et al., 2013). For each subtest of the NIHTB-CB, we derived age-corrected standard scores (M = 100, SD = 15).

NIHTB Emotion Battery

The NIHTB-EB includes measures of negative affect, psychosocial well-being, stress and self-efficacy, and social relationships (Salsman et al., 2013). Specifically, there are three scales assessing negative affect or experiences of unpleasant or distressing emotions (i.e., Anger, Fear, and Sadness), two scales assessing psychological well-being or feelings of pleasure and contentment (i.e., Positive Affect and General Life Satisfaction), two scales assessing stress and self-efficacy or one’s perception of everyday experiences and ability to respond to challenging events (i.e., Perceived Stress and Self-Efficacy), and five scales assessing social relationships, with a focus on one’s perception of the availability and quality of those relationships (i.e., Emotional Support, Loneliness, Friendship, Perceived Hostility, and Perceived Rejection). Each of the scales of the NIHTB-EB is completed on a 5-point Likert scale. While some of the scales assess 8- to 17-year-olds, others have distinct forms for 8- to 12-year-olds and 13- to 17-year-olds, with developmentally appropriate wording and item banks. All of the scales for 8- to 12- year-olds and the majority for 13- to 17-year-olds used a fixed number of items; three scales for 13- to 17-year-olds (i.e., Positive Affect, General Life Satisfaction, and Self-Efficacy) used computer-adaptive testing (CAT) methods, in which items are selected from a bank based on participants’ progressive responses. For each scale, we derived fully-corrected t-scores (M = 50, SD = 10).

Clinician-Issued Battery and Rating Scales

For analyses examining convergent and concurrent validity with the NIHTB-CB, participants completed a clinician-issued battery of neuropsychological tests as part of a larger study. They completed the Wechsler Abbreviated Scale of Intelligence, 2^nd edition (WASI-II), a brief assessment of intellectual functioning comprised of four subtests, which provides composite scores of Verbal Reasoning, Fluid Reasoning, and Full Scale IQ. The Vocabulary subtest of the WASI-II assesses crystallized knowledge of vocabulary terms, and can be used in a similar manner as word reading tests to estimate functioning (Bright & van der Linde, 2020). To assess receptive language, participants up to age 16 completed the Comprehension of Instructions subtest from the Neuropsychology Assessment, 2^nd edition (NEPSY-2), in which they were asked to listen to oral instructions of increasing syntactic complexity and point to appropriate stimuli provided. Participants up to age 16 furthermore completed subtests from the Working Memory Index (Digit Span and Letter-Number Sequencing) and Processing Speed Index (Coding and Symbol Search) of the Wechsler Intelligence Scale for Children, 4^th edition (WISC-IV). Three tests of executive functioning from the Delis-Kaplan Executive Function Scale (D-KEFS) were administered to participants above age 8. Specifically, participants completed: the Color-Word Interference Test (CWIT), a variation of the classic Stroop test that requires inhibition of automatic responses when naming stimuli color or word; the Trail Making Test (TMT), which includes a trial of mental flexibility asking individuals to quickly switch between connecting circles containing numbers and letters in order; and, the Verbal Fluency Test (VFT), which includes a trial of mental flexibility asking individuals quickly switch between saying words in different categories. Lastly, participants completed the Design Memory subtest of the Wide Range Assessment of Memory and Learning, 2^nd edition (WRAML-2), a measure of visual learning and memory.

Participants and their parents filled out the Pediatric Quality of Life Inventory (PedsQL), rating scales measuring children’s overall adjustment as well as their adjustment in physical, emotional, school and social domains (with the later three also summarized to index psychosocial adjustment). Parents also completed ratings scales assessing children’s day-to-day adjustment across multiple domains. Parents completed the Adaptive Behavior Assessment System, 2^nd edition (ABAS-II), a questionnaire designed to provide estimates of adaptive behavior, including a measure of overall adaptive skill as well as performance in conceptual, social, and practical domains. Parent-ratings were also collected on the Behavior Assessment System for Children, 2^nd edition (BASC-2), which assesses multiple areas of emotional, behavioral, and adaptive functioning.

Analysis Plan

We examined group differences between patients with CHD and healthy controls using independent sample t-tests for continuous variables and χ²-tests for frequencies within categorical variables. We considered differences in demographic variables, scores on the NIHTB, and scores on the clinician-issued battery and rating scales. We considered developmental trends for the NIHTB by analyzing the correlations between raw scores and age. We assessed whether measures on the NIHTB demonstrated adequate convergent validity by analyzing the correlations between test scores on the NIHTB and those on measures of similar abilities. We examined whether measures on the NIHTB demonstrated adequate concurrent validity by analyzing the correlations between test scores on the NIHTB and measures of day-to-day functioning. To understand if developmental trends or validity indices differed between participants with CHD and healthy controls, we used Fisher (1915^;¹⁹²¹)’s r-to-z transformation. To understand the construct coherence of questionnaires, we conducted confirmatory factor analyses (CFA). Because measures that were administered as computer adaptive tests did not have data for each of the questionnaire items, data were imputed using multiple imputation by chained equations (MICE) (Bulut & Kim, 2021). MICE was completed using an algorithm provided by the scikit-learn multiple imputation library (IterativeImputer) with a Bayesian ridge estimator and 500 iterations. Each participant’s response variables were used to estimate the missing data. A second analysis included both the response variables and group to estimate the missing data. Because the pattern of findings was similar with both imputations, only the latter findings are reported. We examined multiple measures of fit for our CFA models, including the standardized root mean square residual (SRMR), the root mean square error of approximation (RMSEA), and the Bentler comparative fit index (CFI). Adequate model fit was indicated by SRMR ≤ 0.12, RMSEA ≤ 0.12, and CFI ≥ 0.88, using guidelines from Taasoobshirazi & Wang, 2016. To explore the internal consistency of questionnaires, Cronbach’s alpha was calculated for measures that were administered as fixed forms, and marginal reliability coefficients were estimated with graded item response theory (IRT) models for measures that were administered as computer adaptive tests.

Results

Participant demographics are detailed in Supplemental Table 1. There were larger percentages of individuals identified as White and as male among participants with CHD as compared to healthy controls. Otherwise, the two groups of participants did not differ on demographic variables.

Cognitive Battery

Group Differences

Participants with CHD performed significantly poorer than healthy peers on the NIHTB-CB (ps ≤ 0.045), with exception of the LSWMT (Table 1). Still, children and adolescents with CHD generally performed within normal limits. Similarly, on the clinician-issued battery of cognitive tests, participants with CHD tended to perform more poorly than healthy controls but also tended to perform within normal limits (for the comparison data, see Supplemental Table 2). Of note, the overall cognitive level of healthy controls differed from the population mean based on both the NIHTB-CB Total Cognition (t = 5.78, p < 0.001) and the WASI-II Full Scale IQ (t = 6.26, p < 0.001). Similarly, the overall cognitive level of youths with CHD was higher than has been reported in past work (Feldman et al. 2021), suggesting that we captured typical group differences even if participants in both groups were slightly higher functioning than would be expected.

Table 1.

Overview of Group Differences: NIHTB-CB and NIHTB-EB

Test / Composite / Scale	Control		CHD		t	p

	n	μ (SD)	n	μ (SD)
NIHTB-CB

List Sorting Working Memory	74	105.88 (13.934)	57	102.90 (14.292)	1.202	0.232
Pattern Comparison Processing Speed	74	103.97 (24.103)	57	92.05 (22.542)	2.887	0.005
Flanker Inhibitory Control and Attention	74	101.00(14.979)	58	95.77 (14.388)	2.025	0.045
Dimensional Card Change Sort	74	102.9 (16.687)	58	94.17 (16.600)	3.002	0.003
Picture Vocabulary	73	108.67 (13.378)	58	102.15 (12.235	2.877	0.005
Picture Sequence Memory	74	106.60(16.765)	57	97.90 (14.544)	3.115	0.002
Oral Reading Recognition	73	116.93 (18.316)	55	104.88 (17.251)	3.776	<0.001
Fluid Cognition	74	106.87 (20.145)	56	94.92 (16.657)	3.602	<0.001
Crystallized Cognition	73	115.06 (15.596)	55	104.91 (12.963)	4.016	<0.001
Total Cognition	73	113.35 (19.737)	54	99.58 (15.791)	4.223	<0.001

NIHTB-EB

Anger^*	68	49.17 (8.383)	46	50.61 (8.083)	−0.909	0.365
Emotional Support	68	48.42 (9.419)	46	50.07 (8.832)	−0.943	0.347
Fear^*	67	49.28 (9.196)	46	50.77 (11.053)	−0.783	0.435
Friendship	67	48.67 (10.734)	46	49.59 (12.518)	−0.418	0.677
Loneliness^*	67	49.32 (11.033)	46	51.51 (8.488)	−1.191	0.236
Perceived Hostility^*	68	47.30 (9.816)	46	48.93 (10.842)	−0.839	0.404
Perceived Rejection^*	68	49.36 (10.299)	45	51.34 (9.606)	−1.026	0.307
Sadness^*	68	45.91 (10.310)	45	49.07 (11.203)	−1.539	0.127
Perceived Stress^* – Ages 13-17	34	47.85 (8.831)	24	51.78 (9.554)	−1.612	0.113
Life Satisfaction – Ages 8-17	67	53.68 (10.685)	46	53.42 (11.022)	0.125	0.901
Ages 8-12	33	55.06 (12.365)	22	53.92 (14.448)	0.313	0.756
Ages 13-17	34	52.35 (8.736)	24	52.97 (6.829)	−0.291	0.772
Positive Affect – Ages 8-17	60	48.14 (9.015)	33	45.42 (9.165)	1.386	0.169
Ages 8-12	28	48.10 (9.682)	18	45.49 (8.608)	0.931	0.357
Ages 13-17	32	48.19 (8.545)	15	45.34 (10.101)	1.004	0.321
Self-Efficacy – Ages 8-17	68	51.29 (10.928)	45	48.40 (8.889)	1.475	0.143
Ages 8-12	33	54.43 (12.205)	21	48.63 (10.240)	1.806	0.077
Ages 13-17	35	48.33 (8.749)	24	48.20 (7.738)	0.056	0.956

Open in a new tab

Note.

indicates measures for which higher scores indicate poorer outcomes.

Developmental Trends

Age was positively associated with improved performance on all NIHTB-CB subtests and composite scores, both among participants with CHD (rs ≥ 0.547, ps < 0.001) and healthy controls (rs > 0.400, ps < 0.001). Age effects did not significant differ between participants with CHD and healthy controls, based on Fisher’s r-to-z transformations (zs < |1.71|, ps > 0.08).

Convergent Validity

As shown in Table 2, composite scores on the NIHTB-CB were moderately correlated with comparable composites from the WASI-II for both participants with CHD and typically developing peers, with no significant group differences. For the most part, subtests from the NIHTB-CB were significantly correlated with neuropsychological measures assessing similar constructs across groups, with small-to-medium effect sizes. That being said, the correlations between the NIHTB-CB DCCST and both the D-KEFS VFT Switching and the D-KEFS TMT Number Letter were not significant for either group, likely reflecting that the comparison measures do not sufficiently assess the same construct as the NIHTB-CB DCCST. Moreover, the positive correlation between the NIHTB-CB FIC+AT and the D-KEFS CWIT Inhibition was only significant among participants with CHD; however, the difference between the groups was not statistically significant. The positive correlation between the NIHTB-CB Oral Reading Test and WASI-II Vocabulary was only significant among healthy controls, and this represented the only statistically significant difference between participants with CHD and healthy controls.

Table 2.

Convergent Validity Estimates Comparing the NIHTB-CB to Clinician-Issued Tests of Similar Domains

Test	Control			CHD			Group Differences

	n	r	p	n	r	p	r-to-z	p
List Sorting Working Memory Test
WISC-IV Working Memory Index	61	0.318	0.013	43	0.434	0.004	−0.66	0.509

Pattern Comparison Processing Speed Test
WISC-IV Processing Speed Index	60	0.496	<0.001	41	0.318	0.042	1.02	0.308

Flanker Inhibitory Control and Attention Test
D-KEFS CWIT Inhibition	54	0.241	0.080	27	0.415	0.031	−0.79	0.430

Dimensional Card Change Sort Test
D-KEFS CWIT Inhibition	54	0.290	0.033	27	0.520	0.005	−1.12	0.263
D-KEFS VFT Switching	57	0.112	0.407	38	0.307	0.060	−0.94	0.347
D-KEFS TMT Number Letter	56	0.093	0.498	35	0.258	0.135	−0.76	0.447

Picture Vocabulary Test
NEPSY-II Comprehension of Instructions	62	0.480	<0.001	44	0.376	0.012	0.63	0.529

Oral Reading Recognition Test
WASI-II Vocabulary	60	0.595	<0.001	42	0.189	0.230	2.38	0.017

Picture Sequence Memory Test
WRAML-2 Design Memory	64	0.422	0.001	42	0.399	0.009	0.13	0.897

Fluid Cognition Composite
WASI-II Perceptual Reasoning Index	61	0.538	<0.001	41	0.410	0.008	0.79	0.430

Crystallized Cognition Composite
WASI-II Verbal Comprehension Index	60	0.597	<0.001	42	0.410	0.007	1.22	0.223

Total Cognition Composite
WASI-II Full Scale IQ	60	0.625	<0.001	40	0.534	<0.001	0.65	0.516

Open in a new tab

Concurrent Validity

When considering whether scores from the NIHTB-CB were associated with measures of day-to-day functioning in a predictable fashion (i.e., assessing concurrent validity), we limited our analyses to composite scores of the NIHTB-CB. As shown in Table 3, for children and adolescents with CHD, better fluid cognition on the NIHTB-CB was associated with fewer externalizing and behavior problems, as rated by parents on the BASC-2. Better crystalized cognition on the NIHTB-CB was associated with better adaptive functioning across composite scores from the ABAS-II and BASC-2 as well as fewer behavior problems as indexed on the BASC-2. Better overall cognition on the NIHTB-CB was associated with better global and practice adaptive functioning on the ABAS-II and BASC-2 as well as fewer internalizing, externalizing, and behavior problems on the BASC-2. Although correlations were not significant for healthy control, there were no significant differences between participants with CHD and healthy controls for any of the analyses.

Table 3.

Concurrent Validity Estimates Comparing the NIHTB-CB to Clinician-Issued Ratings of Functioning

	Control			CHD			Group Differences

	n	r	p	n	r	p	r-to-z	p
Fluid Cognition Composite
ABAS-II Global Adaptive Composite	60	0.017	0.895	42	0.125	0.429	−0.52	0.603
ABAS-II Conceptual Composite	62	0.089	0.493	43	0.063	0.689	0.13	0.897
ABAS-II Social Composite	62	−0.001	0.993	43	0.071	0.649	−0.35	0.726
ABAS-II Practical Composite	60	−0.149	0.255	42	0.166	0.294	−1.53	0.126
BASC-2 Externalizing Composite	64	−0.162	0.200	42	−0.324	0.037	0.84	0.401
BASC-2 Internalizing Composite	64	−0.044	0.733	43	−0.254	0.100	1.06	0.289
BASC-2 Behavior Symptoms Index	64	−0.097	0.448	43	−0.347	0.023	1.30	0.194
BASC-2 Adaptive Composite	64	0.094	0.458	43	0.142	0.364	−0.24	0.810

Crystallized Cognition Composite
ABAS-II Global Adaptive Composite	59	0.147	0.268	41	0.434	0.005	−1.51	0.131
ABAS-II Conceptual Composite	61	0.210	0.104	42	0.372	0.015	−0.86	0.390
ABAS-II Social Composite	61	0.126	0.334	42	0.299	0.054	−0.88	0.379
ABAS-II Practical Composite	59	0.069	0.603	41	0.407	0.008	−1.73	0.084
BASC-2 Externalizing Composite	63	−0.107	0.405	41	−0.225	0.157	0.59	0.555
BASC-2 Internalizing Composite	63	0.027	0.831	42	−0.212	0.178	1.18	0.238
BASC-2 Behavior Symptoms Index	63	−0.150	0.240	42	−0.332	0.032	0.94	0.347
BASC-2 Adaptive Composite	63	0.232	0.068	42	0.416	0.006	−1.00	0.317

Total Cognition Composite
ABAS-II Global Adaptive Composite	59	0.107	0.420	41	0.321	0.041	−1.07	0.285
ABAS-II Conceptual Composite	61	0.187	0.149	42	0.242	0.123	−0.28	0.780
ABAS-II Social Composite	61	0.084	0.522	42	0.204	0.195	−0.59	0.555
ABAS-II Practical Composite	59	−0.055	0.677	41	0.339	0.030	−1.94	0.052
BASC-2 Externalizing Composite	63	−0.185	0.147	41	−0.332	0.034	0.76	0.447
BASC-2 Internalizing Composite	63	−0.008	0.951	42	−0.279	0.073	1.35	0.177
BASC-2 Behavior Symptoms Index	63	−0.155	0.226	42	−0.409	0.007	1.35	0.177
BASC-2 Adaptive Composite	63	0.206	0.105	42	0.311	0.045	−0.55	0.582

Open in a new tab

Emotion Battery

Group Differences

As depicted in Table 1, there were no significant differences between participants with CHD and typically developing peers for any of the scales from the NIHTH-EB. Overall, both groups of children and adolescents endorsed generally healthy socioemotional functioning.

Confirmatory Factor Analysis

As detailed in Table 4, there was adequate construct coherence for questionnaires from the NIHTB-EB among healthy control, based on the SRMR and CFI from CFA models, echoing studies on the development of the tools (Salsman et al., 2013). Still, the SRMR and CFI only trended towards guidelines of acceptable fit for certain scales (i.e., Sadness, Positive Affect for Ages 8-12, and Perceived Stress for Ages 13-17) and suggested poor fit for one scale (i.e., Positive Affect for Ages 13-17). By comparison, for participants with CHD, the majority of scales did not yield CFA models with SRMR and CFI within guidelines. Even if fit indices often trended towards acceptable ranges, they did not for Positive Affect for Ages 8-12 and for Ages 13-17 and Self-Efficacy for Ages 8-12 and for Ages 13-17. Of note, CFA models across the groups typically yielded RMSEA suggestive of poor fit. However, our small sample sizes likely inflated the RMSEA, making it a less useful measures in evaluating model fit (Kenny, Kaniskan, & McCoach, 2014).

Table 4.

Indices of Internal Consistency and Confirmatory Factor Analysis for the NIHTB-EB

Scale	Control					CHD

Fixed Forms	n	α	SRMR	RMSEA	CFI	n	α	SRMR	RMSEA	CFI
Anger	68	0.876	0.041	0.141	0.959	52	0.770	0.063	0.169	0.901
Emotional Support	68	0.913	0.047	0.145	0.939	52	0.874	0.107	0.234	0.796
Fear	68^a	0.921	0.074	0.185	0.902	52 ^a	0.955	0.065	0.216	0.873
Friendship	68	0.822	0.075	0.202	0.895	52	0.883	0.043	0.140	0.963
Loneliness	68	0.895	0.048	0.151	0.923	52	0.841	0.138	0.258	0.728
Perceived Hostility	68	0.873	0.061	0.202	0.920	51	0.869	0.072	0.280	0.862
Perceived Rejection	68	0.896	0.049	0.216	0.937	51	0.833	0.057	0.147	0.944
Sadness	68 ^a	0.881	0.073	0.262	0.824	50 ^a	0.924	0.111	0.248	0.811
Life Satisfaction 8-12	33	0.771	0.080	0.139	0.930	25	0.779	0.113	0.256	0.785
Positive Affect 8-12	28	0.817	0.122	0.203	0.660	17	0.713	0.146	0.204	0.429
Perceived Stress 13-17	34	0.606	0.174	0.229	0.577	26	0.797	0.100	0.023	0.992
CAT Forms	n	Marginal Reliability	SRMR	RMSEA	CFI	n	Marginal Reliability	SRMR	RMSEA	CFI

Self-Efficacy 8-12	29	0.918	0.072	0.089	0.951	22	0.876	0.210	0.243	0.366
Life Satisfaction 13-17	35	0.921	0.101	0.165	0.792	27	0.869	0.106	0.148	0.752
Self-Efficacy 13-17	35	0.927	0.080	0.153	0.869	27	0.842	0.130	0.167	0.673
Positive Affect 13-17	35	--	0.243	0.403	0.445	27	--	0.147	0.433	0.453

Open in a new tab

Notes.

denotes that a smaller n was used to calculate Cronbach’s α due to differences in items completed across participants. The marginal reliability for Positive Affect 13-17 could not be computed in a reliable fashion either within or across groups due to the number of items that were completed by a small number of participants.

Internal Consistency

As shown in Table 4, analyses using Cronbach’s α revealed acceptable internal consistency between fixed form items among participants with CHD (α > 0.713), similar to healthy controls (α > 0.771), with one exception. Among healthy controls, but not those with CHD, Perceived Stress for Ages 13-17 demonstrated questionable internal consistency (α = 0.606). Similarly, for CAT forms, marginal reliability coefficients from IRT models were suggestive of acceptable internal consistency among both participants with CHD and healthy controls (≥ 0.842).

Convergent Validity

When considering the convergent validity of the self-completed questionnaires from the NIHTB-EB, we focused our analyses on self-reports from the PedsQL. Although research suggests that parents can view their children’s functioning in a divergent manner than children do (Jackson et al., 2015), we also considered parent-reports from the PedsQL, given that we were limited in the number of participants who completed the rating scale. We conducted exploratory analyses using relevant subscales from the parent-completed ABAS-II and BASC-2. As a similar pattern of findings emerged as with the parent-completed PedsQL, we limit our discussion to the PedsQL.

As shown in Table 5, correlations between the scales of the NIHTB-EB and scores from the PedsQL were largely within expectable directions. It should be noted, though, that not all effects, even those with medium effect sizes were significant, as a function of sample size. Importantly, there were no significant group differences, with one exception. Healthy controls who endorsed greater perceived rejection on the NIHTB-EB were rated by their parents as having poorer social functioning on the PedsQL, but the same was not true for children and adolescents with CHD.

Table 5.

Convergent Validity Estimates Comparing the NIHTB-EB to Rating Scales of Similar Domains

Scale / Test	Control			CHD			Group Differences

	n	r	p	n	r	p	r-to-z	p
Anger – Ages 8-17
PedsQL Parent Emotional Functioning	58	−0.146	0.274	34	0.010	0.957	−0.70	0.484
PedsQL Child Emotional Functioning	39	−0.738	<0.001	14	−0.628	0.016	−0.60	0.549

Emotional Support – Ages 8-17
PedsQL Parent Social Functioning	58	0.331	0.011	34	−0.086	0.630	1.92	0.055
PedsQL Child Social Functioning	39	0.159	0.334	14	0.209	0.473	−0.15	0.881

Fear – Ages 8-17
PedsQL Parent Emotional Functioning	57	−0.328	0.013	34	−0.087	0.625	−1.12	0.263
PedsQL Child Emotional Functioning	38	−0.765	<0.001	14	−0.806	<0.001	0.32	0.749

Friendship – Ages 8-17
PedsQL Parent Social Functioning	57	0.358	0.006	34	0.146	0.409	1.01	0.313
PedsQL Child Social Functioning	38	0.173	0.298	13	0.243	0.402	−0.20	0.842

Loneliness – Ages 8-17
PedsQL Parent Social Functioning	57	−0.432	0.001	34	−0.302	0.083	−0.67	0503
PedsQL Child Social Functioning	38	−0.286	0.082	14	−0.444	0.111	0.53	0.596

Perceived Hostility – Ages 8-17
PedsQL Parent Social Functioning	58	−0.237	0.074	34	−0.031	0.861	−0.94	0.347
PedsQL Child Social Functioning	39	−0.417	0.008	14	−0.412	0.144	−0.02	0.984

Perceived Rejection – Ages 8-17
PedsQL Parent Social Functioning	58	−0.430	0.001	33	0.069	0.705	−2.33	0.020
PedsQL Child Social Functioning	39	−0.396	0.013	14	−0.334	0.243	−0.21	0.834

Sadness – Ages 8-17
PedsQL Parent Emotional Functioning	58	−0.347	0.008	33	−0.069	0.702	−1.29	0.197
PedsQL Child Emotional Functioning	39	−0.602	<0.001	14	−0.514	0.060	−0.37	0.711

Perceived Stress – Ages 13-17
PedsQL Parent Emotional Functioning	28	−0.318	0.099	14	−0.292	0.311	−0.08	0.936
PedsQL Child Emotional Functioning	20	−0.560	0.010	5	−0.878	0.050	0.98	0.327

Life Satisfaction – Ages 8-17
PedsQL Parent Total Functioning	57	0.307	0.020	34	0.361	0.036	−0.27	0.787
PedsQL Child Total Functioning	38	0.352	0.030	14	0.399	0.158	−0.16	0.873

Positive Affect – Ages 8-17
PedsQL Parent Emotional Functioning	51	0.014	0.924	31	0.186	0.318	−0.73	0.465
PedsQL Child Emotional Functioning	39	0.107	0.518	14	0.447	0.109	−1.08	0.280

Self-Efficacy – Ages 8-17
PedsQL Parent Psychosocial Functioning	58	0.091	0.496	33	0.119	0.511	−0.12	0.905
PedsQL Child Psychosocial Functioning	39	0.337	0.036	14	0.032	0.914	0.93	0.352

Open in a new tab

Discussion

The current study examined the psychometric properties NIHTB-CB and NIHTB-EB among children and adolescents with CHD, as compared to healthy controls. Prior research on the development of the NIHTB-CB composite scores and underlying subtests has demonstrated adequate factor structure (CFI ≥ 0.725, TLI ≥ 0.689, RSMEA ≤ 0.059, SRMR ≤ 0.039), robust developmental effects across childhood (r ≥ 0.77), and expectable correlations with measures of similar constructs (r ≥ 0.34) among children and adolescents (Bauer & Zelazo, 2013; Mungas et al., 2013). Similarly, pediatric self-reports from the NIHTB-EB have been found to assess subdomains of psychosocial functioning in a coherent manner (using confirmatory factor analysis, CFI ≥ 0.913; RSMEA ≥ 0.057), include items that reliably measure the same constructs (Cronbach’s α > 0.86), and relate to comparable measures in a predictable way (|r| ≥ 0.28) (Salsman et al., 2013), with one exception (i.e., Perceived Rejection for Ages 13–17).

It is important to note, though, that the psychometric properties of the NIHTB were established among typically developing individuals but may differ in clinical populations. Recently, researchers have begun to use the NIHTB among individuals with CHD, a population with an increased risk of deficits across multiple cognitive and socioemotional domains (Bellinger & Newburger, 2013). For example, the NIHTB-CB has been used to assess the efficacy of cognitive training interventions among children with hypoplastic left heart syndrome and multiple forms of critical CHD (Calderon et al., 2019; Siciliano et al., 2020). Meanwhile, researchers have used a virtual administration of NIHTB-EB to assess the stress felt by patients with CHD during the COVID-19 pandemic (Cousino et al., 2020). Although the NIHTB has already begun to be administered patients with CHD, the psychometric properties of the tool within this population have largely remained unexplored. Siciliano and colleagues (2020) did note that children with hypoplastic left heart syndrome (HLHS) performed more poorly than healthy controls on the fluid cognition composite score from the NIHTB-CB, and the fluid cognition composite score was positively associated, with medium to large effect sizes, with the fluid reasoning, working memory, and processing speed indices of the Wechsler Intelligence Scale for Children, 5^th edition (WISC-V). That being said, the study, limited to those with HLHS, did not consider associations of the individual subtests, the crystalized cognition composite score, or the overall composite.

The results of the current study provide the first comprehensive assessment of the psychometric properties of the NIHTB-CB and NIHTB-EB among children with CHD. As has been found in prior work with typically developing children, we found developmentally expected gains in performance across cognitive tasks for those with CHD. Although subjects with CHD largely performed within expectation for their age on cognitive tasks, there were inefficiencies among children and adolescents with CHD as compared to healthy controls on all cognitive tasks with exception of the LSWMT. Such a pattern of findings is consistent with prior research showing subtle cognitive vulnerabilities among children with CHD when considered as a whole (Bellinger & Newburger, 2013; Marino et al., 2012). For the most part, composites and subtests from the NIHTB-CB were significantly correlated with neuropsychological measures assessing similar constructs across groups, with small-to-medium effect sizes. Moreover, higher scores on the NIHTB-CB were also generally associated with ratings of better day-to-day functioning among children with CHD. As such, the brief and user-friendly NIHTB-CB appears to provide a valid and useful assessment of cognitive functioning among children and adolescents with CHD.

On the NIHTB-EB, we found that both children and adolescents with CHD and their typically developing peers endorsed generally healthy socioemotional functioning, and there were no significant differences between the groups for any of the scales. Prior research has similarly found that, although parents and other informants report socioemotional vulnerabilities, children with CHD often do not (Jackson et al., 2015). For the most part, scales from the NIHTB-EB showed acceptable internal consistency among both participants with CHD and healthy controls. Interestingly, there was adequate construct coherence for most of questionnaires from the NIHTB-EB among healthy control, echoing studies on the development of the tools (Salsman et al., 2013). However, for participants with CHD, the majority of scales did not yield models with adequate fit.

Correlations between the scales of the NIHTB-EB and a comparison tool were largely within expectable directions, although analyses were limited by our sample size. Overall, while the NIHTB-EB may be a reliable and valid tool among patients with CHD, further research is needed to understand the meaningfulness of the scales for participants with CHD.

Similar to our work with children and adolescents with CHD, there are emerging efforts to assess the feasibility and properties of the NIHTB among diverse groups. Among children and adolescents, the literature has, for example, supported the use of the cognitive battery among those with intellectual disability, traumatic and other acquired brain injuries, and epilepsy (Chadwick et al., 2021; Shields et al., 2020; Thompson et al., 2020; Watson et al., 2020). Still, research has found that the cognitive battery from the NIHTB may not be appropriate or sensitive to differences in functioning among those with a high degree of impairment or agitation. It has been noted that young children with intellectual disability and older children with very low functioning may require test adaptations (Shields et al., 2020). In fact, children and adolescents with a high degree of cognitive impairment may not be able to complete the NIHTB-CB, as noted in a study with pediatric patients with treatment-resistant epilepsy (Thompson et al., 2020). Similar findings have been seen with adult samples. Although the NIHTB-CB has been found to be a useful battery when considering Alzheimer’s disease, the tests of memory may be too difficult and insufficiently sensitive for those at the lower end of memory function (Hackett et al., 2018; Ma et al., 2021). It has also been noted that children and adolescents with a high degree agitation (e.g., inhibition, emotional lability, and aggression) may not be able to complete the NIHTB-CB, as noted in a study with pediatric patients with acquired brain injuries (Watson et al., 2020). Such findings across clinical populations point to limitations of the NIHTB-CB that may also be applicable to patients with CHD. Although, on average, those with CHD display intellectual functioning in the average range and no more than mild behavioral dysregulation, there are subtests of patients with cognitive impairments and agitation who may not be well-served by the NIHTB-CB. Indeed, in our study, there were several individuals who could not complete testing due to behavioral noncompliance.

Emerging work with other clinical populations has also found that the cognitive tests of the NIHTB may uniquely capture the subtle deficits seen in some groups. For instance, Chadwick and colleagues (2021) noted that the NIHTB-CB may be more advantageous for detecting cognitive deficits after mild TBI in pediatric patients compared to traditional tests given its focus on reaction time as well as accuracy when assessing attention and executive functioning. Yet, research has suggested that the brief cognitive battery from the NIHTB may be less sensitive in capturing subtle cognitive differences in other populations. For example, Meredith and colleagues (2020) did not find differences in cognition between adults with alcohol use disorder and healthy controls using the NIHTB-CB, as found in studies with comprehensive neuropsychological batteries. Data from the present study suggested that the NIHTB-CB is generally sensitive to differences in cognition between patients with CHD and healthy controls. Still, its measure of working memory did not pick up on the differences between the groups that were seen within traditional measures.

It is important to acknowledge, that while our study is the first to provide a comprehensive view of the psychometric properties of the NIHTB-CB and NIHTB-EB among children and adolescents with CHD, there are limitations to our findings. First, longitudinal data were not available in order to explore test-retest reliability, the ability to detect meaningful change, and predictive relationships. However, data among typically developing children and adolescents suggests that further longitudinal studies may be beneficial. Although prior work has found strong test-retest reliability at short intervals for the NIHTB-CB (ICC ≥ 0.76), moderate associations have been seen at multi-year intervals (ICC = 0.31–0.76) (Bauer & Zelazo, 2013; Taylor et al., 2020). Second, we were limited to the tools used within our larger study when considering how traditional measures compared to the NIHTB. Third, we acknowledge that we had a small sample size for certain analyses. As a result of the available tools and sample size, we were restricted in exploring the validity of the NIHTB-CB and, particularly, the NIHTB-EB, and it would be helpful to continue exploring this area in future research. Future studies may also investigate the effect of CHD lesion type and severity, social economic status, and other medical and demographic factors on the psychometric properties of the NIHTB (Loccoh et al., 2018; Marino et al., 2012; Naef at al., 2017).

Overall, the NIH Toolbox offers developmentally sensitive, reliable, and valid assessments of cognitive abilities and psychosocial functioning among children and adolescents with CHD. As such, the easy-to-administer, time-efficient, and cost-effective tool may facilitate clinical and empirical endeavors requiring brief and repeatable assessments. Still, the NIH Toolbox may not provide the breadth, level of detail, and sensitivity needed for all contexts. As such, clinicians and researchers may continue to need complementary measures for comprehensive assessments.

Supplementary Material

NIHPP2023.02.11.23285800v1-supplement-1.pdf^{(33.3KB, pdf)}

Acknowledgments

This work was supported by the Department of Defense (W81XWH-16-1-0613), the National Heart, Lung and Blood Institute (R01 HL152740-1 and R01 HL128818-05), the National Heart, Lung and Blood Institute with National Institute of Aging (R01 HL128818-05 S1), the National Institute of Neurological Disorders and Stroke (K23 063371), the National Library of Medicine (5T15LM007059-27), the Pennsylvania Department of Health, the Mario Lemieux Foundation, and the Twenty Five Club Fund of Magee Women’s Hospital.

Abbreviations

ABAS-II: Adaptive Behavior Assessment System, 2^nd edition
BASC-2: Behavior Assessment System for Children, 2^nd edition
CAT: computer-adaptive testing
CFA: confirmatory factor analyses
CFI: comparative fit index
CHD: congenital heart defects
CWIT: Color-Word Interference Test
DCCST: Dimensional Change Card Sort Test
D-KEFS: Delis-Kaplan Executive Function Scale
FIC+AT: Flanker Inhibitory Control and Attention Test
HLHS: hypoplastic left heart syndrome
IRT: item response theory
MICE: multiple imputation by chained equations
NEPSY-2: Neuropsychology Assessment, 2^nd edition
NIHTB: National Institutes of Health Toolbox
NIHTB-CB: NIHTB Cognition Battery
NIHTB-EB: NIHTB Emotion Battery
LSWMT: List Sorting Working Memory Test
ORRT: Oral Reading Recognition Test
PCPST: Pattern Comparison Processing Speed Test
PedsQL: Pediatric Quality of Life Inventory
PSMT: Picture Sequence Memory Test
PVT: Picture Vocabulary Test
RMSEA: root mean square error of approximation
SRMR: standardized root mean square residual
TMT: Trail Making Test
VFT: Verbal Fluency Test
WASI-II: Wechsler Abbreviated Scale of Intelligence, 2^nd edition
WISC-IV: Wechsler Intelligence Scale for Children, 4^th edition
WISC-V: Wechsler Intelligence Scale for Children, 5^th edition
WRAML-2: Wide Range Assessment of Memory and Learning, 2^nd edition

Footnotes

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Data Sharing: Individual participant data that underlie the results reported in this article, after deidentification (text, tables, figures, and appendices), Study Protocol, and Statistical Analysis Plan, are available upon formal request. Requests should be submitted to the corresponding author.

References

1.Phillips J. M., & Longoria J. M. (2020). Addressing the neurodevelopmental needs of children and adolescents with congenital heart disease: A review of the existing intervention literature. Child Neuropsychology, 26(4), 433–459. [DOI] [PubMed] [Google Scholar]
2.Marino B. S., Lipkin P. H., Newburger J. W., Peacock G., Gerdes M., Gaynor J. W., Mussatto K. A., Uzark K., Goldberg C. S., Johnson W. H. Jr, Li J., Smith S. E., Bellinger D. C., Mahle W. T., & American Heart Association Congenital Heart Defects Committee, Council on Cardiovascular Disease in the Young, Council on Cardiovascular Nursing, and Stroke Council (2012). Neurodevelopmental outcomes in children with congenital heart disease: evaluation and management: a scientific statement from the American Heart Association. Circulation, 126(9), 1143–1172. [DOI] [PubMed] [Google Scholar]
3.Ilardi D., Sanz J., Cassidy A., Sananes R., Rollins C., Ullman Shade C., & Bellinger D. (2020). Neurodevelopmental evaluation for school-age children with congenital heart disease: Recommendations from the cardiac neurodevelopmental outcome collaborative. Cardiology in the Young, 30(11), 1623–1636. [DOI] [PubMed] [Google Scholar]
4.Gershon R. C., Slotkin J., Manly J. J., Blitz D. L., Beaumont J. L., Schnipke D., Wallner-Allen K., Golinkoff R. M., Gleason J. B., Hirsh-Pasek K., Adams M. J., & Weintraub S. (2013). IV. NIH Toolbox Cognition Battery (CB): measuring language (vocabulary comprehension and reading decoding). Monographs of the Society for Research in Child Development, 78(4), 49–69. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Bauer P. J., & Zelazo P. D. (2013). IX. NIH Toolbox Cognition Battery (CB): summary, conclusions, and implications for cognitive development. Monographs of the Society for Research in Child Development, 78(4), 133–146. [DOI] [PubMed] [Google Scholar]
6.Mungas D., Widaman K., Zelazo P. D., Tulsky D., Heaton R. K., Slotkin J., Blitz D. L., & Gershon R. C. (2013). VII. NIH Toolbox Cognition Battery (CB): factor structure for 3 to 15 year olds. Monographs of the Society for Research in Child Development, 78(4), 103–118. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Salsman J. M., Butt Z., Pilkonis P. A., Cyranowski J. M., Zill N., Hendrie H. C., Kupst M. J., Kelly M. A., Bode R. K., Choi S. W., Lai J. S., Griffith J. W., Stoney C. M., Brouwers P., Knox S. S., & Cella D. (2013). Emotion assessment using the NIH Toolbox. Neurology, 80(11 Suppl 3), S76–S86. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Weintraub S., Dikmen S. S., Heaton R. K., Tulsky D. S., Zelazo P. D., Bauer P. J., Carlozzi N. E., Slotkin J., Blitz D., Wallner-Allen K., Fox N. A., Beaumont J. L., Mungas D., Nowinski C. J., Richler J., Deocampo J. A., Anderson J. E., Manly J. J., Borosh B., Havlik R., … Gershon R. C. (2013). Cognition assessment using the NIH Toolbox. Neurology, 80(11 Suppl 3), S54–S64. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Bellinger D. C., & Newburger J. W. (2013). Late neurodevelopmental outcomes in children with congenital heart disease. In Baron I. S. & Rey-Casserly C. (Eds.), Pediatric Neuropsychology: Medical advances and lifespan outcomes (p. 99–112). Oxford University Press. [Google Scholar]
10.Tulsky D. S., Carlozzi N. E., Chevalier N., Espy K. A., Beaumont J. L., & Mungas D. (2013). V. NIH Toolbox Cognition Battery (CB): measuring working memory. Monographs of the Society for Research in Child Development, 78(4), 70–87. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Carlozzi N. E., Tulsky D. S., Kail R. V., & Beaumont J. L. (2013). VI. NIH Toolbox Cognition Battery (CB): measuring processing speed. Monographs of the Society for Research in Child Development, 78(4), 88–102. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Zelazo P. D., Anderson J. E., Richler J., Wallner-Allen K., Beaumont J. L., Weintraub S. (2013). II. NIH Toolbox Cognition Battery (CB): measuring executive function and attention. Monographs of the Society for Research in Child Development, 78(4), 16–33. [DOI] [PubMed] [Google Scholar]
13.Bauer P. J., Dikmen S. S., Heaton R. K., Mungas D., Slotkin J., & Beaumont J. L. (2013). III. NIH Toolbox Cognition Battery (CB): measuring episodic memory. Monographs of the Society for Research in Child Development, 78(4), 34–48. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Bright P., & van der Linde I. (2020). Comparison of methods for estimating premorbid intelligence. Neuropsychological Rehabilitation, 30(1), 1–14. [DOI] [PubMed] [Google Scholar]
15.Fisher R. A. (1915). Frequency distribution of the values of the correlation coefficient in samples from an indefinitely large population. Biometrika, 10, 507–521. [Google Scholar]
16.Fisher R. A. (1921). On the “probable error” of a coefficient of correlation deduced from a small sample. Metron, 1, 3–32. [Google Scholar]
17.Bulut O., & Kim D. (2021). The Use of Data Imputation when Investigating Dimensionality in Sparse Data from Computerized Adaptive Tests. Journal of Applied Testing Technology. 22(2), 1–13. [Google Scholar]
18.Taasoobshirazi G., & Wang S. (2016). The Performance of the SRMR, RMSEA, CFI, and TLI: An Examination of Sample Size, Path Size, and Degrees of Freedom. Journal of Applied Quantitative Methods 11(3), 31–39. [Google Scholar]
19.Feldmann M., Bataillard C., Ehrler M., Ullrich C., Knirsch W., Gosteli-Peter M. A., … & Latal B. (2021). Cognitive and executive function in congenital heart disease: A meta-analysis. Pediatrics, 148(4). [DOI] [PubMed] [Google Scholar]
20.Kenny D. A., Kaniskan B., & McCoach D. B. (2015). The performance of RMSEA in models with small degrees of freedom. Sociological Methods & Research, 44(3), 486–507. [Google Scholar]
21.Jackson J. L., Misiti B., Bridge J. A., Daniels C. J., & Vannatta K. (2015). Emotional Functioning of Adolescents and Adults with Congenital Heart Disease: a Meta-Analysis. Congenital Heart Disease, 10(1), 2–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Calderon J., Bellinger D. C., Hartigan C., Lord A., Stopp C., Wypij D., & Newburger J. W. (2019). Improving neurodevelopmental outcomes in children with congenital heart disease: protocol for a randomised controlled trial of working memory training. BMJ Open, 9(2), e023304. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Cousino M. K., Pasquali S. K., Romano J. C., Norris M. D., Yu S., Reichle G., Lowery R., Viers S., & Schumacher K. R. (2020). Impact of the COVID-19 Pandemic on Congenital Heart Disease Care and Emotional Wellbeing. Cardiology in the Young, 1–21. Advance online publication. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Siciliano R. E., Murphy L. K., Prussien K. V., Henry L. M., Watson K. H., Patel N. J., Lee C. A., McNally C. M., Markham L. W., Compas B. E., & Jordan L. C. (2020). Cognitive and Attentional Function in Children with Hypoplastic Left Heart Syndrome: A Pilot Study. Journal of Clinical Psychology in Medical Settings, 28(3), 619–626. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Chadwick L., Roth E., Minich N. M., Taylor H. G., Bigler E. D., Cohen D. M., Bacevice A., Mihalov L. K., Bangert B. A., Zumberge N. A., & Yeates K. O. (2021). Cognitive Outcomes in Children with Mild Traumatic Brain Injury: An Examination Using the National Institutes of Health Toolbox Cognition Battery. Journal of Neurotrauma, 38(18), 2590–2599. [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Shields R. H., Kaat A. J., McKenzie F. J., Drayton A., Sansone S. M., Coleman J., Michalak C., Riley K., Berry-Kravis E., Gershon R. C., Widaman K. F., & Hessl D. (2020). Validation of the NIH Toolbox Cognitive Battery in intellectual disability. Neurology, 94(12), e1229–e1240. [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Thompson M. D., Martin R. C., Grayson L. P., Ampah S. B., Cutter G., Szaflarski J. P., & Bebin E. M. (2020). Cognitive function and adaptive skills after a one-year trial of cannabidiol (CBD) in a pediatric sample with treatment-resistant epilepsy. Epilepsy & Behavior, 111, 107299. [DOI] [PubMed] [Google Scholar]
28.Hackett K., Krikorian R., Giovannetti T., Melendez-Cabrero J., Rahman A., Caesar E. E., Chen J. L., Hristov H., Seifan A., Mosconi L., & Isaacson R. S. (2018). Utility of the NIH Toolbox for Assessment of Prodromal Alzheimer’s Disease and Dementia. Alzheimer’s & Dementia (Amsterdam, Netherlands), 10, 764–772. [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Ma Y., Carlsson C. M., Wahoske M. L., Blazel H. M., Chappell R. J., Johnson S. C., Asthana S., & Gleason C. E. (2021). Latent Factor Structure and Measurement Invariance of the NIH Toolbox Cognition Battery in an Alzheimer’s Disease Research Sample. Journal of the International Neuropsychological Society, 27(5), 412–425. [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Watson W., Pedowitz A., Nowak S., Neumayer C., Kaplan E., & Shah S. (2020). Feasibility of National Institutes of Health Toolbox Cognition Battery in Pediatric Brain Injury Rehabilitation Settings. Rehabilitation Psychology, 65(1), 22–30. [DOI] [PubMed] [Google Scholar]
31.Meredith L. R., Lim A. C., & Ray L. A. (2020). Neurocognitive performance in alcohol use disorder using the NIH toolbox: Role of severity and sex differences. Drug and Alcohol Dependence, 216, 108269. [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Taylor B. K., Frenzel M. R., Eastman J. A., Wiesman A. I., Wang Y. P., Calhoun V. D., Stephen J. M., & Wilson T. W. (2020). Reliability of the NIH toolbox cognitive battery in children and adolescents: a 3-year longitudinal examination. Psychological medicine, 1–10. Advance online publication. [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Loccoh E. C., Yu S., Donohue J., Lowery R., Butcher J., Pasquali S. K., Goldberg C. S., & Uzark K. (2018). Prevalence and risk factors associated with non-attendance in neurodevelopmental follow-up clinic among infants with CHD. Cardiology in the young, 28(4), 554–560. [DOI] [PubMed] [Google Scholar]
34.Naef N., Liamlahi R., Beck I., Bernet V., Dave H., Knirsch W., & Latal B. (2017). Neurodevelopmental Profiles of Children with Congenital Heart Disease at School Age. The Journal of pediatrics, 188, 75–81. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

NIHPP2023.02.11.23285800v1-supplement-1.pdf^{(33.3KB, pdf)}

[R1] 1.Phillips J. M., & Longoria J. M. (2020). Addressing the neurodevelopmental needs of children and adolescents with congenital heart disease: A review of the existing intervention literature. Child Neuropsychology, 26(4), 433–459. [DOI] [PubMed] [Google Scholar]

[R2] 2.Marino B. S., Lipkin P. H., Newburger J. W., Peacock G., Gerdes M., Gaynor J. W., Mussatto K. A., Uzark K., Goldberg C. S., Johnson W. H. Jr, Li J., Smith S. E., Bellinger D. C., Mahle W. T., & American Heart Association Congenital Heart Defects Committee, Council on Cardiovascular Disease in the Young, Council on Cardiovascular Nursing, and Stroke Council (2012). Neurodevelopmental outcomes in children with congenital heart disease: evaluation and management: a scientific statement from the American Heart Association. Circulation, 126(9), 1143–1172. [DOI] [PubMed] [Google Scholar]

[R3] 3.Ilardi D., Sanz J., Cassidy A., Sananes R., Rollins C., Ullman Shade C., & Bellinger D. (2020). Neurodevelopmental evaluation for school-age children with congenital heart disease: Recommendations from the cardiac neurodevelopmental outcome collaborative. Cardiology in the Young, 30(11), 1623–1636. [DOI] [PubMed] [Google Scholar]

[R4] 4.Gershon R. C., Slotkin J., Manly J. J., Blitz D. L., Beaumont J. L., Schnipke D., Wallner-Allen K., Golinkoff R. M., Gleason J. B., Hirsh-Pasek K., Adams M. J., & Weintraub S. (2013). IV. NIH Toolbox Cognition Battery (CB): measuring language (vocabulary comprehension and reading decoding). Monographs of the Society for Research in Child Development, 78(4), 49–69. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R5] 5.Bauer P. J., & Zelazo P. D. (2013). IX. NIH Toolbox Cognition Battery (CB): summary, conclusions, and implications for cognitive development. Monographs of the Society for Research in Child Development, 78(4), 133–146. [DOI] [PubMed] [Google Scholar]

[R6] 6.Mungas D., Widaman K., Zelazo P. D., Tulsky D., Heaton R. K., Slotkin J., Blitz D. L., & Gershon R. C. (2013). VII. NIH Toolbox Cognition Battery (CB): factor structure for 3 to 15 year olds. Monographs of the Society for Research in Child Development, 78(4), 103–118. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R7] 7.Salsman J. M., Butt Z., Pilkonis P. A., Cyranowski J. M., Zill N., Hendrie H. C., Kupst M. J., Kelly M. A., Bode R. K., Choi S. W., Lai J. S., Griffith J. W., Stoney C. M., Brouwers P., Knox S. S., & Cella D. (2013). Emotion assessment using the NIH Toolbox. Neurology, 80(11 Suppl 3), S76–S86. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R8] 8.Weintraub S., Dikmen S. S., Heaton R. K., Tulsky D. S., Zelazo P. D., Bauer P. J., Carlozzi N. E., Slotkin J., Blitz D., Wallner-Allen K., Fox N. A., Beaumont J. L., Mungas D., Nowinski C. J., Richler J., Deocampo J. A., Anderson J. E., Manly J. J., Borosh B., Havlik R., … Gershon R. C. (2013). Cognition assessment using the NIH Toolbox. Neurology, 80(11 Suppl 3), S54–S64. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R9] 9.Bellinger D. C., & Newburger J. W. (2013). Late neurodevelopmental outcomes in children with congenital heart disease. In Baron I. S. & Rey-Casserly C. (Eds.), Pediatric Neuropsychology: Medical advances and lifespan outcomes (p. 99–112). Oxford University Press. [Google Scholar]

[R10] 10.Tulsky D. S., Carlozzi N. E., Chevalier N., Espy K. A., Beaumont J. L., & Mungas D. (2013). V. NIH Toolbox Cognition Battery (CB): measuring working memory. Monographs of the Society for Research in Child Development, 78(4), 70–87. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R11] 11.Carlozzi N. E., Tulsky D. S., Kail R. V., & Beaumont J. L. (2013). VI. NIH Toolbox Cognition Battery (CB): measuring processing speed. Monographs of the Society for Research in Child Development, 78(4), 88–102. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R12] 12.Zelazo P. D., Anderson J. E., Richler J., Wallner-Allen K., Beaumont J. L., Weintraub S. (2013). II. NIH Toolbox Cognition Battery (CB): measuring executive function and attention. Monographs of the Society for Research in Child Development, 78(4), 16–33. [DOI] [PubMed] [Google Scholar]

[R13] 13.Bauer P. J., Dikmen S. S., Heaton R. K., Mungas D., Slotkin J., & Beaumont J. L. (2013). III. NIH Toolbox Cognition Battery (CB): measuring episodic memory. Monographs of the Society for Research in Child Development, 78(4), 34–48. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R14] 14.Bright P., & van der Linde I. (2020). Comparison of methods for estimating premorbid intelligence. Neuropsychological Rehabilitation, 30(1), 1–14. [DOI] [PubMed] [Google Scholar]

[R15] 15.Fisher R. A. (1915). Frequency distribution of the values of the correlation coefficient in samples from an indefinitely large population. Biometrika, 10, 507–521. [Google Scholar]

[R16] 16.Fisher R. A. (1921). On the “probable error” of a coefficient of correlation deduced from a small sample. Metron, 1, 3–32. [Google Scholar]

[R17] 17.Bulut O., & Kim D. (2021). The Use of Data Imputation when Investigating Dimensionality in Sparse Data from Computerized Adaptive Tests. Journal of Applied Testing Technology. 22(2), 1–13. [Google Scholar]

[R18] 18.Taasoobshirazi G., & Wang S. (2016). The Performance of the SRMR, RMSEA, CFI, and TLI: An Examination of Sample Size, Path Size, and Degrees of Freedom. Journal of Applied Quantitative Methods 11(3), 31–39. [Google Scholar]

[R19] 19.Feldmann M., Bataillard C., Ehrler M., Ullrich C., Knirsch W., Gosteli-Peter M. A., … & Latal B. (2021). Cognitive and executive function in congenital heart disease: A meta-analysis. Pediatrics, 148(4). [DOI] [PubMed] [Google Scholar]

[R20] 20.Kenny D. A., Kaniskan B., & McCoach D. B. (2015). The performance of RMSEA in models with small degrees of freedom. Sociological Methods & Research, 44(3), 486–507. [Google Scholar]

[R21] 21.Jackson J. L., Misiti B., Bridge J. A., Daniels C. J., & Vannatta K. (2015). Emotional Functioning of Adolescents and Adults with Congenital Heart Disease: a Meta-Analysis. Congenital Heart Disease, 10(1), 2–12. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R22] 22.Calderon J., Bellinger D. C., Hartigan C., Lord A., Stopp C., Wypij D., & Newburger J. W. (2019). Improving neurodevelopmental outcomes in children with congenital heart disease: protocol for a randomised controlled trial of working memory training. BMJ Open, 9(2), e023304. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R23] 23.Cousino M. K., Pasquali S. K., Romano J. C., Norris M. D., Yu S., Reichle G., Lowery R., Viers S., & Schumacher K. R. (2020). Impact of the COVID-19 Pandemic on Congenital Heart Disease Care and Emotional Wellbeing. Cardiology in the Young, 1–21. Advance online publication. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R24] 24.Siciliano R. E., Murphy L. K., Prussien K. V., Henry L. M., Watson K. H., Patel N. J., Lee C. A., McNally C. M., Markham L. W., Compas B. E., & Jordan L. C. (2020). Cognitive and Attentional Function in Children with Hypoplastic Left Heart Syndrome: A Pilot Study. Journal of Clinical Psychology in Medical Settings, 28(3), 619–626. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R25] 25.Chadwick L., Roth E., Minich N. M., Taylor H. G., Bigler E. D., Cohen D. M., Bacevice A., Mihalov L. K., Bangert B. A., Zumberge N. A., & Yeates K. O. (2021). Cognitive Outcomes in Children with Mild Traumatic Brain Injury: An Examination Using the National Institutes of Health Toolbox Cognition Battery. Journal of Neurotrauma, 38(18), 2590–2599. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R26] 26.Shields R. H., Kaat A. J., McKenzie F. J., Drayton A., Sansone S. M., Coleman J., Michalak C., Riley K., Berry-Kravis E., Gershon R. C., Widaman K. F., & Hessl D. (2020). Validation of the NIH Toolbox Cognitive Battery in intellectual disability. Neurology, 94(12), e1229–e1240. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R27] 27.Thompson M. D., Martin R. C., Grayson L. P., Ampah S. B., Cutter G., Szaflarski J. P., & Bebin E. M. (2020). Cognitive function and adaptive skills after a one-year trial of cannabidiol (CBD) in a pediatric sample with treatment-resistant epilepsy. Epilepsy & Behavior, 111, 107299. [DOI] [PubMed] [Google Scholar]

[R28] 28.Hackett K., Krikorian R., Giovannetti T., Melendez-Cabrero J., Rahman A., Caesar E. E., Chen J. L., Hristov H., Seifan A., Mosconi L., & Isaacson R. S. (2018). Utility of the NIH Toolbox for Assessment of Prodromal Alzheimer’s Disease and Dementia. Alzheimer’s & Dementia (Amsterdam, Netherlands), 10, 764–772. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R29] 29.Ma Y., Carlsson C. M., Wahoske M. L., Blazel H. M., Chappell R. J., Johnson S. C., Asthana S., & Gleason C. E. (2021). Latent Factor Structure and Measurement Invariance of the NIH Toolbox Cognition Battery in an Alzheimer’s Disease Research Sample. Journal of the International Neuropsychological Society, 27(5), 412–425. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R30] 30.Watson W., Pedowitz A., Nowak S., Neumayer C., Kaplan E., & Shah S. (2020). Feasibility of National Institutes of Health Toolbox Cognition Battery in Pediatric Brain Injury Rehabilitation Settings. Rehabilitation Psychology, 65(1), 22–30. [DOI] [PubMed] [Google Scholar]

[R31] 31.Meredith L. R., Lim A. C., & Ray L. A. (2020). Neurocognitive performance in alcohol use disorder using the NIH toolbox: Role of severity and sex differences. Drug and Alcohol Dependence, 216, 108269. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R32] 32.Taylor B. K., Frenzel M. R., Eastman J. A., Wiesman A. I., Wang Y. P., Calhoun V. D., Stephen J. M., & Wilson T. W. (2020). Reliability of the NIH toolbox cognitive battery in children and adolescents: a 3-year longitudinal examination. Psychological medicine, 1–10. Advance online publication. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R33] 33.Loccoh E. C., Yu S., Donohue J., Lowery R., Butcher J., Pasquali S. K., Goldberg C. S., & Uzark K. (2018). Prevalence and risk factors associated with non-attendance in neurodevelopmental follow-up clinic among infants with CHD. Cardiology in the young, 28(4), 554–560. [DOI] [PubMed] [Google Scholar]

[R34] 34.Naef N., Liamlahi R., Beck I., Bernet V., Dave H., Knirsch W., & Latal B. (2017). Neurodevelopmental Profiles of Children with Congenital Heart Disease at School Age. The Journal of pediatrics, 188, 75–81. [DOI] [PubMed] [Google Scholar]

PERMALINK

This is a preprint.

Psychometric Properties of the NIH Toolbox Cognition and Emotion Batteries Among Children and Adolescents with Congenital Heart Defects

Julia Wallace

Rafael Ceschin

Vince K Lee

Nancy H Beluk

Cheryl Burns

Sue Beers

Cecilia Lo

Ashok Panigrahy

Daryaneh Badaly

Abstract

Objective:

Study Design:

Results:

Conclusion:

Methods

Participants

Assessment Instruments

NIHTB Cognitive Battery

NIHTB Emotion Battery

Clinician-Issued Battery and Rating Scales

Analysis Plan

Results

Cognitive Battery

Group Differences

Table 1.

Developmental Trends

Convergent Validity

Table 2.

Concurrent Validity

Table 3.

Emotion Battery

Group Differences

Confirmatory Factor Analysis

Table 4.

Internal Consistency

Convergent Validity

Table 5.

Discussion

Supplementary Material

Acknowledgments

Abbreviations

Footnotes

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases