Abstract
Background
Early childhood self-regulation (SR) is key for many health- and education-related outcomes across the life span. Kindergarten age is a crucial period for SR development, and within this developmental window, potential SR difficulties can still be compensated for (e.g., through interventions). However, efficient measurement of SR through brief, comprehensive, and easy-to-use instruments that identify SR difficulties are scarce. To address this need, we used items of an internationally applied kindergarten teacher questionnaire—the Early Development Instrument (EDI) – to develop and validate a specific SR measurement scale.
Methods
The psychometric evaluation and validation of the selected SR-items was performed in data collected with the German version of the EDI (GEDI), in two independent data sets – (a) the development dataset, with 191 children, and b) the validation dataset, with 184 children. Both included three- to six-year-old children and contained retest and interrater reliability data. First, three independent raters—based on theory—selected items eligible to form a SR scale from the two SR-relevant GEDI domains "social competence" and "emotional maturity". Second, exploratory and confirmatory factor analysis using structural equation modeling examined the item structure across both data sets. This resulted in a defined SR scale, of which internal consistency, test–retest and interrater reliability, cross-validation, and concurrent validity using correlation and descriptive agreements (Bland–Altman (BA) plots) with an existing validated SR-measuring instrument (the Kindergarten Behavioral Scales) were assessed.
Results
Confirmatory factor analysis across both data sets yielded the best fit indices with 13 of the GEDI 20 items initially deemed eligible for SR measurement, and a three-factor structure: a) behavioral response inhibition, b) cognitive inhibition, c) selective or focused attention (RMSEA: 0.019, CFI: 0.998). Psychometric evaluation of the resulting 13-item-GEDI-SR scale revealed good internal consistency (0.92), test–retest and interrater reliability (0.85 and 0.71, respectively), validity testing yielded stability across populations and good concurrent validity with the Kindergarten Behavioral Scales (Pearson correlation coefficient: mean 0.72, range 0.61 to 0.84).
Conclusions
The GEDI contains 13 items suitable to assess SR, either as part of regular EDI developmental monitoring or as a valid stand-alone scale. This short 13-item (G)EDI-SR scale may allow early detection of children with SR difficulties in the kindergarten setting in future and could be the basis for public health intervention planning. To attain this goal, future research should establish appropriate reference values using a representative standardization sample.
Keywords: Self-regulation, Child preschool [MeSH, Child development [MeSH, Early Development Instrument, Germany, Environment and public health [MeSH, Monitoring
Introduction
Self regulation (SR) is a fundamental developmental skill impacting a child’s performance and health across the lifespan [1, 2]. It describes the ability to adapt one's thoughts, feelings, and behavior to the demands of a particular situation in order to optimally pursue personal goals [3]. Moreover, SR refers to processes that enable us to maintain optimal levels of emotional, motivational, and cognitive arousal. It […] overlaps substantially with inhibitory control, a core dimension of executive functions [4].
From a medical, psychological and pedagogical perspective, good SR skills are considered a protective factor regarding mental [5–7] and physical health [8] and have been found to longitudinally predict health, success in professional and private life, satisfaction with life and social equity in adulthood [1].
Accumulating evidence in the last two decades suggests that more and more children from school age to adolescence have difficulties in regulating their behaviors [9]. For example, the prevalence of behavioral and psychological problems related to SR in kindergarten and primary school has been steadily increasing [2, 10–12]. This not only presents challenges for the daily work of teachers [13–15], but studies also suggest that these problems persist into adolescence with a 50% chance [16], resulting in a high societal burden and possible medical costs [17, 18].
With the window for promoting children’s SR skills opening years before entering school, early identification of children with SR difficulties combined with early intervention e.g. in kindergarten seems key from a public health perspective. As SR development depends on environmental factors and experiences [19–21] (besides biological maturity), interventions that change the environment and experiences have the potential to effectively support child SR development [22–24]. Current systematic reviews have shown effectiveness of different SR promoting interventions in early childhood education and care environments (ECECs) [23, 24]. Other studies showed that supportive environmental factors such as high-quality teacher–child interaction [25] are positively associated with SR development in children. This suggests that a public health approach combining the efficient identification of children with SR difficulties early on with the implementation of effective interventions in the kindergarten setting has a high potential.
To identify vulnerable children, valid measurement of SR in kindergartens is necessary. As SR skills are part of psychological and social-emotional child development, questionnaires that are used to assess the latter might be promising. These include the Behavioral and Emotional Rating Scale (BERS, 26 items, domains: behavioral self-control, emotional self-control) [26], the Child Behavior Checklist (CBCL, 33 items, domains: emotionally reactive, attention problems, aggressive behavior) [27], the Child Behavior Questionnaire (CBQ, 12 items, domains: attentional focusing, inhibitory control) [28], the Child Behavior Rating Scale (CBRS, 17 items, domains: self-regulation, social/interpersonal skills) [29], Conners' rating scale – teacher form (CTRS, 28 items, domains: conduct problems, day-dreaming inattention, anxious fearful, hyperactivity) [30], the Devereux Early Childhood Assessment (DECA, 8 items, domain: self-control) [31], Social competence and behavior evaluation—preschool edition (SCBE, 20 items, domains: anger-aggression, social competence) [32], the Social Competence Scale (SCS, 13 items, domains: prosocial behavior, emotion regulation) [33], the Strengths and difficulties questionnaire (SDQ, 25 items, domains: emotional symptoms, conduct problems, hyperactivity/inattention, peer relationship problems, prosocial behavior) [34, 35], and the Behavior Rating Inventory of Executive Function—Preschool Version (BRIEF-P, 63 items, domains: inhibition, attention shift, emotional control, working memory, planning/organizing) [36]. Although many instruments might be available to measure SR skills, the most important ones were suggested to be the CBQ, BRIEF, CBCL and SDQ [37]. However, from a public health perspective, all of these are too comprehensive and long (e.g. number of items for SR measurement = 12, 26 23, 25, respectively) for screening purposes, and do not feature SR as a separate construct.
Several of these questionnaires also exist in German, e.g. the SDQ or the BRIEF-P [38]. Furthermore, additional questionnaires exist that were developed in the German context and are primarily used in Germany, such as the Kindergarten Behavior Scales (VSK, 49 items, domains: anxiety, hyperactivity and inattention, aggressive behavior, emotional dysregulation, social competence, emotional knowledge/empathy, self-regulation) [39], the Organizing Education in Kindergarten screening (BIKO, 33 items six domains: willingness to cooperate with educational staff, integration into the group, problem behavior towards peers, prosocial behavior towards peers, play and task behavior, regulation of emotions) [40, 41], the Dortmund Developmental Screening for Kindergarten (DESK 3–6 R, 45 to 50 items depending on age, domains: fine motor skills, gross motor skills, social competence, social behavior, social interaction, attention and concentration, cognition and language, cognition, basic competence literacy, basic competence numeracy, language and communication) [41] or the questionnaire Competencies and Interests of Children (KOMPIK, 158 items across 11 domains: motor skills, social and emotional behavior, motivation, language and early literacy, maths, science, music, design, health, well-being, and social relationships) [42].
While these instruments meet scientific standards, they are all longer and quite time-consuming (minimum 40 items, while the DESK even contains performance tasks over and above questionnaire items, which requires even more time and a suitable physical environment in kindergartens). In addition, most of them do not feature SR as a separate construct and are far too comprehensive (e.g. measure development or behavioral issues in general), which reduces their suitability as efficient SR screening tools in the kindergarten environment and also might explain why they failed to gain wide use in Germany.
To move the field of developmental monitoring and public health intervention planning in kindergartens in Germany forward, we previously adapted the internationally widely used Canadian Early Development Instrument (EDI) [43] to the German context and published the German version of the EDI (GEDI) [44]. The EDI is a valid and reliable teacher 103-item questionnaire assessing a child’s ability to meet age-appropriate development expectations in five domains (see below), developed by Magdalena Janus and colleagues at the Offord Center for Child Studies at McMaster University, Ontario. The instrument was designed as a screening and developmental monitoring tool [45–49]. It serves to collect data on the development of 3- to 6-year-old children in all relevant developmental domains [50]. In Canada and other countries, the EDI is integrated into a public health monitoring and intervention planning approach, which results in a tailored implementation of interventions in kindergartens to support child health and development.
Based on the features described above, the EDI could provide an optimal basis to develop a brief, but psychometrically sound and fully questionnaire-based screening instrument to detect SR difficulties in kindergarten children. In addition, the worldwide use of the EDI would allow to assess SR as part of the regular EDI monitoring in kindergartens in many countries.
Therefore, this study assesses whether it is possible to develop a valid scale measuring SR by recombining items of the theoretically relevant EDI domains "social competence" and "emotional maturity". The following research questions guide our study:
Can existing items from the (G)EDI be selected based on solid theoretical and conceptual considerations and recombined to form a valid (stand-alone) SR scale?
-
b)
Does the resulting (G)EDI-SR scale have adequate psychometric properties and validity?
Methods
Recruitment, data collection and sample description
The present study collected data with the (G)EDI teacher questionnaire [43, 44] in two independent data sets – (a) the development dataset, with 191 children, collected in June 2016 to pilot the EDI in Germany in three different towns, with more details on recruitment and psychometric features published elsewhere [44], and b) the validation dataset, with 184 children, collected in fall 2021, in kindergartens in a small town in the South-West of Germany (population approx. 15.000), which intended to use the GEDI as the starting point for a community-based early childhood prevention strategy. In both data collections, teachers completed the full GEDI and the VSK-SR subscale for all participating children. The precondition to fill out the GEDI was that the teachers knew the children for at least one month, had sufficient command of the German language, and took part in a training session prior to the assessment. The previous training ensured that all teachers had the same level of knowledge about the instrument, its purpose and completion.
All data were collected electronically and given an individual pseudonym by the teachers to match first and second surveys to the same child with a 100% degree of accuracy.
Eligibility criteria for the children to whom the GEDI was administered comprised age 3 to 6 years, the presence of written informed parental consent and the absence of special needs. Table 1 displays descriptive characteristics for both samples and provides the number of eligible and finally participating children and teachers. Ethical approvals for both data collections were granted by the Ethics committee of the Medical Faculty Mannheim, Heidelberg University (development sample: 2015-640N-MA; validation sample: 2016-588N-MA). The teachers’ participation was taken as an implicit consent to participate in our study.
Table 1.
N development sample (%) | N validation sample (%) | ||
---|---|---|---|
Eligible (invited) | Children | 444 | 385 |
Kindergartens | 9 | 6 | |
Teachers | 60 | 75 | |
Participating | Children with parental consent | 225 (51) | 209 (54) |
Kindergartens | 9 (100) | 6 (100) | |
Teachers | 60 (100) | 33(44) | |
Cases excluded upon reasons | 34a (15) | 25b (12) | |
Cases in dataset | 191 (43) | 184 (48) | |
mean age (range; SD) | 4.27 (3 to 6; 1.05) | 4.25 (3 to 6; 0.94) | |
n 3 years | 58 (30) | 46 (25) | |
n 4 years | 60 (31) | 65 (35) | |
n 5 years | 43 (23) | 55 (30) | |
n 6 years | 30 (16) | 18 (10) | |
Gender (female) | 49% | 51% | |
German second language | 18% | 7% | |
SES low/middle/high | 2,6/49,2/40,3% | - |
an = 5 with missing data or a “don’t know” response to the special needs assignation variable; n = 28 with special needs assignation, n = 1 under the age of three
bn = 22 due to an affirmative answer to the special needs question, n = 3 under the age of three
SES = socioeconomic status
Study design – overview
In a first step, the selection of GEDI items that theoretically map to SR was performed, which resulted in eligible GEDI-SR items. To assess the construct and dimensions of the eligible GEDI-SR items (see beneath), we used the development dataset, resulting in a first GEDI-SR scale. The GEDI data from the two independent samples were then used to cross-validate the item and factor structure of the GEDI-SR scale from the development data set to the validation data set. In a next step, using the validation data set, the GEDI-SR scale was compared with the VSK-SR items to assess concurrent validity of the GEDI-SR scale. Moreover, our reliability analyses used data from repeated retests of the GEDI within the validation sample. In the following, measurements and related statistical analyses for the different steps of the study design are presented in more detail.
Measurements
The GEDI as basis for SR scale development
The GEDI, like the original EDI, is a kindergarten teacher questionnaire to assess early childhood development in the following domains: “physical health and well-being" (13 items), "social competence" (26 items), "emotional maturity" (30 items), "language and cognitive development" (25 items), and "communication and general knowledge" (8 items) based on accumulated teacher impression and observation (and not on performance tasks). As a public health tool, the (G)EDI can be helpful in several ways: e.g. for teachers to create optimal learning opportunities tailored to individual child developmental profiles, for school boards and ministries to plan resource allocations to kindergartens (e.g. child-teacher relation) and to describe specific intervention needs in kindergartens which could be used for public health monitoring and planning (including to convince funders of intervention projects) [51].
The validation of the GEDI in the German context across the original five main domains demonstrated excellent internal consistency (0.73 < α > 0.99), moderate to good test–retest and interrater reliability (0.50 to 0.81 and 0.48 to 0.71, respectively [p-value < 0.05]), and good concurrent validity with other developmental instruments (range: 0.32 to 0.67) (details see [44]).
However, focus groups with teachers after the first data collection in Germany revealed a need to provide age-specific ratings (the original instrument is applied to 5-year old children in their preschool year, while in Germany kindergartens serve children from the age of 3 to 6). Using item response analyses, appropriateness of age-related information content and redundancies (e.g. some items from the original 103 items that did not provide additional content for specific age groups) were resolved, which thereby led to an overall shortening of the GEDI as compared to the EDI. The age-adjusted, age-specific and shorter GEDI contains different numbers of items, depending on the age group: n = 69 for 3–4 year-olds, n = 65 for 5-year-olds, and n = 61 for 6-year-olds. In the present study, only the items of the SR-relevant domains of the GEDI, "social competence" (n = 15 and 16 items for 3–4- as well as 5–6-year-olds, respectively) and "emotional maturity" (n = 21 items for all age groups), were considered and analysed.
The VSK as measure to assess concurrent validity
Besides the GEDI we applied the SR subscale of the German Kindergarten Behavioral Scales (Verhaltensskala für den Kindergarten = VSK-SR) [39] to assess concurrent validity. The VSK comprises 49 items in seven domains: anxiety, hyperactivity and inattention, aggressive behavior, emotional dysregulation, social competence, emotional knowledge/empathy, self-regulation). The VSK-SR scale entails five items, with an internal consistency of = 0.79: waits for his or her turn, performs activities he or she does not like, wants things immediately, considers the consequences of his or her own actions, finishes tasks. The concurrent validity of the VSK-SR subscale was assessed with the SDQ [35] and proved to be moderate (-0.67, p-value < 0.001) and thus acceptable [52].
Selection of items: Assessing eligibility and selecting SR-mapping GEDI items
We used a theory-based approach to identify items that might be relevant for the development of a SR scale. As a theoretical basis, we used a widely accepted categorization system of SR [4]. It considers SR as a multidimensional latent construct, including three closely related sub-dimensions: a) cognitive inhibition, which means the inhibition of thoughts and memories, b) selective or focused attention, or c) response inhibition: self-control/discipline. With these definitions in mind, three independent raters who were professionally familiar with early childhood development (childhood education, occupational therapy, developmental psychology) assessed all items within the GEDI domains of "social competence" and "emotional maturity”, which deemed relevant as these skills are closely related to SR skills [53]. Each item was labeled each as either 0 (not mapping to SR) or 1 (mapping to SR). Subsequently, they assigned the items mapping to SR to the three sub-dimensions of SR. Interrater agreement was assessed using kappa-statistics. Inconsistencies were resolved through discussion including a third independent rater until consensus was reached. This process resulted in items eligible to form the new GEDI-SR scale.
Statistical analyses
Operationalization and categorization of responses in the GEDI-SR scale
Like in the original EDI, we retained three-point Likert scales for the GEDI (coding: often/very true = 10, sometimes/somewhat true = 5, and never/not true = 0) [43]. Higher mean scores indicated better development. Children were excluded from analyses in a domain if ≥ 30% of values were missing [20]. In the absence of a normative German sample to establish valid cut-offs, and in line with the original EDI procedures, children who scored lower than the 10th percentile in the ensuing GEDI-SR scale were preliminarily deemed as “vulnerable” in terms of SR [54].
Descriptive analysis of the two data sets
We initially compared descriptive statistics of the development and validation datasets (sample size, mean age, distribution and scorings at 10th, 25ths, 50th and 75th percentile) using kernel density plots to reveal differences that might further help to explain potential inconsistencies in structured equation modeling (SEM).
Assessment of construct and dimensions of the eligible GEDI-SR items: Psychometric evaluation
We first performed an exploratory and confirmatory factor analysis. Using the development dataset, we applied the measure of sampling adequacy (MSA, < 0.5 unsuitable, ≥ 0.6 usable, > 0.8 good [55]). To test the hypothesis regarding the factor structure among the eligible GEDI-SR items, we conducted an exploratory factor analysis using structural equation modeling (maximum-likelihood method). The comparative fit index (CFI, > 0.95) [56] and the root mean squared error of approximation (RMSEA, < 0.05) [57–60] served as goodness-of-fit indicators of the model. To avoid overfitting, we tested the model fitted with the development dataset by recalculating the same model using the validation dataset. We aimed to replicate the main structured equation modeling composition of the model (confirmatory factor analysis). Since we were still in the exploration stage, we adjusted correlations among items in the validation dataset where necessary in favor of a better model fit.
Reliability testing of the GEDI-SR scale
We assessed internal consistency (Cronbach’s alpha) of the GEDI-SR scale resulting from the confirmatory factor analysis and used intraclass correlation coefficients (ICC) to assess test–retest and interrater reliability (0.5 = poor, 0.5 to 0.75 = moderate, 0.75 to 0.9 = good, and > 0.9 = excellent [61]. We asked teachers to repeat the GEDI for a randomly selected subset of children (n = 72; 3 children per age group) after two weeks. ICCs indicate the strength of the correlation of the GEDI-SR scores between the two measurement time points. The higher the ICC value, the better the correlation between T1 and T2 and the better the corresponding reliability. Additional plausibility checks using invariant demographic variables (birth quarter, gender) ensured the accuracy between T1 and T2 data.
Concurrent validity testing of the GEDI-SR scale
We assessed concurrent validity by means of Pearson correlation coefficients and plotting differences between the mean GEDI-SR and VSK-SR scores using Bland–Altman (BA) plots for each age group. BA plots are graphical representations that can be used to compare two measurement methods by analyzing the agreement between these: a difference plot combined with calculation of the two (upper and lower) limits of the differences between the methods (the so-called 95% limits of agreement). The x axis shows the mean of the results of the two methods ([A + B]/2), whereas the y axis represents the absolute difference between the two methods ([B—A]) [62, 63]. The closer the points in the plot are aligned around the line of mean difference (line centered at zero of the y-axis), the better the agreement. A good agreement is to be interpreted as good concurrent validity.
To meet the requirement for normality [64, 65], we used the Stata commands gladder and qladder and selected the closest to normal distribution. To enable cross-measure comparisons in BA plots, GEDI-SR and VSK-SR scores were transformed into z-scores. BA plots were generated using the Stata command concord [66]. The association between the two measures was examined by (i) considering the mean difference and (ii) the scattering of dots around the mean difference line in relation to the latent trait continuum on the x-axis.
All analyses were conducted using Stata (StataCorp. 2015. Stata Statistical Software: Release 15. College Station, TX: StataCorp LP.).
Results
Results of the item selection process
The theory-based item selection resulted in a list of 20 eligible GEDI-SR items (Table 2). In the selection process, a moderate kappa of 0.5 between the three raters could be achieved.
Table 2.
Original GEDI-Domain |
Items Would you say that this child… |
|
---|---|---|
Social competence | qc2 | Has the ability to get along with peers |
qc5 | Follows rules and instructions | |
qc7 | Demonstrates self-control | |
qc9 | Demonstrates respect for adults | |
qc10 | Demonstrates respect for children | |
qc11 | Accepts responsibility for actions | |
qc12 | Listens attentively | |
qc14 | Completes work on time | |
qc15 | Works independently | |
qc16 | Takes care of school materials | |
qc17 | Works neatly and carefully | |
qc24 | Is able to follow class routines without reminders | |
Emotional maturity | qc37 | Gets into physical fights |
qc42 | Can’t sit still, is restless | |
qc43 | Is distractible, has trouble sticking to any activity | |
qc44 | Fidgets | |
qc46 | Has temper tantrums | |
qc47 | Is impulsive, acts without thinking | |
qc48 | Has difficulty awaiting turn in games or groups | |
qc50 | Is inattentive |
Assessment of construct and dimensions of the eligible GEDI-SR items: Psychometric evaluation
The measure of sampling adequacy analysis amounted to MSA = 0.9. Exploratory factor analysis with the development sample revealed three highly significant (p-value < 0.001) interrelated factors (Table 3). The explanations in the right column of this table show that the loadings and allocations of the eligible items to the factors are theory-based and comprehensible. The contents of all items with loadings higher than or equal to 0.4 could be transparently assigned to the corresponding factors. Four items with loadings below 0.4 had too general a wording and their content did not necessarily refer to the ability to self-regulate. Therefore, they were removed from consideration leaving us with 16 of the initially 20 eligible items. Based on the theoretical background, the ensuing three factors were labeled as: 1) behavioral response inhibition; 2) cognitive inhibition; 3) selective or focused attention.
Table 3.
Variable | Would you say that this child… | Factor1 | Factor2 | Factor3 | Uniqueness | Theory based explanation |
---|---|---|---|---|---|---|
qc10 | Demonstrates respect for children | 0.7440 | 0.4707 | Requires to inhibit emotions and behavior | ||
qc9 | Demonstrates respect for adults | 0.6693 | 0.4933 | Requires to inhibit emotions and behavior | ||
qc37 | Gets into physical fights | 0.6625 | 0.6026 | Requires to regulate emotions and needs a certain motivation to regulate behavior | ||
qc47 | Is impulsive, acts without thinking | 0.6237 | 0.5022 | Impulsivity is the inability to regulate emotions and behavior. If someone is planned, then he can regulate his emotions and act in a self-controlled manner | ||
qc5 | Follows rules and instructions | 0.5344 | 0.6048 | Requires the ability to motivate oneself to adapt and to inhibit "rebellious" emotions and behave accordingly | ||
qc11 | Accepts responsibility for actions | 0.4134 | 0.4878 | Requires the ability stand up for own mistakes to resist the impulse to be offended and "run away". This requires to regulate emotions and behavior by being honest and not offended | ||
qc7a | Demonstrates self-control | < 0.4 | 0.6445 | Can mean anything and does not separate well. The item is not worded accurately enough | ||
qc46a | Has temper tantrums | < 0.4 | 0.8090 | You can throw tantrums for very different reasons. However, this does not necessarily mean that one has a bad SR | ||
qc15 | Works independently | 0.7598 | 0.4631 | to be able to work independently, I have to be able to remember things and stay on task | ||
qc17 | Works neatly and carefully | 0.7359 | 0.3816 | to be neat and careful, I need to be able to structure myself and my thoughts | ||
qc14 | Completes work on time | 0.7344 | 0.4662 | To stay on schedule, I also need to be able to stay on task and focus my thoughts on what I'm doing | ||
qc24 | Is able to follow class routines without reminders | 0.5504 | 0.6140 | Requires the ability to remember things and also be able to recall it again | ||
qc12 | Listens attentively | 0.5315 | 0.4751 | Requires the ability to block out disturbing thoughts and memories | ||
qc16 | Takes care of school materials | 0.5002 | 0.4667 | Requires to be careful and not destroy anything on purpose. Requires the ability to suppress the impulse to destroy, which is sometimes perceptible, and behave appropriately and in a controlled manner | ||
qc2a | Has the ability to get along with peers | < 0.4 | 0.7658 | Too many things in one item. Doesn't have to be SR ability if someone can get along with another kid | ||
qc44 | Fidgets | 0.7379 | 0.4575 | Fidgeting and being restless and physically active doesn't necessarily mean that a child is not able to concentrate, but certainly often goes hand in hand with it | ||
qc43 | Is distractible, has trouble sticking to any activity | 0.7372 | 0.4048 | Requires the ability to concentrate and focus attention | ||
qc42 | Can’t sit still, is restless | 0.7171 | 0.4387 | These children have difficulties to focus their attention | ||
qc50 | Is inattentive | 0.6705 | 0.4803 | These children can't concentrate and selectively focus their attention | ||
qc48a | Has difficulty awaiting turn in games or groups | < 0.4 | 0.6657 | Awaiting turn requires patience and waiting is different from attention and concentration |
Note: aItem excluded from subsequent analysis (structured equation modeling and BA-Plots)
Item numbers in bold: Items corresponding with VSK-SR items: Waits for his or her turn (qc5), Performs activities he or she does not like (qc11), Wants things immediately (qc48), Considers the consequences of his or her own actions (qc11, qc47), Finishes tasks (qc14, qc43)
Confirmatory factor analysis with the development dataset using structured equation modeling revealed highly significant correlations at the factor and item level. Three items loaded below 0.6 and were therefore excluded from the final model (Table 4) leaving us with 13 items of the initially 20 eligible items. The good model fit (RMSEA: 0.029, CFI: 0.993) is presented in Table 4, resulting in a 13-item SR scale to be tested further.
Table 4.
development dataset | replication with validation dataset | |||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
N = 191 | N = 184 | |||||||||||||
Factor | Item | Coefficient (subdomain level) |
Coefficient (item level) |
SE | 95% CI lb | 95% CI ub | Correlations with other items | Coefficient (subdomain level) |
Coefficient (item-level) |
SE | 95% CI lb | 95% CI ub | Correlations with other items | |
sd16 discipl | qc10 | Demonstrates respect for children |
sd17 0.76*** sd18 0.62*** |
0.64*** | 0.05 | 0.54 | 0.75 |
qc9 0.26** qc15 -0.14 (ns) qc14 -0.19* qc16 0.13 (ns) |
sd17 0.66*** sd18 0.77*** |
0.56*** | 0.06 | 0.45 | 0.68 |
qc9 0.52*** qc16 0.39*** |
qc9 | Demonstrates respect for adults | 0.67*** | 0.05 | 0.56 | 0.77 | 0.49*** | 0.06 | 0.37 | 0.61 |
qc16 0.29*** qc42 0.19*** qc50 -0.15* |
||||
qc37 | Gets into physical fights | < 0.6 | excluded | |||||||||||
qc47 | Is impulsive, acts without thinking | < 0.6 | excluded | |||||||||||
qc5 | Follows rules and instructions | 0.68*** | 0.05 | 0.58 | 0.78 | qc15 -0.19* | 0.97*** | 0.06 | 0.86 | 1.08 |
qc11 -1.67(ns) qc16 0.58(ns) |
|||
q11 | Accepts responsibility for actions | 0.72*** | 0.05 | 0.63 | 0.81 | 0.86*** | 0.06 | 0.74 | 0.98 |
qc15 0.43*** qc17 0.38** qc16 0.63*** |
||||
sd17 cog |
qc15 | Works independently | sd18 0.57*** | 0.5 | 0.05 | 0.49 | 0.71 |
qc17 0.13 (ns) qc14 0.225** qc44 -0.15* |
sd18 0.89*** | 0.69*** | 0.05 | 0.59 | 0.79 |
qc17 0.23** qc14 0.13(ns) qc12 -0.28* qc42 -0.35*** |
qc17 | Works neatly and carefully | 0.6*** | 0.04 | 0.76 | 0.90 | qc12 -0.35** | 0.71*** | 0.05 | 0.62 | 0.80 |
qc12 -0.49*** qc42 -0.18* qc50 0.14(ns) |
|||
qc14 | Completes work on time | 0.68*** | 0.05 | 0.59 | 0.77 | qc16 -0.19* | 0.69*** | 0.04 | 0.61 | 0.77 | ||||
qc24 | Is able to follow class routines without reminders | < 0.6 | excluded | |||||||||||
qc12 | Listens attentively | 0.74*** | 0.04 | 0.65 | 0.82 | qc43 0.28** | 0.86*** | 0.03 | 0.81 | 0.92 | ||||
qc16 | Takes care of school materials | 0.76*** | 0.04 | 0.68 | 0.83 | 0.67*** | 0.04 | 0.58 | 0.75 | qc42 -0.14* | ||||
sd18 att |
qc44 | Fidgets | 0.64*** | 0.05 | 0.54 | 0.75 | qc42 0.29*** | 0.73*** | 0.04 | 0.65 | 0.81 | qc42 0.45*** | ||
qc43 | Is distractible, has trouble sticking to any activity | 0.81*** | 0.04 | 0.73 | 0.89 | 0.87*** | 0.02 | 0.82 | 0.92 | |||||
qc42 | Can't sit still, is restless | 0.71*** | 0.05 | 0.61 | 0.80 | 0.76*** | 0.04 | 0.69 | 0.83 | |||||
qc50 | Is inattentive | 0.71*** | 0.05 | 0.62 | 0.80 | 0.77*** | 0.03 | 0.70 | 0.84 | |||||
RMSEA /CFI | 0.029 / 0.993 | 0.019 / 0.998 |
sd Subdomain, SE Standard error, CI Confidence interval, RMSEA Root mean square error of approximation, CFI Comparative fit index
sd16 = Behavioral response inhibition
sd17 = Cognitive inhibition
sd18 = Selective or focused attention
Note: *p < 0.05, ** p < 0.01, ***p = 0.000
Cross-validation: confirmatory analysis using the validation dataset
We tried to replicate the GEDI-SR scale model using the validation dataset. This cross-validation yielded similar results (RMSEA: 0.019, CFI: 0.998) (Table 4.), confirming the 13-item scale within a three-factor model structure.
Comparison of the 13-item GEDI-SR scale’s descriptive data across the datasets
Overall, descriptive statistics and age-specific kernel density plots for development and validation samples (Table 5, Fig. 1) illustrate the underlying distribution of the data. The mean value of the 10% cut-off in the samples ranged from 5.00 in the development data set to 5.42 in the validation data set, respectively. The graph shows the similarly skewed distribution in both datasets except for 3- and 4-year old children, whose percentile values partially differ from each other up to 1.4 points.
Table 5.
Participant information | GEDI-SR scale scores | |||||||
---|---|---|---|---|---|---|---|---|
Age | N | Mean | SD | Min | Max | 10th | 25th | 75th |
development sample | ||||||||
3 | 58 | 7.60 | 1.62 | 3.08 | 10 | 5.00 | 6.92 | 8.85 |
4 | 60 | 7.52 | 2.02 | 3.08 | 10 | 4.42 | 6.54 | 9.23 |
5 | 43 | 8.28 | 1.53 | 4.62 | 10 | 5.77 | 7.31 | 9.62 |
6 | 30 | 8.68 | 1.60 | 2.69 | 10 | 6.92 | 8.08 | 9.62 |
overall | 191 | 7.90 | 1.78 | 2.69 | 10 | 5.00 | 6.92 | 9.23 |
validation sample | ||||||||
3 | 46 | 7.26 | 2.31 | 2.08 | 10 | 3.75 | 5.83 | 9.58 |
4 | 65 | 8.26 | 1.84 | 2.08 | 10 | 5.83 | 7.50 | 9.58 |
5 | 55 | 8.44 | 1.98 | 2.08 | 10 | 5.42 | 7.50 | 10 |
6 | 18 | 8.96 | 1.36 | 5.42 | 10 | 7.50 | 7.92 | 10 |
overall | 184 | 8.13 | 2.03 | 2.08 | 10 | 5.42 | 7.08 | 10 |
Internal consistency, test–retest and interrater reliability results
Internal consistency (range: 0.89 < ⍺ > 0.92), overall test–retest ICC (0.85, 95%-CI: 0.71 to 0.93), and overall interrater ICC (0.71, 95%-CI: 0.43 to 0.89) of the 13-item GEDI-SR scale were good (Table 6). For test–retest and interrater reliability we obtained 27 (38%) retest pairs and 26 (36%) interrater pairs (children at least 3 years old, without special needs). The interval between T1 and T2 ranged from 6 and 9 to 30 and 22 days, respectively. Attempting to balance between "include as many pairs as possible" and "the interval between T1 and T2 should be as close to 14 days as possible" we only included pairs with a time interval between 13 and 15 days (n = 25 and 17 pairs). Due to a large score difference between T1 and T2 in some pairs, retest ICCs could not be calculated for 6-year-olds and interrater ICCs could only be calculated for 3-year-olds. Therefore, we only report the overall ICCs in Table 6.
Table 6.
age | N | Cronbach's alpha | |
---|---|---|---|
Internal consistency | 3 y | 46 | 0.92 |
4 y | 65 | 0.9 | |
5 y | 55 | 0.92 | |
6 y | 18 | 0.89 | |
overall | 184 | 0.92 | |
N (pairs) | ICCs (CI) | ||
Test–retest reliability | across age groups | 25 | 0.85 (0.71 to 0.93) |
Interrater reliability | 17 | 0.71 (0.43 to 0.89) |
Concurrent validity
Table 7 shows the results from assessing concurrent validity. With one exception, correlation coefficients indicate strong, statistically significant positive linear correlations in all age groups (range: 0.61 to 0.84). Limits of agreement are furthest apart for 6-year-olds and closest for 5-year-olds (Table 8.). Figures 2 A to E illustrate the extent to which the paired variables match. The more dispersed scatter of points around the mid-section in Figures A, B, C, and E reveal that the poorest agreement is for children with average SR skills. Children with lower average SR skills (scores < − 1 on the x-axis) and those with higher average SR skills (scores > 1 on the x-axis) tend to be underestimated with the GEDI-SR scale compared to the VSK-SR scale. In plot D (5-year-olds), dots are clustered more tightly around the line of mean difference in the mid-section of the x-axis, indicating good agreement between the GEDI-SR and VSK-SR scales in the latent trait section, where the vast majority of children scored. For children with extreme values around -3, the plot shows a larger measurement error to the extent that the GEDI-SR scale underestimates children in the lower latent trait range.
Table 7.
GEDI-SR scale | ||||||
---|---|---|---|---|---|---|
3 years | 4 years | 5 years | 6 years | Overall | ||
VSK subdomain SR | 3 years | 0.72*** | ||||
4 years | 0.70*** | |||||
5 years | 0.84*** | |||||
6 years | 0.61** | |||||
overall | 0.75*** |
Note: VSK „Kindergarten Behavioral Scales “, GEDI German version of the Early development instrument, SR Self-regulation, *** p = 0.000
Table 8.
Age-group | N |
Difference Average / Mean difference |
SD | 95%-Limits of agreement | Concordance correlation coefficient |
---|---|---|---|---|---|
Pearson's r (95%-CI) | |||||
3 years | 46 | -0.000a | 0.75 | -1.47 to 1.47 | 0.72*** (0.54 to 0.83) |
4 years | 65 | -0.000a | 0.77 | -1.52 to 1.52 | 0,70*** (0.55 to 0.81) |
5 years | 55 | 0.000 | 0.56 | -1.11 to 1.11 | 0,84*** (0.74 to 0.90) |
6 years | 18 | -0.000a | 0.92 | -1.8 to 1.8 | 0.58** (0.17 to 0.82) |
overall | 184 | 0.000 | 0.71 | -1.38 to 1.38 | 0.75*** (0.68 to 0.81) |
Note: SD Standard deviation, CI Confidence interval, *** = p < 0.001; a values have a slightly negative tendency, which only becomes apparent after the fourth comma position
Discussion
The aim of the study was to identify items eligible for SR-measurement within the (G)EDI domains "social competence" and "emotional maturity" by a theory-based selection process, and therefrom develop a GEDI-SR scale and assess its dimensions, psychometric properties and validity.
We identified 20 original (G)EDI items eligible for measuring SR. Starting with these 20 items, we used exploratory factor analysis to assess constructs and dimensions using the development dataset. Cross-validation with both datasets using confirmatory factor analysis was successful and resulted in a 13-item, three-factor GEDI-SR scale model with excellent goodness of fit indices for measuring SR in kindergarten children. The GEDI-SR scale’s internal consistency, test–retest and interrater reliability, stability across populations as well as concurrent validity with the VSK-SR scale were in the good to excellent range, which qualifies the scale for screening or monitoring purposes. Since all items of this SR scale are inherent to the (G)EDI, SR can now be efficiently measured when administering the (G)EDI, without the need for applying an additional SR assessment instrument. Alternatively, given high reliability and validity, the newly developed, short GEDI-SR scale could also be administered as stand-alone scale.
Development of the GEDI-SR scale and its constructs and dimensions
The sequence of theory-based selection process and a subsequent quantitative analysis of constructs and dimensions of the resulting eligible SR-items across two independent data sets was successful to reduce the initial 20 items to a very short scale of 13 items to measure SR in a valid way. The internal consistency of this scale was high (⍺ 〜 0.90).
The 13 items of the resulting SR scale revealed large correlations at the factor and item level, which indicates a multicomponent latent construct. The three factors of the GEDI-SR scale found empirically correspond perfectly to the theoretical basis of Diamond's conceptual model on SR [33], which underlines the scale’s validity. It consists of the “core” components of SR 1) behavioral response inhibition; 2) cognitive inhibition; 3) selective or focused attention (Diamond 2013). A child scoring high on these domains will find it easier to a) meet teachers' expectations, as teachers expect children to behave appropriately with regard to their school readiness and show SR by treating people and things well, by being able to sit still and to listen when needed [67]. Such children will show b) responsible behavior by following rules, taking responsibility for their actions, and being mindful of the materials and furniture at the kindergarten; c) concentration being able to conduct activities independently and calmly, e.g. completing painting and handicrafts carefully and on time, and to have an appropriate attention span. Children with high levels of SR may be expected to show d) conscientiousness, for example being careful with play materials.
The exploratory factor analysis led to omission of four items from the eligible SR-item selection. These encompass items such as “demonstrates self-control”, “has temper tantrums”, “has the ability to get along with peers” and “has difficulty awaiting turn in games or groups”, which -based on face-validity- might actually relate to the concept of SR. It is therefore not fully clear why the exploratory factor analysis suggested omission. The most probably hypothesis is that these items capture other behavioral domains distinct from the 13-items representing SR. Likewise, the structural equation modeling failed to support the inclusion of the items “gets into physical fights”, “is impulsive, acts without thinking” and “is able to follow class routines without reminders” – although all three investigators initially considered them to be appropriate and relevant items to measure SR. This however does not seem unusual: Also other studies on the development of theory- or literature-based questionnaires have shown that theoretically relevant items are dropped after factor analytic steps [68, 69]. Authors have argued that this might be due to the wording of some items not being appropriate to reflect the latent construct for which they were actually included.
Reliability assessment
The 13-item GEDI-SR scale showed favorable reliability, both with respect to internal consistency as well as the results from structural equation modeling and re-test analyses. Yet, we must acknowledge some limitations regarding test–retest and interrater reliability. First, due to the COVID-19 pandemic and difficult organizational conditions in kindergartens, we received significantly fewer pairs of data than intended. With three pairs only for 6-year-olds, calculation of ICCs was not possible as was the calculation of interrater ICCs for 4- to 6-year-olds. We therefore only present overall values and recommend age-specific reliability analysis in a future study.
Concurrent validity
We assessed concurrent validity by comparison to the VSK-SR scale. The VSK-SR scale tends to focus behavioral inhibition, namely patience, adaptability, and perseverance skills, whereas the GEDI-SR scale reflects cognitive inhibition and selective/focused attention with slightly different dimensions (concentration, diligence, and adherence to rules). Given this difference, the degree of agreement in terms of Pearson’s correlation coefficient was good. However, despite good overall concurrent validity results, the additional Bland–Altman analysis revealed that the two scales ((G)EDI-SR versus VSK-SR) differed for extreme values of SR. It thus remains uncertain whether the VSK-SR overestimates the extremes or the GEDI-SR underestimates deviations from the mean. Therefore, a future study might want to re-investigate the agreement of the GEDI-SR scale and another instrument available in German language, such as the SDQ.
Comparison of reliability and validity results with those of other SR instruments
Regarding its psychometric properties and validity, the GEDI-SR scale shows values comparable (or even superior) to those of other instruments used to measure SR in the international and national context, as exemplified and quantified in Table 9. For example, the GEDI-SR scale compared to the other instruments shows very good internal consistency. Test–retest reliability seems even better than that of the CBQ or SDQ.
Table 9.
GEDI-SR | Other SR-Measurements | |||||
---|---|---|---|---|---|---|
CBCL 1,5–5 | CBQa | SDQ | BRIEF-P Canadian Sample | BRIEF-P German Sample | ||
Psychometric properties | Current study | Achenbach (2000) | Putnam (2006) | Goodman (2001) | Shermand (2010) | Daseking (2013) |
Reliability | ||||||
Internal consistency | ⍺ 〜 0.90 | ⍺ ≥ 0.86 | ⍺ = 0.67 to 0.71 | ⍺ = 0.73 | 0.90 < ⍺ > 0.97 | 0.82 < ⍺ > 0.94 |
Test–retest reliability | ICC = 0.85 (95%-CI: 0.71 to 0.93) | Pearson’s r = 0.72 to 0.89 | r = 0.61 – 0.70 | r = 0.73 (after 4 to 6 months) | r = ≥ 0.90 | X |
Interrater reliability | ICC: 0.71 (95%-CI: 0.43 to 0.89) | r = 0.52 to 0.78 | r = 0.47 | r = 0.80; (sample of 5–15-Year-Olds) | X | r = 0.56 |
Validity | ||||||
Concurrent Validity | r = 0.75 with VSK-SR | r = 0.56 to 0.77 with the Richman Behavior Checklist | X | OR = 13.5 (95%-CI: 11.1 to 16.3) with DSM-IV-diagnosis | X | r = 0.70 with BASCb |
Note: aValues for parents as respondents; bBASC Behavioral Assessment of Children (Reynolds & Kamphaus 2004), X Information not available, ICC Intraclass correlation coefficient
Moreover, our results confirm the good psychometric properties of the original (G)EDI and show that the "Social Competence" and "Emotional Maturity" scales of the EDI have been developed very well with regard to the selection and formulation of items. Building on this excellent work of the Canadian developers, we were now able to develop a reliable and valid SR scale that is inherent to the (G)EDI and thus does not require additional time for SR-assessment.
Public health implication
Given good psychometric characteristics, high validity and reliability of the (G)EDI-SR scale, our work is the precondition for a public health monitoring process, which could take GEDI-SR as part of the (G)EDI or as a stand-alone scale as a starting point for intervention implementation, both at the individual child as well as the population level. The newly developed GEDI-SR might be specifically relevant to those countries already monitoring child development in kindergartens using the EDI at scale (e.g., Australia [45]). However, to lever its use as a potential public health screening instrument, in a next step, age-specific standardized cut-offs should be established in a representative sample (standardization sample) [70]. After the establishment of valid cut-off values, each country using the EDI for developmental monitoring could efficiently screen for SR difficulties in this early age and use the screening for tailored implementation of SR-promoting interventions in kindergartens at a public health scale.
Strengths and limitations
To our best knowledge, this is the first study to define and validate a short SR scale within the widely used EDI. Although other short SR subscales exist (e.g. in the VSK-SR or the CBRS) and might be theoretically usable, our scale might be very efficient from a public health perspective as its items are part of and included in the administration of the EDI or GEDI. In addition, the costly purchase of e.g. the VSK (which is not open access) and the necessary, separate scoring methodology make the use of a separate SR scale potentially challenging for teachers and public health researchers, especially if compared to the (G)EDI assessment, which would allow developmental and SR assessment at once and is available free of charge.
In terms of item selection for the GEDI-SR scale, we only achieved a moderate agreement between raters, which underscores the difficulty to distinguish SR from other constructs such as social competence or emotional maturity. Despite the agreement and consensus regarding the theoretical basis, the only moderate agreement might also be explained by the raters’ different professional perspective and background (psychology, occupational therapy, pedagogy), e.g. bringing about different preferences for wordings and deviating operationalizations. However, reassuringly, the results of our exploratory and confirmatory factor analyses and structured equation modeling suggest that the selected items represent the latent construct SR.
Although we were able to include two independent data sets, we are aware that both might be affected by selection bias, according to their geographic location (e.g. potentially containing lower numbers of children from families with low socioeconomic status). As we did not collect the SES of the children's families we cannot assess representativeness of the samples. Hence, our data cannot readily be generalized to specific subgroups of interest, for example children from parents with recent migrant background and lower socio-economic or educational status. Moreover, 6-year-old children are underrepresented in both datasets. We found differing percentile values for lower age groups, but we attribute these to a higher inter- and intra-individual variability of developmental maturity [71].
In addition, we did not establish reference values in a representative data set. However, given the successful replication of the structured equation modeling with the validation dataset, we were at last able to demonstrate the stability of the model across populations. Last, at this stage and without a standardized sample, we are currently unable to determine the predictive validity of the GEDI-SR scale.
Conclusion
Thirteen items in the (G)EDI can be recombined to a reliable and valid (G)EDI-SR scale, which can be used either as a stand-alone scale or as part of regular developmental monitoring using the EDI or GEDI in kindergartens. Through using the SR scale as part of (G)EDI kindergarten monitoring, kindergartens with higher percentages of children with SR difficulties could be identified and interventions implemented in a tailored way. Future research collecting data with the GEDI-SR in a representative sample could provide appropriate age- and domain-specific standardized cut-offs that would enable an adequate evaluation of area-wide population-based data.
Acknowledgements
We thank the children, their parents and families, the kindergartens and regular kindergarten teachers for their cooperation (donated time as well as kindergarten teachers who participated in the expert interviews). Moreover, we thank the health manager of the community where data were collected, Franziska Kramer-Gmeiner, for her extraordinary support in recruitment and project coordination.
Abbreviations
- BA
Bland-Altman
- BIKO
Bildungskompetenzen Organisieren / BIKO-Screening zur Entwicklung von Basiskompetenzen für 3- bis 6-Jährige (Organizing Education in Preschool)
- BRIEF-P
Behavior rating inventory of executive function—preschool version
- CFI
Comparative fit index
- CPD
Center for Preventive Medicine and Digital Health
- DESK 3-6
Dortmunder Entwicklungs Screening für den Kindergarten (Dortmund Developmental Screening for Kindergarten)
- GEDI
German version of the Early Development Instrument
- GEDI-SR scale
GEDI self-regulation scale
- ICC
Intraclass correlation coefficient
- KOMPIK
Kompetenzen und Interessen von Kindern (Competencies and Interests of Children)
- RMSEA
Root mean squared error of approximation
- SDQ
Strengths and difficulties questionnaire
- SES
Socioeconomic status
- SR
Self-regulation
- VSK
Verhaltenssalen für das Kindergartenalter (Kinergarten Behavior Scales)
- VSK-SR
VSK self-regulation scale
Authors’ contributions [46]
Sabine Georg: Conceptualization, Investigation, Methodology, Formal analysis, Writing – original draft, Visualization, Project administration Bernd Genser: Methodology, Writing – Review & Editing Joachim E. Fischer: Conceptualization, Writing – Review & Editing, Funding acquisition Steffi Sachse: Conceptualization, Methodology, Writing – Review & Editing, Supervision Freia De Bock: Conceptualization, Methodology, Writing – Review & Editing, Supervision
Funding
Open Access funding enabled and organized by Projekt DEAL. The work of Sabine Georg was funded by the Ministry of Science, Research and the Arts of the German federal state Baden-Württemberg. The authors declare no further conflict of interest. We declare that this second validation study was designed independently from the funder, the collection, analysis and interpretation of data was performed independently from the funder and the manuscript was written independently from the funder.
Availability of data and materials
The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.
Declarations
Ethics approval and consent to participate
Ethical approval was granted by the Ethics committee of the Medical Faculty Mannheim, Heidelberg University (2015-640N-MA). The teachers’ participation was understood as an implicit consent to participate in our study. Written informed consent was obtained from parents. We confirm that all methods were performed in accordance with the relevant guidelines and regulations.
Consent for publication
Not applicable.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Moffitt TE, Arseneault L, Belsky D, et al. A Gradient of Childhood Self-Control Predicts Health, Wealth, and Public Safety. Proc Natl Acad Sci. 2011;108:2693–2698. doi: 10.1073/pnas.1010076108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Robson DA, Allen MS, Howard SJ. Self-regulation in childhood as a predictor of future outcomes: A meta-analytic review. Psychol Bull. 2020;146:324–354. doi: 10.1037/bul0000227. [DOI] [PubMed] [Google Scholar]
- 3.Gawrilow C, Rauch W. Selbstregulationsfähigkeiten und exekutive Funktionen im Entwicklungsverlauf bei Vorschulkindern und Schulkindern. (Self-regulatory and executive functions over the developmental course in preschoolers and school-age children.). In: Hartmann U, Gold A, Marcus H (eds) Entwicklungsverläufe verstehen - Kinder mit Bildungsrisiken wirksam fördern - Forschungsergebnisse des Frankfurter IDeA-Zentrums. Stuttgart: Kohlhammer; 2017.
- 4.Diamond A. Executive functions. Annu Rev Psychol. 2013;64:135–168. doi: 10.1146/annurev-psych-113011-143750. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Calkins SD, Graziano PA, Keane SP. Cardiac vagal regulation differentiates among children at risk for behavior problems. Biol Psychol. 2007;74:144–153. doi: 10.1016/j.biopsycho.2006.09.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Eisenberg N, Valiente C, Spinrad TL, et al. Longitudinal Relations of Children’s Effortful Control, Impulsivity, and Negative Emotionality to Their Externalizing, Internalizing, and Co-Occurring Behavior Problems. Dev Psychol. 2009;45:988–1008. doi: 10.1037/a0016213. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Nigg JT. Annual Research Review: On the relations among self-regulation, self-control, executive functioning, effortful control, cognitive control, impulsivity, risk-taking, and inhibition for developmental psychopathology. J Child Psychol Psychiatry. 2017;58:361–383. doi: 10.1111/jcpp.12675. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Riggs NR, Kobayakawa Sakuma K-L, Pentz MA. Preventing Risk for Obesity by Promoting Self-Regulation and Decision-Making Skills. Pilot Results From the PATHWAYS to Health Program (PATHWAYS). Eval Rev. 2011;11:287–310. [DOI] [PubMed]
- 9.Dierckens M, Richter M, Moor I, et al. Trends in material and non-material inequalities in adolescent health and health behaviours: A 12-year study in 23 European countries. Prev Med (Baltim); 157. Epub ahead of print 1 April 2022. 10.1016/j.ypmed.2022.107018. [DOI] [PubMed]
- 10.White BA, Jarrett MA, Ollendick TH. Self-regulation deficits explain the link between reactive aggression and internalizing and externalizing behavior problems in children. J Psychopathol Behav Assess. 2013;35:1–9. doi: 10.1007/s10862-012-9310-9. [DOI] [Google Scholar]
- 11.Hölling H, Erhart M, Ravens-Sieberer U, et al. Verhaltensauffälligkeiten bei Kindern und Jugendlichen: Erste Ergebnisse aus dem Kinder- und Jugendgesundheitssurvey (KiGGS) (Behavioral problems in children and adolescents: Initial findings from the Child and Adolescent Health Survey.) Bundesgesundheitsblatt Gesundheitsforschung Gesundheitsschutz. 2007;50:784–793. doi: 10.1007/s00103-007-0241-7. [DOI] [PubMed] [Google Scholar]
- 12.Klipker K, Baumtarten F, Göbel K, et al. Mental Health Problems in Children and Adolescents in Germany. Results of the Cross-Sectional KiGGS Wave 2 Study and Trends. J Health Monit. 2018;3:34–41. doi: 10.17886/RKI-GBE-2018-084. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Nakamura YM, Lehmann RJ. Mitarbeiter/Innenbeurteilung. Lebens- und Schulqualität PH Akzente. 2003;3:50–52. [Google Scholar]
- 14.Nodi M, Ackermann K, Eberhard U, et al. Arbeitsbedingungen, Belastungen und Ressourcen von Lehrpersonen und Schulleitungen im Kanton Aargau 2008. Ergebnisse der Untersuchung im Auftrag des Departements Bildung, Kultur und Sport. (Working Conditions, Burdens and Resources of Teachers and School Administrators in the Canton of Aargau 2008. Aarau: Results of the Study Commissioned by the Department of Education, Culture and Sport.); 2008.
- 15.Keller R, Kunz A, Luder R, et al. Schulentwicklung für eine inklusive und gesunde Schule am Beispiel der Projekte „SIS“ und „Challenge“. (School development for an inclusive and healthy school using the example of the ‘SIS’ and ‘Challenge’ projects.) In: Zala-Mezö E, Strauss N-C, Häbig J, et al., editors. Dimensionen von Schulentwicklung. Verständnis, Veränderung und Vielfalt eines Phänomens. Münster. Münster: Waxmann; 2018. pp. 187–204. [Google Scholar]
- 16.Campbell SB, Pierce EW, March CL, et al. Hard-to-Manage Preschool Boys : Symptomatic Behavior across Contexts and Time. Child Dev. 1994;65:836–851. doi: 10.2307/1131422. [DOI] [PubMed] [Google Scholar]
- 17.Klora M, Zeidler J, Linder R, et al. Costs and treatment patterns of incident ADHD patients - a comparative analysis before and after the initial diagnosis - Health Econ Rev. 2015;5:1–9. doi: 10.1186/s13561-015-0078-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Ewest F, Reinhold T, Vloet TD, et al. Durch Jugendliche mit Störungen des Sozialverhaltens ausgelöste Krankenkassenausgaben: Eine gesundheitsökonomische Analyse von Versichertendaten einer gesetzlichen Krankenkasse. (Health insurance costs caused by adolescents with social behavior disorders: A Health Economic Analysis of Insured Data from a Public Health Insurance Fund.) Kindheit und Entwicklung. 2013;22:41–47. doi: 10.1026/0942-5403/a000097. [DOI] [Google Scholar]
- 19.Blair C. How similar are fluid cognition and general intelligence? A developmental neuroscience perspective on fluid cognition as an aspect of human cognitive ability. Behavioral and Brain Sciences. 2006;29:109–125. doi: 10.1017/S0140525X06009034. [DOI] [PubMed] [Google Scholar]
- 20.Ceci SJ. How much does schooling influence general intelligence and its cognitive components? A reassessment of the evidence. Dev Psychol. 1991;27:703–722. doi: 10.1037/0012-1649.27.5.703. [DOI] [Google Scholar]
- 21.Cicchetti D. The impact of social experience on neurobiological systems: illustration from a constructivist view of child maltreatment. Cogn Dev. 2002;17:1407–1428. doi: 10.1016/S0885-2014(02)00121-1. [DOI] [Google Scholar]
- 22.Diamond A, Lee K. Interventions shown to aid executive function development in children 4 to 12 years old. Science. 2011;333:959–964. doi: 10.1126/science.1204529. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Pandey A, Hale D, Das S, et al. Effectiveness of universal self-regulation-based interventions in children and adolescents a systematic review and meta-analysis. JAMA Pediatr. 2018;172:566–575. doi: 10.1001/jamapediatrics.2018.0232. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Muir RA, Howard SJ, Kervin L. Interventions and Approaches Targeting Early Self-Regulation or Executive Functioning in Preschools: A Systematic Review. Educ Psychol Rev. 2023;35:27. doi: 10.1007/s10648-023-09740-6. [DOI] [Google Scholar]
- 25.Blair C, Ku S. A Hierarchical Integrated Model of Self-Regulation. Front Psychol; 13. Epub ahead of print 4 March 2022. 10.3389/fpsyg.2022.725828. [DOI] [PMC free article] [PubMed]
- 26.Epstein MH, Sharma JM. Behavioral and Emotional Rating Scale: A strength-based approach to Assessment. TX: PRO-ED; 1998. [PubMed] [Google Scholar]
- 27.Achenbach TM. The Child Behavior Checklist and related instruments. In: Maruish ME, editor. The use of psychological testing for treatment planning and outcomes assessment. Lawrence Erlbaum Associates Publishers; 1999. pp. 429–466. [Google Scholar]
- 28.Putnam SP, Rothbart MK. Development of Short and Very Short Forms of the Children’s Behavior Questionnaire. J Pers Assess. 2006;87:102–112. doi: 10.1207/s15327752jpa8701_09. [DOI] [PubMed] [Google Scholar]
- 29.Bronson MB, Goodson BD, Layzer JI, et al. Child behavior rating scale. Cambridge MA: Abt Associates; 1990. [Google Scholar]
- 30.Conners CK. A teacher rating scale for use in drug studies with children. American J Psychiatry. 1969;126(6):884–8. doi: 10.1176/ajp.126.6.884. [DOI] [PubMed] [Google Scholar]
- 31.Lebuffe PA, Naglieri JA. The Devereux Early Childhood Assessment (for children ages 2 through 5 years) 1998. [Google Scholar]
- 32.Lafreniere PJ, Dumas JE. Social Competence and Behavior Evaluation in Children Ages 3 to 6 Years: The Short Form (SCBE-30) Psychol Assess. 1996;8:369–377. doi: 10.1037/1040-3590.8.4.369. [DOI] [Google Scholar]
- 33.Gouley KK, Brotman LM, Huang KY, et al. Construct validation of the social competence scale in preschool-age children. Soc Dev. 2008;17:380–398. doi: 10.1111/j.1467-9507.2007.00430.x. [DOI] [Google Scholar]
- 34.Goodman R. The Strengths and Difficulties Questionnaire: a research note. J Child Psychol Psychiatry. 1997;38:581–586. doi: 10.1111/j.1469-7610.1997.tb01545.x. [DOI] [PubMed] [Google Scholar]
- 35.Goodman R. Psychometric Properties of the Strengths and Difficulties Questionnaire. J Am Acad Child Adolesc Psychiatry. 2001;40:1337–1345. doi: 10.1097/00004583-200111000-00015. [DOI] [PubMed] [Google Scholar]
- 36.Gioia GA, Isquith PK, Guy SC, et al. Behavior Rating Inventory of Executive Function. Child Neuropsychol. 2000;6:235–238. doi: 10.1076/chin.6.3.235.3152. [DOI] [PubMed] [Google Scholar]
- 37.McCoy DC. Measuring Young Children’s Executive Function and Self-Regulation in Classrooms and Other Real-World Settings. Clin Child Fam Psychol Rev. 2019;22:63–74. doi: 10.1007/s10567-019-00285-1. [DOI] [PubMed] [Google Scholar]
- 38.Daseking M, Petermann F. Verhaltensinventar zur Beurteilung exekutiver Funktionen für das Kindergartenalter Deutschsprachige Adaptation des Behavior Rating Inventory of Executive Function® - Preschool Version (BRIEF®-P) von Gerard A. Gioia, Kimberly Andrews Espy und Peter K. Isquith. Göttingen: Hogrefe; 2013.
- 39.Koglin U, Petermann F. Verhaltensskalen für das Kindergartenalter. Göttingen: Hogrefe; 2016. [Google Scholar]
- 40.Seeger D, Holodynski M, Souvignier E. Testbesprechung. BIKO-Screening zur Entwicklung von Basiskompetenzen für 3- bis 6-Jährige. (Test review. BIKO screening for the development of basic skills for 3- to 6-year-olds.). Hogrefe Publishing Group, 2014. Epub ahead of print January 2014. 10.1026/0049-8637/a000122.
- 41.Tröster H, Flender J, Reineke D. Dortmunder Entwicklungsscreening für den Kindergarten (DESK 3–6). (Dortmund Development Screening for Kindergarten.) Kindheit und Entwicklung. 2005;14:140–149. doi: 10.1026/0942-5403.14.3.140. [DOI] [Google Scholar]
- 42.Bauer C, Krause M, Mayr T. Kompetenzen und Interessen von Kindern. Beobachtungs- und Einschätzboten für Kinder von 3,5 bis 6 Jahre. (Children’s competencies and interests. Observation and assessment tools for children from 3.5 to 6 years.) Gütersloh: Bertelsmann Stiftung; 2010. [Google Scholar]
- 43.Janus M, Offord DR. Development and psychometric properties of the Early Development Instrument (EDI): A measure of children’s school readiness. Can J Behav Sci. 2007;39:1–22. doi: 10.1037/cjbs2007001. [DOI] [Google Scholar]
- 44.Georg S, Bosle C, Fischer JE, et al. Psychometric properties and contextual appropriateness of the German version of the Early Development Instrument. BMC Pediatr. 2020;20:339. doi: 10.1186/s12887-020-02191-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Brinkman SA, Gregory TA, Goldfeld S, et al. Data Resource Profile: The Australian Early Development Index (AEDI) Int J Epidemiol. 2014;43:1089–1096. doi: 10.1093/ije/dyu085. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Curtin M, Madden J, Staines A, et al. Determinants of vulnerability in early childhood development in Ireland: a cross-sectional study. BMJ Open; 3. Epub ahead of print 2013. 10.1136/bmjopen-2012-002387. [DOI] [PMC free article] [PubMed]
- 47.Hagquist C, Hellström L. The Psychometric Properties of the Early Development Instrument: A Rasch Analysis Based on Swedish Pilot Data. Soc Indic Res. 2013;117:301–317. doi: 10.1007/s11205-013-0344-5. [DOI] [Google Scholar]
- 48.Ip P, Li SL, Rao N, et al. Validation study of the Chinese Early Development Instrument (CEDI) BMC Pediatr. 2013;13:146. doi: 10.1186/1471-2431-13-146. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Woolfson LM, Geddes R, McNicol S, et al. A Cross-Sectional Pilot Study of the Scottish Early Development Instrument: A Tool for Addressing Inequality. BMC Public Health. 2013;13:1187. doi: 10.1186/1471-2458-13-1187. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Equity from the Start - The Early Development Instrument, https://edi.offordcentre.com/about/what-is-the-edi/ (accessed 14 December 2022).
- 51.What is the EDI? (https://edi.offordcentre.com/about/what-is-the-edi/) [accessed 28th August 2023].
- 52.Mukaka MM. Statistics Corner: A guide to appropriate use of Correlation coefficient in medical research, www.mmj.medcol.mw (2012). [PMC free article] [PubMed]
- 53.Bailey R, Jones SM. An Integrated Model of Regulation for Applied Settings. Clin Child Fam Psychol Rev. 2019;22:2–23. doi: 10.1007/s10567-019-00288-y. [DOI] [PubMed] [Google Scholar]
- 54.Janus M. The Early Development Instrument: A Tool for Monitoring Children’s Development and Readiness for School. Early Child Development: From Measurement to Action A Priority for Growth and Equity. 2006. pp. 141–155. [Google Scholar]
- 55.Ludwig-Mayerhofer W. ILMES - Internet-Lexikon der Methoden der empirischen Sozialforschung. (ILMES - Internet Encyclopedia of Methods in Empirical Social Research.), http://wlm.userweb.mwn.de/Ilmes/ilm_f3.htm (2016, accessed 14 December 2022).
- 56.Hu L, Bentler PM, Hu L. Cutoff criteria for fit indexes in covariance structure analysis : Conventional criteria versus new alternatives Cutoff Criteria for Fit Indexes in Covariance Structure Analysis : Conventional Criteria Versus New Alternatives. 5511. Epub ahead of print 2009. 10.1080/10705519909540118.
- 57.Maccallum RC, Browne MW, Sugawara HM. Power Analysis and Determination of Sample Size for Covariance Structure Modeling of fit involving a particular measure of model. 1996;13:130–149. [Google Scholar]
- 58.Schumacker RE, Lomax RG. A Beginner’s Guide to Structural Equation Modeling. 4. New York: Routledge; 2015. [Google Scholar]
- 59.Loehlin JC, Beaujean AA. Latent Variable Models: An Introduction to Factor, Path, and Structural Equation Analysis. New York: Taylor & Francis; 2017. [Google Scholar]
- 60.Kyriazos TA. Applied Psychometrics: Sample Size and Sample Power Considerations in Factor Analysis (EFA, CFA) and SEM in General. Psychology. 2018;09:2207–2230. doi: 10.4236/psych.2018.98126. [DOI] [Google Scholar]
- 61.Koo TK, Li MY. A Guideline of Selecting and Reporting Intraclass Correlation Coefficients for Reliability Research. J Chiropr Med. 2016;15:155–163. doi: 10.1016/j.jcm.2016.02.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Altman DG, Bland JM. Measurement in Medicine: The Analysis of Method Comparison Studies. The Statistician. 1983;32:307. doi: 10.2307/2987937. [DOI] [Google Scholar]
- 63.Bland JM, Altman DG. Statistical methods for assessing agreement between two methods of clinical measurement. The Lancet. Epub ahead of print 1986. 10.1128/AAC.00483-18. [PubMed]
- 64.Bennetts SK, Mensah FK, Westrupp EM, et al. The Agreement between Parent-Reported and Directly Measured Child Language and Parenting Behaviors. Front Psychol; 7. Epub ahead of print 2016. 10.3389/fpsyg.2016.01710. [DOI] [PMC free article] [PubMed]
- 65.Bland MJ, Altman DG. Applying the Right Statistics: Analyses of Measurement Studies. Ultrasound Obstet Gynecol. 2003;22:85–93. doi: 10.1002/uog.122. [DOI] [PubMed] [Google Scholar]
- 66.Cox NJ, Steichen TJ. CONCORD: Stata Module for Concordance Correlation. Statistical Software Components S404501, Boston College Department of Economics, https://ideas.repec.org/c/boc/bocode/s404501.html (2007, accessed 20 March 2020).
- 67.Savina E. Self-regulation in Preschool and Early Elementary Classrooms: Why It Is Important and How to Promote It. Early Childhood Educ J. 2021;49:493–501. doi: 10.1007/s10643-020-01094-w. [DOI] [Google Scholar]
- 68.Légare F, Borduas F, Freitas A, et al. Development of a Simple 12-Item Theory-Based Instrument to Assess the Impact of Continuing Professional Development on Clinical Behavioral Intentions. PLoS One; 9. Epub ahead of print 18 March 2014. 10.1371/journal.pone.0091013. [DOI] [PMC free article] [PubMed]
- 69.Kumah EA, Bettany-Saltikov J, van Schaik P, et al. Development and validation of a questionnaire to assess evidence-based practice and evidence-informed practice knowledge, attitudes, understanding and behavior. Teaching and Learning in Nursing. Epub ahead of print 2023. 10.1016/j.teln.2023.07.006.
- 70.Moosbrugger H, Kelava A. Testtheorie und Fragebogenkonstruktion. (Test theory and questionnaire development.). 2013. Epub ahead of print 2013. 10.1007/978-3-642-20072-4_2.
- 71.Van Dijk M, Van Geert P. The nature and meaning of intraindividual variability in development in the early life span. In: Diehl M, Hooker K, Sliwinski MJ, editors. Handbook of intraindividual variablity across the life span. New York, East Sussex: Routledge; 2016. pp. 37–58. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.