Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2013 Mar 25.
Published in final edited form as: Dev Neuropsychol. 2008;33(3):365–393. doi: 10.1080/87565640801982445

Standardized Assessment of Strategy Use and Working Memory in Early Mental Arithmetic Performance

Sarah S Wu 1, Meghan L Meyer 1, Uta Maeda 1, Valorie Salimpoor 5, Sylvia Tomiyama 2, David C Geary 6, Vinod Menon 1,3,4
PMCID: PMC3607623  NIHMSID: NIHMS86772  PMID: 18473204

Abstract

Although children’s use of a variety of strategies to solve arithmetic problems has been well documented, there is no agreed upon standardized and validated method for assessing this mix. We examined the convergent validity of typically achieving (TA, n = 39) and low achieving (LA, n = 20) 2nd and 3rd grade children’s strategy choices in simple addition using three different methods: child self-report, observer-report, and response time (RT). The high concordance between child and observer reports (Kappa = .948) in both groups suggests that the participants were aware of, and could accurately report, the strategies they used. The Receiver-Operator Characteristic (ROC) analysis showed that RT accurately differentiated between retrieval and counting (AUC = 82%). The specificity and sensitivity of the ROC profiles were significantly greater for the TA group than for LA group, even though the groups did not differ in the overall strategy mix. Our findings suggest that ROC analysis is more sensitive to group differences in the mechanisms governing strategy choice than observation or child report. Children’s use of retrieval strategies as well as accuracy during both retrieval and counting trials were all related to the central executive, but not the phonological and visuospatial sketchpad, component of working memory. We discuss the implication of these findings for early mathematical learning.

Keywords: Mathematical cognition, mental arithmetic, strategy use, working memory, automaticity


It is well established that children’s problem solving in many cognitive and academic domains is characterized by use of a mix of strategies and that development involves gradual change in this mix. The change is reflected in declining use of the least efficient–in terms of time, accuracy, and working memory demands–strategies and increasing use of the most efficient strategies (Siegler, 1984). One of the areas in which this pattern has been extensively documented is in the mix of strategies children use to solve arithmetic problems. These studies have shown that even young children are capable of using a variety of strategies to solve simple addition and subtraction problems, and that the strategy mix changes with development and schooling (Ashcraft, 1992; Baroody, 1987; Carpenter & Moser, 1984; Cooney, Swanson, & Ladd, 1988; Siegler & Robinson, 1982; Siegler & Shrager, 1984). The developmental pattern is most evident in typically achieving (TA) children who progress from the use of immature counting strategies to the more mature memory-based processes during the elementary school years (Ashcraft & Fierman, 1982; Geary, Widaman, Little, & Cormier, 1987; Fuson, 1982; Kaye, Post, Hall, & Dineen, 1986). Children with a mathematical learning disability (MLD) and, to a lesser extent, children who score lower on mathematics achievement tests than would be expected on their intelligence and reading scores (i.e. low achieving children, LA) use counting strategies for more years and show evidence of difficulties in use of memory-based processes (Geary, Hoard, Byrd-Craven, & DeSoto, 2004; Geary & Brown, 1991).

Researchers have employed a variety of methods to study the mix of strategies children use to solve arithmetic problems, but to date there is no agreed upon standardized measures for assessing these strategy choices. In fact, there have been concerns regarding the validity of the commonly used method of subject report and whether these reports are consistent with RT patterns (Ashcraft, 1992). We address this issue in the current study with the assessment of the convergent validity of the three methods that have been used for assessing strategy choices in arithmetic for groups of typically achieving and low achieving children; specifically, experimenter observation, child self-report, and RT. We also present the first Receiver-Operator Characteristic (ROC) analysis of RT measures of strategy choice and relate these parameters and those derived from the observational and self-report methods to individual and group differences in working memory.

Addition Strategy Choices

During the early elementary school years, the mix of strategies children use to solve simple addition problem is initially dominated by finger and verbal counting, although most children can use direct retrieval to solve a few problems, such as retrieving “three” for 1 + 2 (Ashcraft, 1982; Fuson, 1982; Groen & Parkman, 1972; Siegler & Shrager, 1984). With schooling, the use of counting gradually declines and is replaced by direct retrieval of answers from long-term memory, at least for simple problems.

In comparison to typically achieving children, children with MLD and their low achieving peers use counting as a problem solving strategy across more grades and commit more counting errors (Geary, 1993; Geary, Hoard, Byrd-Craven, Nugent, & Numtee, 2007; Hanich, Jordan, Kaplan, & Dick, 2001; Jordan & Montani, 1997; Ostad, 1997; Russell & Ginsberg, 1984). The most consistent finding is that children with MLD and, to a lesser extent, low achieving children show a deficit in the ability to use retrieval-based processes (Barrouillet, Fayol, & Lathuliére, 1997; Geary, 1990; Geary, Hamson, & Hoard, 2000; Jordan, Hanich, & Kaplan, 2003). It is not that these children never correctly retrieve answers, but that they show a persistent difference in the frequency with which they correctly retrieve basic facts, and sometimes in the pattern of retrieval errors.

Standardizing Strategy Use Assessments

To date, three methods have been used to evaluate strategy choices. The simplest of these is the self-report, which involves directly asking children which strategy they used to solve each problem (Carpenter & Moser, 1984; Houlihan & Ginsburg, 1981). The assumption is that children are capable of providing an accurate description of their strategies, if asked immediately after solving each problem (Siegler, 1987; Svenson, 1985). Observer-reports require the experimenter to watch a child solve each problem and note overt signs of counting, such as finger usage, lip movement, or audible counting. However, this type of classification becomes increasingly more difficult with age, because as children progress to implicit counting and to retrieval, overt signs of strategy use begin to disappear.

A third method is chronometry, that is, the measurement of RT for solving each problem (Ashcraft, 1982; Groen & Parkman, 1972), and use of regression equations to infer underlying strategies and processes. For instance, if problems are solved by means of counting, then RT should be linearly related to the magnitude of the counted addend and the corresponding raw regression coefficient should be consistent with independent estimates of speed of implicit counting (Geary & Widaman, 1987; Groen & Parkman, 1972; Svenson, 1985; Widaman, Little, Geary, & Cormier, 1992). For retrieval, the best predictor of RTs should be a variable that models the underlying structure of the long-term memory organization of addition facts (Ashcraft & Battaglia, 1978). The major advantages of the chronometric model are that the RTs can be reliably assessed, they are not dependent on subject report, and do not lose their sensitivity as observation of strategy choices becomes difficult (e.g., for retrieval). The primary disadvantage is that it is difficult to assess within-subject variation in strategy usage.

In this study, we used ROC methods, a signal detection method that determines how well a predictive equation correctly classifies a data set into two groups (Centor, 1991). In the domain of mathematical cognition, ROC analyses have been used to predict performance outcomes in early mathematics learning. For example, Mazzocco and Thompson (2005) used ROC curves to determine whether performance measures gathered in kindergarten could predict whether participants would eventually develop MLD in the 2nd and 3rd grade. Here we use ROC methods to assess whether RTs generated when 2nd and 3rd grade children solve additional problems can be used to accurately classify strategy choices, more specifically, to distinguish between retrieval and counting. We also determined the RT, or cut-off criterion, that maximizes the sensitivity and specificity of the classification. These analyses may be used to help develop a confirmatory and quantitative method of using RTs for determining strategy use.

Thus, the first set of aims of the current study are to: (1) test the level of convergence amongst experimenter reports and participant self-reports of children’s strategy choice, (2) validate the aforementioned qualitative methods of strategy assessment with chronometric data, (3) investigate a predictive model of strategy use based on RTs, (4) determine a RT cut-off threshold that maximizes the differentiability between the two strategies of counting and retrieval, and (5) compare strategy use in typically and low achieving children using all of these methods.

Strategy Use and Working Memory

The second aspect of our study concerns the working memory (WM) contributions to individual and group differences in children’s strategy choices. On the basis of Baddeley and Hitch’s (1974) model, the three key components of WM are the central executive, phonological loop, and visuospatial sketchpad. The central executive functions as a supervisory and control system, orchestrating the activity and flow of information represented in the phonological and visuospatial subsystems. The central executive is also engaged in other aspects of executive function more generally (Baddeley, 1996) including retrieval of information from long-term memory (Baddeley, 1996; Baddeley, Emslie, Kolodny, & Duncan, 1998). The phonological loop and visuospatial sketchpad act as short-term storage systems for verbal and visuospatial information, respectively (Baddeley, Thomson, & Buchanan, 1975; Della Sala, Gathercole & Adams, 1994; Della Salla, Gray, Baddeley, Allamano, & Wilson, 1999; Logie, 1986; Wilson & Swanson, 2001).

With respect to how children solve addition problems, the central executive can influence strategy choice in both task-specific and task-general ways (Menon, Meyer, Wu, In Press). In general, this component of WM can play an important role in making decisions about appropriate strategy use and then allocating attentional resources to implement the selected strategy (Shrager & Siegler, 1998). The role of the central executive may be particularly critical in 2nd and 3rd grade children, when these tasks are not well automated and effortful processing is required. If an answer cannot be retrieved from long-tem memory, the central executive must engage several sequential processing stages to perform the task. These include encoding, maintenance and manipulation of numbers in either the visuospatial and phonological buffers (Geary & Brown, 1991; Hitch, 1978; Logie, Gilhooly, & Wynn, 1994; Swanson, 2001).

Previous research has established that individual and group differences in mathematics achievement are related to WM capacity (Adams & Hitch, 1997; Bull, Johnston, & Roy, 1999; Geary, Hoard, Byrd-Craven, & DeSoto, 2004; McLean & Hitch, 1999; Swanson, 1993; Swanson, 2006; Swanson & Sachse-Lee, 2001); Barrouillet and Lepine (2005) found that 3rd and 4th grade children with high WM capacity, as determined by dot-counting and reading letter span tasks, retrieved answers to simple addition questions (e.g. 3+4=7) significantly more frequently and more quickly than did children with low WM capacities. Similarly, Bull, Johnston and Roy (1999) found a significant, negative correlation between frequency of arithmetic fact retrieval and perseveration on the Wisconsin Card Sorting Test, a measure of the central executive component of WM in 3rd grade children. Geary et al. (2004) found that the poor central executive capacity (as measured by a counting span task) was associated with more frequent use of counting to solve addition problems and more counting errors in 1st graders. This pattern was not found in the 3rd graders, suggesting a developmental shift in the importance of the central executive for solving simple addition problems sometime between 1st and 3rd grade. The age at which this shift predominantly occurs is not known as few studies have examined strategy use and the role of WM in 2nd grade children.

Despite these advances, the components of WM that contribute to this relation are not as well understood. For example, Barrouillet and Lepine (2005) used a measure for the central executive (i.e., the Dot-Counting task), but did not include standardized measures for the phonological loop and the visuospatial sketchpad. Bull et al. (1999) assessed both the central executive and visuospatial sketchpad, but these measures were not standardized. In addition, they did not examine the role of the phonological component of WM, although this component has been implicated as a strong predictor of 1st grade children’s arithmetic performance (Rasmussen & Bisanz, 2005). Hitch and McAuley (1991) did find that children with mathematics learning disabilities had lower phonological loop capacities, their study was confined to a limited sample of 15 boys.

Geary et al. (2007), however, recently demonstrated that, in mathematics achievement and in an array of mathematical cognition tasks, including an assessment of strategy choices in addition, individual and group differences were consistently related to the individual differences in the capacity of the central executive. The phonological loop and visuospatial sketchpad, in contrast, contributed to more circumscribed aspects of mathematical performance. We provide a follow up to this study by examining the relation between each of the three core components of WM and addition strategy choices assessed by the three different methods described above. We also examine the role of each WM component in the accurate and efficient performance on the addition trials in the TA and LA groups.

Method

Participants

Participants were 59 2nd (n = 26) or 3rd (n = 33) graders recruited from the San Francisco Bay Area. Participants included 31 boys and 28 girls with a mean age of 8.05 years (SD=0.66) at Time 1 testing. Advertisements in school and local newspapers, community and electronic bulletin boards, community organizations, and other public locations were the principal means of recruitment. Parents who wished to enroll their children in the study contacted the lab for initial screening and further information. Written consent was obtained from all participants and parents prior to any participation. Children with IQ scores between 75 and 125 were classified as low achieving (n = 28) if either their standardized Numerical Operations or Math Composite score from the Weschler Individual Achievement Test –II (WIAT-II; Weschler, 2005) was less than 90 (corresponding to 27th percentile). Children were classified as typically achieving (n = 31) if their score was greater than 89 on both measures. An independent sample t-test indicated no differences in age or grades between the typically and low achieving participants.

Measures

Intelligence was assessed using the Weschler Abbreviated Scales of Intelligence (WASI; Weschler, 1999). The Numerical Operations, Math Reasoning, and Word Reading subtests of the WIAT-II were used to assess achievement.

WM was assessed using four subtests of the Working Memory Test Battery for Children (WMTB-C; Pickering & Gathercole, 2001): Counting Recall, Backward Digit Recall, Digit Recall, and Block Recall. Counting Recall and Backward Digit Recall assess the central executive component of working memory, whereas Digit Recall and Block Recall assess phonological loop and visuospatial sketchpad components, respectively.

Counting Recall requires the child to count a set of 4, 5, 6, or 7 dots on a card, and then to recall the number of counted dots at the end of a series of cards. Backward Digit Recall is a standard format Backward Digit Span. Digit Recall is a span task in which the child is asked to repeat number words spoken by the experimenter in the same order as presented by the experimenter. Block Recall is another span task, but the stimuli consist of a board with nine raised blocks in what appears to the child as a “random” arrangement. The blocks have numbers on one side that can only be seen from the experimenter’s perspective. The experimenter taps a block (or series of blocks), and the child’s task is to repeat the tapping in the same order as presented by the experimenter.

Strategy Assessment

The stimuli included 24 addition problems in the format of ‘a + b = ?’ composed of random pairs of integers 2 to 19, with sums ranging from 6 to 25. Tie problems (e.g. 2+2, 5+5) and addends of 0 and 1 were excluded because answers are typically retrieved from long-term memory and thus do not show strategy variability even for young children (Groen & Parkman, 1972; Svenson, 1985). No repetition of either addend was allowed across consecutive problems. Four practice problems were provided at the beginning of each session in order to familiarize the participants with the task.

Procedure

Strategy Assessment

Following four practice problems, participants were asked to solve the same 24 addition problems, presented one at a time on the computer screen. The child was asked to solve each problem (without the use of paper and pencil) as quickly as possible without making too many mistakes and to immediately state the answer out loud. It was emphasized that the child could use whatever strategy was easiest to produce an answer; that is, they were told that they could count using their fingers, count verbally, or “just remember the answer.” For each problem, the experimenter took detailed notes of overt signs of counting, such as finger usage, lip movement, or audible counting. A timer was started at the initial display of each problem, and the experimenter measured the child’s RT by pressing a key on the keyboard as soon as a verbal response was given; all sessions were audio-recorded to check for RT precision. Immediately after each response, children were asked what strategy they used to solve the problem. Their reported strategies were classified as one of the four basic types: (a) counting fingers, (b) fingers (with no visible or audible verbal counting), (c) verbal counting, and (d) retrieval (Geary & Burlingham-Dupree, 1989; Geary, Hoard, Nugent, Byrd-Craven, 2007; Siegler & Shrager, 1984; Siegler, 1986). The experimenter’s observations and the participant’s descriptions were compared to determine the degree of agreement.

All strategies were reclassified as either “counting” (counting on fingers, finger usage without verbal counting, and verbal counting) or “retrieval”. All other strategies, such as decomposition, guessing, or counting by numbers (e.g. counting by 2) comprised approximately 7.14% of the total number of trials, and were excluded from further analysis. Trials which the children reported as having been solved by guessing were removed in order to retain only trials actively solved by retrieval. If it was unclear whether the child actively attempted to retrieve the answer or guessed, the trial was removed. Isolated trials (33 total) in which the RT was flawed due to the experimenter pressing the key too early or too late were removed from the analysis, as were trials with an RT > 3.5 SDs above or below the mean. Participants who counted on at least 60% of the trials were classified as “counters,” and likewise, participants who retrieved on at least 60% of the trials were considered “retrievers.” Those who used a roughly equal proportion of counting and retrieval were classified as “neither.”

ROC

To investigate the appropriateness of using RT to determine strategy, we used Receiver Operating Characteristic (ROC), a method of signal detection. ROC assesses how well a variable of interest classifies a data set into two groups (Centor, 1991; Katz & Foxman, 1993). For any particular threshold, all values above are classified into one group, whereas all values below are classified into a second group. Each threshold yields an associated proportion of cases that are correctly classified in a group (i.e., “sensitivity,” or percentage true positives), and the proportion that are incorrectly classified into the same group (i.e., 1 minus the “specificity,” or percentage false positives). The threshold is then varied to yield an ROC curve (Center, 1991). The ROC curve is an X-Y plot of all the sensitivity and specificity pairs in a unit square (i.e. maximum X and Y values are both 1) . The ROC curve can be used to determine the threshold point at which the sensitivity and specificity is optimal; thus maximizing the percentage of true positives while minimizing the percentage of false positives. ROC curves also provide a measure called the Area under the Curve (AUC), or, the likelihood that, given a true positive and a true negative, the ROC would correctly classify each as such (Hanley & McNeil, 1982; Lusted, 1971). An AUC of .5, for example, would indicate that the ROC was unreliable, assigning a correct classification in only 50% of the cases. An AUC of 1, however, indicates that the ROC was perfectly accurate in classification. In our study we used response times to differentiate between counting and retrieval trials. True positives here refer to the percentage of counting trials that were accurately identified as counting trials, and false positives refer to the percentage of retrieval trials that were inaccurately identified as counting trials.

Results

The results are presented in three sections. In the first section, we present results from the analysis of the correspondence between self- and observer-reports. In the second section, we present results from the ROC analyses, first with the entire group of participants, and then between-group comparisons of ROC curves. In the third section, we also present results detailing behavioral measures of performance stemming from strategy use. Finally, we discuss the relationship between strategy choice and the three components of WM. Throughout these sections, we focus on differences between the TA and LA groups.

As expected, the typically achieving (n = 39) participants had significantly greater WIAT-II Numerical Operations and Math Composite scores than the low achieving (n = 20) participants (Table 1). Both groups had average and statistically similar WIAT word reading and reading comprehension scores that were well above the reading disability cutoff score of 91.

Correspondence between self- and observer- reports

Our analysis revealed a Kappa’s α of .948 (p < .001) between the children’s self-report and experimenter’s observations of strategy use. The most common disagreement occurred when the child reported using retrieval but the experimenter observed subtle signs of counting, such as fingers fluttering or lips moving. The Kappa’s α was .944 and .954 (both p< .001) for the TA and LA groups, respectively.

ROC Profiles

Across all children, the ROC analysis yielded an AUC of 0.820 (SE = 0.012, asymptotic p < .001; Figure 1), suggesting that RT could successfully differentiate between the trials in which participants used counting and trials in which they used retrieval. The ROC data yielded a maximal cut-off of 3662.5 ms, with a sensitivity of 78.7% and a specificity of 72.1%. In other words, trials with RTs of less than 3662.5 ms most likely reflect the use of retrieval, whereas trials with RTs greater than 3662.5 ms reflect the use of counting. In using this cut-off threshold, 78.7% of the counting trials will be correctly classified as being solved by counting, whereas 27.9% (i.e. 100% - 72.1%) of the retrieval trials will have been incorrectly categorized as having been solved by counting.

Comparison of 2nd and 3rd graders

For the 2nd graders, the ROC analysis yielded an AUC of .801 (SE = 0.019), with a cut-off RT of 4047 ms, 77.4% sensitivity, and 72.5% specificity. The 3rd grade ROC data yielded an AUC of .831 (SE = 0.015), a cut-off RT of 3426 ms, 79.3% sensitivity, and 73.8% specificity. A z-score analysis (Hanley & McNeil, 1982) indicated no significant differences between the two curves (z = 1.198, p = .231).

Comparison of Typically and Low Achieving groups

The ROC analysis yielded an AUC of .839 for the TA group, and an AUC of .756 for the LA group. For the TA group, the maximal cut-off RT was 3603 ms, which yielded a sensitivity of 77.6% and a specificity of 77.1%. For the LA group, the maximal cut-off RT was 4340 ms, with a sensitivity of 74.1% and a specificity of 69.5%. A z-score analysis indicated that the AUC was significantly greater for the TA group than the LA group (z = 2.897, p = .004; Figure 2).

Strategy Use and Behavioral Performance

Within the TA group, 10 (25.6%) were retrievers, 19 were counters (48.7%) and 10 used the two strategies equally (25.6%). Within the LA group, 4 (20%) were retrievers, 9 (45%) were counters, and 7 (35%) used the two strategies equally. A chi-square test indicated no significant differences within the TA and LA groups in the distributions of retrievers, counters, or children who used both equally (χ2(2) = 0.618, p = .734). The findings, however, suggested that typically achieving children tended to rely more on retrieval (typically achieving, M = 52%, SD = 29%; low achieving, M = 38%, SD = 20%), whereas low achieving children relied more on counting F(1,57) = 3.881, p = .054.

Strategy use and RT

Across groups, average RTs were longer for counting trials than for retrieval trials, F(1,53) = 104.776, p < .001, and this was also the case for each group; F(1,33) = 63.813, p < .001, and F(1,19) = 40.487, p < .001, respectively for TA and LA groups (Figure 3). Overall, the typically achieving participants had faster RTs than the low achieving participants (F(1,57) = 5.721, p < .020; Table 1). An independent samples t-test indicated that typically and low achieving children did not differ in their counting RTs, t(52) = −.912, p = .336. However, typically achieving children were faster than the low achieving children in retrieving, t(57) = −3.523, p = .001.

Although the within-group regressions were non-significant, the data appear to indicate between-group differences in the strategy choice and RT profiles (Figure 4; typically achieving, standardized β = −.220, p = ns, R2 = .047; low achieving, standardized β = .296, p = ns, R2 = .087). A difference of slope test indicated that the slopes of RT regressed on percentage retrieved were significantly different (standardized β = .346, t(1) = 2.706, p = .007) for the two groups. This suggests that the RT profiles relative to strategy use for the two groups were significantly different. Typically achieving children become faster as they retrieve more, whereas low achieving children exhibit a significantly different profile, becoming slower as they depend more heavily on retrieval.

Strategy Use and Accuracy

Accuracy for counting and retrieval trials did not differ, F(1, 53) = 3.431, p = .070. The typically achieving children were more accurate overall (96.14 % correct) than their low achieving peers (89.97 % correct), F(1,57) = 11.996, p = .001. An independent samples t-test indicated that typically achieving children were more accurate than low achieving children in both counting, t(52) = 2.369, p = .022, and retrieving, t(57) = 2.492, p = .016. Within each group, the children were equally accurate for counting and retrieval trials; typically achieving, F(1,33) = 2.073, p = .159, low achieving, F(1,19) = 1.428, p = .247; Figure 5.

RT and sum size

Across groups, average RTs increased linearly with sum size, standardized β = .880, p < .001, R2 = .765. This was also the case within each group: TA, standardized β = .520, p < .001, R2=.270; LA, standardized β = .502, p < .001, R2 = 0.252.

Strategy use and problem size

As problem size increased, the participants were less likely to rely on retrieval and more likely to rely on counting (standardized β = −.744, p < .001, R2 = 0.554). Within groups, both typically achieving and low achieving children showed the same pattern; TA, standardized β = −.882, p < .001, R2 = 0.778; LA, standardized β = −.736, p = .001, R2 = 0.542; Figure 6.

Relationship between strategy use and specific components of WM

Across groups, regression analyses were used to examine the relation between performance on the strategy choice assessments and scores on the four components of WM (Table 2). Across groups, children with higher Counting Recall scores were more likely to retrieve, F(1,56) = 6.035, p = .015, R2 = .101. However, this relation did not hold within each group; TA, F(1,36) = 1.633, p = .210, R2 = .043; LA, F(1,18) = 3.831, p = .066, R2 = .175; Tables 3 and 4. A difference of slopes test indicated that these relationships between Counting Recall and retrieval were not significantly different between the two groups, t(1) = 1.179, p = .244.

Retrieved versus counted trials

Across groups, Backward Recall was inversely correlated with retrieved trial RT (β = −.366, p < .01, R2 = 0.135), and Counting Recall was positively correlated with correct counting (β = .325, p < .05, R2 = 0.106), and overall accuracy (β = .270, p < .05, R2 = 0.078; Table 3). For the TA group, Backward Recall was inversely correlated with retrieved RT (β = −.329, p < .05, R2 = 0.108), but no significant relations emerged for the LA group (Table 4).

Discussion

Concordance between Self- and Observer-reports

The high agreement between the child and observer reports of strategy use suggests that children in the 2nd and 3rd grade are aware of the strategies they use to solve arithmetic problems, and that these self responses often concur with the overt signs noted by observers. Agreements between the participants’ and the observers’ reports within the TA and LA groups were virtually identical, suggesting that both groups of children are aware of the strategies they use. These findings further indicate that any differences in actual strategy use between the two groups are not the result of group differences in subjective reporting of strategy use.

ROC Analysis of Strategy Use

As predicted, retrieval trials were solved significantly faster than counting trials, reflecting the well-established differences in processing speed between an effortful production of the answer versus more rapid retrieval from memory (Siegler, 1988). Although this suggests a relationship between retrieval use and RT, it does not provide information about how well RT can be used to infer strategy use. Here, for the first time, we used an ROC to investigate this question. We also investigated the validity of other methods used to categorize strategy use.

As previously discussed, ROC curves can be used to test the appropriateness of a measure or metric for classifying data into groups (Centor, 1991). The results of our ROC analysis suggests that an ROC curve can appropriately categorize the strategy used to solve the problem via RT. For children in 2nd and 3rd grade, a cut-off threshold of 3662.5 ms maximized the sensitivity and specificity of the curve, indicating that trials with RTs greater than the threshold are most likely counting trials, whereas those below are retrieval trials. With this threshold, the resultant sensitivity and specificity indicated that the ROC curve was able to accurately classify a large proportion of the counting and retrieval trials; 78.7% of the counting trials were correctly classified, whereas 72.1% of the retrieval trials were correctly classified.

Depending on whether researchers choose to maximize the sensitivity or specificity of the ROC curve, the cut-off threshold will change accordingly. Maximizing the sensitivity would increase the number of counting trials that were correctly categorized, but would also risk incorrectly categorizing more retrieval trials. On the other hand, maximizing the specificity would decrease the number of counting trials that were incorrectly categorized, but at the risk of fewer correct categorization of retrieval trials. The determination of RT cut-off thresholds that maximizes the differentiability between the two strategies of counting and retrieval is particularly important for the efficient design of functional brain imaging studies (Menon, Meyer, Wu, In Press).

Interestingly, no differences were found between the ROC curves of the 2nd and 3rd graders. In agreement with this, there were no differences in math performance, WM or strategy use. These findings suggest that significantly detectable shifts in strategy use may not occur between the 2nd and 3rd grades. Also, as noted by Siegel and Ryan (1989) and Siegler and Jenkins (1989), the shift in strategy choice is a gradual one, with children beginning to adopt retrieval while still using counting methods. Studies of changes in strategy choice across the school year confirm a gradual shift, at least for children in the United States (Geary, Bow-Thomas, Liu, & Siegler, 1996). Our data also point to considerable individual subject variability which may have precluded detection of any overall differences. An interesting question is whether changes in the relationship between WM and strategy use shifted between 1st and 3rd grades found by Geary et al. (2004) reflect cognitive changes that are not captured by strategy use. In any case, these results point to the need for longitudinal studies to continually trace the development of strategy use between the 1st through 3rd grades.

ROC analyses were also used to compare profiles of strategy use between the TA and LA groups. This analysis showed that the TA group had significantly higher AUC than the LA group, suggesting that RTs are related to strategy use with a higher sensitivity and specificity in the TA group. Our findings also suggest that our methods of strategy assessment are more optimal for detecting strategy use in typically achieving children. The differences in the ROCs between the two groups may be related to inconsistent strategy use and more variable performance in the LA group. For example, the LA group had slightly larger standard deviations in RT than the typically achieving children. Because of the variability in the response profile of counted and retrieved trials, the ROC analyses were less able to distinctly and accurately categorize strategy choices, as indicated by the lower peak sensitivity and specificity (True Positives and 1 - False Positives, respectively) for the LA group. Furthermore, the threshold for optimal sensitivity and specificity was also approximately 700 ms greater for the LA group than the TA group, suggesting that the low achieving children were slower overall in using both strategies, and most particularly when using the retrieval strategy.

The different ROC profiles between the two groups are consistent with previous data which have suggested that children with MLD tend to lag behind typically achieving children in the development of strategy choice (Geary et al., 2007; Siegler, 1988) by at least one year (Geary et al., 2004). TA children in the 2nd and 3rd grades do not differ in their ROC profiles, which suggests that, unless low achieving children have a significantly accelerated growth trajectory in higher grades, they are likely to lag behind the typically achieving group by more than a year. Our findings also add to the growing evidence that many low achieving children have difficulty with math in part because they continue to apply less mature strategies and do not progress to the more efficient memory retrieval strategy (Geary et al., 2004).

Taken together, these ROC analyses strongly suggest that the reaction time differences reflect actual strategy choices and that the differences are more significant in typically achieving children than low achieving children. For the entire group, these findings suggest that a predictive model of strategy use can be derived from RT. Additional predictive models that are created from data sets on participants with other developmental patterns, problem types, can inform us greatly as to whether strategy choices can be predicted from behavior. Most importantly, this analysis provides an additional means of assigning strategy choice, and one that can be used concomitantly with other methods.

Strategy Use and Behavioral Performance

For the entire group, our analysis confirmed the recent findings of Geary et al. (2004) and earlier findings of Siegler (1987), in which RTs were greater when counting was used and in which RTs increased linearly with problem size. Overall, participants were more likely to use retrieval on the smaller problems, and use counting on larger problems.

Interestingly, there were no differences in the proportions of counters and retrievers between typically and low achieving children. Whereas low achieving children did not differ significantly from typically achieving children in the mix of strategies used, they were slower and had more performance errors. Our data support the view that low achieving children experience a persistent difficulty in retrieving arithmetic facts from LTM (Barrouillet & Lepine, 2005; Geary et al., 2004). Typically achieving children were more efficient in the use of their strategies, responding with faster RTs and greater accuracy when solving mental arithmetic problems as compared to low achieving children. Within the TA group, children showed faster RTs and were more accurate when employing the retrieval strategy compared to counting. The typically achieving children were also more accurate in counting than the low achieving children. Typically achieving children who retrieved may have been faster and more accurate due to their direct retrieval of correctly stored arithmetic facts in LTM. There were no significant differences within the two groups in accuracy on trials during which they counted versus those they retrieved. Our data suggest that low achieving participants were simply less proficient than typically achieving children regardless of the strategy they used.

In contrast to typically achieving children, low achieving children did not exhibit the profile of increased speed when they switched to the retrieval strategy. This finding suggests that they failed to show improvements in speed during the use of a method that is typically more efficient. Alternatively, they may have been implicitly counting on these trials. Although low achieving children were spending more time than typically achieving children overall, they remained less accurate and the additional time used to count did not translate to any accuracy differences. This result is consistent with previous findings that low achieving children and children with MLD commit more errors when either finger counting or retrieval is used when solving both simple and complex problems (Geary et al., 2004). It is therefore likely that even when children with MLD eventually shift to retrieval of arithmetic facts from LTM, their processing is less efficient, perhaps due to weaker WM (Barrouillet & Lepine, 2005).

Strategy Use, Performance and Role of Specific WM Components

Our findings highlight the importance of the central executive, rather than the phonological loop and the visuospatial sketchpad, as the key component of WM that is related to strategy use. With the exception of Geary, Hoard, Byrd-Craven, Nugent, and Numtee (2007), little research has investigated how each of the specialized components of WM differentially contributes to strategy use in children. For the entire group, Counting Recall, a measure of the central executive component of WM, was positively correlated with the percentage of trials in which each child used retrieval. This result is in agreement with the finding that as counting span performance increases in 1st graders, reliance on finger counting and finger counting errors decreases (Geary et al., 2004). Our findings complement these results by suggesting that these relations extend beyond 1st grade to children in the 2nd and 3rd grades. An intriguing question for further research is how these processes might change between the 1st and 2nd grade.

Second, our findings clarify the role of all three components of WM in strategy use. Three previous studies (Barrouillet & Lepine, 2005; Geary et al., 2004; Geary et al., 2007) have examined the role of WM in strategy choice using measures associated with the central executive. The procedure used in the Geary et al. (2004) study, based on Hitch and McAuley (1991), is similar to the Counting Recall task in the WMBT-C. Briefly, children are presented with target red and distractor blue dots randomly placed on index cards and were asked to count, remember, and then recall the number of red dots on each card. Barrouillet and Lepine used a similar counting task with target and distracter dots that were interspersed with letters to be remembered. Given the similarities in the counting tasks used in the previous and present studies, the findings seem to converge on the central executive as the specific component of WM that mediates strategy choice. Our study builds upon these previous studies by also including another standardized measure of the central executive (i.e. Backwards Recall), and standardized measures of the phonological loop and the visual spatial sketchpad. A novel finding of our study is that these measures were not significantly correlated with retrieval use or with RTs, suggesting that the central executive may play the main role in orchestrating simple fact retrieval in 2nd and 3rd graders.

Our results also provide new insights into the influence of WM on the counted and retrieved trials, as assessed and validated by our measures of strategy use. The central executive influenced performance on both types of trials, but in slightly different ways. Counting Recall was related to accurate task performance on the counted trials, whereas Backwards Recall was related to the rapidity with which retrieval was used. These findings indicate that, within the same group of children, the central executive not only contributes to flexible strategy use, but also to efficient task performance on the counted and retrieved trials. The phonological loop and visuospatial sketchpad were not related to strategy use on either strategy, but Geary and colleagues (2007) did find that these components of WM were important for other areas of early mathematics learning.

Typically achieving children, compared to low achieving children, performed significantly better on Backward and Counting Recall tasks of the central executive component of WM. No significant differences were found between the two groups for the phonological loop and visuospatial sketchpad capacity. These findings concur with previous findings that demonstrate a persistent WM deficit in children with poor mathematics achievement scores (D'Amico & Guarnera, 2005; McLean & Hitch, 1999). Geary and colleagues (2004) found that in a sample of 1st graders, WM capacity mediated the greater use of verbal counting by the normally achieving children and contributed to the greater use of finger counting by the children in their MLD group. Consistent with these findings, typically achieving children in the current study tended to retrieve slightly more and were faster in responding during the strategy assessment.

Although the IQ scores of typically achieving children were not significantly different from low achieving children, the typically achieving children demonstrated higher WM capacities and significantly better mathematical achievement (as indexed by the WIAT-II) compared to the low achieving group. Thus, differences in WM and mathematical achievement most likely reflect a domain-specific, developmental delay, as opposed to fixed disparities in aptitude. Indeed, Geary and colleagues (2004) found that children with MLD present strategy choice and WM profiles typically one year behind those of their typically achieving peers. This finding has implications for the development of curricula geared towards children with MLD, and perhaps their low achieving peers. Further research is needed to determine whether this delay in children with MLD can be improved with intervention geared towards greater use of retrieval strategies (Fuchs, Fuchs, & Karns, 2001).

The influence of WM on performance also differed between the TA and LA groups. The TA group exhibited the same behavior as the entire group, wherein Backwards Recall was correlated with increased efficient retrieval. The LA group showed no such effects. As noted, compared to typically achieving children, low achieving children demonstrated significantly lower scores on the Backward Digit and Counting Recall tasks of the central executive. The central executive component of WM may orchestrate the storage of arithmetic facts calculated in WM into LTM. On the other hand, low achieving children may not experience difficulty in storing a fact in long-term memory, but instead difficulty in retrieving a stored fact. In this case, the episodic buffer, which is thought to bind representations from a subsystem and LTM into a unitary episodic representation (Baddeley, 2000) may not function adequately in low achieving children. Further research is needed to investigate how the central executive influences storage and consolidation in, and retrieval of facts from, LTM.

To summarize, low achieving children had lower scores on their central executive and phonological loop measures, despite high average IQ scores, and showed lower accuracy and slower RTs when they used retrieval strategies. In contrast with typically achieving children, WM components in low achieving children do not play a significant role in efficient retrieval. Developmental delays in the central executive in low achieving children may therefore contribute to their poor accuracy and slow RT when retrieving basic arithmetic facts from LTM.

Conclusion

ROC analysis provides new and more thorough validation of strategy use as developed by Siegler, Geary and others. Standardizing strategy choice assessment in 2nd and 3rd grade children is useful for discerning strategy maturation as well as for isolating strategy deficits in low achieving children. ROC analysis may be particularly helpful for designing brain imaging experiments based on individual differences in strategy use. Our study further emphasizes the role of the central executive, rather than the phonological loop and visuospatial sketchpad, component of WM as a key cognitive factor that mediates efficient strategy selection and use. Although the proportion of counters and retrievers were analogous in the two groups, low achieving children take more time and commit more errors than the typically achieving children. Our findings also suggest that the central executive influences performance on trials regardless of whether retrieval or counting strategies are used. Together with the observation that low achieving children scored significantly lower on standardized measures of the central executive, these results suggest that the Central Executive component of WM plays an important role in both the choice and execution of particular strategies.

Supplementary Material

Supplementary

Acknowledgments

We thank Katherine Keller, Jose Anguiano, and Sabina Lau for their assistance with data acquisition and scoring. This research was supported by NIH grant HD047520.

Abbreviations used

AUC

Area Under Curve

LA

Low Achieving

LTM

Long-Term Memory

MA

Mental Arithmetic

MLD

Mathematical Learning Disabilities

ROC

Receiver-Operator Characteristic

RT

Response Time

TA

Typically Achieving

WM

Working Memory

WMBT-C

Working Memory Test Battery for Children

References

  1. Adams JW, Hitch GJ. Working memory and children’s mental addition. Journal of Experimental Child Psychology. 1997;67:21–38. doi: 10.1006/jecp.1997.2397. [DOI] [PubMed] [Google Scholar]
  2. Ashcraft MH. The development of mental arithmetic: A chronometric approach. Developmental Review. 1982;2:213–236. [Google Scholar]
  3. Ashcraft MH. Cognitive arithmetic: A review of data and theory. Cognition. 1992;44:75–106. doi: 10.1016/0010-0277(92)90051-i. [DOI] [PubMed] [Google Scholar]
  4. Ashcraft MH, Battaglia J. Cognitive arithmetic: Evidence for retrieval and decision processes in mental addition. Journal of Experimental Psychology: Human Learning & Memory. 1978;4(5):527–538. [Google Scholar]
  5. Ashcraft MH, Fierman BA. Mental addition in third, fourth, and sixth graders. Journal of Experimental Child Psychology. 1982;33:216–234. [Google Scholar]
  6. Baddeley AD. Exploring the central executive. Quarterly Journal of Experimental Psychology. 1996;49A:5–28. [Google Scholar]
  7. Baddeley AD. The episodic buffer: A new component of WM? Trends in Cognitive Science. 2000;4(11):417–423. doi: 10.1016/s1364-6613(00)01538-2. [DOI] [PubMed] [Google Scholar]
  8. Baddeley AD, Hitch G. The psychology of learning and motivation. Vol. 8. Erlbaum; Hillsdale, NJ: 1974. Working Memory; pp. 646–667. [Google Scholar]
  9. Baddeley AD, Thomson N, Buchanan M. Word length and the structure of short-term memory. Journal of Verbal Learning and Verbal Behavior. 1975;14:575–589. [Google Scholar]
  10. Baddeley A, Emslie H, Kolodny J, Duncan J. Random generation and the executive control of working memory. The Quarterly Journal of Experimental Psychology. 1998;51A:819–852. doi: 10.1080/713755788. [DOI] [PubMed] [Google Scholar]
  11. Baroody AJ. A developmental framework for preschool, primary, and special education teachers. New York: Teachers College Press; 1987. Children’s mathematical thinking. [Google Scholar]
  12. Barrouillet P, Lepine R. WM and children's use of retrieval to solve addition problems. Journal of Experimental Child Psychology. 2005;91(3):183–204. doi: 10.1016/j.jecp.2005.03.002. [DOI] [PubMed] [Google Scholar]
  13. Barrouillet P, Fayol M, Lathulière E. Selecting between competitors in multiplication tasks: An explanation of the errors produced by adolescents with learning disabilities. International Journal of Behavioral Development. 1997;21:253–275. [Google Scholar]
  14. Bull R, Johnston RS, Roy JA. Exploring the roles of the visual-spatial sketch pad and central executive in children's arithmetical skills: Views from cognition and developmental neuropsychology. Developmental Neuropsychology. 1999;15(3):421–442. [Google Scholar]
  15. Carpenter TPM, Moser JM. The acquisition of addition and subtraction concepts in grades one through three. Journal for Research in Mathematics and Education. 1984;15(3):179–202. [Google Scholar]
  16. Centor RM. Signal detestability: The use of ROC curves and their analyses. Medical Decision Making. 1991;11(2):102–106. doi: 10.1177/0272989X9101100205. [DOI] [PubMed] [Google Scholar]
  17. Cooney JB, Swanson LH, Ladd SF. Acquisition of mental multiplication skill: Evidence for the transition between counting and retrieval strategies. Cognition and Instruction. 1988;5(4):323–345. [Google Scholar]
  18. D'Amico A, Guarnera M. Exploring WM in children with low arithmetical achievement. Learning and Individual Differences. 2005;15(3):189–202. [Google Scholar]
  19. Della Sala S, Gray C, Baddeley A, Allamano N, Wilson L. Pattern span: A tool for unwelding visuo-spatial memory. Neuropsychologia. 1999;37:1189–1199. doi: 10.1016/s0028-3932(98)00159-6. [DOI] [PubMed] [Google Scholar]
  20. Fuchs LS, Fuchs D, Karns K. Enhancing kindergartner’s mathematical development: Effects of Peer-Assisted Learning Strategies. The Elementary School Journal. 2001;101(5):495–510. [Google Scholar]
  21. Fuson KC. An analysis of the counting-on solution procedure in addition. In: Carpenter TP, Moser JM, Romberg TA, editors. Addition and subtraction: A cognitive perspective. Hillsdale, NJ: Lawrence Erlbaum; 1982. pp. 67–81. [Google Scholar]
  22. Geary DC. Mathematical disabilities: Cognitive, neuropsychological, and genetic components. Psychological Bulletin. 1993;114:345–362. doi: 10.1037/0033-2909.114.2.345. [DOI] [PubMed] [Google Scholar]
  23. Geary DC, Brown SC. Cognitive addition: Strategy choice and speed-of-processing differences in gifted, normal, and mathematically disabled children. Developmental Psychology. 1991;27:398–406. [Google Scholar]
  24. Geary DC, Burlingham-Dubree M. External validation of the strategy choice model for addition. Journal of Experimental Child Psychology. 1989;47:175–192. [Google Scholar]
  25. Geary DC, Widaman KF. Individual differences in cognitive arithmetic. Journal of Experimental Psychology: General. 1987;116:154–171. doi: 10.1037//0096-3445.116.2.154. [DOI] [PubMed] [Google Scholar]
  26. Geary DC, Hamson CO, Hoard MK. Numerical and arithmetical cognition: A longitudinal study of process and concept deficits in children with learning disability. Journal of Experimental Child Psychology. 2000;77:236–263. doi: 10.1006/jecp.2000.2561. [DOI] [PubMed] [Google Scholar]
  27. Geary DC, Hoard MK, Byrd-Craven J, DeSoto CM. Strategy choices in simple and complex addition: Contributions of WM and counting knowledge for children with mathematical disability. Journal of Experimental Child Psychology. 2004;88(88):121–151. doi: 10.1016/j.jecp.2004.03.002. [DOI] [PubMed] [Google Scholar]
  28. Geary DC, Hoard MK, Nugent L, Byrd-Craven J. Strategy use, long-term memory, and WM capacity. In: Berch DB, Mazzocco MM, editors. Why is Math So Hard for Some Children? Baltimore, MD: Paul H. Brookes Publishing Co; 2007. pp. 83–105. [Google Scholar]
  29. Geary DC, Widaman KF, Little TD, Cormier P. Cognitive addition: Comparison of learning-disabled and academically normal elementary school children. Cognitive Development. 1987;2(3):249–269. [Google Scholar]
  30. Geary DC, Hoard MK, Byrd-Craven J, Nugent L, Numtee C. Cognitive mechanisms underlying achievement deficits in children with mathematical learning disability. Child Development. 2007;78:1343–1359. doi: 10.1111/j.1467-8624.2007.01069.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Groen GJ, Parkman JM. A chronometric analysis of simple addition. Psychological Review. 1972;79:329–343. [Google Scholar]
  32. Hanich L, Jordan N, Kaplan D, Dick J. Performance across different areas of mathematical cognition in children with learning disabilities. Journal Educational Psychology. 2001;93:615–626. [Google Scholar]
  33. Hanley JA, McNeil BJ. The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology. 1982;143:29–36. doi: 10.1148/radiology.143.1.7063747. [DOI] [PubMed] [Google Scholar]
  34. Hitch GJ. The role of short-term working memory in mental arithmetic. Cognitive Psychology. 1978;10:302–323. [Google Scholar]
  35. Hitch GJ, McAuley E. Working memory in children with specific arithmetical learning difficulties. British Journal of Psychology. 1991;82:375–386. doi: 10.1111/j.2044-8295.1991.tb02406.x. [DOI] [PubMed] [Google Scholar]
  36. Houlihan DM, Ginsburg HP. The addition methods of first- and second-grade children. Journal for Research in Mathematics Education. 1981;12(2):95–106. [Google Scholar]
  37. Jordan N, Montani T. Cognitive arithmetic and problem solving: A comparison of children with specific and general mathematics difficulties. Journal Learning Disabilities. 1997;30:624–634. doi: 10.1177/002221949703000606. [DOI] [PubMed] [Google Scholar]
  38. Jordan N, Hanich L, Kaplan D. A longitudinal study of mathematical competencies in children with specific mathematics difficulties versus children with comorbid mathematics and reading difficulties. Child Development. 2003;74:834–850. doi: 10.1111/1467-8624.00571. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Katz D, Foxman B. How well do prediction equations predict? Using receiver operating characteristic curves and accuracy curves to compare validity and generalizability. Epidemiology. 1993;4:319–326. doi: 10.1097/00001648-199307000-00007. [DOI] [PubMed] [Google Scholar]
  40. Kaye DB, Post TA, Hall VC, Dineen JT. Emergence of information-retrieval strategies in numerical cognition: A developmental study. Cognition and Instruction. 1986;3(2):127–150. [Google Scholar]
  41. Logie RH. Visuo-spatial processing in working memory. Quarterly Journal of Experimental Psychology. 1986;38A:229–247. doi: 10.1080/14640748608401596. [DOI] [PubMed] [Google Scholar]
  42. Logie RH, Gilhooly KJ, Wynn V. Counting on working memory in arithmetic problem solving. Memory & Cognition. 1994;22(4):395–410. doi: 10.3758/bf03200866. [DOI] [PubMed] [Google Scholar]
  43. Lusted LB. Signal detectability and medical decision-making. Science: New Series. 1971;171(3977):1217–1219. doi: 10.1126/science.171.3977.1217. [DOI] [PubMed] [Google Scholar]
  44. Mazzocco M, Thompson RE. Kindergarten predictors of math learning disability. Learning Disabilities Research and Practice. 2005;20(3):142–155. doi: 10.1111/j.1540-5826.2005.00129.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. McLean JF, Hitch GJ. WM impairments in children with specific arithmetic learning difficulties. Journal of Experimental Child Psychology. Special Issue The Development of Mathematical Cognition: Arithmetic. 1999;74(3):240–260. doi: 10.1006/jecp.1999.2516. [DOI] [PubMed] [Google Scholar]
  46. Menon V, Meyer M, Wu S. In: Cognitive Developmental Neuroscience of Mental Arithmetic: Implications for Understanding Learning and Academic Achievement. Kelly AE, editor. Cambridge University Press; (In Press) [Google Scholar]
  47. Ostad SA. Developmental differences in addition strategies: a comparison of mathematically disabled and mathematically normal children. British Journal of Educational Psychology. 1997;67:345–357. doi: 10.1111/j.2044-8279.1997.tb01249.x. [DOI] [PubMed] [Google Scholar]
  48. Pickering SJ, Gathercole SE. WM Test Battery for Children (WMTB-C) Manual. London: The Psychological Corporation; 2001. [Google Scholar]
  49. Rasmussen C, Bisanz J. Representation and working memory in early arithmetic. Journal of Experimental Child Psychology. 2005;91(2):137–157. doi: 10.1016/j.jecp.2005.01.004. [DOI] [PubMed] [Google Scholar]
  50. Russell RL, Ginsburg HP. Cognitive analysis of children’s mathematics difficulties. Cognition and Instruction. 1984;1(2):217–224. [Google Scholar]
  51. Shrager J, Siegler RS. SCADS: A model of children’s strategy choices and strategy discoveries. Psychological Science. 1998;9:405–410. [Google Scholar]
  52. Siegel LS, Ryan EB. The development of WM in normally achieving and subtypes of learning disabled children. Child Development. 1989;60(4):973–980. doi: 10.1111/j.1467-8624.1989.tb03528.x. [DOI] [PubMed] [Google Scholar]
  53. Siegler RS. Unities across Domains in Children's Strategy Choices. Hillsdale, NJ: Erlbaum; 1986. [Google Scholar]
  54. Siegler RS. The perils of averaging data over strategies: An example from children’s addition. Journal of Experimental Psychology: General. 1987;116:250–264. [Google Scholar]
  55. Siegler RS. Individual differences in strategy choices: Good students, not-so-good students, and perfectionists. Child Development. 1988;59:833–851. [PubMed] [Google Scholar]
  56. Siegler RS, Jenkins E. How Children Discover New Strategies. Hillsdale, NJ: Lawrence Erlbaum Associates; 1989. [Google Scholar]
  57. Siegler RS, Robinson M. The development of numerical understanding. Advances in Child Development and Behavior. 1982;16:241–312. doi: 10.1016/s0065-2407(08)60072-5. [DOI] [PubMed] [Google Scholar]
  58. Siegler RS, Shrager J. Strategy choice in addition and subtraction: How do children know what to do. In: Sophian C, editor. Origins of Cognitive Skills. Hillside, NJ: Erlbaum; 1984. pp. 229–293. [Google Scholar]
  59. Swanson HL. Working memory in learning disability subgroups. Journal of Experimental Child Psychology. 1993;56(1):87–114. doi: 10.1006/jecp.1993.1027. [DOI] [PubMed] [Google Scholar]
  60. Swanson HL. Cross-sectional and incremental changes in working memory and mathematical problem solving. Journal of Educational Psychology. 2006;98(2):265–281. [Google Scholar]
  61. Swanson HL, Sachse-Lee C. Mathematical problem solving and working memory in children with learning disabilities: Both executive and phonological processes are important. Journal of Experimental Child Psychology. 2001;79(3):294–321. doi: 10.1006/jecp.2000.2587. [DOI] [PubMed] [Google Scholar]
  62. Svenson O. Memory retrieval of answers of simple additions as reflected in response latencies. Acta Psychologica. 1985;59:285–304. [Google Scholar]
  63. Weschler D. Weschler Individual Achievement Test. 2. San Antonio, TX: The Psychological Corporation; 2005. [Google Scholar]
  64. Weschler D. Weschler Abbreviated Scale of Intelligence. San Antonio, TX: Harcourt Assessment, Inc; 1999. [Google Scholar]
  65. Widaman KF, Little TD, Geary DC, Cormier P. Individual differences in the development of skill in mental addition: Internal and external validation of chronometric models. Learning & Individual Differences. 1992;4(3):167–213. [Google Scholar]
  66. Wilson KM, Swanson HL. Are mathematics disabilities due to a domain-general or a domain-specific working memory deficit? Journal of Learning Disabilities. 2001;34(3):237–248. doi: 10.1177/002221940103400304. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary

RESOURCES