Abstract
Objective
To identify relevant efficacy parameters essential in designing clinical trials for brain-penetrant therapies for Gaucher disease, we evaluated cognitive function longitudinally in 34 patients with Gaucher disease type 3 seen at the NIH Clinical Center.
Methods
Individuals were tested with age-appropriate Wechsler Intelligence Scales administered between 1 and 18 times over 29 years. Variation in all IQ domains was not linear with time and was best characterized with the coefficient of variation (SD/mean) for each individual. Mixed-effects regressions were used to determine whether IQ was associated with clinical features. Models were controlled for variation in test version, participant identification, and test administrator.
Results
Mean verbal, performance, and full-scale IQs were 81.77, 75.98, and 82.02, respectively, with a consistent discrepancy between verbal and performance IQs. Mean (SD) verbal, performance, and full-scale coefficient of variations were 0.07 (0.04), 0.09 (0.05), and 0.06 (0.02), respectively. IQ varied about a mean, with no clear trajectory, indicating no clear patterns of improvement or decline over time. EEG lateralization and behavioral issues were consistently associated with IQ.
Conclusions
The observed variation in IQ in Gaucher disease type 3 across the cohort and within single individuals over time may be characteristic of other neuronopathic diseases. Therefore, to reliably use IQ as an efficacy measure in any clinical trial of neurotherapeutics, a normal variation range must be established to assess the clinical relevance of any IQ change.
Gaucher disease (GD) is an autosomal recessive lysosomal storage disorder caused by mutations in GBA1, leading to a deficiency of the enzyme glucocerebrosidase. Phenotypic presentations vary widely, encompassing visceral, hematologic, skeletal, and neurologic symptoms.1 Individuals with GD type 3 (GD3) exhibit a spectrum of neurologic manifestations, which may include myoclonic epilepsy, generalized seizures, ataxia, and cognitive dysfunction.2–8 However, the hallmark and defining feature is slowing, looping, or absence of horizontal saccadic eye movements.9,10
Enzyme replacement therapy (ERT) and substrate reduction therapy are widely used therapies that dramatically improve the nonneurologic, systemic complications of GD.4,7,8 Although no formulation has been proven to affect the neurologic aspects of GD, brain-penetrant substrate reduction therapy, gene therapy, and small-molecule chaperones are being explored.4 However, because of the general efficacy of existing treatments, patients who would have succumbed to visceral disease in the pretreatment era are now living longer with fewer complications. With prolonged longevity, a new generation of ERT-treated individuals with GD3 are redefining the course of the disease, reporting additional, often subtle, neurologic symptoms.
A major challenge in developing novel therapeutics for neurologic disorders is the establishment of relevant efficacy parameters before initiating a clinical trial. To define a biomarker, a thorough natural history assessment must be performed, evaluating changes over time. In light of the increasing need for brain-penetrant therapy, we aimed to characterize the neurologic and neuropsychological aspects of GD3 through retrospective analysis of 34 patients with GD3 followed at the NIH Clinical Center from 1988 to 2017.
Methods
Standard protocol approvals, registrations, and patient consents
Thirty-four English-speaking patients with a clinical and molecular diagnosis of GD3 were enrolled in studies (NCT00001215, NCT00001289) approved by the National Human Genome Research Institute or the National Institute of Neurologic Disorders and Stroke Institutional Review Board, respectively, at the NIH between 1988 and 2017. Written informed consents/assents were signed by the legal guardian and/or patient as appropriate.
Study population
Patients were evaluated by members of the study team at the NIH Clinical Center. Studies performed included a physical and neurologic examination, abdominal MRI, skeletal x-rays, bone density scans (dual-energy x-ray absorptiometry), EEG, somatosensory and brainstem evoked potentials, neuro-ophthalmologic evaluations, and neurocognitive testing. The specific evaluations performed at each visit varied as clinically indicated, and not all studies were conducted at every visit. Details of these assessments have been previously reported.2,3,11
Molecular analyses
Samples were collected for clinical and research assays, as well as for DNA extraction. Genotypic analyses were performed by sequencing all exons of the GBA1 gene as previously described.12
Neuropsychological evaluations
Participant were evaluated with the age-appropriate Wechsler IQ Scale at the time of visit. The Wechsler IQ Scales are the standard metric for evaluating intelligence. Participants 3 to 5 years of age were assessed with the Wechsler Preschool and Primary Scale of Intelligence (WPPSI), specifically test versions WPPSI-R and WPPSI-III. Participants 6 to 16 years of age were assessed with the Wechsler Intelligence Scale for Children (WISC), test versions WISC-R, WISC-III, and WISC-IV. Participants ≥17 years of age were assessed with the Wechsler Adult Intelligence Scale (WAIS), test versions WAIS-R, WAIS-III, and WAIS-IV.3,13–18 Most individuals were tested with version III of each test. Although versions were updated over time, scores across versions have been shown to be comparable.3,13–18 The Wechsler scales assess 3 IQ domains: verbal (VIQ), performance (PIQ), and full-scale (FSIQ) IQ. The VIQ and PIQ scores are derived from subtests assessing different domains of cognition. These include verbal comprehension, working memory, perceptual organization, processing speed, and executive functioning, among others. The subtests are scored on a z scale derived from age-matched controls with a mean score of 10 ± 3. The VIQ and PIQ are combined to give the FSIQ and are reported as z-scaled scores, relative to age-matched controls, with a mean score of 100 ± 15. All 3 IQ scores are considered to be stable across time in a control population.19 Tests were administered by 2 neuropsychologists.
Statistical analyses
All analyses were performed with R (version 3.3.3)20 in RStudio (version 1.0.136; R Foundation for Statistical Computing, Vienna, Austria).21 To account for within-participant correlation, group-wise descriptive statistics were calculated from the individual means for that variable. For each individual, the mean for each parameter was calculated across all of the individual’s visits. The mean across all 34 values was then taken, excluding missing values (table 1). Analysis revealed an effect of test version on FSIQ. Therefore, to control for individual and test-related variation, FSIQ means were derived with a mixed-effects regression, with test version and participant identification as random effects. The individual FSIQ means are the intercept values for each participant (table 2). The reported FSIQ mean is the intercept of this regression model (table 3). The coefficient of variation (CV), defined as the SD divided by the mean, was calculated for each individual to assess change over time. To control for individual and test-related variation in FSIQ, individual CVs and CV means were derived from a mixed-effects regression, with test version and participant identification as random effects. The individual FSIQ CVs are the intercept values for each participant (table 2 and figure 2). The reported FSIQ CV mean is the intercept of the full regression model (figure 2). To evaluate the relationship between IQ and clinical and demographic variables, mixed-effects regressions were performed with the lme4 package.22,23
Table 1.
Participant characteristics
Table 2.
Individual IQ scores
Table 3.
Sample characteristics

Figure 2. Coefficient of variation demonstrates IQ fluctuations.
(A) Coefficient of variation (CV) of verbal IQ by participant. Average variation across the cohort was 0.066 (0.04), mean (SD). (B) CV of performance IQ by participant. Average variation across the cohort was 0.085 (0.05), mean (SD). (C) CV of full-scale IQ by participant. Average variation across the cohort was 0.061 (0.02), mean (SD).
The p values for the full regression models were obtained by a likelihood ratio test compared to the null model (intercept and random effects only). Ninety-five percent confidence intervals (CIs) were calculated via the lme4 basic bootstrap method on 1,000 iterations.22,24
Data availability
Anonymized data will be shared by request from any qualified investigator.
Results
Sample demographics
Thirty-four individuals 3 to 34 years of age evaluated at the Clinical Center of the NIH between 1988 and 2017 were included in this retrospective analysis (table 1). Of those, 2 individuals were never able to complete testing, while 2 others were unable to complete testing in a single instance but did so at others. Summary statistics for the full cohort are shown in table 3. Nineteen male and 15 female participants were evaluated. The ethnicities reported included non-Hispanic white (n = 16), Hispanic (n = 11), black (n = 5), Asian (n = 1), and Native American (n = 1). Genotypes were available on 32 individuals, and the majority (n = 26) carried the L444P/L444P (p.L483P/L483P) genotype. Four patients carried complex alleles that arose from recombination with the GBA1 pseudogene, together with a missense mutation on the second allele. The average age at symptom onset was 15.6 months, with an average age at diagnosis of 21.7 months. Organomegaly was consistently present at diagnosis. Of the 134 visits, there were 8 visits in 6 patients in which the individual was not on enzyme replacement therapy (ERT). The average duration of ERT treatment at the time of visit was 6.77 years. Of the 32 patients who completed testing, 19 had at least 2 evaluations, and 12 had ≥2 . The length of follow-up ranged from 0 years (single visit) to 23 years, with an average length of follow-up of 5.7 years.
IQ scores
IQ scores for each individual are presented in table 2. The reported score is the mean of all assessments performed. FSIQ scores were additionally controlled for test version. Scores in all 3 domains varied widely across the cohort (table 3) with a mean VIQ of 81.77, a mean PIQ of 75.98, and a mean FSIQ of 82.02. All 3 IQ means fall in the below-average to borderline range. Remarkably, the VIQs ranged from 46 to 147, and PIQs ranged from 46 to 122. We noted a consistent discrepancy between VIQ and PIQ, with a mean difference of 5.99 in favor of VIQ. Analysis of the core subtests of these 2 domains was performed to identify specific areas of weakness (table 3). It should be noted that subtests vary with test version, so the numbers of individuals tested are not equal. Verbal subtests include items from the Verbal Comprehension Index and Working Memory Index of the version IV Wechsler tests. Performance subtests include items from the Perceptual Organization Index and Processing Speed Index of the version IV Wechsler tests. Mean subtest scores were within a normal range of 10 ± 3 for 3 of 7 verbal and 1 of 5 performance subtests. The lowest subtest mean for the verbal items was 5.41 for Arithmetic. The lowest subtest mean for the performance items was 4.97 for Digit Symbol Coding. The low scores in the performance subtests were found primarily in timed subtests designed to test visuospatial skills.13–18
Longitudinal assessment of IQ
To better evaluate the individual test results for each participant over time, we constructed per-participant IQ trajectories, showing IQ scores relative to the age at evaluation (figure 1). Across all 3 IQ domains, individuals showed marked internal variation in IQ over time. No clear pattern of change was evident, and no trends were appreciated. There were no patterns of either improvement or decline. The IQ changes in our cohort were not linear with time, making a regression analysis or multivariate analysis of variance (MANOVA) inappropriate. Instead, the CV was used to evaluate change over time. CV was calculated for each individual by dividing the SD in IQ by the mean IQ for that individual. For FSIQ, CV was calculated by test version within each participant. A mixed-effects regression was performed to obtain controlled estimates of CV per participant and CV mean. Figure 2 shows scatterplots of the CV for each participant across the 3 IQ domains. The CV mean ± SD (range) values for VIQ, PIQ, and FSIQ were 0.066 ± 0.04 (0.00–0.18), 0.085 ± 0.05 (0.00–0.21), and 0.061 ± 0.02 (0.03–0.13), respectively. Subtests also did not vary linearly with respect to time. CVs were computed for each of the subtest scores and found to be higher than the composite score CVs (data not shown).
Figure 1. Variation in IQ over time is not linear.
(A) Verbal IQ scores over time. Each line represents a single individual with an evaluation at that age. (B) Performance IQ scores over time. Each line represents a single individual with an evaluation at that age. (C) Full-scale IQ scores over time. Each line represents a single individual with an evaluation at that age
Correlation of clinical features with IQ
To assess the relationship between specific clinical features and IQ scores, mixed-effects regressions were performed with each of the IQ domains as the outcome variable. Participant identification, test version, and test administrator were included as a random intercept effects in all models. Participant identification accounted for a significant proportion of variance in IQ in all models; test version accounted for a proportion of FSIQ variance; test administrator did not account for variance in IQ in any model (table 4). For each IQ domain, the model was constructed with the following clinical correlates: year of evaluation (to control for generational differences), sex, splenectomy (yes/no), bone pain (yes/no), slowed saccadic eye movements (yes/no), history of seizures (yes/no), abnormal EEG (yes/no), EEG lateralization (no, left, right, both), EEG with background slowing (yes/no), abnormal brainstem auditory evoked response (yes/no), genotype, behavioral issues (yes/no), psychiatric issues (yes/no), hearing loss (yes/no), ataxia (yes/no), tremor (yes/no), myoclonus (yes/no), attention-deficit disorder/attention-deficit/hyperactivity disorder (ADD/ADHD) (yes/no), being assigned an individualized education plan at school (yes/no), skeletal issues (mild vs severe), kyphosis (yes/no), scoliosis (yes/no), interstitial lung disease (yes/no), sleep apnea (yes/no), age at symptom onset (months), duration of ERT (years), and autism (yes/no). EEG lateralization was determined by evidence of either slowing or spike and wave activity on one side more than the other. When both of these were seen but were predominant on different sides, lateralization was designated “both.” Behavioral issues included outbursts and behavior problems in school or at home. Psychiatric issues were anxiety or depression. Mild skeletal issues included radiographic evidence of Erlenmeyer flask deformity, bone marrow infiltration, and alignment abnormalities. Severe skeletal involvement was defined as radiographic evidence of alignment abnormalities requiring surgical intervention, avascular necrosis, or spontaneous fractures. Interstitial lung disease was determined by radiograph and/or CT. ADD/ADHD, sleep apnea, and autism required formal diagnosis for a “yes” designation. Missing values were treated as a separate variable level in the regressions. The results of these models can be found in tables 4 and 5.
Table 4.
Model parameters, random effects, and fixed effects show correlations between IQ, demographics, and visceral features of GD3
Table 5.
Fixed effects show correlations between IQ and neurologic features of GD3
To evaluate the significance of associations between clinical variables and IQ scores, we used a basic bootstrap over 1,000 iterations to obtain 95% CIs. Intervals that do not include zero are more likely to have a true association. For VIQ, left-sided EEG lateralization was associated with an increase in VIQ of 5.88 points (95% CI 1.65–10.49) and behavioral issues with a decrease of 5.72 points (95% CI −11.28 to −0.51). A response of “not available” for interstitial lung disease also had a CI excluding zero.
For PIQ, splenectomy was associated with a decrease in PIQ of 28.61 points (95% CI −45.12 to −11.95), slow saccades with a decrease of 15.89 points (95% CI −30.17 to −0.67), both-sided EEG lateralization with a decrease of 8.43 points (95% CI −14.90 to −1.20), and behavioral issues with a decrease of 9.91 points (95% CI −16.32 to −2.96). A response of “not available” for EEG slowing, interstitial lung disease, and sleep apnea also had CIs excluding zero, in addition to 2 with missing genotypes.
For FSIQ, splenectomy was associated with a decrease in FSIQ of 16.93 points (95% CI −30.38 to −4.17), both-sided EEG lateralization with a decrease of 6.36 points (95% CI −11.52 to −1.15), behavioral issues with a decrease of 8.92 points (95% CI −13.62 to −4.12), and psychiatric issues with an increase of 6.01 points (95% CI 0.01–12.21). Diagnosis of autism and patient genotype also had CIs excluding zero. There was no correlation between any of the IQ scores and duration of ERT treatment.
Discussion
This retrospective analysis of cognition in a neurologic disorder identified many factors to consider when IQ is used as an efficacy parameter. Our evaluation of IQ in individuals with GD3 shows that cognition is variable both among patients and within a single patient over time. Our cohort consisted primarily of individuals with the most common GD3 genotype, L444P/L444P.25,26 Mean VIQ, PIQ, and FSIQ fell in the below-average to borderline ranges. However, this mean is not indicative of the wide spectrum of IQs seen throughout the cohort, with multiple individuals in the above-average and superior ranges. It is important to note that IQ cannot be assessed in individuals at the lowest end of the cognitive spectrum. We have included 2 such individuals (patients 33, 34) to give a more complete picture of the phenotypic variation.
Prior studies have reported a variety of neurologic symptoms associated with GD3. While some maintain that the neurologic manifestations are stable or even improve over time, others report progressive deterioration, leaving no clear consensus.2,3,6–8,11
As patients with GD3 have begun to experience less visceral symptoms, an increased emphasis has been placed on cognitive function and school performance. A commonly reported measure of cognitive function is the IQ. Previous studies have looked at the IQs in patients with GD3, reporting that IQs are typically below average, although the range of these values varies widely.2,3,11,27,28 However, most studies have been cross-sectional, and the few that are longitudinal are limited by population size or intervals of follow-up. This study evaluates the neurocognitive function in the largest GD3 cohort to date, spanning a course of 29 years.
Some of the individuals in this cohort were included in prior studies.2,11 These studies noted a discrepancy between VIQ and PIQ scores, with higher VIQ scores.3 This was seen in our larger cohort as well, with an average difference of ≈6 IQ points. However, the earlier studies in GD3 reported no IQ changes in their cohort over time, whereas we found a variation of 3% to 7% (VIQ), 4% to 14% (PIQ), and 4% to 8% (FSIQ). These different conclusions are likely due to the statistical methodology used. In 1 study, the investigators compared a baseline and single follow-up visit, which is a limited longitudinal analysis. Another study used MANOVA with time as a within factor. MANOVA analyses assume an underlying linearity of the data, which, given the results of our trajectory plots, is likely an invalid assumption. We propose that any longitudinal studies of IQ in neurologic diseases include an assessment of the linearity of the data before further analysis.
Our study also differs from the previous 2 studies in that we analyzed the subtest scores to probe the discrepancy between VIQ and PIQ. We found that while only 4 of 7 verbal subtest means fell >1 SD from the mean, 4 of 5 performance subtest means fell outside a single SD. Moreover, the performance subtests with the lowest scores were timed tests, incorporating fine motor skills, horizontal gaze tracking, and processing speed.13–18 This suggests that clinical deficits in these areas may be driving performance scores down.
Mixed-effects regressions were used to assess which clinical variables were associated with each of the IQ domains. Using participant identification, test version, and test administrator as random intercept effects, we obtained 3 unique models. Participant identification accounted for variance in IQ in all models, whereas test administrator did not account for variance in any model. Test version accounted for variance in FSIQ only. Significance of the association with IQ of fixed effects was evaluated by construction of 95% CIs by basic bootstrap method on 1,000 iterations.22,24 Significant effects are identified by CIs that exclude zero.
We found no association between IQ and year of evaluation, sex, bone pain, history of seizures, general EEG abnormality, abnormal brainstem auditory evoked response, hearing loss, ataxia, tremor, myoclonus, ADD/ADHD, having an individualized education plan in school, skeletal issues or structural abnormalities, age at disease onset, or the duration of ERT. The only clinical parameter found to be associated with all 3 IQ domains was behavioral issues, which may be indicative of compliance during testing or the degree of brain involvement. Left-sided EEG lateralization was associated with a minor increase in VIQ score, which may suggest increased activity or compensatory mechanism on the side of the brain involved with language, although EEG was not performed simultaneously with testing. Slow saccades were associated with a lower PIQ score, which corresponds with the results of the subtest analyses showing poorest performance on tests that were timed and required rapid horizontal eye tracking. Both-sided EEG lateralization and splenectomy were associated with lower PIQ and FSIQ and may be indicative of a global neurologic decline. This result should be validated in other neurologic diseases. Chart reviews indicate that all individuals who required splenectomy had severe systemic disease. In all but 1 case, splenectomy was performed before the advent of ERT. The remaining case was not responsive to ERT and continued to have severe visceral disease with life-threatening splenomegaly. We believe the association with splenectomy to be indicative of more aggressive disease.
Our analysis also revealed some associations that we do not believe to be legitimate, despite CIs that may indicate significance. A missing value for lung disease was associated with lower VIQ and PIQ. Missing values for EEG slowing and sleep apnea were also associated with PIQ changes. In cases of missing data, which were very few, this is likely not a real association. We also believe that very small sample sizes of only a single individual are responsible for the apparent associations with genotypes (VIQ and FSIQ) and autism (FSIQ). We assert that these apparent associations should be interpreted with extreme caution and are likely artifacts due to small sample size.
Sample size was a limitation of our study, as evidenced by the artifact associations. In addition, because of our limited sample size, we were unable to include random slopes in our model. A combined random slope and random intercept approach would have been beneficial for mixed-effects regressions because random intercept alone can overestimate the significance of associations. Although the model for VIQ did not differ significantly from a null model (p = 0.08), IQ variation was accounted for by inclusion of random effects. Perhaps a better measure of validation for our models is the relatively low residuals, indicating that a large portion of variance is accounted for in the full models, even without the random slopes. Small sample size, while a general limitation of our study, is not uncommon in rare disease research. With respect to change in IQ over time, there is a dearth of longitudinal studies of IQ in healthy controls, limiting interpretation of the CVs. However, IQ is generally considered to be stable over time, and normalization of data at each age is designed to keep consistency.13–18 Finally, we recognize that there is potential for selection bias because all participants were seen at the NIH Clinical Center, a tertiary referral center.
Despite these limitations, the strengths of our study include a relatively large and diverse sample considering that this is a rare disease. In fact, this is the largest longitudinal study of IQ in GD3.2, 11 All participants underwent detailed clinical examination, allowing us to assess associations with many different clinical parameters. Careful genotyping was performed by full sequencing of all exons of GBA1. This study also has the longest duration, including patients followed up for up to 20 years (mean 5 years). All evaluations were performed by only 2 neuropsychologists, limiting the effects of varying interpretations. Our study is the first study of IQ in GD3 to include analyses of the subtests and the first in which the analysis was performed in a data-driven manner, with no a priori assumption of linear change over time.
Characterization of the normal range of variation in IQ in GD3 is an essential part of understanding the course of the disease. This component is important for parents, providers, and educators working with GD3, and it must be recognized that any single increase or decrease in IQ cannot be interpreted as an absolute change. Fluctuations within the range of 3% to 7% (VIQ), 4% to 14% (PIQ), and 4% to 8% (FSIQ) are to be expected, and variations outside of that range should not necessarily be cause for alarm but should prompt investigation into the appropriate associated clinical parameters. As the field moves toward trials of brain-penetrant therapies, it is essential to understand the expected baseline fluctuations. To truly assess efficacy of a treatment, it will be essential to demonstrate an effect outside of the normal variation range. Moreover, it is important that studies control for the effects of the associated clinical parameters in their interpretation. The difference in FSIQ across test versions makes it important for future studies to control for test version, ideally by using only a single test within each age group. This study demonstrates that future longitudinal studies of cognition, in both GD and other pediatric-onset neurologic disorders, must establish baseline preintervention measures of IQ, define a range of expected variation without assuming linear change, and include >1 postintervention assessment to demonstrate an effect. Such measures should be incorporated early in study design to reliably assess efficacy. While multiple baseline studies at appropriate intervals are the ideal standard, in cases in which only one can be obtained, the normal variation ranges established here would provide a guideline for indication of a clinically meaningful change in IQ for individuals with GD3. We suggest that the following equations be used to calculate a range of expected IQ variation:
When multiple preintervention IQs (IQpre) are obtained and an individual CV is calculated, this equation should be used:
2. When only 1 preintervention IQ is obtained but a population CV range has been established, the following 2 equations apply:
![]() |
![]() |
A postintervention IQ score outside of these ranges would then be considered clinically meaningful. It is reasonable to assume that these findings may not be unique to GD3 and should be tested in other chronic neurologic conditions to establish normal variation ranges.
Acknowledgment
The authors thank Dona Lingerfelt for her advice and input and Dr. Dwight Dickinson for his helpful suggestions.
Glossary
- ADD/ADHD
attention-deficit disorder/attention-deficit/hyperactivity disorder
- CI
confidence interval
- CV
coefficient of variation
- ERT
enzyme replacement therapy
- FSIQ
full-scale IQ
- GD
Gaucher disease
- MANOVA
multivariate analysis of variance
- PIQ
performance IQ
- VIQ
verbal IQ
- WAIS
Wechsler Adult Intelligence Scale
- WISC
Wechsler Intelligence Scale for Children
- WPPSI
Wechsler Preschool and Primary Scale of Intelligence
Appendix. Authors

Study funding
This study was supported by the intramural research programs of the National Human Genome Research Institute, the National Institute of Neurological Disorders and Stroke, and the NIH.
Disclosure
A.M. Steward, E. Wiggs, T. Lindstrom, S. Ukwuani, E. Ryan, N. Tayebi, T. Roshan Lal, and G. Lopez report no disclosures relevant to the manuscript. R. Schiffmann received research support and honoraria from Protalix Biotherapeutics, Amicus Therapeutics, Shire, Inc, and Sanofi Genzyme. E. Sidransky reports no disclosures relevant to the manuscript. Go to Neurology.org/N for full disclosures.
References
- 1.Nalysnyk L, Rotella P, Simeone JC, Hamed A, Weinreb N. Gaucher disease epidemiology and natural history: a comprehensive review of the literature. Hematology 2017;22:65–73. [DOI] [PubMed] [Google Scholar]
- 2.Benko W, Ries M, Wiggs EA, Brady RO, Schiffmann R, Fitzgibbon EJ. The saccadic and neurological deficits in type 3 Gaucher disease. PLoS One 2011;6:e22410. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Goker-Alpan O, Wiggs EA, Eblan MJ, et al. Cognitive outcome in treated patients with chronic neuronopathic Gaucher disease. J Pediatr 2008;153:89–94. [DOI] [PubMed] [Google Scholar]
- 4.Gary SE, Ryan E, Steward AM, Sidransky E. Recent advances in the diagnosis and management of Gaucher disease. Expert Rev Endocrinol Metab 2018;13:107–118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Campbell PE, Harris CM, Vellodi A. Deterioration of the auditory brainstem response in children with type 3 Gaucher disease. Neurology 2004;63:385–387. [DOI] [PubMed] [Google Scholar]
- 6.Erikson A, Karlberg J, Skogman A, Dreborg S. Gaucher disease (type III): intellectual profile. Pediatr Neurol 1987;3:87–89. [DOI] [PubMed] [Google Scholar]
- 7.Sechi A, Deroma L, Dardis A, et al. Long term effects of enzyme replacement therapy in an Italian cohort of type 3 Gaucher patients. Mol Genet Metab 2014;113:213–218. [DOI] [PubMed] [Google Scholar]
- 8.Bembi B, Zanatta M, Carrozzi M, et al. Enzyme replacement treatment in type 1 and type 3 Gaucher's disease. Lancet 1994;344:1679–1682. [DOI] [PubMed] [Google Scholar]
- 9.Pensiero S, Accardo A, Pittis MG, Ciana G, Bembi B, Perissutti P. Saccade testing in the diagnosis and treatment of type 3 Gaucher disease. Neurology 2005;65:1837. [DOI] [PubMed] [Google Scholar]
- 10.Goker-Alpan O, Schiffmann R, Park JK, Stubblefield BK, Tayebi N, Sidransky E. Phenotypic continuum in neuronopathic Gaucher disease: an intermediate phenotype between type 2 and type 3. J Pediatr 2003;143:273–276. [DOI] [PubMed] [Google Scholar]
- 11.Altarescu G, Hill S, Wiggs E, et al. The efficacy of enzyme replacement therapy in patients with chronic neuronopathic Gaucher's disease. J Pediatr 2001;138:539–547. [DOI] [PubMed] [Google Scholar]
- 12.Velayati A, Knight MA, Stubblefield BK, Sidransky E, Tayebi N. Identification of recombinant alleles using quantitative real-time PCR implications for Gaucher disease. J Mol Diagn 2011;13:401–405. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Wechsler D. WISC III (Wechsler Intelligence Scale for Children). San Antonio: Psychological Corp; 1992. [Google Scholar]
- 14.Wechsler D. WISC-4: Wechsler Intelligence Scale for Children: Technical and Interpretive Manual. New York, New York: Pearson; 2003. [Google Scholar]
- 15.Wechsler D. WAIS-R Manual: Wechsler Adult Intelligence Scale-Revised. San Antonio: Psychological Corp; 1981. [Google Scholar]
- 16.Wechsler D. Wechsler Adult Intelligence Scale: WAIS-IV; Technical and Interpretive Manual. New York, New York: Pearson; 2008. [Google Scholar]
- 17.Wechsler D, Corporation P. WPPSI-R, Manual: Wechsler Preschool and Primary Scale of Intelligence, Revised. San Antonio, Texas: Psychological Corp; 1989. [Google Scholar]
- 18.Wechsler D, Corporation P. WAIS-III. San Antonio: Psychological Corp;1997. [Google Scholar]
- 19.Deary IJ, Pattie A, Starr JM. The stability of intelligence from age 11 to age 90 years: the Lothian birth cohort of 1921. Psychol Sci 2013;24:2361–2368. [DOI] [PubMed] [Google Scholar]
- 20.R: A Language and Environment for Statistical Computing [computer program]. Vienna: R Foundation for Statistical Computing; 2017. [Google Scholar]
- 21.RStudio. Integrated Development Environment for R [computer program]. Boston: RStudio, Inc.; 2016. [Google Scholar]
- 22.Bates D, Maechler M, Bolker B, Walker S. Fitting linear mixed-effects models using lme4. J Stat Softw 2015;67:1–48. [Google Scholar]
- 23.Winter B. Linear Models and Linear Mixed Effects Models in R with Linguistic Applications. Merced, California: CoRR; 2013:abs/1308.5499. [Google Scholar]
- 24.Efron B. Bootstrap methods: another look at the Jackknife. Ann Stat 1979;7:1–26. [Google Scholar]
- 25.Koprivica V, Stone DL, Park JK, et al. Analysis and classification of 304 mutant alleles in patients with type 1 and type 3 Gaucher disease. Am J Hum Genet 2000;66:1777–1786. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Sidransky E, Bottler A, Stubblefield B, Ginns EI. DNA mutational analysis of type 1 and type 3 Gaucher patients: how well do mutations predict phenotype? Hum Mutat 1994;3:25–28. [DOI] [PubMed] [Google Scholar]
- 27.Erikson A. Gaucher disease: Norrbottnian type (III): neuropaediatric and neurobiological aspects of clinical patterns and treatment. Acta Paediatr Scand Suppl 1986;326:1–42. [PubMed] [Google Scholar]
- 28.Garvey MA, Toro C, Goldstein S, et al. Somatosensory evoked potentials as a marker of disease burden in type 3 Gaucher disease. Neurology 2001;56:391–394. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
Anonymized data will be shared by request from any qualified investigator.








