Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2023 Nov 1.
Published in final edited form as: Neurobiol Aging. 2022 Jul 1;119:22–35. doi: 10.1016/j.neurobiolaging.2022.06.010

Voice Biomarkers as Indicators of Cognitive Changes in Middle and Later Adulthood

Elizabeth Mahon a, Margie E Lachman a
PMCID: PMC9487188  NIHMSID: NIHMS1824377  PMID: 35964541

Abstract

Voice prosody measures have been linked with Alzheimer’s disease (AD), but it is unclear whether they are associated with normal cognitive aging. We assessed relationships between voice measures and 10-year cognitive changes in the MIDUS national sample of middle-aged and older adults ages 42 to 92, with a mean age of 64.09 (SD=11.23) at the second wave. Seven cognitive tests were assessed in 2003-2004 (Wave 2) and 2013-2014 (Wave 3). Voice measures were collected at Wave 3 (N=2585) from audio recordings of the cognitive interviews. Analyses controlled for age, education, depressive symptoms, and health. As predicted, higher jitter was associated with greater declines in episodic memory, verbal fluency, and attention switching. Lower pulse was related to greater decline in episodic memory, and fewer voice breaks were related to greater declines in episodic memory and verbal fluency, although the direction of these effects was contrary to hypotheses. Findings suggest that voice biomarkers may offer a promising approach for early detection of risk factors for cognitive impairment or AD.

Keywords: middle age, cognitive performance, voice prosody, aging biomarkers, cognitive change

1. Introduction

Alzheimer’s disease (AD), the most common type of dementia, which affects approximately one in three seniors (age 65+), is projected to affect nearly 13 million Americans by the year 2050 (Alzheimer’s Association, 2019; Petersen, 2004). With the emergence of voice biomarkers for dementia, it is of interest to explore their influences on cognitive aging before clinical impairment. Although cognitive dysfunction is apparent in AD, methods are needed to identify the covert neuropathological changes that begin years ahead of obvious clinical symptoms, which could help provide earlier treatment and possible intervention. Preclinical cognitive declines are subtle enough to be overlooked until the emergence of more apparent symptoms several years later, by which point there are significant cognitive impairments in life (Alzheimer’s Association, 2019; Elsey et al., 2015; Lopez-de-Ipina et al., 2017; Mundt et al., 2005; Sperling et al., 2011; Twamley et al., 2006). Moreover, recent findings on the most promising treatments suggest that early diagnosis provides the greatest hope for slowing progression, or ultimately reversing symptoms (König et al., 2015; Meilán et al., 2020). One promising biomarker involves voice features (Martinez-Sanchez et al., 2012; Meilán et al., 2020, 2014, 2012).

1.1. Voice Biomarkers

Recent studies suggest that voice measures, most commonly from the category of prosody, are associated with AD (Konig et al., 2019, 2018; Mirheidari et al., 2019; Testa et al., 2001). Cognitive difficulties in domains such as memory and executive function are associated with lower prosody measures of pitch (vibration rate of vocal folds producing higher or lower sound), pulse (burst of air when vocal cords open/close distinguishing voice quality), the number of voice breaks (vibration of vocal cords producing articulated sound), jitter (pitch instability), shimmer (volume instability), and higher measures of amplitude (vocal cord vibration producing highness or lowness of volume) (Martinez-Sanchez et al., 2012; Meilán et al., 2020, 2014, 2012).

Voice measures are sensitive enough to acutely discriminate between participants with healthy cognition, MCI, and mild dementia (Beltrami et al., 2018; Kato et al., 2018; Themistocleous et al., 2020; Thomas et al., 2020; Toth et al., 2017; Xue and Deliyski, 2001). Recent research has found that machine-learning classification models of temporal, spectral, and prosodic voice feature sets can distinguish between normal cognition and cognitive impairment or Alzheimer’s disease (Haider et al., 2020; Nagumo et al., 2020). However, it is of interest whether individual prosody features of voice can be used to identify possible early signs of risk for cognitive impairment. Past work suggests that speech may be an early predictor of cognitive decline years before AD clinical diagnosis (Le et al., 2018; Van Velzen et al., 2014). Voice analysis may be less expensive, less invasive, and more readily obtained to monitor cognitive health compared to traditional diagnostic data such as brain imaging, cerebrospinal fluid, blood plasma, blood serum, urinary cortisol, and salivary cortisol (Johnson et al., 2012; Karlamangla et al., 2005; Lu et al., 2015; Österberg et al., 2012; Paterson et al., 2018; Quadri et al., 2004; van Himbergen et al., 2012; Wolf et al., 2002). Therefore, voice measures may be promising to integrate with other biomarkers to monitor cognitive health, potentially improving detection of pre-symptomatic dementia biomarkers and facilitating early intervention and treatment.

Longitudinal work at the Framingham Heart Study examining midlife and older age found that dementia risk is related to acoustic voice features, such as higher jitter, which was associated with a greater risk of developing dementia during their 7-year follow-up period (Lin et al., 2020). However, their work mainly looked at temporal voice features. In comparison, they did not assess other acoustic features of prosody that were previously found to be significantly related to AD (such as the prosody measures of pitch, pulse, voice breaks, and amplitude that we included in our current study). (Martinez-Sanchez et al., 2012; Meilán et al., 2014; Nasreen et al., 2021; Nishikawa et al., 2022; Testa et al., 2001). These additional measures may also be promising as markers. It would be helpful to assess individual differences in cognitive changes before impairment, which may identify early warning signs for eventual cognitive impairment.

The Framingham Heart Study significantly identified voice measures, including prosody measures of jitter and shimmer, which identified healthy participants who were more likely to develop dementia over 7 years (Lin et al., 2020). Thus, there is evidence that jitter and shimmer are potential biomarkers for early detection of significant cognitive impairment. Overall, it seems that voice measures, which have been linked with AD diagnosis, may be related to individual differences in cognitive changes before impairment. The current study builds on the 7-year-long analysis of Lin et al., 2020 by analyzing 10-year longitudinal data based on cognitive tests administered twice over the 10-year period with voice prosody measures of jitter and shimmer that were analyzed from cognitive interviews. However, we added other measures of voice prosody previously linked to MCI and AD, such as pitch, pulse, voice breaks, and amplitude (Martinez-Sanchez et al., 2012; Meilán et al., 2014; Nasreen et al., 2021; Nishikawa et al., 2022; Testa et al., 2001). Compared to past studies, our study also adjusted for neurological conditions, depressive symptoms, and chronic conditions in addition to age, sex, and education (Xue et al., 2021; Zhang et al., 2021). We looked at voice features predicting cognitive change over 10 years, a more extended range in timespan than the study by Lin et al., 2020. Our dataset also had a lower average age (M=54.99±11) compared to Lin et al., 2020 (M= 63±15) and included a more extensive age range (33 to 83 at occasion 1). Thus, our research focused on more voice prosody measures in a more extensive range of cognitively healthy adults, specifically adding participants in middle age.

Additionally, to assess temporal voice features, Lin et al. 2020 analyzed voice using OpenSMILE, a program developed for speech-based emotion recognition (Eyben et al., 2010). In contrast, we used Praat, a program designed for analyzing, synthesizing, and manipulating speech (Boersma and van Heuven, 2001), a method frequently used to assess prosody measures (Martínez-Nicolás et al., 2021; Martínez-Sánchez et al., 2012; Meilán et al., 2014). Past work also noted limitations in the quality of voice recordings, which created the need for expensive and time-consuming manual transcription (Xue et al., 2021). Our measures were from high-quality voice recordings that did not require transcription. Our results suggest possibilities for developing efficient and non-invasive methods that could be used for long-term monitoring throughout cognitive aging.

Cognitive aging is accompanied by less perceivable physiological changes caused by decline in information integration in the memory and executive function cerebral cortex regions responsible for voice production and speech syntax (Funahashi, 2001; Guenther, 2006; Hirano et al., 2004; Lieberman, 2007). Therefore, changes detected in the voice tied to brain aging could help identify early signs of cognitive impairment and dementia. Further research is needed to investigate whether voice prosody measures could serve as biomarkers of individual differences in cognitive change. This method could differentiate normal and abnormal cognitive changes, which could be used to detect preclinical stages of MCI and resulting AD (Bondi et al., 1999).

1.2. Physiological Biomarkers

Other biomarkers besides voice are related to AD, such as neuroinflammation, allostatic load, vascular disease, APOE4 genetic predisposition, amyloid protein and tau (phosphorylated and total), measured by cerebrospinal fluid and blood, plasma proteins and lipids in blood, and neurofilament light chain in blood (Craig-Schapiro et al., 2011; Fransquet and Ryan, 2019; Heneka et al., 2006; Heutink, 2000; Holmes and Butchart, 2011; Humpel, 2011; Jones et al., 2014; Ling et al., 2017; Matos and De Souza-Talarico, 2019; Shahim et al., 2018; Shoji, 2011; Zou et al., 2020). These factors could potentially inform early interventions, which may decrease the risk of developing AD. However, these measures are collected through invasive methods, such as blood draws and spinal taps, which may also be expensive and cannot necessarily be measured repeatedly over time. In contrast, monitoring voice features remotely and more regularly may help to detect subtle changes that could indicate heightened risk. It is also possible that combining voice measures with the more traditional AD biomarkers may provide a more comprehensive assessment of cognitive status.

Further, past research suggests that neuropathobiological changes associated with AD risk, such as reduced mitochondrial functioning, reduced signaling in the insulin/IGF pathway, and reduced hypothalamic regulation of stress hormones, may appear years, even decades, before clinical manifestation of AD (Arenaza-Urquijo et al., 2015; Bishop et al., 2010; Blalock et al., 2004; Freiherr et al., 2013; Haigis and Guarente, 2006; Honer et al., 2012; Lesuis et al., 2018; Liang et al., 2008; Miller et al., 2008; Stacey et al., 2017; Wilcox et al., 2011). Thus, if researchers can develop a non-invasive and accessible method to identify early indicators of dementia risk, early treatment may be more readily provided, ultimately delaying or possibly preventing the progression to AD. Some of these changes include reduced mitochondrial functioning, reduced signaling in the insulin/IGF pathway, and reduced hypothalamic regulation of stress hormones (Arenaza-Urquijo et al., 2015; Bishop et al., 2010; Blalock et al., 2004; Haigis and Guarente, 2006; Honer et al., 2012; Lesuis et al., 2018; Liang et al., 2008; Miller et al., 2008; Stacey et al., 2017; Wilcox et al., 2011). Age-related changes to the autonomic nervous system and coordination of extraneural tissues have also been associated with declined function of the brain (Beer et al., 2017; Bishop et al., 2010; Goodall et al., 2018; McLean and Le Couteur, 2004; Parashar et al., 2016; Pfeifer et al., 1983; Shimazu et al., 2005; Strickland et al., 2019). Some of these biomarkers can predict cognitive change over a few years, whereas others are useful in assessing risk over decades (Albert et al., 2011).

Neuropathological changes in the brain may begin 10 or more years before overt clinical symptoms appear (Green et al., 2000; Tuladhar et al., 2016). Thus, it is helpful to identify the underlying mechanisms that link voice biomarkers with cognitive changes so that a risk of AD can be identified before significant neuropathology and functional impairments appear. Such risk factors, along with cognitive changes, may provide warning signs for cognitive impairment up to 13 years before a clinical diagnosis (Linn et al., 1995; Sperling et al., 2011; Stewart et al., 2009). Although researchers have explored these biomarkers to improve AD identification, there is currently no widely accepted method to detect the preclinical stages (Alzheimer’s Association, 2019, National Academies, 2017). Providing reliable methods to detect early warning signs of cognitive impairment is particularly important given the vast individual differences in the nature and timing of cognitive declines with aging (Dixon and Lachman, 2019). For example, healthy adults with lower baselines of cognitive functioning have a higher risk of developing neurocognitive disorders such as AD within just a few years (Chen et al., 2001; Elias et al., 2000; Fox et al., 1998). By identifying possible risk factors of abnormal cognitive aging, early detection and intervention may be possible, leading to more effective treatment (National Academies, 2017).

1.3. Cognitive Aging

The most conventional method of identifying cognitive impairment and AD, neuropsychological testing, allows for improved clinical diagnostic accuracy by assessing various cognitive factors (Albert et al., 2011; Ang et al., 2019; Ding et al., 2020). Resulting criteria have aided in identifying those performing below normative expectations who may be at risk or already experiencing progressive cognitive impairment. For example, participants with mild cognitive impairment (MCI) can be identified by 1 to 1.5 standard deviations below the mean score for their age and education on verbal recall tests (Albert et al., 2011; Jak et al., 2016). This result suggests that relatively low cognitive test scores could be indicative of MCI. Further, the risk of a 10-year decline from MCI to AD can be identified by impaired performance in memory and executive function (Albert et al., 2011). Thus, identifying those with impaired cognitive performances or decline may facilitate early intervention, potentially lowering their risk of conversion to dementia. Specifically, past work suggests that the range of cognitive score decline can differentiate the type of risk. For example, mean yearly cognitive declines indicative of change to MCI have been found for Episodic Memory measured by Word List Learning (0.14±0.44) and Executive Function measured by Category Fluency (0.18±0.28); and mean yearly cognitive declines indicative of change to dementia have been found for Episodic Memory (0.25±0.32) and Executive Function (0.18±0.28) (Carmichael et al., 2012). Therefore, examining decline in cognitive scores in previous research has provided information about cognitive impairment, such as MCI or dementia. Examining individual differences in cognitive changes in relation to voice measures among a relatively healthy sample could identify those at risk of later-life cognitive impairment and potential biomarkers for dementia.

1.4. Current Study

Research investigating preclinical cognitive changes in midlife and later life with previously-identified voice biomarkers of AD may provide insights into early indicators of dementia. We tested the hypothesis that voice biomarkers of AD would be associated with individual differences in cognitive changes in midlife and later life, specifically among community-residing and well-functioning middle-aged and older adults who are part of a long-term longitudinal study. We examined voice prosody and cognitive changes in the MIDUS national sample of middle-aged and older adults with a wide range in educational attainment, using a battery of cognitive tests at two time points over 10 years. The goal was to investigate whether voice prosody measures previously associated with AD are also related to individual differences in the extent of cognitive changes over 10 years. Based on previous findings with voice prosody and AD, we hypothesized that higher pitch, higher pulse, more voice breaks, higher jitter, higher shimmer, and lower amplitude would be associated with greater decline in cognitive tests while controlling for demographic and health factors.

2. Methods

2.1. Participants

The current study included participants from the second wave (M2) and third wave (M3) of the Midlife in the United States Study (MIDUS) (Radler and Ryff, 2010) who completed cognitive assessments at both time points. MIDUS is a national longitudinal study that was initiated in 1995-1996 through random digit dialing households in 48 states that had a telephone (with further data collection approximately 9 and 19 years later) to assess how social, psychological, and behavioral factors affect mental and physical health in English-speaking, non-institutionalized adults ranging from young adult to older life (Brim et al., 2004). The original participants in M1 (N=7100) ranged from age 24 to 75 years (M = 46.40±13.00), were 51.7% women, had a mean education of 13.21 years, and had a mean self-rated health on a 5-point scale (1=poor, 5=excellent) of 3.53(±1.02) (Brim et al., 2004). The second wave of data collection (M2) occurred an average of nine years later in 2003-4. Cognitive testing was added at the second and third waves. The third wave (M3) occurred an average of 9.12 years later in 2013-4 (N=3294). In prior publications, additional information about the samples and selective attrition is available (Hughes et al., 2018; Radler and Ryff, 2010).

Available Data for Participants.

Participants were included in our analysis sample if they had demographic and cognitive assessments from M2 and M3, as well as at least two of the three voice recordings selected from M3 cognitive interviews (N=2585). At M3, the participants ranged from age 42 to 92 years (M=64.09, SD=±11.23), were 55.20% women, and had an average education (out of 20 years) of 14.57 years (SD=±2.67). Information is summarized in Table 1. Additionally, 90.2% of participants in our analysis sample were white, 3.2% African American, 0.9% Native American, 0.3% Asian, and 5.4% identified as “other”.

Table 1-.

Subjects, demographics, voice variables, and cognitive variables included in our analysis sample.

N Minimum Maximum Mean Std. Deviation
Age 2585 42.00 92.00 64.09 11.23
Sex (55.20% women) 2585 1.00 2.00 1.55 0.50
Education (years) 2577 6.00 20.00 14.57 2.67
Neurological conditions 2579 0.00 3.00 0.12 0.36
Depressive Symptoms 2585 0.00 1.00 0.10 0.29
Chronic conditions 2531 0.00 22.00 2.22 2.07
Word List Immediate pitch 2549 12.00 609.00 167.34 42.84
Word List Immediate pulse 2558 0.00 30735.00 739.44 804.75
Word List Immediate voice breaks 2558 0.00 374.00 20.74 16.76
Word List Immediate jitter 2548 0.00 20.00 1.77 0.89
Word List Immediate shimmer 2547 2.00 38.00 11.74 2.66
Word List Immediate amplitude 2548 0.00 1.00 0.19 0.10
Word List Delayed pitch 2424 80.00 607.00 164.61 42.94
Word List Delayed pulse 2440 0.00 22022.00 626.51 708.37
Word List Delayed voice breaks 2440 0.00 536.00 17.82 18.49
Word List Delayed jitter 2424 0.00 18.00 1.92 1.10
Word List Delayed shimmer 2425 2.00 32.00 12.00 3.10
Word List Delayed amplitude 2423 0.00 10.00 0.20 .22
Category Fluency pitch 2574 0.00 610.00 159.55 40.01
Category Fluency pulse 2578 0.00 24495.00 1458.28 1006.60
Category Fluency voice breaks 2578 0.00 255.00 41.91 22.25
Category Fluency jitter 2572 0.00 12.00 1.76 0.87
Category Fluency shimmer 2574 2.00 45.00 11.69 2.66
Category Fluency amplitude 2573 0.00 69.00 0.21 1.35
Pitch composite 2581 87.48 608.60 164.06 39.49
Pulse composite 2585 3.67 25750.67 947.84 753.97
Voice breaks composite 2585 0.00 318.00 27.02 15.56
Jitter composite 2580 0.60 15.27 1.82 0.84
Shimmer composite 2580 2.44 30.55 11.81 2.53
Amplitude composite 2580 0.05 34.43 0.21 0.68
M2 Word List Immediate 2412 0.00 15.00 7.00 2.19
M2 Word List Delayed 2332 0.00 14.00 4.69 2.51
M2 Category Fluency 2416 1.00 42.00 19.70 6.00
M2 Backward Digit Span 2415 0.00 8.00 5.10 1.45
M2 Number Series 2414 0.00 5.00 2.51 1.51
M2 30 Second and Counting Task 2415 −2.00 90.00 38.76 11.19
M2 Stop and Go Switch Task 2346 −3.77 −0.61 −1.07 0.23
M3 Word List Immediate 2582 0.00 15.00 6.70 2.38
M3 Word List Delayed 2460 0.00 14.00 4.37 2.68
M3 Category Fluency 2582 0.00 40.00 18.80 6.05
M3 Backward Digit Span 2583 0.00 8.00 4.97 1.47
M3 Number Series 2517 0.00 5.00 2.33 1.56
M3 30 Second and Counting Task 2557 −10.00 90.00 36.40 11.50
M3 Stop and Go Switch Task 2492 −7.67 −0.42 −1.27 0.39

To investigate selective attrition, differences between the analysis sample and participants not included in analyses due to dropping out from the study at Wave 3 or not having usable voice data were assessed by independent samples t-tests. As commonly found in longitudinal studies (including MIDUS), retention in our analysis sample was higher among those who were white, married, and had higher education (Agrigoroaei and Lachman, 2011; Karlamangla et al., 2014; Radler and Ryff, 2010). Additionally, retained participants showed higher performance on all cognitive tests compared to dropouts (Hughes et al., 2018) (see Table 2 for means).

Table 2-.

Demographics and cognitive variables for subjects analyzed and excluded.

    Analysis Sample     Excluded

N Min Max Mean SD N Min Max Mean SD
M2 Age** 2585 33.00 83.00 54.99 11.23 2377 28.00 84.00 55.91 13.64
M2 Sex** 2585 (55.20% women) 2378 (51.30% women)
M2 Education (years)** 2582 6.00 20.00 14.58 2.65 2374 6.00 20.00 13.76 2.60
M2 Neurological conditions** 2556 0.00 3.00 0.11 0.34 2348 0.00 3.00 0.12 0.37
M2 Depressive Symptoms** 2585 (9.70% depressed) 2378 (11.40% depressed)
M2 Chronic conditions*** 2572 0.00 20.00 1.64 1.79 2373 0.00 20.00 1.64 2.07
M2 Word List Immediate** 2412 0.00 15.00 7.00 2.19 1777 0.00 15.00 6.35 2.37
M2 Word List Delayed** 2332 0.00 14.00 4.69 2.51 1664 0.00 14.00 4.05 2.72
M2 Category Fluency 2416 1.00 42.00 19.70 6.00 1776 0.00 40.00 17.53 6.16
M2 Backward Digit Span** 2415 0.00 8.00 5.10 1.45 1778 0.00 8.00 4.88 1.56
M2 Number Series** 2414 0.00 5.00 2.51 1.51 1752 0.00 5.00 1.94 1.46
M2 30 Second and Counting Task 2415 −2.00 90.00 38.76 11.19 1760 2.00 100.00 35.21 11.44
M2 Stop and Go Switch Task** 2346 −3.77 −0.61 −1.07 0.23 1672 −7.36 −0.22 −1.13 0.33
Valid N (listwise) 2235 1561
**

T-test between the two groups is significant at the 0.01 level (equal variance assumed).

*

T-test between the two groups is significant at the 0.05 level (equal variance assumed).

Compared to excluded participants (N=2378), those in the analysis sample (N=2585) at M2 were younger, had a higher proportion of women, had more years of education, had fewer neurological conditions, were less likely to have depressive symptoms, had fewer chronic conditions, and performed better on all cognitive assessments at Wave 2.

2.2. Cognitive Measures

All cognitive measures from M2 and M3 were collected with the Brief Test of Adult Cognition by Telephone (BTACT), a reliable and valid brief cognitive battery designed for telephone administration that assesses key cognitive domains related to cognitive aging (Lachman and Tun, 2008; Lachman et al., 2014; Tun and Lachman, 2006). Participants were administered the BTACT at their preferred time of day. They were instructed to be in a place without distractions, close their eyes to help with concentration, and not write anything during the seven cognitive tests.

The BTACT included measures of episodic memory (Word List Immediate (WLI) and Delayed (WLD); immediate and delayed free recall of 15 words), inductive reasoning (Number Series (NS); completing a pattern in a series of 5 numbers), category verbal fluency (Category Fluency (CAT); the number of words produced from the category of animals in 60 seconds), working memory span (Backward Digit Span (DS); the highest span achieved in repeating strings of digits in reverse order), processing speed (30 Second and Counting Task (BC); the number of digits produced by counting backward from 100 in 30 seconds), and attention switching and inhibitory control reaction time (Stop and Go Switch Task (SGST); (Lachman and Tun, 2008). For SGST, reaction times were calculated with the mean of switch and non-switch trials median latencies on a task requiring alternating between the “normal” condition (i.e., respond “Go” to the stimulus “Green” and “Stop” to the stimulus “Red”) and the “reverse” condition (i.e., respond “Stop” to the stimulus “Green” and “Go” to the stimulus “Red”). See Hughes et al., 2018 for more information about the BTACT and scoring procedures.

2.3. Voice Measures

Voice measures include the six prosody variables that have been most frequently associated with dementia in prior research: pitch was measured by the mean number of vocal cord vibrations per second; pulse was measured by the frequency of glottal pulses of air; number of voice breaks was measured by total time length of breaks divided by total time length of voice; jitter was measured by frequency instability; shimmer was measured by amplitude instability; and amplitude was measured by the average noise-to-signal ratio (Al-Hameed et al., 2019; Boersma and Weeink, 2019; Fraser et al., 2016; König et al., 2015; Martinez-Sanchez et al., 2012; Meilán et al., 2014; Rusz et al., 2014).

The voice data were extracted from the three tests from the cognitive interview which had the most extended segments of uninterrupted participant voice (WLI, WLD, and CAT), with a total of 7763 recordings. Two research assistants (RA’s) were randomly assigned to each voice recording to identify the time frame of the participant’s voice within the recording, to analyze that time frame for voice measures using Praat voice analysis software (Boersma and Weeink, 2019), and finally to document the voice measures in an assigned secure excel file. Reliability between the 2 RAs was calculated across all ratings by calculating the percentage of audio files with consistent values (97.14%). Afterward, the first author resolved any discrepancies by reanalyzing the voice segments. Voice recordings determined to be of poor quality or for which participants did not complete the cognitive measures were excluded from analyses (n=4.24%). For each voice measure, we computed an aggregate composite score from the Praat values averaged across the three cognitive tests and computed internal consistency reliability using coefficient alpha: pitch (α=.928), pulse (α=.871), voice breaks (α=.732), jitter (α=.852), shimmer (α =.878), and amplitude (α=.535).

2.4. Covariates

Based on their significant relationships with cognitive performance in prior literature, we included self-reported age (continuous), sex (male=0, female=1), education (in years) obtained from a telephone interview, as well as the number of neurological conditions, depressive symptoms, and the number of chronic conditions obtained from a self-administered questionnaire (Alzheimer’s Association, 2019, National Academies, 2017; Gatz et al., 2001; Karlamangla et al., 2014; Kessler et al., 2004; Meilán et al., 2020, 2014; Mielke, 2018; Podcasy and Epperson, 2016; Tsenkova and Karlamangla, 2016). The number of neurological conditions was calculated by a count of the number of “yes” answers on self-reported neurological conditions (stroke, serious head injury, Parkinson’s Disease, or other neurological disorder), with a higher score indicating more neurological conditions (M2 range: 0-3; M3 range: 0-3).

Depressive symptoms were calculated as a binary variable (0=no, 1=yes) using phone interview questions per the Diagnostic and Statistical Manual of Mental Disorders third edition (American Psychiatric Association, 1987). The depressive symptoms variable was coded as “yes” if, within the past 12 months, the participant experienced two weeks of either depressed mood or anhedonia most of the day or nearly every day and also had at least four common depression symptoms (such as poor eating, poor sleeping, low energy, low concentration, low feelings of self-worth, and suicidal thoughts or actions) (Kessler et al., 2004; Tsenkova and Karlamangla, 2016).

The number of chronic conditions was calculated at M3 by a count of the number of “yes” answers on self-reported health conditions in the last 12 months (asthma/bronchitis/emphysema, tuberculosis, other lung problems, arthritis/rheumatism/other bone or joint disease, thyroid disease, recurring stomach trouble/indigestion/diarrhea, urinary/bladder problems, being constipated all or most of the time, gall bladder trouble, AIDS/HIV infection, Lupus/other autoimmune disorders, persistent trouble with gums/mouth, persistent trouble with teeth, high blood pressure/hypertension, alcohol/drug problems, migraine headaches, chronic sleeping problems, diabetes/high blood sugar, ulcer, hernia/rupture, cancer, or heart trouble) with a higher score indicating more chronic conditions (M3 range: 0-22). Participants were not assessed for neurocognitive disorders, although this does not exclude the possibility of some undiagnosed cognitive impairment or AD.

2.5. Data Analyses

Descriptive statistics were conducted in SPSS 28.0 (IBM Corp., 2021). We used longitudinal multi-level modeling (MLM) (Bolger and Laurenceau, 2013) and the lme4 package (Bates et al., 2011) in R (R Core Team, 2021) to examine whether voice measures of pitch, pulse, voice breaks, jitter, shimmer, and amplitude were associated with individual differences in change in each cognitive test over 10 years. Time was coded linearly as 0 = M2 and 1=M3. Age, gender, education, neurological conditions, depressive symptoms, and chronic conditions were the control variables. The M2 and M3 cognitive measures were standardized using means and standard deviations from M2 scores. The SGST Latency variable was multiplied by (−1) to keep the direction consistent with other cognitive tests (i.e., a higher score indicates better (faster) performance).

The following multi-level model was tested for each of the seven cognitive tests:

Cognitive_test_scoreti=β0i+timetipitchli+timetipulseli+timetivoicebreaksli+timetijitterli+timetishimmerli+timetiamplitudeli+ageli+sexli+educationli+neurological_conditionsli+depressive_symptomsli+chronic_conditionsli+et+ei

Equation 1 shows the simple linear growth model with the interaction effect of time, and it is based on the two waves of cognitive test scores for the i-th participant: time (t) represents the two waves (0=M2 and 1=M3). β0i is the average cognitive test score at M2 when each covariate equals zero.

3. Results

3.1. Covariates associated with voice measures

The correlations of the covariates with voice variables were consistent with prior work (Gatz et al., 2001; Martinez-Sanchez et al., 2012; Meilán et al., 2020, 2014, 2012; Mielke, 2018; Podcasy and Epperson, 2016). Age was negatively correlated with pulse and voice breaks and positively correlated with jitter and shimmer. Sex was negatively correlated with jitter, shimmer, and amplitude and positively correlated with pitch, pulse, and voice breaks. Education was negatively correlated with pitch and positively correlated with pulse, voice breaks, and shimmer. Neurological conditions were negatively correlated with pulse. Depressive symptoms were positively correlated with pitch and negatively correlated with shimmer. Chronic conditions were negatively correlated with jitter and positively correlated with pulse and voice breaks. The correlations are presented in Table 3.

Table 3-.

Correlations of demographic measures and cognitive measures with voice measures

Pitch composite Pulse composite Voice breaks composite Jitter composite Shimmer composite Amplitude composite
Age Pearson Correlation .012 −.082** −.046** .159** .080** −.004
Sig. (2-tailed) .542 <.001 .020 <.001 <.001 .836
N 2581 2585 2585 2580 2580 2580
Sex Pearson Correlation .590 .309** .099** −.240** −.496** −.068**
Sig. (2-tailed) <.001 <.001 <.001 <.001 <.001 <.001
N 2581 2585 2585 2580 2580 2580
Education Pearson Correlation −.055** .053** .108** .026 .062** .000
Sig. (2-tailed) .005 .007 <.001 .194 .002 .996
N 2573 2577 2577 2572 2572 2572
Neurological Pearson Correlation −.011 −.040** −.025 .050* .059** −.001
conditions Sig. (2-tailed) .581 .042 .205 .011 .003 .972
N 2575 2579 2579 2574 2574 2574
Depressive Symptoms Pearson Correlation .042* .036 .007 −.022 −.050* −.011
Sig. (2-tailed) .033 .068 .734 .270 .011 .567
N 2581 2585 2585 2580 2580 2580
Chronic Pearson Correlation .086** −.018 −.021 .060** .010 .015
conditions Sig. (2-tailed) <.001 .355 .288 .003 .623 .451
N 2528 2531 2531 2527 2527 2527
M2 Word List Immediate Pearson Correlation .147** .179** .125** −.112** −.158** −.105**
Sig. (2-tailed) <.001 <.001 <.001 <.001 <.001 <.001
N 2410 2410 2410 2409 2409 2409
M2 Word List Delayed Pearson Correlation .136** .174** .101** −.117** −.176** −.105**
Sig. (2-tailed) <.001 <.001 <.001 <.001 <.001 <.001
N 2330 2330 2330 2329 2329 2329
M2 Category Fluency Pearson Correlation −.038 .110** .170** −.034 .006 −.018
Sig. (2-tailed) .062 <.001 <.001 .099 .780 .366
N 2414 2414 2414 2413 2413 2413
M2 Backward Digit Span Pearson Correlation .028 .088** .081** .000 .015 −.009
Sig. (2-tailed) .163 <.001 <.001 .990 .469 .665
N 2413 2413 2413 2412 2412 2412
M2 Number Series Pearson Correlation −.065** .051* .096** −.011 .032 −.001
Sig. (2-tailed) .001 .012 <.001 .590 .113 .959
N 2412 2412 2412 2411 2411 2411
M2 30 Second and Counting Task Pearson Correlation −.071** .053** .089** .009 .053** .006
Sig. (2-tailed) <.001 .009 <.001 .644 .009 .759
N 2413 2413 2413 2412 2412 2412
M2 Stop and Go Switch Task Pearson Correlation −.078** .011 .039 −.016 .023 −.032
Sig. (2-tailed) <.001 .578 .059 .437 .271 .118
N 2344 2344 2344 2343 2343 2343
M3 Word List Immediate Pearson Correlation .126** .247** .202** −.168** −.200** −.056**
Sig. (2-tailed) <.001 <.001 <.001 <.001 <.001 .004
N 2581 2581 2581 2580 2580 2580
M3 Word List Delayed Pearson Correlation .148** .248** .187** −.169** −.225** −.035
Sig. (2-tailed) <.001 <.001 <.001 <.001 <.001 .087
N 2460 2460 2460 2460 2459 2459
M3 Category Fluency Pearson Correlation −.016 .185** .247** −.073** −.025 −.026
Sig. (2-tailed) .419 <.001 <.001 <.001 .208 .181
N 2581 2581 2581 2580 2580 2580
M3 Backward Digit Span Pearson Correlation .004 .089** .117** −.025 −.015 −.006
Sig. (2-tailed) .852 <.001 <.001 .204 .446 .776
N 2582 2582 2582 2581 2581 2581
M3 Number Series Pearson Correlation −.078** .041* .090** −.026 .038 −.019
Sig. (2-tailed) <.001 .038 <.001 .190 .056 .352
N 2517 2517 2517 2516 2516 2516
M3 30 Second and Counting Task Pearson Correlation −.069** .056** .104** −.017 .029 −.018
Sig. (2-tailed) <.001 .005 <.001 .400 .145 .354
N 2556 2556 2556 2555 2555 2555
M3 Stop and Go Switch Task Pearson Correlation −.062** .057** .053** −.070** −.005 −.030
Sig. (2-tailed) .002 .004 .008 <.001 .818 .140
N 2492 2492 2492 2491 2491 2491
**

Correlation is significant at the 0.01 level (2-tailed).

*

Correlation is significant at the 0.05 level (2-tailed).

Because voice was measured from the same time point as M3 cognitive tests, we compared results from our primary longitudinal analyses with correlations between voice measures and M3 cognitive performance. All of the multi-level results were replicated with the M3 correlations. However, there were some other significant relationships in the correlations for concurrent assessments of cognition and voice at M3 that did not appear in the primary analyses of cognitive change: WLI was positively correlated with pitch and negatively correlated with shimmer and amplitude; WLD was positively correlated with pulse and pitch, and negatively correlated with shimmer; CAT was positively correlated with pulse; DB was positively correlated with pulse and voice breaks; NS was positively correlated with pulse and voice breaks, and negatively correlated with pitch; BC was positively correlated with pulse and voice breaks, and negatively correlated with pitch; SGST was positively correlated with pulse and voice breaks, and negatively correlated with pitch. All correlations are presented in Table 3.

3.2. Multi-level Models of Voice Relating to Cognitive Measures

For each of the seven cognitive tests, we applied a linear mixed-effects multi-level model (LMER) to investigate whether voice measures were associated with change in cognitive performance over time when controlling for the covariates. Results for cognitive tests that were used for the voice composites (WLI, WLD, and CAT) are shown in Tables 46, and results for the relationships between changes in cognitive tests from M2 to M3 and significant voice composites are shown in Figures 13: as expected, higher jitter was associated with greater decline in WLI (p=.033), WLD (p=.010), and CAT (p=.006) (see Figure 1 graphs showing greater 10-year declines in WLI, WLD, and CAT when M3 jitter is higher). Contrary to hypotheses, lower pulse was associated with greater decline in WLI (p=.038) (see Figure 2 graphs showing greater 10-year decline in WLI when M3 pulse is higher); fewer voice breaks were associated with greater decline in WLI (p=.005), WLD (p<.001), and CAT (p<.001) (see Figure 3 graphs showing greater 10-year declines in WLI, WLD, and CAT when M3 amount of voice breaks is lower). Results for cognitive tests that were not used for the voice composites (DB, NS, BC, and SGST) are shown in Tables 710. Results for SGST, a cognitive test not used in voice composite, are summarized in Table 7; as expected, higher jitter was associated with greater decline in SGST reaction time (p=.001) (see Figure 2 graphs showing greater 10-year decline in SGST when M3 jitter is higher).

Table 4.

Multilevel Model Results for Word List Immediate

Model B SE p CI95
1 (Intercept) −.217 .331 .513 −0.866; 0.433
Time −.036 .179 .840 −0.388; 0.315
Pitch .001 .001 .306 −0.001; 0.003
Pulse .000 .000 .897 0.000; 0.000
Voice breaks −.001 .003 .605 −0.007; 0.004
Jitter .022 .062 .718 −0.099; 0.143
Shimmer .001 .020 .956 −0.039; 0.041
Amplitude .384 .409 .347 −0.417; 1.185
Age −.021*** .001 <.001 −0.024; −0.018
Sex .398*** .042 <.001 0.316; 0.5481
Education .067*** .006 <.001 0.055; 0.078
Neurological conditions −.157*** .042 <.001 −0.240; −0.075
Depressive Symptoms −.018 .052 .730 −0.119; 0.084
Chronic conditions −.017* .008 .033 −0.032; −0.001
Time x pitch −.001 .001 .288 −0.002; 0.001
Time x pulse .000* .000 .045 0.000; 0.000
Time x voice breaks .005** .002 .006 0.001; 0.008
Time x jitter −.073* .036 .040 −0.143; −0.003
Time x shimmer .000 .012 .989 −0.024; 0.023
Time x amplitude −.205 .205 .317 −0.607; 0.197
***

significant at the 0.001 level (2-tailed).

**

significant at the 0.01 level (2-tailed).

*

significant at the 0.05 level (2-tailed).

Table 6.

Multilevel Model Results for Category Fluency

Model B SE p CI95
1 (Intercept) .263 .295 .373 −0.315; 0.840
Time −.401** .146 .005 −0.692; −0.121
Pitch −.001 .001 .420 −0.003; 0.001
Pulse .000 .000 .987 0.000; 0.000
Voice breaks .004 .002 .102 −0.001; 0.009
Jitter −.005 .053 .920 −0.109; 0.099
Shimmer .010 .018 .556 −0.024; 0.045
Amplitude .108 .355 .761 −0.587; 0.803
Age −.021*** .001 <.001 −0.023; −0.018
Sex −.054 .043 .222 −0.139; 0.031
Education .095*** .006 <.001 0.083; 0.107
Neurological conditions −.104* .043 .017 −0.189; −0.019
Depressive Symptoms .036 .053 .504 −0.069; 0.140
Chronic conditions −.024** .008 .003 −0.040; −0.008
Time x pitch .001 .001 .283 0.000; 0.002
Time x pulse .000 .000 .131 0.000; 0.000
Time x voice breaks .005*** .001 .001 0.002; 0.007
Time x jitter −.077** .029 .009 −0.134; −0.019
Time x shimmer .014 .010 .156 −0.005; 0.033
Time x amplitude −.068 .178 .703 −0.416; 0.281
***

significant at the 0.001 level (2-tailed).

**

significant at the 0.01 level (2-tailed).

*

significant at the 0.05 level (2-tailed).

Figure 1.

Figure 1.

Relationship of Jitter to Cognitive Change

Figure 3.

Figure 3.

Relationship of Voice Breaks to Cognitive Change

Figure 2.

Figure 2.

Relationship of Pulse to Cognitive Change

Table 7.

Multilevel Model Results for Stop and Go Switch Task

Model B SE p CI95
 (Intercept) 1.830*** .447 <.001 0.954; 2.705
 Time −.585* .260 .024 −1.094; −0.076
 Pitch .001 .002 .524 −0.002; 0.004
 Pulse .000 .000 .220 0.000; 0.000
 Voice breaks −.001 .004 .890 −0.008; 0.007
 Jitter .197* .085 .020 0.031; 0.362
 Shimmer −.017 .027 .542 −0.069; 0.036
 Amplitude −.366 .527 .487 −1.400; 0.668
 Age −.021*** .002 <.001 −0.024; −0.017
 Sex −.194*** .051 <.001 −0.294; −0.094
 Education .040*** .007 <.001 −0.026; 0.053
 Neurological conditions −.262*** .051 <.001 −0.362; −0.161
 Depressive Symptoms −.002 .063 .972 −0.125; 0.121
 Chronic conditions −.022* .010 .020 −0.041; −0.004
 Time x pitch −.001 .001 .213 −0.003; 0.001
 Time x pulse .000 .000 .129 0.000; 0.000
 Time x voice breaks .002 .003 .558 −0.004; 0.007
 Time x jitter −.166** .053 .002 −0.270; −0.061
 Time x shimmer .013 .016 .444 −0.020; 0.045
 Time x amplitude .163 .264 .538 −0.356; 0.682
***

significant at the 0.001 level (2-tailed).

**

significant at the 0.01 level (2-tailed).

*

significant at the 0.05 level (2-tailed).

Table 10.

Multilevel Model Results for Backwards Counting

Model B SE p CI95
 (Intercept) 1.345*** .254 <.001 0.847; 1.843
 Time −.191 .010 .055 −0.386; 0.004
 Pitch .000 .001 .907 −0.002; 0.001
 Pulse .000 .000 .321 0.000; 0.000
 Voice breaks .001 .002 .457 −0.002; 0.005
 Jitter .049 .042 .245 −0.033; 0.131
 Shimmer .015 .014 .275 −0.012; 0.043
 Amplitude −.176 .258 .495 −0.682; 0.330
 Age −.032*** .002 <.001 −0.035; −0.029
 Sex −.254*** .046 <.001 −0.343; −0.164
 Education .082*** .006 <.001 0.070; 0.095
 Neurological conditions −.149** .046 .001 −0.239; −0.059
 Depressive Symptoms .001 .056 .980 −0.109; 0.112
 Chronic conditions −.029*** .009 .001 −0.045; −0.012
 Time x pitch .000 .000 .376 0.000; 0.001
 Time x pulse .000 .000 .422 0.000; 0.000
 Time x voice breaks .002 .001 .077 0.000; 0.003
 Time x jitter −.027 .020 .182 −0.066; 0.013
 Time x shimmer −.005 .007 .424 −0.018; 0.008
 Time x amplitude .068 .129 .600 −0.185; 0.320
***

significant at the 0.001 level (2-tailed).

**

significant at the 0.01 level (2-tailed).

*

significant at the 0.05 level (2-tailed).

3.3. Sensitivity Analyses

Given that voice prosody could reflect how one performs on a given test, we conducted sensitivity analyses to explore this further for the three tests used to measure voice prosody. We added to the models the measures of total voice duration for the test and the number of intrusions or errors made on the test. These are indirect measures of how well the participant performed on a test and could potentially result in more voice breaks. In other words, those with longer voice duration presumably remember more words. However, having more intrusions also adds to duration but not to the test score. The sensitivity analysis results were consistent with the original findings: as expected, higher jitter was associated with greater decline in WLI (p=.038), WLD (p=.010), and CAT (p=.006). Also consistent with the primary analyses, lower pulse was associated with greater decline in WLI (p=.040); and fewer voice breaks were associated with greater decline in WLI (p=.006), WLD (p<.001), and CAT (p=.001). As the MIDUS sample includes some siblings, we tested whether including within-family dependence (including an additional random effects term for family) would change the results. We confirmed that for this test, the estimates of the key covariates and their significance remained the same. Thus, all reported analyses did not include the family random effects term in the model.

4. Discussion

The current study is the first, to our knowledge, to examine a selection of 6 voice prosody measures previously associated with AD as possible indicators of risk for cognitive impairment by examining their relation to cognitive changes over 10 years in a large community-residing national sample of midlife and older adults. We collected voice measures from participant voice segments, a method used in work measuring temporal and spectral voice features to assess dementia risk in participants (Lin et al., 2020; Nasreen et al., 2021). Our study extended this work by assessing the relationships between these voice prosody measures and cognitive changes in a sample with a larger age range while controlling for health and demographic factors. Our results replicate prior significant findings for jitter by Lin et al. 2020, but not for shimmer. One possible reason is that our model controlled for additional covariates, including neurological conditions such as Parkinson’s disease, which is associated with higher jitter and shimmer (Hertrich and Ackermann, 1995; Ramig et al., 1988). In addition, our study suggests new relationships linking 10-year cognitive changes with prosody measures of pulse and voice breaks in middle-aged and older participants.

Our study examined individual differences in cognitive change over 10 years, assuming that more decline is worse than less or no decline and that more decline is potentially associated with a greater risk of later impairment. The goal was to examine whether voice prosody measures show meaningful relationships with individual differences in change over the 10 years. Although having two occasions is not ideal for examining patterns of age-related changes, the results are consistent with previous aging research investigating different domains of cognitive functioning (Hultsch et al., 1990; Karlamangla et al., 2009; Salthouse, 1996). First, the expected relationships with age were found, with older people showing greater declines over the 10 years compared to younger people. Second, the extent of change across measures is consistent with previous work; for example, Digit Symbol Backwards does not show significant declines until later in life, whereas Speed of Processing (Backward Counting) shows significant declines that begin in midlife. Although not definitive, these longitudinal results provide some evidence for the validity of the changes found. The consistency of results with past work supports the assumption that the changes reflect meaningful differences in cognitive performance over 10 years. In future work, it will be essential to include additional time points for both cognition and voice measures to better understand the trajectory of cognitive changes related to voice changes and to link cognitive and voice changes directly with risk of impairment.

Some results were consistent with predictions based on past work on voice and dementia. The results for jitter supported past AD work in that higher jitter was associated with greater decline in WLI, WLD, CAT, and SGST reaction time. These cognitive changes are consistent with other studies of cognitive aging. Jitter is generally explained as tiny tremors in the voice, inaudible to the human ear, due to the unstable change of pitch frequency affected by executive function decline over time. Jitter has been significantly associated with AD in past work. The present findings extend this to a link with cognitive changes in midlife and later life adults who have not been diagnosed with MCI or AD.

Some results were inconsistent with past research on voice and AD. First, lower pulse was associated greater decline in WLI in contrast to past work in which higher pulse has been associated with AD. However, pulse is less commonly researched as a voice biomarker of AD than other voice prosody measures. Past work on pulse and AD has mainly used small clinical samples. Our results contribute information on the relationships between voice measures and cognitive change in a broader sample and suggest the need for more research on low pulse as an indicator of cognitive impairment. Additionally, the association between fewer voice breaks and greater decline in WLI, WLD, and CAT is the reverse of what was expected based on prior AD literature. This result could be due to using voice recordings from cognitive assessments rather than conversational speech. Breaks in conversation may indicate cognitive problems, whereas breaks during responses to a cognitive test may reflect something about test performance. In other words, those who remember more words on a memory or category fluency test may have more voice breaks if they pause between words recalled. Future work could compare voice breaks in different protocols to examine whether voice breaks are valuable biomarkers for cognitive changes in midlife and later life.

4.1. Limitations

The study was limited in that the voice measures were obtained from several tests used to examine cognitive change. Significant voice measures associated with cognitive change were found mainly for the three tests from which the voice measures were derived. Thus, a dependency between test performance and voice measures should be addressed in future research. Voice prosody may reflect the level of performance on a given test. For example, if someone is doing well on a memory test, they may have less jitter. We attempted to address this limitation with sensitivity analyses that controlled for speech duration (related to how many words were recalled) and intrusion errors. The same results were obtained as the primary analyses. It is also noteworthy that the voice measures were related to change in cognition rather than the level of performance. In future work, voice measures could be obtained from independent speech that is not tied to cognitive testing. Nevertheless, there are also some advantages of using cognitive interviews to obtain voice measures in that it controls the possible range of responses across participants. However, the results may not generalize to free conversational speech.

Another potential limitation was that the audio files were recorded in MP3 format, which is not considered optimal for voice quality. However, although participant voice recordings were not recorded in an ideal format (WAV), the optimization function in Praat enhanced the quality of the voice data. Many past studies have used the Praat optimization method to make their audio analyses of various formats more reliable. Therefore, this is a minor concern that likely did not affect the parameters of our study. Future research should record participant voices in WAV format, an option that can be set as default in most recording devices currently available.

The MIDUS data set we used had some limiting factors. Although ~10% of the original cognitive sample at M2 was non-white, there was a larger percentage of participants in the dropout group from M2 to M3 that were non-white (12.5%) compared to the longitudinal group (7.5%) (Hughes et al., 2018; Radler and Ryff, 2010). Future research with more diverse samples could provide more generalizable results. Future research should include additional covariates such as medication use, smoking, and anxiety to control for other possible factors that could be related to vocal qualities. Also, the sample did not have assessments of cognitive status or clinical diagnosis. Thus, we do not know if any participants have some form of cognitive impairment. Future studies should consider examining the relationships between voice prosody measures and cognitive change in a sample that has midlife and later life participants assessed as healthy or with mild cognitive impairment or dementia by medical professionals.

4.2. Future Work

When considering the next steps for this area of research, it will be essential to test possible mechanisms linking voice features and cognitive changes. Voice features can potentially be affected over time by peripheral mechanisms in the body. For example, changes in pulmonary function can impact vocal cord vibrations and result in altered jitter and shimmer or can impact subglottal pressure in the vocal tract and increase amplitude (Johns et al., 2011). Further, respiratory illness and smoking can cause excessive larynx constriction, resulting in changed pitch (Garellek, 2014). Though less harmful, smaller mechanisms that are common in respiratory illnesses, such as sustained contractions of the laryngeal muscle and excessive cricothyroid muscle tightening over the vocal fold, can raise pitch and amplitude due to peripheral vocal fatigue (Roubeau et al., 1997; Sieck and Prakash, 1995; Simpson and Rosen, 2008). Even without these dysfunctions, the aging of these muscles over time may lead to dopaminergic deficit, resulting in lower pitch and amplitude (Baker, 1998; Rampello et al., 2016; Zarzur et al., 2007).

The central nervous system can also be explored as a mechanism for understanding the association between voice prosody and cognitive changes. Brain damage due to trauma or disease is associated with changes in voice features. Specifically, lesions to the central nervous systems can result in higher pitch and lower amplitude (Arenaza-Urquijo et al., 2015; Honer et al., 2012; Ijitona et al., 2016; Lesuis et al., 2018; Stacey et al., 2017). However, because the brain plays a central role throughout life in physiological changes, brain aging is the ultimate cause of cognitive decline and degeneration of voice. Age-related changes in cognition and voice are both associated with more structural alterations, more cortical thickness, and reduced gray matter volume in cortical and subcortical regions related to executive function and speech production (Diaconescu et al., 2013; Funahashi, 2001; Guenther, 2006; Hirano et al., 2004; Lieberman, 2007; Porter et al., 2011; Tremblay and Deschamps, 2016). In these regions, subtle age-related changes in the dopamine binding potential and receptor densities of the neural networks can later result in significant deterioration in more overt voice parameters, such as speech response time and articulation accuracy (Bäckman et al., 2010; Duchin and Mysak, 1987; Erixon-Lindroth et al., 2005; MacDonald et al., 2009; Nyberg et al., 2016; Wang et al., 1998). Peripheral changes due to aging may happen years before cognitive decline is apparent. It is vital to identify the underlying mechanisms that link voice biomarkers with cognitive change so that AD risk can be calculated decades before serious neuropathology appears.

4.3. Conclusion

As the prevalence of AD continues to rise, identifying biomarkers that can distinguish significant patterns of cognitive change from normal cognitive aging as early as possible could be useful for delaying or reducing dementia. Voice, the primary method of human communication, has the potential for long-term self-monitoring and medical monitoring of cognitive status because specific vocal prosody measures are related to changes in cognitive functioning. Voice recordings are a convenient and low-burden way to obtain biomarkers that may be useful for early detection of cognitive changes. Analysis of voice biomarkers has the potential to become a simple and non-invasive procedure for rapid and precise data collection at any timepoint, ultimately enabling the identification of risks for cognitive changes and impairment and early diagnosis of dementia.

Table 5.

Multilevel Model Results for Word List Delayed

Model B SE p CI95
1 (Intercept) .509 .338 .133 −0.155; 1.172
Time −.267 .180 .139 −0.620; 0.086
Pitch −.001 .001 .626 −0.003; 0.002
Pulse .000 .000 .462 0.000; 0.000
Voice breaks −.005 .003 .056 −0.011; 0.000
Jitter .075* .061 .223 −0.045; 0.195
Shimmer −.023 .021 .253 −0.064; 0.017
Amplitude .352 .405 .384 −0.441; 1.145
Age −.021*** .001 <.001 −0.024; −0.018
Sex .391*** .043 <.001 0.307; 0.476
Education .056*** .006 <.001 0.044; 0.067
Neurological conditions −.107* .043 .013 −0.192; −0.022
Depressive Symptoms −.035 .053 .518 −0.139; 0.070
Chronic conditions −.028*** .008 .001 −0.044; −0.012
Time x pitch .000 .001 .586 −0.001; 0.002
Time x pulse .000 .000 .188 0.000; 0.000
Time x voice breaks .006*** .002 .000 0.003; 0.009
Time x jitter −.093** .035 .008 −0.162; −0.024
Time x shimmer .006 .012 .603 −0.017; 0.030
Time x amplitude −.169 .203 .404 −0.567; 0.228
***

significant at the 0.001 level (2-tailed).

**

significant at the 0.01 level (2-tailed).

*

significant at the 0.05 level (2-tailed).

Table 8.

Multilevel Model Results for Digits Backward

Model B SE p CI95
 (Intercept) −.930** .335 .005 −1.586; −0.274
 Time .182 .176 .302 −0.164; 0.528
 Pitch .001 .001 .346 −0.001; 0.003
 Pulse .000 .000 .170 0.000; 0.000
 Voice breaks .000 .003 .966 −0.005; 0.006
 Jitter −.031 .062 .610 −0.152; 0.090
 Shimmer .047\6* .020 .026 0.006; 0.086
 Amplitude −.014 .413 .973 −0.823; 0.795
 Age −.012*** .002 <.001 −0.001; −0.009
 Sex .081 .045 .069 −0.006; 0.167
 Education .069*** .006 <.001 0.058; 0.081
 Neurological conditions −.020 .045 .648 −0.108; 0.067
 Depressive Symptoms −.001 .055 .987 −0.109; 0.107
 Chronic conditions −.024** .008 .005 −0.040; −0.007
 Time x pitch −.001 .001 .310 −0.002; 0.001
 Time x pulse .000 .000 .444 0.000; 0.000
 Time x voice breaks .002 .002 .143 −0.001; 0.006
 Time x jitter .003 .035 .935 −0.066; 0.072
 Time x shimmer −.017 .012 .140 −0.041; 0.006
 Time x amplitude .006 .207 .977 −0.399; 0.412
***

significant at the 0.001 level (2-tailed).

**

significant at the 0.01 level (2-tailed).

*

significant at the 0.05 level (2-tailed).

Table 9.

Multilevel Model Results for Number Series

Model B SE p CI95
 (Intercept) −.066 .303 .828 −0.659; 0.527
 Time −.308* .152 .043 −0.605; −0.010
 Pitch .000 .001 .799 −0.002; 0.002
 Pulse .000 .000 .583 0.000; 0.000
 Voice breaks .003 .002 .243 −0.002; 0.008
 Jitter .020 .055 .721 −0.088; 0.127
 Shimmer −.016 .018 .381 −0.051; 0.020
 Amplitude .117 .364 .749 −0.598; 0.831
 Age −.019*** .002 <.001 −0.022; −0.016
 Sex −.180*** .044 <.001 −0.267; −0.094
 Education .138*** .006 <.001 0.126; 0.150
 Neurological conditions −.077 .044 .082 −0.164; 0.010
 Depressive Symptoms −.062 .055 .253 −0.169; 0.045
 Chronic conditions −.026** .008 .002 −0.042; −0.010
 Time x pitch .000 .001 .595 −0.001; 0.001
 Time x pulse .000 .000 .734 0.000; 0.000
 Time x voice breaks .001 .001 .666 −0.002; 0.003
 Time x jitter −.046 .030 .128 −0.106; 0.013
 Time x shimmer .018 .010 .071 −0.002; 0.038
 Time x amplitude −.077 .183 .674 −0.435; 0.281
***

significant at the 0.001 level (2-tailed).

**

significant at the 0.01 level (2-tailed).

*

significant at the 0.05 level (2-tailed).

Highlights.

  • We assessed voice prosody and 10-year cognitive changes in midlife and older adults

  • Higher jitter is linked to greater declines in memory, fluency, and attention

  • Lower pulse was associated with greater declines in memory

  • More voice breaks were related to greater declines in memory and fluency

  • Voice biomarkers may help detect cognitive changes and dementia risk

Acknowledgments

1. This research was supported by the National Institute on Aging grants P01-AG020166 and U19-AG051426. The research was approved by the MIDUS Investigators’ Institutional Review Boards.

2. The authors have no competing financial or other interests that might have affected this research.

3. We verify that the voice data and the analyses contained in the manuscript being submitted have not been previously published, have not been submitted elsewhere and will not be submitted elsewhere while under consideration at Neurobiology of Aging.

4. The study was conducted according to the Good Clinical Practice guidelines, the Declaration of Helsinki, and U.S. 21 CFR Part 50 – Protection of Human Participants, and Part 56 – Institutional Review Boards. Informed consent was obtained from all participants before protocol-specific procedures were performed.

5. We verify that all authors have reviewed the contents of the manuscript being submitted, approve of its contents and validate the accuracy of the data.

Footnotes

Disclosure Statement

The authors have no potential financial or personal conflicts of interest including relationships with other people or organizations within three years of beginning the work submitted that could influence this work.

References

  1. Agrigoroaei S, Lachman ME, 2011. Cognitive functioning in midlife and old age: Combined effects of psychosocial and behavioral factors. Journals Gerontol. Ser. B Psychol. Sci. Soc. Sci 66B, 130–140. 10.1093/geronb/gbr017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Al-Hameed S, Benaissa M, Christensen H, Mirheidari B, Blackburn D, Reuber M, 2019. A new diagnostic approach for the identification of patients with neurodegenerative cognitive complaints. PLoS One 14, 1–18. 10.1371/journal.pone.0217388 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Albert MS, DeKosky ST, Dickson D, Dubois B, Feldman HH, Fox NC, Gamst A, Holtzman DM, Jagust WJ, Petersen RC, Snyder PJ, Carrillo MC, Thies B, Phelps CH, 2011. The diagnosis of mild cognitive impairment due to Alzheimer’s disease: recommendations from the National Institute on aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer’s disease. Focus (Madison). 11, 96–106. 10.1176/appi.focus.11.1.96 [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Alzheimer’s Association, 2019.
  5. American Psychiatric Association, 1987. Diagnostic and statistical manual of mental disorders (3rd ed., rev.).
  6. Ang TFA, An N, Ding H, Devine S, Auerbach SH, Massaro J, Joshi P, Liu X, Liu Y, Mahon E, Au R, Lin H, 2019. Using data science to diagnose and characterize heterogeneity of Alzheimer’s disease. Alzheimer’s Dement. Transl. Res. Clin. Interv 5, 264–271. 10.1016/j.trci.2019.05.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Arenaza-Urquijo EM, Wirth M, Chételat G, 2015. Cognitive reserve and lifestyle: Moving towards preclinical Alzheimer’s disease. Front. Aging Neurosci 7, 1–12. 10.3389/fnagi.2015.00134 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Bäckman L, Lindenberger U, Li SC, Nyberg L, 2010. Linking cognitive aging to alterations in dopamine neurotransmitter functioning: Recent data and future avenues. Neurosci. Biobehav. Rev 34, 670–677. 10.1016/j.neubiorev.2009.12.008 [DOI] [PubMed] [Google Scholar]
  9. Baker J, 1998. Psychogenic dysphonia: Peeling back the layers. J. Voice 12, 527–535. 10.1016/S0892-1997(98)80061-1 [DOI] [PubMed] [Google Scholar]
  10. Bates D, Maechler M, Bolker B, Walker S, Christensen RH, Singmann H, …, Grothendieck G, 2011. Package ‘lme4’. Linear mixed-effects models using S4 classes. R package version [WWW Document]. [Google Scholar]
  11. Beer NR, Soroker N, Bornstein NM, Leurer MK, 2017. Association between cardiac autonomic control and cognitive performance among patients post stroke and age-matched healthy controls—an exploratory pilot study. Neurol. Sci 38, 2037–2043. 10.1007/s10072-017-3097-0 [DOI] [PubMed] [Google Scholar]
  12. Beltrami D, Gagliardi G, Favretti RR, Ghidoni E, Tamburini F, Calzà L, 2018. Speech analysis by natural language processing techniques: A possible tool for very early detection of cognitive decline? Front. Aging Neurosci 10. 10.3389/fnagi.2018.00369 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Bishop NA, Lu T, Yankner BA, 2010. Neural mechanisms of ageing and cognitive decline. Nature 464, 529–535. 10.1038/nature08983 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Blalock EM, Geddes JW, Chen KC, Porter NM, Markesbery WR, Landfield PW, 2004. Incipient Alzheimer’s disease: Microarray correlation analyses reveal major transcriptional and tumor suppressor responses. Proc. Natl. Acad. Sci. U. S. A 101, 2173–2178. 10.1073/pnas.0308512100 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Boersma P, van Heuven V, 2001. Speak and unSpeak with Praat. Glot Int. 5, 341–347. [Google Scholar]
  16. Boersma P, Weeink D, 2019. Praat: Doing phonetics by computer [Computer program]. Version 6.1.08, retrieved 5 December 2019 from http://www.praat.org/.
  17. Bolger N, Laurenceau J-P, 2013. Intensive longitudinal methods: An introduction to diary and experience sampling research., Methodology in the social sciences. Guilford Press, New York, NY, US. [Google Scholar]
  18. Bondi MW, Salmon DP, Galasko D, Thomas RG, Thal LJ, 1999. Neuropsychological function and apolipoprotein E genotype in the preclinical detection of Alzheimer’s disease. Psychol. Aging 14, 295–303. 10.1037/0882-7974.14.2.295 [DOI] [PubMed] [Google Scholar]
  19. Brim OG, Ryff CD, Kessler RC, 2004. The MIDUS National Survey: An Overview., in: How Healthy Are We?: A National Study of Well-Being at Midlife. University of Chicago Press, Chicago, IL, pp. 1–34. [Google Scholar]
  20. Carmichael O, Mungas D, Beckett L, Harvey D, Tomaszewski Farias S, Reed B, Olichney J, Miller J, DeCarli C, 2012. MRI predictors of cognitive change in a diverse and carefully characterized elderly population. Neurobiol. Aging 33, 83–95.e2. 10.1016/j.neurobiolaging.2010.01.021 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Chen P, Ratcliff G, Belle SHSH, Cauley JAJA, DeKosky STSTST, Ganguli M, 2001. Patterns of cognitive decline in presymptomatic Alzheimer disease: A prospective community study. Arch. Gen. Psychiatry 58, 853–858. 10.1001/archpsyc.58.9.853 [DOI] [PubMed] [Google Scholar]
  22. Craig-Schapiro R, Kuhn M, Xiong C, Pickering EH, Liu J, Misko TP, Perrin RJ, Bales KR, Soares H, Fagan AM, Holtzman DM, 2011. Multiplexed immunoassay panel identifies novel CSF biomarkers for alzheimer’s disease diagnosis and prognosis. PLoS One 6. 10.1371/journal.pone.0018850 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Diaconescu AO, Hasher L, McIntosh AR, 2013. Visual dominance and multisensory integration changes with age. Neuroimage 65, 152–166. 10.1016/j.neuroimage.2012.09.057 [DOI] [PubMed] [Google Scholar]
  24. Ding H, An N, Au R, Devine S, Auerbach SH, MAssaro J, Joshi P, Liu X, Liu Y, MAhon E, Ang TF, Lin H, 2020. Exploring the hierarchical influence of cognitive functions for Alzheimer disease: The Framingham Heart Study. J. Med. Internet Res 22, 1–14. 10.2196/15376 [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Dixon RA, Lachman ME, 2019. Risk and protective factors in cognitive aging: Advances in assessment, prevention, and promotion of alternative pathways. aging brain Funct. Adapt. across adulthood. 217–263. 10.1037/0000143-009 [DOI] [Google Scholar]
  26. Duchin SW, Mysak ED, 1987. Disfluency and rate characteristics of young adult, middle-aged, and older males. J. Commun. Disord 20, 245–257. 10.1016/0021-9924(87)90022-0 [DOI] [PubMed] [Google Scholar]
  27. Elias MF, Beiser A, Wolf PA, Au R, White RF, D’Agostino RB, 2000. The preclinical phase of Alzheimer disease: A 22-year prospective study of the Framingham cohort. Arch. Neurol 57, 808–813. 10.1001/archneur.57.6.808 [DOI] [PubMed] [Google Scholar]
  28. Elsey C, Drew P, Jones D, Blackburn D, Wakefield S, Harkness K, Venneri A, Reuber M, 2015. Towards diagnostic conversational profiles of patients presenting with dementia or functional memory disorders to memory clinics. Patient Educ. Couns 98, 1071–1077. 10.1016/j.pec.2015.05.021 [DOI] [PubMed] [Google Scholar]
  29. Erixon-Lindroth N, Farde L, Wahlin TBR, Sovago J, Halldin C, Bäckman L, 2005. The role of the striatal dopamine transporter in cognitive aging. Psychiatry Res. - Neuroimaging 138, 1–12. 10.1016/j.pscychresns.2004.09.005 [DOI] [PubMed] [Google Scholar]
  30. Eyben F, Wöllmer M, Schuller B, 2010. OpenSMILE - The Munich versatile and fast open-source audio feature extractor. MM’10 - Proc. ACM Multimed. 2010 Int. Conf. 1459–1462. 10.1145/1873951.1874246 [DOI] [Google Scholar]
  31. Fox NC, Warrington EK, Seiffer AL, Agnew SK, Rossor MN, 1998. Presymptomatic cognitive deficits in individuals at risk of familial Alzheimer’s disease. A longitudinal prospective study. Brain 121, 1631–1639. 10.1093/brain/121.9.1631 [DOI] [PubMed] [Google Scholar]
  32. Fransquet PD, Ryan J, 2019. The current status of blood epigenetic biomarkers for dementia. Crit. Rev. Clin. Lab. Sci 56, 435–457. [DOI] [PubMed] [Google Scholar]
  33. Fraser KC, Rudzicz F, Hirst G, 2016. Detecting late-life depression in Alzheimer’s disease through analysis of speech and language 1–11. 10.18653/v1/w16-0301 [DOI] [Google Scholar]
  34. Freiherr J, Lundström JN, Habel U, Reetz K, 2013. Multisensory integration mechanisms during aging. Front. Hum. Neurosci 7, 1–6. 10.3389/fnhum.2013.00863 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Funahashi S, 2001. Neuronal mechanisms of executive control by the prefrontal cortex. Neurosci. Res 39, 147–165. 10.1016/S0168-0102(00)00224-8 [DOI] [PubMed] [Google Scholar]
  36. Garellek M, 2014. Voice quality strengthening and glottalization. J. Phon 45, 106–113. 10.1016/j.wocn.2014.04.001 [DOI] [Google Scholar]
  37. Gatz M, Svedberg P, Pedersen NL, Mortimer JA, Berg S, Johansson B, 2001. Education and the risk of Alzheimer’s disease: Findings from the study of dementia in Swedish twins. Journals Gerontol. - Ser. B Psychol. Sci. Soc. Sci 56, 292–300. 10.1093/geronb/56.5.P292 [DOI] [PubMed] [Google Scholar]
  38. Goodall EF, Wang C, Simpson JE, Baker DJ, Drew DR, Heath PR, Saffrey MJ, Romero IA, Wharton SB, 2018. Age-associated changes in the blood-brain barrier: comparative studies in human and mouse. Neuropathol. Appl. Neurobiol 44, 328–340. 10.1111/nan.12408 [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Green MS, Kaye JA, Ball MJ, 2000. The Oregon brain aging study: neuropathology accompanying healthy aging in the oldest old. Neurology 54, 105–113. 10.1212/wnl.54.1.105 [DOI] [PubMed] [Google Scholar]
  40. Guenther FH, 2006. Cortical interactions underlying the production of speech sounds. J. Commun. Disord 39, 350–365. [DOI] [PubMed] [Google Scholar]
  41. Haider F, De La Fuente S, Luz S, 2020. An assessment of paralinguistic acoustic features for detection of Alzheimer’s dementia in spontaneous speech. IEEE J. Sel. Top. Signal Process 14, 272–281. 10.1109/JSTSP.2019.2955022 [DOI] [Google Scholar]
  42. Haigis MC, Guarente LP, 2006. Mammalian sirtuins - Emerging roles in physiology, aging, and calorie restriction. Genes Dev. 20, 2913–2921. 10.1101/gad.1467506 [DOI] [PubMed] [Google Scholar]
  43. Heneka MT, Ramanathan M, Jacobs AH, Dumitrescu-Ozimek L, Bilkei-Gorzo A, Debeir T, Sastre M, Galldiks N, Zimmer A, Hoehn M, Heiss WD, Klockgether T, Staufenbiel M, 2006. Locus ceruleus degeneration promotes Alzheimer pathogenesis in amyloid precursor protein 23 transgenic mice. J. Neurosci 26, 1343–1354. 10.1523/JNEUROSCI.4236-05.2006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Hertrich I, Ackermann H, 1995. Gender-Specific Vocal Dysfunctions in Parkinson’s Disease: Electroglottographic and Acoustic Analyses. Ann. Otol. Rhinol. Laryngol 104, 197–202. 10.1177/000348949510400304 [DOI] [PubMed] [Google Scholar]
  45. Heutink P, 2000. Untangling tau-related dementia. Hum. Mol. Genet 9, 979–986. 10.1093/hmg/9.6.979 [DOI] [PubMed] [Google Scholar]
  46. Hirano S, Bless DM, Muñoz Del Río A, Connor NP, Ford CN, 2004. Therapeutic potential of growth factors for aging voice. Laryngoscope 114, 2161–2167. 10.1097/01.mlg.0000149450.37640.db [DOI] [PubMed] [Google Scholar]
  47. Holmes C, Butchart J, 2011. Systemic inflammation and Alzheimer’s disease, Biochemical Society Transactions. 10.1042/BST0390898 [DOI] [PubMed] [Google Scholar]
  48. Honer WG, Barr AM, Sawada K, Thornton AE, Morris MC, Leurgans SE, Schneider JA, Bennett DA, 2012. Cognitive reserve, presynaptic proteins and dementia in the elderly. Transl. Psychiatry 2, 1–8. 10.1038/tp.2012.38 [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Hughes ML, Agrigoroaei S, Jeon M, Bruzzese M, Lachman ME, 2018. Change in cognitive performance from midlife into old age: findings from the Midlife in the United States (MIDUS) Study. J. Int. Neuropsychol. Soc 24, 805–820. 10.1017/S1355617718000425 [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Hultsch DF, Hertzog C, Dixon RA, 1990. Ability correlates of memory performance in adulthood and aging. Psychol. Aging 5, 356–368. 10.1037//0882-7974.5.3.356 [DOI] [PubMed] [Google Scholar]
  51. Humpel C, 2011. Identifying and validating biomarkers for Alzheimer’s disease. Trends Biotechnol. 29, 26–32. 10.1016/j.tibtech.2010.09.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. IBM Corp., 2021. IBM SPSS Statistics for Mac OS, Version 28.0. [Google Scholar]
  53. Ijitona T, Caterina G. Di, Yue H, Soraghan J, 2016. Prosodic feature extraction for assessment and treatment of dysarthria 1, 2016. [Google Scholar]
  54. Jak AJ, Preis SR, Beiser AS, Seshadri S, Wolf PA, Bondi MW, Au R, 2016. Neuropsychological Criteria for Mild Cognitive Impairment and Dementia Risk in the Framingham Heart Study. J. Int. Neuropsychol. Soc 22, 937–943. 10.1017/S1355617716000199 [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Johns MM, Arviso LC, Ramadan F, 2011. Challenges and opportunities in the management of the aging voice. Otolaryngol. - Head Neck Surg 145, 1–6. 10.1177/0194599811404640 [DOI] [PubMed] [Google Scholar]
  56. Johnson KA, Fox NC, Sperling RA, Klunk WE, 2012. Brain imaging in Alzheimer disease. Cold Spring Harb. Perspect. Med 2, 1–23. 10.1101/cshperspect.a006213 [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Jones SAV, Breakey RW, Evans PJ, 2014. Heading in football, long-term cognitive decline and dementia: evidence from screening retired professional footballers. Br. J. Sports Med 48, 159–161. [DOI] [PubMed] [Google Scholar]
  58. Karlamangla AS, Miller-Martinez D, Aneshensel CS, Seeman TE, Wight RG, Chodosh J, 2009. Trajectories of cognitive function in late life in the United States: Demographic and socioeconomic predictors. Am. J. Epidemiol 170, 331–342. 10.1093/aje/kwp154 [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Karlamangla AS, Miller-Martinez D, Lachman ME, Tun PA, Koretz BK, Seeman TE, 2014. Biological correlates of adult cognition: Midlife in the United States (MIDUS). Neurobiol. Aging 35, 387–394. 10.1016/j.neurobiolaging.2013.07.028 [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Karlamangla AS, Singer BH, Chodosh J, McEwen BS, Seeman TE, 2005. Urinary cortisol excretion as a predictor of incident cognitive impairment. Neurobiol. Aging 26, 80–84. 10.1016/j.neurobiolaging.2005.09.037 [DOI] [PubMed] [Google Scholar]
  61. Kato S, Homma A, Sakuma T, 2018. Easy screening for mild Alzheimer’s disease and mild cognitive impairment from elderly speech. Curr. Alzheimer Res 15, 104–110. [DOI] [PubMed] [Google Scholar]
  62. Kessler RC, Mickelson KD, Walters EE, Zhao S, Hamilton L, 2004. Age and depression in the MIDUS survey., in: How Healthy Are We?: A National Study of Well-Being at Midlife., The John D and Catherine T MacArthur Foundation Series on Mental Health and Development. Studies on Successful Midlife Development The University of Chicago Press, Chicago, IL, US, pp. 227–251. [Google Scholar]
  63. Konig A, Linz N, Zeghari R, Klinge X, Troger J, Alexandersson J, Robert P, 2019. Detecting apathy in older adults with cognitive disorders using automatic speech analysis. J. Alzheimer’s Dis 69, 1183–1193. [DOI] [PubMed] [Google Scholar]
  64. Konig A, Satt A, Sorin A, Hoory R, Derreumaux A, David R, Robert PH, 2018. Use of speech analyses within a mobile application for the assessment of cognitive impairment in elderly people. Curr. Alzheimer Res 15, 120–129. [DOI] [PubMed] [Google Scholar]
  65. Konig A, Satt A, Sorin A, Hoory R, Toledo-Ronen O, Derreumaux A, Manera V, Verhey F, Aalten P, Robert PH, David R, 2015. Automatic speech analysis for the assessment of patients with predementia and Alzheimer’s disease. Alzheimer’s Dement. Diagnosis, Assess. Dis. Monit 1, 112–124. 10.1016/j.dadm.2014.11.012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Lachman ME, Tun PA, 2008. Cognitive testing in large-scale surveys: Assessment by telephone. Hofer SM Alwin DF (Eds.), Handb. Cogn. aging Interdiscip. Perspect (p. 506–523). Sage Publications, Inc. [Google Scholar]
  67. Lachman MEME, Agrigoroaei S, Tun PA, Weaver SLSL, 2014. Monitoring cognitive functioning: psychometric properties of the Brief Test of Adult Cognition by Telephone. Assessment 21, 404–417. 10.1177/1073191113508807 [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Le D, Licata K, Mower Provost E, 2018. Automatic quantitative analysis of spontaneous aphasic speech. Speech Commun. 100, 1–12. 10.1016/j.specom.2018.04.001 [DOI] [Google Scholar]
  69. Lesuis SL, Hoeijmakers L, Korosi A, De Rooij SR, Swaab DF, Kessels HW, Lucassen PJ, Krugers HJ, 2018. Vulnerability and resilience to Alzheimer’s disease: Early life conditions modulate neuropathology and determine cognitive reserve 11 Medical and Health Sciences 1109 Neurosciences 17 Psychology and Cognitive Sciences 1701 Psychology Rik Ossenkoppele. Alzheimer’s Res. Ther 10, 1–20. 10.1186/s13195-018-0422-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Liang WS, Reiman EM, Valla J, Dunckley T, Beach TG, Grover A, Niedzielko TL, Schneider LE, Mastroeni D, Caselli R, Kukull W, Morris JC, Hulette CM, Schmechel D, Rogers J, Stephan DA, 2008. Alzheimer’s disease is associated with reduced expression of energy metabolism genes in posterior cingulate neurons. Proc. Natl. Acad. Sci. U. S. A 105, 4441–4446. 10.1073/pnas.0709259105 [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Lieberman P, 2007. The evolution of human speech: its anatomical and neural bases. Curr. Anthropol 48, 39–66. [Google Scholar]
  72. Lin H, Karjadi C, Ang TFA, Prajakta J, McManus C, Alhanai TW, Glass J, Au R, 2020. Identification of digital voice biomarkers for cognitive health. Explor. Med 1, 406–417. 10.37349/emed.2020.00028 [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Ling H, Morris HR, Neal JW, Lees AJ, Hardy J, Holton JL, Revesz T, Williams DDR, 2017. Mixed pathologies including chronic traumatic encephalopathy account for dementia in retired association football (soccer) players. Acta Neuropathol. 133, 337–352. 10.1007/s00401-017-1680-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Linn RT, Wolf P, Bachman DL, Knoefel JE, Cobb JL, Belanger AJ, Kaplan EF, D’Agostino RB, 1995. The “preclinical phase” of probable Alzheimer’s disease. A 13-year prospective study of the Framingham cohort. Arch. Neurol 52, 485–490. [DOI] [PubMed] [Google Scholar]
  75. Lopez-de-Ipina K, Martinez-de-Lizarduy U, Calvo PM, Beitia B, Garcia-Melero J, Ecay-Torres M, Estanga A, Faundez-Zanuy M, 2017. Analysis of disfluencies for automatic detection of mild cognitive impairment: a deep learning approach International Conference and Workshop on Bioinspir. [Google Scholar]
  76. Lu H, Zhu XC, Jiang T, Yu JT, Tan L, 2015. Body fluid biomarkers in Alzheimer’s disease. Ann. Transl. Med 3, 1–15. 10.3978/j.issn.2305-5839.2015.02.13 [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. MacDonald SWS, Cervenka S, Farde L, Nyberg L, Bäckman L, 2009. Extrastriatal dopamine D2 receptor binding modulates intraindividual variability in episodic recognition and executive functioning. Neuropsychologia 47, 2299–2304. 10.1016/j.neuropsychologia.2009.01.016 [DOI] [PubMed] [Google Scholar]
  78. Martínez-Nicolás I, Llorente TE, Martínez-Sánchez F, Meilán JJG, 2021. Ten years of research on automatic voice and speech analysis of people With Alzheimer’s disease and mild cognitive impairment: a systematic review article. Front. Psychol 12, 1–15. 10.3389/fpsyg.2021.620251 [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Martinez-Sanchez F, Garcia Meilan JJ, Perez E, Carro J, Arana JM, 2012. Expressive prosodic patterns in individuals with Alzheimer’s disease. Psicothema 24, 16–21. [PubMed] [Google Scholar]
  80. Martínez-Sánchez F, Meilán JJG, Pérez E, Carro J, Arana JM, 2012. Patrones de prosodia expresiva en pacientes con enfermedad de alzheimer. Psicothema 24, 16–21. [PubMed] [Google Scholar]
  81. Matos TM, De Souza-Talarico JN, 2019. How stress mediators can cumulatively contribute to Alzheimer’s disease: An allostatic load approach. Dement. e Neuropsychol 13, 11–21. 10.1590/1980-57642018dn13-010002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. McLean AJ, Le Couteur DG, 2004. Aging biology and geriatric clinical pharmacology. Pharmacol. Rev 56, 163 LP–184. 10.1124/pr.56.2.4 [DOI] [PubMed] [Google Scholar]
  83. Meilán JJG, Martínez-Sánchez F, Carro J, López DE, Millian-Morell L, Arana JM, 2014. Speech in Alzheimer’s disease: Can temporal and acoustic parameters discriminate dementia? Dement. Geriatr. Cogn. Disord 37, 327–334. 10.1159/000356726 [DOI] [PubMed] [Google Scholar]
  84. Meilán JJG, Martínez-Sánchez F, Carro J, Sánchez JA, Pérez E, 2012. Acoustic markers associated with impairment in language processing in Alzheimer’s disease. Span. J. Psychol 15, 487–494. 10.5209/rev_sjop.2012.v15.n2.38859 [DOI] [PubMed] [Google Scholar]
  85. Meilán JJG, Martínez-Sánchez F, Martínez-Nicolás I, Llorente TE, Carro J, 2020. Changes in the rhythm of speech difference between people with nondegenerative mild cognitive impairment and with preclinical dementia. Behav. Neurol 2020. 10.1155/2020/4683573 [DOI] [PMC free article] [PubMed] [Google Scholar]
  86. Mielke MM, 2018. Sex and gender differences in Alzheimer disease dementia. Psychiatr. Times 35, 14–15. [PMC free article] [PubMed] [Google Scholar]
  87. Miller JA, Oldham MC, Geschwind DH, 2008. A systems level analysis of transcriptional changes in Alzheimer’s disease and normal aging. J. Neurosci 28, 1410–1420. 10.1523/JNEUROSCI.4098-07.2008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Mirheidari B, Blackburn D, Walker T, Reuber M, Christensen H, 2019. Dementia detection using automatic analysis of conversations. Comput. Speech Lang 53, 65–79. 10.1016/j.csl.2018.07.006 [DOI] [Google Scholar]
  89. Mundt JC, Moore HK, Greist JH, 2005. A novel interactive voice response (IVR) system for dementia screening, education, and referral: One-year summary. Alzheimer Dis. Assoc. Disord 19, 143–147. [DOI] [PubMed] [Google Scholar]
  90. Nagumo R, Zhang Y, Ogawa Y, Hosokawa M, Abe K, Ukeda T, Sumi S, Kurita S, Nakakubo S, Lee S, Doi T, Shimada H, 2020. Automatic detection of cognitive impairments through acoustic analysis of speech. Curr. Alzheimer Res 17, 60–68. 10.2174/1567205017666200213094513 [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Nasreen S, Hough J, Purver M, 2021. Detecting alzheimer’s disease using interactional and acoustic features from spontaneous speech. Proc. Annu. Conf. Int. Speech Commun. Assoc. INTERSPEECH 1, 306–310. 10.21437/Interspeech.2021-1526 [DOI] [Google Scholar]
  92. National Academies, 2017.
  93. Nishikawa K, Akihiro K, Hirakawa R, Kawano H, Nakatoh Y, 2022. Machine learning model for discrimination of mild dementia patients using acoustic features. Cogn. Robot 2, 21–29. 10.1016/j.cogr.2021.12.003 [DOI] [Google Scholar]
  94. Nyberg L, Karalija N, Salami A, Andersson M, Wåhlin A, Kaboovand N, Köhncke Y, Axelsson J, Rieckmann A, Papenberg G, Garrett DD, Riklund K, Lövdén M, Lindenberger U, Bäckman L, 2016. Dopamine D2 receptor availability is linked to hippocampal-caudate functional connectivity and episodic memory. Proc. Natl. Acad. Sci. U. S. A 113, 7918–7923. 10.1073/pnas.1606309113 [DOI] [PMC free article] [PubMed] [Google Scholar]
  95. Österberg K, Karlson B, Malmberg B, Hansen ÅM, 2012. A follow-up of cognitive performance and diurnal salivary cortisol changes in former burnout patients. Stress 15, 589–600. 10.3109/10253890.2011.648972 [DOI] [PubMed] [Google Scholar]
  96. Parashar R, Amir M, Pakhare A, Rathi P, Chaudhary L, 2016. Age related changes in autonomic functions. J. Clin. Diagnostic Res 10, CC11–CC15. 10.7860/JCDR/2016/16889.7497 [DOI] [PMC free article] [PubMed] [Google Scholar]
  97. Paterson RW, Slattery CF, Poole T, Nicholas JM, Magdalinou NK, Toombs J, Chapman MD, Lunn MP, Heslegrave AJ, Foiani MS, Weston PSJ, Keshavan A, Rohrer JD, Rossor MN, Warren JD, Mummery CJ, Blennow K, Fox NC, Zetterberg H, Schott JM, 2018. Cerebrospinal fluid in the differential diagnosis of Alzheimer’s disease: Clinical utility of an extended panel of biomarkers in a specialist cognitive clinic. Alzheimer’s Res. Ther 10, 1–11. 10.1186/s13195-018-0361-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  98. Petersen RC, 2004. Mild cognitive impairment as a diagnostic entity. J. Intern. Med 256, 183–194. 10.1111/j.1365-2796.2004.01388.x [DOI] [PubMed] [Google Scholar]
  99. Pfeifer MA, Weinberg CR, Cook D, Best JD, Reenan A, Halter JB, 1983. Differential changes of autonomic nervous system function with age in man. Am. J. Med 75, 249–258. 10.1016/0002-9343(83)91201-9 [DOI] [PubMed] [Google Scholar]
  100. Podcasy JL, Epperson CN, 2016. Considering sex and gender in Alzheimer disease and other dementias. Dialogues Clin. Neurosci 18, 437–446. [DOI] [PMC free article] [PubMed] [Google Scholar]
  101. Porter BA, Rosenthal TR, Ranasinghe KG, Kilgard MP, 2011. Discrimination of brief speech sounds is impaired in rats with auditory cortex lesions. Behav. Brain Res 219, 68–74. 10.1016/j.bbr.2010.12.015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  102. Quadri P, Fragiacomo C, Pezzati R, Zanda E, Forloni G, Tettamanti M, Lucca U, 2004. Homocysteine, folate, and vitamin B-12 in mild cognitive impairment, Alzheimer disease, and vascular dementia. Am. J. Clin. Nutr 80, 114–122. 10.1093/ajcn/80.1.114 [DOI] [PubMed] [Google Scholar]
  103. R Core Team, 2021. R: A language and environment for statistical computing.
  104. Radler BT, Ryff CD, 2010. Who participates? Accounting for longitudinal retention in the MIDUS national study of health and wellbeing. J. Aging Health 22, 307–331. 10.1177/0898264309358617 [DOI] [PMC free article] [PubMed] [Google Scholar]
  105. Ramig LA, Titze IR, Scherer RC, Ringel SP, 1988. Acoustic Analysis of Voices of Patients with Neurologic Disease: Rationale and Preliminary Data. Ann. Otol. Rhinol. Laryngol 97, 164–172. 10.1177/000348948809700214 [DOI] [PubMed] [Google Scholar]
  106. Rampello Liborio, Rampello Luigi, Patti F, Zappia M, 2016. When the word doesn’t come out: A synthetic overview of dysarthria. J. Neurol. Sci 369, 354–360. 10.1016/j.jns.2016.08.048 [DOI] [PubMed] [Google Scholar]
  107. Roubeau B, Chevrie-Muller C, Saint Guily JL, 1997. Electromyographic activity of strap and cricothyroid muscles in pitch change. Acta Otolaryngol. 117, 459–464. 10.3109/00016489709113421 [DOI] [PubMed] [Google Scholar]
  108. Rusz J, Saft C, Schlegel U, Hoffman R, Skodda S, 2014. Phonatory dysfunction as a preclinical symptom of huntington disease. PLoS One 9, 1–7. 10.1371/journal.pone.0113412 [DOI] [PMC free article] [PubMed] [Google Scholar]
  109. Salthouse TA, 1996. The processing-speed theory of adult age differences in cognition. Psychol. Rev 103, 403–428. 10.1037/0033-295x.103.3.403 [DOI] [PubMed] [Google Scholar]
  110. Shahim P, Tegner Y, Marklund N, Blennow K, Zetterberg H, 2018. Neurofilament light and tau as blood biomarkers for sports-related concussion. Neurology 90, E1780–E1788. 10.1212/WNL.0000000000005518 [DOI] [PMC free article] [PubMed] [Google Scholar]
  111. Shimazu T, Tamura N, Shimazu K, 2005. Aging of the autonomic nervous system. Japanese J. Clin. Med 63, 973–977. [PubMed] [Google Scholar]
  112. Shoji M, 2011. Biomarkers of the dementia. Int. J. Alzheimers. Dis 2011. 10.4061/2011/564321 [DOI] [PMC free article] [PubMed] [Google Scholar]
  113. Sieck GC, Prakash YS, 1995. Fatigue at the neuromuscular junction, in: Fatigue. pp. 83–100. [PubMed] [Google Scholar]
  114. Simpson B, Rosen C, 2008. Glottic insufficiency: vocal fold paralysis, paresis, and atrophy. Oper. Tech. Laryngol 29–35. [Google Scholar]
  115. Sperling RA, Aisen PS, Beckett LA, Craft S, 2011. Toward defining the preclinical stages of alzheimer’s disease: Recommendations from the national institute on aging. Alzheimer’s Dement. 7, 280–292. 10.1016/j.jalz.2011.03.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  116. Stacey D, Ciobanu LG, Baune BT, 2017. A systematic review on the association between inflammatory genes and cognitive decline in non-demented elderly individuals. Eur. Neuropsychopharmacol 27, 568–588. 10.1016/j.euroneuro.2015.12.017 [DOI] [PubMed] [Google Scholar]
  117. Stewart R, Xue QL, Masaki K, Petrovitch H, Ross GW, White LR, Launer LJ, 2009. Change in blood pressure and incident dementia: A 32-year prospective study. Hypertension 54, 233–240. 10.1161/HYPERTENSIONAHA.109.128744 [DOI] [PMC free article] [PubMed] [Google Scholar]
  118. Strickland M, Yacoubi-Loueslati B, Bouhaouala-Zahar B, Pender SLF, Larbi A, 2019. Relationships between ion channels, mitochondrial functions and inflammation in human aging. Front. Physiol 10, 1–22. 10.3389/fphys.2019.00158 [DOI] [PMC free article] [PubMed] [Google Scholar]
  119. Testa JA, Beatty WW, Gleason C, Orbelo DM, Ross ED, 2001. Impaired affective prosody in AD. Neurology 57, 1474–1481. [DOI] [PubMed] [Google Scholar]
  120. Themistocleous C, Eckerström M, Kokkinakis D, 2020. Voice quality and speech fluency distinguish individuals with Mild Cognitive Impairment from Healthy Controls. PLoS One 15, 1–13. 10.1371/journal.pone.0236009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  121. Thomas JA, Burkhardt HA, Chaudhry S, Ngo AD, Sharma S, Zhang L, Au R, Ghomi RH, 2020. Assessing the utility of language and voice biomarkers to predict cognitive impairment in the Framingham Heart Study Cognitive Aging cohort data. J. Alzheimer’s Dis 76, 905–922. 10.3233/JAD-190783 [DOI] [PubMed] [Google Scholar]
  122. Toth L, Hoffmann I, Gosztolya G, Vincze V, Szatloczki G, Banreti Z, Pakaski M, Kalman J, 2017. A speech recognition-based solution for the automatic detection of mild cognitive impairment from spontaneous speech. Curr. Alzheimer Res 14, 130–138. 10.2174/1567205014666171121114930 [DOI] [PMC free article] [PubMed] [Google Scholar]
  123. Tremblay P, Deschamps I, 2016. Structural brain aging and speech production: a surface-based brain morphometry study. Brain Struct. Funct 221, 3275–3299. 10.1007/s00429-015-1100-1 [DOI] [PubMed] [Google Scholar]
  124. Tsenkova VK, Karlamangla A, 2016. Depression amplifies the influence of central obesity on 10-year incidence of diabetes: Findings from MIDUS. PLoS One 11, 1–9. 10.1371/journal.pone.0164802 [DOI] [PMC free article] [PubMed] [Google Scholar]
  125. Tuladhar AM, Van Uden IWM, Rutten-Jacobs LCA, Lawrence A, Van Der Holst H, Van Norden A, De Laat K, Van Dijk E, Claassen JAHR, Kessels RPC, Markus HS, Norris DG, De Leeuw FE, 2016. Structural network efficiency predicts conversion to dementia. Neurology 86, 1112–1119. 10.1212/WNL.0000000000002502 [DOI] [PMC free article] [PubMed] [Google Scholar]
  126. Tun PA, Lachman ME, 2006. Telephone assessment of cognitive function in adulthood: the Brief Test of Adult Cognition by Telephone. Age Ageing 35, 627–629. 10.1093/ageing/afl099 [DOI] [PubMed] [Google Scholar]
  127. Twamley EW, Legendre Ropacki SA, Bondi MW, 2006. Neuropsychological and neuroimaging changes in preclinical Alzheimer’s disease. J. Int. Neuropsychol 12, 707–735. [DOI] [PMC free article] [PubMed] [Google Scholar]
  128. Van Himbergen TM, Beiser AS, Ai M, Seshadri S, Otokozawa S, Au R, Thongtang N, Wolf PA, Schaefer EJ, 2012. Biomarkers for insulin resistance and inflammation and the risk for all-cause dementia and Alzheimer disease: results from the Framingham Heart Study. Arch. Neurol 69, 594–600. 10.1001/archneurol.2011.670 [DOI] [PMC free article] [PubMed] [Google Scholar]
  129. Van Velzen MH, Nanetti L, De Deyn PP, 2014. Data modelling in corpus linguistics: How low may we go? Cortex 55, 192–201. 10.1016/j.cortex.2013.10.010 [DOI] [PubMed] [Google Scholar]
  130. Wang Y, Chan GLY, Holden JE, Dobko T, Mak E, Schulzer M, Huser JM, Snow BJ, Ruth TJ, Calne DB, Stoessl AJ, 1998. Age-dependent decline of dopamine D1 receptors in human brain: A PET study. Synapse 30, 56–61. [DOI] [PubMed] [Google Scholar]
  131. Wilcox KC, Lacor PN, Pitt J, Klein WL, 2011. Aβ oligomer-induced synapse degeneration in Alzheimer’s disease. Cell. Mol. Neurobiol 31, 939–948. 10.1007/s10571-011-9691-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  132. Wolf OT, Convit A, Thorn E, De Leon MJ, 2002. Salivary cortisol day profiles in elderly with mild cognitive impairment. Psychoneuroendocrinology 27, 777–789. 10.1016/S0306-4530(01)00079-8 [DOI] [PubMed] [Google Scholar]
  133. Xue C, Karjadi C, Paschalidis IC, Au R, Kolachalama VB, 2021. Detection of dementia on raw voice recordings using deep learning: A Framingham Heart Study. SSRN Electron. J 1–15. 10.2139/ssrn.3788945 [DOI] [PMC free article] [PubMed] [Google Scholar]
  134. Xue SA, Deliyski D, 2001. Effects of aging on selected acoustic voice parameters: Preliminary normative data and educational implications. Educ. Gerontol 27, 159–168. 10.1080/03601270151075561 [DOI] [Google Scholar]
  135. Zarzur AP, Duprat AC, Shinzato G, Eckley CA, 2007. laryngeal electromyography in adults With Parkinson’s disease and voice complaints. Laryngoscope 117, 831–834. 10.1097/MLG.0b013e3180333145 [DOI] [PubMed] [Google Scholar]
  136. Zhang L, Ngo A, Thomas JA, Burkhardt HA, Parsey CM, Au R, Hosseini Ghomi R, 2021. Neuropsychological test validation of speech markers of cognitive impairment in the Framingham Cognitive Aging Cohort. Explor. Med 2, 232–252. 10.37349/emed.2021.00044 [DOI] [PMC free article] [PubMed] [Google Scholar]
  137. Zou K, Abdullah M, Michikawa M, 2020. Current biomarkers for alzheimer’s disease: From csf to blood. J. Pers. Med 10, 1–15. 10.3390/jpm10030085 [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES