Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2009 Oct 1.
Published in final edited form as: Int Psychogeriatr. 2009 Mar 17;21(5):869–878. doi: 10.1017/S104161020900876X

Minimizing attrition bias: a longitudinal study of depressive symptoms in an elderly cohort

Chung-Chou H Chang 1,2, Hsiao-Ching Yang 2, Gong Tang 2, Mary Ganguli 3,4,5
PMCID: PMC2733930  NIHMSID: NIHMS115565  PMID: 19288971

Abstract

Background

Attrition from mortality is common in longitudinal studies of the elderly. Ignoring the resulting non-response or missing data can bias study results.

Methods

1260 elderly participants underwent biennial follow-up assessments over 10 years. Many missed one or more assessments over this period. We compared three statistical models to evaluate the impact of missing data on an analysis of depressive symptoms over time. The first analytic model (generalized mixed model) treated non-response as data missing at random. The other two models used shared parameter methods; each had different specifications for dropout but both jointly modeled both outcome and dropout through a common random effect.

Results

The presence of depressive symptoms was associated with being female, having less education, functional impairment, using more prescription drugs, and taking antidepressant drugs. In all three models, the same variables were significantly associated with depression and in the same direction. However, the strength of the associations differed widely between the generalized mixed model and the shared parameter models. Although the two shared parameter models had different assumptions about the dropout process, they yielded similar estimates for the outcome. One model fitted the data better, and the other was computationally faster.

Conclusions

Dropout does not occur randomly in longitudinal studies of the elderly. Thus, simply ignoring it can yield biased results. Shared parameter models are a powerful, flexible, and easily implemented tool for analyzing longitudinal data while minimizing bias due to nonrandom attrition.

Keywords: discrete failure time model, dropout, non-ignorable nonresponse, shared parameter model, Weibull model

Introduction

In longitudinal studies, it is almost inevitable for some follow-up data to be missing. This is particularly true in studies of older adults, among whom attrition is often related to mortality, which increases dramatically with age. Other reasons for attrition can include participants’ reluctance to continue in the study, illness, or relocation from the study area. Different factors can be associated with different causes of attrition (van Beijsterveldt et al., 2002; Matthews et al., 2004; 2006). When elderly individuals develop depressive symptoms, they may become particularly unmotivated to participate in research assessments. They also have altered health indices, functional ability, health services utilization, and elevated mortality rates (Ganguli et al., 2002). Individuals may in fact have symptoms shortly before they die (Katona and Shankar, 2004) or drop out of studies (Casey et al., 2008; Ong et al., 2008). The course and relationships of depressive symptoms in these individuals may therefore differ substantially from those observed in individuals who complete the study.

Our objective in the present study was to examine the impact of attrition on the analysis of factors associated with depressive symptoms over time, and to compare different statistical approaches to handling attrition effects in population-based data. In this study, we examined factors associated with depressive symptoms over multiple years of follow-up, using three models: a generalized mixed model and two shared parameter models which simultaneously modeled depression and dropout. Specifically, we used three well-established analytical methods for longitudinal data to compare the estimated effect sizes for different covariates of depressive symptoms, and subsequently determine whether the results would change in magnitude or interpretation. The three models are based on different statistical assumptions as to whether or not the dropout data were missing at random (Rubin, 1976; Little and Rubin, 2002).

Methods

Study setting, design and participants

The Monongahela Valley Independent Elders Survey (MoVIES) was conducted from 1987 to 2002 in a largely rural American region with a population that was economically depressed and had relatively low levels of education. Inclusion criteria for study participation included being at least 65 years old and community-dwelling at the time of recruitment, being fluent in English, and having a sixth-grade education or higher (Ganguli et al., 1993). The initial study cohort consisted of 1681 participants assessed at study entry (wave 1) (Ganguli et al., 1998) and reassessed in a series of approximately biennial data collection waves. Between waves 1 and 2, a total of 340 participants died, relocated or dropped out of the study, leaving 1341 participants to be assessed at wave 2.

Data on depressive symptoms were collected for the first time in wave 2 (1989–1991), which we therefore designated as the data baseline for the current analyses. The modified Center for Epidemiological Studies–Depression Scale (mCES-D Scale), in which scores range from 0 to 20, was used to represent the number of self-reported depressive symptoms experienced over most of the preceding week. The score was dichotomized as 0–4 symptoms vs. 5 or more symptoms, with this threshold representing the 90th percentile of the cohort (Ganguli et al., 1995). The scale was administered at waves 2 through 6. Here, an mCES-D score <5 indicates the absence of substantial depressive symptoms (“not depressed” for the purpose of this article) while a score ≥ 5 indicates the presence of substantial depressive symptoms (“depressed”).

To ensure validity of responses to the mCES-D depression questions and to avoid confounding by dementia, we excluded from these analyses 48 participants who were classified as having dementia at baseline, and 12 participants who did not complete the Mini-mental State Examination (MMSE; Folstein et al., 1975), a screening measure of global cognitive functioning. We further excluded 21 participants whose mCES-D depression data were incomplete.

For the current analyses, we used data from the remaining 1260 participants, whose demographic and clinical characteristics have been described previously (Andreescu et al., 2008). Of these 1260 participants, 48 (3.81%) completed waves 2 and 6 but skipped one or more waves in between. For these participants, we imputed the missing mCES-D score by using the average of the scores derived from the waves immediately before and after the missing wave.

Statistical methods

Descriptive statistics

The overall cohort, and the subgroup that dropped out over the course of the study, were characterized using means and proportions with regard to demographic characteristics (age, sex, and education); mCES-D depression symptom scores (dichotomized as <5 vs. ≥ 5); number of prescription drugs being used (dichotomized as <4 vs. ≥ 4), as a measure of overall morbidity; whether antidepressant drugs were being used; and functional impairment with regard to instrumental activities of daily living (IADLs). We used the χ2 or Fisher’s exact test to compare the percentage with presence of substantial depressive symptoms between people who dropped out before or at wave 6 and people who completed the study, and also to compare the percentage with presence of substantial depressive symptoms at baseline between subgroups with different characteristics (e.g. female vs. male).

Overview of missing data mechanisms

Rubin (1976) and Little and Rubin (2002) classified the missing data process into three mechanisms: missing completely at random (MCAR), missing at random (MAR), and non-ignorable missing (NIM). In the study of depressive symptoms over time, missing data will be MCAR if the probability of attrition does not depend on the presence or severity of depression (i.e. the number of depressive symptoms). It will be MAR if the probability of attrition depends on the severity of depression observed at some time point(s) before dropout. Finally, it will be NIM if the probability of attrition depends on the severity of depression at the time of dropout, but the severity is unobserved, i.e. was not measured at the time of dropout. Preliminary analysis suggested that attrition here depended on the magnitude of depressive symptoms (Table 1). Therefore methods under the MCAR assumption were ruled out.

Table 1.

Baseline characteristics for individuals who completed all follow up (n = 591) and those who did not complete all follow up (n = 669)

BASELINE CHARACTERISTIC COMPLETE FOLLOW-UP GROUP INCOMPLETE FOLLOW-UP GROUP P VALUE*
Depressive symptoms ≥ 5 46 (7.8%) 81 (12.1%) 0.01
Age (in years) 65–74 446 (75.5%) 303 (51.3%) <0.001
75–84 140 (23.7%) 313 (53.0%)
≥ 85 5 (0.8%) 55 (9.3%)
Female 396 (67.0%) 370 (55.3%) <0.001
At least high school completion 396 (67.0%) 374 (55.9%) <0.001
Use of ≥ 4 prescription drugs 70 (11.8%) 204 (30.5%) <0.001
Use of antidepressant drugs 11 (1.9%) 25 (3.7%) 0.06
Any IADL impairment 169 (29.3%) 277 (46.8%) <0.001
*

Based on a χ2 or Fisher’s exact test.

Models for longitudinal out come with missing data

Traditionally, the models for analyzing longitudinal data include the multivariate analysis of variance (MANOVA), the model with generalized estimating equation (GEE), and the mixed effect model. These models operate under the assumption of MCAR (MANOVA and GEE) or MAR (weighted version of GEE, GEE with imputations, and mixed effect model). Several methods have been developed specifically for NIM data under the framework of selection models or pattern-mixture models (Little, 1995; Roy, 2003). Shared parameter models are a popular class of selection models that use common unobserved random effects to predict the courses of outcome and dropout over time.

Modeling approaches

We fit three separate models to evaluate and compare the impact of missingness/nonresponse on parameter estimates. The first was the generalized linear mixed model that assumes dropout as MAR. The other two methods were shared parameter models that jointly modeled the outcome (depressive symptoms at each assessment wave) and the likelihood of attrition, through a common unobserved random effect component in order to account for the NIM (Wu and Carroll, 1988; Wulfsohn and Tsiatis, 1997; Lin et al., 2002; Roy, 2003; Tsiatis and Davidian, 2004; Beunckens et al., 2005; Vonesh et al., 2006). Shared parameter models have two key advantages: they provide a flexible framework for handling intermittently missing patterns, and they can be applied when study participants do not all follow the same schedule of follow-up interviews or examinations.

In the generalized mixed model, depressive symptom scores were fit by a logistic regression model with random intercepts representing the subject-specific depression status at baseline. In the shared parameter models, the time to dropout was modeled by a regression model in which a random effect from the generalized linear mixed model acted as a predictor that represents the association between the repeatedly measured mCES-D scores and the dropout process. We fit two shared parameter models, a Weibull accelerated failure time (AFT) model and a discrete failure time model, both of which were conditional on the subject-specific random effect, which served as a subject-specific covariate. We compared the results from the shared parameter models (which jointly modeled both depression and dropout as outcomes) with the results from a generalized logistic random-effects model (which only modeled depression as outcome). Note that dropout was identified consistently at each follow-up wave (waves 3–6) and not in between waves; thus it was a discrete rather than a continuous type of event. However, several researchers still use continuous-type survival models to fit discrete-type survival data. Compared with the previous literature, we used both continuous-type and discrete-type survival regressions to model the dropout process, and compared the results as a sensitivity analysis. We selected the Weibull AFT model among various continuous-type survival models since our data met the assumption of monotonic hazards for the Weibull model (the hazard function showed increasing trend).

The generalized mixed model and the longitudinal part (depressive symptoms at multiple waves) of the shared parameter models included the following baseline (wave 2) covariates: gender, age (65–74 years, 75–84 years, and ≥ 85 years), education level (less than high school vs. at least high school completion), total number of prescription drugs used (<4 or ≥ 4), use of antidepressant drugs (yes or no), and functional impairment (any or no IADL impairment). To measure time to dropout, we defined the failure date as the date of dropout and the censoring date as the study end date (wave 6). If the actual date of dropout was not available, we approximated the date as 2.4 years after the last interview date, this being the average interval between two consecutive interview dates, in this cohort.

To determine how well the models fit the data, we first checked whether a NIM model performs better than a MAR model by testing the variance of the random effects component in the shared parameter model being significantly different from zero. We further used the Akaike information criteria (AIC) to compare the goodness of fit between the two shared parameter models. A smaller AIC value indicates a better fit (Pawitan, 2004).

One of the complications of using a generalized linear mixed model is the computational efficiency (Jiang, 1998), specifically the time (and therefore the cost) of running the models. A shorter central processing unit (CPU) running time indicates a computationally more efficient model.

The notation and shared parameter models, as well as the corresponding estimation procedures, are provided in Appendix S1 (available as supplementary material attached to the electronic version of this paper at www.journals.cambridge.org/jid_IPG).

Results

At baseline, the 1260 participants eligible for these analyses had a mean (SD) age of 74.6 (5.3) years. Women comprised 60.79% (n = 766) of the cohort, and 61.11% (n = 770) had at least high school education. Of the 1260 participants, a total of 669 (53.10%) dropped out before or at wave 6, with 171 (13.57%) of the dropouts occurring between waves 2 and 3, 154 (12.22%) between waves 3 and 4, 164 (13.02%) between waves 4 and 5, and180 (14.29%) between waves 5 and 6. The reasons for dropouts are shown in Figure 1. We found that 81 of 669 participants (12.11%) who dropped out before or at wave 6, compared to 46 of 591 participants (7.78%) who completed wave 6, had substantial depressive symptoms (mCES-D ≥ 5). This difference was significant (χ2 with 1 degree of freedom = 6.47, P = 0.01). As Figure 2 shows, there was a rise in the percentage of participants who had substantial depressive symptoms before their dropouts occurred, and depression was least common in the participants who completed the most waves.

Figure 1.

Figure 1

Flow diagram of reasons for attrition between data collection waves.

Figure 2.

Figure 2

Proportion of cohort with at least 5 depressive symptoms among subgroups with different lengths of follow up.

For interested readers, the unadjusted relationships between presence of depressive symptoms and each covariate at each wave are provided in Appendix S2 (available as supplementary material attached to the electronic version of this paper at www.journals.cambridge.org/jid_IPG). Regardless of the number of completed waves, depression at baseline was significantly more frequent in women than in men (χ2 with 1 degree of freedom = 11.63, P = 0.001); in participants aged 75–84 years than in those aged 65–74 years (χ2 with 1 degree of freedom = 4.37, P = 0.001); in participants who did not complete high school than in those who did (χ2 with 1 degree of freedom = 17.21, P <0.001); in participants who used <4 prescription drugs than in those who used ≥ 4 drugs (χ2 with 1 degree of freedom = 30.59, P <0.001); in participants who did use antidepressant drugs than in those who did not (Fisher’s exact test P <0.001); and in participants who had any functional impairment in IADL than in those who had none (χ2 with 1 degree of freedom = 24.83, P <0.001). Note that none of the nonsignificant findings were due to insufficient power (<80%). The percentages with ≥ 5 depressive symptoms by each level of each factor can be found in Appendix S2 online.

The multivariable relationship between factors and depression is summarized in Table 2, which includes results from all three models (the generalized mixed model, the shared parameter model with Weibull dropout, and the shared parameter model with discrete failure dropout). The probability of having substantial (five or more) depressive symptoms was not affected by age, but it was affected by gender, education, number of regularly used prescription drugs, and functional limitations in IADL. In all three models, the direction of significance was the same: factors that increased the probability of depressive symptoms were being female, having less than a high school education, using four or more prescription drugs, using antidepressant drugs, and IADL impairment. However, for each of the factors, the strength of the association, as represented by the odds ratios, were similar in the two shared parameter models and differed from those in the generalized mixed model. For example, in the shared parameter models, the odds of depression for people using antidepressant drugs was 16 times greater than that for people not using these drugs. However, in the generalized mixed model, the odds were only about three times greater. Being male and using antidepressant drugs were positively associated with time to dropout in the Weibull model but not in the discrete time model.

Table 2.

Comparison of three modelsa in terms of their covariate effects on the presence of concurrent depression (having five or more depressive symptoms)

MIXED MODEL
WEIBULL MODEL
DISCRETE TIME MODEL
COVARIATES ESTIMATED OR (95% CI) P VALUE ESTIMATED OR (95% CI) P VALUE ESTIMATED OR (95% CI) P VALUE
Age 75–84 years 1.13 (0.80, 1.60) 0.49 1.04 (0.51, 2.10) 0.92 1.04 (0.51, 2.10) 0.92
Age ≥85 years 0.88 (0.40, 1.97) 0.76 0.63 (0.13, 3.05) 0.57 0.63 (0.13, 3.08) 0.56
Female gender 1.80 (1.27, 2.56) 0.001 2.05 (1.03, 4.08) 0.04 2.05 (1.03, 4.09) 0.04
At least high school education 0.43 (0.31, 0.59) <0.001 0.33 (0.17, 0.63) <0.001 0.33 (0.17, 0.63) 0.001
Use of ≥4 prescription drugs 2.13 (1.48, 3.08) <0.001 3.51 (1.65, 7.45) 0.001 3.50 (1.64, 7.44) 0.001
Use of an antidepressant drug 2.93 (1.38, 6.18) 0.005 16.03 (2.36, 108.80) 0.005 16.15 (2.35, 110.87) 0.005
Functional impairment 2.75 (1.96, 3.87) <0.001 4.73 (2.41, 9.31) <0.001 4.72 (2.40, 9.31) <0.001
Time = wave 3 0.74 (0.54, 1.02) 0.06 0.61 (0.39, 0.95) 0.03 0.61 (0.39, 0.95) 0.03
Time = wave 4 0.69 (0.49, 0.98) 0.04 0.58 (0.36, 0.93) 0.02 0.57 (0.36, 0.93) 0.02
Time = wave 5 0.61 (0.41, 0.90) 0.01 0.51 (0.30, 0.87) 0.01 0.51 (0.30, 0.86) 0.01
Time = wave 6 0.75 (0.50, 1.14) 0.18 0.73 (0.42, 1.29) 0.28 0.73 (0.42, 1.29) 0.28
Variance of the shared random effect N/A N/A 18.70 (7.46, 29.94) 0.001 18.94 (7.56, 30.33) 0.001
CPU (min:sec) 1.03 15:19.10 31:03.37
AIC N/A 3599.7 3453.9
a

All models are multivariable models with intercept (estimate not shown).

Effect of wave at which data were collected (with wave 2 as the reference group).

OR = odds ratio; CI = confidence interval; CPU = central processing unit (shorter is computationally faster); AIC = Akaike information criteria (smaller indicates a better fit); N/A = not applicable.

In both shared parameter models, the estimated variances were significantly different from zero (estimated variance = 18.70 and P = 0.001 for the Weibull shared model; estimated variance = 18.94 and P = 0.001 for the discrete time model). Therefore, a generalized linear mixed model did not sufficiently account for the total variability. This suggested that an NIM model was a better fit. Comparing the two shared parameter models, the Akaike information criteria (AIC) showed that the discrete time dropout model had a better fit than the Weibull model (AIC of 3453.9 vs. 3599.7). This was as expected because our dropout process was actually discrete type time-to-event data.

To estimate computational efficiency, we measured the CPU times of the two models. The discrete time dropout model used approximately 31 minutes of CPU time compared to 15 minutes of CPU time use for the Weibull model, running on a server with 8 Xeon processors of 2.66 GHz with 32 GB of RAM and a 4-disk striped RAID array.

Discussion

We found that the presence of substantial depressive symptoms in older adults was associated over time with female gender, less education, overall morbidity as reflected in number of prescription medications, use of antidepressant drugs, and functional limitations. These findings are consistent with the literature on late-life depression (Katona and Shankar, 2004). Our study also showed that the presence of depression is significantly associated with attrition from the study, confirming previous findings that mortality is often preceded by depression or drop in mood (Jorm et al., 1991; Katona and Shankar, 2004; Andreescu et al., 2008). This drop parallels observations regarding a decrease in cognitive function before death, the phenomenon referred to as “terminal decline” (Wilson et al., 2007; Lavery et al., 2008). These findings raise the question of the potential impact of attrition on studies of factors that are themselves related to attrition.

Some degree of attrition is virtually unavoidable in longitudinal studies and must be handled appropriately. Our approach was to compare the results of a model that was unadjusted for non-ignorable attrition (the mixed model) with the results of two models that were adjusted for attrition (the shared parameter models). We found that the direction of the covariate effects was the same in all three models but that the strength of the association was not the same. Information available on the dropouts in this study suggests that attrition was non-ignorable and that the study participants showed an increase in depressive symptoms before they dropped out. Therefore, it appears reasonable to conclude that a shared parameter model provides a less distorted picture of depression symptoms over time than does a mixed model that assumes that attrition occurred randomly. In studies of other populations, or studies examining other relationships, both the direction and the strength of the associations may vary when different models are used to account for attrition. Replications in other datasets should determine under which circumstances the estimation results or the inference results – or both – will vary. Further investigations are also needed to determine under what conditions the bias caused by non-ignorable dropout can and cannot be eliminated after adjusting for it in the shared parameter model framework. These are important but frequently ignored aspects of the analysis of longitudinal data with substantial attrition.

Although most investigators make strenuous efforts to minimize attrition in longitudinal studies, many often choose to ignore attrition mechanisms while carrying out data analysis. For example, some investigators impose their analysis on participants who completed the study, which makes the de facto assumption that attrition took place randomly, or the type of attrition can be ignored. As demonstrated in several simulation studies, any given reason for attrition is associated with multiple factors that could themselves be under investigation in the same study. Therefore, mishandling missing data at the analysis stage could cause the estimated effect sizes to be biased and lead to a distorted understanding of the outcome over time. Touloumi et al. (2001) compared the impact on estimation bias and efficiency (the variability of the estimator) among different models for analysis of longitudinal data with various attrition mechanisms. Kristman et al. (2005) showed that when data are NIM, imputation and weighting methods may not reduce bias due to attrition efficiently. Eerola et al. (2005) found poor predictive ability when the attrition mechanism is ignored. Funatogawa et al. (2008) showed that linear mixed effect models could give biased estimators when data were simulated under an NIM mechanism. Therefore it is important to choose a model that accounts for the missing data mechanism appropriately.

In order to choose the correct models, the first step is to determine the correct attrition mechanism for the given data. To distinguish between MCAR and NIM, researchers usually compare the outcome measures between subjects who do and do not complete the study. Where possible, the judgment should also be made on the basis of prior knowledge and experience as to whether participants’ propensity to complete the study is likely to be highly correlated with the outcome. There is no systematic method to distinguish between MAR and NIM. Researchers tend to make this judgment based on experience but can also use the Index of Local Sensitivity to Non-Ignorable dropout (ISNI) method as an analysis of sensitivity (Troxell et al., 2004). If the data justify the assumption of MCAR, MANOVA models or models with GEE are typically used. When data are MAR, GEE with weighting, GEE with imputations, or mixed models are the common choices. Selection models and pattern-mixture models are specifically designed for data with NIM. Recently, the joint modeling of longitudinal outcomes and attrition process has gained popularity because of the ease of building the models into standard statistical packages and natural interpretation. van Beijsterveldt et al. (2002) has indicated that even though the effect of attrition on the evolution of the outcome could be small, careful selection of analytical methods is helpful in correcting the estimation bias and allowing a better interpretation of results.

Matthews et al. (2004) recommends that all longitudinal studies should investigate attrition, which may help with aspects of design and with the testing of specific hypotheses. In our study, factors associated with overall attrition were age, sex, education, depressive symptoms, IADL impairment, use of more prescription drugs, and use of antidepressant drugs, most of which were also associated with the presence of depressive symptoms. Previous studies have identified different factors associated with different reasons for attrition. Those who are older, male, smokers, have poor cognitive ability, or have impaired ADL have higher attrition rates due to death (van Beijsterveldt et al., 2002; Matthews et al., 2004). Participants who are women, have lower educational levels, or have lower baseline scores on neurocognitive tests have higher rates of dropout due to refusal to continue (van Beijsterveldt et al., 2002; Matthews et al., 2004; 2006). Those who are single, smokers, demented or depressed have higher attrition rates due to relocation or becoming uncontactable (Matthews et al., 2004).

The strengths of the MoVIES dataset are the study cohorts that are representative of the community, with large sample size and relatively long period of follow-up with repeated measures. Potential limitations include the fact that assessments were carried out biennially rather than annually, and that attrition increased with time; however, the latter allowed us to carry out the analyses reported here. We measured self-reported depressive symptoms and did not diagnose depressive illness clinically. The strengths of the shared parameter models are that they are a powerful, flexible and easily implemented tool for analyzing longitudinal data while minimizing attrition bias. We used parametric models to fit both the longitudinal and dropout parts of the shared parameter models. Therefore, it was not difficult to incorporate them into standard software packages. We used the NLMIXED procedure in SAS, and have provided our codes in Appendix S1 (available in the online version of this paper at www.journals.cambridge.org/jid.ipg). Depending on the equipment used, the sample size and the likelihood function derived from the data, the calculation of results could be extremely computationally demanding. Therefore, if multiple shared parameter models are used to fit the data, the performance of the models should be judged both by how well a given model fits the data (goodness-of-fit statistics) and by its computational efficiency.

Attrition is often related to the outcomes of interest in longitudinal studies. Given the potential for attrition bias, investigators should make every effort to maximize cohort retention and minimize attrition in longitudinal studies. Since some attrition is inevitable, efforts should be made to minimize the resulting bias. Any information that can be obtained about the reasons for dropouts will help to determine the extent to which attrition can be ignored. At the level of data analysis, appropriate statistical methods should be employed to minimize bias. Intervention trials might also find the new approaches more appropriate to handling dropouts than merely carrying the last observation forward (Gomeni et al., 2009). The shared parameter models presented here are valuable to other researchers engaged in longitudinal studies, particularly studies of older adults.

Supplementary Material

Arrays

Acknowledgments

The work reported here was supported in part by research grant R01 AG07562 and career development grants K24 AG022035 and K25 DK059928 from the National Institutes of Health, U.S. Department of Health and Human Services. The authors thank the editor and two referees for providing insightful comments and suggestions.

Footnotes

Online supplementary materials

The SAS codes used to generate Table 2 and the unadjusted relationships between the presence of depressive symptoms and each covariate at each wave are available as online supplementary material (Appendix S1 and Appendix S2, respectively) at www.journals.cambridge.org/jid_IPG.

Conflict of interest

None.

Description of authors’ roles

Dr. Chang had full access to the data in the study, confirmed the accuracy of the data analysis, drafted the manuscript, and provided critical revision of the manuscript for important intellectual content. Ms. Yang performed the statistical analyses and drafted the manuscript. Dr. Tang drafted the manuscript, confirmed the accuracy of the methods used in the study, and provided intellectual contributions to the missing data background reviews and interpretations of the data. Dr. Ganguli had full access to the data in the study and takes responsibility for the integrity of the data. In addition, she drafted the manuscript and provided critical revision of the manuscript for important intellectual content, and was the Principal Investigator of research grant R01 AG07562 that funded the MoVIES study.

References

  1. Andreescu C, Chang CH, Mulsant B, Ganguli M. Twelve-year depressive symptom trajectories and their predictors in a community sample of older adults. International Psychogeriatrics. 2008;20:221–236. doi: 10.1017/S1041610207006667. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Beunckens C, Molenberghs G, Kenward MG. Direct likelihood analysis versus simple forms of imputation for missing data in randomized clinical trials. Clinical Trials. 2005;2:379–386. doi: 10.1191/1740774505cn119oa. [DOI] [PubMed] [Google Scholar]
  3. Casey E, Hughes JW, Waechter D, Josephson R, Rosneck J. Depression predicts failure to complete phase-II cardiac rehabilitation. Journal of Behavioral Medicine. 2008;31:421–431. doi: 10.1007/s10865-008-9168-1. [DOI] [PubMed] [Google Scholar]
  4. Eerola M, Huurre T, Aro H. The problem of attrition in a Finnish longitudinal survey on depression. European Journal of Epidemiology. 2005;20:113–120. doi: 10.1007/s10654-004-1657-0. [DOI] [PubMed] [Google Scholar]
  5. Folstein MF, Folstein SE, McHugh PR. “Mini-mental state”: a practical method for grading the cognitive state of patients for the clinician. Journal of Psychiatric Research. 1975;12:189–198. doi: 10.1016/0022-3956(75)90026-6. [DOI] [PubMed] [Google Scholar]
  6. Funatogawa T, Funatogawa I, Takeuchi M. An autoregressive linear mixed effects model for the analysis of longitudinal data which include dropouts and show profiles approaching asymptotes. Statistics in Medicine. 2008;27:6351–6366. doi: 10.1002/sim.3417. [DOI] [PubMed] [Google Scholar]
  7. Ganguli M, et al. Sensitivity and specificity for dementia of population-based criteria for cognitive impairment: the MoVIES project. Journals of Gerontology Series A: Biological Sciences and Medical Sciences. 1993;48:M152–M161. doi: 10.1093/geronj/48.4.m152. [DOI] [PubMed] [Google Scholar]
  8. Ganguli MJ, Gilby J, Seaberg E, Belle S. Depressive symptoms and associated factors in a rural elderly population: the MoVIES project. American Journal of Geriatric Psychiatry. 1995;3:144–160. doi: 10.1097/00019442-199500320-00006. [DOI] [PubMed] [Google Scholar]
  9. Ganguli M, Lytle ME, Reynolds MD, Dodge HH. Random versus volunteer selection for a community-based study. Journal of Gerontology Series A: Biological Sciences and Medical Sciences. 1998;53A:M39–M46. doi: 10.1093/gerona/53a.1.m39. [DOI] [PubMed] [Google Scholar]
  10. Ganguli M, Dodge HH, Mulsant BH. Rates and predictors of mortality in an aging, rural, community-based cohort: the role of depression. Archives of General Psychiatry. 2002;59:1046–1052. doi: 10.1001/archpsyc.59.11.1046. [DOI] [PubMed] [Google Scholar]
  11. Gomeni R, Lavergne A, Merlo-Pich E. Modelling placebo response in depression trials using a longitudinal model with informative dropout. European Journal of Pharmaceutical Sciences. 2009;36:4–10. doi: 10.1016/j.ejps.2008.10.025. [DOI] [PubMed] [Google Scholar]
  12. Jiang J. Consistent estimators in generalized linear mixed models. Journal of the American Statistical Association. 1998;93:720–729. [Google Scholar]
  13. Jorm AF, Henderson AS, Kay DWK, Jacomb PA. Mortality in relation to dementia, depression and social integration in an elderly community sample. International Journal of Geriatric Psychiatry. 1991;6:5–11. [Google Scholar]
  14. Katona CLE, Shankar KK. Depression in old age. Reviews in Clinical Gerontology. 2004;14:283–306. [Google Scholar]
  15. Kristman VL, Manno M, Côté P. Methods to account for attrition in longitudinal data: do they work? A simulation study. European Journal of Epidemiology. 2005;20:657–662. doi: 10.1007/s10654-005-7919-7. [DOI] [PubMed] [Google Scholar]
  16. Lavery LL, Dodge HH, Snitz B, Ganguli M. Cognitive decline and mortality in a community-based cohort: the MoVIES Study. Journal of the American Geriatrics Society. 2008 doi: 10.1111/j.1532&#x02013;5415.2008.02052. E-published ahead of print. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Lin H, McCulloch CE, Mayne ST. Maximum likelihood estimation in the joint analysis of time-to-event and multiple longitudinal variables. Statistics in Medicine. 2002;21:2369–2382. doi: 10.1002/sim.1179. [DOI] [PubMed] [Google Scholar]
  18. Little RJA. Modeling the drop-out mechanism in longitudinal studies. Journal of the American Statistical Association. 1995;90:1112–1121. [Google Scholar]
  19. Little RJA, Rubin DB. Statistical Analysis with Missing Data. 2. New York: Wiley; 2002. [Google Scholar]
  20. Matthews FE, Chatfield M, Freeman C, McCracken C, Brayne C MRC-CFAS. Attrition and bias in the MRC cognitive function and ageing study: an epidemiological investigation. BMC Public Health. 2004;4:12. doi: 10.1186/1471&#x02013;2458&#x02013;4&#x02013;12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Matthews FE, Chatfield M, Brayne C MRC-CFAS. An investigation of whether factors associated with short-term attrition change or persist over ten years: data from the Medical Research Council Cognitive Function and Ageing Study (MRC-CFAS) BMC Public Health. 2006;6:185. doi: 10.1186/1471&#x02013;2458&#x02013;6&#x02013;185. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Ong JC, Kuo TF, Manber R. Who is at risk for dropout from group cognitive-behavior therapy for insomnia? Journal of Psychosomatic Research. 2008;64:419–425. doi: 10.1016/j.jpsychores.2007.10.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Pawitan Y. All Likelihood: Statistical Modelling and Inference Using Likelihood. Oxford: Oxford University Press; 2004. [Google Scholar]
  24. Roy J. Modeling longitudinal data with nonignorable dropouts using a latent dropout class model. Biometrics. 2003;59:829–836. doi: 10.1111/j.0006-341x.2003.00097.x. [DOI] [PubMed] [Google Scholar]
  25. Rubin DB. Inference and missing data. Biometrika. 1976;63:581–592. [Google Scholar]
  26. Touloumi G, Babiker AG, Pocock SJ, Darbyshire JH. Impact of missing data due to drop-outs on estimators for rates of change in longitudinal studies: a simulation study. Statistics in Medicine. 2001;20:3715–3728. doi: 10.1002/sim.1114. [DOI] [PubMed] [Google Scholar]
  27. Tsiatis AA, Davidian MA. Joint modeling of longitudinal and time-to-event data: an overview. Statistica Sinica. 2004;14:809–834. [Google Scholar]
  28. Troxell AB, Ma G, Heitjan1 DF. An index of local sensitivity to nonignorability. Statistica Sinica. 2004;14:1221–1237. [Google Scholar]
  29. van Beijsterveldt CEM, van Boxtel MPJ, Bosma H, Houx PJ, Buntinx F, Jolles J. Predictors of attrition in a longitudinal cognitive aging study: the Maastricht Aging Study (MAAS) Journal of Clinical Epidemiology. 2002;55:216–223. doi: 10.1016/s0895-4356(01)00473-5. [DOI] [PubMed] [Google Scholar]
  30. Vonesh EF, Greene T, Mark D, Schluchter MD. Shared parameter models for the joint analysis of longitudinal data and event times. Statistics in Medicine. 2006;25:143–163. doi: 10.1002/sim.2249. [DOI] [PubMed] [Google Scholar]
  31. Wilson RS, Beck TK, Bienias JL, Bennett DA. Terminal cognitive decline: accelerated loss of cognition in the last years of life. Psychosomatic Medicine. 2007;69:131–137. doi: 10.1097/PSY.0b013e31803130ae. [DOI] [PubMed] [Google Scholar]
  32. Wu M, Carroll R. Estimation and comparison of changes in the presence of informative right censoring by modeling the censoring process. Biometrics. 1988;44:175–188. [PubMed] [Google Scholar]
  33. Wulfsohn MS, Tsiatis AA. A joint model for survival and longitudinal data measured with error. Biometrics. 1997;53:330–339. [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Arrays

RESOURCES