A MULTIVARIATE FINITE MIXTURE LATENT TRAJECTORY MODEL WITH APPLICATION TO DEMENTIA STUDIES

Dongbing Lai; Huiping Xu; Daniel Koller; Tatiana Foroud; Sujuan Gao

doi:10.1080/02664763.2016.1141181

. Author manuscript; available in PMC: 2017 Feb 22.

Published in final edited form as: J Appl Stat. 2016 Feb 22;43(14):2503–2523. doi: 10.1080/02664763.2016.1141181

A MULTIVARIATE FINITE MIXTURE LATENT TRAJECTORY MODEL WITH APPLICATION TO DEMENTIA STUDIES

Dongbing Lai ¹, Huiping Xu ², Daniel Koller ¹, Tatiana Foroud ¹, Sujuan Gao ^2,^*

PMCID: PMC5021196 NIHMSID: NIHMS811792 PMID: 27642206

Abstract

Dementia patients exhibit considerable heterogeneity in individual trajectories of cognitive decline, with some patients showing rapid decline following diagnoses while others exhibiting slower decline or remaining stable for several years. Dementia studies often collect longitudinal measures of multiple neuropsychological tests aimed to measure patients’ decline across a number of cognitive domains. We propose a multivariate finite mixture latent trajectory model to identify distinct longitudinal patterns of cognitive decline simultaneously in multiple cognitive domains, each of which is measured by multiple neuropsychological tests. EM algorithm is used for parameter estimation and posterior probabilities are used to predict latent class membership. We present results of a simulation study demonstrating adequate performance of our proposed approach and apply our model to the Uniform Data Set (UDS) from the National Alzheimer’s Coordinating Center (NACC) to identify cognitive decline patterns among dementia patients.

Keywords: Multivariate finite mixture latent trajectory, cognitive decline, dementia

1. Introduction

Dementia is common in the elderly population, with Alzheimer’s disease (AD) the leading cause. [1]. It is characterized by the progressive decline of cognitive function, leading to impairment in the individual’s ability to perform daily activities and the consequent loss of independence. There is considerable heterogeneity in the individual trajectories of cognitive decline among dementia patients, with some patients showing rapid decline while others exhibiting a slower decline or remaining stable [2]. The heterogeneity of the cognitive decline also varies across cognitive domains such as memory and language. Prior research has shown that typical patients with AD have primarily memory deficits often accompanied by deficits in language [3–5], whereas patients with frontotemporal dementia (FTD) had greater impairment in language and less impairment in the memory domain [6–10].

Research on the identification of distinct longitudinal trajectories of cognitive decline has focused on univariate cognitive outcomes, measured by a single neuropsychological test using group-based trajectory models (GBTM) [11–13] and growth mixture models (GMM) [14–16]. GBTM is also known as latent class growth analysis (LCGA) [15] and was proposed by Nagin and colleagues [11–13] while GMM was developed by Muthén et al [14–16]. Both models assume that subjects belong to one of several subpopulations/groups/latent classes, each characterized by a unique longitudinal trajectory. A key difference between GBTM and GMM is that GBTM assumes conditional independence, i.e. longitudinal measures across time for a given subject are independent, whereas the GMM includes a random effect allowing correlation of longitudinal outcomes for a subject [15]. Therefore, GBTM can be considered a special case of GMM [15].

Despite the successful application of the GBTM and GMM models in many research studies [12, 15], these models are not well-suited for the longitudinal analyses of dementia due to the fact that each latent construct can only be from one test. Many studies of cognitive decline in dementia patients have collected data across several cognitive domains with multiple neuropsychological tests in each domain. Tests within each domain are measures of same underline latent construct from different prospective. Proust-Lima and colleagues extended the GBTM and GMM models to multivariate longitudinal data by treating the multiple tests as reflecting a single unobserved quantity following a latent process that exhibits distinct longitudinal patterns across groups [17, 18]. Although this approach has the flexibility to handle multivariate cognitive outcomes, it allows only exploration of the longitudinal patterns of a single underlying quantity and does not make use of the full capability of the models. In dementia studies where many neuropsychological tests are used to measure different aspects of cognitive function including memory, language, and executive function, it is more realistic to assume multiple latent quantities and identify longitudinal patterns associated with these various cognitive domains.

In this paper, we extend the model proposed by Proust-Lima et al by allowing more than one latent quantity, each of which can be measured by multiple tests, and identifying subpopulations of patients who exhibit distinct longitudinal patterns in these latent quantities. Our work was directly motivated by studies of cognitive decline among dementia patients. Our proposed approach is aimed at identifying longitudinal patterns of cognitive decline defined in multiple cognitive domains. The identified sub-populations share the similar cognitive decline patterns therefore may share the same disease etiology; thus, it can help us to find better patient care and treatment. In addition, these phenotypically homogeneous sub groups can be used to improve the ability of searching dementia genes in genetic studies.

The remainder of this paper is organized as follows. In Section 2, we introduce the multivariate finite mixture latent trajectory model. We discuss parameter estimation using the EM algorithm Section 3. We present results from simulation studies in Section 4. Section 5 includes results from the application of our model to the Uniform Data Set (UDS) from the National Alzheimer’s Coordinating Center (NACC) [19]. We present a discussion and conclude the paper in section 6.

2. The Multivariate Finite Mixture Latent Trajectory Model

Assume that the population consists of G subpopulations represented by G latent classes. For individual i, i = 1, …, N, we define a G-dimentional vector ω_i denoting the latent class membership, with ω_ig = 1 if individual i belongs to class g and 0 otherwise. Suppose there are K neuropsychological tests with continuous outcomes representing cognitive function in D cognitive domains. Let $y_{i} = {(y_{i 1}^{T}, \dots, y_{i k}^{T}, \dots, y_{i K}^{T})}^{T}$ be the vector of all measurements for individual i. y_ik is then a vector of length n_ik, which denotes the number of longitudinal measurements on individual i for test k (k = 1, …, K); hence the length of y_i is $\sum_{k = 1}^{K} n_{i k}$ . Let X_1i(t) and Z_i(t) be the matrices of covariates collected for individual i. Z_i(t) can have partial or all columns of X_1i(t) but contains at least one time variable. Then a measurement model if individual i is in latent class g is:

y_{i | ω_{i g} = 1} = Λ_{i | ω_{i g} = 1} (t) + V_{i} c_{i} + ε_{i},

(1)

Where the latent trajectory is defined as:

Λ_{i | ω_{i g = 1}} (t) = X_{1 i} (t) β_{g} + Z_{i} (t) b_{i g},

(2)

The length of latent process Λ_{i|ω_{ig = 1}}(t) is also $\sum_{k = 1}^{K} n_{i k}$ . Note that tests in the same domain share the same latent process by having the same values in Λ_{i|ω_{ig = 1}}(t). β_g is the vector of class-specific fixed effects from all cognitive domains in latent class g. Its length is P × D, where P is the number of covariates. b_ig is the class-specific random effects for all domains in latent class g. Similar to β_g, b_ig has length q × D, where q is the number of random effects. We assume that b_ig has a multivariate normal distribution $N (0, W_{g}^{2} B)$ with $W_{1}^{2} = 1$ and B is the covariance matrix of the first latent class, similarly defined as in Proust et al [17]. c_i in (1) is the K-vector of test-specific random intercept. It introduces correlation among scores of the same test from the same individual. Here we assume c_i is distributed as N(0, Σ_c), where Σ_c is a diagonal matrix with $σ_{c k}^{2}$ in its diagonal. Design matrix V_i in (1) is a $\sum_{k = 1}^{K} n_{i k} \times K$ block matrix with the following structure:

V_{i} = [\begin{matrix} 1 & \dots & 0 \\ ⋮ & ⋱ & ⋮ \\ 0 & \dots & 1 \end{matrix}]

where 1 is a column vector of 1s. In the k^th column, the column vector of 1s has length n_ik. ε_i in (1) is a vector of random errors with distribution N(0, Σ_ε), where Σ_ε is a block matrix with $σ_{ε k}^{2} I_{n_{i k}}$ on the diagonal and all other entries are 0.

Accordingly, covariate matrix X_1i(t) has the following structure:

X_{1 i} (t) = [\begin{matrix} X_{1 i 1} & \dots & 0 \\ ⋮ & ⋱ & ⋮ \\ 0 & \dots & X_{1 i D} \end{matrix}]

Each X_1id has all covariates for all the tests comprising domain d, d = 1, …, D with dimension n_ik × P. Similarly the design matrix Z_i(t) has the following structure:

Z_{i} (t) = [\begin{matrix} Z_{i 1} & \dots & 0 \\ ⋮ & ⋱ & ⋮ \\ 0 & \dots & Z_{i D} \end{matrix}]

where Z_id is a matrix of time polynomials of degree q − 1 with dimension n_ik × q. For example, if n_ik = 3, for a quadratic model, each Z_id has structure as follows:

Z_{i d} = [\begin{matrix} 1 & t_{i 1} & t_{i 1}^{2} \\ 1 & t_{i 2} & t_{i 2}^{2} \\ 1 & t_{i 3} & t_{i 3}^{2} \end{matrix}]

Note that, although individuals typically have the same measurement times, this is not required for our proposed model. Based on the definition of X_1i(t) and Z_i(t), individuals can have tests measured at different time points.

We assume that b_ig, c_i and ε_i are mutually independent.

For individual i, i = 1, …, N, the probability that this individual belongs to a latent class g, g = 1, …, G, is π_ig, with $\sum_{g = 1}^{G} π_{i g} = 1$ . This can be modeled through a multinomial logistic regression as:

π_{i g} = P (ω_{i g} = 1 | X_{2 i}^{T}) = \frac{exp (X_{2 i}^{T} γ_{g})}{1 + \sum_{h = 1}^{G - 1} exp (X_{2 i}^{T} γ_{h})},

(3)

where γ_g is the vector of the class-specific regression coefficients. For identifiability purposes, γ_G are set to 0. Covariates $X_{2 i}^{T}$ used here can be the same or different from X_1i(t) in equation (2).

3. Parameter Estimation

Since the latent class memberships are unobserved and there are also multiple random effects, the expectation-maximization (EM) algorithm can be used for obtaining parameter estimates [20–23]. Let $Ψ = (β_{1}, \dots, β_{g}, \dots, β_{G}, W_{2}^{2}, \dots, W_{g}^{2}, \dots, W_{G}^{2}, B, Σ_{c}, Σ_{ε}, γ_{2}, \dots, γ_{g}, \dots, γ_{G})$ be the parameters to be estimated, and f_ig(y_i) be the density function of y_i in latent class g; then the observed-data likelihood is:

L (Ψ) = \prod_{i = 1}^{N} \sum_{g = 1}^{G} π_{i g} f_{i g} (y_{i})

(4)

f_ig(y_i) has distribution N(X_1i(t)β_g, Σ_ig), where $Σ_{i g} = Z_{i} (t) W_{g}^{2} B Z_{i} {(t)}^{T} + V_{i} Σ_{c} V_{i}^{T} + Σ_{ε}$ .

Augmenting the observed data y_i with unobserved variables (ω_i, b_i1 …, b_ig, …, b_iG, c_i), the complete-data likelihood function is:

L^{c} (Ψ) = \prod_{i = 1}^{N} \prod_{g = 1}^{G} {π_{i g} f (y_{i} | b_{i g}, c_{i}) f (b_{i g}) f (c_{i})}^{ω_{i g}}

The log-likelihood for the complete data is

log (L^{c} (Ψ)) = \sum_{i = 1}^{N} \sum_{g = 1}^{G} ω_{i g} {log (π_{i g}) + log (f (y_{i} | b_{i g}, c_{i})) + log (f (b_{i g})) + log (f (c_{i}))} = \sum_{i = 1}^{N} \sum_{g = 1}^{G} ω_{i g} log (π_{i g}) - \frac{\sum_{k = 1}^{K} n_{i k} + l + K}{2} \sum_{i = 1}^{N} \sum_{g = 1}^{G} ω_{i g} log (2 π) - \frac{1}{2} \sum_{i = 1}^{N} \sum_{g = 1}^{G} ω_{i g} log | Σ_{ε} | - \frac{1}{2} \sum_{i = 1}^{N} \sum_{g = 1}^{G} ω_{i g} {(y_{i} - X_{1 i} (t) β_{g} - Z_{i} (t) b_{i g} - V_{i} c_{i})}^{T} Σ_{ε}^{- 1} (y_{i} - X_{1 i} (t) β_{g} - Z_{i} (t) b_{i g} - V_{i} c_{i}) - \frac{1}{2} \sum_{i = 1}^{N} \sum_{g = 1}^{G} ω_{i g} log | B | - \frac{1}{2} \sum_{i = 1}^{N} \sum_{g = 1}^{G} l * ω_{i g} log (W_{g}^{2}) - \frac{1}{2} \sum_{i = 1}^{N} \sum_{g = 1}^{G} ω_{i g} b_{i g}^{T} {(W_{g}^{2} B)}^{- 1} b_{i g} - \frac{1}{2} \sum_{i = 1}^{N} \sum_{g = 1}^{G} ω_{i g} log | Σ_{c} | - \frac{1}{2} \sum_{i = 1}^{N} \sum_{g = 1}^{G} ω_{i g} c_{i}^{T} Σ_{c}^{- 1} c_{i},

(5)

where l is the dimension of square matrix B.

3.1. The EM algorithm

The EM algorithm involves taking the conditional expectation of the complete-data log-likelihood and updating the parameters by maximizing the conditional expectation. Based on (5), we can see that the E step at o^th iteration involves evaluating the following conditional expectations for each subject:

E_{Ψ^{(o)}} (ω_{i g} | y_{i}); E_{Ψ^{(o)}} (ω_{i g} b_{i g} | y_{i}); E_{Ψ^{(o)}} (ω_{i g} b_{i g} b_{i g}^{T} | y_{i});

E_{Ψ^{(o)}} (ω_{i g} c_{i} | y_{i}); E_{Ψ^{(o)}} (ω_{i g} c_{i} c_{i}^{T} | y_{i}); E_{Ψ^{(o)}} (ω_{i g} c_{i} b_{i g}^{T} | y_{i}) .

Calculation of the first conditional expectation is straightforward:

E_{Ψ^{(o)}} (ω_{i g} | y_{i}) = Pr (ω_{i g} = 1 | y_{i}) = \frac{π_{i g} f_{i g} (y_{i})}{\sum_{h = 1}^{G} π_{i h} f_{i h} (y_{i})} = τ_{i g}^{(o)},

(6)

which is the posterior probability of subject i belonging to latent class g at the current parameter estimate.

In addition,

E_{Ψ^{(o)}} (ω_{i g} b_{i g} | y_{i}) = E_{Ψ^{(o)}} (b_{i g} | y_{i}, ω_{i g} = 1) = E_{Ψ^{(o)}} (b_{i g} | y_{i}, ω_{i g} = 1) Pr (ω_{i g} = 1 | y_{i}) = τ_{i g}^{(o)} E_{Ψ^{(o)}} (b_{i g} | y_{i}, ω_{i g} = 1)

Therefore, similarly, we only need to calculate:

E_{Ψ^{(o)}} (b_{i g} | y_{i}); E_{Ψ^{(o)}} (b_{i g} b_{i g}^{T} | y_{i}); E_{Ψ^{(o)}} (c_{i} | y_{i}); E_{Ψ^{(o)}} (c_{i} c_{i}^{T} | y_{i}); E_{Ψ^{(o)}} (c_{i} b_{i g}^{T} | y_{i}) .

The joint distribution of ${(y_{i}^{T}, b_{i g}^{T}, c_{i}^{T})}^{T}$ is a multivariate normal distribution with mean:

[\begin{matrix} X_{1 i} (t) β_{g} \\ 0 \\ 0 \end{matrix}],

and variance matrix:

[\begin{matrix} Z_{i} (t) W_{g}^{2} B Z_{i} {(t)}^{T} + V_{i} Σ_{C} V_{i}^{T} + Σ_{e} & Z_{i} (t) W_{g}^{2} B & V_{i} Σ_{c} \\ {(Z_{i} (t) W_{g}^{2} B)}^{T} & W_{g}^{2} B & 0 \\ {(V_{i} Σ_{c})}^{T} & 0 & Σ_{c} \end{matrix}]

Let $Σ_{i b c g} = [Z_{i} (t) W_{g}^{2} B V_{i} Σ_{c}]$ and $Σ_{b c g} = [\begin{matrix} W_{g}^{2} B & 0 \\ 0 & Σ_{c} \end{matrix}]$ ; therefore the joint distribution of b_ig, c_i conditional on y_i is a multivariate normal distribution with mean

E_{Ψ^{(o)}} (b_{i g}, c_{i} | y_{i}) = Σ_{i b c g}^{T} {(Z_{i} (t) W_{g}^{2} B Z_{i} {(t)}^{T} + V_{i} Σ_{C} V_{i}^{T} + Σ_{e})}^{- 1} (y_{i} - X_{1 i} (t) β_{g})

(7)

And variance-covariance matrix

{var}_{Ψ^{(o)}} (b_{i g}, c_{i} | y_{i}) = Σ_{b c g} - Σ_{i b c g}^{T} {(Z_{i} (t) W_{g}^{2} B Z_{i} {(t)}^{T} + V_{i} Σ_{c} V_{i}^{T} + Σ_{ε})}^{- 1} Σ_{i b c g} = E_{Ψ^{(o)}} [\begin{matrix} b_{i g} b_{i g}^{T} | y_{i} & b_{i g} c_{i}^{T} | y_{i} \\ c_{i} b_{i g}^{T} | y_{i} & c_{i} c_{i}^{T} | y_{i} \end{matrix}] - [\begin{matrix} b_{i g}^{(o)} b_{i g}^{{(o)}^{T}} & b_{i g}^{(o)} c_{i}^{{(o)}^{T}} \\ c_{i}^{(o)} b_{i g}^{{(o)}^{T}} & c_{i}^{(o)} c_{i}^{{(o)}^{T}} \end{matrix}]

(8)

From (8) above:

E_{Ψ^{(o)}} [\begin{matrix} b_{i g} b_{i g}^{T} | y_{i} & b_{i g} c_{i}^{T} | y_{i} \\ c_{i} b_{i g}^{T} | y_{i} & c_{i} c_{i}^{T} | y_{i} \end{matrix}] = Σ_{b c g} - Σ_{i b c g}^{T} {(Z_{i} (t) W_{g}^{2} B Z_{i} {(t)}^{T} + V_{i} Σ_{c} V_{i}^{T} + Σ_{ε})}^{- 1} Σ_{i b c g} + [\begin{matrix} b_{i g}^{(o)} b_{i g}^{{(o)}^{T}} & b_{i g}^{(o)} c_{i}^{{(o)}^{T}} \\ c_{i}^{(o)} b_{i g}^{{(o)}^{T}} & c_{i}^{(o)} c_{i}^{{(o)}^{T}} \end{matrix}]

(9)

Thus, all conditional expectations can be obtained from (9).

Implementing the M-step is relatively trivial since there exists a closed-form solution to the maximization of the conditional expectation of the complete-data log-likelihood for the majority of the parameters except for $γ_{g}^{(o + 1)}$ in the model for $τ_{i g}^{(o)}$ , which has to be updated numerically. For all other parameters, closed-form solutions are available and are given below:

σ_{ε k}^{2 (o + 1)} = \frac{\sum_{i = 1}^{N} \sum_{g = 1}^{G} \sum_{j = 1}^{n_{i k}} τ_{i g}^{(o)} {(y_{i j} - X_{1 i j} (t) β_{g}^{(o)} - Z_{i j} (t) b_{i g}^{(o)} - V_{i j} c_{i}^{(o)})}^{2}}{\sum_{i = 1}^{N} \sum_{g = 1}^{G} \sum_{j = 1}^{n_{i k}} τ_{i g}^{(o)}}

σ_{c k}^{2 (o + 1)} = \frac{\sum_{i = 1}^{N} \sum_{g = 1}^{G} τ_{i g}^{(o)} {(c_{i k}^{(o)})}^{2}}{\sum_{i = 1}^{N} \sum_{g = 1}^{G} τ_{i g}^{(o)}}

W_{g}^{2 (o + 1)} = \frac{\sum_{i = 1}^{N} τ_{i g}^{(o)} b_{i g}^{{(o)}^{T}} B^{{(o)}^{- 1}} b_{i g}^{o}}{\sum_{i = 1}^{N} l * τ_{i g}^{(o)}}

B^{(o + 1)} = \frac{\sum_{i = 1}^{N} \sum_{g = 1}^{G} τ_{i g}^{(o)} b_{i g}^{(o)} b_{i g}^{{(o)}^{T}}}{\sum_{i = 1}^{N} \sum_{g = 1}^{G} τ_{i g}^{(o)} W_{g}^{2 (o)}}

β_{g}^{(o + 1)} = {\sum_{i = 1}^{N} τ_{i g}^{(o)} X_{1 i} {(t)}^{T} Σ_{ε}^{{(o)}^{- 1}} X_{1 i} (t)}^{- 1} \sum_{i = 1}^{N} τ_{i g}^{(o)} X_{1 i} {(t)}^{T} Σ_{ε}^{{(o)}^{- 1}} (y_{i} - X_{1 i} (t) b_{i g}^{(o)} - V_{i} c_{i}^{(o)})

The E-step and M-step will be repeated until the difference of observed likelihood becomes smaller than a pre-specified threshold. For model fitting and parameter estimation we used SAS PROC IML. Initial parameter estimates used in the iterative program were obtained using SAS PROC NLMIXED without any random effects. To avoid local maxima, different initial parameter values around the estimates from PROC NLMIXED were used. Variance covariance matrix was calculated using the negative inversion of Hessian matrix by using the SAS function NLPFDD through the finite-differences method.

3.2 Posterior classification and model selection

Assignment of each subject into a latent class can be achieved using the posterior probability defined in (6) and estimated based on the maximum likelihood estimates of the parameters. As can be seen from (6), group membership is jointly determined by both probabilities in each group and group specific latent trajectories [12]. A subject is classified in the latent class for which he or she has the highest posterior probability [12]. While this class assignment appears subjective, it was shown to be an optimal choice under the normal distribution assumption in terms of minimizing the probability of error, i.e., assigning the subject to the wrong class [24]. If the normal distribution assumption is violated, this method of assignment may not retain its optimal property. Other classification approaches may be explored including the K-nearest neighbor classification method and random assignment [25, 26]. These posterior probabilities are then used to evaluate the degree to which the latent classes can be distinguished by the data [27]. Specifically, we will calculate a G × G classification table, with each row representing the average posterior probabilities for each latent class among subjects assigned to a given latent class [27]. High diagonal values close to 1 and low off-diagonal values close to 0 indicate good classification quality.

The number of latent classes for each data set is unknown and needs to be pre-specified before each model estimation procedure. Many model selection procedures can be used to select the ‘best’ model when varying numbers of latent classes are used. In this study, Bayesian information criterion (BIC) [28] will be used to select the number of latent classes. Although there has been conflicting conclusions about its performance [12, 20, 29–31], BIC has been found to have superior performances in several studies [12, 29]. With its ease to implement, BIC has been recommended by many researchers to select the number of latent class [12, 13, 15].

4. Simulation Studies

For the simulation study, we focused on parameter estimation and latent class classification, i.e., whether we can identify the true trajectories and assign individuals into the correct latent classes. Data were simulated under two cognitive domains with two tests in each domain. For each domain, linear trajectory was assumed. In addition to a time variable, a binary variable and a continuous variable were simulated as covariates for the domain specific fixed effects. For class specific random effects, both intercept and slope were assumed. To determine the latent class membership, one continuous variable was simulated. We designed the simulation studies to mimic longitudinal data arising from medical research, which usually contain missing data. Each sample was first simulated to have measurements at three time points and test scores at one to two time points were then randomly set to be missing for some randomly chosen samples.

Five scenarios with 2, 3, 4, 5, 6 latent classes were simulated. For each scenario, we generated 500 replications with each replication consisting 1500 subjects. Under each scenario we fitted a latent trajectory model with the true number of latent classes, e.g., for data that consist of 4 latent classes, only the 4-class model was fitted.

Table 1 shows the resulting parameter estimates for the two-class model. It appears that our proposed method yields adequate parameter and standard error estimates for all model parameters. Similar results were obtained for the 3 to 6-class models (Appendix). Table 2 includes average coverage probabilities of 95% confidence intervals for all model parameters and misclassification rates. Coverage estimates are defined as the percentage of times that the 95% confidence interval of a parameter estimate contains the true parameter across all replications. Misclassification rate is calculated as the percentage of samples that are assigned to the wrong latent class according to the posterior probability. In our simulations, misclassification rates ranged from near 0 to 13.97%, with misclassification rates tending to increase with the number of latent classes. This is expected since there is more opportunities for classification error when there are more latent classes. In addition, for a fixed sample size, as the number of latent classes increases, the number of samples within each latent class decreases, leading to increased standard error estimates of class-specific parameters. Classification error thus increases with less well- separated classes.

Table 1.

Mean parameter estimates, asymptotic standard error (SE) and empirical standard error from 500 replications in simulations for a two-class model

Parameter

True
Value

Mean
estimates

Asymptotic
SE

Empirical
SE

γ₁₁

6.00

6.02

0.32

γ₁₂

−12.00

−12.05

0.61

0.62

β₁₁

5.60

5.58

0.13

β₁₂

1.60

1.57

0.04

0.05

β₁₃

1.60

0.12

β₁₄

1.60

0.19

0.20

β₁₅

1.20

1.18

0.13

0.14

β₁₆

1.20

1.16

0.04

0.06

β₁₇

1.20

0.13

β₁₈

1.20

1.19

0.20

0.22

β₂₁

−5.60

−5.58

0.13

β₂₂

−1.60

−1.56

0.04

0.05

β₂₃

−1.60

0.12

β₂₄

−1.60

0.20

β₂₅

−1.20

−1.18

0.13

β₂₆

−1.20

−1.17

0.04

0.05

β₂₇

−1.20

0.12

β₂₈

−1.20

−1.18

0.21

σ_ε1

0.80

0.02

σ_ε2

1.20

0.02

σ_ε3

0.90

0.02

σ_ε4

1.10

0.02

σ_c1

1.10

0.04

σ_c2

0.90

0.05

σ_c3

1.20

0.04

σ_c4

0.80

0.79

0.06

B₁

1.20

1.19

0.05

B₂

1.40

0.03

B₃

1.30

1.29

0.05

B₄

1.50

0.03

B₁₂

0.65

0.04

0.03

B₁₃

0.15

0.05

B₁₄

0.15

0.04

B₂₃

0.15

0.04

B₂₄

0.15

0.03

B₃₄

0.58

0.03

W_{2}^{2}

0.90

0.05

Open in a new tab

Note: Asymptotic standard error is the average of SE from 500 replicates. Empirical SE is standard error of estimates from 500 replications. γ, σ_ε, σ_c, β, W², B are as defined in section 2. B₁ to B₄ are the square roots of the diagonal elements of matrix B, and all other B parameters are correlation coefficients of the corresponding terms as indicated by numbers in the subscripts.

Table 2.

Average coverage probabilities of 95% confidence intervals for all parameters in a given model, and misclassification rates of simulation results.

Number of classes	Average coverage (range)	Misclassification rate
2	94.89% (92.60%–98.80%)	0.001%
3	95.16% (92.80%–99.00%)	2.29%
4	95.61% (91.40%–99.80%)	8.60%
5	95.73% (93.00%–100.00%)	12.29%
6	96.15% (92.60%–100.00%)	13.97%

Open in a new tab

5. Application to the UDS data

The proposed multivariate finite mixture latent trajectory model was applied to examine longitudinal patterns of cognitive decline among dementia patients using data from the UDS in the NACC data repository. The UDS is an ongoing data collection that was implemented in 2005 at 34 past and present NIA-founded Alzheimer’s Disease Centers (ADC) around the country [19]. Patients were recruited into the ADCs and followed annually to collect information relevant to aging and dementia, including performance measures on neuropsychological tests in multiple cognitive domains such as memory and language [32, 33].

We used UDS Jan. 2014 data freeze. The sample in the analysis included Caucasian patients with any type of dementia who had at least four annual cognitive evaluations. We also restricted our analyses to those whose cognitive decline began after 60 years of age in order to exclude patients with early onset dementia. Tests from two cognitive domains were used: logical memory immediate and delayed recalls tests for the memory domain; Animal Fluency Test and the Boston Naming Test for the language domain. As indicated by Weintraub et al, age of onset, gender and education had significant effects on test scores and were included in both the class membership model and the latent trajectory model [33]. Final analysis data set included 30,004 observations from 1517 patients, of whom 52.74% were male with 15.07 mean years of education and 73.33 as mean age of onset. Since these four test scores have different ranges, all outcomes were rescaled to be between 0 and 10 to achieve computational efficiency. In addition, education (in number of years) and age of onset (in years) were rescaled to be between 0 and 1. The time variable, age, was measured by decades and centered on the mean age. We tested linear and quadratic trajectories with the assumed number of latent classes ranging from 2 to 6. We present the estimated log likelihood and BIC for all models in table 3.

Table 3.

Estimated log likelihoods and BICs in the UDS data for various models assuming different numbers of latent classes.

	Linear Trajectory			Quadratic Trajectory

number of classes	number of parameters	Log Likelihood	BIC	number of parameters	Log Likelihood	BIC
2	43	−47711.57	95738.10	47	−47675.48	95695.22
3	58	−47394.82	95214.45	64	−47329.63	95128.02
4	73	−46992.81	94520.31	81	−46971.34	94535.97
5	88	−46871.30	94387.15	98	−46832.84	94383.48
6	103	−46791.45	94337.31	115	−46713.23	94268.78

Open in a new tab

Note: For the linear trajectory model with 2 classes, there are 4 parameters for Σ_ε; 4 parameters for Σ_c; 10 parameters for covariance matrix of the first latent class B; 1 parameter for $W_{2}^{2}$ ; 4 parameters for γ₂; 10 parameters for β₁ and β₂ respectively, leading to a total of 43 parameters. With each increasing number of classes, 1 parameter for $W_{2}^{g}$ ; 4 parameters for γ_g; 10 parameters for β_g are added. For quadratic modesl, 2 quadratic parameters for memory and language domain are added for each β_g.

It can be seen that the differences between linear and quadratic models are small relative to the complexity of the models. Therefore, we chose the linear model for its ease of interpretation. The decrease in BIC was more pronounced when the number of latent classes increased from 2 to 4, but the BIC values became relatively flat with 4 or more latent classes; thus, we chose the model with 4 latent classes as the final model following the recommended practice by several authors [12, 13, 15]. Parameters estimates in the multinomial model for latent class memberships and for the fixed effects in the latent trajectory models are presented in table 4.

Table 4.

Parameter estimates for the latent class memebership model and for the fixed effects in the latent trajactory model.

		Class 1		Class 2		Class 3		Class 4

Models	Parameter	Est.	SE	Est.	SE	Est.	SE	Est.	SE
Multinomial
	Intercept	3.08	0.63	4.71	0.63	5.53	0.61	0	Ref
	Sex	−0.20	0.28	−0.63	0.28	−0.20	0.32	0	Ref
	Education	−1.65	0.99	−3.99	0.97	−5.98	0.80	0	Ref
	Age of onset	−4.16	1.16	−1.98	1.19	−1.52	1.72	0	Ref
Trajectory
Memory domain	Intercept	−2.28	0.26	−0.37	0.36	−0.14	0.10	−9.19	1.58
	Linear slope	−2.18	0.12	−1.90	0.16	−0.50	0.04	−8.42	0.39
	Sex	−0.08	0.08	−0.29	0.17	−0.03	0.03	−0.39	0.85
	Education	0.19	0.34	1.40	0.40	0.02	0.12	0.34	1.93
	Age of onset	8.50	0.54	6.30	0.89	1.92	0.21	34.32	2.01
Language domain	Intercept	−6.98	1.51	1.49	0.53	0.22	0.41	−0.73	0.37
	Linear slope	−8.17	0.17	−2.35	0.14	−2.92	0.11	−4.91	0.30
	Sex	−0.26	0.33	−0.38	0.16	−0.35	0.14	−0.40	0.54
	Education	0.31	1.87	2.28	0.66	2.25	0.57	−1.47	1.02
	Age of onset	31.10	0.97	7.41	0.70	9.67	0.65	19.03	1.63

Open in a new tab

Note: male is the reference for gender.

In figure 1, we present the predicted trajectories of male patients with 15 years of education and age of onset at 73 (chosen as the sample means) in four latent classes. Latent class 1 has the steepest decline in language but is relatively flat in memory decline; latent class 4 has the fastest decline in memory and also the second fastest decline in language; patients in latent classes 2 and 3 have less decline in both language and memory domains than those in latent classes 1 and 4.

Firgure 1 — Estimated trajectories of language (left) and memory (right) decline for male dementia patients with education and age of onset at the sample means in four latent classes.

We further examined the association between patient characteristics and the four identified latent classes and present the results in Table 5. Since APOE e4 allele is an important risk factor for AD and about 60% of AD patients carrys this allele [19, 34–37], we also included percentages of samples having it in each latent class. Latent classes 3 and 2 captured the majority of patients followed by latent class 1 and 4. More than 70% of the patients in latent classes 1 and 3 were clinically diagnosed as probable AD only as defined by NINCDS-ADRDA criteria [38, 39] and no any other form of dementia, while class 2 had the lowest percentage of probable AD only (41.37%) and the highest percentage of other demenitas. Patients in latent class 1 also had the highest percentage of being an APOE e4 carrier compared to patients in the other classes. For latent classes 2 and 4, about half of samples have other types of dementia and not surprisingly, less than half of samples have APOE e4 allele. However, just as there were differences between latent classes 1 and 3, latent classes 2 and 4 also differ in gender composition, years of education, and age of onset pointing to potentially different etiologies.

Table 5.

Patients characteristics by the four identified latent classes.

class	number of patients	male %	Average years of Education (SD)	Average age of onset (SD)	APOE e4 carrier (%)	Probable AD only (%)	Other Dementia (%)
1	300	54.00	15.91(2.89)	70.66(6.56)	72.69	76.00	22.67
2	510	59.80	15.18(3.30)	73.35(7.21)	44.57	41.37	55.88
3	560	46.96	14.16(3.24)	74.03(6.88)	57.79	71.96	27.14
4	147	47.62	16.38(2.10)	76.06(9.00)	47.06	51.02	42.86

Open in a new tab

To evaluate model fit, the average posterior probabilities for the linear model with 4 latent classes are presented in table 6. The diagonal values are all greater than 0.84, indicating a good separation of the four latent classes.

Table 6.

Average posterior probabilities of 4 latent classes identified

Classified class	1	2	3	4
1	0.87	0.04	0.07	0.02
2	0.02	0.88	0.04	0.06
3	0.05	0.10	0.84	0.00
4	0.02	0.09	0.00	0.89

Open in a new tab

6. Discussion

In this paper, we proposed a multivariate finite mixture latent trajectory model targeted to data that are often encountered in dementia studies. In these studies, there are multiple domains of cognitive performance, each of which may be measured by multiple neuropsychological tests. Our model is an extension of GBTM, GMM and the non-linear latent class model proposed by Proust-Lima et al, and can be used in studies where more than one test for the same underlying variable has been measured. We applied our method to the UDS data and identified four latent cognitive decline patterns.

Given that multiple cognitive measures are routinely collected in dementia and aging studies, appropriate statistical models with realistic assumptions for multiple tests in more than one domain are extremely important. The naïve method of analyzing data with multiple tests by modeling each test with a separate latent trajectory and without combining the information across different cognitive domains can lead to the spurious identification of many latent classes. There are some existing methods that attempt to reduce dimensionality by combining tests within the same domain using summed or weighted average test scores. However, as indicated by Gray and Brookmeyer, data reduction may cause loss of information and the results may be difficult to interpret [40]. In our method, by jointly modeling tests within the same domain, the number of model parameters is greatly reduced. By adding random test-specific effects, the differences and correlations among tests are accounted for. Furthermore, since these tests are often measurements of the same underlying latent construct, joint modeling can improve our ability to identify this construct.

The identified latent classes can be used for therapeutic and research purpose. Since patients in the same latent classes share similar cognitive decline patterns, this can help care providers and clinicians for better patients care and treatment. In addition, patients in each latent class may share the same disease etiology and may be caused by same genes; therefore the power in genetic studies that look for genes related dementia can be improved. For example, based on recent summary at ALzGene database, 695 genes related AD are found from 1395 studies, however, only a few of them are confirmed by multiple studies [34, 35]. The reason for this is, although patients are all clinically diagnosed as AD, their cognitive decline patterns differ dramatically and this heterogeneity makes results from a given study hard to replicate thus the ability to find true genes is greatly reduced. Our method can be used to find samples that have similar cognitive decline patterns and genetics studies from these phenotypically homogenous sub groups will be more comparable.

In our model, we assumed a normal distribution due to its tractability and ease of implementation. In our application, the distributions of logical memory immediate recall, Animal Fluency Test and the Boson Naming Test were approximately normal while logical memory delayed recall showed skewness to the right. However, the normal distribution assumption may not apply for tests that have categorical or binary responses. In the future, we will extend our work to model non-normal variables and/or mixed types of variables. Another limitation of using the normal assumption lies in selecting the number of classes using information-based criterion like BIC. It has been observed in this and many other studies that BIC always decreases as more classes are added [12, 20]. This problem is more pronounced when the sample size is large and sample sizes in each class are unbalanced. In these cases, the latent classes with larger sample sizes can be split into two or more latent classes [12] and currently the best way to address this problem is using external information to aid in model selection.

A common problem encountered in many dementia studies is floor or ceiling effects associated with some of the test scores. Proust et al proposed a transformation for the test scores using the cumulative beta distribution and demonstrated that the transformation fits the data well under these circumstances [17]. Jacqmin-Gadda et al have proposed a semi-parametric latent process model to address the particular problem of differential sensitivity of tests at different dementia stages [41]. In situations where a test consists of multiple items, an alternative approach is to model the distribution of individual item scores instead of the aggregated test score using the item response theory (IRT) [42–44]. This approach has the advantage of handling the floor and ceiling effect. However, it was designed to allow a single latent trajectory and additional research is needed to extend it to include multiple latent trajectories from more than one cognitive domain. Future research will be needed to generalize our models to increase their robustness given these additional challenges.

Acknowledgments

The work is supported by National Institutes of Health (NIH) under Grants R01 AG019181, and P30 AG10133.

The NACC database is funded by NIA/NIH Grant U01 AG016976. NACC data are contributed by the NIA-funded ADCs: P30 AG019610 (PI Eric Reiman, MD), P30 AG013846 (PI Neil Kowall, MD), P50 AG008702 (PI Scott Small, MD), P50 AG025688 (PI Allan Levey, MD, PhD), P30 AG010133 (PI Andrew Saykin, PsyD), P50 AG005146 (PI Marilyn Albert, PhD), P50 AG005134 (PI Bradley Hyman, MD, PhD), P50 AG016574 (PI Ronald Petersen, MD, PhD), P50 AG005138 (PI Mary Sano, PhD), P30 AG008051 (PI Steven Ferris, PhD), P30 AG013854 (PI M. Marsel Mesulam, MD), P30 AG008017 (PI Jeffrey Kaye, MD), P30 AG010161 (PI David Bennett, MD), P30 AG010129 (PI Charles DeCarli, MD), P50 AG016573 (PI Frank LaFerla, PhD), P50 AG016570 (PI David Teplow, PhD), P50 AG005131 (PI Douglas Galasko, MD), P50 AG023501 (PI Bruce Miller, MD), P30 AG035982 (PI Russell Swerdlow, MD), P30 AG028383 (PI Linda Van Eldik, PhD), P30 AG010124 (PI John Trojanowski, MD, PhD), P50 AG005133 (PI Oscar Lopez, MD), P50 AG005142 (PI Helena Chui, MD), P30 AG012300 (PI Roger Rosenberg, MD), P50 AG005136 (PI Thomas Montine, MD, PhD), P50 AG033514 (PI Sanjay Asthana, MD, FRCP), and P50 AG005681 (PI John Morris, MD).

Appendix

Mean parameter estimates, asymptotic standard error (SE) and empirical standard error from 500 replications in simulations for 2 to 6-class model

A: 2-class model

Parameter

True

Value

Mean
estimates

Asymptotic
SE

Empirical
SE

γ₁₁

6.00

6.02

0.32

γ₁₂

−12.00

−12.05

0.61

0.62

β₁₁

5.60

5.58

0.13

β₁₂

1.60

1.57

0.04

0.05

β₁₃

1.60

0.12

β₁₄

1.60

0.19

0.20

β₁₅

1.20

1.18

0.13

0.14

β₁₆

1.20

1.16

0.04

0.06

β₁₇

1.20

0.13

β₁₈

1.20

1.19

0.20

0.22

β₂₁

−5.60

−5.58

0.13

β₂₂

−1.60

−1.56

0.04

0.05

β₂₃

−1.60

0.12

β₂₄

−1.60

0.20

β₂₅

−1.20

−1.18

0.13

β₂₆

−1.20

−1.17

0.04

0.05

β₂₇

−1.20

0.12

β₂₈

−1.20

−1.18

0.21

σ_ε1

0.80

0.02

σ_ε2

1.20

0.02

σ_ε3

0.90

0.02

σ_ε4

1.10

0.02

σ_c1

1.10

0.04

σ_c2

0.90

0.05

σ_c3

1.20

0.04

σ_c4

0.80

0.79

0.06

B₁

1.20

1.19

0.05

B₂

1.40

0.03

B₃

1.30

1.29

0.05

B₄

1.50

0.03

B₁₂

0.65

0.04

0.03

B₁₃

0.15

0.05

B₁₄

0.15

0.04

B₂₃

0.15

0.04

B₂₄

0.15

0.03

B₃₄

0.58

0.03

W_{2}^{2}

0.90

0.05

Open in a new tab

B: 3-class model

Parameter

True
Value

Mean
estimates

Asymptotic
SE

Empirical
SE

γ₁₁

11.00

11.05

0.68

0.69

γ₁₂

−14.00

−14.13

0.74

0.72

γ₂₁

7.00

6.99

0.69

0.70

γ₂₂

−7.00

−6.99

0.63

β₁₁

5.60

5.57

0.13

0.14

β₁₂

1.60

1.57

0.04

0.06

β₁₃

1.60

0.13

β₁₄

1.60

0.22

β₁₅

1.20

1.17

0.15

β₁₆

1.20

1.16

0.04

0.07

β₁₇

1.20

0.13

0.14

β₁₈

1.20

1.22

0.24

β₂₁

−5.60

−5.57

0.14

0.15

β₂₂

−1.60

−1.57

0.05

0.06

β₂₃

−1.60

0.14

β₂₄

−1.60

−1.62

0.23

0.24

β₂₅

−1.20

−1.17

0.16

β₂₆

−1.20

−1.17

0.05

0.07

β₂₇

−1.20

0.14

0.15

β₂₈

−1.20

−1.19

0.26

0.25

β₃₁

6.00

5.98

0.15

β₃₂

2.00

1.97

0.05

0.06

β₃₃

2.00

0.14

0.13

β₃₄

2.00

0.25

0.23

β₃₅

1.60

1.58

0.15

β₃₆

1.60

1.57

0.05

0.06

β₃₇

1.60

1.59

0.15

0.14

β₃₈

1.60

1.61

0.24

σ_ε1

0.80

0.01

σ_ε2

1.20

0.02

σ_ε3

0.90

0.02

0.01

σ_ε4

1.10

0.02

σ_c1

1.10

0.04

σ_c2

0.90

0.89

0.04

σ_c3

1.20

0.04

σ_c4

0.80

0.05

B₁

1.20

1.19

0.05

B₂

1.40

0.03

0.04

B₃

1.30

1.29

0.05

B₄

1.50

0.04

B₁₂

0.65

0.03

B₁₃

0.58

0.03

B₁₄

0.15

0.16

0.05

B₂₃

0.15

0.04

B₂₄

0.15

0.04

0.03

B₃₄

0.15

0.03

W_{2}^{2}

0.90

0.05

W_{3}^{2}

0.80

0.05

Open in a new tab

C: 4-class model

Parameter

True
Value

Mean
estimates

Asymptotic
SE

Empirical
SE

γ₁₁

15.00

15.16

1.48

1.60

γ₁₂

−14.00

−14.17

1.10

1.19

γ₂₁

12.00

12.15

1.46

1.58

γ₂₂

−9.00

−9.13

1.03

1.12

γ₃₁

8.00

8.04

1.24

1.36

γ₃₂

−6.00

−6.03

0.79

0.88

β₁₁

5.60

0.16

0.17

β₁₂

1.60

0.04

0.07

β₁₃

1.60

0.15

β₁₄

1.60

0.26

β₁₅

1.50

1.49

0.17

0.18

β₁₆

1.50

0.04

0.08

β₁₇

1.50

0.16

β₁₈

1.50

1.51

0.27

0.28

β₂₁

7.60

0.16

0.17

β₂₂

−1.60

0.04

0.07

β₂₃

−1.60

0.15

β₂₄

−1.60

0.24

0.26

β₂₅

−1.50

−1.51

0.17

0.18

β₂₆

−1.50

0.04

0.08

β₂₇

−1.50

0.17

0.16

β₂₈

−1.50

−1.49

0.28

β₃₁

4.00

3.99

0.27

β₃₂

1.20

0.06

0.11

β₃₃

1.20

0.23

0.24

β₃₄

1.20

1.22

0.42

β₃₅

1.10

1.11

0.28

0.29

β₃₆

1.10

0.06

0.12

β₃₇

1.10

0.27

0.26

β₃₈

1.10

1.09

0.45

0.46

β₄₁

6.00

0.16

β₄₂

−1.20

0.04

0.06

β₄₃

−1.20

−1.19

0.15

β₄₄

−1.20

0.25

β₄₅

−1.10

0.17

β₄₆

−1.10

0.04

0.06

β₄₇

−1.10

0.15

0.16

β₄₈

−1.10

−1.12

0.27

σ_ε1

0.80

0.01

σ_ε2

1.20

0.02

σ_ε3

0.90

0.01

σ_ε4

1.10

0.02

σ_c1

1.10

0.04

σ_c2

0.90

0.04

σ_c3

1.20

0.04

σ_c4

0.80

0.05

B₁

1.20

1.19

0.05

0.06

B₂

1.40

0.04

B₃

1.30

1.29

0.06

B₄

1.50

0.05

0.04

B₁₂

0.65

0.04

B₁₃

0.58

0.59

0.03

B₁₄

0.15

0.06

B₂₃

0.15

0.04

B₂₄

0.15

0.04

B₃₄

0.15

0.03

W_{2}^{2}

0.90

0.07

0.06

W_{3}^{2}

0.80

0.08

W_{4}^{2}

0.60

0.05

0.04

Open in a new tab

D: 5-class model

Parameter

True
Value

Mean
estimates

Asymptotic
SE

Empirical
SE

γ₁₁

20.00

20.38

1.67

1.77

γ₁₂

−15.00

−15.28

1.23

1.31

γ₂₁

17.00

17.35

1.62

1.73

γ₂₂

−11.00

−11.23

1.12

1.19

γ₃₁

14.00

14.26

1.35

1.42

γ₃₂

−8.00

−8.15

0.74

0.77

γ₄₁

10.00

10.14

1.10

1.18

γ₄₂

−5.00

−5.07

0.54

0.58

β₁₁

5.60

5.61

0.18

0.19

β₁₂

1.60

0.05

0.08

β₁₃

1.60

0.17

β₁₄

1.60

0.28

β₁₅

1.50

1.51

0.18

0.19

β₁₆

1.50

0.05

0.08

β₁₇

1.50

1.51

0.18

β₁₈

1.50

1.48

0.31

0.30

β₂₁

7.60

0.26

β₂₂

−1.60

0.06

0.11

β₂₃

−1.60

−1.61

0.25

0.23

β₂₄

−1.60

0.39

0.41

β₂₅

−1.50

−1.51

0.27

β₂₆

−1.50

0.06

0.12

β₂₇

−1.50

−1.49

0.25

β₂₈

−1.50

−1.49

0.45

0.44

β₃₁

4.00

0.23

0.25

β₃₂

1.20

0.06

0.10

β₃₃

1.20

1.21

0.22

β₃₄

1.20

1.18

0.37

0.38

β₃₅

1.10

1.09

0.26

0.27

β₃₆

1.10

0.06

0.11

β₃₇

1.10

1.09

0.25

0.24

β₃₈

1.10

1.13

0.40

0.42

β₄₁

6.00

6.02

0.19

0.20

β₄₂

−1.20

0.05

0.08

β₄₃

−1.20

−1.19

0.18

β₄₄

−1.20

−1.23

0.30

β₄₅

−1.10

−1.09

0.19

0.20

β₄₆

−1.10

0.05

0.08

β₄₇

−1.10

0.19

β₄₈

−1.10

−1.12

0.32

β₅₁

3.00

3.01

0.21

0.22

β₅₂

1.40

0.05

0.11

β₅₃

1.40

0.20

β₅₄

1.40

1.39

0.34

β₅₅

−1.30

−1.29

0.22

0.23

β₅₆

−1.30

0.05

0.10

β₅₇

−1.30

−1.31

0.22

0.21

β₅₈

−1.30

0.37

0.36

σ_ε1

0.80

0.01

σ_ε2

1.20

0.02

σ_ε3

0.90

0.01

σ_ε4

1.10

0.02

σ_c1

1.10

0.04

σ_c2

0.90

0.05

σ_c3

1.20

0.04

σ_c4

0.80

0.05

B₁

1.20

1.19

0.06

B₂

1.40

0.04

B₃

1.30

1.29

0.06

B₄

1.50

0.05

B₁₂

0.65

0.66

0.04

B₁₃

0.58

0.59

0.03

B₁₄

0.15

0.06

B₂₃

0.15

0.04

B₂₄

0.15

0.04

B₃₄

0.15

0.03

W_{2}^{2}

0.90

0.08

W_{3}^{2}

0.80

0.79

0.08

W_{4}^{2}

0.60

0.05

W_{5}^{2}

1.10

0.09

Open in a new tab

E: 6-class model

Parameter

True
Value

Mean
estimates

Asymptotic
SE

Empirical
SE

γ₁₁

21.00

21.31

1.60

1.85

γ₁₂

−14.00

−14.26

1.04

1.12

γ₂₁

20.00

20.28

1.56

1.83

γ₂₂

−10.00

−10.18

0.81

0.93

γ₃₁

17.00

17.21

1.47

1.72

γ₃₂

−8.00

−8.11

0.67

0.78

γ₄₁

14.00

14.17

1.43

1.63

γ₄₂

−6.00

−6.07

0.59

0.68

γ₅₁

10.00

10.11

1.31

1.37

γ₅₂

−4.00

−4.04

0.51

0.53

β₁₁

5.60

5.61

0.31

0.32

β₁₂

1.60

0.07

0.13

β₁₃

1.60

1.58

0.29

0.28

β₁₄

1.60

0.49

0.50

β₁₅

1.50

0.31

0.34

β₁₆

1.50

0.08

0.14

β₁₇

1.50

0.30

β₁₈

1.50

1.49

0.51

0.55

β₂₁

7.60

0.17

0.18

β₂₂

−1.60

−1.61

0.05

0.07

β₂₃

−1.60

0.15

0.16

β₂₄

−1.60

−1.59

0.26

0.27

β₂₅

−1.50

−1.51

0.17

0.18

β₂₆

−1.50

0.05

0.08

β₂₇

−1.50

0.18

0.17

β₂₈

−1.50

−1.49

0.28

0.29

β₃₁

4.00

3.99

0.26

0.30

β₃₂

1.20

0.06

0.12

β₃₃

1.20

1.19

0.24

0.25

β₃₄

1.20

1.21

0.42

0.46

β₃₅

1.10

1.09

0.29

0.33

β₃₆

1.10

1.09

0.07

0.13

β₃₇

1.10

0.27

β₃₈

1.10

1.11

0.48

0.52

β₄₁

6.00

6.03

0.29

0.32

β₄₂

−1.20

0.07

0.12

β₄₃

−1.20

0.26

0.27

β₄₄

−1.20

−1.23

0.45

0.49

β₄₅

−1.10

−1.08

0.28

0.32

β₄₆

−1.10

0.06

0.12

β₄₇

−1.10

0.28

0.29

β₄₈

−1.10

−1.13

0.46

0.54

β₅₁

3.00

0.24

0.25

β₅₂

1.40

0.06

0.13

β₅₃

1.40

1.41

0.22

0.23

β₅₄

1.40

1.39

0.39

0.40

β₅₅

−1.30

−1.31

0.25

0.28

β₅₆

−1.30

0.07

0.12

β₅₇

−1.30

0.25

β₅₈

−1.30

0.42

0.45

β₆₁

6.00

6.01

0.24

β₆₂

−1.40

0.06

0.11

β₆₃

−1.40

0.22

β₆₄

−1.40

−1.41

0.37

0.38

β₆₅

1.30

1.29

0.25

0.27

β₆₆

1.30

0.06

0.12

β₆₇

1.30

1.31

0.24

β₆₈

1.30

0.42

σ_ε1

0.80

0.01

σ_ε2

1.20

0.02

σ_ε3

0.90

0.01

σ_ε4

1.10

0.02

σ_c1

1.10

0.04

σ_c2

0.90

0.05

σ_c3

1.20

0.04

σ_c4

0.80

0.06

0.05

B₁

1.20

1.18

0.07

B₂

1.40

1.39

0.07

B₃

1.30

1.28

0.08

B₄

1.50

1.49

0.07

B₁₂

0.65

0.66

0.04

B₁₃

0.58

0.59

0.03

B₁₄

0.15

0.06

B₂₃

0.15

0.04

B₂₄

0.15

0.04

B₃₄

0.15

0.03

W_{2}^{2}

0.90

0.91

0.10

W_{3}^{2}

0.80

0.11

W_{4}^{2}

0.60

0.09

W_{5}^{2}

1.10

1.12

0.12

0.13

W_{6}^{2}

1.05

1.06

0.12

Open in a new tab

Note: Asymptotic standard error is the average of SE from 500 replications; Empirical SE is standard error of estimates from 500 replications; γ, σ_ε, σ_c, β, W², B are defined in section 2; B₁ to B₄ are the square roots of the diagonal elements of matrix B, and all other B parameters are correlation coefficients of the corresponding terms as indicated by numbers in the subscripts.

References

1.Salmon DP, Bondi MW. Neuropsychological assessment of dementia. Annu Rev Psychol. 2009;60:257–282. doi: 10.1146/annurev.psych.57.102904.190024. [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Hayden KM, et al. Cognitive decline in the elderly: an analysis of population heterogeneity. Age Ageing. 2011;40(6):684–689. doi: 10.1093/ageing/afr101. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Galton CJ, et al. Atypical and typical presentations of Alzheimer's disease: a clinical, neuropsychological, neuroimaging and pathological study of 13 cases. Brain. 2000;123(Pt 3):484–498. doi: 10.1093/brain/123.3.484. [DOI] [PubMed] [Google Scholar]
4.Martin A, et al. Towards a behavioral typology of Alzheimer's patients. J Clin Exp Neuropsychol. 1986;8(5):594–610. doi: 10.1080/01688638608405178. [DOI] [PubMed] [Google Scholar]
5.Neary D, et al. Neuropsychological syndromes in presenile dementia due to cerebral atrophy. J Neurol Neurosurg Psychiatry. 1986;49(2):163–174. doi: 10.1136/jnnp.49.2.163. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Miller BL, et al. Frontal lobe degeneration: clinical, neuropsychological, and SPECT characteristics. Neurology. 1991;41(9):1374–1382. doi: 10.1212/wnl.41.9.1374. [DOI] [PubMed] [Google Scholar]
7.Neary D, Snowden J, Mann D. Frontotemporal dementia. Lancet Neurol. 2005;4(11):771–780. doi: 10.1016/S1474-4422(05)70223-4. [DOI] [PubMed] [Google Scholar]
8.Neary D, et al. Frontotemporal lobar degeneration: a consensus on clinical diagnostic criteria. Neurology. 1998;51(6):1546–1554. doi: 10.1212/wnl.51.6.1546. [DOI] [PubMed] [Google Scholar]
9.Neary D, et al. Dementia of frontal lobe type. J Neurol Neurosurg Psychiatry. 1988;51(3):353–361. doi: 10.1136/jnnp.51.3.353. [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Perry RJ, Hodges JR. Differentiating frontal and temporal variant frontotemporal dementia from Alzheimer's disease. Neurology. 2000;54(12):2277–2284. doi: 10.1212/wnl.54.12.2277. [DOI] [PubMed] [Google Scholar]
11.Nagin DS. Analyzing developmental trajectories: A semiparametric, group-based approach. Psychol Methods. 1999;4:139–157. doi: 10.1037/1082-989x.6.1.18. [DOI] [PubMed] [Google Scholar]
12.Nagin DS. Group-based modeling of development. Cambridge, Mass: Harvard University Press; 2005. [Google Scholar]
13.Nagin DS, Odgers CL. Group-based trajectory modeling in clinical research. Annu Rev Clin Psychol. 2010;6:109–138. doi: 10.1146/annurev.clinpsy.121208.131413. [DOI] [PubMed] [Google Scholar]
14.Muthen B. Beyond SEM: General latent variable modeling. Behaviormetrika. 2002;29:81–117. [Google Scholar]
15.Muthen B. Latent variable analysis: growth mixture modeling and related techniques for longitudinal data. In: Kaplan D, editor. The Sage Handbook of Quantitative Methodology for the Social Sciences. Newbury Park, CA: 2004. pp. 345–368. [Google Scholar]
16.Muthen B, Shedden K. Finite mixture modeling with mixture outcomes using the EM algorithm. Biometrics. 1999;55:463–469. doi: 10.1111/j.0006-341x.1999.00463.x. [DOI] [PubMed] [Google Scholar]
17.Proust C, et al. A nonlinear model with latent process for cognitive evolution using multivariate longitudinal data. Biometrics. 2006;62(4):1014–1024. doi: 10.1111/j.1541-0420.2006.00573.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Proust-Lima C, Letenneur L, Jacqmin-Gadda H. A nonlinear latent class model for joint analysis of multivariate longitudinal data and a binary outcome. Statistics in Medicine. 2007;26:2229–2245. doi: 10.1002/sim.2659. [DOI] [PubMed] [Google Scholar]
19.The National Alzheimer's Coordinating Center. Available from: https://www.alz.washington.edu/ [Google Scholar]
20.McLachlan GJ, Peel D. Finite Mixture Models. New York: Wiley-Intersci; 2004. [Google Scholar]
21.McLachlan GJ, Krishnan T. The EM Algorithm and Extensions. New York: John Wiley & Sons, Inc.; 1997. [Google Scholar]
22.Ng SK, et al. A mixture model with random-effects components for clustering correlated gene-expression profiles. Bioinformatics. 2006;22(14):1745–1752. doi: 10.1093/bioinformatics/btl165. [DOI] [PubMed] [Google Scholar]
23.Dempster AP, Laird NM, Rubin DB. Maximum Likelihood from Incomplete Data via the EM Algorithm. Journal of the Royal Statistical Society, Series B. 1977;39(1):1–38. [Google Scholar]
24.Figueiredo MA. Lecture notes on bayesian estimation and classification. 2004 [Google Scholar]
25.Altman NS. An Introduction to Kernel and Nearest-Neighbor Nonparametric Regression. American Statistician. 1992;46(3):175–185. [Google Scholar]
26.Goodman LA. On the Assignment of Individuals to Latent Classes. Sociological Methodology. 2007;37:1–22. 2007. 37. [Google Scholar]
27.Muthen B, et al. General growth mixture modeling for randomized preventive interventions. Biostatistics. 2002;3(4):459–475. doi: 10.1093/biostatistics/3.4.459. [DOI] [PubMed] [Google Scholar]
28.Schwartz G. Estimating the dimension of a model. Ann. Stat. 1978;(6):461–464. [Google Scholar]
29.Nylund KL, Asparouhov T, Muthen BO. Deciding on the number of classes in latent class analysis and growth mixture modeling: a Monte Carlo simulation study. Struct. Equ. Model. 2007;14:535–569. [Google Scholar]
30.Tofighi D, Enders CK. Identifying the correct number of classes in growth mixture models. Advances in Latent Variable Mixture Models. 2008:317–341. [Google Scholar]
31.Yang C-C. Evaluating latent class analysis models in qualitative phenotype identification. Computational Statistics & Data Analysis. 2006;50(4):1090–1104. [Google Scholar]
32.Morris JC, et al. The Uniform Data Set (UDS): clinical and cognitive variables and descriptive data from Alzheimer Disease Centers. Alzheimer Dis Assoc Disord. 2006;20(4):210–216. doi: 10.1097/01.wad.0000213865.09806.92. [DOI] [PubMed] [Google Scholar]
33.Weintraub S, et al. The Alzheimer's Disease Centers' Uniform Data Set (UDS): the neuropsychologic test battery. Alzheimer Dis Assoc Disord. 2009;23(2):91–101. doi: 10.1097/WAD.0b013e318191c7dd. [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Alzheimer's Disease Genetics Consortium. Available from: https://alois.med.upenn.edu/adgc/about/overview.html. [Google Scholar]
35.The AlzGene database. Available from: http://www.alzgene.org/ [Google Scholar]
36.Corder EH, et al. Gene dose of apolipoprotein E type 4 allele and the risk of Alzheimer's disease in late onset families. Science. 1993;261(5123):921–923. doi: 10.1126/science.8346443. [DOI] [PubMed] [Google Scholar]
37.Strittmatter WJ, et al. Apolipoprotein E high-avidity binding to beta-amyloid and increased frequency of type 4 allele in late-onset familial Alzheimer disease. Proc Natl Acad Sci U S A. 1993;90(5):1977–1981. doi: 10.1073/pnas.90.5.1977. [DOI] [PMC free article] [PubMed] [Google Scholar]
38.McKhann G, et al. Clinical diagnosis of Alzheimer's disease: report of the NINCDS-ADRDA Work Group under the auspices of Department of Health and Human Services Task Force on Alzheimer's Disease. Neurology. 1984;34(7):939–944. doi: 10.1212/wnl.34.7.939. [DOI] [PubMed] [Google Scholar]
39.McKhann GM, et al. The diagnosis of dementia due to Alzheimer's disease: recommendations from the National Institute on Aging-Alzheimer's Association workgroups on diagnostic guidelines for Alzheimer's disease. Alzheimers Dement. 2011;7(3):263–269. doi: 10.1016/j.jalz.2011.03.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
40.Gray SM, Brookmeyer R. Estimating a treatment effect from multidimensional longitudinal data. Biometrics. 1998;54(3):976–988. [PubMed] [Google Scholar]
41.Jacqmin-Gadda H, Proust-Lima C, Amieva H. Semi-parametric latent process model for longitudinal ordinal data: Application to cognitive decline. Statistics in Medicine. 2010;29(26):2723–2731. doi: 10.1002/sim.4035. [DOI] [PubMed] [Google Scholar]
42.Fox JP. Bayesian Item Response Modeling: Theory and Applications. Bayesian Item Response Modeling: Theory and Applications. 2010:1–313. [Google Scholar]
43.Klein Entink RH, Fox JP, van den Hout A. A mixture model for the joint analysis of latent developmental trajectories and survival. Stat Med. 2011;30(18):2310–2325. doi: 10.1002/sim.4266. [DOI] [PubMed] [Google Scholar]
44.van den Hout A, Fox JP, Muniz-Terrera G. Longitudinal mixed-effects models for latent cognitive function. Statistical Modelling. 2015;15(4):366–387. [Google Scholar]

[R1] 1.Salmon DP, Bondi MW. Neuropsychological assessment of dementia. Annu Rev Psychol. 2009;60:257–282. doi: 10.1146/annurev.psych.57.102904.190024. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R2] 2.Hayden KM, et al. Cognitive decline in the elderly: an analysis of population heterogeneity. Age Ageing. 2011;40(6):684–689. doi: 10.1093/ageing/afr101. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R3] 3.Galton CJ, et al. Atypical and typical presentations of Alzheimer's disease: a clinical, neuropsychological, neuroimaging and pathological study of 13 cases. Brain. 2000;123(Pt 3):484–498. doi: 10.1093/brain/123.3.484. [DOI] [PubMed] [Google Scholar]

[R4] 4.Martin A, et al. Towards a behavioral typology of Alzheimer's patients. J Clin Exp Neuropsychol. 1986;8(5):594–610. doi: 10.1080/01688638608405178. [DOI] [PubMed] [Google Scholar]

[R5] 5.Neary D, et al. Neuropsychological syndromes in presenile dementia due to cerebral atrophy. J Neurol Neurosurg Psychiatry. 1986;49(2):163–174. doi: 10.1136/jnnp.49.2.163. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R6] 6.Miller BL, et al. Frontal lobe degeneration: clinical, neuropsychological, and SPECT characteristics. Neurology. 1991;41(9):1374–1382. doi: 10.1212/wnl.41.9.1374. [DOI] [PubMed] [Google Scholar]

[R7] 7.Neary D, Snowden J, Mann D. Frontotemporal dementia. Lancet Neurol. 2005;4(11):771–780. doi: 10.1016/S1474-4422(05)70223-4. [DOI] [PubMed] [Google Scholar]

[R8] 8.Neary D, et al. Frontotemporal lobar degeneration: a consensus on clinical diagnostic criteria. Neurology. 1998;51(6):1546–1554. doi: 10.1212/wnl.51.6.1546. [DOI] [PubMed] [Google Scholar]

[R9] 9.Neary D, et al. Dementia of frontal lobe type. J Neurol Neurosurg Psychiatry. 1988;51(3):353–361. doi: 10.1136/jnnp.51.3.353. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R10] 10.Perry RJ, Hodges JR. Differentiating frontal and temporal variant frontotemporal dementia from Alzheimer's disease. Neurology. 2000;54(12):2277–2284. doi: 10.1212/wnl.54.12.2277. [DOI] [PubMed] [Google Scholar]

[R11] 11.Nagin DS. Analyzing developmental trajectories: A semiparametric, group-based approach. Psychol Methods. 1999;4:139–157. doi: 10.1037/1082-989x.6.1.18. [DOI] [PubMed] [Google Scholar]

[R12] 12.Nagin DS. Group-based modeling of development. Cambridge, Mass: Harvard University Press; 2005. [Google Scholar]

[R13] 13.Nagin DS, Odgers CL. Group-based trajectory modeling in clinical research. Annu Rev Clin Psychol. 2010;6:109–138. doi: 10.1146/annurev.clinpsy.121208.131413. [DOI] [PubMed] [Google Scholar]

[R14] 14.Muthen B. Beyond SEM: General latent variable modeling. Behaviormetrika. 2002;29:81–117. [Google Scholar]

[R15] 15.Muthen B. Latent variable analysis: growth mixture modeling and related techniques for longitudinal data. In: Kaplan D, editor. The Sage Handbook of Quantitative Methodology for the Social Sciences. Newbury Park, CA: 2004. pp. 345–368. [Google Scholar]

[R16] 16.Muthen B, Shedden K. Finite mixture modeling with mixture outcomes using the EM algorithm. Biometrics. 1999;55:463–469. doi: 10.1111/j.0006-341x.1999.00463.x. [DOI] [PubMed] [Google Scholar]

[R17] 17.Proust C, et al. A nonlinear model with latent process for cognitive evolution using multivariate longitudinal data. Biometrics. 2006;62(4):1014–1024. doi: 10.1111/j.1541-0420.2006.00573.x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R18] 18.Proust-Lima C, Letenneur L, Jacqmin-Gadda H. A nonlinear latent class model for joint analysis of multivariate longitudinal data and a binary outcome. Statistics in Medicine. 2007;26:2229–2245. doi: 10.1002/sim.2659. [DOI] [PubMed] [Google Scholar]

[R19] 19.The National Alzheimer's Coordinating Center. Available from: https://www.alz.washington.edu/ [Google Scholar]

[R20] 20.McLachlan GJ, Peel D. Finite Mixture Models. New York: Wiley-Intersci; 2004. [Google Scholar]

[R21] 21.McLachlan GJ, Krishnan T. The EM Algorithm and Extensions. New York: John Wiley & Sons, Inc.; 1997. [Google Scholar]

[R22] 22.Ng SK, et al. A mixture model with random-effects components for clustering correlated gene-expression profiles. Bioinformatics. 2006;22(14):1745–1752. doi: 10.1093/bioinformatics/btl165. [DOI] [PubMed] [Google Scholar]

[R23] 23.Dempster AP, Laird NM, Rubin DB. Maximum Likelihood from Incomplete Data via the EM Algorithm. Journal of the Royal Statistical Society, Series B. 1977;39(1):1–38. [Google Scholar]

[R24] 24.Figueiredo MA. Lecture notes on bayesian estimation and classification. 2004 [Google Scholar]

[R25] 25.Altman NS. An Introduction to Kernel and Nearest-Neighbor Nonparametric Regression. American Statistician. 1992;46(3):175–185. [Google Scholar]

[R26] 26.Goodman LA. On the Assignment of Individuals to Latent Classes. Sociological Methodology. 2007;37:1–22. 2007. 37. [Google Scholar]

[R27] 27.Muthen B, et al. General growth mixture modeling for randomized preventive interventions. Biostatistics. 2002;3(4):459–475. doi: 10.1093/biostatistics/3.4.459. [DOI] [PubMed] [Google Scholar]

[R28] 28.Schwartz G. Estimating the dimension of a model. Ann. Stat. 1978;(6):461–464. [Google Scholar]

[R29] 29.Nylund KL, Asparouhov T, Muthen BO. Deciding on the number of classes in latent class analysis and growth mixture modeling: a Monte Carlo simulation study. Struct. Equ. Model. 2007;14:535–569. [Google Scholar]

[R30] 30.Tofighi D, Enders CK. Identifying the correct number of classes in growth mixture models. Advances in Latent Variable Mixture Models. 2008:317–341. [Google Scholar]

[R31] 31.Yang C-C. Evaluating latent class analysis models in qualitative phenotype identification. Computational Statistics & Data Analysis. 2006;50(4):1090–1104. [Google Scholar]

[R32] 32.Morris JC, et al. The Uniform Data Set (UDS): clinical and cognitive variables and descriptive data from Alzheimer Disease Centers. Alzheimer Dis Assoc Disord. 2006;20(4):210–216. doi: 10.1097/01.wad.0000213865.09806.92. [DOI] [PubMed] [Google Scholar]

[R33] 33.Weintraub S, et al. The Alzheimer's Disease Centers' Uniform Data Set (UDS): the neuropsychologic test battery. Alzheimer Dis Assoc Disord. 2009;23(2):91–101. doi: 10.1097/WAD.0b013e318191c7dd. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R34] 34.Alzheimer's Disease Genetics Consortium. Available from: https://alois.med.upenn.edu/adgc/about/overview.html. [Google Scholar]

[R35] 35.The AlzGene database. Available from: http://www.alzgene.org/ [Google Scholar]

[R36] 36.Corder EH, et al. Gene dose of apolipoprotein E type 4 allele and the risk of Alzheimer's disease in late onset families. Science. 1993;261(5123):921–923. doi: 10.1126/science.8346443. [DOI] [PubMed] [Google Scholar]

[R37] 37.Strittmatter WJ, et al. Apolipoprotein E high-avidity binding to beta-amyloid and increased frequency of type 4 allele in late-onset familial Alzheimer disease. Proc Natl Acad Sci U S A. 1993;90(5):1977–1981. doi: 10.1073/pnas.90.5.1977. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R38] 38.McKhann G, et al. Clinical diagnosis of Alzheimer's disease: report of the NINCDS-ADRDA Work Group under the auspices of Department of Health and Human Services Task Force on Alzheimer's Disease. Neurology. 1984;34(7):939–944. doi: 10.1212/wnl.34.7.939. [DOI] [PubMed] [Google Scholar]

[R39] 39.McKhann GM, et al. The diagnosis of dementia due to Alzheimer's disease: recommendations from the National Institute on Aging-Alzheimer's Association workgroups on diagnostic guidelines for Alzheimer's disease. Alzheimers Dement. 2011;7(3):263–269. doi: 10.1016/j.jalz.2011.03.005. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R40] 40.Gray SM, Brookmeyer R. Estimating a treatment effect from multidimensional longitudinal data. Biometrics. 1998;54(3):976–988. [PubMed] [Google Scholar]

[R41] 41.Jacqmin-Gadda H, Proust-Lima C, Amieva H. Semi-parametric latent process model for longitudinal ordinal data: Application to cognitive decline. Statistics in Medicine. 2010;29(26):2723–2731. doi: 10.1002/sim.4035. [DOI] [PubMed] [Google Scholar]

[R42] 42.Fox JP. Bayesian Item Response Modeling: Theory and Applications. Bayesian Item Response Modeling: Theory and Applications. 2010:1–313. [Google Scholar]

[R43] 43.Klein Entink RH, Fox JP, van den Hout A. A mixture model for the joint analysis of latent developmental trajectories and survival. Stat Med. 2011;30(18):2310–2325. doi: 10.1002/sim.4266. [DOI] [PubMed] [Google Scholar]

[R44] 44.van den Hout A, Fox JP, Muniz-Terrera G. Longitudinal mixed-effects models for latent cognitive function. Statistical Modelling. 2015;15(4):366–387. [Google Scholar]

PERMALINK

A MULTIVARIATE FINITE MIXTURE LATENT TRAJECTORY MODEL WITH APPLICATION TO DEMENTIA STUDIES

Dongbing Lai

Huiping Xu

Daniel Koller

Tatiana Foroud

Sujuan Gao

Abstract

1. Introduction

2. The Multivariate Finite Mixture Latent Trajectory Model

3. Parameter Estimation

3.1. The EM algorithm

3.2 Posterior classification and model selection

4. Simulation Studies

Table 1.

Table 2.

5. Application to the UDS data

Table 3.

Table 4.

Firgure 1.

Table 5.

Table 6.

6. Discussion

Acknowledgments

Appendix

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

A MULTIVARIATE FINITE MIXTURE LATENT TRAJECTORY MODEL WITH APPLICATION TO DEMENTIA STUDIES

Dongbing Lai

Huiping Xu

Daniel Koller

Tatiana Foroud

Sujuan Gao

Abstract

1. Introduction

2. The Multivariate Finite Mixture Latent Trajectory Model

3. Parameter Estimation

3.1. The EM algorithm

3.2 Posterior classification and model selection

4. Simulation Studies

Table 1.

Table 2.

5. Application to the UDS data

Table 3.

Table 4.

Firgure 1.

Table 5.

Table 6.

6. Discussion

Acknowledgments

Appendix

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases