Identifying typical trajectories in longitudinal data: modelling strategies and interpretations

Moritz Herle; Nadia Micali; Mohamed Abdulkadir; Ruth Loos; Rachel Bryant-Waugh; Christopher Hübel; Cynthia M Bulik; Bianca L De Stavola

doi:10.1007/s10654-020-00615-6

. 2020 Mar 5;35(3):205–222. doi: 10.1007/s10654-020-00615-6

Identifying typical trajectories in longitudinal data: modelling strategies and interpretations

Moritz Herle ^1,², Nadia Micali ^2,^3,⁴, Mohamed Abdulkadir ³, Ruth Loos ⁵, Rachel Bryant-Waugh ², Christopher Hübel ^6,^7,⁸, Cynthia M Bulik ^8,^9,¹⁰, Bianca L De Stavola ^2,^✉

PMCID: PMC7154024 PMID: 32140937

Abstract

Individual-level longitudinal data on biological, behavioural, and social dimensions are becoming increasingly available. Typically, these data are analysed using mixed effects models, with the result summarised in terms of an average trajectory plus measures of the individual variations around this average. However, public health investigations would benefit from finer modelling of these individual variations which identify not just one average trajectory, but several typical trajectories. If evidence of heterogeneity in the development of these variables is found, the role played by temporally preceding (explanatory) variables as well as the potential impact of differential trajectories may have on later outcomes is often of interest. A wide choice of methods for uncovering typical trajectories and relating them to precursors and later outcomes exists. However, despite their increasing use, no practical overview of these methods targeted at epidemiological applications exists. Hence we provide: (a) a review of the three most commonly used methods for the identification of latent trajectories (growth mixture models, latent class growth analysis, and longitudinal latent class analysis); and (b) recommendations for the identification and interpretation of these trajectories and of their relationship with other variables. For illustration, we use longitudinal data on childhood body mass index and parental reports of fussy eating, collected in the Avon Longitudinal Study of Parents and Children.

Electronic supplementary material

The online version of this article (10.1007/s10654-020-00615-6) contains supplementary material, which is available to authorized users.

Keywords: Growth mixture models, Latent class growth analysis, Longitudinal latent class analysis, Mixed effects models, ALSPAC

Introduction

Repeated observations of the same variable over time are increasingly frequent not only in purposely designed observational studies but also in large linked administrative health databases. In most applications, this type of data is analysed using mixed effects models [1, 2], leading to estimates of a population average trajectory, parametrised in terms of fixed effects, and the variation of the individual trajectories around this average. The latter is captured by the variances and covariances of subject-specific random effects. More recently, the focus of modelling such data has moved towards investigating whether there are multiple typical trajectories (see for example adolescent smoking [3], treatment response [4] and comorbidity [5]), leading to the characterisation of latent subgroups of individuals who share a common profile over time. Such groups are often referred to as “phenotypes” (e.g., early onset versus late onset of illness). Aiming to classify individuals into subgroups based on their longitudinal data has been described as being a person-centred approach, as opposed to the variable-centred approach typical of many regression analyses [6]. Often however these latent classes are studied in relation to explanatory variables [7–9] and/or later outcomes [10–12], and thus a person-centred classification may itself become a variable in a regression model, thereby blurring this distinction.

There are several modelling approaches that focus on identifying these trajectories, with alternative strategies available to relate them to earlier variables or later outcomes. The common feature of these approaches is that they all assume that a latent variable, composed of several classes, underlies the heterogeneity in how the variables evolve over time. These common approaches are:

Growth mixture models
Latent class growth analysis, also known as group-based trajectory models
Longitudinal latent class analysis

In this paper, we provide an overview of these three approaches and compare them in terms of assumptions, feasibility, and interpretation of the derived classes using mixed effects models as a reference. Another class of methods for the identification of latent trajectories are generalizations of cluster analysis (e.g., extentions of k-means clustering to longitudinal data [13]). As these methods do not invoke models, but rather rely on algorithms to classify individuals, they are not considered here. Their performance, however, has been found to be closely related to that of latent class growth analysis when trajectories vary smoothly with time [14].

To discuss the practical implications of adopting each of these modelling approaches above, and to illustrate how differences in resulting classes may derive, we analyse data derived from the Avon Longitudinal Study of Parents and Children (ALSPAC [15, 16]).

Latent class trajectory models

Mixed effects models

Mixed effects models when applied to longitudinal data, relate outcomes collected on the same individual to their observation times, allowing for the shape of this relationship to vary across individuals. Consider a single outcome variable, $Z_{ij}$ , observed on individual i at times $t_{ij}$ , where i = 1, 2, …, N, and j = 0, 1, …, J. A typical specification of a mixed effects models for continuous outcomes, assuming a linear relationship with time, and the same observation times for all individuals, $t_{j},$ is

Z_{ij} = β_{0 i} + β_{1 i} t_{j} + ε_{ij},

where $β_{0 i}$ and $β_{1 i}$ are individual-specific coefficients, which have fixed ( $β_{0}$ and $β_{1}$ ) and random ( $u_{0 i}$ and $u_{1 i}$ ) components, with $β_{0 i} = β_{0} + u_{0 i}$ and $β_{1 i} = β_{1} + u_{1 i}$ . The fixed coefficients $β_{0}$ and $β_{1}$ are shared by all individuals, while the error terms $u_{i} = (u_{0 i}, u_{1 i})$ are unobserved random variables that capture the individual departures from the population average trajectory, $(β_{0} + β_{1} t_{j})$ . The error terms $u_{i}$ are usually assumed to be jointly normally distributed with mean zero and free covariance matrix $Ω_{u},$ and the residual errors $ε_{ij}$ to be independently and normally distributed, conditionally on $u_{i}$ and t, with constant variance $σ_{ε}^{2}$ . The $ε_{ij}$ capture the distance between the observed data for the i-th individual to the true individual-specific trajectory, $(β_{0 i} + β_{1 i} t_{j})$ (Fig. 1a). Here we consider $t_{j}$ to indicate the actual observation time, so that the relationship with time is properly captured. When information is gathered in terms of waves, as in panel data, we would recommend translating this information into an appropriate time-scale.

Fig. 1 — Graphical representation of alternative longitudinal models: a mixed effects model; b growth mixture model (GMM); c latent class growth analysis (LCGA); d longitudinal latent class analysis (LLCA). Black line: population mean trajectory; blue line: individual-specific trajectory; red and green lines: class-specific trajectories; red and green triangles: class-specific values; x: observations for individual i

When $Z_{ij}$ is an ordered categorical variable, with (K + 1) categories, a mixed effects model is usually specified in terms of a latent continuous variable $Z_{ij}^{^{'}}$ specified as

Z_{ij}^{^{'}} = β_{0 i} + β_{1 i} t_{j} + ϵ_{ij},

where $β_{0 i}$ and $β_{1 i}$ are defined as before but with the independent error $ϵ_{ij}$ following a logistic distribution with mean 0 and variance $\frac{π^{2}}{3}$ (where $π is the$ constant representing the ratio of a circle’s circumference over its diameter). The observed categorical variable $Z_{ij}$ is assumed to have been generated from this latent variable according to unobserved cut-points (“thresholds”) $τ_{k}$ , k = 1, …, K, with $Z_{ij} = 1$ if $Z_{ij}^{^{'}} \leq τ_{1}$ ; $Z_{ij} = 2$ if $τ_{1} < Z_{ij}^{^{'}} \leq τ_{2}$ ; …; $Z_{ij} = (K + 1)$ if $Z_{ij}^{^{'}} > τ_{K}$ . The thresholds are the expected values of the latent variable $Z_{ij}^{^{'}}$ at which an individual transitions from a value k to a value (k + 1) on the categorical outcome variable $Z_{ij}$ .

Generalisations of models (1) and (2) that include non-linear relationships with time are straightforward, likewise models where the coefficients for these additional non-linear terms include random components, as in

Z_{ij} = β_{0 i} + β_{1 i} t_{j} + β_{2 i} t_{j}^{2} + ε_{ij} .

Estimation is generally by maximum likelihood (ML, or restricted maximum likelihood when the study is small [17]), with the estimation-maximisation algorithm used in the presence of missing outcome data under the missing at random (MAR) assumption [18].

When individuals are observed at the same times $t_{j}$ , as assumed here, there is an alternative formalization of mixed effects models that arises from to the confirmatory factor analysis framework (and, more generally, the structural equation modelling [SEM] literature). This framework views the random coefficients of a mixed effects model as latent factors, “manifested” by the joint distribution of the longitudinal observations, $Z_{i} = (Z_{i 1}, Z_{i 2},, \dots, Z_{iJ})$ [19]. Model (1) for example could also be written as

Z_{ij} = β_{0 i} + λ_{j} β_{1 i} + ε_{ij},

where $β_{0 i}$ and $β_{1 i}$ are the original individual-specific coefficients that are now viewed as latent variables. The regression coefficients $λ_{j}$ (referred to as “factor loadings” in the SEM literature) are not estimated but are pre-determined according to the timing of the observations. For model (1) the factor loadings would be: $λ_{1} = 0, λ_{2} = (t_{2} - t_{1}), λ_{3} = (t_{3} - t_{2}),$ etc. This representation of model (1) can be viewed graphically in Fig. 2a, where the factor loadings are shown above the arrows linking the latent individual-specific coefficients to the observed data. Adopting this approach has several advantages, in particular the option of using SEM software for estimation, and also extending the model for example by allowing the error terms $ε_{ij}$ to have time-specific variances, $σ_{ε j}^{2}$ , or more complex extensions as discussed below.

Fig. 2 — Structural equation modelling representation of: a mixed effects model; b growth mixture model; c growth mixture model with predictors; d growth mixture model with distal outcome

Growth mixture models

Growth mixture models assume that there are multiple mixed effects models, each representing a subgroup (i.e. “class”) of trajectories that share a common mean and shape (with, potentially, class-specific error variance structures) [20, 21]. Growth mixture models are therefore generalisations of mixed effects models (Fig. 1b).

Formally, they are specified as follows. Let C indicate the number of latent classes in the population, distributed with probabilities $p_{c}$ , c = 1,…, C, with $0 \leq p_{c} \leq 1$ and $\sum_{c = 1}^{C} p_{c} = 1$ [22]. As the latent classes are unknown, we model the observed data using as a mixed effects model specific to the latent class c each individual belongs to, with the joint distribution of the data then being a mixture of these distributions, weighted by the probability of each class, $p_{c}$ . For example, a growth mixture model generalisation of model (1) is,

Z_{i j | c} = β_{0 i}^{c} + β_{1 i}^{c} t_{j} + ε_{ij}^{c}, for c = 1, \dots, C,

where $β_{0 i}^{c} = β_{0}^{c} + u_{0 i}^{c}$ , $β_{1 i}^{c} = β_{1}^{c} + u_{1 i}^{c}$ , $u_{i}^{c} = (u_{0 i}^{c}, u_{1 i}^{c})$ and $ε_{ij}^{c}$ are defined as before, although specifically for each class c. The graphical representation of this model is shown in Fig. 2b. Assuming that all classes have the same error structure may be unrealistic; therefore class-specific covariances $Ω_{u}^{c}$ for the individual-level error terms are often considered.

For categorical variables, we would specify $Z_{i j | c}^{'} = β_{0 i}^{c} + β_{1 i}^{c} t_{j} + ϵ_{ij}^{c}$ , with $ϵ_{ij}^{c}$ following a logistic distribution.

Because the number of classes is unknown, the estimation is carried out conditionally on a pre-specified number of classes. Estimation is by Maximum likelihood (ML) with the expectation–maximization (EM) algorithm because the classes are unobserved [23]. As several local maxima for the likelihood are expected to be found with such complex models, multiple starting points for the estimation routine are recommended, before maximization is deemed to have been reached [19]. Following estimation, posterior class probabilities can be derived and used to assign individuals to classes according to their largest value (“modal assignment”), or to weigh individuals when calculating predicted class frequencies.

In order to identify the number of classes that best fits the data, a number of goodness-of-fit criteria are compared. Those commonly recommended in the literature [24] are the Akaike Information Criterion (AIC), the Bayesian Information Criterion (BIC), and its sample size-corrected version (c-BIC). For each of these, lower scores indicate (relatively) better fitting models. The parametric bootstrap likelihood ratio test (BLRT) has also been recommended as an additional comparative tool given its performance in simulations [25]. However it is disadvantaged by being computationally intensive and affected by poor performance in small samples [19]. These goodness-of-fit criteria do not necessarily agree, in the sense that they may not all point to selecting the same model. Hence, additional considerations are often invoked, such as interpretability of the latent trajectories, and the avoidance of too small classes (e.g. < 5% of the study population) that may lead to lack of reproducibility of the results.

The quality of the classification of a model, the so-called “entropy”, is also often reported, with values close to 1 indicating good classification. Specifically, this is a summary measure that captures how well class membership is predicted given the observed outcomes. However, this interpretation requires the model to be correct, and thus entropy values should not be overinterpreted [25].

As described, these criteria are applied sequentially on models with increasing numbers of classes using the same dataset. It has been suggested that cross-validation should be used instead [26]. This would involve fitting the model with a given number of classes on a subset of the data, followed by using the selected model on the remaining data and assessing its goodness of fit. A more sophisticated version of this would involve k-fold cross-validation. This approach, however, requires larger datasets than those usually available in typical epidemiological studies and would still depend on which goodness of fit criterion is used.

Latent class growth analysis

Latent class growth analysis [27] specifies models that are similar to growth mixture models. However, latent class growth analysis models assume no individual-level random variation within each class, and therefore individuals assigned to the same class share exactly the same trajectory.

Formally, latent class growth analysis specifies models with the same structure as model (5) but with fixed effects regression coefficients, albeit specific to each class. Denoting a latent class growth analysis class by s, this model is expressed as,

Z_{i j | s} = β_{0}^{s} + β_{1}^{s} t_{j} + e_{ij}^{s}, s = 1, \dots, S

where Z is a continuous variable and $e_{ij}^{s}$ are independently distributed error terms. Because there is no within-cluster variation (i.e. there are no $u_{i}^{s}$ and the class-specific coefficients $β_{0}^{s}$ and $β_{1}^{s}$ are the same for every member of class s), these error terms capture random perturbations of each observed data point from their class specific trajectory (Fig. 1c). The assumption that these errors are independently distributed, as implicit in most software [28, 29], may be unrealistic however as one would expect individual trajectories that belong to the same class to be heterogeneous and the individual-specific departures from the class-specific trajectories to be correlated. Departures from this assumption can have consequences, as discussed in “Assumptions”.

Longitudinal latent class analysis

These models are a variation of latent class growth analysis models that ignores the longitudinal nature of the data. The model for an individual belonging to the longitudinal latent class r is specified as,

Z_{i j | r} = β_{0}^{r} + \sum_{j = 2}^{J} β_{j}^{r} I_{t = t_{j}}^{} +_{} e_{ij}^{r}, r = 1, \dots, R,

where $I_{t = t_{j}}^{}$ are dummy (0/1) indicators of the times when $Z_{ij}$ is observed (Fig. 1d). Hence, latent classes are identified without exploiting the information on the time order of the observations, but also without forcing any parametric relationship between the outcomes and time.

Comments

Assumptions

Mixed effects models, growth mixture models, and latent class growth analysis rely on parametric assumptions for the relationship between the observed outcomes and time. These models, together with longitudinal latent class analysis, rely on distributional assumptions for the error terms. Mixed effects models and growth mixture models make additional assumptions regarding the within-subject correlations (parametrized by $Ω_{u}$ and $Ω_{u}^{c}$ , respectively). Violations of these assumptions have different consequences depending on the type of outcome and modelling approach. Misspecified distributions and correlation structures in mixed effects models do not impact on the consistency of the fixed effect estimates when the observed outcomes are continuous, but they may bias inferences [1]. If the outcomes are categorical, however, bias will affect the fixed effects estimates as well [1, 30]. Non-parametric specifications of the random effect distributions have been proposed to deal with these issues [31], as described below.

The impact of these misspecifications may also influence the estimated number of classes of a growth mixture model. If, the assumed covariance structure is too simple, the number of classes may be greater because more are needed to capture the variability in the data [32]. For this reason, and as demonstrated in simulations [33], when selecting the number of classes for growth mixture models, one should in principle allow for general specifications, e.g., with class-specific covariance matrices $Ω_{u}^{c} ._{}$ and time-specific residual error variance $σ_{ε j}^{2}$ [33]. How general these matrices can be, will be limited by the study size and may not be suitable with binary outcome data when their prevalence is low [33].

The assumption of independence for the residual errors $e_{ij}^{s}$ , conditional on class s, which is usually made when performing latent class growth analysis, is most likely to be incorrect, especially when there are several observations per individual. Violations may lead to biased estimates of the class-specific regression coefficients [33] unless the classes are well separated, e.g. entropy > 0.8 [32]. Such bias is more prominent when the true covariance structure is complex, the study size is small (< 500), or the outcomes are binary [30].

Another assumption often made with longitudinal data is that of the outcome data being missing at random (MAR) [18]. This assumes that the propensity of missing an observation, possibly because of an individual dropping-out of the study, depends on the observed data only. If met, model estimation by ML (for mixed effects models), or ML with EM (for growth mixture models), based on incomplete data is not affected by selection bias [17, 34]. It is often the case, however, that missingness depends on other variables, most commonly social factors. In such circumstances, one could include the predictors of missingness in the model, as discussed in “Relating latent classes to earlier explanatory variables or to later outcomes”.

Interpretation

In interpreting the results of whichever approach, one has to take into consideration all of the issues described above. Of note is that latent class growth analysis models were initially proposed as a semi-parametric version of mixed effects models where the variation in trajectories around a single class is approximated by a number of fixed trajectories, as opposed to assuming jointly normally distributed random effects [35]. In other words, the classes are used to capture the overall variation so that, when the data are truly from a mixture of K classes (as in growth mixture models), a larger number of classes will be needed to extract the main features of the data when adopting latent class growth analysis [23, 27]. Thus, interpreting the resulting classes as if they had a theoretical underpinning would be inappropriate in most settings. In contrast, growth mixture models distinguish the typologies represented by the latent classes from the within-class variation. Again, however interpretation should be cautious because of their stronger parametric assumptions.

Analytical strategy

These observations highlight the need for a comprehensive set of model specifications to be considered and then compared, ranging from single class mixed effects models to growth mixture models and then latent class growth analysis and longitudinal latent class analysis models, before concluding whether there are multiple trajectory types and what they capture.

As a first step, we would recommend fitting the most general mixed effects model that the data can identify in order to investigate the extent of between-individual heterogeneities. The distributions and correlations of the predicted random effects from such a model could then be used to aid the interpretation of the best fitting growth mixture and best fitting latent class growth analysis (or longitudinal latent class analysis) models. Comparing the classes predicted from these different model specifications, numerically and/or graphically, would also help clarify whether similar typologies emerge when adopting different modelling approaches.

If, even after allowing for the fact that some of the classes from a latent class growth analysis model actually will aim to capture the distribution of individual trajectories within a particular “true” class, little agreement is found, one should investigate whether model misspecifications might explain the discrepancies. As discussed in “Assumptions”, these may lead to biased parameter estimates and/or incorrect selection of the number of classes. Examination of the distributions of the estimated time-specific residuals derived for each class might indicate for example that the model is not properly reflecting the data if they were found to be skewed. This would happen for example if the relationship with time were misspecified in one of the classes.

Relating latent classes to earlier explanatory variables or to later outcomes

Once classes are derived, it is possible to relate them to earlier explanatory variables or later outcomes. Any inferences drawn on these relationships, however, should account for the fact that the classes are not directly observed but derived under certain modelling assumptions. There are two main approaches to achieve this.

The first approach—the “1-step approach”—consists of extending the original model for the latent trajectories to include associations with the explanatory or the later outcome variables of interest. This is easily achieved within an SEM framework (Fig. 2c, d), with the joint estimation of the latent classes and their relationship with other variables (respectively the “measurement” and “structural” parts of the SEM model) accounting for the uncertainties of class assignment.

The second commonly used approach breaks down the estimation into three steps (“3-step approach”). The best fitting latent trajectory model is fitted (1st step) and then used to assign individuals to their most likely class using the predicted posterior probabilities of belonging to each class (2nd step). These classifications are then included as outcomes or predictors in the relevant new analyses, accounting for the uncertainty of the classification performed in step 2 (via the probabilities of the true class given the assigned class estimated in step 1) [36].

The first approach is not generally recommended when the aim is to relate explanatory variables to the latent classes because the identification of the latent classes is potentially affected by which variables are included in the model [37]. One exception to this concern is when the reason for including the covariates in a 1-step analysis is to meet the MAR assumption when the longitudinal outcome data are affected by missingness. In this case, one would want to condition on these covariates to avoid the bias that would arise from analysing incomplete data.

More serious concerns arise when relating latent classes to a later outcome, because in the latter case the outcome has the same direction of association with the classes as the longitudinal variables that lead to their identification (see Fig. 2d; [36]).

When the entropy of the latent class model is greater than 0.80, results from the 1- or 3-step approach have been found to be similar [36]. In practice, however, the 1-step approach may be unfeasible, especially when the longitudinal data are categorical, so that the 3-step approach should be adopted (with multiple imputation if missingness depends on covariates, and with the selection of the number of classes made from the most frequently best solution among the imputed sets).

The ALSPAC study

Participants

We analysed data from the Avon Longitudinal Study of Parents and Children (ALSPAC), a population based, longitudinal cohort of mothers and their children born in the southwest of England, to illustrate the different modelling strategies. Details of the study are given elsewhere [15, 16]. Briefly, all pregnant women expected to give birth between the 1st April 1991 and 31st December 1992 were invited to enrol in the study. From all pregnancies (n = 14,676), 14,451 mothers opted to take part, and 13,988 of their children were alive at 1 year. Analyses are restricted to girls only for simplicity, after randomly selecting one child per set when birth was from a multiple pregnancy. Please note that the study website contains details of all the data that are available through a fully searchable data dictionary and variable search tool: http://www.bristol.ac.uk/alspac/researchers/our-data/.

Variables

Longitudinal variables

We aimed to model the repeated measures of a continuous and an ordinal variable:

Body mass index (BMI; in kg/m²), objectively measured up to six times when participants were (around) 8, 10, 11, 12, 13, and 16 years. Height was measured to the nearest millimetre with the use of a Harpenden Stadiometer (Holtain Ltd.). Weight was measured with a Tanita Body Fat Analyzer (Tanita TBF UK Ltd.) to the nearest 50 g.
Parental reporting of fussy eating consisted of responses to the question “How worried are you because your child is choosy?” for which there were three possible answers: “No/did not happen”, “Not worried”, and “A bit/greatly worried”. These were observed up to eight times during the first ten years of life, specifically at around 1.3, 2.0, 3.2, 4.6, 5.5, 6.9, 8.7, and 9.6 years. A more detailed description of these data can be found in Herle et al. [38].

Explanatory variable

Birth weight (in kg) was used as the explanatory variable of interest in our examples. This variable was available on 4462 (99%) girls among those with at least one longitudinal BMI and on 5750 (99%) girls with at least one fussy eating measurement. Mean birth weight was 3.37 kg (SD = 0.51) and 3.36 kg (SD = 0.51) in these two subgroups. It was internally standardized using these means and SDs in the analyses.

Later outcome

Body fat mass index (FMI) [39] at age 18 years was the later outcome of interest. It was defined as the ratio of total body fat mass (in kg) over height (in metres) squared. Body fat was objectively measured using the Tanita Body Fat Analyser (Model TBF 401A) and height as described above. Data on FMI were available on 2443 (55%) girls with at least one longitudinal BMI measurement and 2464 (42%) girls with at least one longitudinal fussy eating measurement. Mean FMI was 21.57 kg (SD = 9.56) and 21.62 kg (SD = 9.52), respectively.

Computer code

Examples of Mplus and Stata code used for these analyses can be found in https://github.com/MoritzHerle/Identifying-typical-trajectories-in-longitudinal-data. Some of these analyses can also be performed in R (with the lcmm package); the relevant code can also be found in this depository.

Ethics

The authors assert that all procedures contributing to this work comply with the ethical standards of the relevant national and institutional committees on human experimentation and with the Helsinki Declaration of 1975, as revised in 2008. Ethical approval for the study was obtained from the ALSPAC Ethics and Law Committee and the Local Research Ethics Committees.

Data description

Figure 3a shows the observed individual BMI trajectories for all participants with at least one BMI observation (“spaghetti plot”), while Fig. 3b shows the equivalent plot (“lasagne plot”) for the categorical fussy eating variable, with a change in colour along time representing a change in category. Both variables show considerable and increasing variation over time, as well as an increasing frequency of missing data. Details of the completeness of the longitudinal BMI data are given in Supplementary Tables 1 and 2; they highlight that the majority of the participants included in these analyses had six data points and are therefore quite complete. A total of 4517 girls had at least one longitudinal BMI measure and 5824 girls had at least one longitudinal parental report of fussy eating. In the following, we assume that MAR was satisfied and included in the analyses all girls with at least one longitudinal observation of the relevant outcome variable.

Longitudinal phenotypes