Abstract
Between-within designs that include a person group (i.e., a between-subjects factor) and repeated measures of binary responses over time (i.e., a within-subjects factor) are common in educational and psychological research. This software note describes how explanatory item response models can be specified to analyze longitudinal item-level data to detect fixed effects in Mplus for between-within designs. In particular, a necessary parameter transformation is illustrated in detail to obtain the fixed effects in Mplus.
Keywords: between-within design, longitudinal item response model
Description
Explanatory item response models (EIRM; De Boeck & Wilson, 2004) can be used when the interest is in the comparison of means between groups on the latent true score scale for longitudinal item-level data. The aim of this note is to present how fixed effects in the between-within design (i.e., interaction effects between the within-subjects factor and the between-subjects factor or the main effect of the between-subjects factor or the main effect of the within-subjects factor) can be estimated in a latent true score scale using Mplus (Muthén & Muthén, 1998-2016). The authors focus on a design that has one between-subjects factor and one within-subjects factor, and a unidimensional test is assumed at each time point.
Motivation: Software Comparisons
In fitting EIRM, the fixed effects for the between-within design can be specified directly in the SAS NLMIXED procedure (e.g., Nandakumar & Hotchkiss, 2012, for item response theory [IRT] parameter estimation), the glmer function in the lme4 package (Bates., Mächler, Bolker, & Walker, 2015), and the FLIRT R package (Jeon, Rijmen, & Rabe-Hesketh, 2014). However, parameter transformation is required to obtain these effects in Mplus because they cannot be estimated directly. Compared with other software packages, Mplus provides flexibility in measurement model specification, estimation method options, and adjustment for stratification (see the supplementary note for details). To take advantage of these flexibilities in Mplus, it may be instructive to show how Mplus parameters can be transformed to fixed effects in between-within designs.
EIRM Specification for Binary Responses
EIRM
A measurement model in EIRM is a four-parameter version of the longitudinal item response model (Anderson, 1985), described as follows:
where y represents the binary item response, is a latent variable for person j in a group g at a time point t, is a time-invariant item discrimination for item i, is a time-invariant item location for item i, is a time-invariant lower asymptote item parameter for item i, and is a time-invariant upper asymptote item parameter for item i. Three-parameter EIRM can be obtained with in Equation 1. Two-parameter EIRM can be specified with and in Equation 1. The structural model of EIRM for includes the effects of the between-subjects factor (), the within-subjects factor (), and their interaction ():
where is the fixed parameter for the intercept or the grand mean, is the fixed parameter for the between-subjects factor, is the fixed parameter for the within-subjects factor, is the fixed parameter for the interaction between the within-subjects factor and between-subjects factor, and is the random residual at each time point, following a multivariate normal (MVN) distribution (). When the item location parameter ( in Equation 1) is a fixed effect, the grand mean ( in Equation 2) cannot be estimated. The grand mean is the average of the item location parameter estimates across items. Thus, it can be calculated as .
Specifying fixed effects in EIRM
With the unweighted effect codes of , , and (an example of the unweighted effect codes is provided in the supplementary note), the fixed effects (, , and ) can be directly obtained. EIRM is identified with the fixed effects using effect codes: , , for all , and for all .
In addition to the constraints, an additional model identification constraint on item discriminations is required for two-parameter or three-parameter or four-parameter EIRM. In this study, item discrimination for one of the items (e.g., the first item) is fixed to 1.
Implementation in Mplus
The measurement model specified in Equation 1 can be specified in Mplus. The structural model in Mplus is first described and then the parameter transformation is specified to obtain the fixed effects, , , and , in Equation 2 based on the parameters of the structural model in Mplus, as described below.
In the structural model, a covariate, (i.e., the between-subjects factor), can be added to explain :
where is a mean at each time point, is the fixed parameter for at each time point, and is the random residual at each time point.
Specifying fixed effects in Mplus
The following three constraints can be set to obtain , , and in Equation 2, using parameters in the structural model with Mplus, and in Equation 3: (a) , (b) with a constraint, , and (c) in unweighted effect coding, represents the discrepancy of the corresponding group mean () from the unweighted average group mean (): .
Testing fixed effects
To determine whether the fixed effects in EIRM are statistically significant, null hypothesis significance testing for the effect has been used in most EIRM applications (De Boeck & Wilson, 2004). Test statistics (i.e., the z-statistic in the current study) and corresponding p values are used to test whether the population value of the effect differs from a specified value (generally 0). Standard errors of the derived estimates for , , and are calculated using a delta method in Mplus (support@statmodel.com, personal communication, April 19, 2016).
Availability
A supplementary note including an illustrative example, Mplus code, its detailed description, and an example dataset is available at no charge by sending a request by emailing a request to sj.cho@vanderbilt.edu.
Footnotes
Declaration of Conflicting Interests: The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding: The author(s) received no financial support for the research, authorship, and/or publication of this article.
References
- Anderson E. B. (1985). Estimating latent correlations between repeated testings. Psychometrika, 50, 3-16. [Google Scholar]
- Bates D., Mächler M., Bolker B., Walker S. (2015). Fitting linear mixed-effects models using lme4. Journal of Statistical Software, 67, 1-48. [Google Scholar]
- De Boeck P., Wilson M. (2004). Explanatory item response models: A generalized linear and nonlinear approach. New York, NY: Springer. [Google Scholar]
- Jeon M., Rijmen F., Rabe-Hesketh S. (2014). Flexible item response theory modeling with FLIRT. Applied Psychological Measurement, 38, 404-405. [Google Scholar]
- Muthén L. K., Muthén B. O. (1998-2016). Mplus [Computer program]. Los Angeles, CA: Author. [Google Scholar]
- Nandakumar R., Hotchkiss L. (2012). PROC NLMIXED: For estimating parameters of IRT models. Applied Psychological Measurement, 36, 60-63. [Google Scholar]
