Skip to main content
Applied Psychological Measurement logoLink to Applied Psychological Measurement
. 2016 Nov 15;41(2):155–157. doi: 10.1177/0146621616676483

Obtaining Fixed Effects for Between-Within Designs in Explanatory Longitudinal Item Response Models Using Mplus

Sun-Joo Cho 1,, Youngsuk Suh 2
PMCID: PMC5978622  PMID: 29881085

Abstract

Between-within designs that include a person group (i.e., a between-subjects factor) and repeated measures of binary responses over time (i.e., a within-subjects factor) are common in educational and psychological research. This software note describes how explanatory item response models can be specified to analyze longitudinal item-level data to detect fixed effects in Mplus for between-within designs. In particular, a necessary parameter transformation is illustrated in detail to obtain the fixed effects in Mplus.

Keywords: between-within design, longitudinal item response model

Description

Explanatory item response models (EIRM; De Boeck & Wilson, 2004) can be used when the interest is in the comparison of means between groups on the latent true score scale for longitudinal item-level data. The aim of this note is to present how fixed effects in the between-within design (i.e., interaction effects between the within-subjects factor and the between-subjects factor or the main effect of the between-subjects factor or the main effect of the within-subjects factor) can be estimated in a latent true score scale using Mplus (Muthén & Muthén, 1998-2016). The authors focus on a design that has one between-subjects factor and one within-subjects factor, and a unidimensional test is assumed at each time point.

Motivation: Software Comparisons

In fitting EIRM, the fixed effects for the between-within design can be specified directly in the SAS NLMIXED procedure (e.g., Nandakumar & Hotchkiss, 2012, for item response theory [IRT] parameter estimation), the glmer function in the lme4 package (Bates., Mächler, Bolker, & Walker, 2015), and the FLIRT R package (Jeon, Rijmen, & Rabe-Hesketh, 2014). However, parameter transformation is required to obtain these effects in Mplus because they cannot be estimated directly. Compared with other software packages, Mplus provides flexibility in measurement model specification, estimation method options, and adjustment for stratification (see the supplementary note for details). To take advantage of these flexibilities in Mplus, it may be instructive to show how Mplus parameters can be transformed to fixed effects in between-within designs.

EIRM Specification for Binary Responses

EIRM

A measurement model in EIRM is a four-parameter version of the longitudinal item response model (Anderson, 1985), described as follows:

P(yjgti=1|θjgt)=ci+(dici)11+exp[(aiθjgtbi)],

where y represents the binary item response, θjgt is a latent variable for person j in a group g at a time point t, ai is a time-invariant item discrimination for item i, bi is a time-invariant item location for item i, ci is a time-invariant lower asymptote item parameter for item i, and di is a time-invariant upper asymptote item parameter for item i. Three-parameter EIRM can be obtained with di=0 in Equation 1. Two-parameter EIRM can be specified with ci=0 and di=0 in Equation 1. The structural model of EIRM for θjgt includes the effects of the between-subjects factor (Ag), the within-subjects factor (Wt), and their interaction (AgWt):

θjgt=γ0+γ1g·Ag+γ2t·Wt+γ3gt·AgWt+εjgt,

where γ0 is the fixed parameter for the intercept or the grand mean, γ1g is the fixed parameter for the between-subjects factor, γ2t is the fixed parameter for the within-subjects factor, γ3gt is the fixed parameter for the interaction between the within-subjects factor and between-subjects factor, and εjgt is the random residual at each time point, following a multivariate normal (MVN) distribution (εjg=[εjg1,,εjgT]~MVN(0(T×1),Σ(T×T))). When the item location parameter (bi in Equation 1) is a fixed effect, the grand mean (γ0 in Equation 2) cannot be estimated. The grand mean is the average of the item location parameter estimates across items. Thus, it can be calculated as γ0=i=1Ibi^/I.

Specifying fixed effects in EIRM

With the unweighted effect codes of Ag, Wt, and AgWt (an example of the unweighted effect codes is provided in the supplementary note), the fixed effects (γ1g, γ2t, and γ3gt) can be directly obtained. EIRM is identified with the fixed effects using effect codes: g=1Gγ1g=0, t=1Tγ2t=0, g=1Gγ3gt=0 for all t=1,,T, and t=1Tγ3gt=0 for all g=1,,G.

In addition to the constraints, an additional model identification constraint on item discriminations is required for two-parameter or three-parameter or four-parameter EIRM. In this study, item discrimination for one of the items (e.g., the first item) is fixed to 1.

Implementation in Mplus

The measurement model specified in Equation 1 can be specified in Mplus. The structural model in Mplus is first described and then the parameter transformation is specified to obtain the fixed effects, γ1g, γ2t, and γ3gt, in Equation 2 based on the parameters of the structural model in Mplus, as described below.

In the structural model, a covariate, Ag (i.e., the between-subjects factor), can be added to explain θjgt:

θjgt=μt+ωgt·Ag+εjgt,

where μt is a mean at each time point, ωgt is the fixed parameter for Ag at each time point, and εjgt is the random residual at each time point.

Specifying fixed effects in Mplus

The following three constraints can be set to obtain γ1g, γ2t, and γ3gt in Equation 2, using parameters in the structural model with Mplus, μt and ωgt in Equation 3: (a) t=1Tωgt/T=γ1g, (b) μt=γ2t with a constraint, μT=t=1T1μt, and (c) in unweighted effect coding, γ3gt represents the discrepancy of the corresponding group mean (ωgt) from the unweighted average group mean (γ1g): ωgt=γ3gt+γ1gγ3gt=ωgtγ1g.

Testing fixed effects

To determine whether the fixed effects in EIRM are statistically significant, null hypothesis significance testing for the effect has been used in most EIRM applications (De Boeck & Wilson, 2004). Test statistics (i.e., the z-statistic in the current study) and corresponding p values are used to test whether the population value of the effect differs from a specified value (generally 0). Standard errors of the derived estimates for γ1g, γ2t, and γ3gt are calculated using a delta method in Mplus (support@statmodel.com, personal communication, April 19, 2016).

Availability

A supplementary note including an illustrative example, Mplus code, its detailed description, and an example dataset is available at no charge by sending a request by emailing a request to sj.cho@vanderbilt.edu.

Footnotes

Declaration of Conflicting Interests: The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding: The author(s) received no financial support for the research, authorship, and/or publication of this article.

References

  1. Anderson E. B. (1985). Estimating latent correlations between repeated testings. Psychometrika, 50, 3-16. [Google Scholar]
  2. Bates D., Mächler M., Bolker B., Walker S. (2015). Fitting linear mixed-effects models using lme4. Journal of Statistical Software, 67, 1-48. [Google Scholar]
  3. De Boeck P., Wilson M. (2004). Explanatory item response models: A generalized linear and nonlinear approach. New York, NY: Springer. [Google Scholar]
  4. Jeon M., Rijmen F., Rabe-Hesketh S. (2014). Flexible item response theory modeling with FLIRT. Applied Psychological Measurement, 38, 404-405. [Google Scholar]
  5. Muthén L. K., Muthén B. O. (1998-2016). Mplus [Computer program]. Los Angeles, CA: Author. [Google Scholar]
  6. Nandakumar R., Hotchkiss L. (2012). PROC NLMIXED: For estimating parameters of IRT models. Applied Psychological Measurement, 36, 60-63. [Google Scholar]

Articles from Applied Psychological Measurement are provided here courtesy of SAGE Publications

RESOURCES