The Psychometric Latent Agreement Model (PLAM) for Discrete Latent Variables Measured by Multiple Items

Levent Dumenci

doi:10.1177/1094428110374649

. Author manuscript; available in PMC: 2015 Jan 23.

Published in final edited form as: Organ Res Methods. 2011 Jan;14(1):91–115. doi: 10.1177/1094428110374649

The Psychometric Latent Agreement Model (PLAM) for Discrete Latent Variables Measured by Multiple Items

Levent Dumenci ¹

PMCID: PMC4303905 NIHMSID: NIHMS652311 PMID: 25620870

Abstract

The Psychometric Latent Agreement Model (PLAM) is proposed for estimating the subpopulation membership of individuals (e.g., satisfactory performers vs. unsatisfactory performers) at discrete levels of multiple latent trait variables. A binary latent Type variable is introduced to take account of the possibility that, for a given set of observed variables, the latent group memberships of some individuals are indeterminate. The latent Type variable allows for separating individuals who can reliably be assigned to satisfactory versus unsatisfactory performers classes from those individuals whose ratings do not contain the necessary information to make the class assignment possible for a particular set of rating items. Agreements among discrete latent trait variables are also estimated. The PLAM was illustrated with two examples using real data on behavioral rating measures. One example involved ratings of two behavioral constructs by a single rater type, whereas the other involved ratings of one construct by three rater types. Implications were presented for using behavioral ratings to determine the subpopulation membership, such as qualified versus unqualified groupings in hiring decisions and pass versus fail groupings in performance evaluations.

Keywords: latent class analysis, latent agreement, latent Type variable, survey research, quantitative multivariate research, measurement design, research design

Since the seminal paper by Charles Spearman (1904) more than a century ago, the latent variable conceptualization of behavioral attributes has been widely used in a variety of measurement contexts. A latent variable is conceptualized as having a continuous distribution in traditional measurement models (i.e., models originating from factor analysis [FA] and item-response theory [IRT]). Continuously distributed latent variables have been found useful in conceptualizing attitudes, personality, achievement, attainment, intelligence, and many other behavioral and organizational attributes, where the purpose of testing is to determine the location of individuals on a latent continuum. In addition to locating individuals on latent continua, rating scales are sometimes used to make categorical judgments about individuals. The purpose of this study was to propose a model-based approach, that is, the Psychometric Latent Agreement Model (PLAM), for estimating group membership of individuals at the latent variable level while taking account of the possibility that some individuals’ observed responses fail to distinguish between the categories of discrete latent trait variables.

Traditionally, supervisors rate employees’ performance on one or more dimensions during the evaluation process (e.g., Funderburgh & Levy, 1997). For each dimension, supervisory ratings are used to locate employees on a performance continuum as accurately as possible. This practice has limitations. First, supervisory ratings are often motivated to make categorical judgments about employees such as promotion, continuation, or termination of employment. Yet there is no one-to-one correspondence between the performance measurement on a continuum and performance categories. Setting cut scores on a continuum is often the method of choice to achieve this goal. Despite their widespread use and intuitive appeal, however, cut scores may cause more problems than they resolve. First, test scores obtained from traditional measurement models assume that individuals represent a sample drawn from a single population (i.e., homogeneity assumption), yet cut scores assume population heterogeneity. It seems contradictory to assume that the same population is simultaneously homogeneous and heterogeneous. Second, group assignments based on a cut score imply categorical certainty (e.g., unsatisfactory performance below the cut score and satisfactory performance above it). It may be more realistic to assume that employees whose score close to the cut point are less likely to belong in either of these categories than those who score farther away from the cut score. Third, equally well-designed procedures (e.g., consensus among experts, agreement with an external criterion) may not necessarily yield the same cut scores. Evidence supporting a particular cut score does not necessarily invalidate all other possible cut scores. Finally, the cut score is not a part of the measurement model that is used to estimate the continuous performance scores. Consequently, inferences (i.e., test score interpretations) made from the cut score are not justified by the measurement model. Most recently, Rupp, Templin, and Henson (2010) advocate the model-based approaches involving discrete latent variables as an alternative to the cut score method.

The second limitation of traditional supervisory performance ratings of employees is that it provides only a one-sided view of employees’ performance (Campbell & Fiske, 1959; Lawler, 1967). To overcome this limitation, the multirater approach, also known as the 360° feedback, has been widely adopted in organizational research and practice (Brett & Atwater, 2001). The 360° feedback system involves ratings from individuals at different levels of organizational hierarchy including employees, coworkers, supervisors, and subordinates (Funderburgh & Levy, 1997). Edwards and Ewen (1996) reported that the vast majority of Fortune 500 companies has adopted a form of the 360° feedback system. The transition from the traditional supervisory rating to the 360° feedback system effectively overcomes the shortsighted view of employee performance (Maylett, 2009) but, at the same time, it exaggerates the problem associated with making categorical performance judgments from the commonly employed continuously distributed rating scores. The discrete latent variable framework is well suited to address this issue. In this study, we illustrate the extension of traditional latent class analysis involving one discrete latent variable to latent class models with two or more discrete latent variables, each of which represents the rater-specific classes of individuals.

The 360° feedback system highlights the issue of agreement among raters. For example, Brett and Atwater (2001) reported that self, boss, peer, and direct report rating correlations are low to moderate (range: .04–.48). Subsequent hypothesis testing of relationships between the rating level and the accuracy and reactions to ratings further revealed that the predictive relationships are rater-specific. The use of cut scores further complicates the issue of rater agreement in this multirater context. For example, equally plausible cut scores would not necessarily yield the same magnitude of agreement. Because true group memberships are unknown, use of sensitivity/specificity analyses and chance-corrected agreement to validate cut scores is of limited value without an error-free external criterion that can serve as a gold standard (Glarus & Kline, 1988; Guggenmoos-Holzmann & Vonk, 1998; Rindskopf & Rindskopf, 1986).

In organizational research, recent advances in testing the rater agreement for multi-item scales are illustrated within the continuous observed variable framework by Cohen, Doveh, and Nahum-Shani (2010). Pasisz and Hurtz (2010) discuss the between-group differences in within-rater agreement. Most recently, Cheung (2010) proposed a latent congruence model to estimate rater agreement within the confirmatory factor analytic framework with predictors and outcomes of congruence. Several probability models have also been proposed to study rater agreement within the discrete latent variable framework (e.g., Agresti & Lang, 1993; Bergan, Schwarz, & Reddy, 1999; Flaherty, 2002; Guggenmoos-Holzmann & Vonk, 1998; Rindskopf & Rindskopf, 1986; Uebersax & Grove, 1990). By extending Aickin’s (1990) model, Schuster and Smith (2002) proposed the target-type approach to rater agreement within the discrete latent variable framework and showed its conceptual and parametric relations with previously proposed latent agreement models originating from the target-type approach (e.g., Agresti, 1989; Guggenmoos-Holzmann, 1996). The response-error approach has also been widely used to study rater agreements under the discrete latent variable framework (Agresti, 1988; Becker, 1989; Darroch & McCloud, 1986; Dayton & Macready, 1976, 1980; Dillon & Mulani, 1984; Macready & Dayton, 1977; Tanner & Young, 1985). By extending Goodman’s (1979) model, Schuster and Von Eye (2001) proposed a latent agreement model and showed its relations with previously proposed latent agreement models using the response-error approach. Schuster (2006) has provided an overview of response-error and target-type approaches to modeling rater agreement, as well as their strengths and weaknesses.

The one-discrete-observed-variable was the dominant paradigm in modeling agreement under the discrete latent variable framework. In this paradigm, raters were asked to independently assign each of N individuals (or objects, targets) into exhaustive and mutually exclusive categories of one observed variable. Observed cross-classification tables between raters then contained the information needed to estimate the parameters of a particular latent agreement model. In contrast, the multiple-discrete-observed-variable paradigm was dominant in traditional psychometric models, particularly in measuring employee performance, job satisfaction, and a host of behavioral constructs using binary or ordinal (i.e., Likert-type) items, where latent variables are conceptualized as continuously distributed. The modeling approaches adopted in the current study involve a hybrid of the multiple-discrete-observed-variable paradigm drawn from the traditional psychometric models and the discrete latent variable paradigm drawn from the rater agreement models.

Although differing in their parameterizations, two kinds of discrete latent variables have been considered in the past. The first kind, labeled as “discrete latent trait,” represents the subpopulation status of individuals. Examples of discrete latent traits include job performance with unsatisfactory, adequate, and excellent categories and job satisfaction with unsatisfied and satisfied categories. The second kind of discrete latent variable, labeled as “Type,” represents whether the subpopulation status of individuals is determined by a systematic process (Type = informative) or by a random process (Type = uninformative). For example, a particular group of employees might rate their own performance by responding to a set of items haphazardly for one reason or another, for example, they may think that the self ratings cannot possibly be taken seriously by anyone to make judgments about their job performance. Thus, item responses from this group of employees do not tell us (or are uninformative) about their job performance. The introduction of latent Type variable allows for testing the presence/absence of such a group of employees in organizational settings. The discrete latent trait and Type variables would be classified as latent class variables by Bartholomew and Knott (1999) because both variables have discrete distributions.

In the past, systematic processes have been used to define a subpopulation of individuals whose observed response patterns conform to Guttman’s scaling (i.e., intrinsically scalable type; Goodman, 1975), a latent class model (i.e., obvious type; Schuster & Smith, 2002), and an item-response model (IRT; Gitomer & Yamamoto, 1991). It may be unreasonable to assume that all responses follow a particular stochastic process. Random processes have, therefore, been used to define a subpopulation of individuals whose observed response patterns are inconsistent with systematic processes, such as the intrinsically unscalable type proposed by Goodman (1975), the ambiguous type proposed by Schuster and Smith (2002), and the types of individuals whose responses do not follow an IRT model as proposed by Gitomer and Yamamoto (1991). Despite conceptual similarities among models involving one discrete latent trait variable and one latent Type variable, these models are not parametrically equivalent.

Traditionally, models with a latent Type variable have had one discrete latent trait variable. This article introduces the PLAM in which the systematic process is defined in terms of two or more discrete latent traits (i.e., a latent class model with two or more latent class variables), whereas the random process is defined in terms of a binary latent Type variable. The development of the PLAM is introduced with a traditional latent class model involving one discrete latent variable with and without a latent Type variable followed by a model involving multiple discrete latent trait variables with and without a latent Type variable. Prior to the parametric introduction of these models, examples from the 360° feedback system are used to illustrate possible applications of discrete latent variable modeling in organizational research. Examples using empirical data, however, come from a substantive field outside the mainstream organizational research: adolescent problem behavior. Specifically, the PLAM is illustrated with two data sets, one data set involving teachers’ ratings of attention-deficit hyperactivity (ADH) and the other data set involving mother, teacher, and self ratings of oppositional defiant (OD) to estimate latent subtypes of problem behavior.

Modeling One Discrete Latent Trait Variable

Employees’ self ratings of job satisfaction are sometimes used to identify individuals dissatisfied with their current jobs. Instead of conceptualizing the job satisfaction as a unidimensional trait, it may be best suited to treat job satisfaction as a discrete latent variable with satisfied, tolerable, and unsatisfied categories in this example. Consider three manifest categorical variables, y1, y2, and y3 with I = 1…I, j = 1…J, and k = 1…K categories, respectively. The dependencies among the observed categorical variables are explained by a discrete latent trait variable, denoted as R, with p = 1…P categories. Let $π_{ijk}^{y 1 y 2 y 3}$ represent the probability of observing a response pattern for the manifest variables, that is, P(y1 = i, y2 = j, y3 = k). The latent class model can then be expressed in terms of marginal probabilities of the joint distribution of y1, y2, y3, and R as:

π_{ijk}^{y 1 y 2 y 3} = \sum_{p = 1}^{P} π_{ijkp}^{y 1 y 2 y 3 R} .

(1)

Under the assumption of local independence, that is, the independence of the manifest variables conditional on the discrete latent trait variable, the joint probabilities can be expressed as a function of conditional and unconditional probabilities (Lazarsfeld & Henry, 1968):

π_{ijkp}^{y 1 y 2 y 3 R} = π_{p}^{R} π_{i p}^{y 1 ∣ R} π_{j p}^{y 2 ∣ R} π_{k p}^{y 3 ∣ R},

(2)

where the unconditional probability of $π_{p}^{R}$ is P(R = p) and the conditional probabilities of $π_{i p}^{y 1 ∣ R} = P (y 1 = i ∣ R = p); π_{j p}^{y 2 ∣ R} = P (y 2 = j ∣ R = p)$ ; and $π_{k p}^{y 3 ∣ R} = P (y 3 = k ∣ R = p)$ .

The latent class model for one discrete latent trait variable is obtained by inserting equation 2 into equation 1:

π_{ijk}^{y 1 y 2 y 3} = \sum_{p = 1}^{P} π_{p}^{R} π_{i p}^{y 1 ∣ R} π_{j p}^{y 2 ∣ R} π_{k p}^{y 3 ∣ R} .

(3)

Analogous to the factor score estimates in FA and ability estimates in item-response theory models, posterior probabilities can be estimated from the discrete latent variable models by model fitting. Posterior probability estimates are informative of class separation.

Modeling One Discrete Latent Trait Variable and the Latent Type Variable

Ideally, equation 3 holds for each and every member of the population. Yet, it is possible that, for a particular group of individuals, the observed variables y1, y2, and y3 do not contain the necessary information to distinguish P categories of the discrete latent variable R. In employees’ self rating of job satisfaction, for example, a particular group of individuals may fill out the rating scales without paying close attention to items by thinking that their ratings will be inconsequential. Similarly, it is not uncommon to observe a small group of individuals with low ability responding to difficult items correctly or vice versa in ability testing. The latent Type variable, denoted as T with x = 1 (informative) and x = 2 (uninformative) categories, is introduced to take account of the possibility that equation 3 may not hold for a group of individuals. Specifically, equation 3 holds for x = 1, whereas it holds for x = 2 with the following restrictions on the conditional probabilities:

π_{i p}^{y 1 ∣ R} = π_{i}^{y 1}, π_{j p}^{y 2 ∣ R} = π_{j}^{y 2}, π_{k p}^{y 3 ∣ R} = π_{k}^{y 3}, \forall i, j, k, p .

These restrictions imply that the probability of responding to a category of an observed variable remains the same across the levels of the discrete latent trait variable for individuals in the uninformative category of the latent Type variable. Therefore, item responses are simply “uninformative” regarding the latent trait status of individuals in this subpopulation. The T enters into equation 1 as a discrete latent variable as:

π_{ijk}^{y 1 y 2 y 3} = \sum_{p = 1}^{P} \sum_{x = 1}^{2} π_{ijkpx}^{y 1 y 2 y 3 R T} .

(4)

With the restrictions on the conditional probabilities for x = 2 and under the assumption of local independence, the observed response patterns are modeled as:

π_{ijk}^{y 1 y 2 y 3} = \sum_{p = 1}^{P} \sum_{x = 1}^{2} π_{p x}^{R T} π_{ipx}^{y 1 ∣ R T} π_{jpx}^{y 2 ∣ R T} π_{kpx}^{y 3 ∣ R T} .

(5)

Modeling Two Discrete Latent Trait Variables

In addition to the self-reported ratings of job satisfaction, organizational researchers may be interested in the self-reported job performance, which is also conceptualized as a discrete latent variable with unsatisfactory, satisfactory, excellent categories. It may also be of interest to estimate the agreement between job satisfaction and job performance at the latent variable level. Another example of latent agreement would be between the self and supervisory ratings of employees’ job satisfaction. Therefore, in addition to the discrete latent trait variable R, consider a second discrete latent trait variable S with q = 1…Q categories that underlies the dependencies among three manifest categorical variables y4, y5, and y6 with l = 1…L, m = 1…M, and n = 1…N categories, respectively. The model for two discrete latent trait variables can be expressed in terms of the marginal joint probability distribution of y1,…y6, R, and S:

π_{i \dots n}^{y 1 \dots y 6} = \sum_{p = 1}^{P} \sum_{q = 1}^{Q} π_{i \dots npq}^{y 1 \dots y 6 R S} .

(6)

Under the assumption of local independence:

π_{i \dots npq}^{y 1 \dots y 6 R S} = π_{p q}^{R S} π_{i p}^{y 1 ∣ R} π_{j p}^{y 2 ∣ R} π_{k p}^{y 3 ∣ R} π_{l q}^{y 4 ∣ S} π_{m q}^{y 5 ∣ S} π_{n q}^{y 6 ∣ S} .

(7)

Note that R influences y1, y2, and y3, whereas S influences y4, y5, and y6. Inserting equation 7 into equation 6 gives the latent class model for two discrete latent trait variables (e.g., Croon, 2002):

π_{i \dots n}^{y 1 \dots y 6} = \sum_{p = 1}^{P} \sum_{q = 1}^{Q} π_{p q}^{R S} π_{i p}^{y 1 ∣ R} π_{j p}^{y 2 ∣ R} π_{k p}^{y 3 ∣ R} π_{l q}^{y 4 ∣ S} π_{m q}^{y 5 ∣ S} π_{n q}^{y 6 ∣ S} .

(8)

It is of interest to examine the association (i.e., agreement) between the discrete latent trait variables R and S, analogous to a factor correlation matrix for continuously distributed latent trait variables. To ease the presentation, consider a simple case of two binary latent trait variables (i.e., p = q = 2). The cross-classification of R and S appears in Table 1. Analogous to Cohen’s kappa (κ) for chance-corrected agreement between two observed variables, the chance-corrected agreement between two binary latent trait variables, denoted as κ_l₍_R,S₎, is given by:

κ_{l (R, S)} = \frac{(π_{11}^{R S} + π_{22}^{R S}) - (π_{1}^{R} π_{1}^{S} + π_{2}^{R} π_{2}^{S})}{1 - (π_{1}^{R} π_{1}^{S} + π_{2}^{R} π_{2}^{S})} .

(9)

Table 1.

Cross-tabulation of two binary latent variables designated as R and S

graphic file with name nihms652311f8.jpg

Open in a new tab

From Table 1, $π_{1}^{R}, π_{2}^{R}, π_{1}^{S}$ , and $π_{2}^{S}$ represent the marginal distributions of discrete latent trait variables, for example, $π_{1}^{R} = π_{11}^{R S} + π_{12}^{R S}$ . Similar to Cohen’s unweighted κ for three observed variables (Von Eye & Mun, 2005), the latent agreement for three variables is given by:

κ_{l (R, S, Z)} = \frac{(π_{111}^{RSZ} + π_{222}^{RSZ}) - (π_{1}^{R} π_{1}^{S} π_{1}^{Z} + π_{2}^{R} π_{2}^{S} π_{2}^{Z})}{1 - (π_{1}^{R} π_{1}^{S} π_{1}^{Z} + π_{2}^{R} π_{2}^{S} π_{2}^{Z})},

(10)

where Z is a binary latent trait variable.

The PLAM

In the examples of both the self ratings of job satisfaction and job performance (two discrete traits with one rater type) and the self and supervisory ratings of employees’ job satisfaction (one discrete trait with two rater types), it may be of interest to test the presence of a subpopulation whose ratings are uninformative to make inferences about the discrete latent traits. The PLAM is defined as a latent class model for two or more discrete latent trait variables, plus a binary latent Type variable. Each latent trait variable explains associations among a unique set of categorical observed variables. The binary latent Type variable distinguishes individuals whose response patterns are consistent with the multiple discrete latent trait model from those whose responses are uninformative about the discrete latent trait variables. The PLAM with two discrete latent trait variables R and S and the binary latent Type variable of T is given by:

π_{i \dots n}^{y 1 \dots y 6} = \sum_{p = 1}^{P} \sum_{q = 1}^{Q} \sum_{x = 1}^{2} π_{i \dots npqx}^{y 1 \dots y 6 RST},

(11)

with the following restrictions on the conditional probabilities:

π_{i p 2}^{y 1 ∣ R T} = π_{i 2}^{y 1 ∣ T}, π_{j p 2}^{y 2 ∣ R T} = π_{j 2}^{y 2 ∣ T}, \dots, π_{n q 2}^{y 6 ∣ S T} = π_{n 2}^{y 6 ∣ T}, \forall i, \dots n, p, q .

The joint probability distribution of discrete observed and latent variables can be rewritten as:

π_{i \dots npqx}^{y 1 \dots y 6 RST} = (π_{p q 1}^{RST} π_{i p 1}^{y 1 ∣ R T} π_{j p 1}^{y 2 ∣ R T} π_{k p 1}^{y 3 ∣ R T} π_{l q 1}^{y 4 ∣ S T} π_{m q 1}^{y 5 ∣ S T} π_{n q 1}^{y 6 S T}) + (π_{p q 2}^{RST} π_{i 2}^{y 1 ∣ T} π_{j 2}^{y 2 ∣ T} π_{k 2}^{y 3 ∣ T} π_{l 2}^{y 4 ∣ T} π_{m 2}^{y 5 ∣ T} π_{n 2}^{y 6 ∣ T}),

(12)

hence,

π_{i \dots n}^{y 1 \dots y 6} = \sum_{p = 1}^{P} \sum_{q = 1}^{Q} (π_{p q 1}^{RST} π_{i p 1}^{y 1 ∣ R T} π_{j p 1}^{y 2 ∣ R T} π_{k p 1}^{y 3 ∣ R T} π_{l q 1}^{y 4 ∣ S T} π_{m q 1}^{y 5 ∣ S T} π_{n q 1}^{y 6 ∣ S T}) + (π_{p q 2}^{RST} π_{i 2}^{y 1 ∣ T} π_{j 2}^{y 2 ∣ T} π_{k 2}^{y 3 ∣ T} π_{l 2}^{y 4 ∣ T} π_{m 2}^{y 5 ∣ T} π_{n 2}^{y 6 ∣ T}) .

(13)

For Type = informative, κ_l provides the chance-corrected agreement between the two binary latent trait variables R and S because this subpopulation follows Equation 8. For Type = uninformative, however, the restrictions on the conditional probabilities imply:

π_{112}^{RST} = π_{122}^{RST} = π_{212}^{RST} = π_{222}^{RST},

hence, κ_l ₍_R,S₎ = 0 for this subpopulation. Intuitively, the agreement between two discrete latent trait variables is due solely to chance when the categories of these variables are indistinguishable.

In the sections that follow, the PLAM is illustrated with two data examples. The first example involves teachers’ ratings of five inattentive (Ina) and seven hyperactivity-impulsivity (H-I) items (similar to the self ratings of job satisfaction and job performance example), whereas the second example involves ratings of four OD items by mother, teacher, and self (similar to the 360° feedback system where the job performance ratings are obtained from employees, supervisors, and subordinates). Each example is presented with a rationale, data characteristics, graphical display of the PLAM, model selection strategies, and interpretation of parameter estimates. For pedagogical purposes, analogies are drawn between the PLAM and the traditional measurement models originating from FA and IRT, where the latent trait variables are conceptualized as continuously distributed.

PLAM Example I: Attention Deficit/Hyperactivity (ADH)

Rationale

The Diagnostic and Statistical Manual of Mental Disorders (DSM) typology of ADH disorder classifies children into five categories: (a) children without the disorder; (b) children with predominantly Ina type; (c) children with predominantly H-I type; (d) children with both Ina and H-I (i.e., the combined type), and (e) children with the disorder not-otherwise-specified (NOS) type. The ADH types cannot be observed directly. Each component (i.e., Ina and H-I) is best represented as a latent trait variable with two categories: case versus noncase. Therefore, the ADH types can be grouped into 2 × 2 tables representing the cross-classification of two binary latent trait variables Ina and H-I. Because there is no error-free observed variable to determine the true ADH types, multiple items are needed to represent the latent diagnostic status.

Data Characteristics

The Teacher’s Report Form (TRF; Achenbach, 1991; Achenbach & Rescorla, 2001) was completed by the teachers of 2,619 students who were representative of the U.S. population (49.1% female). The students were 6–18 years old with a mean age of 12.0 years (SD = 3.4). The mean SES was 5.7 (SD = 2.1; range: 1–9) on Hollingshead’s (1975) 9-step scale for parents’ occupations. The students were 72.6% Caucasian, 13.2% African American, 6.9% Latino/Latina, and 7.3% mixed or other. In the preceding 12 months, 4.7% of children had been referred for mental health or special education services for a wide range of psychological and educational problems.

The TRF is a standardized instrument for obtaining teacher reports of academic and adaptive functioning and behavioral/emotional problems. The ADH example focused on 5 Ina and 7 H-I items scored from the TRF. The Ina items were i4. Fails to finish things he/she starts, i8. Can’t concentrate, can’t pay attention for long, i22. Difficulty following directions, i78. Inattentive or easily distracted, and i100. Fails to carry out assigned tasks. The H-I items were h10. Can’t sit still, restless, or hyperactive, h15. Fidgets, h24. Disturbs other pupils, h41. Impulsive or acts without thinking, h53. Talks out of turn, h67. Disrupts class discipline, and h93. Talks too much. These items were used to score the TRF DSM-oriented Inattention and Hyperactivity-Impulsivity scales. Based on the preceding 2 months, teachers rated items on a 3-point Likert-type scale as 0 = not true (as far as you know); 1 = somewhat or sometimes true; and 2 = very true or often true.

Model Specification

The PLAM for ADH appears in Figure 1. Five items were used to measure the binary latent trait variable of Ina and 7 items were used to measure the binary latent trait variable of H-I. The one-way arrows leading from the Ina and H-I latent trait variables to the items represent conditional probabilities, for example, the probability of obtaining teachers’ ratings of 2 from i22. Difficulty following directions given that the student had Ina problems (i.e., Ina = case). The two-sided arrow is used to represent a two-way contingency table between the latent Ina and H-I variables. Each cell in this contingency table defines the latent subpopulations of ADH: (a) no ADH (Ina = noncase and H-I = noncase), (b) Inattentive type (Ina = case and H-I = noncase), (c) Hyperactive-Impulsive type (Ina = noncase and H-I = case), and (d) combined type (Ina = case and H-I = case). Two categories of the latent Type variable refer to two different models that were separated by the dotted line in Figure 1. The binary latent Type variable defines two sets of four ADH types: informative versus uninformative. When Type = informative, conditional probabilities are expected to be larger when the Ina and H-I variables take the value of case than noncase. For Type = uninformative, the conditional probabilities are defined as equal whether Ina or H-I takes the value of case or noncase.

The PLAM for ADH: Graphical representation. *Note*. PLAM = Psychometric Latent Agreement Model; ADH = attention-deficit hyperactivity.

Model Selection

The PLAM specification may well be more complex than it should be in this example. To evaluate this possibility, three competing models were specified a priori by placing restrictions on the PLAM. First, a model with two binary latent trait variables (see equation 8) was obtained by eliminating the latent Type variable from the PLAM. The comparison between the two-binary-latent-trait model (c = 4, where c represents the total number of classes in a discrete latent variable model) and the PLAM indicates whether the informative versus uninformative distinction is supported by the teachers’ ratings. Second, the one-binary-latent-trait model (c = 2) was obtained by using all 12 variables as indicators of a binary ADH latent trait variable with case and noncase categories. The comparison between one- and two-binary-latent-trait models indicates whether the Ina versus H-I distinction is supported by the teachers’ ratings. Third, the independence model (c = 1) was obtained by eliminating the binary ADH latent variable in the one-binary-latent-trait model. The independence model states that there is no association among the 12 observed variables. Three indices of fit were used to compare the models: The Akaike information criterion (AIC; Akaike, 1987), the Bayesian information criterion (BIC; Schwarz, 1978), and the sample-size adjusted BIC by a factor of (n + 2)/24 (BIC_n; Sclove, 1987). All three indices penalize model complexity. The model with the smallest fit value is regarded as the optimal model. Mplus (v.4.1; Muthén & Muthén, 1998–2006) was used to estimate the models. Mplus input files are provided in the appendix.

The AIC, BIC, and BIC_n values appear in Figure 2. Of the four competing models, the PLAM had the smallest values for all three fit indices, indicating that it was the optimal model. The PLAM had a loglikelihood value of −17,868.38 with 78 free parameters.

The PLAM for ADH: Model comparison. *Note*. PLAM = Psychometric Latent Agreement Model; ADH = attention-deficit hyperactivity.

Parameter Estimates

Three sets of conditional probability estimates appear in Figure 3. The first set (first 12 bars in Figure 3) describes the probability of rating an item 0, 1, or 2 when Type = informative and Ina/H-I = noncase. For example, i22. difficulty following directions had a .907 probability of being rated 0, .091 probability of being rated 1, and .002 probability of being rated 2 when Type = informative and the latent Ina variable takes the value of noncase. When Type = informative and the latent Ina variable takes the value of case, the conditional probability of obtaining 0, 1, and 2 ratings for this item were .473, .495, and .032, respectively. There is only one set of unconditional probabilities for Type = uninformative because these probabilities are the same whether the binary latent trait variables (i.e., Ina and H-I) take the value of case or noncase. For Type = uninformative, the 0, 1, and 2 response probabilities for i22 were .474, .495, and .032, respectively. Under the condition of Informative–Noncase in Figure 3, all items were strong indicators of the absence of Ina and H-I reported by teachers because the probability of a rating of 0 was approximately .90 for all 12 variables. When Type = informative and the binary latent trait variables take the value of case, the probability of observing a rating of 0 was low (i.e., .10 to .20).

The PLAM for ADH: Conditional probability estimates. *Note*. PLAM = Psychometric Latent Agreement Model; ADH = attention-deficit hyperactivity.

The unconditional probabilities represent the estimated class sizes. The PLAM classified 79.7% of the 2,619 students as belonging in the informative category of the latent Type variable (T = 1) with the following classification for ADH: (a) 57.0% were classified as having neither Ina nor H-I (Type = informative; Ina = noncase; H-I = noncase); (b) 13.1% as having both Ina and H-I, that is, the combined type (Type = informative; Ina = case; H-I = case); (c) 7.9% as having Ina only (Type = informative; Ina = case; H-I = noncase); and 1.7% as having H-I only (Type = informative; Ina = noncase; H-I = case). For Type = informative, κ_l ₍_{Ina, H-I}₎ = .80. The remaining 20.3% of students comprised the uninformative category of the latent Type variable (T = 2). That is, teachers’ ratings of 12 items are uninformative about Ina and H-I status for one fifth of the students; κ_{l(Ina, H-I)} = 0 in this subpopulation, by definition.

Posterior probability estimates for five selected students appear in Table 2. In the PLAM, individuals belong to the levels of latent variables probabilistically. Therefore, the probabilities of a person belonging to each of the latent subpopulations add up to unity. Student A in Table 2, for instance, belongs to the informative–combined type with a probability of .998, whereas Student A’s probability of belonging to the uninformative category of latent Type variable was .002. Student B had a response pattern with a probability of belonging to the informative–Ina type of .992. The response pattern for Student C indicates a .886 chance of belonging to the informative H-I type and Student D had a .914 chance of belonging to the informative–noncase type. The highest posterior probability of the response patterns exhibited by Student E was Type = uninformative, meaning that Student E could not be reliably assigned into one of four ADH types.

Table 2.

The PLAM for ADH: Posterior Probability Estimates for 5 Selected Students

	Latent Variable Type	Informative				Uninformative
	Ina	Case		Noncase
Case	H-I	Case	Noncase	Case	Noncase
Student A		.998	.000	.000	.000	.002
Student B		.000	.992	.000	.000	.008
Student C		.102	.000	.886	.000	.012
Student D		.000	.062	.000	.914	.024
Student E		.000	.001	.009	.214	.785

Open in a new tab

Note. Observed response patterns of 5 Ina and 7 H-I items are (22111, 1212101) for Student A, (21112, 0100000) for Student B, (10001, 1022211) for Student C, (10101, 0000000) for Student D, and (11000, 00010100) for Student E. PLAM = Psychometric Latent Agreement Model; ADH = attention-deficit hyperactivity; Ina = inattentive; H-I = hyperactivity-impulsivity.