Assessing Validity and Application Scope of the Intrinsic Estimator Approach to the Age-Period-Cohort Problem

Liying Luo

doi:10.1007/s13524-013-0243-z

. Author manuscript; available in PMC: 2016 Nov 30.

Published in final edited form as: Demography. 2013 Dec;50(6):1945–1967. doi: 10.1007/s13524-013-0243-z

Assessing Validity and Application Scope of the Intrinsic Estimator Approach to the Age-Period-Cohort Problem^*

Liying Luo ¹

PMCID: PMC5129181 NIHMSID: NIHMS830472 PMID: 24072610

Abstract

In many different fields, social scientists desire to understand temporal variation associated with age, time period, and cohort membership. Among methods proposed to address the identification problem in age-period-cohort analysis, the Intrinsic Estimator (IE) is reputed to impose few assumptions and to yield good estimates of the independent effects of age, period, and cohort groups. This article assesses the validity and application scope of IE theoretically and illustrates its properties with simulations. It shows that IE implicitly assumes a constraint on the linear age, period, and cohort effects. This constraint not only depends on the number of age, period, and cohort categories but also has non-trivial implications for estimation. Because this assumption is extremely difficult, if not impossible, to verify in empirical research, IE cannot and should not be used to estimate age, period, and cohort effects.

Introduction

For over a century, social scientists have attempted to separate cohort effects from age and period effects on various social phenomena including mortality, disease rates, and inequality (e.g., Mason et al. 1973; Holford 1983; Fu 2000; O’Brien 2000; Winship and Harding 2008). Whereas age effects represent the variation associated with growing older, period effects refer to effects due to social and historical shifts such as economic recessions and prevalent unemployment that affect all age groups simultaneously. Cohort refers to a group of people who experience an event such as birth at the same age. Cohort effects are defined as the formative effects of social events on individuals at a specific period during their life course (Ryder 1965). Age-period-cohort (APC) models, where the three variables are simultaneously considered in a statistical equation, have been the conventional framework for quantifying age, period, and cohort effects. Unfortunately, such APC models suffer from a logical identification problem: once any two of the three variables (age, period, and cohort) are known, the value of the third is determined; this is because Cohort=Period-Age. Because of this exact linear dependency, there exist no valid estimates of the distinct effects of the three variables.

Various methods have been developed to address this identification problem. For example, Mason et al. (1973) introduced the APC multiple classification model and suggested the Constrained Generalized Linear Model (CGLM) as a means of estimating the independent effects of age, period, and cohort. More recently, Fu (2000) and Yang and colleagues (2004) proposed a new APC method, called the Intrinsic Estimator (IE). They recommended IE as “a general-purpose method of APC analysis with potentially wide applicability in the social sciences” (Yang et al. 2008, p1699) on the grounds that IE has desirable statistical properties such as unbiasedness and consistency.

However, in this article I show that IE cannot be used to recover the true age, period, and cohort effects because IE, like CGLM, imposes a constraint on parameter estimation that is difficult to verify using theories or empirical evidence; that is, the validity of IE relies on assumptions that are very difficult to verify in applied practice. In this sense, IE is no better than CGLM. In fact, IE is equivalent to the Principal Component Estimator, an estimator with a potential for bias that was noted by its developer (Kupper et al. 1985). Unfortunately, this has not been understood by the community of demographers, sociologists, and epidemiologists who have used IE in a wide variety of research applications. As I demonstrate below, many researchers have misunderstood what IE actually estimates and how IE estimates should be interpreted, resulting in inappropriate applications of IE in empirical research and potentially misleading substantive conclusions.

This article contributes to the literature in two ways: First, although O’Brien (2011a) clarified that IE assumes a special constraint – the null-vector constraint – on parameters, it is challenging for researchers to fully appreciate and evaluate the appropriateness of this constraint when applying IE in substantive studies. In this article, I derive an easily-understood form of IE’s constraint on the linear components of age, period, and cohort effects so the implications of using IE to estimate the true age, period, and cohort effects can be better understood.¹

Second, while scholars agree that IE is a constrained estimator, they debate whether IE can provide reliable estimates of the true age, period, and cohort trends (see Fu et al. 2011; O’Brien 2011b). I address this debate using several types of simulated data generated based on social theories. By comparing IE estimates to the true effects in various circumstances, I show that IE does not work better than CGLM for recovering the true age, period, and cohort trends in empirical research.

This paper is organized as follows. I begin with an introduction of the APC multiple classification model and the identification problem. While reviewing the methodological challenge that has hampered APC research for decades, this section establishes a framework for discussing the nature and limitations of different constrained APC estimators including IE and CGLM. I then review how IE’s developers have described IE and how applied researchers have understood and used it in substantive studies; the two are often not the same. As a result, many scholars have misunderstood IE, so that this technique has been misused in empirical research. To clarify this common misunderstanding and avoid further misuse, in the section “The Linear Constraint Implied by IE,” I derive the constraint that IE imposes on the linear components of age, period, and cohort effects. In the “Application Scope” section that follows the technical discussion of IE’s linear constraint, I use simulations to demonstrate how this constraint affects estimation of age, period, and cohort effects. Based on these mathematical derivations and simulation evidence, I conclude that IE cannot and should not be used to estimate true age, period, and cohort effects.

The Identification Problem

I first review the identification problem that IE and other constrained estimators are intended to address to develop a framework for understanding the nature of these methods. In APC analysis, researchers have conventionally used the Analysis of Variance (ANOVA) model to separate the independent age, period, and cohort effects:

g (E (Y_{i j})) = μ + α_{i} + β_{j} + γ_{k},

(1)

for age groups i = 1,2,…, a, periods j = 1,2, …, p, and cohorts k = 1,2, …,(a + p − 1), where $\sum_{i = 1}^{a} α_{i} = \sum_{j = 1}^{p} β_{j} = \sum_{k = 1}^{a + p - 1} γ_{k} = 0$ . E(Y_ij)denotes the expected value of the outcome of interest Y for the ith age group in the jth period of time; g is the “link function”; α_i denotes the mean difference from the global mean μ associated with the ith age category; β_j denotes the mean difference from μ associated with the jth period; γ_k denotes the mean difference from μ due to the membership in the kth cohort. The usual ANOVA constraint applies where the sum of coefficients for each effect is set to zero.

For a normally distributed outcome Y_ij, the ANOVA model above can also be written in a generic regression fashion:

Y = X b + ε,

(2)

where Y is a vector of outcomes; X is the design matrix; b denotes a parameter vector with elements corresponding to the effects of age, period, and cohort groups; and ε denotes random errors with distribution centered on zero. Then the estimated age, period, and cohort effects can be obtained using the ordinary least squares (OLS) method:

\hat{b} = {(X^{T} X)}^{- 1} X^{T} Y .

(3)

Unfortunately, the inverse of the matrix (X^TX)⁻¹ does not exist because of the age-period-cohort linear dependency, so the parameter vector b is inestimable. This is the identification problem in APC analysis: no unique set of coefficients can be obtained because an infinite number of solutions give identical fits to the data.

This identification problem can be shown more explicitly. For simplicity, suppose the data we have are perfect, without random or measurement errors, so that ε = 0; then the problem is mathematical rather than statistical, and the regression model is:

Y = X b .

(4)

Due to the linear dependency between age, period, and cohort, there exists a nonzero vector b₀, a linear function of the design matrix X, such that the product of the design matrix and the vector equals zero:

X b_{0} = 0.

(5)

In other words, b₀ represents the null space of the design matrix X, which has dimension equal to one. (The null space has dimension one by the specification of model (1), and the value of b₀ is given below.) It follows that the parameter vector b can be decomposed into components:

b = b_{1} + s \cdot b_{0},

(6)

where s is an arbitrary real number corresponding to a specific solution to equation (4), and b₁ is a linear function of the parameter vector b, corresponding to the projection of b on the non-null space of the design matrix X, orthogonal to the null space. b₁ and b₀ are thus orthogonal to each other. That is, b₁ is the part of b that is in the non-null space of the design matrix X, orthogonal (perpendicular) to the null space, so that b₀ is orthogonal to b₁, i.e., b₁ · b₀ = 0.

Given equations (4) and (6), the following equation must hold:

Y = X b = X (b_{1} + s \cdot b_{0}) = X b_{1} + s \cdot X b_{0} .

(7)

But Xb₀ = 0 and thus s · Xb₀ = 0, so equation (7) is true for all values of s. That is, s can be any real number, and each distinct value of s gives a distinct solution to equation (4). Therefore, an infinite number of possible solutions for b exist, and no solution can be deemed the uniquely preferred or “correct” solution without additional constraints on b.

To illustrate, suppose the data have three age groups, three periods, and five cohorts and that error is zero for ease of presentation (and without loss of generality). Table 1 presents three different parameter vectors b^T = (u, α₁, α₂, α₃,β₁, β₂, β₃, γ₁, γ₂, γ₃, γ₄, γ₅) arising from three different values of s, namely 0, 2, and 10. In Table 2’s top panel, the observed value in each cell is represented in terms of the unknown parameters α_i, β_j, and γ_k,. Table 2’s bottom panel shows the fitted values u + α_i + β_j + γ_k, based on Table 1’s three different s’s in the same tabular form as above. Note that these three sets of fitted values are identical although the parameter vectors in Table 1 differ. In fact, these parameter vectors are not just different; their age and period effects change directions depending on s, and the data cannot distinguish between different s’s.

Table 1.

Different Values of s and the Corresponding Parameters.

s	Age			Period			Cohort
s	α₁	α₂	α₃	β₁	β₂	β₃	γ₁	γ₂	γ₃	γ₄	γ₅
0	2	0	−2	−1	0	1	−1	−0.5	0	0.5	1
2	0	0	0	1	0	−1	−5	−2.5	0	2.5	5
10	−8	0	8	9	0	−9	−21	−10.5	0	10.5	21

Open in a new tab

Notes:

s is an arbitrary real number corresponding to a specific solution to equation (4).
Numbers in each row are a set of age, period, and cohort coefficients corresponding to a specific value of s.

Table 2.

Tabular Data: Unobserved parameters and fitted values from Table 1’s three different parameter vectors.

			Period
			1	2	3
Unobserved Parameters	Age	1	μ+α₁+β₁+γ₃	μ+α₁+β₂+γ₄	μ+α₁+β₃+γ₅
		2	μ+α₂+β₁+γ₂	μ+α₂+β₂+γ₃	μ+α₂+β₃+γ₄
		3	μ+α₃+β₁+γ₁	μ+α₃+β₂+γ₂	μ+α₃+β₃+γ₃

Observed Values	Age	1	11	12.5	14
		2	8.5	10	11.5
		3	6	7.5	9

Open in a new tab

Note: The bottom panel presents identical observed values produced by the three different parameter vectors in Table 1.

Taken together, Tables 1 and 2 show that for a single dataset, an infinite number of possible solutions for age, period, and cohort effects exist, and each solution corresponds to a specific value of s. Therefore, any solution, or alternatively, none of these solutions, can be viewed as reflecting the “true” effects even though different values of s give radically different age, period, and cohort effects. In social science research, data inevitably contain random and/or measurement errors so researchers will not have the perfect fit of the idealized data above; however, the fundamental identification problem remains. Various methods have been developed to address the identification problem and find a set of uniquely preferred estimates. In the section below, I will consider IE and other solutions to the identification problem that impose a constraint on b.

The Constrained Approach: IE and CGLM

A large body of literature dating back to the 1970s has addressed the identification problem. Mason et al. (1973) explicated the “identification problem” in APC analysis and proposed the Constrained Generalized Linear Model (CGLM), a coefficient-constrained approach that has been used as a conventional method for APC analysis. This method places at least one identifying restriction on the parameter vector b in equation (2). Usually the effects of the first two age groups, periods, or cohorts are constrained to be equal based on theoretical or external information. With this additional constraint, the APC model becomes just-identified and unique OLS and maximum likelihood (ML) estimators exist. However, such theoretical information often does not exist or cannot easily be verified. Also different choices of identifying constraint can produce widely different estimates for age, period, and cohort effects. That is, CGLM estimates are quite sensitive to the choice of constraints (Rodgers 1982a,b; Glenn 2005).

More recently, a group of scholars has developed a new APC estimator, called the Intrinsic Estimator (IE). They argued that IE has clear advantages over CGLM (called “CGLIM” in Yang et al. 2008) and can produce valid estimates of the true age, period, and cohort effects (see Fu 2000, 2006; Yang et al. 2004, 2008). The most compelling evidence they provided to support this claim is simulation results where IE and CGLM estimates were compared to the true effects of age, period, and cohort (see Yang et al. 2008, p1718-1719). They concluded that IE outperforms CGLM because IE estimates are closer to the true parameters that generate the data than CGLM (ibid., p1719-1722).

This evidence could easily be interpreted as confirmation that IE produces unbiased estimates of the true age, period, and cohort effects. Unfortunately, few clarifications are provided and the developers of IE are sometimes unclear about what IE actually estimates themselves. For example,

“for a finite number of time periods of data, the IE produces an unbiased estimate of the coefficient vector.”

(Yang 2008, p400)

“Because of its estimability and unbiasedness properties, the IE may provide a means of accumulating reliable estimates of the trends of coefficients across the categories of the APC accounting model.”

(Yang et al. 2008, p1711)

“[T]he IE, by its very definition and construction, satisfies the estimability condition. … If other estimators do indeed satisfy the estimability condition, then they also produce unbiased estimates of the A, P, and C effect coefficients. If not, then the estimates they produce are biased.” (ibid., p1710)

“[P]erhaps most importantly for empirical applications of APC analysis, the IE produces estimated age, period, and cohort coefficients and their standard errors in a direct way, without the necessity of choosing among a large array of possible constraints on coefficients that may or may not be appropriate for a particular analysis.”

(Yang et al. 2004, p105)

Many researchers doing substantive APC analyses have interpreted these and other statements to mean that IE produces unbiased estimates of true age, period, and cohort effects. Consequently, they have used IE in empirical research to address substantive issues including mortality, disease, and religious activity (e.g., Keyes and Miech 2013; Winkler and Warnke 2012; Schwadel 2011; Langley et al. 2011; Miech et al. 2011). These authors seem convinced that IE produces unbiased estimates of age, period, and cohort effects. For example,

“[r]ecent advances in modeling APC effects with repeated cross-sectional data allow age, period, and cohort effects to be simultaneously estimated without making subjective choices requiring constraining data or dropping age, period, or cohort indicators from the model. In particular, APC intrinsic estimator models provide unbiased estimates of regression coefficients for age groups, time periods, and birth cohorts (Fu, 2000).”

(Schwadel 2011, p183)

“[T]he intrinsic estimator provides unbiased estimates of age, period, and cohort effects.” (ibid., p184)

“The IE model has been recommended as a better alternative to the widely discussed constrained generalized linear model (CGLM) (Yang et al. 2004). We used the IE model to estimate individual effects of age, period, and cohort for males and females separately.”

(Langley et al. 2011, p106)

“The IE is an approach that places a constraint on the model, but not a constraint that affects the estimation of regression parameters for age, period, and cohort in any way. That is, the regression parameter estimates are unbiased by the constraint placed, and a unique set of regression estimates can be estimated.”

(Keyes and Miech 2013, p2)

Unfortunately, claims of this sort are incorrect; as I demonstrate below, IE does impose constraints that are as consequential as those imposed by CGLM. To help researchers better understand the constraint imposed by IE and make informed decisions in choosing an APC estimator, I will first derive an easily-understood form of IE’s constraint. Because an unbiased and consistent estimator is desirable and necessary to produce reliable and valid results, I will then address how IE’s constraint affects these key properties: unbiasedness (Is the expectation of IE the “true” age, period, and cohort effects?) and consistency (As the sample size increases, does IE converge to the “true” effects?).

The Linear Constraint Implied by IE

To understand IE’s constraint and its implications for estimation, it is helpful to review IE’s conceptual foundation and computational algorithm. IE can be viewed as an extension of Principal Component (PC) Analysis, a multi-purpose technique that can be used to deal with identification problems when explanatory variables are highly correlated. By transforming correlated explanatory variables to a set of orthogonal linear combinations of these variables, called principal components, PC analysis can be a useful tool for reducing data redundancy and developing predictive models.

In contrast, the goal of IE is neither data reduction nor prediction, but estimation of the effects of, and capturing the general trends of, age, period, and cohort.² IE’s computational algorithm includes five steps: (a) transform the design matrix X to the PC space using its eigenvector matrix; (b) in the PC space, identify the “null eigenvector” – the special eigenvector that corresponds to an eigenvalue of zero – and the corresponding null subspace (with one dimension) and non-null subspace (with m − 1 dimensions, where m denotes the number of coefficients to be estimated); (c) in the non-null subspace of m − 1 dimensions, regress the outcome of interest using OLS or ML on the m − 1 PCs to obtain m − 1 coefficient estimates; (d) extend the m − 1 coefficient estimates to the whole PC space of dimension m by adding an element corresponding to the null eigenvector direction and arbitrarily setting it to zero; and (e) use the eigenvector matrix to transform the extended coefficient vector estimated in the PC space, including the added zero element, back to the original age-period-cohort space to obtain estimates for age, period, and cohort effects (see Yang 2004; Yang et al. 2008).³

The fourth step, “extend the m − 1 coefficient estimates to the whole PC space of dimension m by adding an element corresponding to the null eigenvector direction and arbitrarily setting it to zero,” carries the key assumption of the IE approach to APC analysis. This assumption is implicit yet has major implications for the validity and application of the IE approach. Specifically, setting the “coefficient of the null eigenvector”, s, to zero is equivalent to assuming

b \cdot b_{0} = 0,

(8)

i.e., the projection of b on b₀ is zero, where b and b₀ were defined in equation (6). Kupper and colleagues (1985) provided a closed-form representation for the eigenvector b₀. Using vector notation,⁴

b_{0} = {(0, A, P, C)}^{T},

(9)

where

\begin{array}{l} A = (1 - \frac{1 + a}{2}, \dots, (a - 1) - \frac{1 + a}{2}) \\ P = - 1 \cdot (1 - \frac{1 + p}{2}, \dots, (p - 1) - \frac{1 + p}{2}) \\ C = (1 - \frac{a + p}{2}, \dots, (a + p - 2) - \frac{a + p}{2}) . \end{array}

For example, when a = 3 and p = 3, that is, for three age groups and three time periods, b₀ is

b_{0} = {(0, - 1, 0, 1, 0, - 2, - 1, 0, 1)}^{T},

(10)

where A = (−1,0), P = (1,0), and C = (−2,−1,0,1).

What does equation (8) mean? What is the specific form of this constraint for datasets with varying number of age, period, and cohort groups? To illustrate, suppose that age, period, and cohort each have effects on the outcome variable that show a linear trend. Denote these trends as k_a, k_p, and k_c, respectively, the intercepts for the three variables as i_a, i_p, and i_c, and the overall mean as μ. Thus the effects associated with the three age categories are i_a, i_a + k_a, and i_a + 2 · k_a, respectively. Similarly, the effects related to the three periods are i_p, i_p + k_p, and i_p + 2 · k_p, respectively. For the five cohorts, the effects are i_c, i_c + k_c, i_c + 2 · k_c, i_c + 3 · k_c, and i_c + 4 · k_c, respectively. Then the parameter vector, b, can be written as:

b = {(μ, i_{a}, i_{a} + k_{a}, i_{p}, i_{p} + k_{p}, i_{c}, i_{c} + k_{c}, i_{c} + 2 \cdot k_{c}, i_{c} + 3 \cdot k_{c})}^{T},

(11)

where the last category of each variable is omitted as the reference group. According to the constraint for age effects in model (1), we know that

\sum_{i = 1}^{a} α_{i} = i_{a} + (i_{a} + k_{a}) + (i_{a} + 2 \cdot k_{a}) = 3 \cdot i_{a} + 3 \cdot k_{a} = 0,

(12)

which implies that

i_{a} = - k_{a} .

(13)

Similarly, it can be shown using the constraint for period and cohort effects in model (1) that

i_{p} = - k_{p},

(14)

and

i_{c} = - 2 \cdot k_{c} .

(15)

Using equations (13), (14), and (15), equation (11) can be simplified as:

b = {(μ, - k_{a}, 0, - k_{p}, 0, - 2 \cdot k_{c}, - k_{c}, 0, k_{c})}^{T} .

(16)

Since the constraint that IE implicitly imposes is b · b₀ = 0, by equations (8), (10) and (16), the specific form of IE’s linear constraint (LC) for APC data with three age categories, three periods, and five cohorts are

b \cdot b_{0} = μ \cdot 0 + (- k_{a}) \cdot (- 1) + 0.0 + (- k_{p}) \cdot 1 + 0 \cdot 0 + (- 2 \cdot k_{c}) \cdot (- 2) + (- k_{c}) \cdot (- 1) + 0.0 + k_{c} \cdot 1 = k_{a} - k_{p} + 6 \cdot k_{c} = 0.

(17)

In other words, when age, period, and cohort show linear trends, IE’s implicit constraint is that these linear trends must satisfy equation (17). If, in fact, the true age, period, and cohort trends do not satisfy this equation, then the implicit LC imposed by IE is incorrect.

To illustrate the implications of IE’s LC, I simulate normally distributed data as follows. For those at age i in period j, the mean response is 10 + k_a · age_i + k_p · period_j + k_c · cohort_ij and the standard deviation of error ε equals 0.1. The number of age and period groups is fixed at three each. I consider three sets of true k_a, k_p, and k_c: (a)k_a = 1, k_p = 7, k_c = 1; (b)k_a = 1, k_p = 7, k_c= 10; and (c)k_a = 3, k_p = 1, k_c = 4. For each selection of true k_a, k_p, and k_c, I simulate 1,000 such data sets by drawing random errors. As shown in Table 3, for dataset 1 the true effects for the three age categories are −1,0, and 1, respectively, so k_a, the linear trend in age effects, equals 1. The period effects are −7,0, and 7, respectively, so k_p is 7. Similarly, since the cohort effects are −2,−1,0,1, and 2, k_c is 1. Note that for this dataset,

Table 3.

Simulation Results: IE estimates for three datasets.

		Dataset 1			Dataset 2			Dataset 3

		Truth	IE	Bias	Truth	IE	Bias	Truth	IE	Bias
Age	1	−1	−0.997	0.003	−1	5.747	6.747	−3	0.249	3.249
	2	0	−0.002	−0.002	0	0.002	0.002	0	0.000	0.000
	3	1	0.999	−0.001	1	−5.749	−6.749	3	−0.249	−3.249

Period	1	−7	−6.999	0.001	−7	−13.75	−6.750	−1	−4.250	−3.250
	2	0	−0.002	−0.002	0	0.002	0.002	0	−0.002	−0.002
	3	7	7.002	0.002	7	13.748	6.748	1	4.252	3.252

Cohort	1	−2	−2.001	−0.001	−20	−6.497	13.503	−8	−1.500	6.500
	2	−1	−0.998	0.002	−10	−3.253	6.747	−4	−0.750	3.250
	3	0	−0.001	−0.001	0	0.002	0.002	0	0.000	0.000
	4	1	1.004	0.004	10	3.250	−6.750	4	0.750	−3.250
	5	2	1.996	−0.004	20	6.498	−13.502	8	1.500	−6.500

Open in a new tab

Notes:

1
For each dataset, the IE estimates are averaged over 1,000 simulations.
2
The bias of IE is evaluated by the difference between the true effects and the IE estimates, averaged over 1,000 simulations.
2
Equation (17) holds for dataset 1, but does not hold for datasets 2 and 3.

k_{a} - k_{p} + 6 \cdot k_{c} = 1 - 7 + 6 \cdot 1 = 0,

(18)

i.e., the relationship between the linear trends in the true age, period, and cohort effects satisfies equation (17), the LC implicit in IE. However, for datasets 2 and 3 generated by the other sets of true k_a, k_p, and k_c in Table 3, equation (17) does not hold. Specifically, for the second set, k_a = 1, k_p = 7, and k_c = 10, so

k_{a} - k_{p} + 6 \cdot k_{c} = 1 - 7 + 6 \cdot 10 = 54 \neq 0;

(19)

And for the third set, k_a = 3, k_p = 1, and k_c = 4, so

k_{a} - k_{p} + 6 \cdot k_{c} = 3 - 1 + 6 \cdot 4 = 26 \neq 0.

(20)

Table 3 presents IE estimates, averaged over the 1,000 simulated datasets, for the three sets of age, period, and cohort effects. The bias of IE is estimated by the difference between the truth and the averaged IE estimates. Table 3 shows that for dataset 1, IE yields good estimates because the true k_a, k_p, and k_c in the data satisfy equation (17), the implicit LC that IE imposes. Specifically, the estimated slopes for age, period, and cohort are k̂_a = 0.999, k̂_p = 7.001, and k̂_c = 1.000, respectively. In contrast, IE returns highly biased estimates, very different from the true effects, for the second and third datasets because the true k_a, k_p, and k_c do not satisfy IE’s LC. For example, for datasets 2 and 3, the estimated age effects, averaged over the 1,000 simulations, show a downward trend (k̂_a = −5.750 for dataset 2 and k̂_a = −2.582 for dataset 3) when the true trend is upward (the true age slopes are k_a = 1 for dataset 2 and k_a = 3 for dataset 3).

Note that equation (17) is derived for the simplest scenario where the age, period, and cohort trends are purely linear. For more complex scenarios where these trends are not purely linear, IE’s constraint depends on the non-linear components of the age, period, and cohort effects.⁵ For example, suppose that age, period, and cohort each have effects on the outcome of interest that include a linear and a quadratic trend. Denote the quadratic trends as $k_{a}^{'}, k_{p}^{'}$ , and $k_{c}^{'}$ , respectively. Using the same derivation above, the specific form of IE’s constraint for APC data with three age categories, three periods, and five cohorts is

(k_{a} - k_{p} + 6 \cdot k_{c}) + \frac{5}{3} (k_{a}^{'} - k_{p}^{'} + 12 \cdot k_{c}^{'}) = 0.

(21)

That is, when age, period, and cohort effects include quadratic components, these effects must satisfy equation (21) in order for IE to yield good estimates. Equation (17) can be viewed as a special case of equation (21) when there are no quadratic or higher-order non-linear components in the age, period, and cohort effects. Alternatively, because the linear dependency between age, period, and cohort does not affect the identification of nonlinear effects, IE’s constraint can be said to bind only on the linear age, period, and cohort trends, and the specific value of the constraint on the linear effects is determined by the non-linear effects, which are estimable.

For any coefficient-constraint approach such as CGLM and IE, “the choice of constraint is the crucial determinant of the accuracy in the estimated age, period, and cohort effects” (Kupper et al. 1985, p822). Since the constraint assumption strongly affects estimation results, no matter what constraint a statistical method assumes, that method produces good estimates only when its assumption approximates the true structure of the data under investigation. It follows that when there are three age groups, three periods, and five cohorts and their effects are purely linear, IE can only yield accurate estimates when these linear effects of age, period, and cohort satisfy equation (17). Unfortunately, researchers usually have no a priori knowledge about true age, period, and cohort effects that would allow them to evaluate whether the constraint implied in equation (17) holds. Therefore, researchers cannot assess whether IE produces unbiased estimates of age, period, and cohort effects for their data. Thus IE is no better than CGLM in this respect.

More importantly, the exposition above indicates that the LC assumed by IE also depends on the design matrix, i.e., on the number of age, period, and cohort groups. For example, if we add one age group to our example, such that we now have four age groups, three periods, and six cohorts, then following the same derivation used above, the LC implied by IE is

b \cdot b_{0} = 2.75 \cdot k_{a} - k_{p} + 11.25 \cdot k_{c} = 0,

(22)

b \cdot b_{0} = (k_{a} - k_{p} + 6 \cdot k_{c}) + (1.75 \cdot k_{a} + 5.25 \cdot k_{c}) = 0.

(23)

Compared to equation (17) for the case of three age groups, three periods, and five cohorts, equations (22) and (23) show that adding an age group dramatically changes the constraint so that the true effects satisfying IE’s LC with three age categories no longer satisfy this LC when an age category is added. Readers can verify that increasing or reducing the number of periods or cohorts also greatly alters IE’s LC.

These examples demonstrate that not only does IE rely on a constraint like CGLM does, but unlike CGLM — where the constraint (e.g., equal effects for the first two age groups) is explicit and rationalized by theoretical account or side information — the LC of IE is implicit and varies depending on the number of age, period, and cohort groups. Although this constraint has been described as minimal (e.g., Schwadel 2011; Yang et al. 2008), in fact, as shown, it can have major implications for the quality of substantive results.

Theoretically speaking, the limitation of IE results from a misinterpretation of the constraint that IE imposes on parameter estimation. It is true that b₀, the null eigenvector, is determined by the design matrix, but it is incorrect to conclude that therefore b₀ “should not play any role in the estimation of effect coefficients” (Yang et al. 2008, p1705). Rather, both the null eigenvector and non-null eigenvectors (with nonzero eigenvalues) are determined by the design matrix, that is, by the number of age, periods, and cohort groups. To this extent, it is no less likely that the data contain a significant component in the b₀ direction than in the directions of the non-null eigenvectors. The fact that s, the coefficient for b₀, can be any real number without changing the fitted values Xb simply means that variation in Y in the direction of b₀ is not estimable. If the data have variation in this direction, IE will mistakenly attribute that variation to other columns in the design matrix, causing significant errors in estimation.

The Implications of IE’s Constraint: Is IE an unbiased and consistent estimator?

Because IE imposes a constraint on the linear age, period, and cohort trends, IE yields reliable estimates only when the true trends satisfy its constraint. However, Yang and colleagues argue that “[b]ecause of its estimability and unbiasedness properties, the IE may provide a means of accumulating reliable estimates of the trends of coefficients across the categories of the APC accounting model” (ibid., p1711). In the discussion below, I clarify that IE is not an unbiased estimator of the “true” age, period, and cohort effects. I also use concrete examples to illustrate that IE is not consistent and explain why IE appears to be converging to the truth in Yang et al. (2008)’s article. This section may be particularly helpful for non-technical researchers.

Biasedness

By definition, an estimator δ is an unbiased estimator of a parameter θ if the expectation of δ over the distribution that depends on θ is equal to θ, or E_θ (δ) = θ. It follows that, for an unbiased APC estimator, its expectation must be the true effects of age, period, and cohort.⁶ Per this definition, if IE is an unbiased estimator, the expected value of IE must be the true age, period, and cohort effects. The following mathematical computation shows, however, that the expectation of the IE estimator is not the true effects unless those true effects happen to satisfy IE’s implicit constraint.

As noted in the section above, the key computation of IE is to extend the coefficient estimates in the PC space, b′

{(b^{'})}^{T} = (b_{0}^{'}, b_{1}^{'}, b_{2}^{'}, \dots, b_{m - 1}^{'})

(24)

by adding a zero element such that

{(b_{new}^{'})}^{T} = (b_{0}^{'}, b_{1}^{'}, b_{2}^{'}, \dots, b_{m - 1}^{'}, 0),

(25)

where $b_{new}^{'}$ corresponds to the projection of the coefficient vector b in the non-null space, i.e., b₁ in equation (6). IE then transforms the extended coefficient vector $b_{new}^{'}$ including the added zero element, back to the original age-period-cohort space to obtain coefficient estimates for age, period, and cohort.

Given that OLS and ML estimators have been proven unbiased in simpler — identifiable — problems with normally distributed errors as in equation (2), and since IE uses these methods to obtain estimates for b₁, whose projection in the PC space corresponds to the extended coefficient vector $b_{new}^{'}$ , IE yields unbiased estimates for b₁. In other words,

E (b_{I E}) = b_{1} .

(26)

Based on the preceding discussion of the identification problem, the true parameter space b can be decomposed into two orthogonal subspaces corresponding to b₁ and b₀ in equation (6), which is equivalent to

b_{1} = b - s \cdot b_{0} .

(27)

Substituting equation (27) in (26) results in

E (b_{I E}) = b_{1} = b - s \cdot b_{0} .

(28)

Equation (28) means that the expectation of the IE estimator will be different from the true effects b unless s · b₀ = 0, i.e., unless s = 0. IE assumes s = 0; thus, IE is a biased estimator when the true value of s is anything but 0. The larger the absolute value of s, the more biased the IE estimates become.

For researchers who wish to investigate age, period, and cohort effects for the purposes of substantive demographic, social, or other applied research, there exists little theoretical or empirical knowledge about the value of s and what b₀, the “null eigenvector,” may imply about the outcome variable. In specific applications, then, IE must be assumed to be biased, resulting in misleading conclusions about the true age, period, and cohort effects unless proven otherwise.

Note that IE’s developers argue that IE satisfies the “estimability criterion” proposed by Kupper et al. (1985), so IE is in that sense an unbiased estimator. However, estimability of a function of b implies unbiased estimation only of the estimable function of b, not necessarily of the true parameter b itself. b₁, the projection of the parameter vector onto the non-null space, is indeed an estimable function of b, the true parameter vector, and thus IE is an unbiased estimator of b₁. But IE is a biased estimator for the true APC effects when b₁ is different from b. Therefore, it is not accurate to say that “Kupper et al. (1985) … suggested that an estimable function satisfying this condition resolves the identification problem” as claimed in Yang and associates (2008, p1703). To emphasize, estimability in the non-null space does not imply unbiasedness in estimating the true age, period, and cohort effects. Discovering a set of estimable functions is not the same as solving the identification problem.

Consistency

In statistics, for an estimator δ to be a consistent estimator of an unknown parameter space θ, δ must converge in probability to θ as the sample size grows. If δ is unbiased, consistency usually follows immediately. A biased estimator can be consistent if its bias decreases as the sample size increases. However, the bias of IE, s · b₀, does not necessarily shrink as the sample size grows. Thus, IE is not a consistent estimator of the coefficient vector b.

This theoretical argument can be illustrated with simulations. I simulate normally-distributed data using the same function as that for Dataset 1 in Table 3: For those at age i in period j, the mean response is 10 + 1 · age;_i + 7 · period_j + 1 · cohort_ij and standard deviation of error = 0.1. I begin with three age groups and three periods, and then increase the number of periods to six and 12, respectively. For each scenario, I simulate 1,000 such datasets by drawing random errors. If IE is a consistent estimator, as the number of periods increases, the resulting estimates should get closer and closer to the true effects that we know based on the simulation function.

Table 4 presents the IE estimates, averaged over 1,000 datasets, for the three scenarios in which the number of periods is set at three, six, and 12, respectively. It shows that the IE estimates are not converging to the truth and the bias appears to increase as the number of periods increases from three to 12. Specifically, when p, the number of periods, equals six and 12, although IE correctly captures the direction of the age, period, and cohort trends, there is no evidence that these estimates are converging to the truth; the estimated age, period, and cohort slopes are k̂_a = 2.144, k̂_p = 5.857, and k̂_c = 2.144, respectively, when p = 6; k̂_a = 3.017, k̂_p = 4.983, and k̂_c = 3.017 when p increases to 12. In fact, even with an unrealistically large number of periods (e.g., 100 periods), as I show in Appendix Figure 1, the IE estimates do not appear to converge to the truth.

Table 4.

Simulation Results: Inconsistent IE estimates as the number of periods increases.

		Periods=3			Periods=9			Periods=12

		Truth	IE	Bias	Truth	IE	Bias	Truth	IE	Bias
Age	1	−1	−0.997	0.003	−1	−2.144	−1.144	−1	−3.016	−2.016
	2	0	−0.002	−0.002	0	0.000	0.000	0	−0.000	−0.000
	3	1	0.999	−0.001	1	2.144	1.144	1	3.017	2.017

Period	1	−7	−6.999	0.001	−17.5	−14.642	2.858	−38.5	−27.407	11.093
	2	0	−0.002	−0.002	−10.5	−8.783	1.717	−31.5	−22.427	9.073
	3	7	7.002	0.002	−3.5	−2.931	0.569	−24.5	−17.441	7.059
	4				3.5	2.928	−0.572	−17.5	−12.462	5.038
	5				10.5	8.785	−1.715	−10.5	−7.470	3.030
	6				17.5	14.643	−2.857	−3.5	−2.493	1.007
	7							3.5	2.494	−1.006
	8							10.5	7.473	−3.027
	9							17.5	12.455	−5.045
	10							24.5	17.441	−7.059
	11							31.5	22.425	−9.075
	12							38.5	27.412	−11.088

Cohort	1	−2	−2.001	−0.001	−3.5	−7.497	−3.997	−6.5	−19.612	−13.112
	2	−1	−0.998	0.002	−2.5	−5.362	−2.862	−5.5	−16.590	−11.090
	3	0	−0.001	−0.001	−1.5	−3.214	−1.714	−4.5	−13.575	−9.075
	4	1	1.004	0.004	−0.5	−1.071	−0.571	−3.5	−10.558	−7.058
	5	2	1.996	−0.004	0.5	1.074	0.574	−2.5	−7.542	−5.042
	6				1.5	3.215	1.715	−1.5	−4.525	−3.025
	7				2.5	5.353	2.853	−0.5	−1.506	−1.006
	8				3.5	7.502	4.002	0.5	1.509	1.009
	9							1.5	4.529	3.029
	10							2.5	7.542	5.042
	11							3.5	10.559	7.059
	12							4.5	13.575	9.075
	13							5.5	16.589	11.089
	14							6.5	19.604	13.104

Open in a new tab

The developers of IE correctly note that the estimation of period and cohort effects will not improve with more time periods because “adding a period to the data set does not add information about the previous periods or about cohorts not present in the period just added” (ibid., p1718). However, when they simulated data, the IE estimates for age effects did appear to become closer and closer to the true values as the number of periods increased. They simulated data using the following function:

y_{i j} ~ Poisson {exp [0.3 + 0.1 {({age}_{i} - 5)}^{2} + 0.1 sin ({period}_{j}) + 0.1 cos ({cohort}_{i j}) + 0.1 sin (10 \cdot {cohort}_{i j})]} .

(29)

It appears that IE estimates of the age effects converge to the true effects in this simulation as the number of periods increases because IE’s implicit LC is not satisfied by the “true” age, period, and cohort effects in the simulation mechanism (29) with five periods (b · b₀ = −0.339), but the true effects do approximately satisfy the LC (b · b₀ = −0.036) when the number of periods increases to 50. In other words, IE appears to perform better as the number of periods increases not because IE is a consistent procedure but because the true effects used in the data-generating function (29) conform better to IE’s implicit LC as the number of periods increases.

For demographic or social data where the linear trends in the three variables are unknown, adding more periods or cohorts promises nothing about the accuracy of the coefficient estimation for either age or period or cohort effects. That is, even with a sufficiently large sample, researchers using IE to estimate the true age, period, and cohort effects are not guaranteed to have desirable results that are close to the true values.

Application Scope: IE vs. CGLM

The preceding discussions of IE’s linear constraint (LC) and statistical properties are fairly technical. In this section, I will use several types of simulated data to illustrate how the implicit LC of IE affects its ability to recover the underlying age, period, and cohort effects in social science research⁷. This exercise is important because scholars have debated the application scope of IE in empirical research. As Fu and associates (2011) suggested, “the important statistical issue about APC modeling is how to identify the trend that helps to resolve the real-world problem for a given APC data set” (p455). So I examine whether, compared to CGLM, IE yields better (if not unbiased) estimates of the true age, period, and cohort patterns that may be observed in empirical research.

IE’s developers provided simulations in which IE estimates are closer to the true age, period, and cohort effects than CGLM results. This, they argued, supports their conclusion that IE has clear advantages over CGLM. However, as noted above, the true age, period, and cohort effects in Yang et al.’s (2008) simulation in fact approximately satisfy the LC that IE imposes (b · b₀ = −0.036)⁸. For age, period, and cohort effects that do not satisfy IE’s implicit constraint, IE will not necessarily perform better than CGLM and may perform much worse. Thus, IE is no better than CGLM because the restriction that IE imposes is essentially no different from the constraints assumed in CGLM.

To illustrate, I show simulations, as Yang and colleagues did, to compare the CGLM and IE estimates. However, here the data-generating mechanisms satisfy the constraint assumed by CGLM but not the constraint assumed by IE. Moreover, I simulate data from four models that embody specific social theories and thus conform to empirical reality. The first dataset is simulated to represent the observation that overall health for adults deteriorates as they grow older, and that while recent development in health knowledge and technology have improved health conditions for the entire population, people born in more recent years are likely to be healthier than older cohorts. On the other hand, the demographic literature has also suggested that age, period, or cohort effects may not all exist (Alwin 1991; Winship and Harding 2008; Fabio et al. 2006; Preston and Wang 2006). Accordingly, the other three simulations approximate likely empirical situations where one of the three variables has little impact on the outcome variable.

Specifically, I fix the number of age groups at nine and periods at 50 in all of these simulations with little loss of generality. I then generate 1,000 datasets from each of the following four models:

y_{i j} ~ Normal {10 + 2 \cdot {age}_{i} - 0.5 \cdot {age}_{i}^{2} + 1 \cdot {period}_{j} - 0.015 \cdot {period}_{j}^{2} + 0.15 \cdot {cohort}_{i j} + 0.03 \cdot {cohort}_{i j}^{2}, σ = 0.1}

(30)

y_{i j} ~ Normal {10 + 1 \cdot {period}_{j} - 0.015 \cdot {period}_{j}^{2} + 0.15 \cdot {cohort}_{i j} + 0.03 \cdot {cohort}_{i j}^{2}, σ = 0.1}

(31)

y_{i j} ~ Normal {10 + 2 \cdot {age}_{i} - 0.5 \cdot {age}_{i}^{2} + 0.15 \cdot {cohort}_{i j} + 0.03 \cdot {cohort}_{i j}^{2}, σ = 0.1}

(32)

y_{i j} ~ Normal {10 + 2 \cdot {age}_{i} - 0.5 \cdot {age}_{i}^{2} + 1 \cdot {period}_{j} - 0.015 \cdot {period}_{j}^{2}, σ = 0.1}

(33)

For instance, in equation (30), the outcomes for people with age in period are normally distributed with mean ( $10 + 2 \cdot {age}_{i} - 0.5 \cdot {age}_{i}^{2} + 1 \cdot {period}_{j} - 0.015 \cdot {period}_{j}^{2} + 0.15 \cdot {cohort}_{i j} + 0.03 \cdot {cohort}_{i j}^{2}$ ) and standard deviation σ = 0.1. In equations (31), (32), and (33), one of the age, period, and cohort effects is not present while the effects for the other two variables are the same as in equation (30). Note that none of these models satisfies IE’s constraint; specifically, for the first model, b · b₀ = 115.01; for the second, third, and last model, b · b₀ = 115.72, 130.41 and 16.12, respectively.

Figure 1 compares, for the simulated data from the four models, IE estimates and CGLM estimates using two different constraints. The IE estimates, averaged over 1,000 datasets, are largely away from the true effects for all models because for all four models, the constraint that IE assumes is not satisfied. For example, in Scenario 3 in Figure 1, when there is no period effect in the data-generating mechanism (32), the IE estimates suggest a substantially positive period trend on top of inaccurate estimates for age and cohort effects. In contrast, the CGLM assuming equal age effects for the first and third age groups produces close estimates for all four models. It is equally important to note that the performance of the CGLM estimator also depends on whether its assumption approximates the truth. For instance, in Scenario 4, whereas the CGLM that assumes equal age effects for the first and third group yields good estimates, the same method with a different constraint, i.e., the age effects are the same for the first and second groups, results in biased estimates.

Simulation Results: IE vs. CGLM estimates for age, period, and cohort effects.

In sum, it must be concluded that a) if there is a priori information or theoretical justification, the constrained solution that corresponds to such information (e.g., CGLM estimates assuming equal effects for the first and third age groups in data-generating functions (30) to (33)) will yield better estimates than IE, and b) without such a priori knowledge, IE is not necessarily better than other constrained estimators including CGLM. Without such knowledge, neither IE nor CGLM results are valid.

Conclusion and Discussion

In this article, I focus on the Intrinsic Estimator (IE), a statistical method intended to separate the independent effects of age, period, and cohort on various outcomes. I have discussed the nature and application scope of IE theoretically and illustrated it with simulated data. This article has shown that IE assumes a specific constraint on the linear age, period, and cohort effects. This assumption not only depends on the number of age, period, and cohort groups, but also is extremely difficult, if not impossible, to verify in empirical research. This feature of IE is no different from the constraint assumed in CGLM except that the CGLM constraint does not change automatically as the numbers of age, period, and cohort groups change. The conclusion is that IE is not an unbiased or consistent estimator of the “true” age, period, and cohort effects. Therefore, for demographers and social scientists whose goal is to understand the “true, simultaneously independent effects” of age, period, and cohort, IE’s strategy of circumventing the identification problem can yield biased and potentially misleading estimates.

There is no doubt that Yang and associates have revitalized APC research and inspired many scholars. However, IE is nothing new in APC analysis. Kupper and his colleagues introduced the IE solution to APC analysts, calling this solution as the Principal Component Estimator (PCE) (Kupper et al. 1983, p2795-2797). As O’Brien (2011a, p420) noted, such an estimator “produces coefficients identical to those of the recently introduced intrinsic estimator.” However, instead of concluding that IE is preferable to CGLM, Kupper et al. (1983) clearly stated that PCE (that is, IE) “could lead to more bias than the use of some other constraints” (p2797). As a result, Kupper and associates did not advocate PCE/IE as a general solution, then or subsequently.

Generally speaking, PCE/IE or any other constrained estimator provides just one possible solution from the infinite number of solutions for an under-determined problem (i.e., the rank deficiency problem in APC analysis). That said, the PCE/IE solution should not be regarded as the true solution or the uniquely preferred solution without theoretical justification. In fact, the statistical literature has recognized a variety of constrained estimators including other types of generalized inverse solutions. It is important for demographers and sociologists to understand that the PCE/IE estimates are not necessarily better (i.e., closer to the true parameters) than other constrained estimators.

What should well-intentioned researchers, who wish to investigate the age, period, and cohort patterns, do? On the one hand, several alternative methods have been developed, some of which are more theoretically driven, taking external information into account⁹, while others are statistical approaches¹⁰. Although each of these methods has advantages and limitations and a thorough examination is a topic for future research, I caution that purely statistical techniques are unlikely to yield accurate estimates. The methodological problem of IE and its non-trivial implications for empirical research identified in this paper are not unique to IE. The biostatistics literature shows that use of the APC model (1), regardless of estimation technique, precludes valid estimation as well as meaningful interpretations of the linear components of age, period, and cohort effects (see, e.g., Holford 1983; Kupper et al. 1985). Therefore, my position is to encourage development of APC models that are informed by social theories and thus different from model (1) in basic structure.

On the other hand, although the statistical difficulty in quantifying independent effects of age, period, and cohort was recognized long ago, decades of effort has only resulted in unsatisfactory solutions. Thus it is not unreasonable to ask: Is this unusual challenge suggesting a problem that is not statistical but theoretical in nature? In other words, is the identification problem pointing to a more fundamental problem in the theoretical framework of APC analysis? Should the answers to these questions be positive, the identification problem inherent in model (1) “is a blessing for social science” (Heckman and Robb, 1985 p144) because it warns scientists that they want something — a general statistical decomposition of data — for nothing.

Appendix

Simulation Results: Inconsistent IE estimates.

Footnotes

I am grateful to James Hodges, John Robert Warren, Robert O’Brien, Christopher Winship, Daniel Powers, Samir Soneji, Ann Meier, Ian Ross Macmillan, Carolyn Liebler, Caren Arbeit, Julia Drew, Catherine Fitch, Julian Wolfson, and Wenjie Liao for their helpful comments. I also thank the support from the Maryland Population Research Center. A version of this paper was presented at the 2012 meeting of the Population Association of America, San Francisco, CA.

A version of this paper was presented at the 2012 meeting of the Population Association of America, San Francisco, CA.

One way to characterize the effects of an interval variable like time is to break the effect into two components: linear and non-linear (curvature or deviations from linearity) trends. It has been known at least since Holford (1983) that the linear components of age, period, and cohort effects cannot be estimated without constraints because they are not identified. In contrast, non-linear age, period, and cohort trends can be estimated without bias.

It is important to distinguish data reduction or prediction from coefficient estimation. Because the identification problem does not prevent us from obtaining a set of solutions with good fit to the data, we can still make good predictions. The PC technique treats such problems as data redundancy and allows us to obtain one solution. However, as noted above, none of these solutions is the uniquely preferred solution, the solution that APC techniques including IE aim to discover. Therefore, providing a solution for the purpose of prediction is not the same as finding a uniquely preferred solution for estimation of separate age, period, and cohort effects.

Alternatively, Yang (2008) described the computational algorithm of IE as follows: after obtaining r−1 coefficients in the PC space (w₂, …, w_r), “[s]et coefficient w1 equal to 0 and transform the coefficients vector w = (w₁, …, w_r)^T” (Appendix, p413), where w₁ corresponds to the null eigenvector direction.

⁴

Yang et al. (2004, 2008) use $b_{0}^{*} = \frac{b_{0}}{‖ b_{0} ‖}$ , where ||b₀|| is the length of b₀, so $b_{0}^{*}$ has a length of 1. b₀ is used in this paper because it is simply a multiple of $b_{0}^{*}$ and is simpler for exposition and computation.

⁵

The constraint imposed by IE depends on how model (2) is parameterized. If the model is parameterized in terms of orthogonal polynomial contrasts for each of the age, period, and cohort effects, as in Holford (1983), then IE imposes a constraint solely on the linear contrasts of age, period, and cohort effects irrespective of any non-linear trends that are present. The parameterization used here is more common, e.g., Kupper et al. (1985), and in this parameterization, the constraint on the linear components of the age, period, and cohort effects depends on the non-linear components when both components are present.

⁶

Yang and colleagues have used “unbiasedness” in a different sense to mean that the expectation of IE is equal to b₁, the projection of parameter vector b onto the non-null space of design matrix X (e.g., see ibid. p1709). This is an important distinction because thse true parameter vector b can be very different from its projection b₁ onto the non-null space, the vector that IE actually estimates. Because APC analysts are usually interested in estimating the true age, period, and cohort effects, the classic concept of unbiasedness is more relevant to APC research than that used by IE’s proponents. Thus I use “unbiasedness” in its classic sense in the following discussion.

⁷

Yang and colleagues have used empirical data, where the true effects are unknown, to assess the properties and performance of IE (see ibid., p1712-1716). However, it is logically impossible to assess the performance of an estimator when the true effects are unknown. If such a cross-model validation of IE for a specific empirical dataset were to show that IE yields reasonable estimates, this can only depend on having selected examples that are consistent with the IE’s constraint. Therefore, cross-model comparisons using empirical data are not an appropriate method to validate IE.

⁸

While Yang and colleagues correctly pointed out that IE estimates the projection of the “true” effects onto the non-null space, they compared IE estimates to the “true” parameters, not to the projection (see ibid., p1718-1722). This is key, because the true parameter vector can be very different from its projection onto the non-null space (the vector that IE actually estimates). That is, what IE actually estimates can be very different from the true APC effects if the true effects do not at least approximately satisfy the LC implicit in IE.

⁹

Examples include “Age-Period-Cohort Characteristic Models” developed by O’Brien (2000) and the “mechanism-based approach” proposed by Winship and Harding (2008).

¹⁰

E.g., “Cross-Classified Random Effects Models” created by Yang and Land (2006, 2008).

References

Alwin Duane F. Family of Origin and Cohort Differences in Verbal Ability. American Sociological Review. 1991;56(5):625–638. [Google Scholar]
Fabio Anthony, Leober Rolf, Balasubramani GK, Roth Jeffrey, Fu Wenjiang, Farrington David P. Why some Generations are More Violent than Others: Assessment of Age, Period, and Cohort Effects. American Journal of Epidemiology. 2006;164(2):151–160. doi: 10.1093/aje/kwj172. [DOI] [PubMed] [Google Scholar]
Fu Wenjiang. Ridge Estimator in Singular Design with Applications to Age-Period-Cohort Analysis of Disease Rates. Communications in Statistics Theory and Method. 2000;29:263–78. [Google Scholar]
Fu Wenjiang J, Hall Peter. Asymptotic Properties of Estimators in Age-Period-Cohort Analysis. Statistics & Probability Letters. 2006;76(17):1925–1929. [Google Scholar]
Fu Wenjiang J. A Smoothing Cohort Model in Age-Period-Cohort Analysis with Applications to Homicide Arrest Rates and Lung Cancer Mortality Rates. Sociological Methods & Research. 2008;36(3):327–361. [Google Scholar]
Fu Wenjiang J, Land Kenneth C, Yang Yang. On the Intrinsic Estimator and Constrained Estimators in Age-Period-Cohort Models. Sociological Methods & Research. 2011;40(3):453–466. [Google Scholar]
Glenn Norval D. Cohort Analysis. Thousand Oaks, Calif: Sage Publications; 2005. [Google Scholar]
Heckman James, Robb Richard. Using Longitudinal Data to Estimate Age, Period, and Cohort Effects in Earnings Equations. In: Mason WM, Fienberg SE, editors. Cohort Analysis in Social Research. New York: Springer-Verlag; 1985. pp. 137–50. [Google Scholar]
Holford Theodore R. The Estimation of Age, Period and Cohort Effects for Vital Rates. Biometrics. 1983;39(2):311–324. [PubMed] [Google Scholar]
Keyes Katherine, Miech Richard. Age, Period, and Cohort Effects in Heavy Episodic Drinking in the US from 1985 to 2009. Drug and Alcohol Dependence. 2013 doi: 10.1016/j.drugalcdep.2013.01.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kupper Lawrence L, Janis Joseph, Salama Ibrahim A, Yoshizawa Carl N, Greenberg Bernard G, Winsborough HH. Age-period-cohort analysis: an illustration of the problems in assessing interaction in one observation per cell data. Communications in Statistics. Theory and Methods. 1983;12(23):2779–2807. [Google Scholar]
Kupper Lawrence L, Janis Joseph, Karmous Azza, Greenberg Bernard G. Statistical Age-Period-Cohort Analysis: A Review and Critique. Journal of Chronic Diseases. 1985;38(10):811–830. doi: 10.1016/0021-9681(85)90105-5. [DOI] [PubMed] [Google Scholar]
Langley John, Samaranayaka Ari, Davie J, Campbell AJ. Age, Cohort and Period Effects on Hip Fracture Incidence: Analysis and Predictions from New Zealand Data 1974–2007. Osteoporosis International. 2011;22(1):105–111. doi: 10.1007/s00198-010-1205-6. [DOI] [PubMed] [Google Scholar]
Mason Karen Oppenheim, Mason William M, Winsborough HH, Kenneth Poole W. Some Methodological Issues in Cohort Analysis of Archival Data. American Sociological Review. 1973;38(2):242–258. [Google Scholar]
Miech Richard, Koester Steve, Dorsey-Holliman Brook. Increasing US Mortality due to Accidental Poisoning: The Role of the Baby Boom Cohort. Addiction. 2011;106(4):806–815. doi: 10.1111/j.1360-0443.2010.03332.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
O’Brien Robert M. Constrained Estimators and Age-Period-Cohort Models. Sociological Methods & Research. 2011a;40(3):419–452. [Google Scholar]
O’Brien Robert M. Intrinsic Estimators as Constrained Estimators in Age-Period-Cohort Accounting Models. Sociological Methods & Research. 2011b;40(3):467–470. [Google Scholar]
O’Brien Robert M. Age Period Cohort Characteristic Models. Social Science Research. 2000;29(1):123. [Google Scholar]
Preston Samuel H, Wang Haidong. Sex Mortality Differences in the United States: The Role of Cohort Smoking Patterns. Demography. 2006;43(4):631–646. doi: 10.1353/dem.2006.0037. [DOI] [PubMed] [Google Scholar]
Rodgers Willard L. Estimable Functions of Age, Period, and Cohort Effects. American Sociological Review. 1982a;47(6):774–787. [Google Scholar]
Rodgers Willard L. Reply to Comment by Smith, Mason, and Fienberg. American Sociological Review. 1982b;47(6):793–796. [Google Scholar]
Ryder Norman B. The Cohort as a Concept in the Study of Social Change. American Sociological Review. 1965;30(6):843–861. [PubMed] [Google Scholar]
Schwadel Philip. Age, Period, and Cohort Effects on Religious Activities and Beliefs. Social Science Research. 2011;40(1):181–192. [Google Scholar]
Winkler Richelle, Warnke Keith. Population Environment. Springer; 2012. The future of hunting: an age-period-cohort analysis of deer hunter decline. [DOI] [Google Scholar]
Winship Christopher, Harding David J. A Mechanism-Based Approach to the Identification of Age--Period--Cohort Models. Sociological Methods & Research. 2008;36(3):362–401. [Google Scholar]
Yang Yang, Fu Wenjiang J, Land Kenneth C. A Methodological Comparison of Age-Period-Cohort Models: The Intrinsic Estimator and Conventional Generalized Linear Models. Sociological Methodology. 2004;34(1):75–110. [Google Scholar]
Yang Yang. Trends in U.S. Adult Chronic Disease Mortality, 1960–1999: Age, Period, and Cohort Variations. Demography. 2008;45(2):387–416. doi: 10.1353/dem.0.0000. [DOI] [PMC free article] [PubMed] [Google Scholar]
Yang Yang, Schulhofer-Wohl Sam, Fu Wenjiang J, Land Kenneth C. The Intrinsic Estimator for Age-Period-Cohort Analysis: What it is and how to use it. American Journal of Sociology. 2008;113(6):1697–1736. [Google Scholar]
Yang Yang, Land Kenneth C. A Mixed Models Approach to the Age-Period-Cohort Analysis of Repeated Cross-Section Surveys, with an Application to Data on Trends in Verbal Test Scores. Sociological Methodology. 2006;36:75–97. [Google Scholar]
Yang Yang, Land Kenneth C. Age–Period–Cohort Analysis of Repeated Cross-Section Surveys: Fixed Or Random Effects? Sociological Methods & Research. 2008;36(3):297–326. [Google Scholar]

[R1] Alwin Duane F. Family of Origin and Cohort Differences in Verbal Ability. American Sociological Review. 1991;56(5):625–638. [Google Scholar]

[R2] Fabio Anthony, Leober Rolf, Balasubramani GK, Roth Jeffrey, Fu Wenjiang, Farrington David P. Why some Generations are More Violent than Others: Assessment of Age, Period, and Cohort Effects. American Journal of Epidemiology. 2006;164(2):151–160. doi: 10.1093/aje/kwj172. [DOI] [PubMed] [Google Scholar]

[R3] Fu Wenjiang. Ridge Estimator in Singular Design with Applications to Age-Period-Cohort Analysis of Disease Rates. Communications in Statistics Theory and Method. 2000;29:263–78. [Google Scholar]

[R4] Fu Wenjiang J, Hall Peter. Asymptotic Properties of Estimators in Age-Period-Cohort Analysis. Statistics & Probability Letters. 2006;76(17):1925–1929. [Google Scholar]

[R5] Fu Wenjiang J. A Smoothing Cohort Model in Age-Period-Cohort Analysis with Applications to Homicide Arrest Rates and Lung Cancer Mortality Rates. Sociological Methods & Research. 2008;36(3):327–361. [Google Scholar]

[R6] Fu Wenjiang J, Land Kenneth C, Yang Yang. On the Intrinsic Estimator and Constrained Estimators in Age-Period-Cohort Models. Sociological Methods & Research. 2011;40(3):453–466. [Google Scholar]

[R7] Glenn Norval D. Cohort Analysis. Thousand Oaks, Calif: Sage Publications; 2005. [Google Scholar]

[R8] Heckman James, Robb Richard. Using Longitudinal Data to Estimate Age, Period, and Cohort Effects in Earnings Equations. In: Mason WM, Fienberg SE, editors. Cohort Analysis in Social Research. New York: Springer-Verlag; 1985. pp. 137–50. [Google Scholar]

[R9] Holford Theodore R. The Estimation of Age, Period and Cohort Effects for Vital Rates. Biometrics. 1983;39(2):311–324. [PubMed] [Google Scholar]

[R10] Keyes Katherine, Miech Richard. Age, Period, and Cohort Effects in Heavy Episodic Drinking in the US from 1985 to 2009. Drug and Alcohol Dependence. 2013 doi: 10.1016/j.drugalcdep.2013.01.019. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R11] Kupper Lawrence L, Janis Joseph, Salama Ibrahim A, Yoshizawa Carl N, Greenberg Bernard G, Winsborough HH. Age-period-cohort analysis: an illustration of the problems in assessing interaction in one observation per cell data. Communications in Statistics. Theory and Methods. 1983;12(23):2779–2807. [Google Scholar]

[R12] Kupper Lawrence L, Janis Joseph, Karmous Azza, Greenberg Bernard G. Statistical Age-Period-Cohort Analysis: A Review and Critique. Journal of Chronic Diseases. 1985;38(10):811–830. doi: 10.1016/0021-9681(85)90105-5. [DOI] [PubMed] [Google Scholar]

[R13] Langley John, Samaranayaka Ari, Davie J, Campbell AJ. Age, Cohort and Period Effects on Hip Fracture Incidence: Analysis and Predictions from New Zealand Data 1974–2007. Osteoporosis International. 2011;22(1):105–111. doi: 10.1007/s00198-010-1205-6. [DOI] [PubMed] [Google Scholar]

[R14] Mason Karen Oppenheim, Mason William M, Winsborough HH, Kenneth Poole W. Some Methodological Issues in Cohort Analysis of Archival Data. American Sociological Review. 1973;38(2):242–258. [Google Scholar]

[R15] Miech Richard, Koester Steve, Dorsey-Holliman Brook. Increasing US Mortality due to Accidental Poisoning: The Role of the Baby Boom Cohort. Addiction. 2011;106(4):806–815. doi: 10.1111/j.1360-0443.2010.03332.x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R16] O’Brien Robert M. Constrained Estimators and Age-Period-Cohort Models. Sociological Methods & Research. 2011a;40(3):419–452. [Google Scholar]

[R17] O’Brien Robert M. Intrinsic Estimators as Constrained Estimators in Age-Period-Cohort Accounting Models. Sociological Methods & Research. 2011b;40(3):467–470. [Google Scholar]

[R18] O’Brien Robert M. Age Period Cohort Characteristic Models. Social Science Research. 2000;29(1):123. [Google Scholar]

[R19] Preston Samuel H, Wang Haidong. Sex Mortality Differences in the United States: The Role of Cohort Smoking Patterns. Demography. 2006;43(4):631–646. doi: 10.1353/dem.2006.0037. [DOI] [PubMed] [Google Scholar]

[R20] Rodgers Willard L. Estimable Functions of Age, Period, and Cohort Effects. American Sociological Review. 1982a;47(6):774–787. [Google Scholar]

[R21] Rodgers Willard L. Reply to Comment by Smith, Mason, and Fienberg. American Sociological Review. 1982b;47(6):793–796. [Google Scholar]

[R22] Ryder Norman B. The Cohort as a Concept in the Study of Social Change. American Sociological Review. 1965;30(6):843–861. [PubMed] [Google Scholar]

[R23] Schwadel Philip. Age, Period, and Cohort Effects on Religious Activities and Beliefs. Social Science Research. 2011;40(1):181–192. [Google Scholar]

[R24] Winkler Richelle, Warnke Keith. Population Environment. Springer; 2012. The future of hunting: an age-period-cohort analysis of deer hunter decline. [DOI] [Google Scholar]

[R25] Winship Christopher, Harding David J. A Mechanism-Based Approach to the Identification of Age--Period--Cohort Models. Sociological Methods & Research. 2008;36(3):362–401. [Google Scholar]

[R26] Yang Yang, Fu Wenjiang J, Land Kenneth C. A Methodological Comparison of Age-Period-Cohort Models: The Intrinsic Estimator and Conventional Generalized Linear Models. Sociological Methodology. 2004;34(1):75–110. [Google Scholar]

[R27] Yang Yang. Trends in U.S. Adult Chronic Disease Mortality, 1960–1999: Age, Period, and Cohort Variations. Demography. 2008;45(2):387–416. doi: 10.1353/dem.0.0000. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R28] Yang Yang, Schulhofer-Wohl Sam, Fu Wenjiang J, Land Kenneth C. The Intrinsic Estimator for Age-Period-Cohort Analysis: What it is and how to use it. American Journal of Sociology. 2008;113(6):1697–1736. [Google Scholar]

[R29] Yang Yang, Land Kenneth C. A Mixed Models Approach to the Age-Period-Cohort Analysis of Repeated Cross-Section Surveys, with an Application to Data on Trends in Verbal Test Scores. Sociological Methodology. 2006;36:75–97. [Google Scholar]

[R30] Yang Yang, Land Kenneth C. Age–Period–Cohort Analysis of Repeated Cross-Section Surveys: Fixed Or Random Effects? Sociological Methods & Research. 2008;36(3):297–326. [Google Scholar]

PERMALINK

Assessing Validity and Application Scope of the Intrinsic Estimator Approach to the Age-Period-Cohort Problem^*

Liying Luo

Abstract

Introduction

The Identification Problem

Table 1.

Table 2.

The Constrained Approach: IE and CGLM

The Linear Constraint Implied by IE

Table 3.

The Implications of IE’s Constraint: Is IE an unbiased and consistent estimator?

Biasedness

Consistency

Table 4.

Application Scope: IE vs. CGLM

Figure 1.

Conclusion and Discussion

Appendix

Figure 1.

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Assessing Validity and Application Scope of the Intrinsic Estimator Approach to the Age-Period-Cohort Problem*

Liying Luo

Abstract

Introduction

The Identification Problem

Table 1.

Table 2.

The Constrained Approach: IE and CGLM

The Linear Constraint Implied by IE

Table 3.

The Implications of IE’s Constraint: Is IE an unbiased and consistent estimator?

Biasedness

Consistency

Table 4.

Application Scope: IE vs. CGLM

Figure 1.

Conclusion and Discussion

Appendix

Figure 1.

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Assessing Validity and Application Scope of the Intrinsic Estimator Approach to the Age-Period-Cohort Problem^*