Skip to main content
Elsevier - PMC COVID-19 Collection logoLink to Elsevier - PMC COVID-19 Collection
. 2011 Feb 18;141(7):2494–2499. doi: 10.1016/j.jspi.2011.02.014

The nonparametric maximum likelihood estimator for middle-censored data

Pao-Sheng Shen 1
PMCID: PMC7126554  PMID: 32288074

Abstract

In this note, we consider data subjected to middle censoring where the variable of interest becomes unobservable when it falls within an interval of censorship. We demonstrate that the nonparametric maximum likelihood estimator (NPMLE) of distribution function can be obtained by using Turnbull's (1976) EM algorithm or self-consistent estimating equation (Jammalamadaka and Mangalam, 2003) with an initial estimator which puts mass only on the innermost intervals. The consistency of the NPMLE can be established based on the asymptotic properties of self-consistent estimators (SCE) with mixed interval-censored data (Yu et al., 2000, Yu et al., 2001).

Keywords: Middle censoring, Mixed interval censoring, Self-consistent

1. Introduction

Middle censoring occurs when a data point falls inside a random censoring interval whereby it becomes unobservable. For some individuals the exact values are available while for others the corresponding intervals of censorship are observed. We mention two situations where middle censoring occurs. (i) In a follow-up study, if the childhood learning center where the observations are being taken, is closed for a period, due to an external emergency such as the outbreak of severe acute respiratory syndrome (SARS). (ii) In a clinical trial, where the clinic where the observations are being taken, is closed for a period, due to an external emergency such as the outbreak of war or a strike. For situations (i) and (ii) where during a fixed time interval (this fixed interval is indeed, a random interval (denoted by (U,V)) relative to individual's lifetime) the observation was not possible. If some children (or patients) develop a skill (or disease) of interest during this time, we are not able to observe the exact age T of these children (or patients) at the time of skill (or disease) development, rather only the information that the event of interest occurred during a certain time interval (U,V). At first glance, middle censoring, where a random middle part is missing, appears as complementary to the idea of double censoring in which the middle part is what is actually observed. However, a careful reflection and analysis shows them to be quite different ideas; see Jammalamadaka and Mangalam (2003) for details.

Let T i, i=1,…,n, be a sequence of i.i.d. random variables with distribution function F 0. Independent of T i's, let (U i,V i), i=1,…,n, be i.i.d. extended real-valued random variables with joint distribution function K0(x,y)=P(Uix,Viy) such that P(Ui<Vi)=1. The ith observation is said to be censored if Ti(Ui,Vi). We observe (Xi,δi), i=1,…,n, where X i=T i if δi=I[Ti(Ui,Vi)] and X i=(U i, V i) if δi=0. For indentifiability of F 0, we assume that

A0(x)=K0(x,)K0(x,x)=P(Ui<x<Vi)<1, (1.1)

on any interval [a,b], ab for which F0(b)>F0(a). Observe that if A 0(x)=1 on any interval where F 0 has a positive mass, then censoring occurs with probability 1 on such an interval. As a consequence, there will be no observations on the interval and that prevents us from distinguishing any two distributions which are identical outside [a,b] but differing only on [a,b].

In many censoring situations, if we were to try to estimate the distribution function via the EM algorithm the resulting equation takes the form

F^S(t)=EF^S[En|X], (1.2)

as described by Tsai and Crowley (1985), where E n is the empirical distribution function and X denotes the observed data. This equation was first introduced and referred to as self-consistency equation by Efron (1967). A solution F^S of (1.2) is called a self-consistent estimator (SCE) of F 0. In different types of censoring, the relationship between nonparametric maximum likelihood estimator (NPMLE) and SCE has been studied by various authors. In the case of right censoring the product-limit estimator (Kaplan and Meier, 1958) is the NPMLE, and Efron (1967) showed that it is also self-consistent. In the double-censoring case, Mykland and Ren (1996) provided a necessary and sufficient condition for an SCE to be an NPMLE. In the middle censored cases the self-consistent estimator (SCE) (see Jammalamadaka and Mangalam, 2003) F^S satisfies the following equation:

F^S(t)=n1i=1nδiI[Xit]+(1δi)I[Vit]+(1δi)I[t(Ui,Vi)]F^S(t)F^S(Ui)F^S(Vi)F^S(Ui). (1.3)

Jammalamadaka and Mangalam (2003)  showed that the NPMLE satisfies the self-consistency equation (1.3). They also pointed out that an SCE provides only a local maximum of the likelihood equation and may not be an NPMLE. Furthermore, they showed that the NPMLE will have all its mass on the uncensored observations except when it so happens that a censored interval contains no uncensored observation. The consistency of the SCE F^S was established by Jammalamadaka and Mangalam (2003) for the special case when either U i or V i is degenerate. Jammalamadaka and Iyer (2004) proposed an approximation to the distribution function F 0 (denoted by F0) for which a modified self-consistent estimator F^S was obtained. They established the asymptotic properties of F^S and provided an upper bound of the difference between F 0 and F0. Mangalam et al. (2008) showed that under condition (A) every censoring interval contains at least one uncensored observation, i.e. δi=0 implies that there exists j such that δj=1 and Xj(Ui,Vi), the solution of Eq. (1.3) will be unique, and as a consequence F^n will be equal to an NPMLE. Mangalam et al. (2008) proposed a technique for obtaining the NPMLE by dividing the original problem into subproblems.

In this note, we aim to establish connections between the middle-censoring and interval censoring by investigating the self-consistency algorithm. In Section 2, we shall demonstrate that the NPMLE of F 0 can be obtained by using the EM algorithm of Turnbull (1976) or the self-consistent estimating equation (Jammalamadaka and Mangalam, 2003) with an initial estimator which puts mass only on the innermost intervals. Furthermore, we establish the consistency and asymptotic normality of the NPMLE by using the results of Yu et al., 2000, Yu et al., 2001.

2. The NPMLE

2.1. Self-consistency

Turnbull (1976) characterized the NPMLE in the presence of interval censoring and truncation. Frydman (1994) later corrected Turnbull's characterization. Here, we consider the case when there is no truncation. Let Ai=ALi,Ri={Li} if L i=R i and Ai=ARi,Li=(Li,Ri) if Li<Ri. For i=1,…,n, let L i=R i=T i if δi=1 and (L i,R i)=(U i,V i) if δi=0. Thus, we observe the middle-censored data A 1,…,A n. Following Turnbull (1976), Frydman (1994) and Alioum and Commenges (1996), we consider nonparametric estimation of F 0 using the independent observation A i's. Since Turnbull's EM algorithm can be used to tackle the case when A i is a single point set, the connection between middle censoring and interval censoring can be established. Based on the notations defined above, the likelihood is proportional to

L(F0)=i=1nPF0(Ai), (2.1)

where PF0(Ai) denotes the probability that is assigned to the interval by F 0. We define an NPMLE as F^M=argmaxF0F{L(F0)}, where F denotes the class of distribution functions such that PF0(i=1nAi)=1. For censored data, using graph theory, Gentleman and Vandal (2001) presented methods for finding the NPMLE of F 0. For censored and truncated data, Hudgens (2005) employed a graph theoretical approach to describe the support set of the NPMLE of F 0.

Define innermost intervals H j, j=1,…,J, induced by A 1,…,A n to be all the disjoint intervals which are non-empty intersections of these A i's (e.g. Ak=AkAk is an intersection of A i's) such that AiHj= or H j for all i and j. Let the endpoints of the innermost intervals be q j and p j, j=1,…,J, where

0q1p1q2p2qJpJ.

Peto (1973) showed that the NPMLE of F 0 assigns weight, say s1,,sJ, to the corresponding innermost intervals H 1,…,H J only. Thus, it suffices to maximize

L(s)=i=1nj=1Jαijsj, (2.2)

where αij=I[HjAi] and I[·] is the usual indicator function. The goal is to maximize likelihood (2.2) subject to the constraints

j=1Jsj=1 (2.3)

and

sj0(j=1,,J). (2.4)

Thus, we can limit our search to the space given by constraints (2.3), (2.4). We shall use Ω to denote the parameter space that is given by constraints (2.3), (2.4), i.e.

Ω=sRJ:j=1Jsj=1;sj0forj=1,,J.

To find the maximum likelihood estimate of the vector s, we use an EM algorithm as follows.

E-step: Let the expected value of αij be denoted by μij(s). Then under s,

μij(s)=αijsjk=1Jαiksk. (2.5)

M-step: In the maximization step, we treat the expected values as observed. The overall proportion of failures in the interval H j is

πj(s)=i=1nμij(s)i=1nk=1Jμik(s)=n1i=1nμij(s). (2.6)

The EM algorithm iterates between Eqs. (2.5), (2.6) after selecting initial estimates sj(0)>0 such that j=1Jsj(0)=1, i.e., computes μij(s(0)), updates s by μij(s(1)), and repeats until convergence. The resulting self-consistent estimate of s, which is a solution of simultaneous equation sj=πj(s)(j=1,,J), is exactly the Turnbull's (1976) self-consistency algorithm as follows:

sj(b)=dj(s(b1))nsj(b1)(1jJ), (2.7)

where

dj(s(b1))=i=1nαij/k=1Jαiksk(b1).

Let s^j(j=1,,J) denote the estimators obtained from (2.7). Using standard convex optimization techniques, Gentleman and Geyer (1994, see p. 619) provided easily verifiable conditions, i.e. Kuhn–Tucker conditions, for the self-consistent estimator s^j to be a maximum likelihood estimator.

Based on the estimators s^j's, an estimator F^M(t) of F 0(t) can be uniquely defined for t[pj,qj+1) by F^M(pj)=F^M(qj+1)=s^1++s^j, but is not uniquely defined for t being in an open innermost interval (q j,p j) with qj<pj. To avoid ambiguity we define F^M(t)=s^1++s^j1+sj(tqj)/(pjqj) if t(qj,pj] and 0<qj<pj<. Next, we shall show that the estimator F^M satisfies self-consistent equation (1.3), i.e.

F(t)=n1i=1nδiI[Tit]+(1δi)I[Rit]+(1δi)I[t(Li,Ri)]F(t)F(Li)F(Ri)F(Li). (2.8)

Theorem 1

The NPMLE F^M satisfies Eq. (2.8).

Proof

First, notice that for each (qj,pj)(j=1,,J) either q j=p j or qj<pj and there is no uncensored observation in (q j,p j) if qj<pj. Furthermore, dj(s^) can be written as

dj(s^)=i=1nδiαij/k=1Jαiks^k+i=1n(1δi)αij/k=1Jαiks^k.

Consider an initial estimator F^(0), which puts mass only on (qj,pj)(j=1,,J). Let F^(1) denote the first step estimator. Without changing the innermost intervals and likelihood function, we can transform data by moving all right censored points between p j−1 and q j to p j−1. Similarly, move all left censored points between p j−1 and q j to q j (see Li et al., 1997). Hence, we have i=1n(1δi)I[Ri(pj1,qj)]=0. Since i=1nδiI[pj1<Ti<qj]=0, we have

F^(1)(qj)F^(1)(pj1)=n1i=1n(1δi)I[Ri(pj1,qj)]+n1i=1n(1δi)I[(pj1,qj)(Li,Ri)]F^(0)(qj)F^(0)(pj1)F^(0)(Ri)F^(0)(Li)=0.

Hence, F (1) also puts mass only on (qj,pj)(j=1,,J). Next, we consider the following two cases:

Case  1: q j=p j. When q j=p j, we have

F^M(pj)F^M(pj)=s^j=n1+n1i=1n(1δi)αij/k=1Jαiks^ks^j. (2.9)

Now

F^(1)(pj)F^(1)(pj)=n1+n1i=1n(1δi)I[pj(Li,Ri)]F^(0)(pj)F^(0)(pj)F^(0)(Ri)F^(0)(Li),

which can be written as

dF^(1)(pj)=n1+n1i=1n(1δi)αijF^(0)(Ri)F^(0)(Li)dF^(0)(pj). (2.10)

Hence, Eq. (2.10) is equivalent to Eq. (2.9).

Case  2: qj<pj. When qj<pj, since there is no uncensored observations in (q j,p j), we have

F^M(pj)F^M(qj)=n1i=1n(1δi)αij/k=1Jαiks^ks^j. (2.11)

Now,

F^(1)(pj)F^(1)(qj)=n1i=1nδiI[qj<Ti<pj]+n1i=1n(1δi)I[qj<Ri<pj]+n1i=1n(1δi)I[(qj,pj)((Li,Ri)]F^(0)(pj)F^(0)(qj)F^(0)(Ri)F^(0)(Li).

First, we have i=1nδiI[qj<Ti<pj]=0. Next, note that given an interval (L i,R i) and δi=0, we either have (qj,pj)(Li,Ri) or (qj,pj)(Li,Ri)=. Hence, we have i=1n(1δi)I[qj<Ri<pj]=0. It follows that

F^(1)(pj)F^(1)(qj)=n1i=1n(1δi)I[(qj,pj)((Li,Ri)]F^(0)(pj)F^(0)(qj)F^(0)(Ri)F^(0)(Li). (2.12)

Hence, Eq. (2.12) is equivalent to Eq. (2.11). The proof is completed.  □

This conclusion is the same as in Theorem 1 of Jammalamadaka and Mangalam (2003). However, our proof is based on the EM algorithm of Turnbull (1976). It is obvious that if condition (A) holds then q j=p j for all j, and the solution s^j's will have all its mass on the uncensored observations. Furthermore, if we start with an initial estimator which puts weight 1/J on q j=p j for uncensored observations and on (q j+p j)/2 for censored observations, we can obtain an NPMLE by using Eq. (2.8). However, similar to interval-censored data, the self-consistent NPMLE of F 0 is not uniquely defined for x(qj,pj) if qj<pj. An SCE with an initial estimator which puts weight on intervals other than (q j, p j) can lead to a less efficient estimator (not NPMLE).

2.2. Large sample properties

In this section, we shall investigate large sample properties of F^M. First, we introduce mixed interval censored (MIC) data. A data set is called a MIC data when it consists of both exact observations and case 2 interval censoring data (i.e. Li<Ri). Mixed IC data arises in clinical follow-up studies where a tumor maker (e.g., Ca125 in ovarian cancer) is available, a patient whose marker value is consistently on the high (or low) end of normal range in repeated testing is usually under close surveillance for possible relapse. If such a patient should relapse, then the time to clinical relapse can often be accurately determined. However, if a patient is not under close surveillance, and would seek assistance only after some tangible symptoms have appeared, then time to relapse would be subject to case 2 interval censoring. For MIC data, several models have been proposed, and the asymptotic properties of the NPMLE have been investigated under the assumption that either the censoring variables take on finite many values (see Huang, 1999, Yu et al., 1998, Yu et al., 2000), or the censoring and survival distributions are strictly increasing and continuous and they have “positive separation” (see Huang, 1999, Assumption (A3)). For MIC data, define (Y i,Z i) as a pair of extended random censoring times ( allowed) with P(Yi<Zi)=1, and T i is independent of (Y i,Z i). Yu et al. (2000, see (2.1)) considered a mixture interval censorship model to characterize MIC data as follows:

(Li,Ri)=(Ti,Ti)ifTi(Yi,Zi](exact),(Yi,)ifTi>YiandZi=(right censoring),(,Zi)ifTiZiandYi=(left censoring),(Yi,Zi)if<Yi<TiZi<(strictly interval censored). (2.13)

For middle censored data, since neither right censoring nor left censoring occurs, model (2.13) is reduced to

(Li,Ri)=(Ti,Ti)ifTi(Yi,Zi],(Yi,Zi)ifTi(Yi,Zi]. (2.14)

Replacing (Y i,Z i) and (Y i,Z i] in (2.14) with (U i,V i), we obtain the model for middle-censored data as follows:

(Li,Ri)=(Ti,Ti)ifTi(Ui,Vi),(Ui,Vi)ifTi(Ui,Vi).

Hence, although the sampling scheme of MIC data seems to be quite different in character from that of middle-censored data described in Section 1, the resulting observations (L i, R i) would reduce to the observations from middle-censoring data when there is no left or right censoring.

Theorem 2

Let S T denote the set of all support points of variables T i. Similarly, define S V and S U for V i and U i, respectively. Suppose that SUSVST. Then (i) The NPMLE F^M is strongly consistent, i.e. supx0|F^M(x)F0(x)|=0a.s., and (ii) n(F^M(x)F0(x)) converges in distribution to a normal variate.

Proof

Let Q denote the empirical version of the joint distribution function of (L i,R i) (i=1,…,n). It follows that Eq. (2.8) can be written as

F(t)=rtdQn(l,r)+t(l,r)F(t)F(l)F(r)F(l)dQn(l,r), (2.15)

where Q n is the empirical version of Q. In (2.15), if F(t)=F(r−)=F(l), then we encounter 00 in the integrand. In this case, we define 00=1. Notice that Eq. (2.15) is exactly the same as Eq. (2.3) of Yu et al. (2000) and Eq. (2.2) of Yu et al. (2001), which is a self-consistent equation of F 0 for the model in Yu et al., 2000, Yu et al., 2001 with mixed interval-censored (MIC) data. By Theorems 2.1, 2.2 and 3.1 of Yu et al. (2001), the strong consistency and asymptotic normality of F^M and F^S are established.  □

3. Discussion

We have demonstrated how middle-censored data relate to mixed interval-censored data. With some modification of the definition for intervals (q j, p j)'s, we can obtain the NPMLE of distribution function by using EM algorithm of Turnbull (1976) or self-consistent estimating equation (Jammalamadaka and Mangalam, 2003) with a proper initial estimator. The consistency and asymptotic normality of the NPMLE can be established based on the asymptotic properties of self-consistent estimators (SCE) with mixed interval censored data (Yu et al., 2000, Yu et al., 2001).

Acknowledgement

The author would like to thank the associate editor and the referees for their helpful and valuable comments and suggestions.

References

  1. Alioum A., Commenges D. A proportional hazards model for arbitrarily censored and truncated data. Biometrics. 1996;52:512–524. [PubMed] [Google Scholar]
  2. Efron B. Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability. University of California Press; Berkeley: 1967. The two-sample problem with censored data. [Google Scholar]
  3. Frydman H. A note on nonparametric estimation of the distribution function from interval-censored and truncated data. J. Roy. Statist. Soc. Ser. B. 1994;56:71–74. [Google Scholar]
  4. Gentleman R., Geyer C. Maximum likelihood for interval censored data: consistency and computation. Biometrika. 1994;81:618–623. [Google Scholar]
  5. Gentleman R., Vandal A.C. Computational algorithm for censored-data problems using intersection graphs. J. Comput. Graphical Statist. 2001;10:403–421. [Google Scholar]
  6. Huang J. Asymptotic properties of nonparametric estimation based on partly interval-censored data. Statist. Sinica. 1999;9:501–520. [Google Scholar]
  7. Hudgens M.G. On nonparametric maximum likelihood estimation with interval censoring and truncation. J. Roy. Statist. Soc. Ser. B. 2005;67(Part 4):573–587. [Google Scholar]
  8. Jammalamadaka S.R., Mangalam V. Non-parametric estimation for middle-censored data. J. Nonparametr. Statist. 2003;15:253–265. [Google Scholar]
  9. Jammalamadaka S.R., Iyer S.K. Approximate self consistency for middle-censored data. J. Statist. Plann. Inference. 2004;124:75–86. [Google Scholar]
  10. Kaplan E.L., Meier P. Nonparametric estimation from incomplete observations. J. Amer. Statist. Assoc. 1958;53:457–481. [Google Scholar]
  11. Li L., Watkins T., Yu Q.Q. An EM algorithm for smoothing the self-consistent estimator of survival functions with interval-censored data. Scand. J. Statist. 1997;24:531–542. [Google Scholar]
  12. Mangalam V., Nair G.M., Zhao Y. On computation of NPMLE for middle-censored data. Statist. Probab. Lett. 2008;78:1452–1458. [Google Scholar]
  13. Mykland P.A., Ren J. Algorithms for computing self-consistent and maximum likelihood estimators with doubly censored data. Ann. Statist. 1996;24:1740–1764. [Google Scholar]
  14. Peto R. Experimental survival curves for interval-censored data. Appl. Statist. 1973;22:86–91. [Google Scholar]
  15. Tsai W.Y., Crowley J. A large sample study of generalized maximum likelihood estimators from incomplete data via self-consistency. Ann. Statist. 1985;13:1317–1334. [Google Scholar]
  16. Turnbull B.W. The empirical distribution with arbitrarily grouped, censored and truncated data. J. Roy. Statist. Soc. B. 1976;38:290–295. [Google Scholar]
  17. Yu Q.Q., Li L., Wong G.Y.C. Asymptotic variance of the GMLE of a survival function with interval-censored data. Sankhyã. 1998;60:184–197. [Google Scholar]
  18. Yu Q.Q., Li L., Wong G.Y.C. On consistency of the self-consistent estimator of survival function with interval censored data. Scand. J. Statist. 2000;27:35–44. [Google Scholar]
  19. Yu Q.Q., Wong G.Y.C., Li L. Asymptotic properties of self-consistent estimators with mixed interval-censored data. Ann. Inst. Statist. Math. 2001;53:469–486. [Google Scholar]

Articles from Journal of Statistical Planning and Inference are provided here courtesy of Elsevier

RESOURCES