Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2019 Apr 3.
Published in final edited form as: J Am Stat Assoc. 2018 Jun 28;113(523):992–1002. doi: 10.1080/01621459.2017.1379402

Latent Variable Poisson Models for Assessing the Regularity of Circadian Patterns over Time

Sungduk Kim 1, Paul S Albert 1
PMCID: PMC6447315  NIHMSID: NIHMS994776  PMID: 30956371

Abstract

Many researchers in biology and medicine have focused on trying to understand biological rhythms and their potential impact on disease. A common biological rhythm is circadian, where the cycle repeats itself every 24 hours. However, a disturbance of the circadian pattern may be indicative of future disease. In this article, we develop new statistical methodology for assessing the degree of disturbance or irregularity in a circadian pattern for count sequences that are observed over time in a population of individuals. We develop a latent variable Poisson modeling approach with both circadian and stochastic short-term trend (autoregressive latent process) components that allow for individual variation in the degree of each component. A parameterization is proposed for modeling covariate dependence on the proportion of these two model components across individuals. In addition, we incorporate covariate dependence in the overall mean, the magnitude of the trend, and the phase-shift of the circadian pattern. Innovative Markov chain Monte Carlo sampling is used to carry out Bayesian posterior computation. Several variations of the proposed models are considered and compared using the deviance information criterion. We illustrate this methodology with longitudinal physical activity count data measured in a longitudinal cohort of adolescents.

Keywords: Autoregressive process, Circadian pattern, Longitudinal count data, Physical activity, Random effect, Serial correlation, Shrinkage prior, Stick-breaking prior

1. Introduction

Many researchers in biology and medicine have focused on understanding biological rhythms and their potential impact on disease. A common biological rhythm is circadian, where the cycle repeats itself every 24 hours. There has been substantial statistical methodological development that characterizes circadian patterns in biomarkers. Greenhouse, Kass, and Tsay (1987), Brown and Czeisler (1992), Wang and Brown (1996), Wang, Ke, and Brown (2003), and Albert and Hunsberger (2005) proposed statistical methodology for modeling circadian mean structure in continuous longitudinal data. Ogbagaber et al. (2012) extended the shape-invariant model proposed by Wang, Ke, and Brown (2003) to longitudinal physical activity count data. Hu et al. (2007) and Indic et al. (2012) propose the use of nonlinear dynamic approaches for studying circadian patterns in physical activity data over time. All this work, however, focuses only on characterizing the circadian patterns across a population of individuals. Specifically, the focus is on characterizing the underlying circadian shape pattern, amplitude, and phase-shift. However, abnormal biology may reflect itself in disturbed circadian patterns, and scientific interest may be on examining what factors relate to having regular or disturbed (irregular) circadian patterns. In addition to modeling the mean structure in circadian rhythms, this article focuses on understanding what factors influence the degree of circadian as compared with stochastic short-term trend across individuals. In our proposed approach, we characterize both circadian and stochastic variation, and allow for each individual to have an additive and varying component of each.

This methodology was motivated by a recent study that examined the circadian pattern in longitudinal physical activity counts among adolescent girls (Ogbagaber et al. 2012). In this study, 96 adolescent girls completing the 9th grade were asked to wear a physical activity watch (actiwatch2) for a period of 7 days (Kushida et al. 2001). When the actiwatch2 detected nonwearing of the watch at any time during the 7 days, then the participants were asked to redo the whole 7-day followup. Physical activity was characterized by counting physical motions over 15-minute intervals. Ogbagaber et al. (2012) developed a shape-invariant modeling approach for introducing covariates dependence on the mean structure. With this approach, they showed that the circadian pattern was different between obese and nonobese girls. This approach characterized the mean circadian pattern, but did not examine how covariates may affect the relative proportion of circadian versus stochastic short-term trend across participants.

A close examination of the physical activity pattern across individuals shows markedly different types of physical activity patterns in these subjects (Figure 1). Figure 1(a) shows an individual adolescent with a pronounced circadian pattern across the seven days of follow-up. Figure 1(b) shows an adolescent with a slightly disturbed circadian pattern. Furthermore, Figure 1(c) shows an individual with a severely disturbed circadian pattern. This analysis suggested that some individuals have a regular circadian pattern, while others show a much more irregular pattern in physical activity. Interestingly, the degree of a circadian versus stochastic short-term trend may be reflective of future outcomes or important demographic variables. For example, irregular physical activity may result in higher rates of obesity, or perhaps, may be higher in teenagers who are already obese. Developing a modeling strategy that allows us to examine this important public health question is of interest. Figure 2 shows the overall mean and autocorrelation function (ACF) for these longitudinal count data. The figure shows a clear overall circadian pattern across the seven days of follow-up. However, the ACF shows a dampening of the daily correlation across time, suggesting that there may be additional sources of variation in addition to the circadian pattern.

Figure 1.

Figure 1.

Individual plots of three students showing their physical activity over 7 days: the solid lines represent a lowess smooth curve. Each tick represents a four-hour period. Each point reflects a 15 minute with the first point representing the count from 12:00 am to 12:15 am on the first day.

Figure 2.

Figure 2.

Plots of the overall mean trajectory and autocorrelation. Each tick represents a 4-hour period, and each point reflects a 15 minute with the first point representing the count from 12:00 am to 12:15 am on the first day in (a). The lag in (b) is in units of 15 minutes.

Generalized linear mixed models (GLMMs) have been used to estimate the mean structure and variation for longitudinal discrete data (Karim and Zeger 1992; McCulloch et al. 2008). Various extensions of GLMMs have been proposed to incorporate serial correlation using a latent process (Harvey 1989; Smith 1979; Zeger 1988; Albert et al. 2002). Chen and Ibrahim (2000) proposed a Bayesian approach for a time series of counts by constructing a likelihood-based generalization of a model considered by Zeger (1988). Hay and Pettitt (2001) considered the Bayesian analysis of a time series of counts with a random effect having a time series correlation structure. Recently, Kim et al. (2013) proposed a hierarchical Poisson regression model to incorporate overdispersion, heterogeneity, and serial correlation as well as a semiparametric mean structure. These approaches focused on either making inference about the overall mean structure or characterizing the between- to within- subject variation in longitudinal count data. Our focus is on characterizing (through covariates and random effects) the degree of circadian versus stochastic variation in longitudinal physical activity count data. There are different approaches to the statistical analysis of periodic data. The classical time series approach is based on Fourier expansions (Huettel, Song, and McCarthy 2004). A nonparametric method is proposed by Ramsay and Silverman (2005) in which the original series is split into individual cycles and each cycle is treated as an independent curve. A more efficient method is semiparametric shape-invariant modeling (Lawton and Sylvestre 1971; Lawton, Sylvestre, and Maggio 1972; Lindstrom 1995; Wang and Brown 1996; Hurtgen and Gervini 2009). Zeger and Diggle (1994) and Zhang et al. (1998) proposed a semiparametric stochastic mixed model, which assumes parametric covariate effects and nonparametric time effect, and accounts for the within-subject correlation using random effects and a stochastic process. Wang, Ke, and Brown (2003) proposed the shape-invariant mixed effects models for circadian rhythms with smoothing Spline ANOVA decompositions to study potential covariate effects on the common shape function. Albert and Hunsberger (2005) proposed a regression-based approach for analyzing longitudinal continuous circadian rhythm data where individual variations were incorporated through random effects added to the mean, amplitude, and phase-shift. For counts data with periodicity, Ogbagaber et al. (2012) recently proposed to use a shape-invariant Poisson models for circadian rhythm count data, which is an adaptation of Albert and Hunsberger (2005) to the longitudinal physical activity example. All these approaches focus on characterizing the circadian pattern in the mean structure, while in this article, the focus is on characterizing individual differences in the degree of circadian variation across individuals.

In Section 2, we formulate a Poisson model that allows for subject-specific variation in the relative contribution of a circadian pattern and a stochastic latent process, the latter of which governs short-term trend that characterizes irregular circadian variation. The circadian pattern is modeled with a harmonic representation that allows for individual variation in the phase-shift across subjects. Section 3 discusses Bayesian estimation including the proposed prior distributions and Bayesian computations. We demonstrate the performance of the proposed methodology through a simulation study in Section 4. Section 5 presents an analysis of the longitudinal physical activity data that motivated the modeling strategy. A discussion follows in Section 6.

2. The Latent Variable Poisson models

2.1. Model Framework

Suppose that i denotes individual (i=1, … , I) and j denotes the follow-up interval (j=1, … , J). We assume that there are I individuals in the study, each contributing J time intervals. Let m denote the day of continuous activity monitoring (m = 1, … , M). Further, k denotes the time interval on day m (k = 1, … , K) where k, beginning at 12:00 am, repeats itself on each new day. Let Yij be a sequence of counts for the ith individual at the jth time interval. Also, let zi=(zi1,zi2,,zi,q1), xi=(xi1,xi2,,xi,q2), wi=(wi1,wi2,,wi,q3), and ui=(ui1,ui2,,ui,q4); denote q1, q2, q3, and q4-dimensional vectors of covariates for individual i. Note that xi, wi, and ui can share common covariates with zi. Also, α=(α1,,αq1), β=(β1,,βq2), θ=(θ1,,θq3), and ϕ=(ϕ1,,ϕq4) are the corresponding vectors of regression coefficients, respectively. We define hi(k) and bij as the circadian pattern and stochastic short-term variation components. hi(k) follows a harmonic representation, while bij is an autoregressive (AR) process. We allow for participants to have differing degrees of these two components by introducing a mixing parameter that depends on both covariates and random effects. Denote pi=exp(xiβ+σξξi)1+exp(xiβ+σξξi) as a mixing parameter, where ξi is random effect with mean 0 and variance 1, and σξ is a standard deviation (SD) of ξi. Further, denote σi=exp(wiθ) as a scale parameter. The observed count process, yij, is assumed to follow an independent Poisson distribution with mean λij,

logλij=ziα+σττi+σγγij+σi(pihi(k)+(1pi)bij), (1)

where τi is a individual-level random effect that induces exchangeable correlation between individuals, and γij is a time-level random effect that accounts for overdispersion. The term σi(pihi(k) + (1 − pi)bij) reflects the circadian/stochastic pattern where σi governs the amplitude of the combined process. The random effects are assumed to be independent of each other with τi ~ N(0, 1), γij ~ N(0, 1), and ξi ~ N(0, 1), where N(a, b) denotes a normal distribution with mean a and variance b. Furthermore, στ and σγ are the SDs of τi and γij, respectively.

We model the stochastic component, bij, as an AR(1) process,

bi1=ηi1,ηi1~N(0,1),bij=ρbi,j1+ηij,j2, (2)

where ηij ~ N(0, 1 − ρ2) for j ≥ 2. Here ρ, where 0 < ρ < 1, is an autocorrelation parameter that determines how rapidly the serial correlation decreases between bi,j−1 and bi,j. We note that Var(bij) = 1 for j = 1, … , J and I =1, … , I. We model the circadian component,hi*(k), as a harmonic function with a Fourier wavelet basis that can be parameterized as

hi*(k)=l=1Lφl*cos(2πl(k/K+ζil)), (3)

where k/K is in fractions of a day ranging from 0 to 1 starting at midnight, L is the number of harmonic terms where a large L results in a flexible underlying circadian rhythm, and 0 < ζil < 1. Furthermore, L is the number of harmonic terms with L = 1 reducing to a simple sinusoidal circadian pattern and a large value of L allowing for a very flexible pattern. The parameters φl* assess the contribution of each of the L terms to the harmonic function, while the ζil is a parameter that characterize each participant phase-shift as a percentage of a 24-hour period. Variation in the phase-shift for the lth harmonic function and ith participant, ζil, is characterized by ζil=exp(δl+uiϕ+σψψi)1+exp(δl+uiϕ+σψψi) where δl is an intercept for the phase-shift of the lth harmonic function, ψi is an individual-level random effect with mean 0 and variance 1, and σψ is a SD of ψi. Furthermore, the harmonic function in (3) can be rewritten as

hi*(k)=l=1L[ail*cos(2πlk/K)+bil*sin(2πlk/K)], (4)

where ail*=φl*cos(2πlζil), bil*=φl*sin(2πlζil), and <ail* and bil*<. In addition, ail2+bil2=φl*2 for l = 1, … , L. Note that the harmonic function in (4) is the alternative form of (3) which has been used in the literature. The random effect ψi is assumed to be independent of τi, γij, bij, and ξi with ψi ~ N(0, 1).

In order to fairly compare the relative contribution of the circadian versus the stochastic component (i.e., for pi to be a meaningful quantity), we need to standardize the harmonic function hi*(k) so that it has comparable variation across time as the unit variance stochastic process bij. Thus, we define hi(k) as the standardized version,

hi(k)=hi*(k)01hi*(t)2dt=l=1Lφl122cos(2πl(k/K+ζil)), (5)

where t=k/K,φl=φl*2l=1Lφl*2, 0 <φl < 1, and lLφl=1. Further, from (4), the harmonic function can be also rewritten as

hi(k)=l=1L[ailcos(2πlk/K)+bilsin(2πlk/K)], (6)

where ail=φl122cos(2πlζil) and bil=φl122sin(2πlζil). Note that l=1L(ail2+bil2)=2 and 2<ail and bil<2. Our formulation extends the shape-invariant circadian model (Wang, Ke, and Brown 2003; Albert and Hunsberger 2005) to incorporate the stochastic short-term variation into the longitudinal process.

2.2. Hierarchical Centering Reparametrization

Due to the complexity of the model in (1), we propose to use the hierarchical centering reparameterization technique of Ibrahim, Chen, and Ryan (2000), which is particularly suitable for the implementation of the Markov chain Monte Carlo (MCMC) sampling for our problem. Ibrahim, Chen, and Ryan (2000) considered the hierarchical centering reparameterization technique of Gelfand, Sahu, and Carlin (1996) for a Poisson regression model with a latent AR(1) process for a time series of counts. They observed that the mixing and convergence of Gibbs sampler were improved. In addition, Kim et al. (2013) also proposed a reparameterization scheme that proves to enhance the convergence of Gibbs sampler for the hierarchical Poisson regression model incorporating overdispersion, heterogeneity, and serial correlation as well as a semiparametric mean structure. Based on the proposed model described by (1), (2), (5) and random effects such as τi, γij, bij, ξi, and ψi, we first tried a Gibbs sampler without the hierarchical centering reparameterization technique, and all the parameters converged very slowly. To improve convergence, we considered the reparameterization:

bij*=ziα+σττi+σγγij+σipihi(k)+σi(1pi)bij, (7)

where E(bij*)=ziα+σττi+σγγij+σipihi(k) and Var(bij*)=σi2(1pi)2 Using the hierarchical centering in (7), the proposed model in (1) can be rewritten as

yij~Poisson(exp(bij*))andbi1*=μi1+ϵi1,ϵi1~N(0,σi2(1pi)2),bij*μij=ρ(bi,j1*μi,j1)+ϵij,j2, (8)

where μij=ziα+σττi+σγγij+σipihi(k), ϵij~N(0,(1ρ2)σi2(1pi)2) for j ≥ 2, j = (m − 1)K+k, k = 1, … , K, m = 1, … ,M. The convergence for the data analysis and (Section 4) was much improved using this reparameterization. Let y = (y11, y12yI,MK)′, Z = (z1, z2, … , zI)′, X = (x1, x2, … , xI)′, W = (w1, w2, … , wI)′, and U = (u1, u2, … , uI)′. Also let δ = (δ1, δ2,… δL)′, φ = (φ1, φ2, … , φL)′, τ = (τ1, τ2, … τI)′, ξ = (ξ1, ξ2, … , ξI)′ ψ = (ψ1, ψ2, … , ψI), b*=(b11*,b12*,,bI,MK*), and γ = (γ11, γ12, … , γI,MK)′. Then, we let Dobs (y, Z, X,W,U) and D = (y, Z, X,W, τ, ξ, ψ, b, γ) denote the observed and complete data, respectively. The complete data likelihood function can be written as

L(α,β,θ,ϕ,δ,φ,ρ,στ,σξ,σψ,σγ|D)=i=1Im=1Mk=1Kexp[yijbij*exp(bij*)log(yij!)]×i=1Im=1Mk=1KN(bij*|μij+ρ(bi,j1*μi,j1),(1ρ2)σi2(1pi)2)×i=1Im=1Mk=1KN(rij|0,1)×i=1IN(τi|0,1)×N(ξi|0,1)×N(ψi|0,1), (9)

where μij is given in (8) and j = (m − 1)K + k. Further, we obtain the observed likelihood function after integrating out all random effects such as τ, ξ, ψ, γ, and b* from the complete-data likelihood function in (9).

3. Posterior Inference

3.1. Prior and Posterior Distributions

We consider a joint prior distribution for (α, β, θ, ϕ, δ, φ, ρ, στ, σξ, σψ, σγ). We assume that the joint prior for (α, β, θ, ϕ, δ, φ, ρ, στ, σξ, σψ, σγ) is of the form π(α, β, θ, ϕ, δ, φ, ρ, στ , σξ, σψ, σγ) = π(α)π(β)π(θ)π(ϕ)π(δ|φ)π(φ) π(ρ)π(στ)π(σξ)π(σψ)π(σγ). We first specify the prior distribution for the amplitude φl using a stick-breaking representation: φl=υlh=1l1(1υh), where vl ~ Beta(a0, b0) for l < L and vL = 1, which ensures that lLφl=1. Note that stick-breaking prior is commonly used in Bayesian nonparametric modeling such as the Dirichlet process, and Beta(a, b) denotes the beta distribution with shape a and scale b. We also specify a shrinkage prior for the phase-shift, δl:δl~N(0,φlσδ2) and σδ2~Inv-Ga(a1,b1). Furthermore, we assume that

α~Nq1(0,c1Iq1),β~Nq2(0,c2Iq2),θ~Nq3(0,c3Iq3),ϕ~Nq4(0,c4Iq4),στ~ Ga (a2,b2),σξ~ Ga (a3,b3),σψ~ Ga (a4,b4),σγ~ Ga (a5,b5), and ρ~ Unif (0,1), (10)

where l = 1, … , L, Unif(a, b) denotes the uniform distribution with ranged (a, b), Ga(a, b) denotes the gamma distribution with mean a/b, Inv-Ga(a, b) denotes the inverse gamma distribution with shape a and scale b, and c1, c2, c3, c4, a0, b0, a1, b1, a2, b2, a3, b3, a4, and b5 are the prespecified hyperparameters. In this article, we used c1 = 100, c2 = 100, c3 = 100, c4 = 100, a0 = 1, b0 = 1, a1 = 1, b1 = 1, a2 = 1, b2 = 0.1, a3 =1, b3 =0.1, a4= 1, b4 = 0 1, a5 = 0.1, and b5 = 0.1 These choices ensure that the prior distribution for (α,β,θ,ϕ,δ,φ,ρ,στ,σξ,σψ,σγ,σδ2) is nearly non-informative. Let v = (v1, v2, … , vL)’. Based on the prior distributions in (10), the joint posterior distribution of α, β, θ, ϕ, δ, v, ρ, στ, σξ, σψ, σγ, and σδ2 based on the complete data D is given by

π(α,β,θ,ϕ,δ,υ,ρ,στ,σξ,σψ,σγ,σδ2|D)L(α,β,θ,ϕ,δ,υ,ρ,στ,σξ,σψ,σγ|D)×π(α)π(β)π(θ)π(ϕ)π(δ|φ,σδ2)π(υ)×π(ρ)π(στ)π(σξ)π(σψ)π(σγ)π(σδ2), (11)

where L(α, β, θ, ϕ, δ, v, ρ, στ , σξ, σψ, σγ |D) is defined in (9). We note that by using the stick-breaking prior, we sample vl from its conditional posterior distribution without constraints, thereby obtaining the posterior estimates of φl. Furthermore, we incorporate a shrinkage prior for δl that shrinks the variance of the prior distribution toward zero when the corresponding value of φl is near zero, creating stable estimation in situations where there is very little information about δl. A description of the MCMC algorithm is given in the Section 3.3.

3.2. Model Comparison

The model formulation in (1) includes a random effect γij that induces overdispersion relative to the model without this effect. We use the deviance information criterion (DIC) developed by Spiegelhalter et al. (2002) to determine whether this extra source of variation is necessary in our application. Due to the nature of the latent variables and random effects, it is not easy to integrate out τi, γij, ξi, ψi, and bij analytically from model (1). Although the analytically intractable integrals may be evaluating through numerical integration or Monte Carlo methods, these methods are computationally expensive due to the high dimension integrations. Instead of integrating out all random effects to compute the DIC, we let ςij = στ τi + σγ γij + σi(pihi(k) (1 − pi)bij), which can not be observed. Similar to an extension of a version of the DIC discussed in Huang, Chen, and Ibrahim (2005) for generalized linear models with missing covariates, we here treated the ςij as a parameter. Let = (α, ς), where ς = (ς11, ς12, … , ςI,MK)’. We define the deviance function as

Dev(Ω)=2logL*(α,ς|Dobs), (12)

where

L*(α,ς|Dobs)=i=1Im=1Mk=1Kexp[yij(ziα+ςij)exp(ziα+ςij)log(yij!)].

The DIC is given by

DIC=Dev(Ω¯)+2pD, (13)

Where pD=Dev¯(Ω)Dev(Ω¯), Dev¯(Ω) is the posterior mean of Dev(), and Ω¯ is the posterior mean of . In (13), the first term of the DIC measures the goodness-of-fit, and pD is the effective number of model parameters. Using the extension to the DIC as proposed by Huang et al. (2005) in the presence of missing covariates, we compute α¯=E(α|Dobs), ςij¯=E(ςij|Dobs), and Dev¯(Ω)=E[Dev(Ω)|Dobs]. This way of computing the DIC is possible because we have values of ςij at each MCMC iteration and that given ς, no other parameters except α are needed. The DIC in (13) is a Bayesian measure of fit or adequacy with 2pD being the respective dimension penalty term. The smaller the DIC value, the better the model fits the data. Other properties of the DIC can be found in Spiegelhalter et al. (2002) and Huang et al. (2005).

3.3. Computational Development

We present the detailed development of the MCMC sampling algorithm only for full model in (8) as the model without random effects for overdispersion have similar computational developments. Although the analytic evaluation of the joint posterior distribution of (α,β,θ,ϕ,δ,υ,ρ,στ,σξ,σψ,σγ,σδ2,τ,ξ,ψ,b*,γ) given in (11) is not possible, the proposed model allows us to develop an efficient MCMC sampling algorithm to sample from (11). We here briefly discuss how to sample from each of the conditional posterior distributions. (I) Observe that

τi|α,β,θ,ϕ,δ,υ,ρ,στ,σξ,σψ,σγ,σδ2,ξ,ψ,b*,γ,Dobs~N(Aτi1Bτi,Aτi1), (14)

where Aτi=1+mk(1ρ)στ2(1+ρ)σi2(1pi)2 and Bτi=mkστbija(1+ρ)σi2(1pi)2 with bija=bij*ρbi,j1*σγ(γijργi,j1)(1ρ)(ziα+σipihi(k)). (II) Observe that

γij|α,β,θ,ϕ,δ,υ,ρ,στ,σξ,σψ,σγ,σδ2,τ,ξ,ψ,b*,Dobs~N(Aτi1Bγij,Aγij1), (15)

where Aγij=(1+ρ2)σγ2(1ρ2)σi2(1pi)2+1 and Bγij=σγ(μγij+σγργi,j1ρ(μγi,j+1σγγi,j+1))(1ρ2)σi2(1pi)2 with μγij=bij*ρbi,j1*(1ρ)(ziα+σττi+σipihi(k)). (III) The conditional posterior density for [b*|α,β,θ,ϕ,δ,υ,ρ,στ,σξ,σψ,σγ,σδ2,τ,ξ,ψ,γ,Dobs] has the form

[bij*|α,β,θ,ϕ,δ,υ,ρ,στ,σξ,σψ,σγ,σδ2,τ,ξ,ψ,γ,D obs ]exp[yijbij*exp(bij*)]×exp[(bij*μijρ(bi,j1*μi,j1))22(1ρ2)σi2(1pi)2]×exp[(bi,j+1*μi,j+1ρ(bi,j*μi,j))22(1ρ2)σi2(1pi)2], (16)

where μij is given in (8). The expression in (16) is log-concave and we use the adaptive rejection algorithm of Gilks and Wild (1992) to sample bij* from [bij*|α,β,θ,ϕ,δ,υ,ρ,στ,σξ,σψ,σγ,σδ2,τ,ξ,ψ,γ,Dobs]. (IV) The conditional posterior density for [σγ|α,β,θ,ϕ,δ,υ,ρ,στ,σξ,σψ,σδ2,τ,ξ,ψ,b*,γ,Dobs] has the form

[σγ|α,β,θ,ϕ,δ,φ,ρ,στ,σξ,σψ,σδ2,τ,ξ,ψ,b*,γ,Dobs]i=1Im=1Mk=1Kexp[(bij*μijρ(bi,j1*μi,j1))22(1ρ2)σi2(1pi)2]×(σγ)a51exp(b5σγ), (17)

where μij is given in (8). The expression in (17) is log-concave and we use the adaptive rejection algorithm of Gilks and Wild (1992) to sample σγ from [σγ|α,β,θ,ϕ,δ,φ,ρ,στ,σξ,σψ,σδ2,τ,ξ,ψ,b*,γ,Dobs]. (V) The conditional posterior density for [στ|α,β,θ,ϕ,δ,υ,ρ,σξ,σψ,σγ,σδ2,τ,ξ,ψ,b*,γ,Dobs] has the form

[στ|α,β,θ,ϕ,δ,υ,ρ,σξ,σψ,σγ,σδ2,τ,ξ,ψ,b*,γ,Dobs]i=1Im=1Mk=1Kexp[(bij*μijρ(bi,j1*μi,j1))22(1ρ2)σi2(1pi)2]×(στ)a21exp(b2στ), (18)

where μij is given in (8). The expression in (18) is log-concave and we use the adaptive rejection algorithm of Gilks and Wild (1992) to sample στ from [στ|α,β,θ,ϕ,δ,υ,ρ,σξ,σψ,σγ,σδ2,τ,ξ,ψ,b*,γ,Dobs]. (VI) Observe that

α|β,θ,ϕ,δ,υ,ρ,στ,σξ,σψ,σγ,σδ2,τ,ξ,ψ,b*,γ,Dobs~N(Aα1bα,Aα1), (19)

where Aα=c1Iq1+imk(1ρ)zizi(1+ρ)σi2(1pi)2 and bα=imkzibijb(1+ρ)σi2(1pi)2 with bijb=bij*ρbi,j1*σγ(γijργi,j1)(1ρ)(σττi+σipihi(k)). (VII) We apply the collapsed Gibbs technique of Liu (1994) via the following identity:

[δ,σδ2|α,β,θ,ϕ,υ,ρ,στ,σξ,σψ,σγ,τ,ξ,ψ,b*,γ,Dobs]=[σδ2|δ,α,β,θ,ϕ,υ,ρ,στ,σξ,σψ,σγ,τ,ξ,ψ,b*,γ,Dobs]×[δ|α,β,θ,ϕ,υ,ρ,στ,σξ,σψ,σγ,τ,ξ,ψ,b*,γ,Dobs]. (20)

That is, we sample δ after collapsing on σδ2. Observe that

σδ2|δ,α,β,θ,ϕ,υ,ρ,στ,σξ,σψ,σγ,τ,ξ,ψ,b*,γ,Dobs~Inv-Ga(a1+L2,b1+12l=1Lδl2φl), (21)

where φl=υlh=1l1(1υh). Finally, (VIII) the conditional posterior densities for β, θ, ϕ, δ, v, ρ, σξ, σψ, ξ, and ψ do not have closed form. We use the Metropolis–Hastings algorithm (Hastings 1970) to sample β, θ, ϕ, δ, v, ρ, σξ, σψ, ξ, and ψ from their conditional posterior distributions.

4. Simulation Study

We perform a simulation study to illustrate the proposed model. The simulation is constructed to closely match the data analysis in Section 5. Specifically, we simulate data with the same number of subjects (I = 89) and follow-up times as for the application, and we fix the covariates at their observed values. Furthermore, we incorporate covariates that were significant in the application, and evaluate the performance of the model with L = 7, which corresponds to the model fit to the application data where the amplitude values for φl are close to zero for l > 7. We generate random effects as τi ~ N(0, 1), γij ~N(0, 1), ξi ~N(0, 1), and ψi ~ N(0, 1). In addition, we generate the individual stochastic variation, bij, as in (2). True parameter values were chosen close to the ones estimated in the analysis and 500 simulated datasets were generated. The prior distributions used were those specified in Section 3.1. We used 20,000 Gibbs samples to compute all the posterior estimates, including the mean and 95% highest posterior density (HPD) intervals after a burn-in of 10,000 iterations.

Table 1 presents mean of the posterior means (Mean), standard deviation of the posterior means (SD), and the coverage probability (CP) of the 95% HPD intervals for the simulation study. The estimates for all parameters in Table 1 are close to the true values, and are therefore nearly unbiased. In addition, the coverage rates for 95% HPD intervals are close to 95%.

Table 1.

Simulation results for the model with L = 7.

Component Variable Parameter True Mean SD CPof95%HPD
Mean Intercept α1 6.73 6.729 0.052 95.6
Obesity α2 −0.12 −0.120 0.055 95.2
Scale (σi) Intercept θ1 1.28 1.275 0.006 94.6
Obesity θ2 0.02 0.020 0.006 94.8
Race1 θ3 −0.03 −0.031 0.008 96.2
Race2 θ4 0.01 0.010 0.008 95.0
Family affluence θ5 0.03 0.030 0.007 96.4
Parent education θ6 −0.03 −0.030 0.008 96.2
Mixing (pi) Harmonic (hi(k)) Intercept β1 −0.03 −0.042 0.040 95.4
Intercept δ1 −0.90 −0.892 0.026 95.4
δ2 0.17 0.174 0.027 95.2
δ3 −0.48 −0.470 0.037 95.0
δ4 0.05 0.054 0.029 95.6
δ5 0.31 0.312 0.030 95.4
δ6 -0.11 −0.096 0.043 96.6
δ7 0.10 0.084 0.036 96.6
Obesity ϕ1 −0.10 −0.102 0.026 95.6
Parent education ϕ2 0.13 0.130 0.028 96.0
φ1 0.80 0.806 0.007 95.4
φ2 0.158 0.156 0.007 97.4
φ3 0.009 0.010 0.002 96.8
φ4 0.019 0.018 0.002 94.8
φ5 0.009 0.008 0.002 96.0
φ6 0.003 0.003 0.001 94.4
φ7 0.002 0.002 0.001 95.0
σγ 1.11 1.110 0.009 95.8
στ 0.47 0.472 0.043 95.6
σξ 0.39 0.397 0.032 95.2
σψ 0.25 0.253 0.022 94.2
ρ 0.79 0.790 0.004 97.8
σδ2 8.278 1.612

5. Physical Activity in Teenage Girls

We used the proposed latent variable Poisson model (1) to analyze the longitudinal physical activity count data discussed in Section 1. The analytic goal was to investigate the circadian pattern as well as the proportion of circadian versus stochastic short-term trend (degree of irregularity in the pattern across time) across participants in these data.

The response variable Yij is a physical activity count over a 15-minute period for the ith individual at the jth time interval. We consider four covariates for investigation: body mass index (BMI), race, family affluence score (FAS), and parental education (PEd). BMI is dichotomized (1 if BMI ≥ 30 kg/m2 (obese); 0 otherwise (no obese)), Race is also dichotomized ((Race1, Race2) = (0, 0) for Hispanic, (1,0) for African American, and (0,1) for Others). The covariate PEd was reported using a seven point scale (1 if less than a high school diploma; 2 if high school diploma; 3 if GED; 4 if some college or technical school; 5 if associates degree; 6 if bachelors degree; or 7 if graduate degree) from the highest level of education completed by the first guardian in the household. The adolescent provided an estimate of family socioeconomic status using the FAS, which is a validated measure of socioeconomic status. The possible scores of FAS are from 0 to 9. The covariates PEd and FAS were treated as continuous.

Two subjects were removed from the analysis that had long periods of zero or nearly zero counts which were not detected by the watch’s nonwear algorithm (See Figure S in Section S1 of the supplementary materials). In total, we have I = 89 subjects with complete covariate information, M = 7 days, K = 96 observations for each day, and J = 672 observations for each subject. Table 2 presents some descriptive statistics for the 89 participants included in the analysis: number of observations and percent for categorical variables, mean and standard deviation (SD) for continuous variables. Since all participating girls were measured in the summer of the 9th grade, it was anticipated that most individuals would have a nonconstrained schedule since they were unlikely to have summer jobs (although these data were not explicitly collected). Also, Figure 2 (a) shows little variation across the days (Monday to Sunday) suggesting that a day of the week effect is not necessary.

Table 2.

Descriptive statistics of the physical activity count data (I = 89).

Characteristic N (%) or Mean (SD)
Obesity
 0: No obese (BMI < 30) 74 (83.15)
 1: Obese (BMI ≥ 30) 15 (16.85)
Race
 1: Hispanic 33 (37.08)
 2: African American 23 (25.84)
 3: Other 33 (37.08)
Family Affluence Score 5.05 (1.57)
Parental Education 3.71 (1.78)

To help the numerical stability and to improve convergence for the MCMC sampling algorithm, we standardized all covariates. The means and standard deviations are (0.169, 0.377) for obesity, (0.258, 0.440) for race1, (0.371, 0.486) for race2, (5.045,1.566) for PAS, and (3.708, 1.779) for PEd, respectively. In all the Bayesian computations, we used 20,000 Gibbs samples, which were taken from every fifth iteration, after a burn-in of 4000 iterations for each model, to compute all the posterior estimates, including means (Estimates), SDs, 95% HPD intervals, as well as the DICs. The computer programs were written in FORTRAN 95 using IMSL subroutines with double precision accuracy. The convergence of the MCMC sampling algorithm for all the parameters was checked based on the recommendations of Cowles and Carlin (1996). All trace and autocorrelation plots showed good convergence and mixing of the MCMC sampling algorithm (See trace plots in Section S3 of the supplementary materials).

We fit the model with different numbers of harmonic functions (L = 5, 10, 15, and 20) to characterize the circadian pattern in physical activity and to examine the sensitivity of inferences to the size of L. The posterior estimates, including the means, SDs, and 95% HPD intervals of the parameters for the model with L = 10 are reported in Tables 3 and 4. The posterior estimates for the model with L = 5, 15, and 20 were similar and the results are given in Tables in Section S2 of the supplementary materials. Furthermore, the posterior estimates of φl for model with L = 10, 15, and 20 are close to zero for l > 7, suggesting that the model with L = 7 is enough to fully characterize the circadian pattern in physical activity. The model allows us to examine the differing effects of covariates on the overall mean structure (ziα), the shape-invariant scale function (σi), the phase-shift (ζil), and the mixing parameter (pi).

Table 3.

Posterior estimates of the parameters for the model with L = 10.

Component Variable Parameter Posterior
mean
Posterior
SD
95% HPD
interval
Mean Intercept α1 6.7298 0.0551 (6.6201, 6.8362)
Obese α2 −0.1244 0.0572 (−0.2379, −0.0126)
Race1 α3 0.0577 0.0662 (−0.0708, 0.1885)
Race2 α4 0.0381 0.0706 (−0.0963, 0.1813)
Family affluence α5 −0.0152 0.0626 (−0.1347, 0.1111)
Parent education α6 0.0410 0.0651 (−0.0880, 0.1655)
Scale (σi) Intercept θ1 1.2779 0.0076 (1.2625,1.2924)
Obese θ2 0.0172 0.0074 (0.0032, 0.0322)
Race1 θ3 −0.0279 0.0086 (−0.0445,−0.0108)
Race2 θ4 0.0053 0.0091 (−0.0128, 0.0228)
Family affluence θ5 0.0309 0.0080 (0.0155, 0.0468)
Parent education θ6 −0.0305 0.0082 (−0.0462, −0.0146)
Mixing (pi) Intercept β1 −0.0283 0.0442 (−0.1139, 0.0580)
Obese β2 −0.0597 0.0458 (−0.1490, 0.0313)
Race1 β3 −0.0430 0.0540 (−0.1471, 0.0642)
Race2 β4 0.0062 0.0568 (−0.1079, 0.1163)
Family affluence β5 0.0032 0.0515 (−0.0971, 0.1042)
Parent education β6 0.0193 0.0527 (−0.0825, 0.1263)
Harmonic (hi(k)) Obese ϕ1 −0.0948 0.0284 (−0.1509, −0.0387)
Race1 ϕ2 0.0148 0.0331 (−0.0491, 0.0805)
Race2 ϕ3 0.0538 0.0348 (−0.0154, 0.1209)
Family affluence ϕ4 −0.0520 0.0317 (−0.1147, 0.0098)
Parent education ϕ5 0.1248 0.0329 (0.0602, 0.1903)
σγ2 1.1109 0.0124 (1.0867,1.1349)
στ 0.4688 0.0445 (0.3865, 0.5590)
σξ 0.3920 0.0350 (0.3275, 0.4631)
σψ 0.2488 0.0214 (0.2071, 0.2900)
ρ 0.7895 0.0059 (0.7778, 0.8008)
σδ2 9.0746 7.1222 (1.8304,21.5375)

Table 4.

Posterior estimates of the parameters for the harmonic function, hi(k), for the model with L = 10.

Para-
meter
Posterior
mean
Posterior
SD
95% HPD
interval
Para-
meter
Posterior
mean
Posterior
SD
95% HPD
interval
δ1 −0.90308 0.03017 (−0.96065, −0.84191) φ1 0.80039 0.00950 (0.78096,0.81817)
δ2 0.17105 0.02982 (0.11465,0.23100) φ2 0.157 55 0.00847 (0.14109,0.17412)
δ3 −0.48327 0.03980 (−0.56168, −0.40469) φ3 0.00942 0.00220 (0.00524,0.01370)
δ4 0.04540 0.03172 (−0.01762,0.10663) φ4 0.01866 0.00331 (0.01228,0.02508)
δ5 0.30807 0.03308 (0.24583,0.37524) φ5 0.00944 0.00234 (0.00497,0.01405)
δ6 -0.11175 0.04004 (−0.19221, −0.03294) φ6 0.00249 0.00124 (0.00044,0.00499)
δ7 0.09726 0.04458 (0.00758,0.18547) φ7 0.00170 0.00109 (1.73 × 10−11,0.00374)
δ8 0.00326 0.07515 (−0.20146,0.20229) φ8 0.00018 0.00037 (9.11 × 10−11,0.00096)
δ9 0.00477 0.02816 (−0.04444,0.06882) φ9 0.00009 0.00017 (7.58 × 10−11,0.00041)
δ10 0.00283 0.03929 (−0.08496,0.08826) φ10 0.00008 0.00018 (3.01 × 10−11,0.00037)

From Table 3, mean physical activity was only influenced by obesity, where heavier teenage girls had lower overall levels. The scale parameter (σi), which reflects the amplitude or inherent variation of the circadian/stochastic process, significantly increased with obesity and family wealth (FAS), and decreased for African Americans and with parental education (PEd). The phase-shift (ζil) was shifted later for obese participants compared with their nonobese counterparts, and earlier with increasing parental education. With regard to the relative contribution of circadian to stochastic short-term trend, 95% HPD intervals include zero for all covariates, which demonstrates little or no effect of the covariates. However, the variation across individuals, although not explained by these covariates, was large (i.e., the 5th and 95th percentiles of pi are 0.31 to 0.68, respectively). This confirms the observed individual variation in the regularity of circadian patterns shown in Figure 1. The posterior estimate of ρ (a serial correlation for bij) is 0.7895, demonstrating a strong dependence structure in the stochastic variation characterizing the stochastic short-term trend. Furthermore, from Table 4, the posterior estimates of φ for the model with L = 10 are close to zero for l > 7 (the same results for L = 15 and 20), and 95% HPD intervals for δ include zero for l = 4 and l > 7, demonstrating that including addition harmonic terms beyond 7 is not necessary for this application.

As a sensitivity analysis, we also fit a model without overdispersion. We did this to (i) examine whether overdispersion is important in the modeling framework; (ii) provide a sensitivity analysis for examining regression effects under a potentially misspecified model; (iii) examine the effect of excluding this variation term on inferences about the mixing parameters pi since the overdispersion adds additional independent variation that is not included in the stochastic short-term trend bij. Based on the DIC, the model with overdispersion showed a substantially better fit than the reduced model without overdispersion, indicating the importance of including this parameter in our formulation. DIC for the model with L = 10 are reported in Table 5. Specifically, for the model that did not account for overdispersion, we found that the value of the DIC (L = 10) is 629088.97, which is higher than the DIC for the proposed model. Furthermore, the model with L = 5, 15, and 20 had similar results (data not shown). In addition, we found similar results for pi with both models. Specifically, we found that no covariates influenced pi, and that the 5th and 95th percentiles of pi were 0.32 and 0.58, respectively, for the model without overdispersion which is close to the values reported previously for the model with over-dispersion. Further, parameter estimates for the other regression coefficients were similar between the two models (data not shown).

Table 5.

DIC Values for the model with L = 10.

Model Dev (Ω¯) pD DIC
Overdispersion 513686.81 57659.52 629005.85
No overdispersion 513777.54 57655.72 629088.97

Figure 3 shows the posterior estimates of the circadian pattern, σihi(k), in physical activity over a 24-hour period and presents both the population-averages of the posterior estimates as well as individual plots of five students chosen at random for the model with overdispersion. The plots demonstrate a circadian rhythm with relatively constant physical activity during the day and decreased physical activity during the sleeping hours. Additionally, the plot demonstrates sizable between-subject variation in the circadian pattern across students, with the variation manifesting itself in amplitude as well as phase-shift variation. Figure 4(a) is a plot of residuals against the fitted values, defined as exp(λ^ij), where λ^ij is the posterior mean of λij, defined in (1). There are no discernible patterns in this residual plot, which suggests that the model fits the data well. In addition, Figure 4 (b) presents an ACF plot of the residuals and shows that the residuals are nearly independent across time, demonstrating that the proposed model describes the dependence structure well. Figure 5 shows a plot of the estimated overall mean over time, E(exp(λij)) in (1) obtained from the posterior samples of the parameters. This plot adjusts for obesity, race, FAS, and PEd in the mean and variance structure, and takes full advantage of the specification of our flexible model structure. The figure demonstrates a pronounced circadian pattern over the 7-day followup that is close to the one described in Figure 2. Figure 6 shows the posterior estimates of the mixing parameter pi. The figure shows that the participants have a large range in pi with values ranging from 0.2283 to 0.669 among the 89 participants. Interestingly, since a majority of the estimated pi’s are below0.5, these results demonstrate that, for a majority of participants, the short-term stochastic trend dominates the circadian pattern.

Figure 3.

Figure 3.

Plots of the posterior mean for five individual circadian patterns, σihi(k), for the model with L = 10. The overall curve defined by setting the random effects in hi(k) to zero, is also presented. Each tick represents a 4-hour period. Each point reflects a 15 minute with the first point representing the count from 12:00 am to 12:15 am.

Figure 4.

Figure 4.

Residual plots for the model with L = 10: (a) residuals versus fitted values (the solid line represent a lowess smooth curve). Fitted values are defined as the posterior mean of exp(λij); (b) autocorrelation where the lag term is in units of 15 minutes.

Figure 5.

Figure 5.

Estimated overall mean trajectory over 7 days for the model with L = 10. Each tick represents a 4-hour period. Each point reflects a 15 minute with the first point representing the count from 12:00 am to 12:15 am on the first day.

Figure 6.

Figure 6.

Plots of individual estimated mixing parameters with L = 10.

6. Discussion

The focus of this article was on characterizing the heterogeneity in the degree of circadian versus stochastic variation (irregular short-term trend) in longitudinal physical activity counts across time among teenage girls. This new formulation allows investigators to examine the effect of covariates on multiple features of these complex patterns including: (i) overall mean structure;(ii) scale or amplitude parameter of the circadian pattern; (iii) phase-shift of the circadian pattern; and (iv) the degree of circadian versus short-term trend manifested by a stochastic process. Others (Wang, Ke, and Brown 2003; Albert and Hunsberger 2005) have considered shape-invariant formulations for continuous longitudinal data that have addressed (i)–(iii). The assessment of the degree of circadian versus short-term trend was motivated by the analysis of physical activity data where we saw some individuals that showed pronounced circadian patterns while others showed clear departures from these patterns. Incorporating this variation presented unique methodological challenges that, for computational efficiency, were addressed from a Bayesian perspective.

The application of Bayesian methodology to this application has some novel aspects. First, we use a stick-breaking representation for estimating the amplitudes, φl. This approach, which has been primarily used in nonparametric Bayes methodology such as a Dirichlet process, allows us to avoid inherent constraints in sampling from the conditional posterior distribution. Second, in order to avoid estimation instability in estimating the phase-shifts δl associated with amplitudes that are small, we proposed a shrinkage prior with variance φlσδ2, whereby the prior distribution is heavily centered at zero in situations where the amplitudes are small and there is very little information about the phase-shift. Third, a hierarchical centering reparametrization was employed in order to improve convergence of Gibbs sampler.

The analysis demonstrated some interesting findings including that obesity was associated with a decreased overall mean structure, an increased shape or amplitude parameter, and a delay in the phase of the circadian pattern among teenage girls. Interestingly, although we saw marked variation among individuals in the proportion of stochastic variation that can be explained by the circadian pattern (circadian pattern versus stochastic short-term trend), this variation was not explained by any of the known demographic covariates. It is possible that this variation may be explained by other biologically based markers that were not available to us. It is also possible that large or small values in this proportion may be indicative of subsequent teenage or early adult behavior. This is an area of future research.

The model was developed for longitudinal physical activity count data, and to directly answer analytic questions for this physical activity study. However, the model and estimation procedures can be adapted to other generalized linear model outcomes quite simply. We assume Gaussian random effects in this formulation which describe these data well. However, for other applications more flexible random effect distributions can be used to more flexibly model individual and within subject variation across time. Although the shape-invariant parameterization is appealing in its model parsimony, it may not be appropriate in situations where individuals may have very different underlying patterns from each other. Alternative parameterizations may be needed in this case.

The model distinguishes between circadian and stochastic variation, which requires relatively long sequences of data on each individual. In fact, we could not distinguish between these two sources of variation without observing more than one daily cycle of longitudinal count data. Fortunately we can distinguish heterogeneity in the sources of variation with the seven days of follow-up observed in this study.

Supplementary Material

Supplemental

Acknowledgments

Funding

This research was supported by the intramural research program of the National Cancer Institute.

Footnotes

Supplementary materials for this article are available online. Please go to www.tandfonline.com/r/JASA.

References

  1. Albert PS, Follmann DA, Wang SA, and Suh EB (2002), “A Latent Autoregressive Model for Longitudinal Binary Data Subject to Informative Missingness,” Biometrics, 58, 631–642. [ 993] [DOI] [PubMed] [Google Scholar]
  2. Albert PS, and Hunsberger S (2005), “On Analyzing Circadian Rhythms Data Using Nonlinear Mixed Models with Harmonic Terms,” Biometrics, 61, 1115–1120. [ 992,, 993,, 995,, 1000] [DOI] [PubMed] [Google Scholar]
  3. Brown EN, and Czeisler CA (1992), “The Statistical Analysis of Circadian Phase and Amplitute in Constant-routine Core-temperature Data,” Journal of Biological Rhythms, 7, 177–202. [ 992] [DOI] [PubMed] [Google Scholar]
  4. Chen M-H, and Ibrahim JG (2000), “Bayesian Predictive Inference for Time Series Count Data,” Biometrics, 56, 678–685. [ 993] [DOI] [PubMed] [Google Scholar]
  5. Cowles C, and Carlin BP (1996), “Markov Chain Monte Carlo Convergence Diagnostics: A Comparative Review,” Journal of the American Statistical Association, 91, 883–904. [ 998] [Google Scholar]
  6. Gelfand AE, Sahu SK, and Carlin BP (1996), “Efficient Parametrisations for Generalized Linear Mixed Models” (with discussion), in Bayesian Statistics 5, eds. by Bernardo JM, Berger JO,Dawid AP and Smith AFM, Oxford, Oxford University Press; [995] [Google Scholar]
  7. Gilks WR, and Wild P (1992), “Adaptive Rejection Sampling for Gibbs Sampling,” Applied Statistics, 41, 337–348. [ 996,, 997] [Google Scholar]
  8. Greenhouse J, Kass R, and Tsay R (1987), “Fitting Nonlinear Models with ARMA Errors to Biological Rhythm Data,” Statistics in Medicine, 55, 239–244. [ 992] [DOI] [PubMed] [Google Scholar]
  9. Harvey AC (1989), Forecasting, Structural Time Series Models and the Kalman Filter, Cambridge: Cambridge University Press; [993] [Google Scholar]
  10. Hastings WK (1970), “Monte Carlo Sampling Methods Using Markov Chains and Their Applications,” Biometrika, 57, 97–109. [ 997] [Google Scholar]
  11. Hay JL, and Pettitt A (2001), “Bayesian Analysis of Time Series of Counts,” Biostatistics, 2, 433–444. [ 993] [DOI] [PubMed] [Google Scholar]
  12. Hu K, Scheer FA, Ivanov PC, Buijs RM, and Shea SA (2007), “The Suprachiasmatic Nucleus Functions Beyond Circadian Rhythm Generation,” Neuroscience, 149, 508–517. [ 992] [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Huang L, Chen M-H, and Ibrahim JG (2005), “Bayesian Analysis for Generalized Linear Models with Nonignorably Missing Covariates,” Biometrics, 61, 767–780. [ 996] [DOI] [PubMed] [Google Scholar]
  14. Huettel SA, Song AW, and McCarthy G (2004), Functional Magnetic Resonance Imaging, Sunderland, UK: Sinauer Associates; [993] [Google Scholar]
  15. Hurtgen H, and Gervini D (2009), “Semiparametric Shape-invariant Models for Periodic Data,” Journal of Applied Statistics, 36, 1055–1065. [ 993] [Google Scholar]
  16. Ibrahim JG, Chen M-H, and Ryan LM (2000), “Bayesian Variable Selection for Time Series Count Data,” Statistica Sinica, 10, 971–987. [ 995] [Google Scholar]
  17. Indic P, Murray G, Maggini C, Amore M, Meschi T, Borghi L, Baldessarini RJ, and Salvatore P (2012), “Multi-scale Motility Amplitude Associated with Suicidal Thoughts in Major Depression,” PLoS One, 7, e38761 [ 992] [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Karim MR, and Zeger SL (1992), “Generalized Linear Models with Random Effects-salamander Mating Revisited,” Biometrics, 48, 631–644. [ 993] [PubMed] [Google Scholar]
  19. Kim S, Chen Z, Zhang Z, Simons-Morton BG, and Albert PS (2013), “Bayesian Hierarchical Poisson Regression Models, An Application to a Driving Study with Kinematic Events,” Journal of the American Statistical Association, 108, 495–503. [ 993,, 995] [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Kushida C, Chang A, Gadkary C, Guilleminault C, Carrillo O, and Dement W (2001), “Comparison of Actigraphic, Polysomnographic, and Subjective Assessment of Sleep Parameters in Sleep-disordered Patients,” Sleep Medicine, 2, 389–396. [ 992] [DOI] [PubMed] [Google Scholar]
  21. Lawton WH, and Sylvestre EA (1971), “Self-Modeling Curve Resolution,” Technometrics, 12, 617–633. [ 993] [Google Scholar]
  22. Lawton WH, Sylvestre EA, and Maggio MS (1972), “Self-Modeling Linear Regression,” Technometrics, 13, 513–532. [ 993] [Google Scholar]
  23. Lindstrom MJ (1995), “Self Modeling with Random Shift and Scale Parameters and Free-knot Spline Shape Function,” Statistics in Medicine, 14, 2009–2021. [ 993] [DOI] [PubMed] [Google Scholar]
  24. Liu JS (1994), “The Collapsed Gibbs Sampler in Bayesian Computations with Applications to a Gene Regulation Problem,” Journal of the American Statistical Association, 89, 985–966. [ 997] [Google Scholar]
  25. McCulloch CE, R. S, and Neuhaus JM (2008), Generalized, Linear, andMixed Models, 2nd ed New York: Wiley; [993] [Google Scholar]
  26. Ogbagaber S, Albert P, Lewin D, and Iannotti RJ (2012), “Summer Activity Patterns Among Teenage Girls: Harmonic Shape Invariant Modeling to Estimate Circadian Cycles,” Journal of Circadian Rhythms, 10, 2 [ 992,, 994] [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Ramsay JO, and Silverman BW (2005), Functional Data Analysis (Second Edition), New York: Springer; [993] [Google Scholar]
  28. Smith JQ (1979), “A Generalization of the Bayesian Forecasting Model,” Journal of Royal Statistical Society, Series B, 41, 375–387. [ 993] [Google Scholar]
  29. Spiegelhalter DJ, Best NG, Carlin BP, and van der Linde A (2002), “Bayesian Measures of Model Complexity and Fit” (with discussion), Journal of Royal Statistical Society, Series B, 64, 583–639. [ 996] [Google Scholar]
  30. Wang Y, and Brown MB (1996), “A Flexible Model for Human Circadian Rhythms,” Biometrics, 52, 588–596. [ 992,, 993] [PubMed] [Google Scholar]
  31. Wang Y, Ke C, and Brown MB (2003), “Shape-Invariant Modeling of Circadian Rhythms with Random Effects and Smoothing Spline ANOVA Decompositions,” Biometrics, 59, 804–812. [ 992,, 993,, 995,, 1000] [DOI] [PubMed] [Google Scholar]
  32. Zeger SL (1988), “A Regression Model for Time Series of Counts,”Biometrika, 75, 621–629. [ 993] [Google Scholar]
  33. Zeger SL, and Diggle PJ (1994), “Semi-parametric Models for Longitudinal Data with Application to CD4 cell Numbers in HIV Seroconverters,” Biometrics, 50, 689–699. [ 993] [PubMed] [Google Scholar]
  34. Zhang D, Lin X, Raz J, and Sowers MF (1998), “Semiparametric Stochastic Mixed Models for Longitudinal Data,” Journal of the American Statistical Association, 93, 710–719. [ 993] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental

RESOURCES