Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2016 Jul 25.
Published in final edited form as: Ann Appl Stat. 2015 Dec;9(4):1889–1905. doi: 10.1214/15-AOAS872

Space-Time Smoothing of Complex Survey Data: Small Area Estimation for Child Mortality

Laina D Mercer 1, Jon Wakefield 1,2, Athena Pantazis 3, Angelina M Lutambi 4, Honorati Masanja 4, Samuel Clark 3,5,6,7,8
PMCID: PMC4959836  NIHMSID: NIHMS802695  PMID: 27468328

Abstract

Many people living in low and middle-income countries are not covered by civil registration and vital statistics systems. Consequently, a wide variety of other types of data including many household sample surveys are used to estimate health and population indicators. In this paper we combine data from sample surveys and demographic surveillance systems to produce small area estimates of child mortality through time. Small area estimates are necessary to understand geographical heterogeneity in health indicators when full-coverage vital statistics are not available. For this endeavor spatio-temporal smoothing is beneficial to alleviate problems of data sparsity. The use of conventional hierarchical models requires careful thought since the survey weights may need to be considered to alleviate bias due to non-random sampling and non-response. The application that motivated this work is estimation of child mortality rates in five-year time intervals in regions of Tanzania. Data come from Demographic and Health Surveys conducted over the period 1991–2010 and two demographic surveillance system sites. We derive a variance estimator of under five years child mortality that accounts for the complex survey weighting. For our application, the hierarchical models we consider include random effects for area, time and survey and we compare models using a variety of measures including the conditional predictive ordinate (CPO). The method we propose is implemented via the fast and accurate integrated nested Laplace approximation (INLA).

Keywords and phrases: Bayesian smoothing, Infant mortality, Small area estimation, Survey sampling

1. Introduction

Over the past fifteen years the United Nations' (UN) Millennium Development Goals (MDGs) (UN, 2000) have focused the world's attention on improving key indicators of development, health and wellbeing. The requirement to monitor progress toward the MDGs has revealed a stunning absence of data with which to measure and monitor key indicators related to the MDGs in much of the developing world, and this has led to great interest in improving both the data and our ability to use it. In 2015 the UN and its partners are taking stock of experience with the MDGs and coordinating the establishment of a new set of global goals (UN, 2014d) – the Sustainable Development Goals (SDGs) (UN, 2014e). Even before the SDGs are finalized the UN Secretary General has called for a Data Revolution for Sustainable Development and appointed a high level advisory group to define what it should be (UN, 2014b). The aim is clear: to rapidly improve the coverage, quality, availability and timeliness of the data used to measure and monitor progress toward the SDGs. Simultaneously there is sustained, strong interest in improving civil registration, vital statistics (CRVS) and the functioning of statistical offices across the developing world (World Bank and World Health Organization, 2014; Paris21, 2014). The key challenges are improving coverage (UN, 2014a) and timeliness of reporting.

In this context of far-reaching interest in improving data and methods available to monitor indicators of the SDGs and improve CRVS, in this paper we develop a general approach that combines data from different sources and provides temporal, subnational-specific estimates with uncertainty that accounts for the different designs of the data collection schemes. We demonstrate the method by calculating spatio-temporal estimates of child mortality in Tanzania using data from multiple Demographic and Health Surveys (DHS) (USAID, 2014) and two health and demographic surveillance system (HDSS) sites (INDEPTH Network, 2014).

Reducing child mortality is MDG 4 (UN, 2014c), and over the past fifteen years a great deal of effort and resources have been spent in order to meet MDG 4 targets at the national level in many developing nations. This has driven work to develop better methods to estimate trends in child mortality at the national level, and two groups have produced globally comparable trends in child mortality for all nations. The United Nations Inter-agency Group for Child Mortality Estimation (UN IGME) recently developed a Bayesian B-spline Bias-reduction (B3) method (Alkema et al., 2014a,b), and the Institute for Health Metrics and Evaluation (IHME) uses a Gaussian process regression (Wang et al., 2014). All of these methods produce national estimates through time with measures of uncertainty. None are designed to reveal variation in child mortality within countries. A recent paper by Dwyer-Lindgren et al. (2014) compared many Bayesian space-time smoothing models to produced sub-national estimates of U5MR for Zambia. The major methodological limitation of this approach is that it does not incorporate area-specific sampling variability at the first stage of analysis, which we show can be quite variable for small areas.

In this paper we combine data from multiple surveys with different sampling designs, and construct subnational estimates through time with uncertainty that reflects the various data collection schemes. Data come from traditional cluster sample surveys (DHS) and two HDSS sites. HDSS sites intensively monitor everyone within a given area, typically to monitor the effects of health intervention trials of various types. Estimates of child mortality from both sources of data are useful but potentially flawed in different ways. National cluster sample surveys are generally not able to produce useful subnational estimates, and HDSS sites are not designed to be nationally representative, and are also thought to fall prey to the Hawthorne effect by which the communities of these sites have improved health outcomes because they are under observation and, more concretely, because of the trials being conducted.

We construct subnational estimates of Tanzanian child mortality through time with uncertainty intervals. This problem is challenging because in addition to requiring smoothing over space and time, we must also account for the survey design. When sampling is not simple and random and the design variables (upon which sampling was based) are not available, the complex sampling design is accounted for by constructing design weights. Inference is then carried out using design-based inference, e.g. using Horvitz-Thompson estimators (Horvitz and Thompson, 1952). In contrast, a conventional space-time random effects framework, for example, Knorr-Held (2000), is model-based, and requires an explicit likelihood to be specified. In this paper, we marry these two approaches by constructing a working likelihood based on the asymptotic distribution of a design-based estimator and then smooth using a space-time-survey hierarchical prior.

The organization of this paper is as follows. In Section 2 we describe the two data sources upon which estimation will be based. In Section 3 the calculation of child mortality estimates with an appropriate standard error is described using discrete time survival models. Hierarchical Bayesian space-time models are introduced in Section 4. The results of our modeling efforts of under five mortality rates (U5MR) within Tanzania from 1980–2010 are given in Section 5 and discussed in Section 6.

2. Data Sources

We focus on child mortality using data from five Tanzanian Demographic and Health Surveys (TDHS), one Tanzania HIV and Malaria Indicator survey (THMIS), and two health and demographic surveillance system (HDSS) sites in Tanzania, Ifakara and Rufiji. Over the period 1980–2010 estimates of child mortality from the two types of data sources (surveys, surveillance sites) are generally similar but, as described above, different in useful ways. The HDSS estimates are accurate (low bias) and precise (small variance) measurements for comparatively small, geographically-defined populations, and the household survey estimates are less accurate and much less precise but representative of large populations.

2.1. Health and Demographic Surveillance System

The Ifakara Health Institute (IHI), Tanzania runs a number of health and population research projects including two HDSS sites – Ifakara and Rufiji. We collaborated with IHI to estimate child mortality using data from the Ifakara and Rufiji HDSS sites.

The HDSS data are generated through repeated household visits. For the data we use, each household was visited three times per year at regular intervals. During each visit a ‘household roster’ was updated and all new vital and migration events for all members of the household were recorded. In addition, potentially many other questions were asked as part of both routine and ‘add-on’ studies. For our purposes we require only the basic core HDSS data that include information on dates of birth, death and migration – the information necessary to accurately identify observed person time, categorize that time by calendar period and age, and identify the outcome of interest, death. The Ifakara and Rufigi HDSS sites contribute data to the Morogoro and Pwani regions of Tanzania, respectively.

2.2. Household Surveys

Full TDHS surveys that collected data necessary for child mortality estimates were conducted in Tanzania in 2010, 2004–05, 1999, 1996, and 1991–92, in addition to the THMIS that included child mortality which was conducted in 2007–08. The 2010 TDHS, 2007–08 THMIS and 2004–05 TDHS surveys used 2-stage cluster samples. First, enumeration areas were sampled from the 2002 Tanzania census and second, a systematic sampling of households within each enumeration area was carried out. The 1999 TDHS, 1996 TDHS and 1991–92 TDHS used a 3-stage cluster design, first selecting wards and branches using the 1988 Tanzania Census as a sampling frame, second using probability proportional to size sampling to select enumeration areas from each selected ward or branch, and third selecting households from a new list of all households in each selected enumeration area. The same first and second stage units were used for all three of the surveys. For al surveys stratification by urban/rural and region was done at the first stage, with oversampling of Dar es Salaam and other urban areas. The surveys were designed to be nationally representative and to be able to provide estimates of contraceptive prevalence at the regional level. All six household surveys contributed observations to the 21 mainland regions of Tanzania.

All women age 15 to 49 who slept in the household the night before were interviewed in each selected household and response rates were high (above 95% for households in all surveys). TDHS provides sampling (design) weights, assigned to each individual in the dataset. Limited information is provided for each survey concerning the calculation of survey weights, but the general explanation indicates that raw survey weights are the inverse of the product of the 2–3 probabilities of selection from each stage. These raw weights were then adjusted to reflect household response and individual response rates. The 1991–92 Tanzania DHS final report Demographic and Health Surveys (1992) indicates that “final individual weights were calculated by normalizing them for each area so that the total number of weighted cases equals the total number of unweighted cases”, but this normalization is not discussed in later reports (Demographic and Health Surveys, 1997, 2000, 2005, 2010) or the DHS statistics manual (Rutstein and Rojas, 2006). For the purposes of our analysis of child mortality, children identified by the women who were interviewed contributed exposure time and deaths. The data were organized into child-months from birth to either death or date of the mother's interview.

3. Calculating child mortality with Discrete Time Survival Models

We modeled child mortality using discrete time survival analysis (DTSA) (Allison, 1984; Jenkins, 1995). Our main aim is to examine the change in risk as a function of age and historical period. DTSA allows us to easily estimate the predicted probabilities which can be used directly in traditional mortality analysis methods such as life tables, in our case to calculate U5MR. We wish to estimate the U5MR and define nqx = Pr(dying before x + n | lived until x) and the discrete hazards model splits the [0,5) period into J intervals [x1, x2), [x2, x3), …, [xJ, xJ+1), where xj+1 = xj + nj so that nj is the length of the interval beginning at xj, j = 1, …, J. Then U5MR is calculated as

q05=1j=1J(1njqxj). (3.1)

For our purposes, 5q0 is calculated by dividing the first 60 months into six intervals (J = 6), [0, 1), [1, 12), [12, 24), [24, 36), [36, 48), [48, 60) with (x1, …, x6) = (0, 1, 12, 24, 36, 48) and (n1, …, n6) = (1, 11, 12, 12, 12, 12). Data were organized as child-months where each child was at risk during each month observed from birth up to and including the month of their death. The observed data consist of, for each birth, a binary sequence up to length 60 with 0/1 corresponding to survival/death. For example, a child that died in their fourth month would contribute one child-month to the first age category and three to the second age category. The first three child-months would be assigned a 0 outcome and the final month would be assigned a 1.

We use logistic regression to estimate the monthly probability of dying conditional on the state of the child at the beginning of the month. The monthly probability of death for each interval, 1qx, is the probability of dying in [x, x + 1) given that x ∈ [xj, xj + nj) and can be estimated using a logistic generalized linear model (GLM) with J factors for age intervals, logit (1qx) = βj for x ∈ [xj, xj + nj). A more detailed discussion of DTSA method can be found in Clark et al. (2013).

In the complex survey context that is relevant for the Tanzanian household surveys, an important consideration is that the design weights must be acknowledged. This is achieved by solving a (design) weighted score statistic (Binder, 1983) resulting in estimates of the finite population parameter B = [B1, …, BJ] see details in the supplementary materials (Mercer et al., 2015). Once j are estimated we can calculate qx1^=exp(B^j)/[1+exp(B^j)] for xj ∈ [xj, xj + nj). The complement of surviving each month of the interval [xj, xj + nj) is used to calculate q^njxj=1(1qx1^)nj which may be substituted into (3.1) to give 50 (for additional details see the Supplementary Materials).

In Section 4 we will construct, for a generic U5MR, a working likelihood based on the asymptotic distribution

y=logit(q^05)~N(η,V^DES)

where η = log[5q0/(1 − 5 q0)] and DES is the asymptotic (design-based) variance estimate of logit (50), which is obtained via the delta method, the Supplementary Materials contain details of this calculation and a simulation study which investigates the asymptotic properties of the variance estimate compared with the jackknife variance estimate (Lohr, 2009, ch. 9) that is often used in the in the context of child mortality estimates (Pedersen and Liu, 2012).

Simulation results were much as one would expect from clustered sampling, coverage improves when there are more clusters and within a given number of clusters there is little gain in precision when increasing the sample size. Generally the performance of the delta method and jackknife intervals is very similar. We prefer the delta method as it is generally applicable (i.e., to a variety of designs) and has a far smaller computational burden. We conclude that the asymptotic normal sampling distribution and the delta method variance result in sufficiently accurate confidence interval coverage for the cluster and sample sizes considered in our application. Consequently, we will use the asymptotic distribution with the delta method variance as a working likelihood.

4. Combining Data Sources in Hierarchical Bayesian Space-Time Model

4.1. The First Stage

Let 50its represent the estimate of U5MR from survey s in region i and in period t. A model-based approach to inference with survey data may be carried out if the design variables upon which sampling were based, and associated population totals, are available (Gelman, 2007). Unfortunately, these variables are not available for the Tanzania surveys. As an alternative we summarize the data in area i at time point t from survey s via the asymptotic distribution of the estimator of the pseudo–maximum likelihood estimator (MLE):

yits=log[q^0its51q^0its5].

We define the area, period and survey summary as ηits = log [5q0its/(1 − 5q0its)]. We take as working likelihood the asymptotic distribution

yits|ηits~N(ηits,V^DES,its) (4.1)

which has been shown to perform well in the context of small area estimation from complex surveys (Mercer et al., 2014). Dwyer-Lindgren et al. (2014) also used the pseudo–MLE, but did not incorporate design effects and instead assumed a common variance across all observations. However, Figure 13 from the Supplementary Materials shows that the variance of the five year direct estimates can vary significantly by survey and region.

4.2. Second Stage Smoothing Models

We wish to smooth over time period, region and surveys, but would like as parsimonious a model as possible, to avoid overfitting. At the second stage of our model we adopt a model similar to the ‘Type I’ inseparable space-time model of Knorr-Held (2000). However, unlike Knorr-Held (2000) our data provides multiple observations for each area i and time point t through the THMIS, five TDHS and two HDSS, denoted as surveys s. Thus we consider models that allow the option of survey-specific effects. The survey effects could be constant over time and space, could vary with time, vary with space, or vary by time and space.

The six candidate models we consider are given in Table 1, with the caption containing the random effects specification. There are two temporal terms with αt being independent and identically distributed random effects that pick up short-term fluctuations with no structure, and γt being given an (intrinsic) random walk prior of order 1 or 2 (models type ‘a’ or ‘b’), to pick up local temporal smooth fluctuations, for t = 1, …, T = 6 time periods. Five-year time periods were chosen because survey-specific regional sample sizes can be quite small. The UN IGME has only recently moved to annual estimates at the national-level because the sample size of recent DHS has increased (Pedersen and Liu, 2012). We are combining recent and older DHS at a regional-level and thus sample sizes are not sufficiently large to produce reliable annual estimates.

Table 1.

Random effects models for time period t, region i and survey s. In all models μ is the intercept and αt~iidN(0,σα2),θi~iidN(0,σθ2),ϕi~ICAR(σϕ2),δit~iidN(0,σδ2). Specific models contain random effects with distributions νs~iidN(0,σν12),νis~iidN(0,σν22),νts~iidN(0,σν32),νits~iidN(0,σν42). In the ‘a’ models γt~RW1(σγ2) and in the ‘b’ models γt~RW2(σγ2).

Model Linear Predictor ηits

I μ + αt + γt + θi + ϕi + δit
II μ + αt + γt + θi + ϕi + δit + νs
III μ + αt + γt + θi + ϕi + δit + νs + νis
IV μ + αt + γt + θi + ϕi + δit + νs + νts
V μ + αt + γt + θi + ϕi + δit + νs + νts + νis
VI μ + αt + γt + θi + ϕi + δit + νs + νts + νis + νits

There are also two spatial terms, corresponding to the convolution model of Besag et al. (1991). The independent random effects are denoted θi and the intrinsic conditional autoregressive (ICAR) terms are ϕi for i = 1, …, I = 21 regions of Tanzania. The latter perform local geographical smoothing. The space-time interaction terms δit are taken to be independent, which corresponds to the Type I interaction model of Knorr-Held (2000). Type II-IV interaction models were considered, which include spatial and/or temporal structure on the prior for δit, but these models didn't not substantially modify estimates, so Type I was selected for parsimony.

There are S = 8 different surveys that are carried out over the various time periods (since mothers are surveyed on their complete birth history and so report on births from previous time periods), the five TDHS and THMIS surveys cover all 21 regions over the different time periods they were administered and the HDSS sites contribute data for one region each in the last three time periods. The independent random effects νs allow for these surveys to have a systematic displacement from the true logit of U5MR. The interactions νts and νis allow these displacements to vary with period and space, respectively, while νits allow the complete interaction between survey, period and area. Model I contains crossed random effects only, since each area is represented in each of the time periods. Models II–VI contain a combination of nested and crossed random effects. The random walk and ICAR models are described in Rue and Held (2005).

4.3. Hyperpriors

For a generic set of independent random effects we specify priors on the precision τ such that a 95% prior interval for the residual odds ratios lies in the interval [0.5,2] which leads to Gamma(aMARG, bMARG) priors for precisions (Wakefield, 2009) with aMARG = 0.5, bMARG = 0.001488. For the RW1, RW2 and ICAR models the precisions have conditional rather than marginal interpretations. Let z represent a random effect from a improper GMRF with “mean” 0 and “precision” τQ. Following the supplementary materials of Fong et al. (2010), we gain compatibility by calculating an approximate measure of the average marginal “variance” of z in the situation with τ = 1; call this average c. Then to put on the same scale we take aCOND = aMARG and bCOND = bMARG/c. In the above description, the words mean, precision, variance are written in italics to acknowledge that strictly speaking these quantities do not exist since the distribution is improper. However, one may calculate a generalized inverse using the equation given at the end of Section 4.4 of Fong et al. (2010). This method is closely related to that later described by Sørbye and Rue (2014). The Supplementary Materials contain R code for reproducing these prior specifications. For the Tanzania data this leads to gamma priors for the RW1 of τγ ∼ Gamma(0.5, 0.00153), for the RW2 of τγ ∼ Gamma(0.5, 0.00286), and for the ICAR of τϕ ∼ Gamma(0.5,0.00360).

4.4. Computation

Model fitting was carried out within the R computing environment. Weighted logistic regressions were fit using the svyglm() function from the survey package (Lumley, 2004) from which the design-based variance was extracted (see Supplementary Materials for further details). The hierarchical Bayesian space-time models were fitted using the Integrated Nested Laplace Approximation (INLA) (Rue et al., 2009) as implemented in the INLA package. INLA provides a fast alternative to MCMC for approximating the marginal posterior distributions of Markov random field (MRF) models. There is now extensive evidence that the approximations are accurate for space-time modeling, see for example Fong et al. (2010), Held et al. (2010), and Schrödle and Held (2011).

4.5. Model Selection

In Table 1 we describe twelve plausible random effects specifications (allowing for RW1 or RW2 models). A number of approaches have been described for comparing models, including the conditional predictive ordinate (CPO), the deviance information criteria (DIC) as introduced by Spiegelhalter et al. (2002) and the normalizing constants p(y|M) for the twelve models indexed by M. Let yits represent the vector of data with the observation from region i, time period t and survey s removed. The idea behind the CPO is to predict the density ordinate of the left-out observation, based on those that remain. Specifically, the CPO for observation i, t, s is defined as:

CPOits=p(yits|yits)=p(yits|θ)p(θ|yits)dθ=Eθ|yits[p(yits|θ)]

where θ represents the totality of parameters and in the U5MR setting the distribution of yits|θ is N(ηits, DES,its). The CPOs can be used to look at local fit, or one can define an overall score for each model:

LCPO=log(CPO)=i=1It=1Ts=1SlogCPOits,

and good models will have relatively high values of LCPO. Held et al. (2010) discuss shortcuts for computation (i.e. avoidance of fitting the model I×T×S times) using INLA.

We also calculate another widely-used model comparison measure, the deviance information criteria, or DIC (Spiegelhalter et al., 2002). To define the DIC with respect to a generic set of parameters θ, first define an ‘effective number of parameters’ as

pD=Eθ|y{2log[p(y|θ)]}+2log[p(y|θ¯)]=D¯+D(θ¯)

where D is the deviance, θ̄ = E[θ|y] is the posterior mean, D(θ̄) is the deviance evaluated at the posterior mean and = E[D|y]. The DIC is given by

DIC=D(θ¯)+2pD=D¯+pD,

so that we have the sum of a measure of goodness of fit and model complexity. We are wary of interpretation of DIC in our setting, since Plummer (2008) has shown that DIC is prone to inappropriately under-penalize large models such as the ones we are fitting, see also Spiegelhalter et al. (2014).

5. Applying Methods to household surveys and HDSS sites in Tanzania

We fit Models Ia–VIb (as summarized in Table 1) to the Tanzania survey data and Table 2 provides the summaries of various model comparison summaries. Model Vb is the favored model according to both the DIC, LCPO, and log of the normalizing constant criterion. Results for Models Vb and VIb are very similar, but we see from the effective number of parameters that even though the number of 3-way interaction random effects is 573, there are only 13 effective parameters due to the closeness of the interactions to zero. Hence, from this point onwards we shall report summaries with respect to Model Vb. We begin by summarizing the posterior distribution, and then describe regional trends.

Table 2.

Model comparison: pD is the effective degrees of freedom, as defined for the calculation of the deviance information criteria (DIC), which also uses the deviance evaluated at the posterior mean, ; LCPO is defined as itslog(CPOits). In the ‘a’ models γt~RW1(σγ2) and in the ‘b’ models γt~RW2(σγ2).

Model No Pars log p(y) pD DIC LCPO
Ia 181 -310.9 74.5 409.3 483.8 -294.5
IIa 189 -304.6 80.1 384.2 464.3 -287.3
IIIa 313 -257.6 118.9 221.8 340.7 -193.5
IVa 223 -302.2 88.6 367.5 456.2 -283.4
Va 347 -254.6 121.8 210.1 332.0 -183.1
VIa 920 -255.0 134.5 199.4 334.0 -183.9

Ib 181 -308.2 74.2 409.1 483.3 -293.7
IIb 189 -301.9 79.8 383.9 463.7 -286.4
IIIb 313 -255.0 118.6 221.7 340.3 -192.9
IVb 223 -299.5 88.2 367.4 455.6 -282.5
Vb 347 -252.1 121.6 209.9 331.5 -183.1
VIb 920 -252.5 133.3 200.2 333.4 -183.4

5.1. Summarizing the Posterior Distribution

Table 3 provides numerical summaries and the proportion of total variation explained by each random effect. The total variance is

Table 3.

Summaries of variance components. The proportion of variation is calculated as the contribution the relevant set of random effects makes to the total variation. In the case of the RW2 and ICAR models, the relevant contribution is evaluated empirically, since the variance parameter is conditional rather than marginal.

Variance Interpretation Median (95% Interval) Percentage Variation

σα2
Indept Time 0.002 (0.001, 0.012) 1.3
σγ2
RW2 Time 0.009 (0.002, 0.054) 46.0
σθ2
Indept Space 0.068 (0.033, 0.133) 31.3
σϕ2
ICAR Space 0.017 (0.002, 0.378) 4.9
σδ2
Indept Space-Time Interaction 0.005 (0.001, 0.013) 2.3
σνs2
Indept Survey 0.002 (0.001, 0.013) 1.4
σνst2
Indept Survey-Time Interaction 0.004 (0.001, 0.011) 2.0
σνsi2
Indept Survey-Space Interaction 0.024 (0.015, 0.038) 10.9
σα2+sγ2+σθ2+sϕ2+σδ2+σνs2+σνsi2+σνst2

where sγ2 and sϕ2 are empirical estimates of the marginal variances in the RW2 and ICAR models. The structured temporal and unstructured spatial random effects explain 77% of the total variation. Hence, there is strong temporal structure and large spatial heterogeneity, which we shall discuss subsequently. The third largest contribution to the variation is 11% from the survey-space interaction. Different survey teams are sent to different regions which explains to some extent this relatively large contribution.

5.2. Model Validation

To validate the model we removed all of the observations in area i for time point t and then generated 95% intervals around the posterior mean 50,it using the variance of the observed response, defined as Sits2=σit2+V^DES,its, where σit2 is the variance of the posterior distribution of logit(50,it) and DES,its is the design-based variance described in Section 3. This was completed for the 21 regions and 6 time points (figures shown in Supplementary Materials). Intervals contained the design-based estimates 92.5% of the time overall. Time/area-specific coverages range from 89.9– 96.9% and the coverage for the final time point is 93.2%.

5.3. Regional Estimates and Projections

For region i and 5 year period t, estimates, projections, and credible intervals of U5MR taken from posterior draws of

q0,it5=expit(μ+αt+γt+θi+ϕi+δit).

Figure 1 shows maps of the posterior median estimates of child mortality (per 1000 births) by region for the six observed 5 year time periods. Child mortality has decreased markedly over the 30 year period considered but overall more than 5% of infants still die before they turn 5, and there are strong regional differences. Figures 2 and 3 display the observed direct estimates and smoothed results for the Morgoro and Pwani regions, respectively. Additionally, each plot shows the projected U5MR for the 2010–2014 time period. The direct estimates have a great deal of variability between surveys, especially for the first four time points and design-based intervals are very wide. Smoothed rates and projections for all 21 regions are located in the Supplementary Materials.

Fig 1.

Fig 1

Smoothed regional estimates of child mortality (per 1000 births).

Fig 2.

Fig 2

Regional five-year direct and model-based smoothed of 5q0 in Pwani, TZA with 95% confidence intervals.

Fig 3.

Fig 3

Regional five-year direct and model-based smoothed estimates of 5q0 in Morogoro, TZA with 95% confidence intervals.

6. Discussion

We have described a general method for spatiotemporal smoothing of a health outcome, with the data arising from complex surveys and surveillance. The method was illustrated with child mortality in regions of Tanzania over 1980–2009 using data from household surveys and surveillance sites. A great advantage of the model is that there is a fast implementation within the R computing environment using the existing survey and INLA packages. The Supplementary Materials contain example code. As an example, fitting the most complex model for the Tanzania data took just 18.7 seconds on a Macbook Pro1.

In our hierarchical modeling approach we explicitly acknowledge the weights by taking as (pseudo-)likelihood the (design-based) sampling distribution of the estimator. In the Supplementary Materials we illustrate the effect of the weights on both the estimates and the standard errors. Another use of our model is for prediction, with the RW2 terms drawn from the relevant conditional distribution.

Our model contains a relatively complex combination of nested and crossed random effects and we described a particular approach to hyperprior selection. As with any such suggestion, it is beneficial to examine prior sensitivity, and the supplementary materials contain details of a sensitivity study that we performed for the Tanzania data.

An integral part of our method involves calculating and pooling estimates of child mortality from household surveys and demographic surveillance sites and allowing both to inform our overall estimates by region and for the country as a whole. A byproduct of this procedure is an ability to carefully compare the DHS-based and demographic surveillance-based estimates of child mortality in the regions that include HDSS sites. As Figures 2 and 3 makes clear, the central estimates from the two different data collection schemes are very similar. This adds more weight to similar findings by others (Byass et al., 2007; Fottrell et al., 2009; Hammer et al., 2006) and reduces concerns about the Hawthorne effect preventing measures of child mortality from HDSS sites from being more widely relevant, i.e. similar to surrounding populations.

Although we have demonstrated our method with a single country and outcome, it is sufficiently general to be applied to produce spatiotemporal estimates of a variety of indicators. Because this approach provides consistent, precise estimates across both time and space utilizing data from a variety of sources, including complex sample surveys, accounting for study designs, it should be considered as an approach for producing subnational estimates of child mortality and other key health, demographic and development indicators. However, countries with a substantial HIV/AIDs burden may suffer from underreporting biases. The UN IGME pre-processes data in a number of countries, including Tanzania, to take account of underreporting biases because of HIV/AIDS. We base our analysis on direct national estimates of U5MR, and so do not adjust for this bias, but our smoothed results do not differ substantially from the UN results at the national level and so we believe that any bias from this source will be small.

The world's rapidly growing appetite for timely, subnational estimates of key development indicators will continue to motivate innovative new developments in both data collection and analysis. In addition to providing a means to improve indicator estimates using different sources of data, our results also hint at the possibility of eventually creating integrated data collection and analysis schemes that build on existing infrastructure to yield some of the functionality of full-coverage CRVS. Clark et al. (2012) and Ye et al. (2012) begin to discuss ideas in this vein, e.g. how one might utilize both sample surveys and demographic surveillance to continuously provide indicators equivalent to what is normally produced by vital registration. The method and results we present in this paper encourage future development of those ideas.

Supplementary Material

Supplementary

Acknowledgments

This work was partially supported by USAID through the MEASURE DHS project. Partial support for this research came from a Shanahan Endowment Fellowship and a Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD) training grant, T32 HD007543, to the Center for Studies in Demography & Ecology at the University of Washington and a 2R01CA095994-05A1 from the National Institutes of Health. Samuel Clark was partially supported by grant K01 HD057246 from the NICHD. We would like to acknowledge the hard work and commitment of the HDSS field teams and the data team and thank the residents in the surveillance areas for offering their time for interviews and sharing their valuable personal information over the years.

Footnotes

1

processor: 2.9 GHz Intel Core i7; memory: 8 GB 1,600 MHz DDR3.

References

  1. Alkema L, New JR, Pedersen J, You D, et al. Child mortality estimation 2013: an overview of updates in estimation methods by the united nations inter-agency group for child mortality estimation. PloS ONE. 2014a;9:e101112. doi: 10.1371/journal.pone.0101112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Alkema L, New JR, et al. Global estimation of child mortality using a bayesian b-spline bias-reduction model. The Annals of Applied Statistics. 2014b;8(4):2122–2149. [Google Scholar]
  3. Allison P. Event History Analysis: Regression for Longitudinal Event Data. Number 46. Sage Publications Inc; 1984. [Google Scholar]
  4. Besag J, York J, Mollié A. Bayesian image restoration with two applications in spatial statistics. Annals of the Institute of Statistics and Mathematics. 1991;43:1–59. [Google Scholar]
  5. Binder D. On the variances of asymptotically normal estimators from complex surveys. International Statistical Review. 1983;51:279–292. [Google Scholar]
  6. Byass P, Worku A, Emmelin A, Berhane Y. Dss and dhs: longitudinal and cross-sectional viewpoints on child and adolescent mortality in ethiopia. Population Health Metrics. 2007;5(1):12. doi: 10.1186/1478-7954-5-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Clark SJ, Wakefield J, McCormick T, Michelle R. Technical Report. Vol. 118. Center for Statistics and the Social Sciences (CSSS), University of Washington; 2012. Hyak mortality monitoring system innovative sampling and estimation methods: Proof of concept by simulation. https://www.csss.washington.edu/Papers/wp118.pdf. [Google Scholar]
  8. Clark SJ, Kahn K, Houle B, Arteche A, Collinson MA, Tollman SM, Stein A. Young children's probability of dying before and after their mother's death: A rural South African population-based surveillance study. PLoS Medicine. 2013;10(3):e1001409. doi: 10.1371/journal.pmed.1001409. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Demographic and Health Surveys. Demographic Health Survey 1991/1992. Bureau of Statistics Planning Commission; 1992. [Google Scholar]
  10. Demographic and Health Surveys. Tanzania Demographic and Health Survey 1996. Bureau of Statistics [Tanzania] and Macro International Inc; 1997. [Google Scholar]
  11. Demographic and Health Surveys. Tanzania Demographic and Health Survey 1999. Bureau of Statistics [Tanzania] and Macro International Inc; 2000. [Google Scholar]
  12. Demographic and Health Surveys. Tanzania Demographic and Health Survey 2004-05. National Bureau of Statistics (NBS) [Tanzania] and ORC Macro; 2005. [Google Scholar]
  13. Demographic and Health Surveys. Tanzania Demographic and Health Survey 2010. National Bureau of Statistics (NBS) [Tanzania] and ICF Macro; 2010. [Google Scholar]
  14. Dwyer-Lindgren L, Kakungu F, Hangoma P, Ng M, Wang H, Flaxman AD, Masiye F, Gakidou E. Estimation of district-level under-5 mortality in zambia using birth history data, 1980–2010. Spatial and spatio-temporal epidemiology. 2014;11:89–107. doi: 10.1016/j.sste.2014.09.002. [DOI] [PubMed] [Google Scholar]
  15. Fong Y, Rue H, Wakefield J. Bayesian inference for generalized linear mixed models. Biostatistics. 2010;11:397–412. doi: 10.1093/biostatistics/kxp053. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Fottrell E, Enquselassie F, Byass P. The distribution and effects of child mortality risk factors in ethiopia: a comparison of estimates from dss and dhs. Ethiopian Journal of Health Development. 2009;23(2) [Google Scholar]
  17. Gelman A. Struggles with survey weighting and regression modeling. Statistical Science. 2007;22:153–164. [Google Scholar]
  18. Hammer GP, Kouyaté B, Ramroth H, Becher H. Risk factors for childhood mortality in sub-saharan africa: a comparison of data from a demographic and health survey and from a demographic surveillance system. Acta Tropica. 2006;98(3):212–218. doi: 10.1016/j.actatropica.2006.04.005. [DOI] [PubMed] [Google Scholar]
  19. Held L, Schrödle B, Rue H. Posterior and cross-validatory predictive checks: A comparison of MCMC and INLA. In: Kneib T, Tutz G, editors. Statistical Modeling and Regression Structures – Festschrift in Honour of Ludwig Fahrmeir. Physica-Verlag; 2010. pp. 91–110. [Google Scholar]
  20. Horvitz D, Thompson D. A generalization of sampling without replacement from a finite universe. Journal of the American Statistical Association. 1952;47:663–685. [Google Scholar]
  21. INDEPTH Network. Health and Demographic Surveillance Systems. 2014 http://www.indepth-network.org/index.php?option=com_content&task=view&id=1798&Itemid=501.
  22. Jenkins SP. Easy estimation methods for discrete-time duration models. Oxford Bulletin of Economics and Statistics. 1995;57(1):129–136. [Google Scholar]
  23. Knorr-Held L. Bayesian modelling of inseparable space-time variation in disease risk. Statistics in Medicine. 2000;19:2555–2567. doi: 10.1002/1097-0258(20000915/30)19:17/18<2555::aid-sim587>3.0.co;2-#. [DOI] [PubMed] [Google Scholar]
  24. Lohr S. Sampling: Design and Analysis. Cengage Learning; 2009. [Google Scholar]
  25. Lumley T. Analysis of complex survey samples. Journal of Statistical Software. 2004;9 [Google Scholar]
  26. Mercer L, Wakefield J, Chen C, Lumley T. A comparison of spatial smoothing methods for small area estimation with sampling weights. Spatial Statistics. 2014;8:69–85. doi: 10.1016/j.spasta.2013.12.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Mercer L, Wakefield J, Pantazis A, Lutambi A, Masanja H, Clark S. Supplement to “Space-time smoothing of complex survey data: Small area estimation for child mortality”. Annals of Applied Statistics. 2015 doi: 10.1214/15-AOAS872. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Paris21. Paris21: Partnership for Statistics in Development in the 21 Century. 2014 http://www.paris21.org.
  29. Pedersen J, Liu J. Child mortality estimation: Appropriate time periods for child mortality estimates from full birth histories. PLoS Medicine. 2012;9(8) doi: 10.1371/journal.pmed.1001289. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Plummer M. Penalized loss functions for Bayesian model comparison. Biostatistics. 2008;9:523–539. doi: 10.1093/biostatistics/kxm049. [DOI] [PubMed] [Google Scholar]
  31. Rue H, Held L. Gaussian Markov Random Fields: Theory and Application. Chapman and Hall/CRC Press; Boca Raton: 2005. [Google Scholar]
  32. Rue H, Martino S, Chopin N. Approximate Bayesian inference for latent Gaussian models using integrated nested Laplace approximations (with discussion) Journal of the Royal Statistical Society, Series B. 2009;71:319–392. [Google Scholar]
  33. Rutstein SO, Rojas G. Tanzania demographic and health survey 1996. Calverton, Maryland: ORC Macro; 2006. [Google Scholar]
  34. Schrödle B, Held L. Spatio-temporal disease mapping using inla. Environmetrics. 2011;22(6):725–734. [Google Scholar]
  35. Sørbye S, Rue H. Scaling intrinsic Gaussian Markov random field priors in spatial modelling. Spatial Statistics. 2014;8:39–51. [Google Scholar]
  36. Spiegelhalter D, Best N, Carlin B, Linde AVD. Bayesian measures of model complexity and fit. Journal of the Royal Statistical Society: Series B. 2002;64:583–639. [Google Scholar]
  37. Spiegelhalter D, Best N, Carlin B, Linde AVD. The deviance information criterion: 12 years on (with discussion) Journal of the Royal Statistical Society: Series B. 2014;64:485–493. [Google Scholar]
  38. UN. Millennium Development Goals. 2000 http://www.un.org/millenniumgoals/
  39. UN. Civil Registration and Vital Statistics Coverage. 2014a http://unstats.un.org/unsd/demographic/CRVS/CR_coverage.htm.
  40. UN. Data Revolution for Sustainable Development. 2014b http://www.un.org/apps/news/story.asp?NewsID=48594#.VEVQpoctuvJ.
  41. UN. Millennium Development Goal number 4: Reduce by two thirds, between 1990 and 2015, the under-five mortality rate. 2014c http://www.un.org/millenniumgoals/childhealth.shtml.
  42. UN. The Post-2015 Development Agenda. 2014d http://www.post2015hlp.org/the-report/
  43. UN. Sustainable Development Goals. 2014e http://sustainabledevelopment.un.org/owg.html.
  44. USAID. Demographic and Health Surveys. United States Agency for International Development; 2014. http://www.dhsprogram.com. [Google Scholar]
  45. Wakefield J. Multi-level modelling, the ecologic fallacy, and hybrid study designs. International Journal of Epidemiology. 2009;38:330–336. doi: 10.1093/ije/dyp179. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Wang H, Liddell CA, Coates MM, Mooney MD, Levitz CE, Schumacher AE, Apfel H, Iannarone M, Phillips B, Lofgren KT, et al. Global, regional, and national levels of neonatal, infant, and under-5 mortality during 1990–2013: a systematic analysis for the global burden of disease study 2013. The Lancet. 2014;384(9947):957–979. doi: 10.1016/S0140-6736(14)60497-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. World Bank and World Health Organization. Global Civil Registration and Vital Statistics Scaling Up Investment Plan 2015-2024. 2014 http://www.worldbank.org/en/topic/health/publication/global-civil-registration-vital-statistics-scaling-up-investment.
  48. Ye Y, Wamukoya M, Ezeh A, Emina JB, Sankoh O. Health and demographic surveillance systems: a step towards full civil registration and vital statistics system in sub-sahara africa? BMC Public Health. 2012;12(1):741. doi: 10.1186/1471-2458-12-741. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary

RESOURCES