Pairwise joint modeling of clustered and high-dimensional outcomes with covariate missingness in pediatric pneumonia care

Susan Gachau; Edmund Njeru Njagi; Geert Molenberghs; Nelson Owuor; Rachel Sarguta; Mike English; Philip Ayieko

doi:10.1002/pst.2197

. Author manuscript; available in PMC: 2022 Sep 22.

Published in final edited form as: Pharm Stat. 2022 Feb 24;21(5):845–864. doi: 10.1002/pst.2197

Pairwise joint modeling of clustered and high-dimensional outcomes with covariate missingness in pediatric pneumonia care

Susan Gachau ^1,^2,^✉, Edmund Njeru Njagi ³, Geert Molenberghs ^4,⁵, Nelson Owuor ², Rachel Sarguta ², Mike English ^1,⁶, Philip Ayieko ^7,⁸

PMCID: PMC7613603 EMSID: EMS152550 PMID: 35199938

Abstract

Multiple outcomes reflecting different aspects of routine care are a common phenomenon in health care research. A common approach of handling such outcomes is multiple univariate analyses, an approach which does not allow for answering research questions pertaining to joint inference. In this study, we sought to study associations among nine pediatric pneumonia care outcomes spanning assessment, diagnosis and treatment domains of care, while circumventing the computational challenge posed by their clustered and high-dimensional nature and incompletely recorded covariates. We analyzed data from a cluster randomized trial conducted in 12 Kenyan hospitals. There were varying degrees of missingness in the covariates of interest, and these were multiply imputed using latent normal joint modeling. We used the pairwise joint modeling strategy to fit a correlated random effects joint model for the nine outcomes. This entailed fitting 36 bivariate generalized linear mixed models and deriving inference for the joint model using pseudo-likelihood theory. We also analyzed the nine outcomes separately before and after multiple imputation. We observed joint effects of patient-, clinician- and hospital-level factors on pneumonia care indicators before and after multiple imputation of missing covariates. In both pairwise joint modeling and separate univariate analysis methods, enhanced audit and feedback improved documentation and adherence to recommended clinical guidelines over time in six and five pneumonia care indicators, respectively. Additionally, multiple imputation improved precision of parameter estimates compared to complete case analysis. The strength and direction of association among pneumonia outcomes varied within and across the three domains of pneumonia care

Keywords: multiple imputation, pairwise joint modeling, pediatric care, pneumonia, pseudo-likelihood

1. Introduction

Multiple responses reflecting different aspects of patients’ care are a common phenomenon in routine care studies, investigating research questions such as the level of adherence to standard quality of care guidelines by clinicians in different health care facilities. Besides complexities associated with multiple outcomes spanning several domains of quality care, routine data are prone to missing information which can occur at patient-, clinician- and/or facility-level.

Despite measuring, for each patient, a correlated vector of response variables, inferences in most routine care studies are based on one primary outcome or multiple separate analyses.^1–3 Alternatively, the outcomes are combined into a single composite score,^4–7 to provide global trends and insight into the quality of patient care. While these approaches are relatively straight forward, some research questions require joint modeling of all outcomes simultaneously,^8,9 for instance, when the association among outcomes and joint effects of covariates on all outcomes are of primary research interest.^8–11

In principle, a joint model links two or more models, using random effects that capture association among outcomes of interest. Statistically, joint modeling has advantages over separate analyses of multiple outcomes. This includes efficiency gain and bias reduction, especially when data are missing at random (MAR) in some of the outcomes.^8,12–16 In addition, joint modeling allows for different types of models for the different outcomes¹⁷ (e.g., linear, non-linear, and generalized linear mixed models), while the interpretation of parameter estimates is the same as interpretation from the separate univariate models.¹³

Although joint models have been extended from the common bivariate to the multivariate cases,¹⁴ standard fitting procedures are difficult to implement with high-dimensional outcomes.^{8,14,16,18–21} The computational complexities stem from an increase in the number of parameters to be estimated, for every new outcome added into the joint model,⁸ and relatedly the increasing dimension of the random-effects vector.

To overcome these challenges, the shared random-effects model, which assumes that all outcomes share the same set of random effects, can be considered. In this case, the dimension of the random effects does not increase with an increase with the number of outcomes.^8,20 The price to pay is a sometimes restrictive, less realistic model.^8,16 For instance, when dealing with discrete outcomes (e.g., binomial and Poisson), that have a natural link between the mean and variance.

A plausible alternative is the pairwise joint modeling approach, which allows fitting of the correlated random-effects joint model, while circumventing the computational complexity associated with a full joint multivariate model.^8,9,11,14

As mentioned earlier, missing data in either outcomes or covariates is a common problem in routine data. Although joint modeling can be used to mitigate the effect of missing data among outcomes, appropriate strategies of handling missing covariates in high-dimensional joint modeling is hardly addressed in the literature. For instance, a previous joint modeling study reported deletion of case records with missing covariates to alleviate computational challenges.²² The repercussion of suboptimal missing data handling techniques include risk of biased and inefficient estimates, hence misleading inferences.²³

In the present study, we sought to jointly analyze nine binary outcomes, at the same time accounting for covariate missingness in a pediatric routine data set, from a cluster randomized trial conducted in Kenyan hospitals. Specifically, we used multiple imputation, based on the joint modeling (JM) framework to address missing covariates across two levels of the hierarchy. Thereafter, we used the pairwise approach within the pseudo-likelihood framework to estimate the joint effects of covariates on outcomes. This was in addition to estimating the strength of association among nine pneumonia outcomes. Besides joint modeling, we analyzed the nine binary outcomes separately under complete case analysis and after multiple imputation of missing covariates.

The remainder of this article is organized as follows. Section 2 introduces the joint modeling approach using mixed models and the pairwise fitting approach. Section 3 introduces the pneumonia trial data while Section 4 present multilevel multiple imputation model, univariate random effect model and pairwise joint model for pneumonia trial data set. Results under complete case analysis and after multiple imputation are presented in Section 5 and we conclude with a discussion in Section 6.

2. Correlated Random-Effects Joint Model

Let Y_rij denote the r^th (r = 1,2,…,p) outcome for the i^th (i = 1,2, …,N) subject in cluster j (j = 1,2, …, n_i). The corresponding univariate random effects model for the r^th outcome can be defined as

h^{- 1} (E (Y_{r i j} | b_{ri}, X_{rij}, Z_{rij})) = {X^{'}}_{r i j} β_{r} + {Z^{'}}_{r i j} b_{r i},

(1)

where h^–1(·) is an appropriate link function depending on the type of outcome (i.e., whether continuous, binary, count, etc.),¹² X_rij is a vector of known covariates with fixed effects β_r, and Z_rij is a vector of covariates with random effects b_ri. The univariate random effects model can be extended to jointly model all the outcomes simultaneously, by imposing a joint multivariate distribution on the random effects.^14,15,17 Moreover, the number of random effects can vary among the outcomes of interest. Conditional on the vector of random effects (b_ri), the outcomes are assumed to be independent⁸ and the corresponding log-likelihood contribution for subject i equals

l_{i} (y_{1 i}, y_{2 i}, \dots, y_{p i} | Θ^{*}) = ln \int \prod_{r = 1}^{p} f_{r i} (y_{r i} | b_{ri}, θ_{r}) f (b_{i} | D) d b_{i} .

(2)

The vector Θ* contains all parameters of the full joint model (i.e., fixed parameters denoted by β* and covariance parameters denoted by Σ*), while f _ri(y_ri|b_ri,θ_r) is the density of y_ri conditional on the random effects for the r^th outcome on subject i. The vector of random-effects b_i is assumed to follow a multivariate normal distribution with mean zero and covariance matrix D, that is,

b_{i} = (\begin{matrix} b_{1 i} \\ b_{2 i} \\ ⋮ \\ b_{p i} \end{matrix}) \sim N [(\begin{matrix} 0 \\ 0 \\ ⋮ \\ 0 \end{matrix}), (\begin{matrix} D_{11} & D_{12} & \dots & D_{1 p} \\ D_{21} & D_{22} & \dots & D_{2 p} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ D_{p 1} & D_{p 2} & \dots & D_{p p} \end{matrix})] .

The elements D_rs in D correspond to blocks of random effects variance–covariance between the r^th and the s^th outcomes (r, s = 1,2,…,p). For example, assuming that each outcome has a random intercept (b₀) and a random slope (b₁), then D_rs is given by

D_{r s} = [\begin{matrix} σ_{b_{0 r}}^{2} & σ_{b_{0 r}} b_{1 r} & σ_{b_{0 r}} b_{0 s} & σ_{b_{0 r} b_{1 s}} \\ σ_{b_{1 r}}^{2} & σ_{b_{1 r} b_{0 s}} & σ_{b_{1 r} b_{1 s}} \\ σ_{b_{0 s}}^{2} & σ_{b_{0 s} b_{1 s}} \\ σ_{b_{1 s}}^{2} \end{matrix}] .

The elements of the variance covariance matrix D can be used to measure the strength of association between any two outcomes of interest. As mentioned earlier, the dimension of the random effects vector b_i in the full joint model, increases with an in increase in the number of outcomes. This leads to computational challenges for high dimensional vectors of outcomes.^8,10,14

2.1. The pairwise modeling approach

In light of computational challenges highlighted above, Fieuws and Verbeke¹⁴ proposed a pairwise approach within the pseudo-likelihood framework to handle high-dimensional vectors of outcomes. With a vector of p outcomes, the pairwise approach maximizes the likelihood for all Q = p(p – 1)/2 pairwise models separately, instead of maximizing the full joint multivariate likelihood.^14,24 Precisely, this produces a so-called pseudo-likelihood (pl) of the following form:

p l (Θ) = l (Y_{1}, Y_{2} | Θ_{12}) l (Y_{1}, Y_{3} | Θ_{13}), \dots, l (Y_{p - 1}, Y_{p} | Θ_{p - 1 p}) = \prod_{r = 1}^{p - 1} \prod_{s = 1}^{p} l (Y_{r}, Y_{s} | Θ_{r s}) .

(3)

For a given pair of responses (r, s = 1,2..,p),l(Y_r, Y_s|Θ_rs) denotes the likelihood, while Θ_rs is the vector of all parameters encountered in a pairwise joint model.¹⁴ The corresponding pseudo-log likelihood function (pll) is given by

\begin{array}{l} p l l (Θ) & = \sum_{r = 1}^{p - 1} \sum_{s = r + 1}^{p} l l (Y_{r}, Y_{s} | Θ_{rs}), \\ = \sum_{q = 1}^{Q} l l (Y_{q} | Θ_{q}), \end{array}

where Y_q and Θ_q contain all the observations and parameters, respectively, in the q^th response pair (q = 1,2,…, Q). All Q pair-specific parameter vectors Θ_q (q = 1,2,…, Q) are stacked together into Θ with fixed parameters denoted by β. It is clear that if ${\hat{Θ}}_{q}$ maximizes l(Y_q|Θ_q), then $\hat{Θ}$ , the stacked vector combining all ${\hat{Θ}}_{q}$ , maximizes pll(Θ).²⁴ The asymptotic distribution of $\hat{Θ}$ is multivariate normal given by

\sqrt{N} (\hat{Θ} - Θ) \sim M V N (0, H^{- 1} {GH}^{- 1}),

(4)

where H⁻¹GH⁻¹ is a sandwich estimator and H and G are based on cluster-wise Hessians and gradients of the log-pseudo-likelihood function, respectively.^8,10,18,24 The vector of all parameters in the full joint model (Θ*) and stacked vector from pairwise models (Θ) are not equivalent. Specifically, some parameters in Θ* have a single counterpart in Θ, while other elements in Θ* have multiple counterparts in Θ.⁸ A set of fixed effects (β*), for the full joint model, are obtained by averaging duplicate parameter estimates from the pairwise joint models.^8,14 This can be achieved by multiplying the stacked vector of regression parameters (β) with an appropriate weight matrix A as below

β^{*} = A β .

(5)

The standard errors follow as the square root of diagonal elements of variance–covariance estimator

Σ^{*} = A (H^{- 1} {GH}^{- 1}) A^{T} .

(6)

Further details on estimation of fixed effects and corresponding standard errors are presented in the application section.

3. Pneumonia Trial Data

In this study, we analyzed routine pediatric data collected in a cluster randomized trial in 12 Kenyan hospitals between March and November 2016.^2,25 The trial’s objective was to investigate the level of uptake of pediatric pneumonia treatment guidelines recommended by the World Health Organization (WHO) in 2013.²⁶ Details on the trial are contained in the trial report.² In brief, hospitals were randomly allocated to the intervention arm or control arm. Six hospitals in the intervention received an enhanced monthly audit and feedback (A&F) report on assessment, diagnosis and treatment of pneumonia cases, a bi-monthly standard A&F report assessing performance and adherence to general inpatient pediatric care guidelines at facility level. Besides A&F reports, the trial intervention package contained network intervention strategies such as peer learning among clinicians across study facilities, workshops and follow-up emails and phone calls by the trial pediatrician. On the other hand, six control hospitals received a bi-monthly standard A&F report and network intervention strategies.^2,25

During the trial period, 2299 children aged 2 to 59 months were admitted in general pediatric wards with childhood pneumonia in 12 study hospitals. However, this analysis excluded 172/2299 (7.5%) case records lacking admitting clinician’s information. The remaining 2127/2299 (92.5%) patients were admitted by 378 clinicians. Of the 2127 pneumonia cases, 953 (44.8%) were admitted to six intervention hospitals. On average, there were 32 clinicians per hospital, and the number of patients per clinician ranged between 3 and 46. Data were extracted by trained data clerks from pediatric admission record (PAR) (a structured paper based medical record/form used in pediatric wards in CIN hospitals) after discharge from hospital. The data were entered into an open source data capture tool (Research Electronic Data Capture, [REDCap])²⁷ using a standard operating procedure manual.

3.1. Pediatric pneumonia care indicators

The outcomes of interest were nine pneumonia care indicators spanning three domains of care (Table 1). These are 1 = cough, 2 = difficult breathing, 3 = respiratory rate, 4 = oxygen saturation, 5 = level of consciousness measured on the ‘Alert’, ‘Verbal response’, ‘response to Pain’, and ‘Unresponsive’ (AVPU) scale, 6 = lower chest wall indrawing (signs and symptoms in the assessment domain), 7 = pneumonia severity classification (diagnosis domain), 8 = oral amoxicillin prescription to treat pneumonia, and 9 = oral amoxicillin dosage and frequency of administration (treatment domain). While these indicators were measured on different scales reflecting different aspects of care, we created a binary variable for each one of them as appropriate (Table 1). For each case record, we assessed documentation of cough and difficult breathing (primary pneumonia signs and symptoms required for identification of pneumonia cases), respiratory rate, oxygen saturation, AVPU, lower chest wall indrawing (secondary signs and symptoms required for classification of pneumonia severity).²⁶ For each sign and symptom, we created a binary variable with the value one representing documentation in pediatric admission record (PAR) (e.g., cough assessed at point of admission and marked in a check box as present or absent) and zero representing lack of documentation of a sign and symptom in the medical record by the admitting clinician (Table 1). In the diagnosis domain, we assessed whether clinical pneumonia diagnosis and the severity classification documented in a patient’s PAR by the admitting clinician was in line with the diagnosis and the severity implied by presenting signs and symptoms. Here, we created a binary variable with value one representing correct diagnosis and severity classification and zero representing misclassification of pneumonia severity (Table 1).

Table 1. Definition of binary outcomes in the assessment, diagnosis and classification and treatment domains of pediatric pneumonia care.

Quality of care domain	Indicator	Scores in binary indicators
1. Assessment	Cough	1:	if cough is documented,
Primary signs and symptoms
		0:	if it is not documented.
	Difficult breathing	1:	if difficult breathing is documented,
Secondary sign and symptoms	Respiratory rate	0: 1:	if it is not documented. if respiratory rate is documented,
		0:	if it is not documented.
	Oxygen saturation	1:	if oxygen saturation is documented,
		0:	if it is not documented.
	AVPU^a	1:	if AVPU is documented,
		0:	if it is not documented.
	Lower chest wall indrawing	1:	if indrawing is documented,
		0:	if it is not documented.
2. Diagnosis and classification	Correct diagnosis^*	1: if the admitting clinician documented pneumonia as the clinical diagnosis 0: if documented clinical diagnosis is severe pneumonia or missing classification.
3. Treatment	Correct prescription	1: if oral amoxicillin was prescribed and documented in the medical record. 0: if amoxicillin was not prescribed
	Correct oral amoxicillin dose	1: if oral amoxicillin was prescribed in correct dose and frequencies, that is, 32-48 international units/Kilogram (IU/Kg) every 12 h. 0: if oral amoxicillin prescription is an under dose (<32 IU/Kg) or overdose (>48 IU/Kg) or missing amoxicillin dose or wrong frequency or missing frequency or missing patient’s weight.

Open in a new tab

Note: AVPU^a:-Alert, Verbal response, Pain response, Unresponsive

Pneumonia diagnosis for patients with history of cough and/or difficult breathing (primary signs) in combination with signs of lower chest wall indrawing and/or respiratory rate (RR) ≥50 (≥40) for patients aged 2-11 (12-59 months), in the absence of danger any sign (inability to drink/breastfeed, cyanosis, grunting or oxygen saturation < 90% or AVPU = ‘V’, ‘P’ or ‘U’).

In the treatment domain, we had two binary indicators, one assessing adherence to prescription guidelines and the other assessing adherence to dosing guidelines. For the prescription indicator, the value one represented prescription of oral amoxicillin to treat pediatric pneumonia as per the guidelines while zero represented deviation from ideal care (i.e., lack of evidence in a patient’s medical record that oral amoxicillin was prescribed) (Table 1).

To determine correctness of dose among oral amoxicillin recipients, we considered actual dose prescribed, patient’s weight and frequency of administration as documented in a patient’s medical record. We created a binary indicator with value one representing oral amoxicillin correct dosage and correct frequency of administration (i.e., 32–48 international units per Kilogram [IU/Kg] every 12 h). Inappropriate oral amoxicillin dosing was considered as: lack of documentation of actual oral amoxicillin dose prescribed, lack of documentation of patient’s weight, undocumented/wrong frequency of oral amoxicillin administration, oral amoxicillin underdosing (<32 IU/Kg every 12 h) or overdosing (>48 IU/Kg every 12 h) (Table 1).

3.2. Covariates

In this analysis, the covariates of intertest included intervention arm, follow-up time (in months) and their interaction, hospital malaria prevalence status and pediatric admission workload. Five out of 12 hospitals were drawn from high malaria endemic regions while the remaining seven hospitals were drawn from regions with low malaria endemicity in Kenya.²⁸ Hospitals with less than 1000 pediatric admissions per year were categorized as low admission workload while those with 1000 or more pediatric admissions per year were categorized as high admission workload hospitals. This categorization allowed us to assess the impact of admission workload on quality of inpatient pediatric pneumonia care. This is considering that public hospitals in LMICs are often characterized by a shortage in workforce, potentially impeding delivery of health care services.^29–31 At clinician level, gender and cadre were considered (here cadre refers to clinician’s level of training, that is, clinical officers with diploma-level training and medical officers with bachelors’ degree level training). Among 295 clinicians with observed cadre, majority were clinical officer interns at 62.4% (n = 184) followed by medical officer interns at 33.4% (n = 99). Clinical officer and medical officers accounted for 2.0% (n = 6) each. Among 296 clinicians with observed gender, 43.2% (n = 128) were females.

At patient level, we considered gender, number of comorbid illnesses and age in months at point of admission. While the WHO pediatric pneumonia treatment guidelines apply for children aged 2–59 months²⁶ we categorized patients into two age groups, (i.e., 2–11 months and 12–59 months) in order to assess whether pneumonia care administered varied between infants and older children. This is considering that older children tend to have better outcomes compared to infants.³² Approximately, 42.5% (903/2127) of the patients were aged between two and 11 months and 57.5% (1224/2127) were females and among 2114 patients with observed gender, 55.1% (n = 1164) were males. Regarding comorbidities, we determined the total number of clinical diagnoses documented in patient medical records. The diagnoses of interest in the comorbidity variables included malaria, malnutrition, asthma, tuberculosis (TB), rickets, anemia, diarrhea and dehydration. For each patient, we created separate binary variables for each diagnosis above with value 1 denoting the presence of a disease and 0 denoting absence of a disease. We then created a categorical variable which consisted of a count of comorbidities defined as (0 = none, 1 = one, 2 = two, 3 = three or more comorbid illnesses). The above categorization was to assess whether care among pneumonia patients varied with an increase in the number of comorbid illness. Clinically, 46.8% (995/2127) of the patients had no comorbidities, 29.8% (633/2127) had one comorbidity, 17.9% (381/2127) had two comorbidities, and 5.5% (118/2127) had at least three comorbidities.

3.3. Missingness in the trial data

Missing data occurred in patient- and clinician level covariates. Approximately, 21.9% (83/378) and 21.7% (82/378) clinicians had missing data on the gender and cadre variables respectively, while patient’s gender was missing in 0.7% (17/2127) case records. An assessment of the missing data pattern revealed that nearly all clinicians with missing cadre had gender missing as well.

4. Application: Model Fitting and Inference

4.1. Multiple imputation

Before fitting the analyses models of interest, we imputed partially observed covariates assuming a missing at random (MAR) mechanism. MI was conducted within joint modeling (JM) framework where imputation values are drawn from a multivariate normal distribution in a single step.^23,33 We used the latent normal approach to impute incomplete categorical variables of interest.²³ Multiple imputation was implemented in the jomo³⁴ and mitml³⁵ packages in R (version 3.5.4) which allow imputation of categorical variables with more than two levels in the second and higher levels of the multilevel structure. For the i^th (i = 1,…,2127) patient nested within the j^th clinician (j = 1,…,378) in hospital l (l = 1,…,12), we defined a two-level JM imputation model corresponding to

\begin{matrix} Y_{ijl}^{(1)} = X_{ijl}^{(1)} β^{(1)} + b_{j l}^{(1)} + e_{i j l}^{(1)} \\ Y_{j l}^{(2)} = X_{jl}^{(2)} β^{(2)} + b_{j l}^{(2)} \\ e_{i j l}^{(1)} \sim N (0, σ_{e}^{2}), a n d (b_{j l}^{(1)}, b_{j l}^{(2)}) \sim N (0, Σ_{b}), \end{matrix}

(7)

where $Y_{ijl}^{(1)}$ and $Y_{jl}^{(2)}$ are vectors of partially observed level 1 variables (patient’s sex) and level two variables (clinician’s sex and cadre) respectively. Predictor variables $(X_{ijl}^{(1)})$ for missing patient’s sex were fully observed variables (i.e., follow-up time, interacted with feedback arm, hospital admission workload and hospital malaria prevalence status, patient’s age and the number of comorbid illnesses). Predictor variables $(X_{jl}^{(2)})$ for missing clinician’s sex and cadre at the second level of the imputation model included follow-up time interacted with feedback arm, hospital admission workload and hospital malaria prevalence status. We also included all the nine binary response variables in both levels of the imputation model. A random intercept (b_jl) was included to account for clustering at clinician level. Missing values were imputed 20 times with a burn-in of 500, and 500 updates between each imputed data set. Imputed values were assessed as appropriate while trace plots were used to assess convergence of the imputation model.³⁶

4.2. Separate univariate analyses

First, we analyzed the nine outcomes separately under complete case analysis and after multiple imputation of missing covariates. For each outcome (r = 1,2,…,9), we fitted a generalized linear mixed model defined by.

\begin{array}{l} l o g i t [P (Y_{r i j l} = 1)] & = β_{r 0} + β_{r 1} x_{(a g e g r o u p; r i j l)} + β_{r 2} x_{(p a t i e n t s e x; r i j l)} + β_{r 3} x_{(c o m o r b i d i t i e s = 0; r i j l)} + β_{r 4} x_{(c o m o r b i d i t i e s = 1; r i j l)} \\ + β_{r 5} x_{(c o m o r b i d i t i e s = 2; r i j l)} + β_{r 6} x (_{c l i n i c i a n c a d r e; r j l}) + β_{r 7} x_{(c l i n i c i a n s e x; r j l)} + β_{r 8} x_{(a d m i s s i o n w o r k l o d; r j l)} \\ + β_{r 9} x_{(m a l a r i a p r e v a l e n c e; r l)} + β_{r 10} x_{(t i m e i n m o n t h s; r l)} + β_{r 11} x_{(t r i a l a r m; r l)} + β_{r 12} x_{(t i m e i n m o n t h; r l)} * x_{(t r i a l a r m; r l)} + b_{r i j l}, . \end{array}

(8)

where β_r1,β_r2…,β_r12 are regression parameters associated with known fixed covariates for the r^th outcome. Due to relatively low numbers of clinical and medical officers, we grouped clinicians into two cadres from the initial four. That is, clinical officers (CO) combine clinical officers and clinical officer interns and medical officers (MO) combine medical officers and medical officer interns, respectively. The vector of random clinicians’ intercepts b_ijl is assumed to follow a normal distribution with mean zero and variance $σ_{b}^{2}$ .

4.3. Full multivariate joint model

To analyze the nine pneumonia outcomes jointly, a full multivariate joint model was considered:

\begin{matrix} l o g i t [P (Y_{1 i} = 1)] = X_{i} β_{1} + b_{1 i} \\ l o g i t [P (Y_{2 i} = 1)] = X_{i} β_{2} + b_{2 i} \\ ⋮ \\ l o g i t [P (Y_{9 i} = 1)] = X_{i} β_{9} + b_{9 i}, \end{matrix}

(9)

where X_i denotes a vector of known covariates and β₁, β₂,…,β₉ are vectors of regression parameters to be estimated for each of the nine outcomes. The random clinicians’ intercepts were assumed to follow a joint zero-mean normal distribution denoted by

(\begin{array}{l} b_{1 i} \\ b_{2 i} \\ b_{3 i} \\ b_{4 i} \\ b_{5 i} \\ b_{6 i} \\ b_{7 i} \\ b_{8 i} \\ b_{9 i} \end{array}) \sim N (0, D)

where D, the covariance matrix of the random effects has the following structure:

D = [\begin{matrix} σ_{b_{1}}^{2} & σ_{b_{1} b_{2}} & σ_{b_{1} b_{3}} & σ_{b_{1} b_{4}} & σ_{b_{1} b_{5}} & σ_{b_{1} b_{6}} & σ_{b_{1} b_{7}} & σ_{b_{1} b_{8}} & σ_{b_{1} b_{9}} \\ σ_{b_{2}}^{2} & σ_{b_{2} b_{3}} & σ_{b_{2} b_{4}} & σ_{b_{2} b_{5}} & σ_{b_{2} b_{6}} & σ_{b_{2} b_{7}} & σ_{b_{2} b_{8}} & σ_{b_{2} b_{9}} \\ σ_{b_{3}}^{2} & σ_{b_{3} b_{4}} & σ_{b_{3} b_{5}} & σ_{b_{3} b_{6}} & σ_{b_{3} b_{7}} & σ_{b_{3} b_{8}} & σ_{b_{3} b_{9}} \\ σ_{b_{4}}^{2} & σ_{b_{4} b_{5}} & σ_{b_{4} b_{6}} & σ_{b_{4} b_{7}} & σ_{b_{4} b_{8}} & σ_{b_{4} b_{9}} \\ σ_{b_{5}}^{2} & σ_{b_{5} b_{6}} & σ_{b_{5} b_{7}} & σ_{b_{5} b_{8}} & σ_{b_{5} b_{9}} \\ σ_{b_{6}}^{2} & σ_{b_{6} b_{7}} & σ_{b_{6} b_{8}} & σ_{b_{6} b_{9}} \\ σ_{b_{7}}^{2} & σ_{b_{7} b_{8}} & σ_{b_{7} b_{9}} \\ σ_{b_{8}}^{2} & σ_{b_{8} b_{9}} \\ σ_{b_{9}}^{2} \end{matrix}] .

(10)

4.4. Pairwise joint modeling

To circumvent computational burden associated with model (9), we applied the pairwise approach to jointly model the probability of documentation among nine pneumonia outcomes under complete case analysis and after multiple imputation of missing covariates. We fitted 36 pairwise models where each pairwise model was defined by.

\begin{array}{l} l o g i t [P (Y_{r i j l} = 1)] & = β_{r 0} + β_{r 1} x_{(a g e g r o u p; r i j l)} + β_{r 2} x_{(p a t i e n t s e x; r i j l)} + β_{r 3} x_{(c o m o r b i d i t i e s = 0; r i j l)} + β_{r 4} x_{(c o m o r b i d i t i e s = 1; r i j l)} \\ + β_{r 5} x_{(c o m o r b i d i t i e s = 2; r i j l)} + β_{r 6} x_{(c l i n i c i a n c a d r e; r j l)} + β_{r 7} x_{(c l i n i c i a n s e x; r j l)} + β_{r 8} x_{(a d m i s s i o n w o r k l o d; r j l)} \\ + β_{r 9} x_{(m a l a r i a p r e v a l e n c e; r l)} + β_{r 10} x_{(t i m e i n m o n t h s; r l)} + β_{r_{11}} x_{(t r i a l a r m; r l)} + β_{r 12} x_{(t i m e i n m o n t h s; r l)} * x_{(t r i a l a r m; r l)} + b_{r i j l}, . \end{array}

(11)

\begin{array}{l} l o g i t [P (Y_{s i j l} = 1)] & = β_{s 0} + β_{s 1} x_{(a g e g r o u p; s i j l)} + β_{s 2} x_{(p a t i e n t s e x; s i j l)} + β_{s 3} x_{(c o m o r b i d i t i e s = 0; s i j l)} + β_{s 4} x_{(c o m o r b i d i t i e s = 1; s i j l)} \\ + β_{s 5} x_{(c o m o r b i d i t i e s = 2; s i j l)} + β_{s 6} x_{(c l i n i c i a n c a d r e; s j l)} + β_{s 7} x_{(c l i n i c i a n s e x; s j l)} + β_{s 8} x_{(a d m i s s i o n w o r k l o d; s j l)} \\ + β_{s 9} x_{(m a l a r i a p r e v a l e n c e; s l)} + β_{s 10} x_{(t i m e i n m o n t h s; s l)} + β_{s_{11}} x_{(t r i a l a r m; s l)} + β_{s 12} x_{(t i m e i n m o n t h s; s l)} * x_{(t r i a l a r m; s l)} + b_{s i j l}, \end{array}

where Y_rijl and Y_sijl denote the r^th and the s^th outcomes, r ≠ s for (r,s = 1,2,…,9) for patient i admitted by clinician j in hospital l. Each outcome occurred in eight specific pairs and we included a random clinicians’ intercept in each model. For each pairwise joint model, the random effects were assumed to follow a bivariate normal distribution denoted by

(\begin{matrix} b_{r} \\ b_{s} \end{matrix}) \sim N [0, (\begin{matrix} σ_{b_{r}}^{2} & σ_{b_{r} b_{s}} \\ σ_{b_{s}}^{2} \end{matrix})] .

(12)

We fitted all pairwise joint models using the JMbayes package³⁷ using a server with the following specification: 40 GB memory, Intel Xeon E5-4650 (2.70GHz) processor (12 cores/24 threads), Gnu/Linux Ubuntu 14.04 OS, and R (version 3.4.4) programming language. For verification purposes, complete case analysis was also conducted in SAS version 9.4 using a SAS macro provided by.¹⁰

Under complete case analysis, regression estimates, and standard errors were averaged across 36 pairwise models using the pseudo-likelihood approach presented in Section 2. Likewise, regression parameters were averaged across the various pairwise models for each imputed data set. Variance–covariance estimators⁶ were also obtained for each imputed data set. This step resulted in 20 sets of averaged regression parameters and variance–covariance estimators respectively. Thereafter, Rubin’s rules³⁸ were applied to obtain final estimates while accounting for within and between imputation variability. More details on the two-step procedure are as follows.

4.4.1. Inference for fixed regression parameters

Each bivariate model in the m^th imputed dataset had a vector of 26 regression coefficients (i.e., 13 regression coefficients for each outcome) denoted by ${\hat{β}}_{q m}, q = 1, 2, \dots, 36$ , m = 1,2,…,20. We stacked the 36 pairwise parameter estimate vectors resulting into a column vector with 936 rows, that is,

{\hat{β}}_{m} = {[\begin{matrix} {\hat{β}}_{1 m} \\ {\hat{β}}_{2 m} \\ ⋮ \\ {\hat{β}}_{36 m} \end{matrix}]}_{936 \times 1}, m = 1, 2, \dots, 20 .

Any two pairwise joint models with a common outcome (e.g., l(Y_r, Y_s) and l(Y_r, Y_s′) s ≠ s′) shared the parameters for the r^th outcome.^8–10,24 To account for duplicate parameter estimates, we pre-multiplied ${\hat{β}}_{m}$ with an appropriate weight matrix A as follows,

{\hat{β}}_{m}^{*} = A {\hat{β}}_{m}, m = 1, 2, \dots, 20 .

(13)

The weight matrix A had 117 rows (i.e., 13 regression parameters for each of the nine outcomes) and 936 columns and it was constructed such that, it averaged all duplicate parameter estimates of an outcome across the eight pairwise models in which it occurred. The resulting vector, ${\hat{β}}_{m}^{*}$ was a stacked column vector of 117 parameter estimates for all nine outcomes. Each outcome had 13 regression parameters denoted by ${\hat{β}}_{m r}^{*}$ . This step was repeated for all 20 imputed data sets.

4.4.2. Inference for standard errors

The corresponding standard errors were obtained using the pseudo-likelihood approach introduced above. For each bivariate pair, Y_mq, q = 1,2,…,36, in the m^th imputed dataset, we estimated the variance–covariance matrix, H^–1GH^–1. Since H and G depend on the unknown parameters in Θ,^8,24 estimation proceeded as follows. N indicates the total number of subjects.

Step 1: We obtained ${\hat{J}}_{m q}$ and ${\hat{K}}_{m q}$ for each pairwise model using.
${\hat{J}}_{m q} = \sum_{i = 1}^{N} X_{i m q}^{T} {\hat{T}}_{i m q} X_{i m q} and {\hat{K}}_{m q} = [X_{1 m q}^{T} {\hat{T}}_{1 m q}, X_{2 m q}^{T} {\hat{T}}_{2 m q}, \dots, X_{N m q}^{T} {\hat{T}}_{N m q}],$
where X_{imq_1} corresponds to the i^th subject’s contribution in the design matrix for the fixed effects, ${\hat{T}}_{i m q} = (Z_{i m q} {\hat{D}}_{m q} Z_{i m q}^{T})$ where Z_imq is the i^th subject’s contribution in the design matrix for random effects²⁴ and D_mq is the variance-covariance matrix for the random effects for the q^th pair in the m^th imputed data set. N indicates the number of subjects.
Step2: We combined ${\hat{J}}_{m q}$ and ${\hat{K}}_{m q}$ estimated across all the 36 pairs, (i.e., $({\hat{J}}_{m 1}, {\hat{K}}_{m 1}), ({\hat{J}}_{m 2}, {\hat{K}}_{m 2}), \dots, ({\hat{J}}_{m 36}, {\hat{K}}_{m 36})$ ) as follows.
${\hat{J}}_{m} = {[\begin{matrix} {\hat{J}}_{m 1} & 0 & \dots & 0 \\ 0 & {\hat{J}}_{m 2} & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & \dots & \dots & {\hat{J}}_{m 36} \end{matrix}]}_{936 \times 936} and {\hat{K}}_{m} = {[\begin{matrix} {\hat{K}}_{m 1} \\ {\hat{K}}_{m 2} \\ ⋮ \\ {\hat{K}}_{m 36} \end{matrix}]}_{936 \times N .}$
Step 3: We estimated H_m and G_m as follows.
${\hat{H}}_{m} = \frac{1}{N} {\hat{J}}_{m} and {\hat{G}}_{m} = \frac{1}{N} {\hat{K}}_{m} {\hat{K}}_{m}^{T},$
where N is defined above.
Step 4: We obtained a variance–covariance matrix, ${\hat{Σ}}_{m}^{*}$ for each imputed dataset using.
${\hat{Σ}}_{m}^{*} = A {\hat{Ω}}_{m} A^{T}, m = 1, 2, \dots, 20, .$ (14)
where ${\hat{Ω}}_{m} = {\hat{H}}_{m}^{- 1} {\hat{G}}_{m} {\hat{H}}_{m}^{- 1}$ and A is the weight matrix defined above. Each ${\hat{Σ}}_{m}^{*}$ was a 117 × 117 covariance matrix and the diagonal elements corresponded to variances of fixed regression parameters in ${\hat{β}}_{m}^{*}$ .

4.4.3. Pooling final estimates

In the final step, we pooled the final estimates using Rubin’s rules³⁸ for each of the 9 outcomes. This was based on the set of pairwise regression parameters and the estimated variance covariance matrices ${\hat{Σ}}_{m}^{*}$ estimated in.¹⁰ The pooled MI estimator for β is given by

\bar{β_{r}^{*}} = \frac{1}{M} \sum_{m = 1}^{M} {\hat{β}}_{m r}^{*},

(15)

with variance estimator

{\hat{V}}_{r} = {\hat{W}}_{r} + (\frac{M + 1}{M}) \times {\hat{B}}_{r},

where

{\hat{W}}_{r} = \frac{1}{M} \sum_{m = 1}^{M} {\hat{σ}}_{m r}^{2}

is the average imputation variance, ${\hat{σ}}_{m r}^{2}$ are the diagonal elements of ${\hat{Σ}}_{m}^{*}$ and

{\hat{B}}_{r} = \frac{1}{M - 1} \sum_{m = 1}^{M} {({\hat{β}}_{m r}^{*} - {\bar{β}}_{r}^{*})}^{2}

is the between imputation variance. Final MI estimates were compared to those obtained under complete case analysis.

4.4.4. Wald test for joint covariates effects under complete case analysis and after multiple imputation

To test for the joint effects of covariates on the outcomes, we used a Wald-type test under complete case analysis and after multiple imputation of missing covariates. The general linear hypothesis corresponded to.

H_{0} : L β = 0 vs H_{A} : L β \neq 0 .

(16)

Systems of linear equations were defined as appropriate for different parameter vectors. For illustration, the joint null hypothesis for the interaction effect between the intervention arm and follow-up time on the nine outcomes (i.e., β_1,12 = β_2,12 = β_3,12 = β_4,12 = β_5,12 = β_6,12 = β_7,12 = β_8,12 = β_9,12 = 0) was defined using a system of linear equation below:

{(\begin{matrix} 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ 0 & 0 & 0 & 0 & 0 \end{matrix} \begin{matrix} 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ 0 & 0 & 0 & 0 & 0 \end{matrix} \begin{matrix} 0 & 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ 0 & 0 & 0 & 0 & 0 \end{matrix} \begin{matrix} 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ 0 & 0 & 0 & 0 & 0 \end{matrix} \begin{matrix} 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ 0 & 0 & 0 & 0 & 0 \end{matrix} \begin{matrix} 0 \dots & 0 & 0 & 0 & 0 \\ 1 \dots & 0 & 0 & 0 & 0 \\ \dots & ⋮ & ⋮ & ⋮ & ⋮ \\ 0 \dots & 0 & 0 & 0 & 0 \end{matrix} \begin{matrix} 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ 0 & 0 & 0 & 0 & 0 \end{matrix} \begin{matrix} 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ ⋮ & ⋮ & ⋮ & ⋮ \\ 0 & 0 & 0 & 1 \end{matrix})}_{9 \times 117} {(\begin{matrix} β_{1, 0} \\ β_{1, 1} \\ ⋮ \\ β_{1, 12} \\ β_{2, 0} \\ β_{2, 1} \\ ⋮ \\ β_{2, 12} \\ ⋮ \\ ⋮ \\ β_{9, 0} \\ β_{9, 1} \\ ⋮ \\ β_{9, 12} \end{matrix})}_{117 \times 1} = {(\begin{matrix} 0 \\ 0 \\ ⋮ \\ 0 \end{matrix})}_{9 \times 1} .

The alternative hypothesis stated that at least one of the parameters differs from zero. Under complete case analysis, the test statistic for the joint interaction term was calculated using

F = \frac{1}{9} {(L_{12} {\hat{β}}^{*} - 0)}^{'} {(L_{12} {\hat{Σ}}^{*} {L^{'}}_{12})}^{- 1} (L_{12} {\hat{β}}^{*} - 0),

(17)

where L₁₂ is a matrix of zeros and ones defined to eliminate all parameter estimates except those associated with the interaction term (i.e.,β_r,12 (r = 1,2,…,9)), ${\hat{β}}^{*}$ is a stacked vector of parameter estimates averaged across 36 pairwise models and ${\hat{Σ}}^{*}$ is the variance–covariance matrix estimated using pseudo-likelihood. Wald-test statistics for the other variables were calculated in a similar manner but adjusting the L matrix appropriately.

For imputed datasets, the joint null hypotheses were tested using linear systems of equations like those defined under complete case analysis. Nonetheless, the test statistics were calculated differently. For instance, the test statistic for the joint interaction term effect on the nine outcomes after multiple imputation was calculated using

F = \frac{1}{9} {(L_{12} {\bar{β}}_{m}^{*} - 0)}^{'} {(L_{12} {\hat{V}}_{m} {L^{'}}_{12})}^{- 1} (L_{12} {\bar{β}}_{m}^{*} - 0),

(18)

where ${\bar{β}}_{m}^{*}$ is a stacked vector of pooled parameter estimates for all the nine outcomes and ${\hat{V}}_{m}$ is the variance–covariance matrix based on Rubin’s rules. Wald-test statistics for the other variables were calculated in a similar manner but adjusting the L matrix appropriately. In each case, the test statistic was multiplied by nine (removing the fraction in front) resulting in test statistics that were distributed according to chi-squared distribution with nine degrees of freedom. A 5% level of significance was considered in all statistical tests.

4.4.5. Association among pneumonia outcomes

The strength of association among documentation of pneumonia care indicators was evaluated using the variance–covariance matrix of the random-effects. Since the covariance matrix D defined in (10) was not estimated directly at analysis stage, we constructed it using blocks of random-effects variance–covariance matrices in (12) estimated in the pairwise joint models. Under multiple imputation, we first averaged duplicate variance across 36 pairwise random intercept models for each of the 20 imputed data set. Specifically, we extracted the random intercepts variance–covariance matrix for all 36 pairwise joint models, that is,

D_{m 1} = [\begin{matrix} σ_{b_{m 1}}^{2} & σ_{b_{m 1} b_{m 2}} \\ σ_{b_{m 2}}^{2} \end{matrix}], D_{m 2} = [\begin{matrix} σ_{b_{m 1}}^{2} & σ_{b_{m 1} b_{m 3}} \\ σ_{b_{m 3}}^{2} \end{matrix}], \dots, D_{m 36} = [\begin{matrix} σ_{b_{m 8}}^{2} & σ_{b_{m 8} b_{m 9}} \\ σ_{b m 9}^{2} \end{matrix}] .

We then created an overall variance-covariance matrix D_m for each imputed data set accounting for overlapping information. For example, in each imputed data set, (m = 1,2,…,20), documentation of cough occurred in the variance-covariance matrices of the first eight pairs, that is,

D_{m 1} = [\begin{matrix} σ_{b_{m 1}}^{2} & σ_{b_{m 1} b_{m 2}} \\ σ_{b_{m 2}}^{2} \end{matrix}], D_{m 2} = [\begin{matrix} σ_{b_{m 1}}^{2} & σ_{b_{m 1} b_{m 3}} \\ σ_{b_{m 3}}^{2} \end{matrix}], \dots, D_{m 8} = [\begin{matrix} σ_{b_{m 1}}^{2} & σ_{b_{m 1} b_{m 9}} \\ σ_{b m 9}^{2} \end{matrix}] .

We extracted the random intercept variances of each outcome from the pairs it occurred in and averaged them into a single random intercept variance estimate of Y_r (e.g., $σ_{b_{1}}^{2}$ denoting the random intercept variance for cough). On the other hand, unique off-diagonal elements corresponding to covariance between any two outcomes were also mapped into D_m. Thereafter, we averaged all the 20 D_m matrices, m = 1,2,…,20 to obtain the overall 9 × 9 variance covariance matrix D for all the nine outcomes. We used the same procedure to construct the random-intercepts variance–covariance under complete case analysis where we averaged duplicate variances across 36 pairwise random intercept models. The strength of association between any 2 outcomes, say Y_r and Y_s was calculated using

corr (b_{r}, b_{s}) = \frac{C o v (b_{r}, b_{s})}{\sqrt{V a r (b_{r}) \times V a r (b_{s})}} = \frac{σ_{b_{r} b_{s}}}{\sqrt{σ_{b_{r}}^{2} \times σ_{b_{s}}^{2}}} .

(19)

We performed principal component analysis (PCA) on random clinicians’ intercepts variance–covariance matrices obtained under complete case analysis and after multiple imputation. This was to help visualize factor loadings among pneumonia outcomes of interest and how they correlated with one another.

5. Results

The level of documentation and adherence to recommended pneumonia care varied within and across domains of care. To be specific, most of the signs and symptoms in the assessment domain were well documented except for oxygen saturation and respiratory rate which had documentation rates of 60.9% (1297/2127) and 88.8% (1889/2127) respectively. On the other hand, the level of documentation and adherence to recommended guidelines in diagnosis and treatment domains, respectively was poor compared to that of signs and symptoms in the assessment domain. Specifically, of all 2127 syndromic pneumonia cases, only 1473 (69.3%) had correct clinical pneumonia diagnosis and severity classification documented in the medical record. In the treatment domain, about 48.7% (1036/2127) were prescribed with oral amoxicillin as per the guidelines. However, only 25% (523/2127) of all pneumonia patients got the right oral amoxicillin dose and in the right frequency of administration, that is, 32–48 international units/Kilogram (IU/Kg) every 12 h.

6. Wald-Type Tests for Joint Covariates Effects

After multiple imputation of missing clinician- and patient level covariates, the Wald-type test suggested a significant joint interaction effect between the trial arm and follow-up time on documentation and adherence to recommended clinical guidelines on all the nine pediatric pneumonia outcomes of interest (P-value <0.05). Likewise, pediatric admission workload and malaria prevalence status at hospital level also exhibited significant joint effects on all the nine outcomes (Table 2). At clinician level, gender and cadre had significant joint effect on documentation and adherence to recommended pediatric pneumonia care guidelines (Table 2). At patient level, age and comorbidity had significant joint effect on documentation of all the nine outcomes. On the other hand, patient’s gender did not have a significant joint effect on the outcomes of interest (Table 2). The Wald-type test results under complete case analysis were consistent with those from imputed datasets for all the covariates. That is, all the covariates had significant joint effects on the nine outcomes except patient’s gender (Table 2).

Table 2. Wald-type test results for joint effects of covariates on nine pneumonia outcomes.

	Wald-type test under complete case analysis		Wald-type test after multiple imputation
Effect	Test statistic	p value	Test statistic	p value
Patient’s age	19.62	0.02	21.81	0.01
Patient’s gender	12.20	0.21	13.16	0.15
Comorbidity	20.54	0.01	23.48	0.01
Clinician’s gender	20.91	0.01	22.47	0.007
Clinician’s cadre	19.94	0.02	17.96	0.03
Admission workload	25.56	0.002	24.73	0.003
Malaria prevalence	17.89	0.04	19.01	0.02
Time in months	19.26	0.02	18.16	0.03
Enhanced A&F^a arm	17.98	0.04	16.76	0.04
Enhanced A&F arm x follow-up time	18.13	0.03	23.11	0.005

Open in a new tab

Note: A&F^a, Audit and feedback.

Figure 1 and Supplementary Table A1-A2 present the odds ratios and their 95% confidence intervals estimated from the pairwise joint model under complete case analysis and after multiple imputation of missing covariates. Separate univariate analyses results are presented in Figure 2 and Supplementary Table A3-A4.

Odds ratios (dots) and 95% confidence intervals (horizontal bars) under complete case analysis and after multiple imputation of missing covariates: Pairwise joint modeling of nine pneumonia care outcomes

Under pairwise joint modeling, the magnitude and direction of covariates effects varied among pneumonia outcomes of interest. Over time, documentation and adherence to recommended clinical guidelines improved in six out of nine pneumonia care indicators among children admitted to six intervention hospitals (i.e., enhanced audit and feedback arm). That is, for a unit increase in follow-up month, the change in the adjusted odds of oxygen saturation, respiratory rate, lower wall indrawing documentation (in the assessment domain), correct pneumonia diagnosis, oral amoxicillin prescription and correct dosage among patients admitted to intervention hospitals (i.e., enhanced A&F arm) were significantly more positive in comparison to the change among patients admitted to control hospitals. These observations were made under complete case analysis and after multiple imputation of missing clinician- and patient level covariates (Figure 1). Nevertheless, the estimated 95% confidence intervals estimated were consistently narrower after multiple imputation. On the other hand, there was no significant difference in the documentation of cough, difficult breathing and AVPU over time between patients admitted to six intervention hospitals (enhanced A&F arm) and patients admitted to six control hospitals (standard A&F arm).

We also observed a few instances of contrasting results. For example, after multiple imputation, the adjusted odds of AVPU documentation were significantly lower among patients admitted to hospitals with low pediatric admission workload (Figure 1 and supplementary Table A1). Under complete case analysis, however, there was no significant difference (Figure 1, supplementary Table A2). Similarly, the adjusted odds of documentation of difficult breathing and correct oral amoxicillin dose among patients admitted in low malaria prevalence hospitals were lower compared to the odds of patients admitted in high malaria hospitals. However, under complete case analysis, the difference was not statistically significant (Figure 1, supplementary Table A2).

With regards to separate univariate analysis, the direction and magnitude of effects of most of the covariates across the nine outcomes were by and large consistent with those observed under a pairwise joint model. Additionally, it was found that documentation and adherence to recommended clinical guidelines improved over time in five out of nine pneumonia care indicators among children admitted to six hospitals in the enhanced A&F arm (intervention arm). To be specific, for a unit increase in follow-up month, the change in the adjusted odds of oxygen saturation, respiratory rate, correct pneumonia diagnosis, oral amoxicillin prescription and correct dosage among patients admitted to intervention hospitals were significantly more positive in comparison to the change among patients admitted to control hospitals. These observations were made under complete case analysis and after multiple imputation. However, multiple imputation improved precision of the estimated odds ratios compared to complete case analysis (Figure 2, Supplementary Tables A3-A4). The estimated variance among admitting clinicians (i.e., variance between random clinicians’ intercepts) varied across the nine pneumonia outcomes, both under complete case analysis and after multiple imputation of missing covariates (Tables A3, A4).

Tables 3 and 4 present variance-correlation matrices of random clinicians’ intercepts among 9 pneumonia outcomes under complete case analysis and after multiple imputation, respectively. Generally, the magnitude of correlation estimated among outcomes was consistently larger under multiple imputation compared to complete case analysis. Moreover, the strength and direction of association among outcomes varied within and across domains of care. For instance, the strength of association between documentation of oxygen saturation and respiratory rate was somewhat high, compared to association with other indicators in the assessment domain. To be specific, correlation between oxygen saturation and respiratory rate documentation increased from 0.69 (Table 3) under complete case analysis to 0.89 (Table 4) after multiple imputation of missing covariates. In the treatment domain, prescription of oral amoxicillin and correct dosage exhibited a strong positive association with a correlation coefficient of 0.73 under complete case analysis (Table 3) and 0.80 after multiple imputation of missing covariates (Table 4).

Table 3. Variance-correlation matrix for random clinicians' intercepts under complete case analysis.

	Cough	Difficult breathing	Respiratory rate	Oxygen saturation	AVPU^a	Indrawing	Correct diagnosis	Correct treatment	Correct dose
Cough	1.49
Difficult breathing	0.07	1.92
Respiratory rate	−0.29	−0.43	2.71
Oxygen saturation	−0.17	−0.47	0.63	7.38
AVPU	−0.14	−0.19	−0.20	0.09	2.26
Indrawing	−0.22	−0.11	−0.54	−0.39	−0.19	2.33
Correct diagnosis	−0.49	−0.53	0.48	0.29	−0.06	0.04	2.64
Correct treatment	− 0.48	−0.42	0.07	0.16	−0.38	0.66	0.64	1.81
Correct dose	−0.54	−0.64	0.57	0.69	−0.21	0.19	0.62	0.73	1.30

Open in a new tab

Note: AVPU^a: Alert, verbal response, pain response, unresponsive.

Table 4. Variance-correlation matrix for random clinicians' intercepts after multiple imputation.

	Cough	Difficult breathing	Respiratory rate	Oxygen saturation	AVPU^a	Indrawing	Correct diagnosis	Correct treatment	Correct dose
Cough	1.05
Difficult breathing	0.17	0.71
Respiratory rate	−0.29	−0.60	2.47
Oxygen saturation	−0.30	−0.78	0.89	2.23
AVPU	−0.12	−0.24	−0.12	0.22	1.76
Indrawing	−0.30	0.06	−0.52	−0.50	−0.26	1.82
Correct diagnosis	−0.54	−0.65	0.40	0.24	−0.07	0.35	2.14
Correct treatment	− 0.45	−0.55	0.23	0.26	−0.22	0.52	0.77	0.56
Correct dose	− 0.47	−0.76	0.63	0.64	−0.18	0.15	0.74	0.80	0.67

Open in a new tab

Note: AVPU^a, Alert, verbal response, pain response, unresponsive.

Across domains of care, correct pneumonia diagnosis was strongly associated with prescription of oral amoxicillin and correct dosage both in the treatment domain. We also observed that documentation of oxygen saturation, respiratory rate, and lower wall chest wall indrawing, in the assessment domain were positively associated with correct pneumonia diagnosis, amoxicillin prescription and correctness of the dose. These observations were made under complete case analysis (Table 3) and after multiple imputation (Table 4). On the other hand, documentation of cough and difficult breathing (primary pneumonia signs and symptoms) and AVPU in the assessment domain were negatively associated with documentation of other pneumonia care indicators.

Under complete case analysis, a principal component analysis (PCA) on the correlation matrix of the random intercepts showed that the first and second principal components explained 57.6% and 24.6% of the variation respectively (Figure 3, panel a). After multiple imputation, the first and second principal components explained 60.3% and 26.2% of the variation respectively (Figure 3, panel b). Vectors of two positively correlated outcomes in the loading plots were close, forming a small angle between them (e.g., oxygen saturation and respiratory rate). On the other hand, vectors of negatively correlated outcomes (e.g., cough and treatment) were diverging forming a large angle between them. The direction of vectors for all the outcomes was consistent under complete case analysis and after multiple imputation.

Results (component loadings for the first and second principal components) of a principal components analysis on correlation matrix of the random intercepts of model under complete case analysis (panel a) and after multiple imputation (panel b)

7. Discussion

In this study we sought to estimate the joint and separate effects of covariates on nine pediatric pneumonia outcomes from a routine data set collected during a cluster randomized trial conducted in Kenyan hospitals. We also estimated the strength of association among the outcomes using a correlated random-effects joint model.^8,14 Missing data in covariate across two level of hierarchy were handled using multiple imputation.

During the trial period, documentation and adherence to recommended pediatric pneumonia guidelines by clinicians depended on individual quality of care indicators. For instance, documentation of pneumonia care indicators, that did not require a lot of cognitive effort, were highly documented (e.g., cough, difficult breathing) compared to indicators that required more cognitive effort on the part of the clinician (e.g., prescribing the right treatment in the right dosage). These variations in delivery of quality care could also be due to hospital level factors, such as lack of or broken medical devices, impeding delivery of recommended care (e.g., pulse oximeter to measure oxygen saturation).

From Wald type test, we observed significant joint effects of all covariates of interest except patient’s gender and these observations were consistent between complete case analysis and after multiple imputation of missing patient and clinician level covariates. After fitting pairwise joint model, results showed that documentation and adherence to recommended clinical guidelines improved over time in six out of nine pneumonia care indicators among children admitted to six hospitals in the intervention arm. In separate analysis, documentation and adherence to recommended clinical guidelines improved over time in five out of nine pneumonia care indicators among children admitted to six hospitals in the intervention arm.

In both analysis approaches (i.e., pairwise joint modeling and separate univariate analysis), multiple imputation led to more precise estimates compared to those from complete case analysis. These observations were attributed to loss of information under complete case analysis resulting in larger standard errors hence wider 95% confidence intervals.

Further results revealed that the strength and direction of association among pneumonia outcomes varied within and across domains of care. Thus, an assumption of common random-effects among all outcomes would be too restrictive and unrealistic for pneumonia trial data analyzed in this study.

In the pairwise modeling approach, estimates obtained by averaging over several auxiliary estimates (from the various pairs) do not maximize the full multivariate likelihood. However, Fieuws and Verbeke³⁹ demonstrated with simulations that the loss of efficiency is small in the pairwise approach relative to a full maximum-likelihood based approach. Moreover, the averaged estimates are consistent and asymptotically normal,⁸ a property which holds for imputed data sets thus, ensuring valid within imputation estimates. Validity of within imputation estimates is a prerequisite for the application of Rubin’s rules which then account for between imputation variability.³⁸

Although we did not evaluate computational complexity explicitly, combining pairwise joint model fitting and multiple imputation comes with its computational expense as demonstrated in this study. At imputation stage, the level of complexity is compounded when missing data occur in more than one level of clustering. In such occurrences, it is paramount to account for the hierarchical structure present in the analysis model of interest in the imputation model as well. This is because incompatibility between imputation and analysis model may lead to biased estimates, underestimated cluster level variances and overestimated individual level variances.^23,33 In the current study, missing covariates were imputed using the latent normal approach within the joint model imputation framework while accounting for clustering at clinician level. Additionally, the outcomes of interest, all fully observed were included in the imputation model as auxiliary variables. Nonetheless, there is need for further research possibly through a simulation study to evaluate compatibility between imputation and substantive model or the lack thereof, in the high dimensional joint modeling context.

At analysis stage, complexity stems from calculating parameters of interest (e.g., obtaining variance–covariance matrices for each imputed data set using the pseudolikelihood approach before applying Rubin’s rules). Besides, constructing the overall variance covariance matrix for the random effects is not straight forward, hence the need for greater care to avoid incorrect inferences due to miscalculations. Therefore, future studies can consider developing and incorporating generic functions and packages into standard statistical software to handle missing data and other computational aspects (e.g., Wald-type tests to test for joint covariate effects after multiple imputation) more efficiently when the substantive model of interest entails joint modeling of clustered and high-dimensional vectors of outcomes.

The correlated random-effects joint model fitted using the pairwise approach has been previously used to jointly analyze clustered binary data¹⁰ as well as continuous longitudinal outcomes.¹⁴ However, there is essentially no example in the literature on how to account for missing covariates in a high-dimensional joint modeling context. Additionally, we extended and exemplified Wald-type tests for joint covariate effects after multiple imputation in a high-dimensional joint modeling context. To our knowledge, there are no examples in the literature demonstrating application of Wald-type tests for joint covariate effects tests in high-dimensional joint modeling after multiple imputation, hence the novelty of this study.

Besides estimating the joint effects of covariates after multiple imputation, we estimated the strength of association among quality-of-care outcomes, aspects that are largely ignored in routine pediatric care studies. In previous analysis of the trial data, for instance, diagnosis and classification of pneumonia cases was the primary outcome of interest.² In yet another study, pneumonia quality of care indicators were combined into a single ordinal composite outcome known as the pediatric quality of care indicator (PAQC) score.⁵ Therefore, when there is need for joint inference, we recommend this study as a practical example for handling high-dimensional vector of outcomes using a pairwise fitting approach and at the same time performing multiple imputation to account for missing covariates. However, if the research question does not necessitate joint inference, then univariate mixed models as tools for analysis suffice.¹⁴

Evidently, this study has several limitations. Firstly, we imputed missing covariates assuming a missing at random (MAR) mechanism, an assumption that cannot be verified using the observed data alone.^8,23,40 Therefore, sensitivity analysis is recommended to explore the robustness of the inferences to the MAR assumptions.

As already noted, fitting pairwise joint models on multiply imputed data sets was time intensive. Future studies may consider multiple outputation, an approach suggested by¹⁸ as alternative to the pairwise joint modeling using a sandwich-type robust variance estimator.

In conclusion, there were significant joint effects of covariates on nine pneumonia outcomes before and after multiple imputation of missing covariates. In both pairwise joint modeling and separate univariate analysis approaches, enhanced audit and feedback improved documentation and adherence to recommended clinical guidelines over time in six and five out of nine pneumonia care outcomes of interest. Irrespective of the analysis approach, multiple imputation of missing covariates improved precision of parameter estimates compared to complete case analysis. The strength and direction of association estimated using clinicians’ random intercepts estimated from the pairwise joint model varied among pneumonia outcomes within and across the three domains of pneumonia care. Across domains of care, pneumonia diagnosis was strongly correlated with oral amoxicillin prescription and dosage.

Supplementary Material

Appendix S1: Suplement Appendix Tables

EMS152550-supplement-Appendix_S1__Suplement_Appendix_Tables.docx^{(41.1KB, docx)}

Acknowledgments

We would like to thank the Ministry of Health who gave permission for this work to be developed and have supported the implementation of the CIN together with the county health executives and all hospital management teams. We are grateful to the Kenya Pediatric Association for promoting the aims of the CIN and the support they provide through their officers and membership. We also thank the hospital teams involved in service delivery for the sick child. This work is published with the permission of the Director of KEMRI.

The Clinical Information Network team who contributed to the design of the data collection tools, conduct of the work, collection of data and data quality assurance that form the basis of this report and who saw and approved the report’s findings include: Grace Irimu, Samuel Akech, Ambrose Agweyu, Michuki Maina, Jacquie Oliwa, David Gathara, Lucas Malla, Morris Ogero, Basil Okola, George Mbevi, Mercy Chepkirui (KEMRI-Wellcome Trust Research Programme); Samuel N’garng’ar (Vihiga County Hospital), Ivan Muroki (Kakamega County Hospital), David Kimutai & Loice Mutai (Mbagathi County Hospital), Caren Emadau & Cecilia Mutiso (Mama Lucy Kibaki Hospital), Charles Nzioki (Machakos Level 5 Hospital), Francis Kanyingi & Agnes Mithamo (Nyeri County Hospital), Margaret Kuria (Kisumu East County Hospital), Samuel Otido (Embu County Hospital), Grace Wachira & Alice Kariuki (Karatina County Hospital), Peris Njiiri (Kerugoya County Hospital), Rachel Inginia & Melab Musabi (Kitale County Hospital), Hilda Odeny (Busia County Hospital), Grace Ochieng & Lydia Thuranira (Kiambu County Hospital); Priscilla Oweso (Vihiga County Hospital), Ernest Namayi (Mbale Rural Health and Demonstration Centre), Benard Wambani, Samuel Soita (Kakamega Provincial General Hospital), Joseph Nganga (Mbagathi District Hospital), Margaret Waweru, John Karanja (Kiambu County Hospital), Susan Owano (Mama Lucy Kibaki Hospital), Esther Muthiani (Machakos Level 5 Hospital), Alfred Wanjau (Nyeri Level 5 hospital), Larry Mwallo (Kisumu East District Hospital), Lydia Wanjiru (Embu Provincial General Hospital), Consolata Kinyua (Karatina District Hospital), Mary Nguri (Kerugoya District Hospital) and Dorothy Munjalu (Kitale District Hospital).

Funding information

This work was supported through the DELTAS Africa Initiative Grant No. 107754/Z/15/Z-DELTAS Africa SSACAB. The DELTAS Africa Initiative is an independent funding scheme of the African Academy of Sciences (AAS)’s Alliance for Accelerating Excellence in Science in Africa (AESA) and supported by the New Partnership for Africa’s Development Planning and Coordinating Agency (NEPAD Agency) with funding from the Wellcome Trust (Grant No. 107754/Z/15/Z) and the UK government. The views expressed in this publication are those of the author(s) and not necessarily those of AAS, NEPAD Agency, Wellcome Trust or the UK government. Funds from the Wellcome Trust (Grant No. 207522) awarded to Prof. Mike English as a senior Fellow together with additional funds from a Wellcome Trust core grant awarded to the KEMRI-Wellcome Trust Research Programme (Grant No. 092654) supported CIN data collection.

Footnotes

Conflict of Interest

The authors have declared that no competing interests exist.

Data Availability Statement

The dataset analysed in this study are not publicly available because they are a property of the Ministry of Health and we do not have authority to share it on their behalf.

References

1.Gachau S, Ayieko P, Gathara D, et al. Does audit and feedback improve the adoption of recommended practices? Evidence from a longitudinal observational study of an emerging clinical network in Kenya. BMJ Glob Health. 2017;2(4):e000468. doi: 10.1136/bmjgh-2017-000468. [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Ayieko P, Irimu G, Ogero M, et al. Effect of enhancing audit and feedback on uptake of childhood pneumonia treatment policy in hospitals that are part of a clinical network: a cluster randomized trial. Implementation Sci. 2019;14(1):20. doi: 10.1186/s13012-019-0868-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Ogero M, Ayieko P, Boniface Makone TJ, et al. An observational study of monitoring of vital signs in children admitted to Kenyan hospi-tals: an insight into the quality of nursing care? J Glob Health. 2018;8(1):010409. doi: 10.7189/jogh.08.010409. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Opondo C, Allen E, Todd J, English M. The Paediatric admission quality of care (PAQC) score: designing a tool to measure the quality of early inpatient paediatric care in a low-income setting. Trop Med Int Health. 2016;21(10):1334–1345. doi: 10.1111/tmi.12752. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Gachau S, Owuor N, Njagi EN, Ayieko P, English M. Analysis of hierarchical routine data with covariate missingness: effects of Audit & Feedback on clinicians’ prescribed paediatric pneumonia care in Kenyan hospitals. Front Public Health. 2019;7:198. doi: 10.3389/fpubh.2019.00198. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Ogero M, Akech S, Malla L, Agweyu A, Irimu G, English M. Examining which clinicians provide admission hospital care in a high mortality setting and their adherence to guidelines: an observational study in 13 hospitals. Arch Dis Child. 2020;105:648–654. doi: 10.1136/archdischild-2019-317256. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Profit J, Kowalkowski MA, Zupancic JA, et al. Baby-MONITOR: a composite indicator of NICU quality. Pediatrics. 2014;134(1):74–82. doi: 10.1542/peds.2013-3552. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Molenberghs G, Verbeke G. Models for Discrete Longitudinal Data. Springer; 2005. [Google Scholar]
9.Fieuws S, Verbeke G, Molenberghs G. Random-effects models for multivariate repeated measures. Stat Methods Med Res. 2007;16(5):387–397. doi: 10.1177/0962280206075305. [DOI] [PubMed] [Google Scholar]
10.Fieuws S, Verbeke G, Boen F, Delecluse C. High dimensional multivariate mixed models for binary questionnaire data. J R Stat Soc Ser CAppl Stat. 2006;55(4):449–460. [Google Scholar]
11.Verbeke G, Fieuws S, Molenberghs G, Davidian M. The analysis of multivariate longitudinal data: a review. Stat Methods Med Res. 2014;23(1):42–59. doi: 10.1177/0962280212445834. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Fitzmaurice G, Davidian M, Verbeke G, Molenberghs G. Longitudinal data analysis. Chapman & Hall / CRC Press; 2008. [Google Scholar]
13.Fieuws S, Verbeke G. Joint modelling of multivariate longitudinal profiles: pitfalls of the random-effects approach. Stat Med. 2004;23(20):3093–3104. doi: 10.1002/sim.1885. [DOI] [PubMed] [Google Scholar]
14.Fieuws S, Verbeke G. Pairwise fitting of mixed models for the joint modeling of multivariate longitudinal profiles. Biometrics. 2006;62(2):424–431. doi: 10.1111/j.1541-0420.2006.00507.x. [DOI] [PubMed] [Google Scholar]
15.Gueorguieva R. A multivariate generalized linear mixed model for joint modelling of clustered outcomes in the exponential family. Stat Model. 2001;1(3):177–193. [Google Scholar]
16.McCulloch C. Joint modelling of mixed outcome types using latent variables. Stat Methods Med Res. 2008;17(1):53–73. doi: 10.1177/0962280207081240. [DOI] [PubMed] [Google Scholar]
17.Faes C, Aerts M, Molenberghs G, Geys H, Teuns G, Bijnens L. A high-dimensional joint model for longitudinal outcomes of different nature. Stat Med. 2008;27(22):4408–4427. doi: 10.1002/sim.3314. [DOI] [PubMed] [Google Scholar]
18.Nassiri V, Ivanova A, Molenberghs G, Verbeke G. Fast precision estimation in high-dimensional multivariate joint models. Biom J. 2017;59(6):1221–1231. doi: 10.1002/bimj.201600241. [DOI] [PubMed] [Google Scholar]
19.Catalano PJ. Bivariate modelling of clustered continuous and ordered categorical outcomes. Stat Med. 1997;16(8):883–900. doi: 10.1002/(sici)1097-0258(19970430)16:8<883::aid-sim542>3.0.co;2-e. [DOI] [PubMed] [Google Scholar]
20.Jaffa MA, Gebregziabher M, Jaffa AA. A joint modeling approach for right censored high dimensional multivariate longitudinal data. J Biometric Biostat. 2014;5(4):1000203. doi: 10.4172/2155-6180.1000203. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Hickey GL, Philipson P, Jorgensen A, Kolamunnage-Dona R. Joint modelling of time-to-event and multivariate longitudinal outcomes: recent developments and issues. BMC Med Res Methodol. 2016;16(1):117. doi: 10.1186/s12874-016-0212-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Long JD, Mills JA. Joint modeling of multivariate longitudinal data and survival data in several observational studies of Huntington’s disease. BMC Med Res Methodol. 2018;18(1):1–15. doi: 10.1186/s12874-018-0592-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Carpenter JR, Kenward MG. Multiple Imputation and its Applications. John Wiley & Sons; 2013. [Google Scholar]
24.Kundu MG. Implementation of pairwise fitting technique for analyzing multivariate longitudinal data in Sas. 2011:1–12. [Google Scholar]
25.Ayieko P, Irimu G, English M. Effect of enhanced feedback to hospitals that are part of an emerging clinical information network on uptake of revised childhood pneumonia treatment policy: study protocol for a cluster randomized trial. Trials. 2017;18(1):416. doi: 10.1186/s13063-017-2152-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Organization WH. Pocket Book of Hospital Care for Children: Guidelines for the Management of Common Childhood Illnesses. World Health Organization; 2013. [PubMed] [Google Scholar]
27.Harris PA, Taylor R, Thielke R, Payne J, Gonzalez N, Conde JG. Research electronic data capture (REDCap)—a metadata-driven methodology and workflow process for providing translational research informatics support. J Biomed Inform. 2009;42(2):377–381. doi: 10.1016/j.jbi.2008.08.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Ayieko P, Ogero M, Makone B, et al. Characteristics of admissions and variations in the use of basic investigations, treatments and outcomes in Kenyan hospitals within a new clinical information network. Arch Disease Childhood. 2015;101:223–229. doi: 10.1136/archdischild-2015-309269. [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Miseda MH, Were SO, Murianki CA, Mutuku MP, Mutwiwa SN. The implication of the shortage of health workforce specialist on universal health coverage in Kenya. Hum Resour Health. 2017;15(1):1–7. doi: 10.1186/s12960-017-0253-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
30.English M, Strachan B, Esamai F, et al. The paediatrician workforce and its role in addressing neonatal, child and adolescent healthcare in Kenya. Arch Dis Child. 2020;105(10):927–931. doi: 10.1136/archdischild-2019-318434. [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Arsenault C, English M, Gathara D, Malata A, Mandala W, Kruk ME. Variation in competent and respectful delivery care in Kenya and Malawi: a retrospective analysis of national facility surveys. Trop Med Int Health. 2020;25(4):442–453. doi: 10.1111/tmi.13361. [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Wang H, Liddell CA, Coates MM, et al. Global, regional, and national levels of neonatal, infant, and under-5 mortality during 1990–2013: a systematic analysis for the global burden of disease study 2013. The Lancet. 2014;384(9947):957–979. doi: 10.1016/S0140-6736(14)60497-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Grund S, Lüdtke O, Robitzsch A. Multiple imputation of missing data for multilevel models: simulations and recommendations. Org Res Method. 2018;21(1):111–149. doi: 10.1177/1094428117703686. [DOI] [Google Scholar]
34.Quartagno M, Grund S, Carpenter J. Jomo: a flexible package for two-level joint modelling multiple imputation. R J. 2019;9(1):205–228. [Google Scholar]
35.Grund S, Robitzsch A, Luedtke O, Grund MS. Package ‘mitml’. 2019. Retrieved from https://cran.rproject.org/web/packages/mitml.
36.Gelman A, Rubin DB. Inference from iterative simulation using multiple sequences. Stat Sci. 1992;7(4):457–472. http://digitalassets.lib.berkeley.edu/sdtr/ucb/text/307.pdf . [Google Scholar]
37.Rizopoulos D. The R package JMbayes for fitting joint models for longitudinal and time-to-event data using MCMC. J Stat Software. 2016;72(7):1–46. doi: 10.18637/jss.v072.i07. [DOI] [Google Scholar]
38.Rubin DB. Inference and missing data. Biometrika. 1976;63(3):581–592. [Google Scholar]
39.Fieuws S, Verbeke G. Evaluation of the Pairwise Approach for Fitting Joint Linear Mixed Models: A Simulation Study. Technical Report TR0527. Biostatistical Centre, Katholieke Universiteit Leuven; Belgium: 2005. [Google Scholar]
40.Verbeke G, Molenberghs G. Arbitrariness of models for augmented and coarse data, with emphasis on incomplete data and random effects models. Stat Model. 2010;10(4):391–419. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Appendix S1: Suplement Appendix Tables

EMS152550-supplement-Appendix_S1__Suplement_Appendix_Tables.docx^{(41.1KB, docx)}

Data Availability Statement

The dataset analysed in this study are not publicly available because they are a property of the Ministry of Health and we do not have authority to share it on their behalf.

[R1] 1.Gachau S, Ayieko P, Gathara D, et al. Does audit and feedback improve the adoption of recommended practices? Evidence from a longitudinal observational study of an emerging clinical network in Kenya. BMJ Glob Health. 2017;2(4):e000468. doi: 10.1136/bmjgh-2017-000468. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R2] 2.Ayieko P, Irimu G, Ogero M, et al. Effect of enhancing audit and feedback on uptake of childhood pneumonia treatment policy in hospitals that are part of a clinical network: a cluster randomized trial. Implementation Sci. 2019;14(1):20. doi: 10.1186/s13012-019-0868-4. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R3] 3.Ogero M, Ayieko P, Boniface Makone TJ, et al. An observational study of monitoring of vital signs in children admitted to Kenyan hospi-tals: an insight into the quality of nursing care? J Glob Health. 2018;8(1):010409. doi: 10.7189/jogh.08.010409. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R4] 4.Opondo C, Allen E, Todd J, English M. The Paediatric admission quality of care (PAQC) score: designing a tool to measure the quality of early inpatient paediatric care in a low-income setting. Trop Med Int Health. 2016;21(10):1334–1345. doi: 10.1111/tmi.12752. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R5] 5.Gachau S, Owuor N, Njagi EN, Ayieko P, English M. Analysis of hierarchical routine data with covariate missingness: effects of Audit & Feedback on clinicians’ prescribed paediatric pneumonia care in Kenyan hospitals. Front Public Health. 2019;7:198. doi: 10.3389/fpubh.2019.00198. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R6] 6.Ogero M, Akech S, Malla L, Agweyu A, Irimu G, English M. Examining which clinicians provide admission hospital care in a high mortality setting and their adherence to guidelines: an observational study in 13 hospitals. Arch Dis Child. 2020;105:648–654. doi: 10.1136/archdischild-2019-317256. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R7] 7.Profit J, Kowalkowski MA, Zupancic JA, et al. Baby-MONITOR: a composite indicator of NICU quality. Pediatrics. 2014;134(1):74–82. doi: 10.1542/peds.2013-3552. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R8] 8.Molenberghs G, Verbeke G. Models for Discrete Longitudinal Data. Springer; 2005. [Google Scholar]

[R9] 9.Fieuws S, Verbeke G, Molenberghs G. Random-effects models for multivariate repeated measures. Stat Methods Med Res. 2007;16(5):387–397. doi: 10.1177/0962280206075305. [DOI] [PubMed] [Google Scholar]

[R10] 10.Fieuws S, Verbeke G, Boen F, Delecluse C. High dimensional multivariate mixed models for binary questionnaire data. J R Stat Soc Ser CAppl Stat. 2006;55(4):449–460. [Google Scholar]

[R11] 11.Verbeke G, Fieuws S, Molenberghs G, Davidian M. The analysis of multivariate longitudinal data: a review. Stat Methods Med Res. 2014;23(1):42–59. doi: 10.1177/0962280212445834. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R12] 12.Fitzmaurice G, Davidian M, Verbeke G, Molenberghs G. Longitudinal data analysis. Chapman & Hall / CRC Press; 2008. [Google Scholar]

[R13] 13.Fieuws S, Verbeke G. Joint modelling of multivariate longitudinal profiles: pitfalls of the random-effects approach. Stat Med. 2004;23(20):3093–3104. doi: 10.1002/sim.1885. [DOI] [PubMed] [Google Scholar]

[R14] 14.Fieuws S, Verbeke G. Pairwise fitting of mixed models for the joint modeling of multivariate longitudinal profiles. Biometrics. 2006;62(2):424–431. doi: 10.1111/j.1541-0420.2006.00507.x. [DOI] [PubMed] [Google Scholar]

[R15] 15.Gueorguieva R. A multivariate generalized linear mixed model for joint modelling of clustered outcomes in the exponential family. Stat Model. 2001;1(3):177–193. [Google Scholar]

[R16] 16.McCulloch C. Joint modelling of mixed outcome types using latent variables. Stat Methods Med Res. 2008;17(1):53–73. doi: 10.1177/0962280207081240. [DOI] [PubMed] [Google Scholar]

[R17] 17.Faes C, Aerts M, Molenberghs G, Geys H, Teuns G, Bijnens L. A high-dimensional joint model for longitudinal outcomes of different nature. Stat Med. 2008;27(22):4408–4427. doi: 10.1002/sim.3314. [DOI] [PubMed] [Google Scholar]

[R18] 18.Nassiri V, Ivanova A, Molenberghs G, Verbeke G. Fast precision estimation in high-dimensional multivariate joint models. Biom J. 2017;59(6):1221–1231. doi: 10.1002/bimj.201600241. [DOI] [PubMed] [Google Scholar]

[R19] 19.Catalano PJ. Bivariate modelling of clustered continuous and ordered categorical outcomes. Stat Med. 1997;16(8):883–900. doi: 10.1002/(sici)1097-0258(19970430)16:8<883::aid-sim542>3.0.co;2-e. [DOI] [PubMed] [Google Scholar]

[R20] 20.Jaffa MA, Gebregziabher M, Jaffa AA. A joint modeling approach for right censored high dimensional multivariate longitudinal data. J Biometric Biostat. 2014;5(4):1000203. doi: 10.4172/2155-6180.1000203. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R21] 21.Hickey GL, Philipson P, Jorgensen A, Kolamunnage-Dona R. Joint modelling of time-to-event and multivariate longitudinal outcomes: recent developments and issues. BMC Med Res Methodol. 2016;16(1):117. doi: 10.1186/s12874-016-0212-5. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R22] 22.Long JD, Mills JA. Joint modeling of multivariate longitudinal data and survival data in several observational studies of Huntington’s disease. BMC Med Res Methodol. 2018;18(1):1–15. doi: 10.1186/s12874-018-0592-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R23] 23.Carpenter JR, Kenward MG. Multiple Imputation and its Applications. John Wiley & Sons; 2013. [Google Scholar]

[R24] 24.Kundu MG. Implementation of pairwise fitting technique for analyzing multivariate longitudinal data in Sas. 2011:1–12. [Google Scholar]

[R25] 25.Ayieko P, Irimu G, English M. Effect of enhanced feedback to hospitals that are part of an emerging clinical information network on uptake of revised childhood pneumonia treatment policy: study protocol for a cluster randomized trial. Trials. 2017;18(1):416. doi: 10.1186/s13063-017-2152-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R26] 26.Organization WH. Pocket Book of Hospital Care for Children: Guidelines for the Management of Common Childhood Illnesses. World Health Organization; 2013. [PubMed] [Google Scholar]

[R27] 27.Harris PA, Taylor R, Thielke R, Payne J, Gonzalez N, Conde JG. Research electronic data capture (REDCap)—a metadata-driven methodology and workflow process for providing translational research informatics support. J Biomed Inform. 2009;42(2):377–381. doi: 10.1016/j.jbi.2008.08.010. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R28] 28.Ayieko P, Ogero M, Makone B, et al. Characteristics of admissions and variations in the use of basic investigations, treatments and outcomes in Kenyan hospitals within a new clinical information network. Arch Disease Childhood. 2015;101:223–229. doi: 10.1136/archdischild-2015-309269. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R29] 29.Miseda MH, Were SO, Murianki CA, Mutuku MP, Mutwiwa SN. The implication of the shortage of health workforce specialist on universal health coverage in Kenya. Hum Resour Health. 2017;15(1):1–7. doi: 10.1186/s12960-017-0253-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R30] 30.English M, Strachan B, Esamai F, et al. The paediatrician workforce and its role in addressing neonatal, child and adolescent healthcare in Kenya. Arch Dis Child. 2020;105(10):927–931. doi: 10.1136/archdischild-2019-318434. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R31] 31.Arsenault C, English M, Gathara D, Malata A, Mandala W, Kruk ME. Variation in competent and respectful delivery care in Kenya and Malawi: a retrospective analysis of national facility surveys. Trop Med Int Health. 2020;25(4):442–453. doi: 10.1111/tmi.13361. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R32] 32.Wang H, Liddell CA, Coates MM, et al. Global, regional, and national levels of neonatal, infant, and under-5 mortality during 1990–2013: a systematic analysis for the global burden of disease study 2013. The Lancet. 2014;384(9947):957–979. doi: 10.1016/S0140-6736(14)60497-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R33] 33.Grund S, Lüdtke O, Robitzsch A. Multiple imputation of missing data for multilevel models: simulations and recommendations. Org Res Method. 2018;21(1):111–149. doi: 10.1177/1094428117703686. [DOI] [Google Scholar]

[R34] 34.Quartagno M, Grund S, Carpenter J. Jomo: a flexible package for two-level joint modelling multiple imputation. R J. 2019;9(1):205–228. [Google Scholar]

[R35] 35.Grund S, Robitzsch A, Luedtke O, Grund MS. Package ‘mitml’. 2019. Retrieved from https://cran.rproject.org/web/packages/mitml.

[R36] 36.Gelman A, Rubin DB. Inference from iterative simulation using multiple sequences. Stat Sci. 1992;7(4):457–472. http://digitalassets.lib.berkeley.edu/sdtr/ucb/text/307.pdf . [Google Scholar]

[R37] 37.Rizopoulos D. The R package JMbayes for fitting joint models for longitudinal and time-to-event data using MCMC. J Stat Software. 2016;72(7):1–46. doi: 10.18637/jss.v072.i07. [DOI] [Google Scholar]

[R38] 38.Rubin DB. Inference and missing data. Biometrika. 1976;63(3):581–592. [Google Scholar]

[R39] 39.Fieuws S, Verbeke G. Evaluation of the Pairwise Approach for Fitting Joint Linear Mixed Models: A Simulation Study. Technical Report TR0527. Biostatistical Centre, Katholieke Universiteit Leuven; Belgium: 2005. [Google Scholar]

[R40] 40.Verbeke G, Molenberghs G. Arbitrariness of models for augmented and coarse data, with emphasis on incomplete data and random effects models. Stat Model. 2010;10(4):391–419. [Google Scholar]

PERMALINK

Pairwise joint modeling of clustered and high-dimensional outcomes with covariate missingness in pediatric pneumonia care

Susan Gachau

Edmund Njeru Njagi

Geert Molenberghs

Nelson Owuor

Rachel Sarguta

Mike English

Philip Ayieko

Abstract

1. Introduction

2. Correlated Random-Effects Joint Model

2.1. The pairwise modeling approach

3. Pneumonia Trial Data

3.1. Pediatric pneumonia care indicators

Table 1. Definition of binary outcomes in the assessment, diagnosis and classification and treatment domains of pediatric pneumonia care.

3.2. Covariates

3.3. Missingness in the trial data

4. Application: Model Fitting and Inference

4.1. Multiple imputation

4.2. Separate univariate analyses

4.3. Full multivariate joint model

4.4. Pairwise joint modeling

4.4.1. Inference for fixed regression parameters

4.4.2. Inference for standard errors

4.4.3. Pooling final estimates

4.4.4. Wald test for joint covariates effects under complete case analysis and after multiple imputation

4.4.5. Association among pneumonia outcomes

5. Results

6. Wald-Type Tests for Joint Covariates Effects

Table 2. Wald-type test results for joint effects of covariates on nine pneumonia outcomes.

Figure 1.

Figure 2.

Table 3. Variance-correlation matrix for random clinicians' intercepts under complete case analysis.

Table 4. Variance-correlation matrix for random clinicians' intercepts after multiple imputation.

Figure 3.

7. Discussion

Supplementary Material

Acknowledgments

Funding information

Footnotes

Data Availability Statement

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases