Skip to main content
. Author manuscript; available in PMC: 2021 Sep 3.
Published in final edited form as: J Expo Sci Environ Epidemiol. 2021 Mar 3;31(4):683–698. doi: 10.1038/s41370-021-00309-5

Table 1.

Approaches to estimating the effect of intervention on exposure1

# Approach Data Objective Equation and Terms
1 Across Study Arms All Post-Intervention estimates the difference in study arm mean 48-hour exposure as compared to mean in Control study arm (β1) log(yij)= β0ij+ β1StudyArmj+εij
yij = mean 48-hour air pollution exposure (either CO or PM2.5) for participant i in study arm j,
StudyArmj = participant’s assigned study arm in two dummy variables for improved biomass and LPG study arms (ref: Control),
εij= error term for participant i in study arm j.
2 Before and After All Data, study arm by study arm estimates the difference in mean 48-hour exposure as compared to the baseline period (β1) log(yik)= β0ik+ β1PostInterventionk+εik
yik = mean 48-hour CO exposure for participant i in intervention period k,
PostInterventionk = dummy variable for exposure estimate being before or after the intervention (ref: Pre-Intervention), and
εik= error term for participant i in intervention period k.
3 Difference-in-Differences (primary specification) All Data estimates the difference in mean 48-hour exposure observed in the post-intervention period from the baseline period as compared to the same difference occurring in the Control arm (β3) log(yijk)= β0ijk+ β1StudyArmj+β2PostInterventionk+β3(StudyArmj×PostInterventionk)+εijk
yijk = mean 48-hour CO exposure for participant i in study arm j in intervention period k,
StudyArmj = participants assigned cluster in two dummy variables for improved biomass and LPG study arms (ref: Control),
PostInterventionk = dummy variable for exposure estimate being before or after the intervention (ref: Pre-Intervention),
StudyArmj × PostInterventionk = dummy variables for interaction terms between StudyArmj and PostInterventionk dummy variables (ref: Control and Pre-Intervention), and
εijk= error term for participant i in study arm j and intervention period k.
4 Session-specific Difference-in-Differences All Data estimates change in sessions compared to change from baseline in Control study arm (β3) log(yijl)= β0ijl+ β1StudyArmj+β2MonitoringSessionl+β3(StudyArmj×MonitoringSessionl)+εijl
yijl = mean 48-hour CO exposure for participant i in study arm j during monitoring session l,
StudyArmj = participants assigned cluster in two dummy variables for improved biomass and LPG study arms (ref: Control),
MonitoringSessionl = dummy variable for monitoring session of exposure estimate, e.g., Session 1, Session 2, Session 3 (ref: Session 1),
StudyArmj × MonitoringSessionl = dummy variables for interaction terms between StudyArmj and each MonitoringSessionl dummy variable (ref: Control and Session 1), and
εijl= error term for participant i in study arm j and monitoring session l.
1

This table describes the approach we took to estimate the effect of clean cooking interventions on personal air pollution exposure and is intended to illustrate the dependent and independent variable specifications, highlighting the coefficients of interest. However, all analyses are conducted utilizing generalized estimating equations, a non-parametric estimation framework that estimates population averaged effects. Standard errors in the GEEs account for multiple observations per participant and the village-clustered nature of the intervention deployment.