Abstract
Previous research in the US has found negative health effects of contamination when it triggers regulatory violations. An important question is whether levels of contamination that do not trigger a health-based violation impact health. We study the impact of drinking water contamination in community water systems on birth outcomes using drinking water sampling results data in Pennsylvania. We focus on the effects of water contamination for births not exposed to regulatory violations. Our most rigorous specification employs mother fixed effects and finds changing from the 10th to the 90th percentile of water contamination (among births not exposed to regulatory violations) increases low birth weight by 12% and preterm birth by 17%.
Keywords: drinking water contamination, community water systems, birth outcomes, low birth weight, I18, Q53, Q52
INTRODUCTION
Most residential settings in the U.S., including homes, apartments, and mobile park homes, are served by community water systems (CWS), representing 94% of the nation’s population (United States Environmental Protection Agency [EPA], 2019a). The Safe Drinking Water Act (SDWA) was enacted in 1974 to protect communities from unsafe drinking water in CWS. The act currently regulates over 90 contaminants (e.g., lead, arsenic, disinfection byproducts). Regulatory drinking water violationsi under SDWA occur when a maximum contaminant level (MCL), action level, or maximum residual disinfectant level (MRDL) is exceeded (EPA, 2018).ii Despite successfully reducing drinking water contamination in CWS, up to 45 million people were affected by health-based violations from 1982 to 2015 (Allaire et al., 2018). In 2017, 22 million people, representing 7% of the U.S. population, relied on a CWS with a health-based drinking water violation (EPA, 2019a). Recent lead-related drinking water crises in Newark, New Jersey (Dave & Yang, 2022) and Flint, Michigan (D. S. Grossman & Slusky, 2019) have highlighted the importance of these regulations to protect public health.
The vast majority of the literature studying SDWA has focused on non-compliant drinking water. SDWA violations disproportionately impact minority and low-income communities (Allaire et al., 2018; McDonald & Jones, 2018); egregious cases of water contamination, such as the case of Flint, have major community and health impacts (Christensen et al., 2023; Danagoulian et al., 2022; Danagoulian & Jenkins, 2021; D. Grossman & Slusky, 2019), and health-based violations adversely impact infant health (Currie et al., 2013). Public reporting of these health-based violations is an important mechanism to encourage compliance within the law (Baker et al., 2022; Bennear et al., 2009; Bennear & Olmstead, 2008; Grooms, 2016). Prior research has shown that this public reporting has resulted in fewer violations over time (Baker et al., 2022; Bennear & Olmstead, 2008), however, other work using sampling concentrations has suggested that systems may not return to compliance following health-based violations (Grooms, 2016). Consumers appear to respond to public reporting of violations with avoidance behavior, such as bottled water purchases (Allaire et al., 2019; Marcus, 2022; Zivin et al., 2011). In this paper, we provide new evidence of the health impacts of drinking water quality that is not subject to a health-based violation, which we call compliant drinking water contamination.iii
A large and growing literature focused on the health impacts of air pollution that is compliant with the Clean Air Act has shown that even low levels of air quality can still have meaningful impacts on health, labor market outcomes, and cognition (Currie & Walker, 2019). Much like air quality, where particulate matter is actually a grouping of many air pollutants and not isolating particular pollutant constituents, we propose to measure this compliant contamination using a water quality index. Using an index addresses the multiple comparison issues we would face if we separately estimated the effects of each contaminant, or the multicollinearity issues we would face if we attempted to include the over 90 contaminants that are regulated under SDWA in the same regression.iv Furthermore, while current toxicological and epidemiological studies of the associations between water quality and health generally reflect a narrow focus on single contaminants (Murphy et al., 2012), researchers have noted that low concentrations of contaminants may not be associated with health outcomes when considered individually but may produce effects when examined as mixtures (Gennings et al., 2018). For these reasons, we create an overall water quality index using contaminant concentrations and regulatory standards, creating a single measure for testing our hypothesis. Following prior literature, we also construct an aggregate “chemical only” measure (Currie et al., 2013). Our aggregation strategy is discussed further in the section “Water Quality Index Construction Details.”
Scientific literature assessing the association between each of the more than 90 contaminants regulated by SDWA and birth outcomes is lacking beyond a subset of contaminants.v Still, our overall water quality index may not be specific to our outcomes of interest (e.g., low birth weight, preterm birth), and so we create a second index specific to five contaminants with the most associational evidence linking exposure to these contaminants with the outcomes studied in this paper.vi While a reproductive-specific index can be supportive of the theoretical link between water quality and birth outcomes, lack of empirical evidence for the other contaminants regulated by SDWA is not indicative of no biological effect. Our overall index can be used for hypothesis generation regarding other contaminants impacting birth outcomes. Furthermore, and we consider this most important, the overall measure can be validated on other populations (children, older adults) and health outcomes (cancer), and allow for a useful general index that can be used across systems and susceptible populations, analogous to the EPA’s Air Quality Index (AQI).
We employ the universe of address-specific birth records in Pennsylvania from 2003 to 2014 geocoded to CWS boundaries, and match births to water samples measured at the CWS level that underlie regulatory violations during gestation.vii Using these novel data on contamination concentrations summarized in an omnibus index, we find evidence of precise harmful effects of water contamination on birth outcomes after removing regulatory violations. Our empirical strategy relies on within-CWS variation (CWS fixed effects) such that our approach ignores correlations between water quality and infant health driven by cross-sectional differences. We also include birth year-month fixed effects to control for seasonality and state-wide time shocks. Using fixed-effects models that isolate only within-mother across-sibling variation, we find that changes in overall contamination from the 10th to the 90th percentile—conditional on no violation—leads to an increase in low-birth weight (LBW) of 12% and preterm birth (PTB) of 17%. Using a reproductive-specific index, we find slightly larger impacts of water quality on our birth outcomes—a 19% increase in LBW and 22% increase in PTB. And in sensitivity analyses, we remove births with samples above the regulatory threshold, which is more stringent than how regulatory violations are determined and continue to find a strong impact on birth outcomes.viii
Given the recent findings of fertility effects of contamination and the consequent issues of selection (e.g., for lead, see D. S. Grossman & Slusky, 2019, and Clay et al., 2021), we additionally fit CWS panel models of births per person served by the system on various measures of recent water contamination. Our estimates suggest small negative effects of contamination on fertility. We also analyze whether mothers move out of their current CWS in response to contamination. While maternal response would be of concern for our identification strategy, we find no significant effects on mobility, which is consistent with mothers not being aware of variation in contamination that does not trigger regulatory violations. Moreover, our results are robust to excluding mothers who move across pregnancies. Finally, we use the bounding methods of Altonji et al. (2005) and Oster (2016) to assess the importance of selection on unobservables, finding in most specifications that it would have to be greater than selection on observables to conclude that there are no damaging effects of water contamination.
Our work contributes to a well-established fetal origins literature in economics that links shocks, such as pollution, experienced in utero to poorer birth outcomes (Almond et al., 2018). Extensive research in economics establishes that poor birth outcomes lead to worse short- and long-term health and human capital outcomes (see, e.g., Almond et al., 2005; Black et al., 2007; Currie & Moretti, 2007; Oreopoulos et al., 2008; Royer, 2009). Air pollution has attracted particular attention, as findings of harmful effects appear robust, even at levels compliant with air advisory standards (Beatty & Shimshack, 2014; Currie et al., 2015; Currie & Walker, 2011; Knittel et al., 2016; Yang & Chou, 2018). Although federal and state regulation of public drinking water in the U.S. has recently come under scrutiny, in part because of the highly publicized Flint water crisis (Allaire et al., 2018; D. S. Grossman & Slusky, 2019), few papers have studied how the quality of drinking water in the U.S. affects health outcomes using quasi-experimental designs (Keiser & Shapiro, 2019b).ix The most closely related study to our work is Currie et al. (2013), who found that exposure to a water quality violation during gestation increases the chance of low birth weight and preterm birth, based on 1997 to 2007 data from NJ.x Importantly, their estimates suggested that non-compliant contamination has negative health consequences. Our analysis addresses health impacts of compliant contamination levels.
Consumers currently learn about the quality of their water either by reviewing the annual Consumer Confidence Reports (CCR), which contain the average concentrations for contaminants in their water system, or through notifications about extreme contamination events that constitute regulatory violations. We argue that, based on the reports as currently designed, consumers are unlikely to respond to concentrations that are compliant with current regulations. Given our findings of adverse birth outcomes from overall contamination concentrations removing regulatory violations, using our index to describe these risks for consumers could greatly improve the available information. Specifically, we propose using our overall water quality index to create a letter grade as a summative measure of water quality that could be included in CCRs to provide more actionable information to consumers. In the section “Discussion,” we evaluate this potential policy, finding that it could lead to economically meaningful improvements in public health as measured by birth outcomes. We find that a policy improving water quality for births served by systems with letter grades B to F to the average quality of systems with letter grade A would reduce the number of LBW infants by 3,960 and reduce the number of PTB infants by 6,930 over 11 years. Such a policy could save around $280 million in medical costs for these avoided adverse birth outcomes. This analysis could serve as a foundation for future refinements to providing a summary measure of water quality for CCRs.
DATA AND SUMMARY STATISTICS
To conduct our analysis, we assembled data on birth outcomes from confidential birth records, temperature, weather, and pollution data derived from weather stations and Toxics Release Inventory (TRI) locations and emissions, and water quality and service areas from community water systems (CWS). We describe each of these in turn. We join these three data sources together using spatial and temporal attributes.
Birth records and control variables
Data on birth certificates spanning the years 2003 to 2014 were provided by the Pennsylvania Department of Health. These data contain detailed information on birth outcomes, clinical characteristics, and demographic characteristics. We describe in detail our control variables obtained from these birth records in the section “Empirical Methods.” Our two main outcomes of interest are binary indicators for low birth weightxi and preterm birth.xii In addition, we study two birth outcome measures often used by epidemiologists that better capture growth rates during pregnancy (see e.g., Govarts et al., 2018). These measures are small for gestational age (SGA) and term birth weight (TBW; in grams).xiii
We obtained daily weather statistics from the website of Schlenker (Schlenker & Roberts, 2009). These data provide estimates of daily precipitation and maximum and minimum temperatures for a grid of points across the United States. We match mothers to their closest grid point using exact location of birth and calculate a set of temperature and weather controls over the mom’s pregnancy. We use these as control variables in our regression specifications described in the section “Empirical Methods.” Note that weather and temperature have been found to be important in previous studies relating health to drinking water contamination (Currie et al., 2013), as well as to air pollution (Beatty & Shimshack, 2014; Knittel et al., 2016). Finally, we also obtained Toxics Release Inventory (TRI) controls (locations and emissions) from the U.S. Environmental Protection Agency (2013). Using maternal address, we calculate the number of TRI facilities operating within 1, 3, and 5 kilometers in the year the infant was born, and the reported total onsite, offsite, and overall releases for TRI facilities weighted by the squared inverse distance between the facility and the mother’s residential location.xiv
Community water system data
Our data on CWS comes from two sources: the U.S. EPA and the Pennsylvania Department of Environmental Protection (PADEP). We obtained Safe Drinking Water Information System (SDWIS) data from the EPA through a Freedom of Information Act (FOIA) request, which contains information on CWS drinking water violations as well as system characteristics such as recently reported population served. These data include violation reports 2000 to 2014. We observe intervals of time (“compliance periods”) for these violations that are primarily reported quarterly; we define a birth to have been exposed to a violation if its gestation period overlaps with the compliance period of a violation for at least one day.xv Further discussion of these data is provided in the Online Appendix Section “Data Construction Details.” For our fertility analysis, we make use of the system population served measure from SDWIS, both to normalize the dependent variable across systems (we divide births per month by system population served to calculate a monthly fertility rate), and as weights for the regression model.
Most importantly for our study, we obtained public drinking water sampling information for CWS from PADEP.xvi These data contain the sample results underlying Pennsylvania’s SDWA violations recorded in the SDWIS data. This drinking water sampling data contains the date of the sample, the contaminant tested, and the result of the test. These data are recorded at a much higher frequency than water quality violations, and they provide information on water contamination at levels that do not trigger a violation.xvii
These data were augmented with a shapefile obtained from the Pennsylvania Spatial Data Access (PASDA) website containing the service area boundaries of CWS.xviii Figure 1 is a map of the CWS service area boundaries for the systems in our data. As is apparent, CWS vary substantially in size. There are 1,768 CWS in the boundary file; the average system reports serving about 5,700 people, but the smallest systems (10th percentile) serve only around 60 each, while the top 10 largest systems serve 41% of people served in this data (the Philadelphia Water Department alone serves 16%, reporting about 1.6 million people served).
Figure 1:

Map of Pennsylvania and 1,768 CWS service areas.
Most water quality samples are measured at entry points to the distribution system, or somewhere within the distribution system (e.g., at the tap for lead and copper samples).xix In the simplest case, a treatment facility pipes water from a single source through its operations, then out into a distribution system. In more complicated cases, a distribution system can be served by several source locations.xx Analysis requires us to aggregate up to the level of a complete CWS, as that is the level at which we can match the water sample results to birth records using the water system boundary shapefile and geocoded mother addresses.xxi
Water quality index construction details
In this subsection, we describe how we construct our measure of drinking water quality. For each CWS, contaminant, and day, we use the samples of the contaminant on that day to calculate the average contamination in the CWS’s distributed water. This measure of average daily contamination is in the units in which the contaminant is measured, typically parts-per-billion (ppb) or parts-per-million (ppm). Then, for each birth and contaminant, we average this daily measure of contamination for the CWS where the mother’s residence was at the time of birth. The average is taken across all samples within the birth’s gestation period.xxii
The raw Pennsylvania sampling data contains the 94 contaminants regulated under the SDWA.xxiii We also obtained the health based regulatory violations from EPA by FOIA request which includes the maximum contaminant level (MCL), action level or maximum residual disinfectant level (MRDL) for each contaminant and whether the system received a violation during each compliance period. Instead of studying each contaminant separately, in our main analysis we aggregate by normalizing and averaging. Our goal is to construct a simple composite measure of water contamination, in order to avoid the statistical pitfalls of very noisy data combined with multiple hypothesis testing. The measure we choose to employ for this purpose is the average “result relative to MCL” (RRMCL).
| #(1) |
This approach accommodates the fact that what is atypically large for one contaminant may not be for another contaminant. For example, the MCL for nitrate and nitrite are 10 ppm and 1ppm, respectively. In order to create a single composite measure of contamination, we divide nitrate samples by 10, divide nitrite samples by 1, and divide all other contaminants by their respective MCL.
We then average over all contaminants sampled during pregnancy for each infant to have a measure of “overall” contamination based upon contaminants regulated by the SDWA, weighting contaminants (not samples) equally. Given the large number of chlorine samples (over 4.8 million samples; the next most common in our data is lead at 93,000 samples), we take a 10% random subsample of the chlorine samples before we build the index, to ensure computational feasibility.
Sample nondetects, i.e., samples with results that cannot be distinguish from zero, are recorded as zero in the data and included in our index calculations. This will introduce measurement error into our index measure. In general, when the contamination in the water sample is small, methods that estimate the level of contamination can have substantial relative error. We take the position expressed in the EPA internal document (EPA, 2006) that measured samples below the MCL still contain (albeit noisy) information. We further discuss this issue in Online Appendix Section “Non-Detection and Limits of Measurement.”
We make two notes about our “average RRMCL” construction process. First, it maximizes sample size, but the measure is not composed of the same contaminants across births. For example, one birth may only be matched to total coliform samples, while another birth may include nitrate and arsenic samples. Each birth will have a measure of average RRMCL, but the contaminant used in the first birth is just total coliform, while the RRMCL in the second birth is a simple average of the RRMCL for nitrate and RRMCL for arsenic. To be clear, this is just a stylized example for illustration: 2.47% of births in our data have only one or two unique contaminants measured. Indeed, across births, an average of 39.31 unique contaminants are used to calculate our overall contamination index. Second, and as this example makes clear, the step in this process that likely makes the strongest assumptions is weighting the contaminants available for each birth equally. Moreover, while maximum contaminant level goals (MCLG) may be a more natural denominator given the focus on protecting health, MCLG are often zero, making them not as useful for aggregation. In addition to our main overall measure of contamination, we repeat this exercise, but removing coliform, heterotrophic bacteria, and turbidity. This additional measure we call a measure of “chemical” (i.e., non-bacterial) contamination. Splitting between overall and chemical contamination is consistent with previous literature (Currie et al., 2013). On average, 38.50 unique contaminants are used to calculate this chemical-only index; 3.05% of these births have only one or two contaminants in the index. Finally, we also construct a “reproductive index” measure, using five contaminants that have been found linked to reproductive health outcomes; further discussion of its construction is postponed to Section “Reproductive Health Water Quality Index.”
There is one important contaminant that we treat as an exception to Equation 1: total coliform. Specifically, we define the Result Relative to MCL (RRMCL) for total coliform, for each birth, as
| #(2) |
In this expression, the ratio (Number of Tests) / ((273/30)) gives the approximate number of tests per month (per 30 days) over the gestation period, assuming full gestation length (273 days = 39 weeks). We use this expression for the RRMCL for this contaminant, since the effective MCL for coliform is 2 tests when the number of tests per month is less than 40, and 5% of tests when the number of tests per month is greater than 40. This implies that the MCL is the maximum in the denominator of the above expression. Indeed, this Equation can be interpreted as the generalization of Equation 1 for total coliform, given that the MCL for total coliform varies based on the number of tests conducted by the system (the number of tests required to be conducted depends on system size; see Bennear et al., 2009 for a detailed analysis of the Total Coliform Rule).
Taken together, our approach to defining a water quality index is the “sampling data analogue” of the drinking water violations measure employed in Currie et al. (2013). To see this, consider that we are combining across contaminants with very different properties and health effects by normalizing by their regulatory thresholds. This is very similar to Currie et al. (2013), who also grouped all contaminants into a single summary measure, weighting equally, but instead defined treatment as exposure to a regulatory violation during gestation. The advantage of our approach is that it allows us to use variation in contamination that does not trigger a health-based violation, which is important for providing policy-relevant evidence.
Throughout our analysis, we refer to and use what we call our “analytic sample,” which restricts births as follows. First, births in our analytic sample are not exposed to any MCL violations; second, they have some measure of contamination within their gestation period; third, they are not plural births; and fourth, their measured contamination does not exceed the 99th percentile across all births in the data for the specific contamination measure under study.xxiv In addition, when we provide summary statistics, we remove singleton observations based on CWS and birth year-month fixed-effects, or mother and birth year-month fixed-effects, in order to calculate statistics for the exact samples used in our regressions.xxv
Table 1 describes the distribution of our average RRMCL measures across births; throughout, births exposed to MCL violations are excluded. In preliminary analyses, we noticed that our RRMCL index measures are very right-skewed, even after removing violation-exposed births. Thus, we decided to trim out the top 1% of births based on RRMCL when we formed our analytic sample.xxvi In panel A, we show the distribution of RRMCL across births, by contaminant group, after our trimming. To illustrate why we trimmed outliers, Panel B shows the distribution before trimming. Finally, since our main specification for identifying causal effects is a within-mother design, Panel C shows the distribution of these index measures after removing mother fixed-effects. We see that the standard deviation in Panel C is about two-thirds the size, as is the spread between the 10th and 90th percentiles.To facilitate comparing the overall, chemical, and reproductive index contamination index measures, in our regressions, we standardize such that a 1-unit change is equivalent to a change from the 10th to the 90th percentile of the average RRMCL, for the RRMCL measure of interest (overall, chemical, or reproductive). These movements are equivalent to about 1.875 standard deviations for the overall measure, about 1.625 standard deviations for the chemical measure, and about 1.889 standard deviations for the reproductive index (five contaminant) measure (Table 1 contains the quantiles of the distribution). We chose the 10th and 90th percentiles to standardize in order to simulate a large, though not impossible, change in contamination exposure.xxvii For comparison, a movement from the 25th to the 75th percentile is about 33% of the 10th-to-90th percentile movement for the overall contamination measure, and about 31% for the chemical-only measure. The reader can rescale our estimates to represent effects of these movements by multiplying by 0.33 or 0.31, respectively.xxviii Finally, our most rigorous specifications use within-mother across-sibling variation only; Table 1 includes statistics describing the spread of our contamination indices across births within mother in Panel C, in case the reader wishes to rescale our estimates based on that distribution instead.xxix
Table 1.
Distribution of drinking water contamination experienced by Pennsylvanian births during gestation.
| (1) | (2) | (3) | (4) | (5) | (6) | (7) | (8) | (9) | |
|---|---|---|---|---|---|---|---|---|---|
| Contaminant | Mean | SD | 10th | 25th | 50th | 75th | 90th | Max | Births |
|
| |||||||||
| Panel A: Trimmed sample (births used in analysis) | |||||||||
|
|
|||||||||
| All | 0.08 | 0.08 | 0.02 | 0.03 | 0.05 | 0.08 | 0.17 | 0.50 | 1,216,132 |
| Chemical | 0.07 | 0.08 | 0.02 | 0.03 | 0.04 | 0.07 | 0.15 | 0.87 | 1,214,709 |
| Repro. index | 0.08 | 0.09 | 0.00 | 0.02 | 0.06 | 0.11 | 0.17 | 2.80 | 1,147,893 |
| Panel B: Raw sample (including births not used) | |||||||||
|
|
|||||||||
| All | 0.09 | 2.24 | 0.02 | 0.03 | 0.05 | 0.09 | 0.18 | 1425.00 | 1,228,531 |
| Chemical | 0.09 | 2.24 | 0.02 | 0.03 | 0.04 | 0.07 | 0.17 | 1425.00 | 1,227,035 |
| Repro. index | 0.08 | 0.59 | 0.00 | 0.02 | 0.06 | 0.11 | 0.18 | 193.80 | 1,151,025 |
| Panel C: Trimmed sample, after removing mother fixed effects | |||||||||
|
|
|||||||||
| All | −0.00 | 0.05 | −0.05 | −0.02 | −0.00 | 0.01 | 0.05 | 0.37 | 700,951 |
| Chemical | −0.00 | 0.05 | −0.05 | −0.01 | −0.00 | 0.01 | 0.04 | 0.54 | 699,656 |
| Repro. index | −0.00 | 0.05 | −0.05 | −0.02 | 0.00 | 0.02 | 0.05 | 1.26 | 645,353 |
Notes: Table displays the average sample result relative to MCL (RRMCL) over gestational periods for births in Pennsylvania, 2003 to 2014. Throughout, the table restricts to non-plural births not exposed to MCL violations. Panel A trims the sample of births at the 99th percentile (separately for each contaminant group). This is the sample we use throughout the paper. Panel B shows the raw statistics for comparison. Panel C shows the distribution after removing mother fixed effects, which is relevant given our causal design. “All” and “Chemical” are our indexes described in the section “Water Quality Index Construction Details”; “Repro. Index” is our five-contaminant index described in the section “Reproductive Health Water Quality Index.”
In the Online Appendix Section “Discussion of Our Main Index Measures,” we further describe the correlation between our index measures and the samples for selected contaminants of interest in Appendix Table A1, as well as provide a full table listing all the contaminants included in each of our index measures (94 for the all index, 91 for the chemical index) in Appendix Table A2.
Reproductive health water quality index
EPA lists 13 contaminants associated with “reproductive difficulties” and another 6 associated with harms to “infants” (EPA, 2018). See Appendix Table A3 for the specific contaminants. We performed a literature review (see Appendix Section “An Alternative Reproductive Health Index of Five Contaminants Based on Data Availability and Prior Research” for discussion of this literature) and the epidemiology literature primarily focuses on arsenic, nitrate, atrazine, and disinfectant byproducts (Haloacetic Acids [HAA5], Total Trihalomethanes [TTHM]) and only nitrate and atrazine overlap with the EPA’s list. We also found some epidemiological literature associating tetrachloroethylene (PERC) and Di(2-ethylhexyl)phthalate (DEHP) and a recent study in Virgina included these plus lead, radon, and fecal bacteria (Young et al., 2022).xxx We assessed the availability of samples for these contaminants to create an index that would have more uniform coverage in our sample population and limited to five with frequent sampling in Pennsylvania for our reproductive-health specific water quality index: arsenic, atrazine, nitrate, Di(2-ethylhexyl) Phthalate (DEHP), and Tetrachloroethene (PCE or PERC). On average, 3.27 unique contaminants are used in this more specific water quality index. This index could be used to target water alerts communications to pregnant persons and other studies could create water quality indices for specific health outcomes to target water alerts to specifically vulnerable populations.
Summary statistics
Summary statistics for births are provided in Table 2. In column (1) we display statistics for all non-plural births matched to CWS without exposure to MCL violations; in column (2) we restrict to our analytic sample for overall contamination.xxxi These births include those to mothers who switch CWS between some of their births in the panel. There is very little difference in mother characteristics between columns (1) and (2), suggesting that samples are consistent after addressing data availability and trimming. Columns (3) and (4) examine the births in column (2), split by the tails of average contamination by CWS. These columns suggest that mothers of lower socioeconomic status, and worse birth outcomes on average, tend to give birth in places with higher levels of water contamination (that does not trigger a health-based violation). Column (5) restricts the sample to the mothers with multiple births that are included in our mother fixed-effects regressions. In general, differences between column (5) and column (2) are not large, suggesting that sample composition does not substantially change when we restrict to mothers with multiple births (i.e., siblings).xxxii
Table 2.
Descriptive statistics of births in Pennsylvania, 2003–2014.
| (1) | (2) | (3) | (4) | (5) | |
|---|---|---|---|---|---|
| Characteristic | Non-plural | Analytic | Tails of contamination distribution | Mom FE | |
|
|
|||||
| No MCL | sample | ≤ 10th | ≥ 90th | Sample | |
|
| |||||
| Sample Sizes | |||||
| No. of births | 1,241,656 | 1,216,132 | 122,856 | 110,523 | 700,815 |
| No. of moms | 832,674 | 820,173 | 112,209 | 103,069 | 304,934 |
| No. of CWSs | 1,537 | 1,459 | 1,302 | 885 | 1,343 |
| Outcomes | |||||
| Low birth weight | 0.067 | 0.067 | 0.057 | 0.079 | 0.065 |
| Preterm birth | 0.082 | 0.082 | 0.074 | 0.102 | 0.081 |
| Small for gestational age | 0.095 | 0.096 | 0.083 | 0.095 | 0.091 |
| Term birth weight (g) | 3,396 (468.9) | 3,396 (468.8) | 3,423 (465.0) | 3,389 (467.5) | 3,402 (467.2) |
| CWS births per 100k per month | 113.4 (107.4) | 112.7 (103.3) | 141.0 (223.4) | 109.6 (108.1) | 112.1 (98.59) |
| Other characteristics | |||||
| Mom age (years) | 27.87 (6.054) | 27.87 (6.057) | 28.01 (5.854) | 27.80 (6.023) | 27.69 (5.858) |
| Mom Black | 0.210 | 0.213 | 0.074 | 0.185 | 0.220 |
| Mom Hispanic | 0.068 | 0.069 | 0.038 | 0.060 | 0.074 |
| Mom white, not Hispanic | 0.722 | 0.718 | 0.889 | 0.757 | 0.713 |
| HS or less | 0.401 | 0.401 | 0.377 | 0.402 | 0.406 |
| Mom smokes | 0.224 | 0.223 | 0.262 | 0.238 | 0.216 |
| Mom married | 0.569 | 0.568 | 0.642 | 0.581 | 0.578 |
| WIC/Medicaid | 0.477 | 0.478 | 0.422 | 0.459 | 0.482 |
| Different CWS next birth | 0.369 | 0.368 | 0.456 | 0.356 | 0.368 |
| Mean RRCML, all contaminants | 0.095 (2.237) | 0.078 (0.078) | 0.014 (0.006) | 0.285 (0.082) | 0.078 (0.077) |
| Mean RRCML, chemical only | 0.085 (2.238) | 0.069 (0.080) | 0.013 (0.006) | 0.281 (0.112) | 0.069 (0.079) |
Notes: Sample consists of births matched to a public water system. Means (for indicator variables, proportions) are displayed, with standard deviations in parentheses when appropriate. Column (1) provides statistics for non-plural births without MCLs. Column (2) restricts to our analytic sample, keeping only births with an overall measure of contamination and for which that measure is below the 99th percentile. Columns (3) and (4) describe births in the tails of the overall contamination distribution (among births in column (2)). Column (5) describes births in our mother fixed-effects specifications.
To reiterate, column (1) of Table 2 already restricts to births matched to CWS; in Online Appendix Table A8 we compare this restricted sample to the sample of all births, including those not matched to CWS. The most notable difference is that the sample matched to CWS is more urban, which is expressed most strongly by being of greater share Black (as rural Pennsylvania is predominantly non-Black).
EMPIRICAL METHODS
We employ two regression models to study the effects of drinking water contamination on birth outcomes, one with community water system (CWS) fixed effects (FE) and the other with mother FE (within-mother across-birth variation).
The first model takes the following form:
| #(3) |
In this model, denotes birth to mother living in CWS in year-month at the time of the birth; is a birth outcome; the birth outcomes we consider are: low birth weight (LBW), preterm birth (PTB), small for gestational age (SGA) and term birth weight (TBW). See the section “Birth Records and Control Variables” for information about how these outcomes are defined.
This regression specification employs a fixed effects estimator, where and are water system and birth year-month fixed-effects, respectively. The independent variable of interest is , “mean result relative to MCL,” a measure of overall or chemical drinking water contamination which we construct from the sample concentrations as described in subsection “Water Quality Index Construction Details.” The identification assumption is that, after adjusting for an array of individual mother, weather and pollution characteristics in and , trends in birth outcomes across water systems would be the same on average except for differences due to differences in water contamination trends.
We control rigorously for a wide array of mother, birth, and weather measures. Whenever a variable is missing, we set it to zero and include an indicator for missingness. We include a full set of mother demographic and socioeconomic controls, and risk factors for the pregnancy, in .xxxiii These control variables were curated from the birth records.xxxiv The variables in are weather and temperature controls obtained from the daily weather statistics provided by Schlenker (Schlenker & Roberts, 2009), as well as Toxics Release Inventory (TRI) facility statistics from the EPA (EPA, 2013).xxxv
The CWS fixed-effects model also naturally lends itself to an aggregate water-system level fertility regression,
| (4) |
This model is motivated by the concern that observed fertility rates may respond to drinking water contamination, either due to behavior such as mobility or selection into pregnancy, or due to health issues, such as infertility, miscarriage, or stillbirth. In this model, is the birth rate for the CWS in the given year-month, defined as the number of births divided by the system’s population served. We also estimate a version using as the dependent variable to alleviate concerns that we are not accurately measuring CWS-population. Our population served measure is time-invariant, so we cannot ignore the fact that variations in birth rates may be due to mothers leaving or entering CWS over our panel; we address maternal mobility below. The variable of interest is a measure of mean contamination in the water system over the last 9 months (by aggregating the RRMCL measures described in the section “Data and Summary Statistics,” weighting months equally); is a measure of average birth characteristics listed earlier taken over the last 9 months weighting births equally; and is the same battery of weather and TRI facility-based controls we use in models at the individual birth level, except again averaged over the last 9 months, weighting births equally.xxxvi In these regressions, we weight systems by their population served.
In light of the possibility that any fertility effects we identify may be due to mothers simply moving out of CWS in response to contamination, it is useful to repeat regression (3), but with a different outcome variable: whether the mom is in a different CWS in her next birth (restricting to mothers with at least two births). These models directly calculate whether mothers can be seen to switch systems more often when there is greater water contamination in their first birth. These estimates indirectly speak to whether any fertility effects we see are due to mother mobility.
The second birth-level regression model we employ is a within-mother analysis, which uses variation only across the births of the same mother. The regression is identical to the CWS fixed- effects model (equation 3) but mother fixed-effects are added, in addition to the current fixed-effects. Controlling for all time-invariant characteristics of mothers is a rigorous way to determine whether water quality variation affects birth outcomes, rather than being confounded with other mother characteristics.
As before, in addition to fitting mother fixed-effects models with birth outcomes as the dependent variable, we can also fit these with mother mobility (switching CWS) as the outcome. We estimate mobility effects within CWS and within mothers. The effects of contamination on mobility are identified off relating within-mother differences in contamination that mothers face across births to within-mother differences in whether the mother has switched CWS between births.xxxvii
The recent literature on difference-in-differences (DD) and two-way fixed effects (TWFE) estimators have made significant advances in a staggered treatment setting (Callaway et al., 2021; Callaway & SantAnna, 2021; De Chaisemartin & dHaultfoeuille, 2020; Goodman-Bacon, 2021). While we do employ a panel model that leverages within water system or within mother variation over time, we are not explicitly using a difference-in-differences model due to this being a non-standard setting with no clear event. Contamination can steadily increase over time, stay relatively constant, reduce abruptly following a change in technology or following a regulatory violation/remediation, and it is not clear that it follows a standard pattern commonly used in a DD model exploiting pre/post treatment (with continuous treatment). Here, there are also unlikely to be “never treated” units.
While the literature on TWFE DD and event studies with continuous treatment suggests this could still be an issue (Callaway et al., 2021), the literature does not at this time have a solution to employ. The primary issue that the continuous treatment in a TWFE design presents is a selection issue. The decomposition in Callaway et al. (2021) showed that the presence of this selection bias without an easy fix will require credibly arguing that the average treatment effects are equal across all dosage groups at the same dosage level. The identification assumption is strong—that there is no selection into the intensity of treatment. However, we are studying dosages of water contamination that are compliant with current regulations and does not trigger a violation (i.e., information to the consumer). The water systems do not have an incentive to manipulate concentrations except near the regulatory threshold. We show that mothers are not changing water systems in response, nor are there large changes in fertility. Finally, our “dose” is defined by a water quality index that is not observed by the consumer, the operator, or the policymaker, and it is unlikely that any systems or mothers are selecting on the level of treatment.
RESULTS
Tables 3 and 4 display our main estimates of the overall effects of drinking water contamination in community water systems (CWS) on birth outcomes. Tables 5 and 6 present the results for the reproductive-specific water quality index. Table 7 displays our estimates of the effects of water contamination on mother mobility or overall CWS-level fertility. Throughout, we restrict to our analytic sample. In particular, we have removed all births that have a gestation period which overlaps with any MCL violation compliance period. Thus, our estimates are of the effects of water contamination on birth outcomes, mobility, or fertility, for variation in water contamination which does not coincide with MCL violations.
Table 3.
Effects of drinking water contamination on low birth weight and preterm birth.
| (1) | (2) | (3) | (4) | (5) | (6) | |
|---|---|---|---|---|---|---|
| No controls | Adding mother ctrls | Adding temp & TRI ctrls | ||||
|
|
|
|
||||
| All | Chemical | All | Chemical | All | Chemical | |
|
| ||||||
| Panel A: low birth weight (LBW) | ||||||
|
|
||||||
| CWS FE | ||||||
|
|
||||||
| 10th to 90th RRMCL | 0.01356*** (0.00302) | 0.01456** (0.00133) | 0.01249*** (0.00285) | 0.01349** (0.00124) | 0.01059*** (0.00240) | 0.01050** (0.00140) |
| Observations | 1,216,132 | 1,214,688 | 1,216,132 | 1,214,688 | 1,216,132 | 1,214,688 |
| Adj R2 | 0.0050 | 0.0052 | 0.0456 | 0.0458 | 0.1058 | 0.1059 |
| AETO’s δ | 32.59 | 34.50 | 10.78 | 7.82 | ||
| Mom & CWS FE | ||||||
|
|
||||||
| 10th to 90th RRMCL | 0.00957*** (0.00245) | 0.01048** (0.00120) | 0.00932*** (0.00232) | 0.01003** (0.00115) | 0.00819*** (0.00198) | 0.00774** (0.00133) |
| Observations | 700,815 | 699,546 | 700,815 | 699,546 | 700,815 | 699,546 |
| Adj R2 | 0.1506 | 0.1508 | 0.1689 | 0.1691 | 0.2103 | 0.2104 |
| AETO’s δ | 2.29 | 1.43 | 1.13 | 0.55 | ||
| Panel B: preterm birth (PTB) | ||||||
|
|
||||||
| CWS FE | ||||||
|
|
||||||
| 10th to 90th RRMCL | 0.01997*** (0.00405) | 0.02134** (0.00178) | 0.01866*** (0.00375) | 0.02001** (0.00164) | 0.01626*** (0.00306) | 0.01623** (0.00175) |
| Observations | 1,216,132 | 1,214,688 | 1,216,132 | 1,214,688 | 1,216,132 | 1,214,688 |
| Adj R2 | 0.0036 | 0.0040 | 0.0467 | 0.0471 | 0.1316 | 0.1318 |
| AETO’s δ | 40.16 | 41.35 | 13.38 | 9.72 | ||
| Mom & CWS FE | ||||||
|
|
||||||
| 10th to 90th RRMCL | 0.01571*** (0.00363) | 0.01716** (0.00166) | 0.01533*** (0.00345) | 0.01658** (0.00160) | 0.01405*** (0.00276) | 0.01377** (0.00170) |
| Observations | 700,815 | 699,546 | 700,815 | 699,546 | 700,815 | 699,546 |
| Adj R2 | 0.1486 | 0.1489 | 0.1663 | 0.1665 | 0.2252 | 0.2253 |
| AETO’s δ | 2.44 | 1.79 | 2.00 | 0.99 | ||
Notes: Standard errors in parentheses
p < 0.10
p < 0.05
p < 0.01.
Standard errors are two-way clustered on public water system and mother. Each cell is a separate regression. Observations (number of births) and adjusted R2s (for the full model, i.e., including the fixed-effects) are reported. In each regression, the sample is restricted to births in our analytic sample for whom we observe at least one water quality sample of the given contaminant group during gestation in the public water system of residence at the time of birth. Control variables used vary over supercolumns; independent variable, i.e. the contaminant group studied, vary across columns; the outcomes varies over panels; the specification used (either public water system fixed-effects or mom fixed-effects) varies over sub-panels; temp & TRI ctrls = weather and toxics release inventory (air pollution) controls.
Table 4.
Effects of contamination on small for gestational age and term birth weight.
| (1) | (2) | (3) | (4) | (5) | (6) | |
|---|---|---|---|---|---|---|
| No controls | Adding mother ctrls | Adding temp & TRI ctrls | ||||
|
|
|
|
||||
| All | Chemical | All | Chemical | All | Chemical | |
|
| ||||||
| Panel A: small for gestational age (SGA) | ||||||
|
|
||||||
| CWS FE | ||||||
|
|
||||||
| 10th to 90th RRMCL | 0.00341*** (0.00077) | 0.00268** (0.00064) | 0.00282*** (0.00074) | 0.00217** (0.00061) | 0.00258*** (0.00079) | 0.00183** (0.00062) |
| Observations | 1,214,205 | 1,212,766 | 1,214,205 | 1,212,766 | 1,214,205 | 1,212,766 |
| Adj R2 | 0.0054 | 0.0054 | 0.0381 | 0.0381 | 0.0390 | 0.0390 |
| AETO’s δ | 13.15 | 11.59 | 8.30 | 5.86 | ||
| Mom & CWS FE | ||||||
|
|
||||||
| 10th to 90th RRMCL | 0.00279*** (0.00100) | 0.00179** (0.00083) | 0.00276*** (0.00096) | 0.00165** (0.00081) | 0.00290*** (0.00099) | 0.00173** (0.00083) |
| Observations | 698,761 | 697,487 | 698,761 | 697,487 | 698,761 | 697,487 |
| Adj R2 | 0.1386 | 0.1393 | 0.1542 | 0.1548 | 0.1544 | 0.1550 |
| AETO’s δ | 6.36 | 0.65 | −1.90 | 1.26 | ||
| Panel B: term birth weight (TBW) | ||||||
|
|
||||||
| CWS FE | ||||||
|
|
||||||
| 10th to 90th RRMCL | −10.61*** (1.39) | −10.04*** (1.25) | −9.75*** (1.55) | −9.44*** (1.21) | −9.54*** (1.69) | −8.77*** (1.32) |
| Observations | 1,116,570 | 1,115,266 | 1,116,570 | 1,115,266 | 1,116,570 | 1,115,266 |
| Adj R2 | 0.0169 | 0.0169 | 0.0943 | 0.0944 | 0.0987 | 0.0988 |
| AETO’s δ | 30.56 | 42.24 | 22.73 | 17.70 | ||
| Mom & CWS FE | ||||||
|
|
||||||
| 10th to 90th RRMCL | −9.05*** (1.73) | −8.82*** (1.34) | −9.15*** (1.84) | −8.58*** (1.37) | −9.33*** (1.87) | −8.25*** (1.46) |
| Observations | 612,960 | 611,893 | 612,960 | 611,893 | 612,960 | 611,893 |
| Adj R2 | 0.3437 | 0.3437 | 0.3827 | 0.3828 | 0.3848 | 0.3849 |
| AETO’s δ | −10.33 | 3.87 | −5.48 | 1.47 | ||
Notes: Standard errors in parentheses
p < 0.10
p < 0.05
p < 0.01.
Standard errors are two-way clustered on public water system and mother. Each cell is a separate regression. Observations (number of births) and adjusted R2s (for the full model, i.e., including the fixed-effects) are reported. In each regression, the sample is restricted to births in our analytic sample for whom we observe at least one water quality sample of the given contaminant group during gestation in the public water system of residence at the time of birth. Control variables used vary over supercolumns; independent variable, i.e. the contaminant group studied, vary across columns; the outcomes varies over panels; the specification used (either public water system fixed-effects or mom fixed-effects) varies over sub-panels; temp & TRI ctrls = weather and toxics release inventory (air pollution) controls.
Table 5.
Effects of drinking water contamination on low birth weight and preterm birth, using five- contaminant reproductive index.
| (1) | (2) | (3) | (4) | (5) | (6) | |
|---|---|---|---|---|---|---|
| No controls | Adding mother ctrls | Adding temp & TRI ctrls | ||||
|
|
|
|
||||
| All | 5 Contam | All | 5 Contam | All | 5 Contam | |
|
| ||||||
| Panel A: low birth weight | ||||||
|
|
||||||
| CWS FEs | ||||||
|
|
||||||
| 10th to 90th RRMCL | 0.01356*** (0.00302) | 0.01936*** (0.00639) | 0.01249*** (0.00285) | 0.01833*** (0.00580) | 0.01059*** (0.00240) | 0.01406** (0.00576) |
| Observations | 1,216,132 | 1,139,430 | 1,216,132 | 1,139,430 | 1,216,132 | 1,139,430 |
| Adj R2 | 0.0050 | 0.0049 | 0.0456 | 0.0452 | 0.1058 | 0.1049 |
| Mom & CWS FEs | ||||||
|
|
||||||
| 10th to 90th RRMCL | 0.00957*** (0.00245) | 0.01709** (0.00405) | 0.00932*** (0.00232) | 0.01637** (0.00381) | 0.00819*** (0.00198) | 0.01282** (0.00411) |
| Observations | 700,815 | 637,615 | 700,815 | 637,615 | 700,815 | 637,615 |
| Adj R2 | 0.1506 | 0.1507 | 0.1689 | 0.1686 | 0.2103 | 0.2094 |
| Panel B: preterm birth | ||||||
|
|
||||||
| CWS FEs | ||||||
|
|
||||||
| 10th to 90th RRMCL | 0.01997*** (0.00405) | 0.02743** (0.00863) | 0.01866*** (0.00375) | 0.02621** (0.00798) | 0.01626*** (0.00306) | 0.02096** (0.00788) |
| Observations | 1,216,132 | 1,139,430 | 1,216,132 | 1,139,430 | 1,216,132 | 1,139,430 |
| Adj R2 | 0.0036 | 0.0034 | 0.0467 | 0.0460 | 0.1316 | 0.1306 |
| Mom & CWS FEs | ||||||
|
|
||||||
| 10th to 90th RRMCL | 0.01571*** (0.00363) | 0.02370** (0.00525) | 0.01533*** (0.00345) | 0.02279** (0.00497) | 0.01405*** (0.00276) | 0.01834** (0.00543) |
| Observations | 700,815 | 637,615 | 700,815 | 637,615 | 700,815 | 637,615 |
| Adj R2 | 0.1486 | 0.1487 | 0.1663 | 0.1659 | 0.2252 | 0.2247 |
Notes: Standard errors in parentheses
p < 0.10
p < 0.05
p < 0.01.
Standard errors are two-way clustered on public water system and mother. Each cell is a separate regression. Observations (number of births) and adjusted R2s (for the full model, i.e., including the fixed-effects) are reported. In each regression, the sample is restricted to births in our analytic sample for whom we observe at least one water quality sample of the given contaminant group during gestation in the public water system of residence at the time of birth. Control variables used vary over super-columns; independent variable, i.e. the contaminant group studied, vary across columns; the outcomes varies over panels; the specification used (either public water system fixed-effects or mom fixed-effects) varies over sub-panels; temp & TRI ctrls = weather and toxics release inventory (air pollution) controls. Results for the overall index included for comparison, which are the same as the “All” columns in Table 3.
Table 6.
Effects of drinking water contamination on small for gestational age and term birth weight, using five-contaminant reproductive index.
| (1) | (2) | (3) | (4) | (5) | (6) | |
|---|---|---|---|---|---|---|
| No controls | Adding mother ctrls | Adding temp & TRI ctrls | ||||
|
|
|
|
||||
| All | 5 Contam | All | 5 Contam | All | 5 Contam | |
|
| ||||||
| Panel A: small for gestational age | ||||||
|
|
||||||
| CWS FEs | ||||||
|
|
||||||
| 10th to 90th RRMCL | 0.00341*** (0.00077) | 0.00511*** (0.00143) | 0.00282*** (0.00074) | 0.00470*** (0.00125) | 0.00258*** (0.00079) | 0.00390*** (0.00138) |
| Observations | 1,214,205 | 1,137,738 | 1,214,205 | 1,137,738 | 1,214,205 | 1,137,738 |
| Adj R2 | 0.0054 | 0.0053 | 0.0381 | 0.0380 | 0.0390 | 0.0390 |
| Mom & CWS FEs | ||||||
|
|
||||||
| 10th to 90th RRMCL | 0.00279*** (0.00100) | 0.00349** (0.00164) | 0.00276*** (0.00096) | 0.00350** (0.00167) | 0.00290*** (0.00099) | 0.00317* (0.00168) |
| Observations | 698,761 | 635,832 | 698,761 | 635,832 | 698,761 | 635,832 |
| Adj R2 | 0.1386 | 0.1405 | 0.1542 | 0.1559 | 0.1544 | 0.1561 |
| Panel B: term birth weight | ||||||
|
|
||||||
| CWS FEs | ||||||
|
|
||||||
| 10th to 90th RRMCL | −10.61*** (1.39) | −13.86*** (3.94) | −9.75*** (1.55) | −13.92*** (3.57) | −9.54*** (1.69) | −12.58*** (3.91) |
| Observations | 1,116,570 | 1,047,149 | 1,116,570 | 1,047,149 | 1,116,570 | 1,047,149 |
| Adj R2 | 0.0169 | 0.0167 | 0.0943 | 0.0945 | 0.0987 | 0.0989 |
| Mom & CWS FEs | ||||||
|
|
||||||
| 10th to 90th RRMCL | −9.05*** (1.73) | −13.74*** (2.79) | −9.15*** (1.84) | −13.86*** (2.82) | −9.33*** (1.87) | −12.59*** (2.99) |
| Observations | 612,960 | 558,355 | 612,960 | 558,355 | 612,960 | 558,355 |
| Adj R2 | 0.3437 | 0.3483 | 0.3827 | 0.3867 | 0.3848 | 0.3887 |
Notes: Standard errors in parentheses
p < 0.10
p < 0.05
p < 0.01.
Standard errors are two-way clustered on public water system and mother. Each cell is a separate regression. Observations (number of births) and adjusted R2s (for the full model, i.e., including the fixed-effects) are reported. In each regression, the sample is restricted to births in our analytic sample for whom we observe at least one water quality sample of the given contaminant group during gestation in the public water system of residence at the time of birth. Control variables used vary over super-columns; independent variable, i.e. the contaminant group studied, vary across columns; the outcomes varies over panels; the specification used (either public water system fixed-effects or mom fixed-effects) varies over sub-panels; temp & TRI ctrls = weather and toxics release inventory (air pollution) controls. Results for the overall index included for comparison, which are the same as the “All” columns in Table 4.
Table 7.
Effects of drinking water contamination on mobility and fertility.
| (1) | (2) | (3) | (4) | (5) | (6) | |
|---|---|---|---|---|---|---|
| No controls | Adding mother controls | Adding weather controls | ||||
|
|
|
|
||||
| All | Chemical | All | Chemical | All | Chemical | |
|
| ||||||
| Panel A: Mobility – CWS FE | ||||||
|
|
||||||
| 10th to 90th RRMCL | −0.00108 (0.00189) | −0.00014 (0.00149) | −0.00165 (0.00178) | −0.00068 (0.00151) | −0.00069 (0.00152) | −0.00100 (0.00152) |
| Observations | 413,135 | 412,785 | 413,135 | 412,785 | 413,135 | 412,785 |
| Adj R2 | 0.1271 | 0.1271 | 0.1470 | 0.1470 | 0.1472 | 0.1472 |
| B: Mobility – Mom FEs | ||||||
|
|
||||||
| 10th to 90th RRMCL | 0.00255 (0.00315) | 0.00348 (0.00260) | 0.00165 (0.00309) | 0.00297 (0.00251) | 0.00298 (0.00252) | 0.00280 (0.00257) |
| Observations | 172,651 | 172,498 | 172,651 | 172,498 | 172,651 | 172,498 |
| Adj R2 | 0.2494 | 0.2486 | 0.2694 | 0.2689 | 0.2696 | 0.2691 |
| Panel C: Fertility (birth rate per 100,000 people served) – CWS FE | ||||||
|
|
||||||
| 10th to 90th RRMCL | −0.07271** (0.03491) | −0.08445** (0.03351) | −0.06937** (0.02934) | −0.07647** (0.03040) | −0.07472** (0.02939) | 0.07040** (0.02799) |
| Observations | 59,763 | 59,406 | 59,763 | 59,406 | 59,763 | 59,406 |
| Adj R2 | 0.7397 | 0.7393 | 0.7420 | 0.7417 | 0.7422 | 0.7418 |
| Panel D: Fertility (log number of births) – CWS FEs | ||||||
| 10th to 90th RRMCL | −0.00114*** (0.00041) | −0.00095*** (0.00033) | −0.00114*** (0.00036) | −0.00089*** (0.00031) | - (0.00037) | - (0.00031) |
| Observations | 33,122 | 32,912 | 33,122 | 32,912 | 33,122 | 32,912 |
| Adj R2 | 0.9924 | 0.9925 | 0.9925 | 0.9926 | 0.9925 | 0.9926 |
Notes: Standard errors in parentheses
p < 0.10
p < 0.05
p < 0.01.
Standard errors are two-way clustered on public water system and mother in panels A and B, and clustered on public water system in panel C. Observations (number of births in panel A and B; public water system months in panel C) and adjusted R2s (inclusive of fixed-effects) are reported. Each cell is a separate regression. In panels A and B, the sample is our analytic sample; in panel A, mothers must have two or more births in the data (to measure mobility), with at least one having a measure of contamination, to be included. Panel B requires mothers to have three or more births in our sample due to the addition of mom fixed-effects. In both panels A and B, the dependent variable (measured for each birth) is whether the mother is in a different public water system at the time of the next birth (being in no public water system in the next birth is considered being in a different water system for this purpose). Observations in panel C are public water system quarters, and the dependent variable is the number of births per 100,000 people served by the public water system. Regressions in panel C are weighted by public water system population served. Regressions in panel D are the same as panel C, except the outcome variable is the natural log of the number of births (community water system quarters with zero births are dropped). Control variables used vary over super-columns; independent variables, i.e., the contaminant group studied, vary across columns.
Birth outcomes
Fitting the model described by Equation 3, we find that the chance of low birth weight (LBW) and preterm birth (PTB) are both significantly positively related to contamination overall and chemical contamination specifically (Table 3). Reading across columns, we find that the addition of our extensive set of control variables does not change inference, despite raising adjusted R2 substantially. Note, however, that point estimates are attenuated by 10% to 28% in the fully adjusted specifications (columns 5 and 6) relative to the unadjusted specifications (columns 1 and 2). Generally, effect estimates are larger in the unadjusted models. Reading across rows, we find that our results are more sensitive to whether we use a CWS fixed-effects (CWS FE) specification or a mother fixed-effects (MFE) specification (that also includes CWS FE). In general estimates are larger in the CWS FE specifications. All coefficients are estimated effects of a change from the 10th percentile to the 90th percentile in the Pennsylvanian distribution of exposure to CWS drinking water contamination.xxxviii Thus they are comparable across specifications. In both cases, effects are economically meaningful: based on columns (5) and (6), in the MFE specification, a 10-to-90th percentile move leads to about a 0.8 percentage point increase in low birth weight (LBW). From the base LBW rate for non-plural births of 6.7% (Table 2), this is about a 12% increase. Analogously for PTB, we find about a 17% increase.
In order to formally calculate the degree to which the addition of control variables affects our inference, we report “AETO’s δ.” By the acronym AETO we refer to the two papers Altonji et al. (2005) and Oster (2016); the latter work substantially builds on the former and provides us the Stata implementation we use.xxxix The interpretation of δ is the minimum ratio of selection on unobservables to selection on observables that would be needed to make our estimated coefficients zero.xl A ratio of at least 1 is a plausible rule for saying an effect estimate is “robust to controls.”xli In our case, AETO’s δ tends to be large in the CWS FE models, but much smaller in the mother (and CWS) fixed-effects models (Table 3). This is partly by construction: the mom fixed-effects absorb many potential unobservables, making it unlikely that the inclusion of control variables will appreciably change the R2 in the model. Indeed, consistent with this idea, when we compare for example, columns (1) and (3) in the mom fixed-effects specifications, the control variables do not substantially increase adjusted R2. That said, most of our estimates have AETO’s δ’s greater than one. This also lends support for our earlier claim that there is not a clear indication of selection into a particular dose of treatment.
In Table 4, we report the estimated effects on more direct measures of intrauterine growth restriction: small for gestational age (SGA) and term birth weight (TBW). These estimates are consistent with our main findings. In other words, after applying at least some approaches to removing the effects of contamination on gestation length, we continue to find effects of contamination on birth outcomes, suggesting that effects on gestation length are not the only channels by which water contamination affects birth outcome, and intrauterine growth restriction (IUGR) is a possible consequence of contamination.
While our main analysis focuses on indices of water quality that include a large number of contaminants, as discussed in the section “Reproductive Health Water Quality Index” and Appendix section “An Alternative Reproductive Health Index of Five Contaminants Based on Data Availability and Prior Research,” an alternative approach is to restrict to contaminants which are commonly sampled and for which we could find research literature suggesting deleterious health effects for infants. We ultimately chose five contaminants to include in a reproductive health specific water quality index. Estimates using this five-contaminant index to measure water quality are included in Tables 5 and 6 (the tables include estimates for our all index of contamination for comparison). Across the various specifications, we see slightly stronger effects for the reproductive-specific index. In particular, focusing on columns (5) and (6) and mother fixed-effects regressions, we find that a 10th-to-90th percentile movement in RRMCL for our five-contaminant index leads to a 1.3 percentage point increase in low birth weight (about 19% relative to the mean), a 1.8 percentage point increase in preterm birth (about 22% relative to the mean), 0.3 percentage point increase in small for gestational age (significant at only the 10% level), and a 12.59 gram decrease in term birth weight.
For interested readers, in Appendix Section “Effects of MCL Violations on Birth Outcomes” we estimate models studying the effects of MCL violations on health, following Currie et al. (2013). We find that 3% of our sample has a regulatory violation during gestation, compared to 8% in NJ in the prior study. We find similar effects in magnitude and statistical precision for any MCL violation, but not for chemical violations, which is in contrast to Currie et al. (2013).xlii
Avoidance via migration
Table 7 Panels A and B display the estimated effects of contamination on mother mobility. Mobility is defined for a given birth as an indicator for the mother being in a different CWS at the time of her next birth. Regressions in this table are similar to equation 3, except with mobility as the dependent variable. As before, we restrict to our analytic sample as described in the section “Data and Summary Statistics.” In these models, the independent variable of interest is CWS drinking water contamination, as measured during the gestation period of the current birth.
Mothers would exhibit avoidance behavior via migration if, when they experience higher levels of water contamination during this gestation period, they are more likely to be found in a different CWS at the time of their next birth. We find little evidence for this type of avoidance behavior, which is consistent with mothers not behaviorally responding to contamination levels that do not trigger MCL violations. In Table 2 about 37% of births have their mother in a different CWS at the next birth (this is consistent with statistics from New Jersey in Currie et al., 2013, so the point estimates from Table 7 are also economically small).
While we have evaluated the degree of avoidance behavior through very costly migration decisions, this is not the only possible and probably not the most likely form of avoidance behavior. It is possible that very attentive mothers purchase bottled water instead of using tap water, or they filter their tap water, when contamination rises. This avoidance behavior is less likely in our setting where we study water quality that doesn’t trigger a health-based violation unless their system previously experienced violations. However, mobility looms large as a potential avoidance behavior that could lead to bias in our estimated effects of contamination on birth outcomes, and especially fertility, discussed in the next section, so it is helpful to note little evidence for this particular behavior.
Fertility
Table 7 Panel C displays the estimated effects of contamination on fertility rates from the regression described in equation 4. These regressions are weighted by CWS population served. CWS fertility rates are defined as monthly births per 100,000 people served.xliii Our findings are consistent with poor water quality reducing fertility. A typical birth is in a CWS with about 113 births per 100,000 served per month (Table 2). We find that changes in overall contamination from the 10th to the 90th percentile reduces this number by about 0.07 births per 100,000 per month, or approximately a 0.06% reduction in fertility. Effects for chemical contamination are similar. Comparing these effects to those found for lead contamination, D. S. Grossman & Slusky (2019) found that the Flint water crisis led to a 12% reduction in fertility, while Clay et al. (2021) found, for topsoil lead, that bringing a typical above-median county to the lead levels of a typical below-median county would lead to approximately an 11% increase in the birth rate. By contrast, we find that going from the 10th to the 90th percentile of water contamination in our sample has a much smaller, though still statistically significant, negative effect on fertility. Lead is known to induce miscarriage, whereas the more general contamination studied in our paper does not have strong evidence for causing pregnancy loss. Given that mobility effects appear to be limited, the effects on fertility are likely due to selection out of pregnancy, either through choice or through contamination leading to reduced fertility.
Our fertility regressions in Panel C of Table 7 use time invariant CWS population served in the denominator of the dependent variable. As a robustness check, in Panel D of the same table we use log births as the outcome variable, thereby avoiding the use of the time-invariant population served variable. This approach is commonly used (Ananat & Hungerman, 2012; González, 2013; Guldi, 2008). Effects in Panel D are larger in magnitude, at about a 0.1% reduction in fertility, but they are of the same sign and statistical significance as our main fertility results in Panel C.
SENSITIVITY ANALYSIS
In this section, we discuss the sensitivity of our findings to different subsamples. For all results presented thus far, we have restricted to non-plural births, and have included all mothers, including those who switch CWS. We show robustness to these sample selection choices in this section. All tables referenced in this section are included in Online Appendix section “Sensitivity Analyses.”
Robustness to sample restrictions based on plurality and mobility
In the Online Appendix, we estimate our models with two alternative samples: first, all births (i.e., including plural births), and second, sub-setting to never-moving mothers. Including plural births does little to change our findings (Appendix Tables A9 and A10). Our findings are also robust to excluding all mothers with multiple births who we ever observe in different CWS across births (Appendix Tables A11 and A12). This is surprising since this sample restriction removes a quarter to a half of the births in our data, depending on specification; moreover, one component of our variation in water quality from the mother fixed-effects analyses was due to mothers switching water systems with different average contamination. The robustness of our results in this very restrictive sample provides additional support for our within-water system estimates.
Table 8:
Largest systems by water quality letter grades based on our index measure.
| (1) | (2) | (3) | |
|---|---|---|---|
| Water system name | Population served | Mean RRMCL | Letter grade |
|
| |||
| Ambler Boro Water Dept | 20,000 | 0.02 | A |
| Lansford Coaldale Jt Water Aut | 9,300 | 0.02 | A |
| Paw Bangor District | 9,008 | 0.02 | A |
| Erie City Water Authority | 180,000 | 0.03 | B |
| York Water Co | 159,623 | 0.04 | B |
| North Penn Water Authority | 82,822 | 0.03 | B |
| Pittsburgh Water & Sewer Auth | 250,000 | 0.06 | C |
| West View Boro Muni Auth | 200,000 | 0.05 | C |
| Westmd Mun Auth-Sweeney Plant | 140,000 | 0.06 | C |
| Philadelphia Water Department | 1,600,000 | 0.07 | D |
| Pa Amer Water Co-Pittsburgh | 507,675 | 0.08 | D |
| City Of Lancaster | 120,000 | 0.09 | D |
| Aqua Pa Main System | 784,939 | 0.06 | F |
| Easton Area Water System | 93,400 | 0.05 | F |
| Capital Region Water | 66,540 | 0.06 | F |
Notes: Letter grades assigned based on average result relative to MCL for all contaminants available since 2010 (reported in the mean RRMCL column): the cleanest 25% public water systems are assigned grade “A,” the next 25% are assigned grade “B,” and so forth.
Table 9:
Possible effects of an information provision intervention - assigning letter grades to systems on consumer confidence reports.
| (1) | (2) | (3) | |
|---|---|---|---|
| A or B | C, D, or F | Overall | |
|
| |||
| Panel A: Low birth weight | |||
| Actual rates | 5.73 | 6.90 | 6.67 |
| After policy change | 5.73 | 6.60 | 6.43 |
|
| |||
| Panel B: Pre-term birth | |||
| Actual rates | 7.28 | 8.33 | 8.13 |
| After policy change | 7.28 | 7.81 | 7.71 |
|
| |||
| Number of births | 242,388 | 1,014,850 | 1,257,238 |
Notes: We assume that people in systems assigned to letter grades C, D, or F engage in avoidance behavior and obtain water quality at the average for letter grade A systems. Estimates give an idea of the magnitude of public health gains that might be obtained from adopting a consumer confidence report simplification policy.
Robustness to sample restrictions based on MCL violations
Recall that throughout our analysis, we excluded births exposed to MCL violations. In the Online Appendix, we also re-estimate our models with two additional alternative sample restrictions: we run the analysis keeping births that are exposed to MCL violations, and we run the analysis removing births with any samples exceeding the MCL threshold. The latter analysis is motivated by the complexity of the regulatory environment, in that in many cases violations are triggered only when a particular average of samples exceeds the regulatory threshold (e.g., TTHM and HHA5 use running averages), and a concern that the MCL violation data may be missing some violations. See the Online Appendix sections “Including Births Exposed to MCL Violations” and “Removing Births With Any Samples Exceeding MCL” for further details on these alternative samples. We also include models estimating the effects of MCL violations on birth outcomes in the Appendix section “Effects of MCL Violations on Birth Outcomes.”
Our findings are robust to including births exposed to MCL violations (Online Appendix Tables A13 and A14). This may not be surprising since our sample size increases by only about 3 to 5% (depending on specification) by adding these births. More surprisingly, our findings are robust to removing births exposed to samples exceeding MCL thresholds (Online Appendix Tables A15 through A20). In conducting these further restrictions, we document in the Appendix section “Removing Births With Any Samples Exceeding MCL” that four contaminants very often have single samples that exceed their respective MCL (Coliform, TTHM, HAA5, and lead), and this makes sense because none of these contaminants have violations driven by single-sample exceedances. Our findings are robust to several different ways to treating these four contaminants, which is an important consideration since if we drop births exposed to any samples with exceedances for these contaminants, we lose about 85% of our data. In any case, this robustness check is intended to simulate a stricter definition of MCL violation and reduces sample sizes by 25% to 85% depending on specification. That our results are robust to these stricter definitions of “compliant drinking water” suggests that there are indeed statistically significant effects of water contamination on birth outcomes, even when systems are in compliant with regulations, or all sample concentrations are below the regulatory threshold.
Robustness to restricting to MFE sample using CWS FE
In the main specifications, using mother fixed-effects (MFE) rather than community water system fixed-effects (CWS FE) often appreciably and significantly changed the effect estimates (e.g., the differences between the CWS FE and MFE subpanels in Table 3). In fact, this is the main way in which our results are sensitive, since the addition of mother and birth control variables does little to change our estimates.
In the Online Appendix, we present CWS FE and MFE estimates sub-setting throughout to the sample used in the MFE specifications. Our results are robust in Appendix Tables A21 and A22. Thus, the differences between the CWS FE and MFE specifications are due to differences in the models we fit, not due to differences in the samples we employ.
Study limitations
Our estimates may be affected by a few forms of exposure misclassification, which is common in environmental health research. First, our index is created from the measured contamination during gestation periods. The regulations provide monitoring schedules that vary by size of system,xliv season, and contaminant (among others). For some contaminants, systems are required to measure monthly, quarterly, annually, every 3 years, and in some cases every 6 years. Unfortunately, this temporal sparsity of the data leads to unobserved contamination levels during gestation for some contaminants that make up our index for any individual pregnancy. We chose not to impute contamination because we do not believe we have adequate information to predict contamination over time across systems. Despite this limitation, we believe using the sampling data provides more temporal frequency than the MCL violations data as they only report violations or not in each monitoring period (approx. quarterly) but when there is no violation, the researcher doesn’t observe whether it is a true zero (measured) or just no violation in the monitoring period (which leads to censoring). Second, our estimates are dependent on the accuracy of the technology measuring the concentrations for CWS. In particular, in the Pennsylvania data, non-detect values are treated as zero, when in fact, the technology may have a minimum detection limit (MDL) such that we cannot be confident that the concentrations are in fact zero. In Appendix section “Non-Detection and Limits of Measurement,” we examine how replacing non-detects with the MDL changes the distribution of our index, and furthermore show that Pennsylvania MDL are typically lower than California’s, suggesting Pennsylvania is a decent setting to conduct this study (Appendix Tables A4 through A6). Third, we rely on water system service areas to define exposure to CWS and assign water quality during gestation at the system-level to each mother residing in that water system. There could be exposure misclassification if the water ingested by an individual mom is more or less contaminated than what is measured prior to the water being released into the distribution system. Ideally, we would have distribution maps and be able to assign samples for sub-areas of the service area. Unfortunately, these distribution maps are not available and this is a data limitation.xlv Despite this limitation, we believe that this is analogous to what is used in air quality studies extrapolating monitor or satellite data (Currie & Walker, 2019) or water quality studies at the county or city level (Flynn & Marcus, 2023; D. S. Grossman & Slusky, 2019). However, we are using system boundaries providing sub-county variation in water quality, which should improve upon existing approaches that use county-level variation (McDonald et al., 2022). Finally, in our analysis, we used a singular vintage of the water system boundaries. In the Appendix, we checked to see if there was any correlation between systems that changed over time and the water quality index. See Appendix section “Community Water System Boundaries and Linking Births to Water Quality” for details. We found that while there were some changes over time, they are minimally associated with our measure of water quality (Appendix Table A7). We hold the boundaries constant over our study period to ensure that factors that might influence changing systems (e.g., population growth, population decline, water contamination, etc.) are not influencing our assignment of treatment.
DISCUSSION
Policymakers have increasing begun using summary indices to regulate or improve communication of quality in various sectors for consumers. For example, Centers for Medicaid and Medicare Services (CMS) has promoted Hospital Compare, Nursing Home Compare, and Medicare Care Compare as mechanisms for improving health care quality. Most related to our context, the EPA has long provided the U.S. Air Quality Index (AQI; updates by metro area daily) and the AQI is now reported in AirAlerts communicating to the public when air quality could be risky for at-risk individuals. State departments of education provide school report cards and provide summary measures of quality using “grades.” Researchers have studied responses by consumers and suppliers (e.g., providers, schools) to use of these indices for communicating quality. For example, AQI and communicating poor air quality has been shown to change participation in outdoor activities (Wen et al., 2009; Zivin & Neidell, 2009) and following star-ratings for Nursing Home Compare, low scoring nursing homes lost demand and high scoring nursing homes experienced an increase in demand (Werner et al., 2016). With recent scrutiny on water quality, use of a water quality index to communicate more frequent water quality information to society could be beneficial for avoiding poor outcomes for the most sensitive groups.xlvi
Figure 2 displays an excerpt of Philadelphia’s 2017 Consumer Confidence Report (CCR), which was provided to consumers of Philadelphia water to inform them of water quality in the prior year. Sample result data in the report is displayed by contaminant, for example, the highest level of Barium reported in Philadelphia’s water in 2017 was 0.044ppm. Except for reported regulatory violations, which are rare, consumers are unlikely to extract actionable information from current Consumer Confidence Reports. Motivated by our results, we propose that regulators consider providing a summative, index measure of water quality for CWS, to be displayed on annual CCRs or to be used more frequently as water alerts to community members.
Figure 2:

Philadelphia Water Consumer Confidence Report summary results, 2017.
We run a simple simulation to provide an example policy counterfactual, if systems were to adopt our overall index measure. Our example accountability policy takes our index measure, measured at the birth level, and aggregates to the CWS level by averaging the measure for all births (below the 99th percentile of contamination) since the start of 2010 through the end of 2014.xlvii This gives us a “mean result relative to MCL” or “mean RRMCL” measure. We then split systems into letter grades, A through F, as follows. We begin by splitting up the CWS-level mean RRMCL measure into four quartiles and assigning letter grades A through D to those quartiles. Then, if a system receives any MCL violation (or for lead and copper, any trigger of the action level) over the 2010 through 2014 period, we assign the system to grade F. Several large water systems with their letter grades assigned according to our method are displayed in Table 8.
In Table 9, we display results from a counterfactual simulation, where we assume that consumers in systems with letter grades C, D, or F engage in avoidance behavior or lobby their water systems for better water treatment, and these responses ultimately move their effective water quality to the mean quality for systems of letter grade A. The table displays back-of-the-envelope estimated effects on birth outcomes.xlviii To put these numbers in context, there are about 150,000 singleton births in Pennsylvania each year, and about 10,005 are low birth weight (6.67 out of 100); the proposed policy could reduce this to about 9,645 low birth weight infants, thus, about 360 infants per year would avoid being low birth weight (3,960 over our 11-year period). Similarly, for preterm birth, there are about 12,195 such infants born in Pennsylvania each year; the policy could reduce this to 11,565, thus preventing 630 preterm births annually (6,930 over 11 years). This policy could save around $290 million in total costs for PTB and LBW associated hospitalizations and medical services before age 9.xlix Given the large human capital costs of poor birth outcomes for later life education outcomes, labor market outcomes, and life expectancy (Almond et al., 2018; Black et al., 2007), our estimates clearly have economic and policy significance.
Based on this simulation, we argue that providing a grade for water systems could be an improvement on the current information provided to consumers. We must acknowledge, however, that while this provides more information for the consumer, it puts the burden of preventing exposure on the consumer and may disproportionately affect low-income families. Research on water systems in California, however, suggests that water systems may not be removing contaminants even after regulatory violations (Grooms, 2016). Given this imperfect enforcement, we do believe that adding a summary index (after further validation) to CCR or used in water quality alerts could further protect public health at the very least by informing consumers of water quality in their local systems. For example, the EPA’s Air Alerts indicate levels of concern as well as which populations might be most at risk and should avoid exposure. The actual implementation of this information would need to take into account equity and environmental justice concerns, much like air quality alerts where low-income families may not have air conditioning, access to air filters or be able to avoid exposure due to employment that requires spending time outdoors.
The index that we used in this study is but one possible summary measure, and we encourage researchers and policymakers to consider alternatives. Our index, or an alternative water quality index, could be used to study other health outcomes and other populations that could be at risk of adverse outcomes from exposure to drinking water contamination (e.g., children, older adults). As research causally links various water quality indices that may be relevant for particular populations (such as our reproductive index described in the section “Reproductive Health Water Quality Index”) or how our overall index affects the health of other populations, “water alerts” could be targeted to specific populations at risk with enough research and validation to support such implementation.
We view our letter grading policy proposal as a way to consider the potential benefits of improving water quality, and our study is not a formal cost-benefit analysis of safe drinking water policies. The cost-benefits literature on water contamination is relatively new, and a recent evaluation of the Clean Water Act suggested that its benefits did not exceed its costs (Keiser & Shapiro, 2019a). One approach to assess costs in our context might be to assume that households make bottled water purchases to replace tap water with higher quality water, but it is unclear whether bottled water is a sufficient substitute for quality tap water. In particular, the persistent and large negative house price effects of the Flint water crisis found in Christensen et al. (2023), despite 3 years of free bottled water provision and several programs to distribute free water filters and filter cartridges, suggests that private goods are imperfect substitutes for quality tap water.
Our results using a water quality index have two additional implications for water regulation. The scientific literature has suggested that certain co-occurring contaminants that are not regulated by SDWA and therefore are unmeasured in our data could be explaining some of the effects we observe (Bradley et al., 2018). Using this index could be helpful, given regulators cannot monitor all contaminants in drinking water. Furthermore, as briefly discussed in the introduction, SDWA regulates contaminants on a contaminant-by-contaminant basis. Toxicology literature suggests that mixtures of contamination may be more problematic for human health (Braun et al., 2016; Gennings et al., 2018), and people are exposed to mixtures in real-world settings. Our approach of using a summary index could be a first step towards having drinking water policy take into account mixtures.
In this paper, we define compliance as not having a health-based violation as reported to the EPA. Recent work (Baker et al., 2022) has argued that while there is some evidence that violations go unreported to the national EPA, the majority (79%) are accurately reported.l Given that we condition on no health-based violations as reported to the EPA, we join other studies (Allaire et al., 2019) that suggest that improving this reporting could benefit communities health through communicating water quality accurately.li Furthermore, if violations do not result in improved compliance, as Grooms (2016) suggested, our findings could reflect this lack of enforcement resulting in poor birth outcomes.
We address the question of “missed” or unreported violations by crudely removing samples that are above the regulatory threshold for each contaminant (Appendix section “Removing Births With Any Samples Exceeding MCL”). The current regulations do not treat these thresholds as binding for all contaminants, and so our removal of concentrations above any regulatory threshold is much more stringent than the current regulations. We find similar results when we treat thresholds as binding for all contaminants in Appendix Tables A19 and A20. While this suggests that even when concentrations are below the regulatory threshold, water contamination could be impacting infant health, we lose substantial sample and caution that more research is needed to confirm these findings.
For analyses such as those reported in this paper, the lack of regular monitoring for certain contaminants that may be very important for reproductive health outcomes (e.g., lead, uranium) limited our ability to create a reproductive health-specific water quality index containing the full list of contaminants that the EPA has indicated could harm reproductive health or infants.lii Frequent monitoring has costs and the EPA has already considered these costs when defining monitoring schedules for each contaminant. Still, our analysis has limitations (see the section “Robustness to Restricting to MFE Sample Using CWS FE” for discussion) due to the limited frequency of sampling, and this limitation applies to other analyses that may wish to inform regulators.
CONCLUSIONS
In this paper, we studied the effects of drinking water contamination on birth outcomes, using panel data on births within community water systems (CWS) for the state of Pennsylvania. In order to provide the most policy-relevant estimates possible, we restricted the sample to births that were not exposed to a water quality regulatory violation. We found that when we measure water contamination using either an omnibus measure of water quality or a measure removing microbial contaminants, birth outcomes respond negatively to water contamination, even in this no-violation-exposure sample. We find consistent evidence when using a reproductive-health-specific index and when we remove samples below the regulatory threshold (despite a substantial loss in sample size). In addition, we identify small negative effects on fertility. We find little evidence of mobility in response to contamination. This is consistent with our variation in water contamination not triggering significant avoidance behavior, which is intuitive since we have removed births exposed to regulatory violations that trigger public notification. Finally, we argue that current consumer confidence reports provided to consumers by CWS do not provide much information that can be easily understood by the consumer. Our index could be used to characterize the level of contamination in a simpler way and help consumers avoid exposure.
Although our findings are of short-run health consequences of poor water quality, even at levels that do not trigger regulatory violations, long-run effects are not a distant possibility. Our results call for further research on these important environmental and health policy issues. Future studies could repeat our analysis on different samples and fit models that critique and refine our index measures. In addition, previous research has shown that consumer confidence report policy changes can influence water system behavior (Bennear & Olmstead, 2008), while there is weaker evidence that violations can (Grooms, 2016); our findings motivate developing and adding omnibus water quality measures to these consumer confidence reports, which could be similar to letter grading in K–12 education (e.g., Figlio & Lucas (2004)). Innovative accountability policies such as this may be able to improve water quality even for systems that are not in violation of regulatory standards, an outcome that our results suggest could improve public health.
Acknowledgements
For generous feedback, we thank Andrew Boslett, Alina Denham, Dan Grossman, Lala Ma, Michelle Marcus, Steven Martin, David Slusky, Devin Sonne, Mary Willis, Alexis Zavez, participants at the iHEA 2017 and the APPAM 2017 conferences, and participants at the applied reading group seminar at University of Rochester. We thank Karla Zhendejas, Evan Volkin, Mira Chaskes, and Grace Sventek for excellent research assistance. These data were supplied by the Bureau of Health Statistics & Research, Pennsylvania Department of Health, Harrisburg, Pennsylvania. The Pennsylvania Department of Health specifically disclaims responsibility for any analyses, interpretations or conclusions. Thank you to Amy Farrell and James Rubertone of Pennsylvania Department of Health for facilitating access to the data. This research was supported by the Office of the Director of the National Institutes of Health under award number DP5OD021338 (PI Hill). We also gratefully acknowledge funding from the University of Rochester Environmental Health Sciences Center (EHSC), an NIH/NIEHS-funded program (P30 ES001247). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health or the New York City Independent Budget Office. The study was reviewed and provided expedited determination by the University of Rochester RSRB. The authors have no relevant or material financial interests that relate to the research described in this paper.
Appendix for Drinking Water Contaminant Concentrations and Birth Outcomes
DATA CONSTRUCTION DETAILS
While the drinking water regulations are often complicated, in general the maximum contaminant level (MCL) can be seen as the maximum amount of contamination allowed in public drinking water before a regulatory intervention takes place. The intervention may be a violation, which generally includes public reporting requirements (e.g. in the case of coliform or nitrate); or it may be a requirepament for a set of precautionary actions (“treatment techniques”) to be taken to reduce the level of contamination in the water for end-users (e.g. in the case of lead and copper). On the other hand, the maximum contaminant level goals (MCLG) are levels below which there is no known or expected health risks (US Environmental Protection Agency, 2018). As shown in Table A1 for ten selected contaminants, in some instances the MCLG are lower than the MCL.
The main two reasons for MCL to exceed MCLG are that the EPA believes that it is excessively costly for community water systems (CWS) to reduce contamination to the MCLG, or that measurement technology is unable to accurately measure contamination closer to the MCLG (US Environmental Protection Agency (2016b)). Since for some contaminants, the reason MCL are not reduced toward MCLG is due to concerns over imprecise measurements for low levels of contamination, this may raise concerns over the quality of our water sample data. We take the position espoused in the EPA internal document (US Environmental Protection Agency (2006)), describing how measures below MCL still contain (albeit noisy) information: “[i]t is recommended that all values between the PQL and MDL be reported. They are real, the concentration is fuzzy, but their values can give indications or trends and should be reported.” Note that in this quote, both the PQL and MDL are below the MCL: the PQL (practical quantitative limit) is the level at which the EPA believes the samples can be used for administrative purposes while the MDL (method detection level) is a lower level below which the sample reading cannot reliably be distinguished from zero. More detail on how MCL are limited by noise in the water quality measures is available in the EPA six-year review technical report (US Environmental Protection Agency (2016a)).
Discussion of Our Main Index Measures
To illustrate our index measures, in Appendix Table A1, we display ten example contaminants in our data, the number of births with that contaminant measured in our analytic sample (column 1), and the across-birth correlation of that contaminant with our measure of overall contamination (column 2) and chemical-only contamination (column 3). Of course, the correlations reported are conditional on the set of births for which the contaminant is indeed measured, with counts reported in column 1. We find that the disinfectant byproducts trihalomethanes, haloacetic acids, and chlorite, are positively correlated with our measure of overall contamination, as is lead, total coliform (an important microbial measure) and turbidity (a general measure of the cloudiness of water). Note that chlorite is listed by the US EPA as potentially having negative impacts on reproductive health, but it is sampled relatively rarely in our data; yet, for the births for which it is sampled, our index appears to be strongly related to this measure. By contrast, however, nitrate and arsenic, two popular measures in the water policy literature (e.g. Grooms, 2016), are not highly correlated with our measures. Atrazine, a controversial herbicide (Rohr and McCoy, 2010) and as of 2001 one of the most common herbicides detected in streams and ground water (Gilliom and Hamilton, 2006), is only weakly positively correlated with our measures.
Table A1:
Ten selected contaminants, correlation with our two contaminant groups, and regulatory and health standards.
| (1) | (2) | (3) | (4) | (5) | |
|---|---|---|---|---|---|
| Name | # of births | Cor w/ all | Cor w/ chem | MCL | MCLG |
| TRIHALOMETHANES | 1,144,050 | 0.179 | 0.209 | 0.080 | – |
| HALOACETIC ACIDS (FIVE) | 1,130,519 | 0.134 | 0.150 | 0.060 | – |
| NITRATE | 1,041,320 | 0.040 | 0.094 | 10.000 | 10.000 |
| CHLORINE | 872,825 | 0.213 | – | 4.000 | 4.000 |
| ARSENIC (IOC) | 741,404 | −0.035 | 0.059 | 0.010 | 0.000 |
| TOTAL COLIFORM | 723,596 | 0.424 | – | 0.050 | 0.000 |
| ATRAZINE (SOC) | 590,628 | 0.120 | 0.140 | 0.003 | 0.003 |
| LEAD | 421,453 | 0.172 | 0.468 | 0.015 | 0.000 |
| TURBIDITY | 106,765 | 0.427 | – | 2.000 | – |
| CHLORITE | 23,247 | 0.394 | −0.037 | 1.000 | 0.800 |
Notes: Table displays the number of births in our analytic sample with a measure on each contaminant (column 1), and the correlation of that measure with the two groups we use in analysis: all in column (2), and chemical-only in column (3). Maximum contaminant limits (MCL) are in column (4) while health goals (MCLG) are in column (5). Dashes (–) indicate that the contaminant is not included in that group, or that there is no MCLG for that contaminant.
For a full list of the contaminants used in our analysis, refer to Appendix Table A2. This table lists all 94 contaminants studied in our analysis, and shows whether it is part of our All or Chem (chemical-only) index measures. It also provides the number of births for which we have non-missing contamination measures (at least one sample during the birth’s gestation period for the CWS at birth) and the total number of drinking water samples for the contaminant in our input data.
Table A2:
Contaminants by group.
| Row | ID | Name | All | Chem | Births | Samples |
|---|---|---|---|---|---|---|
| 1 | 999 | chlorine | ✓ | ✓ | 938671 | 4816645 |
| 2 | 1030 | lead | ✓ | ✓ | 450922 | 93545 |
| 3 | 1022 | copper | ✓ | ✓ | 465965 | 93488 |
| 4 | 2950 | trihalomethanes | ✓ | ✓ | 1239417 | 82338 |
| 5 | 3100 | total coliform | ✓ | 785817 | 77823 | |
| 6 | 1040 | nitrate | ✓ | ✓ | 1117361 | 76439 |
| 7 | 2456 | haloacetic acids (five) | ✓ | ✓ | 1224600 | 74626 |
| 8 | 1006 | chloramine | ✓ | ✓ | 234710 | 44909 |
| 9 | 100 | turbidity | ✓ | 111782 | 40059 | |
| 10 | 2987 | tetrachloroethylene | ✓ | ✓ | 1039128 | 38903 |
| 11 | 2984 | trichloroethylene | ✓ | ✓ | 1039602 | 38891 |
| 12 | 2991 | toluene | ✓ | ✓ | 1039130 | 38751 |
| 13 | 2980 | 1,2-dichloroethane | ✓ | ✓ | 1039073 | 38747 |
| 14 | 2981 | 1,1,1-trichloroethane | ✓ | ✓ | 1039126 | 38743 |
| 15 | 2990 | benzene | ✓ | ✓ | 1038953 | 38736 |
| 16 | 2979 | trans-1,2-dichloroethylene | ✓ | ✓ | 1039541 | 38720 |
| 17 | 2992 | ethylbenzene | ✓ | ✓ | 1037610 | 38695 |
| 18 | 2378 | 1,2,4-trichlorobenze | ✓ | ✓ | 1039042 | 38691 |
| 19 | 2380 | cis-1,2-dichloroethylene | ✓ | ✓ | 1039068 | 38674 |
| 20 | 2977 | 1,1-dichloroethylene | ✓ | ✓ | 1039169 | 38673 |
| 21 | 2964 | dichloromethane | ✓ | ✓ | 1038983 | 38666 |
| 22 | 2982 | carbon tetrachloride | ✓ | ✓ | 1038126 | 38666 |
| 23 | 2985 | 1,1,2-trichloroethane | ✓ | ✓ | 1038971 | 38662 |
| 24 | 2989 | chlorobenzene | ✓ | ✓ | 1038483 | 38661 |
| 25 | 2968 | o-dichlorobenzene | ✓ | ✓ | 1038524 | 38657 |
| 26 | 2969 | para-dichlorobenzene | ✓ | ✓ | 1038518 | 38650 |
| 27 | 2983 | 1,2-dichloropropane | ✓ | ✓ | 1039005 | 38649 |
| 28 | 2955 | xylenes (total) | ✓ | ✓ | 1037830 | 38640 |
| 29 | 2996 | styrene | ✓ | ✓ | 1039347 | 38616 |
| 30 | 1041 | nitrite | ✓ | ✓ | 529308 | 32221 |
| 31 | 2976 | vinyl chloride | ✓ | ✓ | 643919 | 27665 |
| 32 | 2941 | chloroform (thm) | ✓ | ✓ | 311636 | 25217 |
| 33 | 2944 | chlorodibromomethane (thm) | ✓ | ✓ | 309861 | 25145 |
| 34 | 2943 | bromodichloromethane (thm) | ✓ | ✓ | 311525 | 25136 |
| 35 | 2942 | bromoform (thm) | ✓ | ✓ | 308017 | 24730 |
| 36 | 1005 | arsenic (ioc) | ✓ | ✓ | 787789 | 23956 |
| 37 | 2050 | atrazine (soc) | ✓ | ✓ | 635751 | 18226 |
| 38 | 2039 | di (2-ethyl) phthalate (soc) | ✓ | ✓ | 698366 | 17202 |
| 39 | 2037 | simazine (soc) | ✓ | ✓ | 639790 | 16551 |
| 40 | 2051 | alachlor (soc) | ✓ | ✓ | 592562 | 16424 |
| 41 | 1025 | fluoride (ioc) | ✓ | ✓ | 450789 | 16402 |
| 42 | 2042 | hexachlorocyclopentadiene(soc) | ✓ | ✓ | 647018 | 16276 |
| 43 | 2035 | di (2-eth) adipate (soc) | ✓ | ✓ | 654311 | 16184 |
| 44 | 2946 | ethylene dibromide (edb) (soc) | ✓ | ✓ | 596121 | 16124 |
| 45 | 2306 | benzo(a)pyrene (soc) | ✓ | ✓ | 619395 | 16090 |
| 46 | 2326 | pentachlorophenol (soc) | ✓ | ✓ | 623120 | 16069 |
| 47 | 2105 | 2,4 - d (soc) | ✓ | ✓ | 391691 | 15890 |
| 48 | 2931 | 1,2-dribomo-3-chloroprop (soc) | ✓ | ✓ | 582020 | 15752 |
| 49 | 2010 | lindane (soc) | ✓ | ✓ | 566031 | 15515 |
| 50 | 2015 | methoxychlor (soc) | ✓ | ✓ | 542303 | 15421 |
| 51 | 2959 | chlordane (soc) | ✓ | ✓ | 543058 | 15175 |
| 52 | 2040 | piclorem (soc) | ✓ | ✓ | 537392 | 15072 |
| 53 | 2046 | carbofuran (soc) | ✓ | ✓ | 530220 | 14833 |
| 54 | 2036 | oxymal (vydate) (soc) | ✓ | ✓ | 530677 | 14805 |
| 55 | 2033 | endothall (soc) | ✓ | ✓ | 527098 | 14608 |
| 56 | 1010 | barium (ioc) | ✓ | ✓ | 451330 | 13815 |
| 57 | 1015 | cadmium (ioc) | ✓ | ✓ | 452156 | 13624 |
| 58 | 1085 | thallium (ioc) | ✓ | ✓ | 451158 | 13595 |
| 59 | 1075 | beryllium (ioc) | ✓ | ✓ | 450796 | 13540 |
| 60 | 1074 | antimony (ioc) | ✓ | ✓ | 451315 | 13536 |
| 61 | 1035 | mercury (ioc) | ✓ | ✓ | 425068 | 13520 |
| 62 | 1020 | chromium (ioc) | ✓ | ✓ | 452054 | 13499 |
| 63 | 1045 | selenium (ioc) | ✓ | ✓ | 451787 | 13484 |
| 64 | 2034 | glyphosate (soc) | ✓ | ✓ | 264887 | 13452 |
| 65 | 2005 | endrin (soc) | ✓ | ✓ | 314336 | 13449 |
| 66 | 2274 | hexachlorobenzene (soc) | ✓ | ✓ | 314975 | 13440 |
| 67 | 2031 | dalapon (soc) | ✓ | ✓ | 338562 | 13421 |
| 68 | 2065 | heptachlor (soc) | ✓ | ✓ | 310391 | 13402 |
| 69 | 2020 | toxaphene (soc) | ✓ | ✓ | 297146 | 13396 |
| 70 | 1024 | cyanide (free) (ioc) | ✓ | ✓ | 438139 | 13388 |
| 71 | 2110 | 2,4,5 - tp silvex (soc) | ✓ | ✓ | 309427 | 13358 |
| 72 | 2067 | heptachlor epoxide (soc) | ✓ | ✓ | 307172 | 13347 |
| 73 | 2041 | dinoseb (soc) | ✓ | ✓ | 306771 | 13267 |
| 74 | 3001 | heterotrophic bacteria - enumeration | ✓ | 169026 | 13123 | |
| 75 | 2032 | diquat (soc) | ✓ | ✓ | 277141 | 13014 |
| 76 | 2383 | pcbs (soc) | ✓ | ✓ | 252770 | 12321 |
| 77 | 1008 | chlorine dioxide | ✓ | ✓ | 8177 | 12041 |
| 78 | 1009 | chlorite | ✓ | ✓ | 24122 | 11888 |
| 79 | 2063 | 2,3,7,8-tcdd (dioxin) (soc) | ✓ | ✓ | 185862 | 10309 |
| 80 | 4030 | radium-228 | ✓ | ✓ | 256112 | 8483 |
| 81 | 4020 | radium-226 | ✓ | ✓ | 250946 | 7979 |
| 82 | 4006 | combined uranium | ✓ | ✓ | 216562 | 7073 |
| 83 | 4002 | alpha/incl. radon & uranium | ✓ | ✓ | 209737 | 5128 |
| 84 | 2451 | dichloroacetic acid | ✓ | ✓ | 20452 | 2783 |
| 85 | 2452 | trichloroacetic acid | ✓ | ✓ | 20452 | 2781 |
| 86 | 2454 | dibromoacetic acid | ✓ | ✓ | 20452 | 2779 |
| 87 | 2450 | monochloroacetic acid | ✓ | ✓ | 20410 | 2775 |
| 88 | 2453 | monobromoacetic acid | ✓ | ✓ | 20408 | 2774 |
| 89 | 4100 | gross beta particle activity | ✓ | ✓ | 123706 | 2447 |
| 90 | 1094 | asbestos | ✓ | ✓ | 62226 | 2157 |
| 91 | 1011 | bromate | ✓ | ✓ | 26251 | 1839 |
| 92 | 2044 | aldicarb sulfone | ✓ | ✓ | 89377 | 1568 |
| 93 | 2043 | aldicarb sulfoxide | ✓ | ✓ | 89274 | 1567 |
| 94 | 2047 | aldicarb | ✓ | ✓ | 81614 | 1511 |
Notes: All contaminants used in our analysis are included. The “Births” column reports the number of births for which we observe at least one sample of the given contaminant during its gestation.
Mechanisms for effects of water contamination on health
In stark contrast to air pollution, where research generally focuses on a handful of key contaminants that impact health,1 there are over 90 drinking water contaminants covered by the National Primary Drinking Water Regulations (NPDWRs), each contaminant may have different potential health effects (US Environmental Protection Agency, 2018), and the research literature appears to provide little guidance on which contaminants are likely to affect birth outcomes.
Bacterial contaminants are especially likely to have short-term effects such as gastrointenstinal illness. While most bacteria that occur in drinking water are harmless, some, such as those from human or animal wastes, may cause short-term, and potentially severe, illnesses. By contrast, US EPA expects most chemical (non-bacterial) contaminants to have health consequences only after long periods of exposure. These health effects may include cancer, liver and kidney damage, reproductive problems, and/or nervous system effects (US Environmental Protection Agency, 2018).
There is a sizeable epidemiological literature on the effects of water pollution on infant health. A detailed summary of the epidemiology literature up to the mid 2000s is provided in Wigle et al. (2008). Wigle et al. (2008) concludes that there is “limited” evidence that prenatal lead exposure leads to greater chance of preterm birth, while there is “inadequate” evidence that arsenic, cadmium, nitrate, and disinfectant byproducts (DBPs) lead to preterm birth. Moreover, they assess that there is “limited” evidence that lead, DBPs, and nitrate exposure will lead to greater chance of low birth weight, while for arsenic and cadmium there is “inadequate” evidence. All contaminants listed here, and many more, are present in our water quality data.
More recently, epidemiologists have found that arsenic exposure in drinking water (Kile et al. (2016)) and, more generally, exposure to toxic metals (Sanders et al. (2014)) is associated with poorer birth outcomes; epidemiologists continue to debate whether nitrate contamination in drinking water affects birth outcomes or infant health (Manassaram et al. (2006)). While useful contributions to our understanding, these studies as a ruledo notuse rigorous econometric methods toisolate exogenous variation in water quality to identify causal effects; one plausible reason for this is small sample sizes. We describe other studies for the five contaminants we include in the reproductive-specific index in Section “An alternative reproductive health index of five contaminants based on data availability and prior research”.
In summary, we do not have a randomized toxicological study for each covered contaminant on human health, so it is unclear whether some of the chemical contaminants may have short-run effects either alone or when combined with other chemical or bacterial contaminants. Indeed, multipollutant exposure analysis is an emerging subfield in epidemiology, especially in the context of air pollution (e.g. see Oakes et al. (2014), Snowden et al. (2015), and Davalos et al. (2017)). These issues motivate exposure analyses that study the effects of pollution sources (e.g. threats to drinking water, such as industrial facilities near source intakes), rather than pollutants per se. In this sense, the approach we take in this paper is traditional: we study the relationship between measured contaminants and health. We leave studies of the effects of drinking water intake exposures to potential point pollution sources as a direction for future research.
In addition, in spite of the NPDWRs regulating a broad range of contaminants, there are still more potentially harmful chemicals or bacteria that could be present in drinking water. Thus, the measures of contamination we observe may be correlated with other, unobserved measures of contamination, which themselves affect fetal health. For example, the EPA currently has over 100 unregulated water contaminants on their “contaminant candidate list,” and in their six-year reviews, US EPA evaluates whether some of these should be added to currently regulated contaminants (US Environmental Protection Agency (2016a)).
For these reasons, in our main analysis we aggregate across contaminants, construct omnibus measures of water quality, and estimate the effects of these overall measures on birth outcomes.
An alternative reproductive health index of five contaminants based on data availability and prior research
The National Primary Drinking Water Regulation provides a brief overview of the main health-based concerns for each contaminant regulated by SDWA. They identify two indicators that could be used to create an index that is specific to birth outcomes: reproductive difficulties, and health effects for infants. There are also studies within epidemiology and economics that have looked at birth outcomes (such as those studied in this paper, i.e., preterm birth, low birth weight) associated with exposure to contamination in drinking water. Appendix Table A3 summarizes the contaminants identified by the EPA and epidemiological/economic literature.
Our goal was to create a summary index of a smaller subset of contaminants that would be more specific to reproductive health and that would be frequently measured in the CWS in Pennsylvania (see Appendix Table A2 for number of births overlapping with contaminant samples). We identified 5 contaminants that had decent overlap between what the NPDWR identifies and the epidemiology literature: arsenic, atrazine, Di(2-ethylhexyl) Phthalate (DEHP), nitrate, and tetrachloroethylene (PCE or PERC). For arsenic, the epidemiological literature is quite large, yet the EPA does not identify it as a contaminant of concern for pregnancy, reproduction, or infants. We choose to include it due to the large associational epi literature (Myers et al., 2010; Saha et al., 2012; Rahman et al., 2017; Bozack et al., 2018).2 For atrazine, both the EPA and the epi literature showed concerns for birth outcomes (Rinsky et al., 2012; Migeot et al., 2013; Stayner et al., 2017; Almberg et al., 2018; Porpora et al., 2019).3 DEHP is implicated in reproductive difficulties as well as associated with poor birth outcomes such as preterm birth in drinking water, soil, and food (Ferguson et al., 2014)4. Nitrate can be problematic for infants due to blue baby syndrome and has a growing set of studies both in epidemiology and economics suggesting it can be problematic for birth outcomes (Migeot et al., 2013; Stayner et al., 2017; Sherris et al., 2021l; Coffman et al., 2022). 5 Finally, we include tetrachloroehtylene due to a growing epi literature suggesting an association with birth outcomes (Sonnenfeld et al., 2001; Aschengrau et al., 2018, 2020).6. Much of the literature supporting the NPDWR are over 20 years old, and so we decided to include contaminants that had recent support for a possible effect on birth outcomes (Porpora et al., 2019).
It is important to note that while the EPA and epidemiological literature (including economic literature) suggests strong impacts for lead exposure and infant health (Dave and Yang, 2022; Grossman and Slusky, 2019), lead is measured every 3 years per the regulation and so we decided not to include it in this index due to also wanting an index made up of contaminants that are frequently sampled. We chose not to include disenfectant byproducts in our sub-index due to a growing literature suggesting that early research that found associations between DBP and birth outcomes may not have been replicable with recent data (Savitz et al., 2006; Horton et al., 2011; Mashau et al., 2018). Lead and DBPs are in our omnibus index that is used in the main specification.
Non-detection and Limits of Measurement
Recall that non-detect values are reported in the Pennsylvania sampling data we use as zeroes, and we treat them as zeroes throughout our analysis. One concern is that treating these as zeroes inflates the distance between the 10th and the 90th percentile, which we use to standardize our estimates. It is possible that the 10th percentile is a zero and the 90th percentile is very large, due to the nondetects being zeroes. However, while many states have non-detection levels for reporting that are relatively high, Pennsylvania Department of Environmental Protection’s (PA DEP) policy is to have testing laboratories report their sample results as long as they are above the EPA’s designated Method Detection Limit (MDL), which varies depending on contaminant and method of analysis. As a result, Pennsylvania provides an ideal case to study variation in water quality conditional on regulatory compliance, since less data is suppressed than in many other states. For supporting evidence we compare PA DEP’s reporting thresholds with California’s for a subset of contaminants for which we can do this, and find that PA DEP’s reporting thresholds are much lower. See Tables A4 and A5. PA DEP’s reporting thresholds can be found in documents posted online at http://www.depgreenport.state.pa.us/elibrary/GetFolder?FolderID=690145. California’s detection limits for purposes of reporting can be found at https://www.waterboards.ca.gov/drinking_water/certlic/drinkingwater/documents/mclreview/mcls_dlrs_phgs.pdf.
Table A3:
Drinking water contaminants associated with birth outcomes and reproductive difficulties.
| NPDWR Indicated Health Risks | Epi Lit | ||
|---|---|---|---|
| Contaminant | Reproductive | Infant | Birth Outcomes |
| 1,2-Dibromo-3- chloropropane (DBCP) | x | ||
| Arsenic | x | ||
| Atrazine | x | x | |
| Benzo(a)pyrene (PAHs) | x | ||
| Carbofuran | x | ||
| Chlorine dioxide | x | ||
| Chlorite | x | ||
| Di(2-ethylhexyl) adipate | x | ||
| Di(2-ethylhexyl) phthalate (DEHP) | x | x | |
| Dinoseb | x | ||
| Dioxin (2,3,7,8-TCDD) | x | ||
| Disenfectant Byproducts (DBP) | x | ||
| Ethylene dibromide | x | ||
| Fecal coliform and E. coli | x | ||
| Glyphosate | x | ||
| Hexachlorobenzene | x | ||
| Lead | x | x | |
| Methoxychlor | x | ||
| Nitrate | x | x | |
| Nitrite | x | ||
| Polychlorinated biphenyls (PCBs) | x | ||
| Tetrachloroethylene | x | ||
| Trichloroethylene | x | ||
Notes: This table lists contaminants identified in the National Primary Drinking Water Regulation Table as having either an impact on reproduction or infants. We also identify some contaminants that have a large literature within epidemiology (or economics, i.e., lead) studying the association between exposure in drinking water and birth outcomes. Some of these contaminants have mixed results in this literature. We bold the 5 contaminants that we include in our 5-contaminant reproductive-specific water quality index.
Table A4:
Comparing non-detect levels in PA and CA.
| ID | Name | Method | PA DLR = EPA MDL | CA DLR | % Difference |
|---|---|---|---|---|---|
| All VOCs | All VOCs | All Methods | 0.0005 | 0.0005 | 0 |
| 1074 | Antimony | ASTM D3697 | 0.001 | 0.006 | −83.33 |
| 1074 | Antimony | SM 3113 B | 0.003 | 0.006 | −50.00 |
| 1074 | Antimony | EPA 200.9 | 8e-04 | 0.006 | −86.67 |
| 1074 | Antimony | EPA 200.8 | 4e-04 | 0.006 | −93.33 |
| 1005 | Arsenic | SM 3114 B; ASTM D1972 | 0.001 | 0.002 | −50.00 |
| 1005 | Arsenic | SM 3113 B; ASTM D2972 | 0.001 | 0.002 | −50.00 |
| 1005 | Arsenic | EPA 200.9 | 5e-04 | 0.002 | −75.00 |
| 1005 | Arsenic | EPA 200.8 | 0.0014 | 0.002 | −30.00 |
| 1010 | Barium | SM 3111 D | 0.1 | 0.1 | 0.00 |
| 1010 | Barium | SM 3113 B | 0.002 | 0.1 | −98.00 |
| 1010 | Barium | EPA 200.7; SM 3120 B | 0.002 | 0.1 | −98.00 |
| 1075 | Beryllium | SM 3113 B; ASTM D3645 | 2e-04 | 0.001 | −80.00 |
| 1075 | Beryllium | EPA 200.9 | 2e-05 | 0.001 | −98.00 |
| 1075 | Beryllium | EPA 200.7; SM 3120 B | 3e-04 | 0.001 | −70.00 |
| 1075 | Beryllium | EPA 200.8 | 3e-04 | 0.001 | −70.00 |
| 1015 | Cadmium | SM 3113 B | 1e-04 | 0.001 | −90.00 |
| 1015 | Cadmium | EPA 200.7 | 0.001 | 0.001 | 0.00 |
| 1020 | Chromium | SM 3113 B | 0.001 | 0.01 | −90.00 |
| 1020 | Chromium | EPA 200.7; SM 3120 B | 0.007 | 0.01 | −30.00 |
| 1024 | Cyanide, free | SM 4500-CN- C, E; ASTM D2036A | 0.02 | 0.1 | −80.00 |
| 1024 | Cyanide, free | EPA 335.4 | 0.005 | 0.1 | −95.00 |
| 1024 | Cyanide, free | SM 4500-CN- C, F | 0.05 | 0.1 | −50.00 |
| 1024 | Cyanide, free | SM 4500-CN- C, G; ASTM D2036B | 0.02 | 0.1 | −80.00 |
| 1024 | Cyanide, free | Kelada-01 | 5e-04 | 0.1 | −99.50 |
| 1024 | Cyanide, free | QuikChem 10-204-00-1-X | 6e-04 | 0.1 | −99.40 |
| 1024 | Cyanide, free | ASTM D6888; OIA-1677, DW (ALPKEM) | 5e-04 | 0.1 | −99.50 |
| 1035 | Mercury | EPA 245.1; SM 3112 B; ASTM D3223 | 2e-04 | 0.001 | −80.00 |
| 1035 | Mercury | EPA 245.2 | 2e-04 | 0.001 | −80.00 |
| 1036 | Nickel | SM 3113 B | 0.001 | 0.01 | −90.00 |
| 1036 | Nickel | EPA 200.9 | 6e-04 | 0.01 | −94.00 |
| 1036 | Nickel | EPA 200.7; SM 3120 B | 0.005 | 0.01 | −50.00 |
| 1036 | Nickel | EPA 200.9 | 5e-04 | 0.01 | −95.00 |
| 1045 | Selenium | SM 3114 B; ASTM D3859 | 0.002 | 0.005 | −60.00 |
| 1045 | Selenium | SM 3113 B; ASTM D3859 | 0.002 | 0.005 | −60.00 |
| 1085 | Thallium | EPA 200.9 | 7e-04 | 0.001 | −30.00 |
| 1085 | Thallium | EPA 200.8 | 3e-04 | 0.001 | −70.00 |
| 1094 | Asbestos | EPA 100.1,100.2 | 0.01 | 0.2 | −95.00 |
| 1040 | Nitrate | SM 4500-NO3- E; ASTM D3867 | 0.01 | 0.4 | −97.50 |
| 1040 | Nitrate | EPA 300.0, 300.1; SM 4110 B; ASTM D4327; B-1011 (Waters) | 0.01 | 0.4 | −97.50 |
| 1040 | Nitrate | SM 4500-NO3- D; 601 (ATI Orion) | 1 | 0.4 | 150.00 |
| 1040 | Nitrate | EPA 353.2; SM 4500-NO3- F; ASTM D3867 | 0.05 | 0.4 | −87.50 |
| 1040 | Nitrate | ASTM D6508 | 0.076 | 0.4 | −81.00 |
| 1041 | Nitrite | SM 4500-NO3- E; ASTM D3867 | 0.01 | 0.4 | −97.50 |
| 1041 | Nitrite | EPA 300.0, 300.1; SM 4110 B; ASTM D4327; B-1011 (Waters) | 0.004 | 0.4 | −99.00 |
| 1041 | Nitrite | EPA 353.2; SM 4500-NO3- F; ASTM D3867 | 0.05 | 0.4 | −87.50 |
| 1041 | Nitrite | SM 4500-NO2- B | 0.01 | 0.4 | −97.50 |
| 1041 | Nitrite | ASTM D6508 | 0.103 | 0.4 | −74.25 |
However, notwithstanding PA DEP’s data advantages, it is still theoretically possible that the currently-used 10th to 90th percentile difference is inflated due to the “jump to zero” effect of detection limits. In order to investigate whether this concern is empirically relevant, we provide below our index measure when we use the zeroes provided in the data (which are below EPA MDL) versus when we replace cases below the MDLs with the corresponding EPA MDL (for the contaminant and method combination). We merged the contaminant method detection limits provided by PA with our data, obtaining data on 69 contaminants. To be specific, to construct the following table, for simplicity we (1) took the maximum MDL by contaminant (because contaminants have multiple possible methods of analysis, there are multiple MDLs for each contaminant), then (2) for every birth that is exposed to an average level of contamination (for a particular contaminant) below the MDL, we replace that birth’s average level of contamination with the MDL (since their low level of contamination must be driven by having some nondetects, which are treated as zero). Furthermore, note that after we replace these birth-specific contamination measures in this way, our index measures will increase for every birth, since our index measures are averaged over contaminants including those with replaced-with-MDL values, and those without replaced-with-MDL values. As a result, not only does the 10th percentile rise, but also the 90th percentile will rise, as the entire distribution of contamination shifts to the right (based on how many nondetects each birth experiences during gestation).
Appendix Table A6 shows our findings from this exercise. For these 69 contaminants, the 10th-to-90th percentile difference (for these contaminants) was 0.03 (0.03 – 0) prior to the replacewith-MDL (Panels A1 and B1), but rises to 2.19 (2.25 – 0.06) after replace-with-MDL (because Coliform is not one of the contaminants on the list, there is no difference between the All and Chemical rows in this table). Our interpretation is that, were we to try to adjust our analysis by replacing non-detects with the detection limit, this would in fact increase the 10th-to-90th difference that we use, so our estimates are currently reported in a way that makes them look conservative.
Community Water System Boundaries and Linking Births to Water Quality
Recall that we calculate the average exposure to contamination for each birth during gestation for the system where the mother resides on the birth certificate. Except for differences in gestation length, this means that the contamination measure is the same for two births born on the same date in the same water system.
This raises the concern that there could be within water system heterogeneity in water quality. The water samples we have are matched to water system, but not at any more granular level, and there are a few very large (population) water systems in Pennsylvania. Consider two of the larger systems: the Philadelphia Water Department and the Pittsburgh Water and Sewer Authority.
The Philadelphia Water Department is, roughly-speaking, two water distribution systems, one that intakes from the Delaware River (eastern side of city) and one that intakes from the Schuylkill River (western side of the city). See page 6 of https://water.phila.gov/pool/files/2019-pwd-water-quality-report.pdf for details; we reproduced the map from that page below in Figure A1.
While we analyze this water system as one system, there are indeed two systems involved here, and, actually, three treatment plants (two sourcing from the Schuylkill), although there are places of overlap that would complicate analysis even if we had more granular data. Our sampling data, which we average over the samples reported by the system (at different and unknown to us distribution plants or, in some cases, different points in the distribution) thus represents only a proxy for the water quality experienced by each individual, and not a perfect measure.
Table A5:
Comparing non-detect levels in PA and CA, continued.
| ID | Name | Method | PA DLR = EPA MDL | CA DLR | % Difference |
|---|---|---|---|---|---|
| 4002 | Gross Alpha particle | All methods | 3 | 3 | 0.00 |
| 4020 | Radium-226 | All methods | 1 | 1 | 0.00 |
| 4030 | Radium-228 | All methods | 1 | 1 | 0.00 |
| 4006 | Uranium | All methods | 1 | 1 | 0.00 |
| 4100 | Gross beta | All methods | 4 | 4 | 0.00 |
| 4174 | Strontium-90 | All methods | 2 | 2 | 0.00 |
| 4102 | Tritium | All methods | 1000 | 1000 | 0.00 |
| 2051 | Alachlor | All methods | 0.00044 | 0.001 | −56.00 |
| 2050 | Atrazine | All methods | 0.00022 | 5e-04 | −56.00 |
| 2306 | Benzo(a)pyrene | All methods | 4.4e-05 | 1e-04 | −56.00 |
| 2046 | Carbofuran | All methods | 0.000198 | 0.005 | −96.04 |
| 2959 | Chlordane | All methods | 0.00044 | 1e-04 | 340.00 |
| 2105 | 2,4-D | All methods | 0.00022 | 0.01 | −97.80 |
| 2031 | Dalapon | All methods | 0.0022 | 0.01 | −78.00 |
| 2931 | Dibromochloropropane (DBCP) | All methods | 4.4e-05 | 1e-05 | 340.00 |
| 2035 | Di(2-ethylhexyl)adipate | All methods | 0.00132 | 0.005 | −73.60 |
| 2041 | Dinoseb | All methods | 0.00044 | 0.002 | −78.00 |
| 2032 | Diquat | All methods | 0.00088 | 0.004 | −78.00 |
| 2033 | Endothall | All methods | 0.0198 | 0.045 | −56.00 |
| 2005 | Endrin | All methods | 2.2e-05 | 1e-04 | −78.00 |
| 2946 | Ethylene dibromide (EDB) | All methods | 2.2e-05 | 2e-05 | 10.00 |
| 2034 | Glyphosate | All methods | 0.0132 | 0.025 | −47.20 |
| 2065 | Heptachlor | All methods | 8.8e-05 | 1e-05 | 780.00 |
| 2067 | Heptachlor epoxide | All methods | 4.4e-05 | 1e-05 | 340.00 |
| 2274 | Hexachlorobenzene | All methods | 0.00022 | 5e-04 | −56.00 |
| 2042 | Hexachlorocyclopentadiene | All methods | 0.00022 | 0.001 | −78.00 |
| 2010 | Lindane | All methods | 4.4e-05 | 2e-04 | −78.00 |
| 2015 | Methoxychlor | All methods | 0.00022 | 0.01 | −97.80 |
| 2036 | Oxymal (Vydate) | All methods | 0.0044 | 0.02 | −78.00 |
| 2326 | Pentachlorophenol | All methods | 8.8e-05 | 2e-04 | −56.00 |
| 2040 | Picloram | All methods | 0.00022 | 0.001 | −78.00 |
| 2037 | Simazine | All methods | 0.000154 | 0.001 | −84.60 |
| 2063 | 2,3,7,8-TCDD (Dioxin) | All methods | 1.1e-08 | 5e-09 | 120.00 |
| 2020 | Toxaphene | All methods | 0.0022 | 0.001 | 120.00 |
| 2110 | 2,4,5-TP (Silvex) | All methods | 0.00044 | 0.001 | −56.00 |
| 2383 | PCBs | All methods | 0.00022 | 5e-04 | −56.00 |
| 1009 | Chlorite | All methods | 0.02 | 0.02 | 0.00 |
Table A6:
Distribution of drinking water contamination experienced by Pennsylvanian births during gestation – 69 contaminants for which we observe thresholds for reporting by the PA DEP.
| (1) | (2) | (3) | (4) | (5) | (6) | (7) | (8) | (9) | |
|---|---|---|---|---|---|---|---|---|---|
| Contaminant | Mean | SD | 10th | 25th | 50th | 75th | 90th | Max | Births |
| Panel A1: Trimmed sample - nondetects are zero (births used in analysis) | |||||||||
| All | 0.01 | 0.03 | 0.00 | 0.00 | 0.01 | 0.02 | 0.03 | 0.28 | 1,148,370 |
| Chemical | 0.01 | 0.03 | 0.00 | 0.00 | 0.01 | 0.02 | 0.03 | 0.28 | 1,148,370 |
| Panel B1: Raw sample - nondetects are zero (including births not used) | |||||||||
| All | 0.02 | 0.32 | 0.00 | 0.00 | 0.01 | 0.02 | 0.03 | 230.00 | 1,160,108 |
| Chemical | 0.02 | 0.32 | 0.00 | 0.00 | 0.01 | 0.02 | 0.03 | 230.00 | 1,160,108 |
| Panel A2: Trimmed sample - nondetects replaced with nondetect limit (births used in analysis) | |||||||||
| All | 0.44 | 0.86 | 0.06 | 0.08 | 0.10 | 0.13 | 2.25 | 4.57 | 1,148,271 |
| Chemical | 0.44 | 0.86 | 0.06 | 0.08 | 0.10 | 0.13 | 2.25 | 4.57 | 1,148,271 |
| Panel B2: Raw sample - nondetects replaced with nondetect limit | |||||||||
| All | 0.52 | 2.14 | 0.06 | 0.08 | 0.10 | 0.13 | 2.28 | 230.00 | 1,160,108 |
| Chemical | 0.52 | 2.14 | 0.06 | 0.08 | 0.10 | 0.13 | 2.28 | 230.00 | 1,160,108 |
Figure A1:

Philadelphia water distribution system map
Turning to a second example, the Pittsburgh Water and Sewer Authority sources its water entirely from the Allegheny River, and has one water treatment plant (https://www.pgh2o.com/your-water/water-quality-treatment). In this case the concern appears less warranted.
We thus recognize inherent measurement limitations in linking mother exposure to water quality using CWS of residence. To address this concern, we include a sensitivity analysis where we run our main analysis, dropping large systems. We include the results in Appendix Tables A23 and A24 (later in this Appendix). We did this by sorting births by system size (number of people served by the CWS) and then dropping the 50% of births in our sample in the largest systems (those in the bottom of the dataset). The largest system in this restricted sample serves 118,000 people (contrast with the largest system, Philadelphia Water Department, which serves 1,600,000 people), and thus this is a meaningful restriction. As the tables show, our main results are robust to this sample restriction.
A second concern about the births to CWS linkage we conduct is that the boundaries of water systems change over time. As mentioned in the main text, the vintage of the community water service map we use as the 2016, month 4, version of the data. We used this vintage since it was the latest available at the start of our research. However, PennState University’s Pennsylvania Spatial Data Access (PASDA), the source of the data, has versions of this data from 2011 to 2022 (https://www.pasda.psu.edu/download/dep/historic/PublicWaterSupply/).
This suggests the possibility of using multiple versions of the boundary data to better measure spatial relationship between mothers and water providers. But since 2011 is near the end of our sample frame (2003–2014), the range of data that is available is of limited use for longitudinal analysis in our study. An additional concern we have about studying multiple years of these service boundaries is it is unclear to us whether the changes from year-to-year are corrections of service boundaries reported, or whether they are true additions or subtractions. With this in mind, the following two maps, in Appendix Figures A2 and A3, compare the 2011/04 vintage with 2016/04 (first map), and 2016/04 with 2022/10 (second map). Blue areas are CWS coverage gains, while orange areas are CWS losses.
Clearly there are some significant changes in these boundaries, but it is less clear whether these are true changes or measurement error (corrections of the maps, or mistakes in reporting).
To shed further light on these issues, we argue that concerns regarding boundary changes could take two forms: (1) if boundaries change randomly (measurement error for example), then that will add noise into our independent variable as births will not be being correctly matched to contamination. This classical measurement error would attenuate our effect estimates. (2) Boundary changes could be an omitted variable (confounder), that is, they could be systematically correlated with contamination and outcomes, and could thus bias our estimates. The two maps we provided above suggest that boundary changes are likely to be correlated with the demographics of the system, as it is the suburban and rural areas that expand or contract, not the major cities. As a result, we admit the possibility that boundary changes may be correlated with outcomes (through socioeconomic status, for example). We instead focus on the question of whether boundary changes are correlated with levels of CWS contamination.
For this auxiliary analysis, we examined CWS level average contamination (taken over the last nine months) for our two omnibus measures, all and chemical-only RRMCL, averaged over the contaminants for which we have samples. We then regressed the change in CWS contamination from 2011/04 to 2015/12 (the first and last month available in our sampling data) on (a) whether the CWS had any meaningful net increase in area (i.e., above the very trivial threshold of 100 square meters) or decrease (i.e. below the threshold of 100 square meters lost) (columns (1) and (3) in the table below), with the excluded group of no meaningful change; and (b) the net change in square kilometers (columns (2) and (3) in the table below). The results of these four regressions we include in the table below. As the table shows, there is no statistically significant relationship between the change in CWS contamination over this period, and CWS boundary changes (though, note that column 4 is statistically significant at the 10% level). This suggests that boundary changes are unlikely to be an omitted variable, since they are not strongly related to CWS contamination levels.
Figure A2:

Changes in PWS boundaries 2011 compared to 2016.
Notes: Blue areas are CWS coverage gains, while orange areas are CWS losses.
SENSITIVITY ANALYSES
This section repeats our main analysis using different sample restrictions and different indices of contamination. Regressions estimating the effects of water contamination on birth outcomes are repeated for each subsample or index definition.
Figure A3:

Changes in PWS boundaries 2016 compared to 2022.
Since much of our sensitivity analysis rests on sample restrictions, we begin by more fully comparing our analytic sample to the full population of births (including those not matched to water systems, plural, and exposed to MCL). To do so, we add an additional column to our summary statistics table (Table 2 in the main text) which includes all births. This is included as Appendix Table A8. (We dropped some columns from this version of the summary statistics table to conserve space.)
Note that in the main text, we exclude plural births. The first change in sample we explore is including plural births. Results are in Appendix Tables A9 and A10.
Concern over to what extent our results are driven by mother mobility motivates the second change in sample: we re-run our models, excluding all mothers (with multiple births) that ever switch a CWS across births. Results are shown in Appendix Tables A11 and A12.
Interest in how our results would change if we keep births exposed to MCL violations, or remove births with high samples more rigorously, motivates our third and fourth sets of sensitivity analyses. We re-run the models including births exposed to MCL, and re-run the models excluding births linked to any samples exceeding the corresponding MCL. For this last sensitivity analyses, four contaminants present a challenge due to samples very often exceeding their MCLs, so we further estimate models that drop these four contaminants from our index construction, and models that keep these four contaminants (with the concomitant substantial reduction in sample size). Our results are included in Appendix Tables A15 through A20.
Table A7:
Change in contamination correlated with changes in CWS boundaries.
| (1) | (2) | (3) | (4) | |
|---|---|---|---|---|
| ΔAll | ΔAll | ΔChemical | ΔChemical | |
| Meaningful Increase | 0.0353 (0.0247) | 0.00839 (0.00755) | ||
| Meaningful Decrease | 0.00406 (0.0238) | −0.00868 (0.0120) | ||
| Net Δ Square Km | 0.000225 (0.000610) | 0.000436* (0.000251) | ||
| Constant | −0.0108 (0.00852) | −0.00688 (0.00763) | 0.00314 (0.00311) | 0.00349 (0.00278) |
| N | 1546 | 1546 | 1546 | 1546 |
Notes: Standard errors in parentheses
p < 0.10
p < 0.05
p < 0.01
Concern over our sample changing between the mom FE models and the CWS FE models motivates our next analysis, where we force these samples to be identical. We saw in the main text that estimates were sensitive to which fixed-effects structure we employed, in spite of mother and birth characteristics being similar across the two samples; it is interesting to see whether the effect estimates remain different after forcing the samples to be identical. Results are in Appendix Tables A21 and A22.
Finally, concern over mismeasurement of the link between water contamination and exposure, that may be especially acute in large CWS (with complicated distribution systems), motivates a sensitivity analysis estimating our models for smaller CWS only. Results are in Appendix Tables A23 and A24.
We present each of these sensitivity analyses in the following appendix subsections.
Table A8:
Sample means and, when appropriate, standard deviations, for selected statistics and subsamples of births.
| (1) | (2) | (3) | (4) | |
|---|---|---|---|---|
| Characteristic | All Births | NP No MCL | Analytic Sample | Mom FE Sample |
| Sample Sizes | ||||
| No. of births | 1,608,619 | 1,241,656 | 1,216,132 | 700,815 |
| No. of moms | 992,451 | 832,674 | 820,173 | 304,934 |
| No. of CWSs | 1,542 | 1,537 | 1,459 | 1,343 |
| Outcomes | ||||
| Low birth weight | 0.083 | 0.067 | 0.067 | 0.065 |
| Preterm birth | 0.099 | 0.082 | 0.082 | 0.081 |
| Small for gestational age | 0.099 | 0.095 | 0.096 | 0.091 |
| Term birth weight (g) | 3,398 (475.2) | 3,396 (468.9) | 3,396 (468.8) | 3,402 (467.2) |
| CWS births per 100k per month | 117.7 (110.8) | 113.4 (107.4) | 112.7 (103.3) | 112.1 (98.59) |
| Other characteristics | ||||
| Mom age (years) | 28.09 (6.032) | 27.87 (6.054) | 27.87 (6.057) | 27.69 (5.858) |
| Mom Black | 0.177 | 0.210 | 0.213 | 0.220 |
| Mom Hispanic | 0.058 | 0.068 | 0.069 | 0.074 |
| Mom white, not Hispanic | 0.765 | 0.722 | 0.718 | 0.713 |
| HS or less | 0.403 | 0.401 | 0.401 | 0.406 |
| Mom smokes | 0.219 | 0.224 | 0.223 | 0.216 |
| Mom married | 0.606 | 0.569 | 0.568 | 0.578 |
| WIC/Medicaid | 0.442 | 0.477 | 0.478 | 0.482 |
| Different CWS next birth | 0.406 | 0.369 | 0.368 | 0.368 |
| Mean RRCML, all contaminants | 0.104 (2.978) | 0.095 (2.237) | 0.078 (0.078) | 0.078 (0.077) |
| Mean RRCML, chemical only | 0.095 (3.201) | 0.085 (2.238) | 0.069 (0.080) | 0.069 (0.079) |
Notes: Sample consists of all births. Means (for indicator variables, proportions) are displayed, with standard deviations in parentheses when appropriate; NP = non-plural birth. Column (1) provides statistics for for all births, including those not matched to water systems. Column (2) restricts to non-plural births without MCLs that are matched to water systems. Column (3) restricts to our analytic sample, keeping only births with an overall measure of contamination and for which that measure is below the 99th percentile. Column (4) describes births in our mother fixed-effects specifications.
All births (including plural)
Table A9:
Effects of contamination on low birth weight and preterm birth, all births (including plural).
| (1) | (2) | (3) | (4) | (5) | (6) | |
|---|---|---|---|---|---|---|
| No controls | Adding mother ctrls | Adding temp & TRI ctrls | ||||
|
|
|
|
||||
| All | Chemical | All | Chemical | All | Chemical | |
|
| ||||||
| Panel A: low birth weight | ||||||
|
|
||||||
| CWS FE | ||||||
|
|
||||||
| 10th to 90th RRMCL | 0.01817*** (0.00393) | 0.01732*** (0.00165) | 0.01363*** (0.00309) | 0.01308*** (0.00122) | 0.01137*** (0.00259) | 0.01009*** (0.00137) |
| Observations | 1,262,235 | 1,260,752 | 1,262,235 | 1,260,752 | 1,262,235 | 1,260,752 |
| Adj R2 | 0.0043 | 0.0047 | 0.1549 | 0.1551 | 0.2079 | 0.2080 |
| Mom & CWS FE | ||||||
|
|
||||||
| 10th to 90th RRMCL | 0.01242*** (0.00308) | 0.01214*** (0.00137) | 0.01010*** (0.00266) | 0.00983*** (0.00120) | 0.00884*** (0.00230) | 0.00764*** (0.00136) |
| Observations | 754,689 | 753,372 | 754,689 | 753,372 | 754,689 | 753,372 |
| Adj R2 | 0.2738 | 0.2741 | 0.3455 | 0.3456 | 0.3752 | 0.3753 |
| Panel B: preterm birth | ||||||
|
|
||||||
| CWS FE | ||||||
|
|
||||||
| 10th to 90th RRMCL | 0.02489*** (0.00506) | 0.02355*** (0.00208) | 0.02004*** (0.00402) | 0.01905*** (0.00160) | 0.01721*** (0.00329) | 0.01531*** (0.00170) |
| Observations | 1,262,235 | 1,260,752 | 1,262,235 | 1,260,752 | 1,262,235 | 1,260,752 |
| Adj R2 | 0.0034 | 0.0039 | 0.1453 | 0.1457 | 0.2209 | 0.2212 |
| Mom & CWS FE | ||||||
|
|
||||||
| 10th to 90th RRMCL | 0.01808*** (0.00415) | 0.01774*** (0.00178) | 0.01553*** (0.00365) | 0.01524*** (0.00156) | 0.01407*** (0.00297) | 0.01255*** (0.00165) |
| Observations | 754,689 | 753,372 | 754,689 | 753,372 | 754,689 | 753,372 |
| Adj R2 | 0.3016 | 0.3021 | 0.3681 | 0.3684 | 0.4125 | 0.4127 |
Notes: see notes to Table 3 in the main text.
Table A10:
Effects of contamination on small for gestational age and term birth weight, all births (including plural).
| (1) | (2) | (3) | (4) | (5) | (6) | |
|---|---|---|---|---|---|---|
| No controls | Adding mother ctrls | Adding temp & TRI ctrls | ||||
|
|
|
|
||||
| All | Chemical | All | Chemical | All | Chemical | |
|
| ||||||
| Panel A: small for gestational age | ||||||
|
|
||||||
| CWS FE | ||||||
|
|
||||||
| 10th to 90th RRMCL | 0.00444*** (0.00080) | 0.00380*** (0.00069) | 0.00236*** (0.00075) | 0.00170*** (0.00060) | 0.00209*** (0.00078) | 0.00137** (0.00060) |
| Observations | 1,259,696 | 1,258,220 | 1,259,696 | 1,258,220 | 1,259,696 | 1,258,220 |
| Adj R2 | 0.0050 | 0.0050 | 0.0569 | 0.0568 | 0.0575 | 0.0575 |
| Mom & CWS FE | ||||||
|
|
||||||
| 10th to 90th RRMCL | 0.00388*** (0.00101) | 0.00294*** (0.00087) | 0.00306*** (0.00095) | 0.00195** (0.00082) | 0.00319*** (0.00097) | 0.00205** (0.00084) |
| Observations | 751,908 | 750,588 | 751,908 | 750,588 | 751,908 | 750,588 |
| Adj R2 | 0.1627 | 0.1633 | 0.1886 | 0.1892 | 0.1887 | 0.1893 |
| Panel B: term birth weight | ||||||
|
|
||||||
| CWS FE | ||||||
|
|
||||||
| 10th to 90th RRMCL | −11.08*** (1.46) | −10.66605*** (1.30313) | −9.66*** (1.51) | −9.36292*** (1.20495) | −9.46*** (1.65) | −8.69350*** (1.31206) |
| Observations | 1,135,217 | 1,133,896 | 1,135,217 | 1,133,896 | 1,135,217 | 1,133,896 |
| Adj R2 | 0.0157 | 0.0157 | 0.1199 | 0.1200 | 0.1241 | 0.1242 |
| Mom & CWS FE | ||||||
|
|
||||||
| 10th to 90th RRMCL | −9.93*** (1.68) | −9.43433*** (1.32991) | −9.15*** (1.80) | −8.56855*** (1.34776) | −9.31*** (1.84) | −8.23830*** (1.42809) |
| Observations | 634,768 | 633,676 | 634,768 | 633,676 | 634,768 | 633,676 |
| Adj R2 | 0.3579 | 0.3580 | 0.4151 | 0.4152 | 0.4171 | 0.4171 |
Notes: see notes to Table 3 in the main text.
Only non-plural births to non-moving moms
In this subsection, we repeat our main specifications providing evidence of the effects of contamination on birth outcomes, except we remove moms who switch CWS at any point across births. Of course, this sample restriction only applies to mothers with multiple births in our data. Our intent is to test whether our results are driven by movers.
Table A11:
Effects of contamination on low birth weight and preterm birth, no movers.
| (1) | (2) | (3) | (4) | (5) | (6) | |
|---|---|---|---|---|---|---|
| No controls | Adding mother ctrls | Adding temp & TRI ctrls | ||||
| All | Chemical | All | Chemical | All | Chemical | |
| Panel A: low birth weight | ||||||
| CWS FE | ||||||
| 10th to 90th RRMCL | 0.01370*** (0.00320) | 0.01502*** (0.00137) | 0.01260*** (0.00303) | 0.01395*** (0.00126) | 0.01130*** (0.00246) | 0.01131*** (0.00145) |
| Observations | 885,494 | 884,502 | 885,494 | 884,502 | 885,494 | 884,502 |
| Adj R2 | 0.0053 | 0.0056 | 0.0464 | 0.0466 | 0.1066 | 0.1067 |
| Mom & CWS FE | ||||||
| 10th to 90th RRMCL | 0.00882*** (0.00307) | 0.01034*** (0.00136) | 0.00841*** (0.00288) | 0.00963*** (0.00128) | 0.00835*** (0.00228) | 0.00808*** (0.00147) |
| Observations | 374,367 | 373,867 | 374,367 | 373,867 | 374,367 | 373,867 |
| Adj R2 | 0.2021 | 0.2023 | 0.2240 | 0.2242 | 0.2577 | 0.2578 |
| Panel B: preterm birth | ||||||
| CWS FE | ||||||
| 10th to 90th RRMCL | 0.01948*** (0.00443) | 0.02155*** (0.00192) | 0.01819*** (0.00409) | 0.02023*** (0.00176) | 0.01653*** (0.00325) | 0.01690*** (0.00193) |
| Observations | 885,494 | 884,502 | 885,494 | 884,502 | 885,494 | 884,502 |
| Adj R2 | 0.0037 | 0.0042 | 0.0467 | 0.0472 | 0.1314 | 0.1317 |
| Mom & CWS FE | ||||||
| 10th to 90th RRMCL | 0.01321*** (0.00463) | 0.01616*** (0.00204) | 0.01254*** (0.00444) | 0.01518*** (0.00194) | 0.01260*** (0.00341) | 0.01324*** (0.00202) |
| Observations | 374,367 | 373,867 | 374,367 | 373,867 | 374,367 | 373,867 |
| Adj R2 | 0.1917 | 0.1920 | 0.2178 | 0.2180 | 0.2658 | 0.2658 |
Notes: see notes to Table 3 in the main text.
Table A12:
Effects of contamination on small for gestational age and term birth weight, no movers.
| (1) | (2) | (3) | (4) | (5) | (6) | |
|---|---|---|---|---|---|---|
| No controls | Adding mother ctrls | Adding temp & TRI ctrls | ||||
| All | Chemical | All | Chemical | All | Chemical | |
| Panel A: small for gestational age | ||||||
| CWS FE | ||||||
| 10th to 90th RRMCL | 0.00432*** (0.00090) | 0.00330*** (0.00076) | 0.00364*** (0.00085) | 0.00278*** (0.00071) | 0.00344*** (0.00091) | 0.00247*** (0.00075) |
| Observations | 884,128 | 883,138 | 884,128 | 883,138 | 884,128 | 883,138 |
| Adj R2 | 0.0059 | 0.0059 | 0.0386 | 0.0386 | 0.0396 | 0.0397 |
| Mom & CWS FE | ||||||
| 10th to 90th RRMCL | 0.00323*** (0.00122) | 0.00160 (0.00103) | 0.00325*** (0.00123) | 0.00149 (0.00104) | 0.00365*** (0.00126) | 0.00194* (0.00105) |
| Observations | 373,142 | 372,638 | 373,142 | 372,638 | 373,142 | 372,638 |
| Adj R2 | 0.1887 | 0.1890 | 0.1984 | 0.1988 | 0.1985 | 0.1989 |
| Panel B: term birth weight | ||||||
| CWS FE | ||||||
| 10th to 90th RRMCL | −11.41*** (1.56) | −10.70*** (1.37) | −10.34*** (1.66) | −10.09*** (1.30) | −10.38*** (1.82) | −9.56821*** (1.46) |
| Observations | 812,779 | 811,873 | 812,779 | 811,873 | 812,779 | 811,873 |
| Adj R2 | 0.0182 | 0.0182 | 0.0942 | 0.0944 | 0.0986 | 0.0988 |
| Mom & CWS FE | ||||||
| 10th to 90th RRMCL | −8.53*** (2.30) | −8.78*** (1.69) | −8.67*** (2.34) | −8.72*** (1.73) | −9.27*** (2.41) | −8.86*** (1.83) |
| Observations | 327,108 | 326,672 | 327,108 | 326,672 | 327,108 | 326,672 |
| Adj R2 | 0.4672 | 0.4674 | 0.4941 | 0.4942 | 0.4955 | 0.4957 |
Notes: see notes to Table 3 in the main text.
Including births exposed to MCL violations
In this subsection, we repeat our main specifications except we do not exclude births exposed to MCL violations.
Table A13:
Effects of contamination on low birth weight and pre-term birth, including births exposed to MCL.
| (1) | (2) | (3) | (4) | (5) | (6) | |
|---|---|---|---|---|---|---|
| No controls | Adding mother ctrls | Adding temp & TRI ctrls | ||||
| All | Chemical | All | Chemical | All | Chemical | |
| Panel A: Low birth weight | ||||||
| CWS FE | ||||||
| 10th to 90th RRMCL | 0.01306*** (0.00277) | 0.01449*** (0.00129) | 0.01200*** (0.00263) | 0.01342*** (0.00120) | 0.01016*** (0.00217) | 0.01064*** (0.00130) |
| Observations | 1,257,166 | 1,255,522 | 1,257,166 | 1,255,522 | 1,257,166 | 1,255,522 |
| Adj R2 | 0.0050 | 0.0053 | 0.0452 | 0.0455 | 0.1050 | 0.1053 |
| Mom & CWS FE | ||||||
| 10th to 90th RRMCL | 0.00918*** (0.00225) | 0.01059*** (0.00114) | 0.00896*** (0.00212) | 0.01013*** (0.00111) | 0.00780*** (0.00180) | 0.00795*** (0.00126) |
| Observations | 738,790 | 737,372 | 738,790 | 737,372 | 738,790 | 737,372 |
| Adj R2 | 0.1482 | 0.1487 | 0.1664 | 0.1668 | 0.2077 | 0.2080 |
| Panel B: Preterm birth | ||||||
| CWS FE | ||||||
| 10th to 90th RRMCL | 0.01948*** (0.00373) | 0.02118*** (0.00170) | 0.01818*** (0.00346) | 0.01987*** (0.00158) | 0.01584*** (0.00277) | 0.01635*** (0.00162) |
| Observations | 1,257,166 | 1,255,522 | 1,257,166 | 1,255,522 | 1,257,166 | 1,255,522 |
| Adj R2 | 0.0037 | 0.0042 | 0.0464 | 0.0469 | 0.1308 | 0.1311 |
| Mom & CWS FE | ||||||
| 10th to 90th RRMCL | 0.01515*** (0.00333) | 0.01688*** (0.00158) | 0.01482*** (0.00316) | 0.01631*** (0.00152) | 0.01347*** (0.00252) | 0.01362*** (0.00160) |
| Observations | 738,790 | 737,372 | 738,790 | 737,372 | 738,790 | 737,372 |
| Adj R2 | 0.1465 | 0.1469 | 0.1641 | 0.1644 | 0.2227 | 0.2229 |
Notes: see notes to Table 3 in the main text.
Table A14:
Effects of contamination on small for gestational age and term birth weight, including births exposed to MCL.
| (1) | (2) | (3) | (4) | (5) | (6) | |
|---|---|---|---|---|---|---|
| No controls | Adding mother ctrls | Adding temp & TRI ctrls | ||||
| All | Chemical | All | Chemical | All | Chemical | |
| Panel A: small for gestational age | ||||||
| CWS FE | ||||||
| 10th to 90th RRMCL | 0.00326*** (0.00073) | 0.00291*** (0.00058) | 0.00266*** (0.00071) | 0.00237*** (0.00054) | 0.00242*** (0.00075) | 0.00206*** (0.00055) |
| Observations | 1,255,208 | 1,253,568 | 1,255,208 | 1,253,568 | 1,255,208 | 1,253,568 |
| Adj R2 | 0.0054 | 0.0054 | 0.0378 | 0.0378 | 0.0388 | 0.0388 |
| Mom & CWS FE | ||||||
| 10th to 90th RRMCL | 0.00222** (0.00099) | 0.00164** (0.00081) | 0.00220** (0.00096) | 0.00149* (0.00079) | 0.00231** (0.00099) | 0.00155* (0.00081) |
| Observations | 736,689 | 735,266 | 736,689 | 735,266 | 736,689 | 735,266 |
| Adj R2 | 0.1376 | 0.1382 | 0.1532 | 0.1538 | 0.1535 | 0.1541 |
| Panel B: term birth weight | ||||||
| CWS FE | ||||||
| 10th to 90th RRMCL | −10.19*** (1.35) | −10.05*** (1.18) | −9.30*** (1.47) | −9.38*** (1.14) | −9.04*** (1.58) | −8.78*** (1.24) |
| Observations | 1,154,988 | 1,153,455 | 1,154,988 | 1,153,455 | 1,154,988 | 1,153,455 |
| Adj R2 | 0.0168 | 0.0168 | 0.0941 | 0.0941 | 0.0985 | 0.0985 |
| Mom & CWS FE | ||||||
| 10th to 90th RRMCL | −9.15*** (1.69) | −9.19*** (1.29) | −9.28*** (1.79) | −8.93*** (1.35) | −9.40*** (1.83) | −8.58*** (1.43) |
| Observations | 647,223 | 645,961 | 647,223 | 645,961 | 647,223 | 645,961 |
| Adj R2 | 0.3400 | 0.3400 | 0.3795 | 0.3795 | 0.3816 | 0.3816 |
Notes: see notes to Table 3 in the main text.
Removing births with any samples exceeding MCL
In this subsection, we repeat our main specifications except not only do we exclude births exposed to reported MCL violations, we also exclude births exposed to any samples that were in excess of MCL. Independent audits of EPA’s reported MCL violations around the year 2000 found that a large share of drinking water violations were not reported accurately (see the references and discussion in Bennear & Olmstead, 2008, who note that for some contaminants around 85% of MCL violations were not accurately reported), so there is concern that the MCL violations data provided by the US EPA is imperfect. However, by 2011, the middle of our study period, close to 75% of health-based violations were accurately reported across states to EPA (Baker et al., 2023). To provide additional statistics for this question, we requested the MCL violations from Pennsylvania and compared them to the data on health-based violations we FOIA’d the EPA for. We combined the EPA and PA violations from 2000–2015 and found that 80% of them were in both files, 6% were in PA only and 14% were in EPA only. Our main specifications used the EPA violations as the gold standard, making up 94% of the total reported violations.
While our analysis and more recent literature suggests the EPA violations data is of reasonable quality, due to lingering concerns that violations might not be accurately reported or measured, we use the sampling data to identify individual sampling concentrations (results) that are above the MCL. The alternative regressions displayed in Tables A15 through A20 test whether our findings are sensitive to a stronger definition of “compliant drinking water” where we exclude births exposed to any samples in excess of the MCL.7
There are four contaminants where births very often have samples that exceed the MCL: Total Coliform, TTHM, HAA5, or lead. Note that none of these contaminants have MCL violations generated using single samples: the Total Coliform rule uses monthly aggregates, TTM and HAA5 use running annual averages, and lead uses 90th percentiles. It is therefore perfectly consistent that samples often exceed their corresponding MCL, but do not trigger regulatory violations, in these cases. Due to these contaminants, we estimate three versions of this sensitivity analysis. In our first version, included in Appendix Tables A15 and A16, we construct our water quality index measure the same way as we do in the text, including these four contaminants, but we do not use these four contaminants to drop births exposed to samples that exceed their MCL. This first version maximizes our sample size for this robustness check, while keeping our index consistent with what we calculate in the main text. In our second version, included in Appendix Tables A17 and A18, we simply do not include these contaminants in the analysis at all: not in the index, nor in the sample restriction. In our third version, included in Appendix Tables A19 and A20, we include these in the index and drop births exposed to exceedences for any and all contaminants (including hese four contaminants), which leads to a substantial reduction in sample size.
The results from these sensitivity analyses are displayed in the following tables. Our findings are, for the most part, robust to these changes.
Table A15:
Effects of contamination on low birth weight and preterm birth, removing births with any samples exceeding MCL, except for four contaminants with widespread exceedances (total coliform, TTHM, HAA5, and lead).
| (1) | (2) | (3) | (4) | (5) | (6) | |
|---|---|---|---|---|---|---|
| No controls | Adding mother ctrls | Adding temp & TRI ctrls | ||||
| All | Chemical | All | Chemical | All | Chemical | |
| Panel A: low birth weight | ||||||
| CWS FE | ||||||
| 10th to 90th RRMCL | 0.01461*** (0.00280) | 0.01456*** (0.00155) | 0.01345*** (0.00262) | 0.01345*** (0.00143) | 0.01652*** (0.00175) | 0.01035*** (0.00174) |
| Observations | 1,004,287 | 1,002,929 | 1,004,287 | 1,002,929 | 1,004,287 | 1,002,929 |
| Adj R2 | 0.0052 | 0.0054 | 0.0460 | 0.0462 | 0.1065 | 0.1066 |
| Mom & CWS FE | ||||||
| 10th to 90th RRMCL | 0.01180*** (0.00194) | 0.01116*** (0.00133) | 0.01130*** (0.00182) | 0.01060*** (0.00125) | 0.01302*** (0.00154) | 0.00818*** (0.00152) |
| Observations | 511,624 | 510,569 | 511,624 | 510,569 | 511,624 | 510,569 |
| Adj R2 | 0.1533 | 0.1532 | 0.1715 | 0.1714 | 0.2122 | 0.2119 |
| Panel B: preterm birth | ||||||
| CWS FE | ||||||
| 10th to 90th RRMCL | 0.02186*** (0.00382) | 0.02154*** (0.00210) | 0.02045*** (0.00351) | 0.02016*** (0.00192) | 0.02476*** (0.00235) | 0.01617*** (0.00225) |
| Observations | 1,004,287 | 1,002,929 | 1,004,287 | 1,002,929 | 1,004,287 | 1,002,929 |
| Adj R2 | 0.0040 | 0.0043 | 0.0472 | 0.0475 | 0.1314 | 0.1315 |
| Mom & CWS FE | ||||||
| 10th to 90th RRMCL | 0.01865*** (0.00304) | 0.01816*** (0.00191) | 0.01806*** (0.00287) | 0.01751*** (0.00180) | 0.02150*** (0.00221) | 0.01451*** (0.00207) |
| Observations | 511,624 | 510,569 | 511,624 | 510,569 | 511,624 | 510,569 |
| Adj R2 | 0.1567 | 0.1568 | 0.1740 | 0.1740 | 0.2299 | 0.2296 |
Notes: see notes to Table 3 in the main text. In this analysis, we do not include births exposed to water quality samples exceeding their respective MCL for any contaminants except four with widespread exceedences (Total Coliform, TTHM, HAA5, and Lead).
Table A16:
Effects of contamination on small for gestational age and term birth weight, removing births with any samples exceeding MCL, except for four contaminants with widespread exceedances (total coliform, TTHM, HAA5, and lead).
| (1) | (2) | (3) | (4) | (5) | (6) | |
|---|---|---|---|---|---|---|
| No controls | Adding mother ctrls | Adding temp & TRI ctrls | ||||
| All | Chemical | All | Chemical | All | Chemical | |
| Panel A: small for gestational age | ||||||
| CWS FE | ||||||
| 10th to 90th RRMCL | 0.00309*** (0.00078) | 0.00247*** (0.00063) | 0.00249*** (0.00075) | 0.00197*** (0.00059) | 0.00242*** (0.00073) | 0.00160*** (0.00058) |
| Observations | 1,002,592 | 1,001,241 | 1,002,592 | 1,001,241 | 1,002,592 | 1,001,241 |
| Adj R2 | 0.0052 | 0.0052 | 0.0378 | 0.0379 | 0.0389 | 0.0389 |
| Mom & CWS FE | ||||||
| 10th to 90th RRMCL | 0.00219* (0.00117) | 0.00101 (0.00107) | 0.00194* (0.00112) | 0.00075 (0.00103) | 0.00092* (0.00127) | 0.00077 (0.00103) |
| Observations | 510,015 | 508,959 | 510,015 | 508,959 | 510,015 | 508,959 |
| Adj R2 | 0.1378 | 0.1382 | 0.1535 | 0.1538 | 0.1537 | 0.1541 |
| Panel B: term birth weight | ||||||
| CWS FE | ||||||
| 10th to 90th RRMCL | −10.49*** (1.34) | −9.18398*** (1.19) | −9.73*** (1.39) | −8.73*** (1.16) | −10.72*** (1.43) | −8.16*** (1.37) |
| Observations | 921,294 | 920,064 | 921,294 | 920,064 | 921,294 | 920,064 |
| Adj R2 | 0.0162 | 0.0162 | 0.0935 | 0.0937 | 0.0979 | 0.0981 |
| Mom & CWS FE | ||||||
| 10th to 90th RRMCL | −9.67*** (1.83) | −8.01*** (1.55) | −9.33*** (1.83) | −7.57*** (1.54) | −9.30*** (1.89) | −7.27*** (1.65) |
| Observations | 445,586 | 444,713 | 445,586 | 444,713 | 445,586 | 444,713 |
| Adj R2 | 0.3422 | 0.3423 | 0.3812 | 0.3813 | 0.3833 | 0.3834 |
Table A17:
Effects of contamination on low birth weight and preterm birth, removing births with any samples exceeding MCL, excluding four contaminants (total coliform, TTHM, HAA5, and lead) from the analysis entirely.
| (1) | (2) | (3) | (4) | (5) | (6) | |
|---|---|---|---|---|---|---|
| No controls | Adding mother ctrls | Adding temp & TRI ctrls | ||||
| All | Chemical | All | Chemical | All | Chemical | |
| Panel A: low birth weight | ||||||
| CWS FEs | ||||||
| 10th to 90th RRMCL | 0.00983*** (0.00108) | 0.00904*** (0.00100) | 0.00903*** (0.00100) | 0.00832*** (0.00092) | 0.00723*** (0.00080) | 0.00663*** (0.00093) |
| Observations | 980,718 | 979,158 | 980,718 | 979,158 | 980,718 | 979,158 |
| Adj R2 | 0.0052 | 0.0053 | 0.0461 | 0.0461 | 0.1067 | 0.1066 |
| Mom & CWS FEs | ||||||
| 10th to 90th RRMCL | 0.00793*** (0.00089) | 0.00735*** (0.00083) | 0.00738*** (0.00084) | 0.00681*** (0.00078) | 0.00591*** (0.00068) | 0.00545*** (0.00080) |
| Observations | 491,834 | 490,505 | 491,834 | 490,505 | 491,834 | 490,505 |
| Adj R2 | 0.1523 | 0.1522 | 0.1702 | 0.1702 | 0.2112 | 0.2110 |
| Panel B: preterm birth | ||||||
| CWS FEs | ||||||
| 10th to 90th RRMCL | 0.01442*** (0.00139) | 0.01322*** (0.00128) | 0.01341*** (0.00129) | 0.01230*** (0.00118) | 0.01068*** (0.00102) | 0.01019*** (0.00113) |
| Observations | 980,718 | 979,158 | 980,718 | 979,158 | 980,718 | 979,158 |
| Adj R2 | 0.0041 | 0.0041 | 0.0472 | 0.0473 | 0.1316 | 0.1316 |
| Mom & CWS FEs | ||||||
| 10th to 90th RRMCL | 0.01244*** (0.00117) | 0.01156*** (0.00109) | 0.01177*** (0.00111) | 0.01091*** (0.00103) | 0.00947*** (0.00089) | 0.00924*** (0.00099) |
| Observations | 491,834 | 490,505 | 491,834 | 490,505 | 491,834 | 490,505 |
| Adj R2 | 0.1556 | 0.1556 | 0.1725 | 0.1725 | 0.2288 | 0.2287 |
Notes: see notes to Table 3 in the main text. In these specifications we include all births except those exposed to any samples that exceed their respective MCL. We discard the four contaminants with widespread exceedences (total coliform, TTHM, HAA5, and lead) from the analysis entirely. Since the indices used in this table do not have the same contaminants as the all or chemical indices used elsewhere (due to the exclusion of the four aforementioned contaminants), the distribution for the indices in this table is different. We do not provide detailed statistics for these distributions, but to aid comparison: the distance between the 10th to 90th RRMCL (a one-unit change in the independent variable) is 0.0611 for the All contaminant index used in this table, and it is 0.0551 for the Chemical index used in this table.
Table A18:
Effects of contamination on small for gestational age and term birth weight, removing births with any samples exceeding MCL, excluding four contaminants (total coliform, TTHM, HAA5, and lead) from the analysis entirely.
| (1) | (2) | (3) | (4) | (5) | (6) | |
|---|---|---|---|---|---|---|
| No controls | Adding mother ctrls | Adding temp & TRI ctrls | ||||
| All | Chemical | All | Chemical | All | Chemical | |
| Panel A: small for gestational age | ||||||
| CWS FEs | ||||||
| 10th to 90th RRMCL | 0.00139*** (0.00050) | 0.00120*** (0.00044) | 0.00108** (0.00049) | 0.00092** (0.00043) | 0.00080* (0.00037) | 0.00070 (0.00043) |
| Observations | 979,100 | 977,545 | 979,100 | 977,545 | 979,100 | 977,545 |
| Adj R2 | 0.0052 | 0.0052 | 0.0378 | 0.0378 | 0.0388 | 0.0388 |
| Mom & CWS FEs | ||||||
| 10th to 90th RRMCL | 0.00045 (0.00072) | 0.00041 (0.00065) | 0.00033 (0.00071) | 0.00026 (0.00064) | 0.00023 (0.00056) | 0.00025 (0.00064) |
| Observations | 490,311 | 488,990 | 490,311 | 488,990 | 490,311 | 488,990 |
| Adj R2 | 0.1368 | 0.1366 | 0.1525 | 0.1524 | 0.1528 | 0.1527 |
| Panel B: term birth weight | ||||||
| CWS FEs | ||||||
| 10th to 90th RRMCL | −5.35*** (0.79) | −4.99*** (0.70) | −5.08*** (0.79) | −4.79*** (0.71) | −4.15*** (0.61) | −4.59*** (0.82) |
| Observations | 899,901 | 898,475 | 899,901 | 898,475 | 899,901 | 898,475 |
| Adj R2 | 0.0161 | 0.0161 | 0.0936 | 0.0936 | 0.0980 | 0.0980 |
| Mom & CWS FEs | ||||||
| 10th to 90th RRMCL | −5.28*** (1.06) | −4.90*** (0.97) | −5.21*** (1.05) | −4.71*** (0.95) | −4.09*** (0.82) | −4.65*** (0.97) |
| Observations | 428,339 | 427,191 | 428,339 | 427,191 | 428,339 | 427,191 |
| Adj R2 | 0.3396 | 0.3396 | 0.3788 | 0.3788 | 0.3810 | 0.3809 |
Table A19:
Effects of contamination on low birth weight and preterm birth, removing births with any samples exceeding MCL, including the four contaminants with widespread exceedances (total coliform, TTHM, HAA5, and lead).
| (1) | (2) | (3) | (4) | (5) | (6) | |
|---|---|---|---|---|---|---|
| No controls | Adding mother ctrls | Adding temp & TRI ctrls | ||||
| All | Chemical | All | Chemical | All | Chemical | |
| Panel A: low birth weight | ||||||
| CWS FEs | ||||||
| 10th to 90th RRMCL | 0.01971*** (0.00236) | 0.01698*** (0.00201) | 0.01840*** (0.00223) | 0.01586*** (0.00190) | 0.01989*** (0.00239) | 0.01232*** (0.00191) |
| Observations | 344,101 | 343,470 | 344,101 | 343,470 | 344,101 | 343,470 |
| Adj R2 | 0.0077 | 0.0077 | 0.0477 | 0.0476 | 0.1176 | 0.1174 |
| Mom & CWS FEs | ||||||
| 10th to 90th RRMCL | 0.01311*** (0.00270) | 0.01190*** (0.00227) | 0.01277*** (0.00256) | 0.01148*** (0.00216) | 0.01440*** (0.00271) | 0.00933*** (0.00211) |
| Observations | 106,463 | 106,100 | 106,463 | 106,100 | 106,463 | 106,100 |
| Adj R2 | 0.1798 | 0.1790 | 0.2007 | 0.2000 | 0.2419 | 0.2411 |
| Panel B: preterm birth | ||||||
| CWS FEs | ||||||
| 10th to 90th RRMCL | 0.02857*** (0.00294) | 0.02466*** (0.00248) | 0.02677*** (0.00279) | 0.02315*** (0.00235) | 0.02904*** (0.00295) | 0.01855*** (0.00236) |
| Observations | 344,101 | 343,470 | 344,101 | 343,470 | 344,101 | 343,470 |
| Adj R2 | 0.0081 | 0.0081 | 0.0512 | 0.0513 | 0.1419 | 0.1419 |
| Mom & CWS FEs | ||||||
| 10th to 90th RRMCL | 0.02241*** (0.00334) | 0.02082*** (0.00281) | 0.02161*** (0.00311) | 0.02000*** (0.00263) | 0.02509*** (0.00329) | 0.01720*** (0.00231) |
| Observations | 106,463 | 106,100 | 106,463 | 106,100 | 106,463 | 106,100 |
| Adj R2 | 0.1807 | 0.1802 | 0.2019 | 0.2013 | 0.2557 | 0.2548 |
Notes: see notes to Table 3 in the main text. In this analysis, we do not include births exposed to any water quality sample with a result that exceeds the respective MCL. We include the four contaminants with widespread exceedences in this analysis. This leads to a substantial reduction in sample sizes, as observed in the table.
Table A20:
Effects of contamination on small for gestational age and term birth weight, removing births with any samples exceeding MCL, including the four contaminants with widespread exceedances (total coliform, TTHM, HAA5, and lead).
| (1) | (2) | (3) | (4) | (5) | (6) | |
|---|---|---|---|---|---|---|
| No controls | Adding mother ctrls | Adding temp & TRI ctrls | ||||
| All | Chemical | All | Chemical | All | Chemical | |
| Panel A: small for gestational age | ||||||
| CWS FEs | ||||||
| 10th to 90th RRMCL | 0.00292** (0.00115) | 0.00241** (0.00100) | 0.00262** (0.00113) | 0.00215** (0.00097) | 0.00270* (0.00122) | 0.00183* (0.00098) |
| Observations | 343,395 | 342,771 | 343,395 | 342,771 | 343,395 | 342,771 |
| Adj R2 | 0.0040 | 0.0040 | 0.0364 | 0.0364 | 0.0377 | 0.0377 |
| Mom & CWS FEs | ||||||
| 10th to 90th RRMCL | −0.00092 (0.00242) | −0.00115 (0.00206) | −0.00050 (0.00242) | −0.00080 (0.00206) | −0.00100 (0.00258) | −0.00083 (0.00206) |
| Observations | 106,009 | 105,647 | 106,009 | 105,647 | 106,009 | 105,647 |
| Adj R2 | 0.1702 | 0.1698 | 0.1824 | 0.1821 | 0.1826 | 0.1823 |
| Panel B: term birth weight | ||||||
| CWS FEs | ||||||
| 10th to 90th RRMCL | −10.16*** (2.04) | −8.66*** (1.70) | −10.47*** (1.97) | −8.97*** (1.66) | −11.26*** (2.09) | −8.76*** (1.79) |
| Observations | 314,836 | 314,283 | 314,836 | 314,283 | 314,836 | 314,283 |
| Adj R2 | 0.0130 | 0.0130 | 0.0883 | 0.0884 | 0.0929 | 0.0929 |
| Mom & CWS FEs | ||||||
| 10th to 90th RRMCL | −5.71 (3.97) | −5.38 (3.44) | −7.59** (3.79) | −7.09** (3.27) | −8.89* (4.10) | −7.03** (3.38) |
| Observations | 92,157 | 91,851 | 92,157 | 91,851 | 92,157 | 91,851 |
| Adj R2 | 0.4015 | 0.4010 | 0.4357 | 0.4352 | 0.4380 | 0.4375 |
Using the mom FE sample in the CWS FE regressions also
In the main specifications, using mother fixed-effects rather than CWS fixed-effects often appreciably and significantly changes the effect estimates. In fact, this is the main way in which our results are sensitive. In this subsection, we present results subsetting throughout to the sample used in the mother fixed-effects specifications. Note that sample sizes are slightly larger in this subsection than in the main text because, in order to maintain exact balance across CWS fixed-effects and mother fixed-effects regressions, we do not iteratively remove singletons (Correia (2015)); this also means that standard errors in this subsection are slightly understated.
Table A21:
Effects of contamination on low birth weight and preterm birth, using the mother fixed-effects sample throughout.
| (1) | (2) | (3) | (4) | (5) | (6) | |
|---|---|---|---|---|---|---|
| No controls | Adding mother ctrls | Adding temp & TRI ctrls | ||||
| All | Chemical | All | Chemical | All | Chemical | |
| Panel A: low birth weight | ||||||
| CWS FE | ||||||
| 10th to 90th RRMCL | 0.01291*** (0.00289) | 0.01414*** (0.00140) | 0.01180*** (0.00274) | 0.01298*** (0.00131) | 0.00997*** (0.00232) | 0.01002*** (0.00146) |
| Observations | 701,006 | 699,750 | 701,006 | 699,750 | 701,006 | 699,750 |
| Adj R2 | 0.0056 | 0.0058 | 0.0481 | 0.0483 | 0.1092 | 0.1094 |
| Mom & CWS FE | ||||||
| 10th to 90th RRMCL | 0.00957*** (0.00245) | 0.01049*** (0.00120) | 0.00932*** (0.00232) | 0.01003*** (0.00115) | 0.00819*** (0.00198) | 0.00775*** (0.00133) |
| Observations | 701,006 | 699,750 | 701,006 | 699,750 | 701,006 | 699,750 |
| Adj R2 | 0.1505 | 0.1507 | 0.1689 | 0.1690 | 0.2102 | 0.2103 |
| Panel B: preterm birth | ||||||
| CWS FE | ||||||
| 10th to 90th RRMCL | 0.01990*** (0.00412) | 0.02139*** (0.00184) | 0.01854*** (0.00384) | 0.01998*** (0.00170) | 0.01632*** (0.00307) | 0.01630*** (0.00176) |
| Observations | 701,006 | 699,750 | 701,006 | 699,750 | 701,006 | 699,750 |
| Adj R2 | 0.0042 | 0.0046 | 0.0502 | 0.0506 | 0.1355 | 0.1357 |
| Mom & CWS FE | ||||||
| 10th to 90th RRMCL | 0.01571*** (0.00363) | 0.01715*** (0.00166) | 0.01533*** (0.00345) | 0.01658*** (0.00160) | 0.01405*** (0.00276) | 0.01377*** (0.00170) |
| Observations | 701,006 | 699,750 | 701,006 | 699,750 | 701,006 | 699,750 |
| Adj R2 | 0.1486 | 0.1489 | 0.1663 | 0.1665 | 0.2252 | 0.2252 |
Notes: standard errors in parentheses
p < 0.10
p < 0.05
p < 0.01.
Standard errors are twoway clustered on public water system and mother. Each cell is a separate regression. Observations (number of births) and adjusted R2s (for the full model, i.e. including the fixed-effects) are reported. In each regression, the sample is restricted to births in our analytic sample for whom we observe at least one water quality sample of the given contaminant group during gestation in the public water system of residence at the time of birth. Control variables used vary over super-columns; independent variable, i.e. the contaminant group studied, vary across columns; the outcomes varies over panels; the specification used (either public water system fixed-effects or mom fixed-effects) varies over sub-panels; temp & TRI ctrls = weather and toxics release inventory (air pollution) controls.
Table A22:
Effects of contamination on small for gestational age and term birth weight, using the mother fixed-effects sample throughout.
| (1) | (2) | (3) | (4) | (5) | (6) | |
|---|---|---|---|---|---|---|
| No controls | Adding mother ctrls | Adding temp & TRI ctrls | ||||
| All | Chemical | All | Chemical | All | Chemical | |
| Panel A: small for gestational age | ||||||
| CWS FE | ||||||
| 10th to 90th RRMCL | 0.00294** (0.00116) | 0.00225*** (0.00087) | 0.00228** (0.00109) | 0.00164** (0.00083) | 0.00212* (0.00114) | 0.00141* (0.00085) |
| Observations | 699,702 | 698,443 | 699,702 | 698,443 | 699,702 | 698,443 |
| Adj R2 | 0.0062 | 0.0062 | 0.0386 | 0.0387 | 0.0394 | 0.0394 |
| Mom & CWS FE | ||||||
| 10th to 90th RRMCL | 0.00279*** (0.00100) | 0.00180** (0.00084) | 0.00276*** (0.00096) | 0.00166** (0.00081) | 0.00290*** (0.00099) | 0.00174** (0.00083) |
| Observations | 699,702 | 698,443 | 699,702 | 698,443 | 699,702 | 698,443 |
| Adj R2 | 0.1391 | 0.1397 | 0.1546 | 0.1552 | 0.1548 | 0.1554 |
| Panel B: term birth weight | ||||||
| CWS FE | ||||||
| 10th to 90th RRMCL | −9.46*** (1.55) | −9.03*** (1.45) | −8.40*** (1.69) | −8.22*** (1.41) | −8.25*** (1.85) | −7.60*** (1.53) |
| Observations | 644,014 | 642,882 | 644,014 | 642,882 | 644,014 | 642,882 |
| Adj R2 | 0.0194 | 0.0194 | 0.0997 | 0.0998 | 0.1037 | 0.1039 |
| Mom & CWS FE | ||||||
| 10th to 90th RRMCL | −9.05*** (1.73) | −8.81*** (1.33) | −9.15*** (1.84) | −8.57*** (1.37) | −9.33*** (1.87) | −8.24*** (1.45) |
| Observations | 644,014 | 642,882 | 644,014 | 642,882 | 644,014 | 642,882 |
| Adj R2 | 0.3488 | 0.3488 | 0.3875 | 0.3876 | 0.3896 | 0.3896 |
Notes: see notes to Table A21.
Estimating on Smaller CWS Only
Concern over mismeasurement of the link between water contamination and exposure, that may be especially acute in large CWS (with complicated distribution systems), motivates a sensitivity analysis estimating our models for smaller CWS only. To conduct this analysis, we sorted births by CWS size (number of people served by the CWS) and then dropped the 50% of births in our sample in the largest systems (those in the bottom of the dataset). The largest system in this restricted sample serves 118,000 people (contrast with the largest system, Philadelphia Water Department, which serves 1,600,000 people), and thus this is a meaningful restriction. The results of estimating on this subsample are included below.
Table A23:
Effects of drinking water contamination on low birth weight and preterm birth, small systems only.
| (1) | (2) | (3) | (4) | (5) | (6) | |
|---|---|---|---|---|---|---|
| No controls | Adding mother ctrls | Adding temp & TRI ctrls | ||||
| All | Chemical | All | Chemical | All | Chemical | |
| Panel A: low birth weight | ||||||
| CWS FEs | ||||||
| 10th to 90th RRMCL | 0.01280*** (0.00124) | 0.01161*** (0.00108) | 0.01185*** (0.00119) | 0.01080*** (0.00104) | 0.01259*** (0.00121) | 0.00789*** (0.00094) |
| Observations | 597,900 | 596,418 | 597,900 | 596,418 | 597,900 | 596,418 |
| Adj R2 | 0.0040 | 0.0042 | 0.0376 | 0.0378 | 0.0911 | 0.0914 |
| Mom & CWS FEs | ||||||
| 10th to 90th RRMCL | 0.00863*** (0.00129) | 0.00801*** (0.00113) | 0.00844*** (0.00124) | 0.00783*** (0.00110) | 0.00913*** (0.00128) | 0.00555*** (0.00107) |
| Observations | 282,179 | 281,162 | 282,179 | 281,162 | 282,179 | 281,162 |
| Adj R2 | 0.1584 | 0.1585 | 0.1755 | 0.1756 | 0.2094 | 0.2097 |
| Panel B: pre-term birth | ||||||
| CWS FEs | ||||||
| 10th to 90th RRMCL | 0.01907*** (0.00166) | 0.01732*** (0.00143) | 0.01782*** (0.00159) | 0.01621*** (0.00136) | 0.01890*** (0.00159) | 0.01240*** (0.00124) |
| Observations | 597,900 | 596,418 | 597,900 | 596,418 | 597,900 | 596,418 |
| Adj R2 | 0.0035 | 0.0037 | 0.0400 | 0.0403 | 0.1167 | 0.1171 |
| Mom & CWS FEs | ||||||
| 10th to 90th RRMCL | 0.01543*** (0.00166) | 0.01454*** (0.00141) | 0.01519*** (0.00159) | 0.01430*** (0.00135) | 0.01666*** (0.00157) | 0.01138*** (0.00126) |
| Observations | 282,179 | 281,162 | 282,179 | 281,162 | 282,179 | 281,162 |
| Adj R2 | 0.1659 | 0.1661 | 0.1842 | 0.1844 | 0.2328 | 0.2330 |
Notes: see notes to Table 3 in the main text.
Table A24:
Effects of drinking water contamination on small for gestational age and term birth weight, small systems only.
| (1) | (2) | (3) | (4) | (5) | (6) | |
|---|---|---|---|---|---|---|
| No controls | Adding mother ctrls | Adding temp & TRI ctrls | ||||
| All | Chemical | All | Chemical | All | Chemical | |
| Panel A: small for gestational age | ||||||
| CWS FEs | ||||||
| 10th to 90th RRMCL | 0.00133* (0.00069) | 0.00139** (0.00057) | 0.00088 (0.00066) | 0.00105* (0.00055) | 0.00122 (0.00064) | 0.00069 (0.00055) |
| Observations | 597,188 | 595,707 | 597,188 | 595,707 | 597,188 | 595,707 |
| Adj R2 | 0.0043 | 0.0043 | 0.0357 | 0.0357 | 0.0366 | 0.0366 |
| Mom & CWS FEs | ||||||
| 10th to 90th RRMCL | 0.00026 (0.00128) | 0.00052 (0.00107) | 0.00016 (0.00126) | 0.00046 (0.00105) | 0.00054 (0.00123) | 0.00029 (0.00104) |
| Observations | 281,525 | 280,500 | 281,525 | 280,500 | 281,525 | 280,500 |
| Adj R2 | 0.1568 | 0.1576 | 0.1697 | 0.1705 | 0.1700 | 0.1708 |
| Panel B: term birth weight | ||||||
| CWS FEs | ||||||
| 10th to 90th RRMCL | −8.18*** (1.23) | −6.83*** (1.07) | −7.66*** (1.19) | −6.49*** (1.02) | −7.57*** (1.19) | −5.74*** (1.06) |
| Observations | 552,670 | 551,295 | 552,670 | 551,295 | 552,670 | 551,295 |
| Adj R2 | 0.0148 | 0.0148 | 0.0894 | 0.0894 | 0.0935 | 0.0936 |
| Mom & CWS FEs | ||||||
| 10th to 90th RRMCL | −7.90*** (1.98) | −6.53*** (1.72) | −7.82*** (1.89) | −6.47*** (1.64) | −7.54*** (1.91) | −6.08*** (1.65) |
| Observations | 249,414 | 248,510 | 249,414 | 248,510 | 249,414 | 248,510 |
| Adj R2 | 0.3879 | 0.3877 | 0.4231 | 0.4230 | 0.4248 | 0.4247 |
Notes: see notes to Table 3 in the main text.
EFFECTS OF MCL VIOLATIONS ON BIRTH OUTCOMES
In this section, we closely follow Currie et al. (2013) using our data. We believe this comparison is important since our data represents a more recent time period and a different state. Overall, we find similar results, although ours are less precise. In general, the estimated effect of a reported health-based MCL violation on birth outcomes is less robust than the effects of our continuous measures of water contamination that we use in the main text. On the one hand, this may be surprising since violations indicate contamination in exceedence of regulatory thresholds; on the other hand, the very fact that there was a violation raises the possibility of significant avoidance behavior, such as through bottled water purchases (Zivin et al. (2011)).
As in the main text, we obtained Safe Drinking Water Information System (SDWIS) data on CWS drinking water violations from the US Environmental Protection Agency (US EPA). This SDWIS data records the compliance period, contaminant, and CWS for each drinking water violation received by each CWS in the United States over our sample period. We focus on “MCL violations”, which are cases where the CWS is observed to have exceeded a “maximum contaminant level,” a threshold over which contamination cannot cross without regulatory repercussions. Note that a drinking water compliance period is a period of time that varies based on contaminant and CWS, but is typically measured in quarters (3 months).
We begin by presenting summary statistics in Appendix Table A25, similar to those in Currie et al. (2013). Note that, following Currie et al. (2013), in Table A25 we restrict to mothers with multiple births, and we do not remove plural births. There are more mothers in this sample than in the mom fixed-effects sample in the main text since in the main text we removed plural births, all births exposed to an MCL violation, and births that had no samples during their gestation period (note that even births with no water quality samples during gestation can be flagged as being exposed or not exposed to an MCL violation).
Studying these statistics, in particular, comparing column (1) with column (2) in Appendix Table A25, we find that only about 3% of births in our sample are exposed to any MCL violations; contrast this with Currie et al. (2013)’s finding of about 8% of births exposed in New Jersey from 1997 to 2007. As a result, while our sample of births is about 50% larger than theirs, as would be expected given that we have an additional year and Pennsylvania is a larger state, the number of births exposed to an MCL violation is actually over 40% smaller in our sample. For chemical contamination, these differences are even larger; we have about 75% fewer births exposed to chemical contamination than Currie et al. (2013). We should keep this in mind when interpreting our results, especially for chemical MCL, as we have much less variation in the independent variables as Currie et al. (2013).
Moreover, we find that births that are exposed to an MCL violation are more likely to be from mothers that are advantaged on multiple measures. For example, mothers tend to be older for births exposed to MCL violations, and mothers exposed to MCL violations are less likely to be racial minorities, are more likely to have bachelors degrees or higher education, and are much more likely to be married. These are all opposite of the findings of Currie et al. (2013), who find that mothers exposed to MCL violations tend to be much more disadvantaged on average. For example, focusing on teen birth, they found that, overall, 3.3% of births were to mothers below the age of 19, while among births exposed to any MCL violation, this rate was about 4.6%; by contrast, in our sample these statistics are practically the reverse, at 5% and 2.6%, respectively.
We use the exact model from their paper (Currie et al. (2013)), which is the same regression model as our mom fixed-effects specifications in the main text, except for three differences. First, we use the same controls as used in Currie et al. (2013). Second, the independent variable is an indicator for at least one day of overlap between the gestation period and an MCL violation.8 Third, we include plural births (Currie et al. (2013) makes no mention of excluding plural births). For comparison, we also present additional results (a) adding additional controls (for plurality, WIC, and payment type) like in our water sample models in the main text, and (b) removing plural births.
Following Currie et al. (2013), we run the model for all MCL violations and then separately for just chemical violations. Chemical violations include all MCL violations except Total Coliform Rule violations. Note that Total Coliform violations are by far the most common MCL violation in our data (at over 70% of MCL violations).
To account for the fact that in the MCL-violation-based specifications, births with longer gestation periods are more likely to have an intersection with an MCL violation (and these birth tend to have better outcomes on average), in our specifications that have MCL exposure as the independent variable, we instrument for contamination using the same “full term gestation” instrument that was used in Currie et al. (2013). To construct this full term gestation instrument, we fix the date of conception and pretend that birth lasted exactly 39 weeks.9 We then construct “full gestation contamination” using this pretend gestation period, and use this constructed variable as an instrument for actual MCL violation exposure. Note that for our water sample regressions in the main text, this instrumental variables approach was unnecessary because there is no mechanical correlation between average water quality and gestational length.
Appendix Tables A26, A27, and A28 report the estimated impacts of MCL violations on low birth weight and prematurity. First, consider Table A26, which reports our attempt at estimating the exact model of Currie et al. (2013) on our sample. We find in this case that the estimated effects are similar to those in Currie et al. (2013), although they are less precisely estimated. In the preferred IV and FE specification (column 4), we find that exposure to an MCL violation for any contaminant increases the probability of low birth weight by about 0.38pp; the analogous result in Currie et al. (2013) was 0.34pp. It is interesting to note that our OLS estimates in column (1) are negative and over four times larger in magnitude than the comparable estimates in Currie et al. (2013), suggesting that the mechanical correlation in our data is more severe than in Currie et al. (2013). While our estimated effects of exposure to a chemical MCL violation on low birth weight has a point estimate of around 0.25pp, compared to Currie et al. (2013) estimate of 0.37pp, it is statistically insignificant. This may have been expected, given the very few births that can be connected to chemical contamination in this data (see Appendix Table A25). Appendix Tables A26 also contains our estimated impacts of MCL violations on pre-term birth, using the same model as Currie et al. (2013). Effects on pre-term birth of “any” MCL violation is about a 0.56pp increase, about twice as large as the estimate in Currie et al. (2013), which was about a 0.25pp increase.
In Appendix Table A27, we add several additional control variables to the Currie et al. (2013) specification, including WIC and Medicaid indicators, Toxics Release Inventory facility measures (as used in the main text), and plurality indicators. From our explorations (not shown), we found that the inclusion of plurality indicators changes our inference the most. While the point estimates for chemical MCL violations are stable (although the standard errors increase), the estimates for “any” violations are cut in half with the inclusion of these controls, and are insignificant in the new specification.
Finally, Appendix Table A28 report results from specifications that include the additional controls and remove plural births from the sample. This is the sample and specification that we use in all tables in this paper, except for tables A26 and A27 which we have just discussed. With this sample selection, the estimated effect of “any” MCL violations on pre-term birth rises back to roughly where it was earlier, and while the effects of “any” MCL violations on low birth weight rise, they remain statistically insignificant.
Table A25:
Sample means for all mothers and those exposed to MCL violations during gestation.
| (1) | (2) | (3) | (4) | (5) | |
|---|---|---|---|---|---|
| With MCL | Switchers | ||||
| Characteristic | All | Any | Chem | Any | Chem |
| No. of observations | 826218 | 26144 | 8243 | 56284 | 17138 |
| Low birth weight (≤ 2500g) | 0.096 | 0.082 | 0.080 | 0.074 | 0.071 |
| Preterm (≤ 36 weeks) | 0.112 | 0.096 | 0.097 | 0.087 | 0.086 |
| Mom’s age | |||||
| < 19 | 0.049 | 0.024 | 0.033 | 0.038 | 0.045 |
| 19 – 24 | 0.259 | 0.193 | 0.246 | 0.233 | 0.275 |
| 25 – 34 | 0.554 | 0.596 | 0.594 | 0.582 | 0.567 |
| 35+ | 0.139 | 0.186 | 0.127 | 0.148 | 0.113 |
| Mom’s race | |||||
| African-American | 0.212 | 0.127 | 0.092 | 0.155 | 0.128 |
| Hispanic | 0.070 | 0.034 | 0.034 | 0.048 | 0.042 |
| White, not hispanic | 0.723 | 0.821 | 0.878 | 0.792 | 0.839 |
| Mom’s education | |||||
| Less than highschool | 0.140 | 0.081 | 0.102 | 0.109 | 0.130 |
| Highschool only | 0.255 | 0.210 | 0.270 | 0.227 | 0.265 |
| Some college | 0.178 | 0.177 | 0.191 | 0.176 | 0.186 |
| College or more | 0.420 | 0.527 | 0.432 | 0.482 | 0.415 |
| Mother smokes | 0.215 | 0.191 | 0.249 | 0.203 | 0.246 |
| Mother is married | 0.589 | 0.700 | 0.671 | 0.655 | 0.633 |
| Mother on WIC/Medicaid | 0.471 | 0.350 | 0.418 | 0.401 | 0.451 |
| Mother moved | 0.467 | 0.570 | 0.495 | 0.633 | 0.584 |
Notes: Observations are births. Only mothers with multiple births are included. Switcher means births among the set of mothers with at least one birth experiencing and at least one not experiencing an MCL violation. “With MCL” contains births exposed to an MCL violation (i.e., at least one day overlap between the gestation period and an MCL violation in the water system at the time of birth).
Table A26:
Effects of MCL violations during gestation on birth outcomes – Replication of Currie et al. (2013).
| (1) | (2) | (3) | (4) | |
|---|---|---|---|---|
| OLS | IV | FE | IV and FE | |
| Panel A: Any MCL, Low Birth Weight | ||||
| Any MCL | −0.0025 (−1.3436) | 0.0051*** (2.6142) | −0.0020 (−0.9393) | 0.0035 (1.6122) |
| Panel B: Any MCL, Pre-term Birth | ||||
| Any MCL | −0.0044** (−2.1996) | 0.0058*** (2.6891) | −0.0029 (−1.2721) | 0.0047** (1.9725) |
| Panel C: Chem MCL, Low birth weight | ||||
| Chem MCL | −0.0061* (−1.9233) | −0.0018 (−0.5492) | −0.0009 (−0.2562) | 0.0020 (0.5456) |
| Panel D: Chem MCL, Pre-term Birth | ||||
| Chem MCL | −0.0081** (−2.3066) | −0.0014 (−0.3957) | −0.0007 (−0.1734) | 0.0042 (1.0172) |
| No. observations | 826218 | 826218 | 826218 | 826218 |
| Full-gestation IV | ✓ | ✓ | ||
| Mom FE | ✓ | ✓ | ||
Notes: t-statistics in parentheses
p < 0.10
p < 0.05
p < 0.01.
Each cell is a separate regression. Sample consists of mothers with multiple births. The independent variables “Any MCL” and “Chem MCL” are indicators for whether an MCL of the specified type occurred during the gestation period. Standard errors are clustered at the mother. “Full-gestation IV” instruments for actual exposure to an MCL violation with predicted exposure given a gestation period of exactly 39 weeks. All regressions include the control variables used in Currie et al. (2013).
Table A27:
Effects of MCL violations during gestation on birth outcomes – Adding additional controls (incl. plurality) to the Currie et al. (2013) model.
| (1) | (2) | (3) | (4) | |
|---|---|---|---|---|
| OLS | IV | FE | IV and FE | |
| Panel A: Any MCL, Low Birth Weight | ||||
| Any MCL | −0.0005 (−0.3015) | 0.0054*** (3.1734) | −0.0014 (−0.6822) | 0.0033 (1.6257) |
| Panel B: Any MCL, Pre-term Birth | ||||
| Any MCL | −0.0022 (−1.2310) | 0.0063*** (3.2887) | −0.0020 (−0.9140) | 0.0048** (2.1166) |
| Panel C: Chem MCL, Low birth weight | ||||
| Chem MCL | −0.0023 (−0.8163) | 0.0012 (0.4195) | 0.0002 (0.0586) | 0.0030 (0.8464) |
| Panel D: Chem MCL, Pre-term Birth | ||||
| Chem MCL | −0.0042 (−1.3469) | 0.0016 (0.4866) | 0.0005 (0.1364) | 0.0053 (1.3343) |
| No. observations | 826218 | 826218 | 826218 | 826218 |
| Full-gestation IV | ✓ | ✓ | ||
| Mom FE | ✓ | ✓ | ||
Notes: See notes to Table A26.
Table A28:
Effects of MCL violations during gestation on birth outcomes – Removing all plural births.
| (1) | (2) | (3) | (4) | |
|---|---|---|---|---|
| OLS | IV | FE | IV and FE | |
| Panel A: Any MCL, Low Birth Weight | ||||
| Any MCL | 0.0000 (0.0220) | 0.0048*** (3.2193) | −0.0006 (−0.3058) | 0.0039** (1.9761) |
| Panel B: Any MCL, Pre-term Birth | ||||
| Any MCL | −0.0009 (−0.5614) | 0.0063*** (3.7555) | 0.0000 (0.0164) | 0.0063*** (2.8921) |
| Panel C: Chem MCL, Low birth weight | ||||
| Chem MCL | −0.0022 (−0.8821) | 0.0009 (0.3669) | 0.0003 (0.0835) | 0.0033 (0.9750) |
| Panel D: Chem MCL, Pre-term Birth | ||||
| Chem MCL | −0.0040 (−1.4103) | 0.0015 (0.5061) | 0.0007 (0.2001) | 0.0055 (1.4425) |
| No. observations | 768228 | 768228 | 768228 | 768228 |
| Full-gestation IV | ✓ | ✓ | ||
| Mom FE | ✓ | ✓ | ||
Notes: See notes to table A26.
Footnotes
For example, in Beatty and Shimshack (2014), only three air pollutants were studied: carbon monoxide, ozone, and particulate matter (PM10).
Some papers have found null associations or mixed results for the association between arsenic and birth outcomes (Saha et al., 2012; Bozack et al., 2018).
Some papers have found null associations or mixed results for the association between atrazine and birth outcomes (Almberg et al., 2018).
Some papers have found null associations or mixed results for the association between DEHP and birth outcomes (Ferguson et al., 2019)
Some papers have found null associations or mixed results for the association between nitrate and birth outcomes (Albouy-Llaty et al., 2016; Ebdrup et al., 2022).
Some papers have found null associations or mixed results for the association between tetrachloroethylene and birth outcomes (Forand et al., 2012). Sonnenfeld et al. (2001) found at most a weak association between tetrachloroethylene exposure and birth outcomes. However, they found an association between exposure and birth weight for infants of older mothers and mothers with a history of fetal loss. Aschengrau et al. (2020) found tetrachloroethylene-contaminated drinking water to be associated with delayed time-to-pregnancy, and increased risks of placental abruption, stillbirths stemming from placental dysfunction, and certain brith defects. No associations were observed with pregnancy loss, birth weight, and gestational duration.
It is important to note that this is not how the regulations are written or enforced. Many violations are determined using aggregated distributional cut-offs, as opposed to the stringent approach we are using here limiting by individual samples that exceed the MCL.
It is unclear in Currie et al. (2013) how much overlap between an MCL violation and the gestation period is required for their definition of contamination; we use at least one day of overlap.
A full gestation length is considered to be 39 weeks. This is not to be confused with the minimum gestation length for the birth to not be classified as pre-term, which is 37 weeks.
Throughout this paper, we use “regulatory violation,” “MCL,” and “MCL violation” interchangeably to mean any action level or health-based violation, respectively, even for those where the violation is based upon an action level or MRDL.
These thresholds are based on single contaminant concentrations, as opposed to mixtures of contaminants or co- occurring contaminants. Most contaminants have a MCL, but lead has an action level, and disinfectants have MRDL. Moreover, regulations vary by contaminant over whether and how samples are averaged to determine violations. Treatment technique (TT) violations also occur when the water system does not implement corrective actions in response to contamination.
We discuss in the section “Discussion” issues that have been described in the literature regarding compliance, enforcement, and how our results shed light on particular policy responses.
We do include all 94 contaminants in this summary measure of water quality. See Appendix Table A2 for the full list of contaminants included. All appendices are available at the end of this article as it appears in JPAM online. Go to the publisher’s website and use the search engine to locate the article at http://onlinelibrary.wiley.com.
We discuss in the section “Reproductive Health Water Quality Index” and the Appendix “An Alternative Reproductive Health Index of Five Contaminants Based on Data Availability and Prior Research” the literature on drinking water and birth outcomes. Quasi-experimental literature is virtually non-existent except in the case of lead (Dave & Yang, 2022; D. S. Grossman & Slusky, 2019) or any MCL violation as in (Currie et al., 2013).
We discuss in the section “Reproductive Health Water Quality Index” and the Appendix section “An Alternative Reproductive Health Index of Five Contaminants Based on Data Availability and Prior Research” the creation of this index. The five contaminants include arsenic, atrazine, nitrate, Di(2-ethylhexyl) Phthalate (DEHP), and Tetrachloroethene (PERC).
Samples are taken at the system level prior to distribution and are not taken at the residence. Exceptions include lead and copper which are measured at a random selection of residences; however, it is a small subsample within the system and we do not have information on which residences.
See Appendix section “Removing Births With Any Samples Exceeding MCL.”
Exceptions include (Currie et al., 2013; Dave & Yang, 2022; Flynn & Marcus, 2023; D. S. Grossman & Slusky, 2019; Hill & Ma, 2022; Marcus, 2021).
In the Online Appendix section “Effects of MCL Violations on Birth Outcomes,” we estimate similar models to Currie et al. (2013) using our data in Pennsylvania from 2003 to 2014 and find consistent, but not statistically significant, effects (Appendix Tables A25 through A28).
I.e., the baby’s weight at birth was less than 2,500 grams.
I.e., gestation length of 36 weeks or less.
SGA is an indicator for birth weight being below the 10th percentile of the infant’s gestational week, while TBW is the birth weight (in grams) of babies that are not preterm, i.e., babies who “came to term.”
Currie et al. (2015) showed evidence that the impacts of TRI facilities are strongest within 1 km, however, emissions likely persist beyond 1 km. Hill and Ma (2022) included these additional distances to account for air pollution near the maternal residence. We follow this literature here given we are concerned about air pollution confounding our estimates of the impact of drinking water contamination.
Compliance periods vary by contaminant with more than 2/3 being 90 days. About 20% are 30 days and a few are annual or longer (min 27 days and max 1,095 days). Importantly, we do not have the date of the violation.
We received the data in database form from the department. Interested readers can download data from the PA Drinking Water Reporting System (DWRS): http://www.drinkingwater.state.pa.us/dwrs/HTM/Welcome.html
The frequency of these data are dependent on the monitoring requirements for the systems and contaminants of interest. Some contaminants are measured monthly, quarterly or annually (lead and copper are measured every 3 years) based upon the regulations in the Safe Drinking Water Act (US Environmental Protection Agency, 2019b). These data provide exact date of sample to match to pregnancy gestation periods.
xviii: Find these data here: https://www.pasda.psu.edu/download/dep/historic/PublicWaterSupply/. We used vintage 2016 month 4 for these analyses. We discuss concerns about boundaries changing over time in the section “Study Limitations” and Appendix section “Community Water System Boundaries and Linking Births to Water Quality.”
Entry points are locations in the CWS after treatment or chemical addition, if any, but prior to the distribution system. About 1% of the reported samples are reported as being of raw water; we remove those from the analysis.
We discuss data limitations and this issue in the section “Study Limitations” and Appendix section “Community Water System Boundaries and Linking Births to Water Quality.”
Other work has aggregated to the county and then controlled for the percent of the population served by public water (Almberg et al., 2018). While there remains potential measurement error in our approach, we believe it is an improvement.
Our method for measuring the water contamination faced by a particular birth is similar to an earlier epidemiological study on disinfectant byproducts (Bove et al., 1995), who measured contamination monthly and then take averages over the gestation period, weighting months equally. Our approach is the same, except we average over daily averages.
xxiii: The original Pennsylvania water sample data has 248 unique contaminant identifiers. We restrict the data to the 94 contaminants that are regulated under the Safe Drinking Water Act and for which we have regulatory MCL. These are listed in Appendix Table A2.
The last restriction depends on the contamination measure we are using as our independent variable, e.g., when we take overall contamination as the independent variable of interest, we trim at the 99th percentile for that measure.
See Correia (2015) for a discussion of the importance of eliminating singleton observations in order to obtain the correct standard errors in high-dimensional fixed-effects regressions.
Trimming the top 1% substantially improves the precision of our estimates, and without this trimming we would generally conclude that there is no significant effect of water contamination on birth outcomes, in contrast to our present conclusion of a precise effect (estimates without trimming are not shown).
xxvii: Note that it is a large movement in the conditional distribution of birth exposure after removing MCL violations and after trimming outliers, which are both data manipulations that limit this difference. See Table 1 for the quantiles of the underlying RRMCL variable.
xxviii: The regression models we use are linear, so the reader can multiply our estimates by these ratios (or different ones based on Table 1) to obtain the estimated effects for smaller changes as desired. To obtain the effect of a change in our overall contamination index of Δ units of RRMCL, simply multiply our estimates by Δ/0.15; similarly for chemical, multiply our estimates by Δ/0.13. For instance, to consider the effects of a change from 10th-to-90th within mother for the overall contamination index, multiply our estimates by 0.10/0.15 = 66%.
Based on this panel, the movement from the 10th to the 90th within-mother is about 0.10 RRMCL for our overall index (the difference between the 90th percentile of 0.05 and the 10th percentile of −0.05; note that since Panel C is the distribution of residuals after removing mother effects, the distribution is centered around zero but the units are still in RRMCL), or about 66% of the 10th-to-90th percentile movement we use, and 0.09 for our chemical index, or about 69% of the movement we use in that case. See prior footnote for scaling instructions.
They only find an association between MCL violations for nitrate-nitrite in this study.
We keep only births with contamination measures, trim at the 99th percentile for overall contamination, and drop singletons based on CWS and birth year-month fixed-effects (these restrictions were discussed in the previous subsection).
xxxii: In exploratory regressions (not shown), we found that systems with a greater share of non-Hispanic white mothers, and lower share served by WIC/Medicaid, are more likely to have violations. This is surprising and differs from findings of Currie et al. (2013). However, our sampling data-based measure of contamination exhibits different relationships: systems with lower share non-Hispanic White experience higher average levels of contamination by these measures, as do larger systems. These last results hold both before and after removing births exposed to violations.
xxxiii: These control variables include: mother’s age categories (19 to 24, 25 to 34, 35+); mother race (non-Hispanic White, non-Hispanic Black, Hispanic); mother’s education (less than high school, some college, college or more); risk factors for the pregnancy (including pre-pregnancy diabetes, gestational diabetes, pre-pregnancy hypertension, gestational hypertension, previous pre-term birth, previous poor pregnancy outcomes, vaginal bleeding, infertility treatment, previous cesarean); maternal smoking (counts of cigarettes smoked by term); parity indicators (number of previous successful live births and unsuccessful live births); mother is married; child is male; WIC participation; and payment type (Medicaid, private, self-pay).
xxxiv: These covariates are commonly used in the birth outcome literature. See Almond et al. (2018) for a review and Currie et al. (2013, 2015) and Hill and Ma (2022) for specific examples.
We are concerned about other environmental confounders that may be co-occurring with drinking water contamination, such as weather, temperature and air pollution. Our weather and temperature controls include maximum and minimum daily temperature, percentage of days in which the daily maximum is above 29.4◦C, percentage of days in which daily minimum is below 0◦C, average daily precipitation, percentage of days in which precipitation is over 0, percentage of days in which precipitation is over 0.25 millimeters, and maximum and minimum daily precipitation. Our TRI facility statistics, which proxy for air pollution (a potential confounder, even in our mother fixed-effects models), include the number of TRI facilities operating within 1, 3, and 5 kilometers of the maternal address in the year the infant was born, and the reported total onsite, offsite, and overall releases for TRI facilities weighted by the squared inverse distance between the facility and the mother’s residential location. We include these weather and air pollution controls following Currie et al. (2015) and Knittel et al. (2016).
xxxvi: An exception to using the same controls is that we use the overall share of plural births (twins or greater) rather than the shares of each plurality type.
xxxvii: Note that to be included in mother FE regressions we need mothers to have three or more births, with at least two of them with contamination measures: for example, for a mother with exactly three births, we observe our switching indicator for the first two births only, and then the mother’s fixed-effect absorbs another degree of freedom. The CWS FE model only requires two births.
xxxviii: For the distribution of the underlying result relative to MCL (RRMCL) measure, see Table 1. The section “Water Quality Index Construction Details” and footnote 29 provide details for alternative scaling.
xxxix: Specifically, we use Oster’s code available from the Boston College Statistical Software Components (SSC) archive under the Stata command -psacalc-. We modify this code to work with the Stata command -reghdfe-.
We use the recommendation of Oster (2016), based on an empirical analysis of randomized controlled trials published in prestigious economics journals, and assume a maximum R2 (the R2 we would obtain if all potential confounders were included) of 1.3 times the R2 estimate from the model with controls.
Note that a negative δ means that selection on unobservables would have to be the reverse of selection on observables to bring the effect to zero.
More details and discussion of our findings are in the Appendix given the focus of this paper on contamination compliant with regulatory standards.
xliii: Analogous to our approach at the individual birth level, we remove CWS-months for which an MCL violation occurred in the last 9 months, and we remove CWS-months above the 99th percentile of contamination (among all CWS-months in the data). However, the units of contamination used in this model are the same as in the previous regressions. Specifically, a one unit increase of the contamination measure reflects a change from the 10th to the 90th percentile in the overall index of drinking water contamination among births (rather than among CWS-months) after removing births above the 99th percentile.
In addition, we estimate our model with a limited sample of smaller CWS to address concerns about mismeasurement in large, complicated distribution systems (Appendix Tables A23 and A24).
More details about this issue are discussed in Appendix section “Community Water System Boundaries and Linking Births to Water Quality” with maps showing changes in CWS boundaries (Appendix Figures A1 through A3).
This is the stated purpose of the EPA’s Air Alerts: “The AQI informs the public about air quality in the area, tells who may be affected, and provides steps to take to reduce exposure when pollution levels are unhealthy” (EPA, 2023).
xlvii: We use births below the 99th percentile to ensure consistency with the rest of the paper; naturally, the letter grades would be different were we to include these outliers.
xlviii: Consistent with our approach elsewhere in the paper, we remove births exposed to contamination above the 99th percentile before conducting these counterfactual exercises. In this analysis, we kept births exposed to MCL violations, however.
Russell et al. (2007) calculated that each instance of LBW costs $14,500, on average, due to increased hospitalization and related medical services use ages 0 to 9 years. For each instance of PTB, Behrman et al., (2007) calculated the hospital costs to be $33,284. Back-of-the-envelope estimates suggest a savings of $57.4 million for LBW and $230.7 million for avoided PTB. These estimates are likely overstated because PTB babies are also more likely to be LBW and therefore could double count savings.
We found 94% of violations were reported to EPA. See Appendix section “Removing Births With Any Samples Exceeding MCL” for more details.
Allaire et al. (2019) suggested using bottled water purchases for surveillance to identify emerging water quality problems that might go undetected or unreported.
See the section “Reproductive Health Water Quality Index” and Appendix section “An Alternative Reproductive Health Index of Five Contaminants Based on Data Availability and Prior Research” for more details.
REFERENCES
- Albouy-Llaty M, Limousi F, Carles C, Dupuis A, Rabouan S, & Migeot V (2016). Association between exposure to endocrine disruptors in drinking water and preterm birth, taking neighborhood deprivation into account: A historic cohort study. International Journal of Environmental Research and Public Health, 13(8), 796. 10.3390/ijerph13080796 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Allaire M, Mackay T, Zheng S, & Lall U (2019). Detecting community response to water quality violations using bottled water sales. Proceedings of the National Academy of Sciences, 116(42), 20917–20922. 10.1073/pnas.1905385116 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Allaire M, Wu H, & Lall U (2018). National trends in drinking water quality violations. Proceedings of the National Academy of Sciences of the United States of America, 115(9) 2078–2083. 10.1073/pnas.1719805115 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Almberg KS, Turyk ME, Jones RM, Rankin K, Freels S, & Stayner LT (2018). Atrazine contamination of drinking water and adverse birth outcomes in community water systems with elevated atrazine in Ohio, 2006–2008. International Journal of Environmental Research and Public Health, 15(9), 1889. 10.3390/ijerph15091889 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Almond D, Chay KY, & Lee DS (2005). The costs of low birth weight. The Quarterly Journal of Economics, 120(3), 1031–1083. 10.1093/qje/120.3.1031 [DOI] [Google Scholar]
- Almond D, Currie J, & Duque V (2018). Childhood circumstances and adult outcomes: Act II. Journal of Economic Literature, 56(4), 1360–1446. 10.1257/jel.20171164 [DOI] [Google Scholar]
- Altonji JG, Elder TE, & Taber CR (2005). An evaluation of instrumental variable strategies for estimating the effects of Catholic Schooling. The Journal of Human Resources, 40(4), 791–821. http://www.jstor.org/stable/4129541 [Google Scholar]
- Ananat EO, & Hungerman DM (2012). The power of the pill for the next generation: Oral contraception’s effects on fertility, abortion, and maternal and child characteristics. Review of Economics and Statistics, 94(1), 37–51. 10.1162/FREST_a_00230 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Aschengrau A, Gallagher LG, Winter M, Butler L, Patricia Fabian M, & Vieira VM (2018). Modeled exposure to tetrachloroethylene-contaminated drinking water and the occurrence of birth defects: A case-control study from Massachusetts and Rhode Island. Environmental Health, 17(1), 75. 10.1186/s12940-018-0419-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Baker J, Bennear L, & Olmstead S (2022). Does information disclosure reduce drinking water violations in the United States? Journal of the Association of Environmental and Resource Economists, 10(3). 10.1086/722619 [DOI] [Google Scholar]
- Beatty TKM, & Shimshack JP (2014). Air pollution and children’s respiratory health: A cohort analysis. Journal of Environmental Economics and Management, 67(1), 39–57. 10.1016/j.jeem.2013.10.002 [DOI] [Google Scholar]
- Behrman RE, Butler AS, & others. (2007). Preterm birth: Causes, consequences, and prevention. Consensus Study Report, National Academies of Sciences, Engineering, and Medicine. https://nap.nationalacademies.org/catalog/11622/preterm-birth-causes-consequences-and-prevention [PubMed] [Google Scholar]
- Bennear LS, Jessoe KK, & Olmstead SM (2009). Sampling out: Regulatory avoidance and the total coliform rule. Environmental Science & Technology, 43(14), 5176–5182. 10.1021/es803115k [DOI] [PubMed] [Google Scholar]
- Bennear LS, & Olmstead SM (2008). The impacts of the “right to know”: Information disclosure and the violation of drinking water standards. Journal of Environmental Economics and Management, 56, 117–130. 10.1016/j.jeem.2008.03.002 [DOI] [Google Scholar]
- Black SE, Devereux PJ, & Salvanes KG (2007). From the cradle to the labor market? The effect of birth weight on adult outcomes. The Quarterly Journal of Economics, 122(1), 409–439. 10.1162/qjec.122.1.409 [DOI] [Google Scholar]
- Bove FJ, Fulcomer MC, Klotz JB, Esmart J, Dufficy EM, & Savrin JE (1995). Public drinking water contamination and birth outcomes. American Journal of Epidemiology, 141(9), 850–862. 10.1093/oxfordjournals.aje.a117521 [DOI] [PubMed] [Google Scholar]
- Bozack AK, Cardenas A, Quamruzzaman Q, Rahman M, Mostofa G, Christiani DC, & Kile ML (2018). DNA methylation in cord blood as mediator of the association between prenatal arsenic exposure and gestational age. Epigenetics, 13(9), 923–940. 10.1080/15592294.2018.1516453 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bradley PM, Kolpin DW, Romanok KM, Smalling KL, Focazio MJ, Brown JB, Cardon MC, Carpenter KD, Corsi SR, DeCicco LA, Dietze JE, Evans N, Furlong ET, Givens CE, Gray JL, Griffin DW, Higgins CP, Hladik ML, Iwanowicz LR, … Wilson VS (2018). Reconnaissance of mixed organic and inorganic chemicals in private and public supply tapwaters at selected residential and workplace sites in the United States. Environmental Science & Technology, 52(23), 13972–13985. 10.1021/acs.est.8b04622 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Braun JM, Gennings C, Hauser R, & Webster TF (2016). What can epidemiological studies tell us about the impact of chemical mixtures on human health? Environmental Health Perspectives, 124(1), A6–A9. 10.1289/Fehp.1510569 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Callaway B, Goodman-Bacon A, & SantAnna PH (2021). Difference-in-differences with a continuous treatment. ArXiv Preprint ArXiv:2107.02637. 10.48550/arXiv.2107.02637 [DOI] [Google Scholar]
- Callaway B, & SantAnna PH (2021). Difference-in-differences with multiple time periods. Journal of Econometrics, 225(2), 200–230. 10.1016/j.jeconom.2020.12.001 [DOI] [Google Scholar]
- Christensen P, Keiser DA, & Lade GE (2023). Economic effects of environmental crises: Evidence from Flint, Michigan. American Economic Journal: Economic Policy, 15(1), 196–232. 10.1257/pol.20190391 [DOI] [Google Scholar]
- Clay K, Portnykh M, & Severnini E (2021). Toxic truth: Lead and fertility. Journal of the Association of Environmental and Resource Economists, 8(5), 975–1012. 10.1086/714351 [DOI] [Google Scholar]
- Coffman VR, Søndergaard Jensen A, Trabjerg BB, Pedersen CB, Hansen B, Sigsgaard T, Olsen J, Schullehner J, Pedersen M, & Stayner LT (2022). Prenatal exposure to nitrate from drinking water and the risk of preterm birth: A Danish nationwide cohort study. Environmental Epidemiology (Philadelphia, Pa.), 6(5), e223. 10.1097/EE9.0000000000000223 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Correia S (2015). Singletons, cluster-robust standard errors and fixed effects: A bad mix. Duke University. [Google Scholar]
- Currie J, Davis L, Greenstone M, & Walker R (2015). Environmental health risks and housing values: Evidence from 1,600 toxic plant openings and closings. American Economic Review, 105(2), 678–709. 10.1257/aer.20121656 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Currie J, Graff Zivin J, Meckel K, Neidell M, & Schlenker W (2013). Something in the water: Contaminated drinking water and infant health. Canadian Journal of Economics/Revue Canadienne d’économique, 46(3), 791–810. 10.1111/caje.12039 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Currie J, & Moretti E (2007). Biology as destiny? Short- and long-run determinants of intergenerational transmission of birth weight. Journal of Labor Economics, 25(2), 231–264. 10.1086/511377 [DOI] [Google Scholar]
- Currie J, & Walker R (2011). Traffic congestion and infant health: Evidence from E-ZPass. American Economic Journal: Applied Economics, 3(1), 65–90. 10.1257/app.3.1.65 [DOI] [Google Scholar]
- Currie J, & Walker R (2019). What do economists have to say about the Clean Air Act 50 Years after the establishment of the Environmental Protection Agency? Journal of Economic Perspectives, 33(4), 3–26. 10.1257/jep.33.4.3 [DOI] [Google Scholar]
- Danagoulian S, Grossman D, & Slusky D (2022). Health care following environmental disasters: Evidence from Flint. Journal of Policy Analysis and Management, 41(4), 1060–1089. 10.1002/pam.22391 [DOI] [Google Scholar]
- Danagoulian S, & Jenkins D (2021). Rolling back the gains: Maternal stress undermines pregnancy health after Flint’s water switch. Health Economics, 30(3), 564–584. 10.1002/hec.4210 [DOI] [PubMed] [Google Scholar]
- Davalos AD, Luben TJ, Herring AH, & Sacks JD (2017). Current approaches used in epidemiologic studies to examine short-term multipollutant air pollution exposures. Annals of Epidemiology, 27(2), 145–153.e1. 10.1016/j.annepidem.2016.11.016 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dave DM, & Yang M (2022). Lead in drinking water and birth outcomes: A tale of two water treatment plants. Journal of Health Economics, 84, 102644. 10.1016/j.jhealeco.2022.102644 [DOI] [PubMed] [Google Scholar]
- De Chaisemartin C, & dHaultfoeuille X (2020). Two-way fixed effects estimators with heterogeneous treatment effects. American Economic Review, 110(9), 2964–2996. 10.1257/aer.20181169 [DOI] [Google Scholar]
- Ebdrup NH, Schullehner J, Knudsen UB, Liew Z, Thomsen AML, Lyngsø J, Bay B, Arendt LH, Clemmensen PJ, Sigsgaard T, Hansen B, & Ramlau-Hansen CH (2022). Drinking water nitrate and risk of pregnancy loss: A nationwide cohort study. Environmental Health: A Global Access Science Source, 21(1), 87. 10.1186/s12940-022-00897-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ferguson KK, McElrath TF, & Meeker JD (2014). Environmental phthalate exposure and preterm birth. JAMA Pediatrics, 168(1), 61–67. 10.1001/jamapediatrics.2013.3699 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ferguson KK, Rosen EM, Rosario Z, Feric Z, Calafat AM, McElrath TF, Vega CV, Cordero JF, Alshawabkeh A, & Meeker JD (2019). Environmental phthalate exposure and preterm birth in the PROTECT birth cohort. Environment International, 132, 105099. 10.1016/j.envint.2019.105099 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Figlio DN, & Lucas ME (2004). What’s in a grade? School report cards and the housing market. American Economic Review, 94(3), 591–604. 10.1257/0002828041464489 [DOI] [Google Scholar]
- Flynn P, & Marcus M (2023). A watershed moment: The Clean Water Act and birth weight. Journal of Human Resources. 10.3368/jhr.0622-12369R2 [DOI] [Google Scholar]
- Forand SP, Lewis-Michl EL, & Gomez MI (2012). Adverse birth outcomes and maternal exposure to trichloroethylene and tetrachloroethylene through soil vapor intrusion in New York State. Environmental Health Perspectives, 120(4), 616–621. 10.1289/ehp.1103884 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gennings C, Shu H, Rudén C, Öberg M, Lindh C, Kiviranta H, & Bornehag C-G (2018). Incorporating regulatory guideline values in analysis of epidemiology data. Environment International, 120, 535–543. 10.1016/j.envint.2018.08.039 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gilliom RJ, Barbash JE, Crawford CG, Hamilton PA, Martin JD, Nakagaki N, Nowell LH, Scott JC, Stackelberg PE, Thelin GP, & Wolock DM (2006). Pesticides in the Nation’s Streams and Ground Water, 1992–2001. In Circular (1291). U.S. Geological Survey. 10.3133/cir1291 [DOI] [Google Scholar]
- González L (2013). The effect of a universal child benefit on conceptions, abortions, and early maternal labor supply. American Economic Journal: Economic Policy, 5(3), 160–188. 10.1257/pol.5.3.160 [DOI] [Google Scholar]
- Goodman-Bacon A (2021). Difference-in-differences with variation in treatment timing. Journal of Econometrics, 225(2), 254–277. 10.1016/j.jeconom.2021.03.014 [DOI] [Google Scholar]
- Govarts E, Iszatt N, Trnovec T, de Cock M, Eggesbø M, Palkovicova Murinova L, van de Bor M, Guxens M, Chevrier C, Koppen G, Lamoree M, Hertz-Picciotto I, Lopez-Espinosa M-J, Lertxundi A, Grimalt JO, Torrent M, Goñi-Irigoyen F, Vermeulen R, Legler J, & Schoeters G (2018). Prenatal exposure to endocrine disrupting chemicals and risk of being born small for gestational age: Pooled analysis of seven European birth cohorts. Environment International, 115, 267–278. 10.1016/j.envint.2018.03.017 [DOI] [PubMed] [Google Scholar]
- Grooms KK (2016). Does water quality improve when a safe drinking water act violation is issued? A study of the effectiveness of the SDWA in California. BE Journal of Economic Analysis & Policy, 16(1), 1–23. 10.1515/bejeap-2014-0205 [DOI] [Google Scholar]
- Grossman DS, & Slusky DJ (2019). The impact of the Flint water crisis on fertility. Demography, 56(6), 2005–2031. 10.1007/s13524-019-00831-0 [DOI] [PubMed] [Google Scholar]
- Guldi M (2008). Fertility effects of abortion and birth control pill access for minors. Demography, 45, 817–827. 10.1353/Fdem.0.0026 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hill EL, & Ma L (2022). Drinking water, fracking, and infant health. Journal of Health Economics, 82, 102595. 10.1016/j.jhealeco.2022.102595 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Keiser DA, & Shapiro JS (2019a). Consequences of the Clean Water Act and the demand for water quality. The Quarterly Journal of Economics, 134(1), 349–396. 10.1093/qje/qjy019 [DOI] [Google Scholar]
- Keiser DA, & Shapiro JS (2019b). US water pollution regulation over the past half century: Burning waters to crystal springs? Journal of Economic Perspectives, 33(4), 51–75. 10.1257/jep.33.4.51 [DOI] [Google Scholar]
- Kile ML, Cardenas A, Rodrigues E, Mazumdar M, Dobson C, Golam M, Quamruzzaman Q, Rahman M, & Christiani DC (2016). Estimating effects of arsenic exposure during pregnancy on perinatal outcomes in a Bangladeshi cohort. Epidemiology, 27(2), 173–181. 10.1097/EDE.0000000000000416 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Knittel CR, Miller DL, & Sanders NJ (2016). Caution, drivers! Children present: Traffic, pollution, and infant health. Massachusetts Institute of Technology Press. https://dspace.mit.edu/handle/1721.1/113913 [Google Scholar]
- Manassaram DM, Backer LC, & Moll DM (2006). A review of nitrates in drinking water: Maternal exposure and adverse reproductive and developmental outcomes. Environmental Health Perspectives, 114(3), 320–327. 10.1289/ehp.8407 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marcus M (2021). Going beneath the surface: Petroleum pollution, regulation, and health. American Economic Journal: Applied Economics, 13(1), 1–37. 10.1257/app.20190130 [DOI] [Google Scholar]
- Marcus M (2022). Testing the water: Drinking water quality, public notification, and child outcomes. The Review of Economics and Statistics, 104(6), 1289–1303. 10.1162/rest_a_01006 [DOI] [Google Scholar]
- Mashau F, Ncube EJ, & Voyi K (2018). Drinking water disinfection by-products exposure and health effects on pregnancy outcomes: A systematic review. Journal of Water and Health, 16(2), 181–196. 10.2166/wh.2018.167 [DOI] [PubMed] [Google Scholar]
- McDonald YJ, Anderson KM, Caballero MD, Ding KJ, Fisher DH, Morkel CP, & Hill EL (2022). A systematic review of geospatial representation of United States community water systems. AWWA Water Science, 4(1), e1266. 10.1002/aws2.1266 [DOI] [Google Scholar]
- McDonald YJ, & Jones NE (2018). Drinking water violations and environmental justice in the United States, 2011–2015. American Journal of Public Health, 108(10), 1401–1407. 10.2105/ajph.2018.304621 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Migeot V, Albouy-Llaty M, Carles C, Limousi F, Strezlec S, Dupuis A, & Rabouan S (2013). Drinking-water exposure to a mixture of nitrate and low-dose atrazine metabolites and small-for-gestational age (SGA) babies: A historic cohort study. Environmental Research, 122, 58–64. 10.1016/j.envres.2012.12.007 [DOI] [PubMed] [Google Scholar]
- Murphy EA, Post GB, Buckley BT, Lippincott RL, & Robson MG (2012). Future challenges to protecting public health from drinking-water contaminants. Annual Review of Public Health, 33, 209–224. 10.1146/Fannurev-publhealth-031811-124506 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Myers SL, Lobdell DT, Liu Z, Xia Y, Ren H, Li Y, Kwok RK, Mumford JL, & Mendola P (2010). Maternal drinking water arsenic exposure and perinatal outcomes in inner Mongolia, China. Journal of Epidemiology and Community Health, 64(4), 325–329. 10.1136/jech.2008.084392 [DOI] [PubMed] [Google Scholar]
- Oakes M, Baxter L, & Long TC (2014). Evaluating the application of multipollutant exposure metrics in air pollution health studies. Environment International, 69, 90–99. 10.1016/j.envint.2014.03.030 [DOI] [PubMed] [Google Scholar]
- Oreopoulos P, Stabile M, Walld R, & Roos L (2008). Short-, medium-, and long-term consequences of poor infant health an analysis using siblings and twins. Journal of Human Resources, 43(1), 88–138. https://www.jstor.org/stable/40057340 [Google Scholar]
- Oster E (2016). Unobservable selection and coefficient stability: Theory and evidence. Journal of Business & Economic Statistics, 0(0), 1–18. 10.1080/07350015.2016.1227711 [DOI] [Google Scholar]
- Porpora MG, Piacenti I, Scaramuzzino S, Masciullo L, Rech F, & Benedetti Panici P (2019). Environmental contaminants exposure and preterm birth: A systematic review. Toxics, 7(1), 11. 10.3390/toxics7010011 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rahman ML, Valeri L, Kile ML, Mazumdar M, Mostofa G, Qamruzzaman Q, Rahman M, Baccarelli A, Liang L, Hauser R, & Christiani DC (2017). Investigating causal relation between prenatal arsenic exposure and birthweight: Are smaller infants more susceptible? Environment International, 108, 32–40. 10.1016/j.envint.2017.07.026 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rinsky JL, Hopenhayn C, Golla V, Browning S, & Bush HM (2012). Atrazine exposure in public drinking water and preterm birth. Public Health Reports, 127(1), 72–80. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rohr JR, & McCoy KA (2010). A qualitative meta-analysis reveals consistent effects of atrazine on freshwater fish and amphibians. Environmental Health Perspectives, 118(1), 20–32. 10.1289/ehp.0901164 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Royer H (2009). Separated at girth: US twin estimates of the effects of birth weight. American Economic Journal: Applied Economics, 1(1), 49–85. 10.1257/app.1.1.49 [DOI] [Google Scholar]
- Russell RB, Green NS, Steiner CA, Meikle S, Howse JL, Poschman K, Dias T, Potetz L, Davidoff MJ, Damus K, & others. (2007). Cost of hospitalization for preterm and low birth weight infants in the United States. Pediatrics, 120(1), e1–e9. 10.1542/peds.2006-2386 [DOI] [PubMed] [Google Scholar]
- Saha KK, Engström A, Hamadani JD, Tofail F, Rasmussen KM, & Vahter M (2012). Pre- and postnatal arsenic exposure and body size to 2 years of age: A cohort study in rural Bangladesh. Environmental Health Perspectives, 120(8), 1208–1214. 10.1289/ehp.1003378 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sanders AP, Desrosiers TA, Warren JL, Herring AH, Enright D, Olshan AF, Meyer RE, & Fry RC (2014). Association between arsenic, cadmium, manganese, and lead levels in private wells and birth defects prevalence in North Carolina: A semi-ecologic study. BMC Public Health, 14, 955. 10.1186/1471-2458-14-955 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Savitz DA, Singer PC, Herring AH, Hartmann KE, Weinberg HS, & Makarushka C (2006). Exposure to drinking water disinfection by-products and pregnancy loss. American Journal of Epidemiology, 164(11), 1043–1051. 10.1093/aje/kwj300 [DOI] [PubMed] [Google Scholar]
- Schlenker W, & Roberts MJ (2009). Nonlinear temperature effects indicate severe damages to US crop yields under climate change. Proceedings of the National Academy of Sciences, 106(37), 15594–15598. 10.1073/pnas.0906865106 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sherris AR, Baiocchi M, Fendorf S, Luby SP, Yang W, & Shaw GM (2021). Nitrate in drinking water during pregnancy and spontaneous preterm birth: A retrospective within-mother analysis in California. Environmental Health Perspectives, 129(5), 57001. 10.1289/EHP8205 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Snowden JM, Reid CE, & Tager IB (2015). Framing air pollution epidemiology in terms of population interventions, with applications to multi-pollutant modeling. Epidemiology, 26(2), 271–279. 10.1097/EDE.0000000000000236 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sonnenfeld N, Hertz-Picciotto I, & Kaye WE (2001). Tetrachloroethylene in drinking water and birth outcomes at the US Marine Corps Base at Camp Lejeune, North Carolina. American Journal of Epidemiology, 154(10), 902–908. 10.1093/aje/154.10.902 [DOI] [PubMed] [Google Scholar]
- Stayner LT, Almberg K, Jones R, Graber J, Pedersen M, & Turyk M (2017). Atrazine and nitrate in drinking water and the risk of preterm delivery and low birth weight in four Midwestern states. Environmental Research, 152, 294–303. 10.1016/j.envres.2016.10.022 [DOI] [PubMed] [Google Scholar]
- United States Environmental Protection Agency. (2006). IDL-MDL-PQL “What the “L” is going on? What does all this alphabet soup really mean? https://www.epa.gov/sites/production/files/2015-06/documents/whatthel.pdf
- United States Environmental Protection Agency. (2013). Toxics Release Inventory (TRI) program. https://www.epa.gov/toxics-release-inventory-tri-program
- United States Environmental Protection Agency. (2018). National primary drinking water regulations. https://www.epa.gov/ground-water-and-drinking-water/national-primary-drinking-water-regulations
- United States Environmental Protection Agency. (2019a). Population served by community water systems with no reported violations of health-based standards. https://cfpub.epa.gov/roe/indicator.cfm?i=45
- United States Environmental Protection Agency. (2019b). Title XIV of The Public Health Service Act: Safety of public water systems (Safe Drinking Water Act). https://www.epa.gov/sdwa/title-xiv-public-health-service-act-safety-public-water-systems-safe-drinking-water-act-0
- United States Environmental Protection Agency. (2023, June 26). Wildfire smoke and your patients’ health: The Air Quality Index. https://www.epa.gov/wildfire-smoke-course/wildfire-smoke-and-your-patients-health-air-quality-index
- Wen X-J, Balluz L, & Mokdad A (2009). Association between media alerts of air quality index and change of outdoor activity among adult asthma in six states, BRFSS, 2005. Journal of Community Health, 34(1), 40–46. 10.1007/s10900-008-9126-4 [DOI] [PubMed] [Google Scholar]
- Werner RM, Konetzka RT, & Polsky D (2016). Changes in consumer demand following public reporting of summary quality ratings: An evaluation in nursing homes. Health Services Research, 51, 1291–1309. 10.1111/1475-6773.12459 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wigle DT, Arbuckle TE, Turner MC, Bérubé A, Yang Q, Liu S, & Krewski D (2008). Epidemiologic evidence of relationships between reproductive and child health outcomes and environmental chemical contaminants. Journal of Toxicology and Environmental Health. Part B, Critical Reviews, 11(5–6), 373–517. 10.1080/10937400801921320 [DOI] [PubMed] [Google Scholar]
- Yang M, & Chou S-Y (2018). The impact of environmental regulation on fetal health: Evidence from the shutdown of a coal-fired power plant located upwind of New Jersey. Journal of Environmental Economics and Management, 90, 269–293. 10.1016/j.jeem.2018.05.005 [DOI] [Google Scholar]
- Young HA, Kolivras KN, Krometis L-AH, Marcillo CE, & Gohlke JM (2022). Examining the association between safe drinking water act violations and adverse birth outcomes in Virginia. Environmental Research, 114977. 10.1016/j.envres.2022.114977 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zivin JG, & Neidell M (2009). Days of haze: Environmental information disclosure and intertemporal avoidance behavior. Journal of Environmental Economics and Management, 58(2), 119–128. 10.1016/j.jeem.2009.03.001 [DOI] [Google Scholar]
- Zivin JG, Neidell M, & Schlenker W (2011). Water quality violations and avoidance behavior: Evidence from bottled water consumption. American Economic Review, 101(3), 448–453. 10.1257/aer.101.3.448 [DOI] [Google Scholar]
