Abstract
This study quantitatively reviews the hedonic literature examining surface water quality to assess how attributes of the commodity, housing market, and methodological choices lead to variation in the significance and expected sign of the estimated property value effects (i.e., elasticities). We conduct a meta-analysis of 29 studies with 290 unique estimates, published or released between 1985 and 2017, and find evidence based on probit meta-regression models that some of the definitions and decisions made in primary studies do influence the estimated relationship between water quality and home prices. Our most robust evidence suggests that methodological choices (e.g., accounting for spatial dependence, or if the water quality measure was based on something other than in situ measurement) have a critical role in determining the likelihood of finding a significant and theoretically expected result; and perhaps most importantly, it is not always selections that reflect best practices that lead to this finding. This study can help identify potential concerns with data and modeling choices in the collective hedonic literature focused on water quality.
Keywords: Hedonic model, Meta-analysis, Property value, Water pollution, Water quality
1. Introduction
The number of hedonic property value studies continues to grow as data on sales and house attributes, including surrounding environmental conditions, become more detailed and readily accessible (Bishop et al., 2020; Guignet and Lee, 2021; Petrolia et al., 2021). This growth in the literature provides an opportunity to analyze how key decisions made by researchers, including the definition of the environmental commodity, housing market, and methodological choices affect the results from hedonic property value models. This paper focuses on hedonic models that examine the capitalization of water quality in housing values. Although several reviews of the hedonic property value literature exist (e.g., Boyle and Kiel, 2001; Kiel, 2006; Wilde et al., 2012), only Nicholls and Crompton (2018) review the hedonic literature focused specifically on water quality. Our study goes beyond their narrative review and empirically analyzes how primary study choices can affect the capitalization of water quality into home values. The resulting quantitative generalizations from our meta-analysis provide useful insights regarding the collective hedonic literature focused on water quality.
Meta-analysis uses a variety of statistical approaches to analyze previously reported scientific results and draw broader conclusions (Stanley, 2001; Nelson and Kennedy, 2009; Stanley and Doucouliagos, 2012). It can be used for purposes of benefit transfer, and/or to draw inferences from the collective body of literature (Boyle and Wooldridge, 2018). Our prior analysis of these meta-data focused on 18 hedonic studies that examined the relationship between house prices and water clarity (see Guignet et al., 2022). The expressed purpose was for benefit transfer. The focus for the current study is to draw inferences from the hedonic literature focused on water quality more broadly, using a larger meta-dataset of 29 studies, and that includes numerous measures of water quality (not just clarity).
The capitalization of water quality into waterfront home values may vary not only due to variations in the environmental commodity itself, but also variation in study design and methodological choices. A meta-regression model allows one to systematically examine how such variation impacts the primary study results. Often the dependent variable in a meta-regression model is the estimated effect size or summary statistic of interest from the primary studies (e.g., willingness to pay or elasticities). The dependent variable can also represent whether the effect size is significant and has a theoretically consistent or expected sign (e.g., Smith and Huang, 1993; Wehkamp et al., 2018; Ohlendorf et al., 2021). The inclusion of various independent variables representing different study characteristics such as methodology, data, and functional form, allow one to identify the effect of these choices on the primary study results (Stanley, 2001).
There are numerous applications of meta-analysis in the environmental economics literature, and in particular in the realm of non-market valuation (Nelson and Kennedy, 2009). Previous meta-analyses related to economic valuation also consider methodological choices, but many still focus on improving benefit transfer (e.g., Rosenberger and Johnston, 2009; Schütt, 2021; Guignet et al., 2022). Including methodological attributes in valuation meta-analyses have been shown to avoid problems of omitted variable bias (e.g., Johnston et al., 2006a; Johnston et al., 2006b; Stapler and Johnston, 2009).
Insight for our paper was provided by Smith and Huang (1993), who estimated meta-regressions examining the effects of air pollution on property values, Kuminoff et al. (2010) who examined how omitted variables affect hedonic models, and three recent “best practices” articles (Taylor, 2017; Bishop et al., 2020; Guignet and Lee, 2021). Smith and Huang (1993), who had the same objectives as this paper, evaluated how study characteristics related to finding a negative and theoretically consistent relationship between air pollution and property sales. They found that using actual sales price (as opposed to census data or assessed values), assuming a linear hedonic model, and using more than one air pollution measure, all decreased the likelihood of finding a significant and theoretically expected result. Market conditions, as measured by home vacancy rates, were also included in their meta-analysis. Kuminoff et al. (2010: p. 157) ran an internal meta-analysis with their set of simulations to assess how omitted variables affect marginal implicit price estimates. They suggest controlling for functional form, sample size, whether the primary study controls for omitted variables, and whether the study estimates time-constant implicit prices, are important when estimating a meta-regression.
The three “best practices” articles play an important role in ensuring we account for recommended modeling choices in our meta-analysis and allow us to compare the differences in results to those recommended practices. The objective of our meta-analysis is not to necessarily promote hedonic modeling choices that lead to significant and expected results. Rather, we aim to identify how commodity, market, and methodological choices made by researchers affect the collective results found in the hedonic property value literature.
We find that, all else constant, the decisions made by primary study researchers can greatly influence the estimated relationship between water quality and home prices. In particular, the results reveal robust evidence that methodological choices – like whether spatial dependence is accounted for, whether actual transaction prices are used, or if the water quality measure is based on in situ measurements – have a critical role in determining the likelihood of finding a significant and theoretically expected result; and perhaps most importantly, these findings do not always align with study decisions that reflect best practices. Our study, therefore, provides information that identifies potential concerns with data and modeling choices in hedonic studies focused on water quality.
2. Meta-dataset
2.1. Meta-dataset development and structure
The complete meta-dataset contains a comprehensive set of hedonic property value studies that examined objective measures of surface water quality in the US and were published or released between 1979–2017.1 We developed the meta-dataset based on guidelines by Stanley et al. (2013), and the subsequent update by Havránek et al. (2020). Studies were not limited to a single or “preferred” observation (Boyle and Wooldridge, 2018); there are sometimes numerous observations from a single study. Such a clustered (or panel) structure and within study variation of specification choices is advantageous given our objective of identifying how primary study assumptions affect the hedonic results.
For the current study, we include property value studies examining a variety of water quality measures, not just water clarity (i.e., Secchi disk depth). This means we cannot focus on a common effect size, such as elasticity,2 because the different water quality measures make it difficult to define a consistent metric. A 1% change in Secchi disk depth can mean something very different compared to a 1% change in chlorophyll a concentrations or fecal coliform counts. Therefore, price elasticities with respect to these different water quality metrics cannot necessarily be pooled and directly compared. At the same time, we lack enough observations (outside of water clarity) to constrain this analysis to only a single water quality measure.
Our protocol for identifying primary studies for inclusion in the meta-dataset is described in the supplementary material. In total, 65 studies in the peer-reviewed and gray literature were identified for potential inclusion. Focus was then drawn to the subset of studies that used an objective water quality measure, were inside the US, were not redundant with a later peer-reviewed publication (e.g., an earlier working paper), and were a primary hedonic study (i.e., not a literature review or another meta-analysis). These screening criteria led to the exclusion of 29 studies. The remaining 36 primary studies in the meta-dataset include both waterfront and non-waterfront observations. From the 36, we select the 29 primary studies and the 290 unique house price elasticity estimates that correspond to waterfront homes.3 This final set of studies includes results from both the published and gray literature. Twenty-one studies are from published peer-reviewed journals, three are master’s or PhD theses, two are government reports, one is a book chapter, one is from a presentation, and one is a working paper.
A variety of water quality measures have been examined in the hedonic literature; 17 different metrics are observed in our meta-data. For each measure, we have an estimate of the house price elasticity with respect to water quality, as well as an estimated standard error. Hedonic studies use a variety of functional forms, and hence report their findings in different ways (e.g., elasticities versus semi-elasticities). We convert the primary study coefficient estimates to common elasticities based on study specific model-by-model derivations. The corresponding standard errors are then computed based on Monte Carlo simulations. These simulations are performed separately for each of the 290 meta-observations across the 29 included studies.4
Some of the elasticities represent improvements in water quality, while the majority of elasticities relate to a degradation in quality (e.g., higher fecal coliform indicates lower water quality). The elasticities for Secchi disk depth, percent water visibility, and dissolved oxygen are all generally expected to be positive because increases in these three measures are often considered improvements. Table 1 lists each of the 17 water quality measures used in the primary studies, the expected sign of the relationships with house prices, the corresponding primary studies, and the number of observations pertaining to each water quality measure in our final meta-dataset.
Table 1.
Water quality measures, expected sign, list of primary studies, and number of meta-observations.
Expected Sign is based on the original study’s water quality measure and whether increases in the water quality measure generally denote a degradation or improvement.
Using the computed standard errors, we calculate whether the estimated elasticity for each meta-observation is significantly different from zero. With the expected sign of the elasticity based on the water quality measure and the estimated standard errors, we characterize whether each elasticity estimate is significant and of the theoretically expected sign (Smith and Huang, 1993; Wehkamp et al., 2018; Ohlendorf et al., 2021). Both the estimation of the standard errors and the critical value rely on the original study’s sample size. The conversion of water quality elasticities into a binary variable, Y, does admittedly result in a loss of information about the size of the effect. However, as discussed earlier, such a conversion is necessary in order to pool meta-observations based on very different measures of water quality, and ultimately to draw general conclusions from the broader literature.
The dependent variable in our main model uses p < 0.05 from a standard two-tailed t-test to determine if the estimated elasticity is statistically significant.5 About 52% of the estimated elasticities are considered significant and have the expected sign (151 observations). There are 129 observations that are statistically equal to zero (83 of those insignificant observations have the expected sign). The primary objective of our quantitative review is to examine under what conditions the hedonic model tends to yield the expected results, focusing specifically on the commodity attributes, market characteristics, and methodological choices discussed next.
2.2. Commodity, market, and methodological characteristics
While the definition of the water quality commodity and the market definition are often chosen by the researcher to match the primary study objectives, and are sometimes constrained by the availability of both water quality and property sales data, the methodological and estimation choices made by researchers can have an important influence on a given study’s results. Including methodological characteristics in the meta-analysis is motivated in part to help understand the importance of these decisions. The role of the publication process and potential associations with the likelihood of observing a statistically significant and expected result are also discussed.
Descriptive statistics can be found in Table 2. Many of the variables are binary indicators or dummy variables. The 29 studies included in the meta-data were published between 1985 and 2017 and provided elasticity estimates ranging from −2.64 to 8.32.
Table 2.
Definition of variables and summary statistics.
| Variable | Mean | Standard Deviation | Minimum | Maximum |
|---|---|---|---|---|
| Elasticity | 0.0876 | 0.7002 | −2.6376 | 8.3202 |
| Dependent variable: | ||||
| Y (=1 if elasticity is significant with a p-value <0.05 and has the theoretically expected sign) | 0.5207 | 0.5004 | 0 | 1 |
| Environmental commodity variables: | ||||
| Lake or Reservoira (=1 if observation is for lake or reservoir) | 0.5724 | 0.4956 | 0 | 1 |
| Estuary (=1 if observation is for estuary) | 0.3379 | 0.4738 | 0 | 1 |
| River (=1 if observation is for river) | 0.0897 | 0.2862 | 0 | 1 |
| Claritya (=1 if water quality measure in hedonic study relates to clarity, see Table 1) | 0.6172 | 0.4869 | 0 | 1 |
| Nutrients (=1 if water quality measure in hedonic study relates to nutrients, see Table 1) | 0.1276 | 0.3342 | 0 | 1 |
| Sediment (=1 if water quality measure in hedonic study relates to sediment, see Table 1) | 0.0517 | 0.2219 | 0 | 1 |
| Bacteria (=1 if water quality measure in hedonic study relates to bacteria, see Table 1) | 0.1414 | 0.3490 | 0 | 1 |
| Biochemical (=1 if water quality measure in hedonic study relates to biochemical, see Table 1) | 0.0621 | 0.2417 | 0 | 1 |
| Study area and market characteristics: | ||||
| Northeasta (=1 if observation is from northeast regionb) | 0.3034 | 0.4605 | 0 | 1 |
| Midwest (=1 if observation is from midwest regionb) | 0.1793 | 0.3843 | 0 | 1 |
| West (=1 if observation is from west regionb) | 0.1241 | 0.3303 | 0 | 1 |
| South (=1 if observation is from south regionb) | 0.3931 | 0.4893 | 0 | 1 |
| Mean House Price (thousands, 2018$)c (average house price updated to 2018$) | 317.9502 | 220.6630 | 8.0196 | 1245.9600 |
| Single Subcountya (=1 if observation uses a sample constrained to county, but does not use all property sales) | 0.7276 | 0.4460 | 0 | 1 |
| Multiple Counties/Subcounties (=1 if observation uses a sample from multiple counties or multiple subcounties) | 0.2724 | 0.4460 | 0 | 1 |
| Sample Yearsc (number of years in the sample) | 8.8586 | 5.0435 | 1 | 24 |
| Bubble (=1 if sample includes any of the years 2006–2009) | 0.3655 | 0.4824 | 0 | 1 |
| Methodological variables: | ||||
| Double-loga (=1 if observation comes from a double-log specification) | 0.3172 | 0.4662 | 0 | 1 |
| Log-lineara (=1 if observation comes from a log-linear specification) | 0.2966 | 0.4575 | 0 | 1 |
| Linear-loga (=1 if observation comes from a linear-log specification) | 0.2759 | 0.4477 | 0 | 1 |
| Log-quadratica (=1 if observation comes from a log-quadratic specification) | 0.0448 | 0.2073 | 0 | 1 |
| Linear (=1 if observation comes from a linear specification) | 0.0655 | 0.2479 | 0 | 1 |
| No Spatial Methoda (=1 if observation was derived from a model using no spatial method) | 0.4138 | 0.4934 | 0 | 1 |
| Spatial Fixed Effects (=1 if observation was derived from a model that included spatial fixed effects) | 0.2897 | 0.4544 | 0 | 1 |
| Spatial Lag (=1 if observation was derived from a spatial autoregressive model) | 0.3276 | 0.4701 | 0 | 1 |
| Spatial Autocorrelation (=1 if observation was derived from a spatial error model or allowed for clustered errors within a spatially defined group) | 0.3276 | 0.4701 | 0 | 1 |
| Assessed Housing Values (=1 if observation used assessed housing values) | 0.0828 | 0.2760 | 0 | 1 |
| Not In Situ (=1 if water quality measure was based on something other than in situ measurement) | 0.3483 | 0.4772 | 0 | 1 |
| More than One WQ Variable (=1 if observation included more than one water quality variable in the primary hedonic model) | 0.2448 | 0.4307 | 0 | 1 |
| Sample Sizec (original study sample size) | 13,066.1 | 36,002.8 | 21 | 281,951 |
| Publication characteristics: | ||||
| Time Trend (year published)c (0 = 1985 to 32 = 2017) | 24.1379 | 7.5817 | 0 | 32 |
| Journal (=1 if observation was from a peer-reviewed journal article) | 0.7310 | 0.4442 | 0 | 1 |
Unweighted descriptive statistics presented for n = 290 unique elasticity estimates in meta-dataset. Variables are binary indicator variables unless otherwise noted.
Denotes reference category.
Regions of the US are defined following the US Census Bureau’s four census regions. Accessed on April 14, 2023, at: https://www2.census.gov/geo/pdfs/maps-data/maps/reference/us_regdiv.pdf.
Denotes independent variables that are continuous.
2.2.1. Environmental commodity
To synthesize the water quality hedonic literature, we begin with the environmental commodity (i.e., the type of waterbody and water quality measure examined in the primary studies). The hedonic property value studies focusing on surface water quality tend to examine lakes or reservoirs (57%) as a group, identified as our reference category, compared to the variables Estuary or River.
For tractability, we organized the 17 different objective water quality measures in the original studies into five broader categories or variables – Clarity, Nutrients, Sediment, Bacteria, and Biochemical (see Table 1). Some categories are more likely to be directly observed by homebuyers, and others could be considered proxies for perceived water quality. Our variables are intended to group measures that reflect similar water quality issues and processes, and that also may be perceived similarly by homebuyers and sellers. The reference variable Clarity is the most common measure in this meta-dataset (62%), followed by Bacteria (14%) and then Nutrients (13%).
2.2.2. Study area and housing market characteristics
Most elasticities were estimated for housing markets and waterbodies in the south (39%), followed by the northeast region, midwest and west. Indicator variables for Midwest, West, and South are included in the meta-regression (Northeast is the reference category). In addition to the broad US regions, the characteristics defining the assumed housing market in a primary study may play an important role in whether a study yields the expected result. Taylor (2017), Bishop et al. (2020), and others emphasize the need to define a market, both geographically and temporally, by the “law of one price” – meaning that within the assumed market, identical housing bundles will sell for the same price. In other words, a single hedonic equilibrium price surface should apply throughout the entire housing market, and that equilibrium is not changing over the assumed time period and spatial definition of that market.
The housing market characteristics considered in our meta-analysis are the average house price, the spatial and temporal definitions of the market (i.e., whether multiple counties were pooled together and the number of years in the study period, respectively), and whether the sample years include the 2006–2009 housing market bubble burst. Using the consumer price index, we update the Mean House Price to 2018$ in each study based on the reported year or, if not reported, the last year of the sample.6 To assess the effects of spatial definitions on elasticities, we initially identify whether observations defined a market as a subcounty area, multiple subcounties, or multiple counties.7 Most observations were at the subcounty level (73%). We combined multiple counties and multiple subcounties (Multiple Counties/Subcounties) as the variable to represent market definition (Table 2). In order to examine the temporal definition of the market, we use the number of years in the sample. Our interest lies in whether hedonic results vary when estimated from sales data over longer study periods, and hence where the “law of one price” assumption is less plausible. The variable Sample Years has noticeable variation, ranging from 1 to 24 years.
Economists have discussed the implications of the 2006–2009 housing market bubble and burst on both hedonic methods and the interpretation of results (e.g., Boyle et al., 2012; Taylor, 2017; Bishop et al., 2020). The structural shifts that occurred during the housing market bubble and subsequent burst clearly affected market equilibriums. Hedonic models that are estimated with samples that included transactions both pre- and post-the market bubble burst, and that do not properly allow the entire hedonic surface to shift with that new equilibrium in their models, violate the “law of one price” and are theoretically invalid (Bishop et al., 2020). Defining the burst from 2006 to 2009, which matches Taylor’s (2017) definition, we create a dummy variable (Bubble) that is equal to 1 if any of the sample years include the bubble burst (about 37% of the observations).
2.2.3. Methodological characteristics
Independent variables that characterize methodological decisions can also lead to variation in the significance and the expected sign of elasticity estimates. The methodological variables examined include choices of functional form, methods to account for spatial dependence, whether actual transaction prices are used (as opposed to assessed values), and decisions about the water quality data and variables included in the hedonic models.
The choice of functional form leads to different interpretations of the coefficients, but there is still relatively little guidance on the appropriate functional form assumptions for hedonic price models. Cropper et al.’s (1988) seminal study suggested that simpler functional forms (e.g., log-linear) outperform more complex models in the presence of omitted variables, but more recently Kuminoff et al. (2010) found that flexible functional forms may perform better when combined with spatial and temporal fixed effects, and quasi-experimental methods. Otherwise, the only firm guidance is that a linear specification is generally not theoretically appropriate (Bockstael and McConnell, 2007; Taylor, 2017; Bishop et al., 2020). Most observations in the meta-data are based on a double-log specification, but linear-log and log-linear are also commonly used. The fewest observations come from linear and log-quadratic specifications (Table 2). We only include the variable Linear in our later meta-regressions because this is the only specification where there is clear consensus that it goes against best practices.
Spatial dependence occurs when primary study observations are correlated with geographically nearby observations. Depending on the nature of the spatial correlation, such dependence can lead to less precise, and even biased estimates from the primary study regression models (LeSage and Pace, 2009). Spatial dependence is a long-standing concern in the hedonic property value literature (Kuminoff et al., 2010; Guignet and Lee, 2021). A variety of approaches have emerged to address the issue, such as spatial econometric specifications (Anselin and Lozano-Gracia, 2009), quasi-experimental methods (Parmeter and Pope, 2013), and spatial fixed effects (Taylor, 2017; Guignet and Lee, 2021). With the exception of Olden and Tamayo (2014), who use an instrumental variable approach to address endogeneity concerns, no studies in our meta-data utilized quasi-experimental methods (e.g., difference-in-difference, regression discontinuity). About 41% of the observations did not use any approach to explicitly address spatial dependence (see Table 2), while 170 observations (59%) used some combination of spatial fixed effects, spatial lag models, and/or approaches to account for spatial autocorrelation.
Eighty-four observations were derived from models that included spatial fixed effects (e.g., neighborhood and watershed, town, city, or lake level). While no observation was based only on a spatial autoregressive model, which includes a spatial lag of price (LeSage and Pace, 2009), 95 observations were estimated from these models in combination with the other approaches. In addition, 95 observations accounted for spatial autocorrelation either through a formal spatial error model (LeSage and Pace, 2009) or allowing for clustered errors within a spatially defined group.8 The variables Spatial Fixed Effects, Spatial Lag, and Spatial Autocorrelation are compared to the reference No Spatial Method in our models.
Sales price data have become more accessible, either through private companies (e.g., CoreLogic, Zillow) or directly from county and state property assessor offices. However, data of assessed or predicted property values are sometimes easier to acquire and are available for a larger sample of homes. Although more comprehensive by not just reflecting homes that are sold, assessed or predicted values do not directly reflect market transactions, and hence revealed preferences. As identified by Bishop et al. (2020), the assessed values may have measurement error that in turn could affect the results from subsequent hedonic models. At the same time, Smith and Huang (1993) found that the use of actual sales prices reduced the likelihood of finding a significant and theoretically consistent elasticity. They suggested that this may occur because of higher variability or noise in actual sales data. Most observations in our meta-dataset are based on actual sales prices, but roughly 8% of the observations are based on assessed values (Assessed Housing Values = 1).
The hedonic literature uses a variety of approaches for acquiring measures of water quality, including: in situ measurements, spatial interpolation, model prediction, and satellite imagery (e.g., remote sensing). Most of the observations in the meta-data are from studies that used in situ measurement. Water quality measures based on spatial interpolated data were the second most common, followed by predicted measurements from water quality models. A relatively new approach for hedonic models that will certainly become increasingly common is the use of satellite-based measures (e.g., Wolf and Kemp, 2021; Zhang et al., 2022). During the period of our meta-data, however, Horsch and Lewis (2009) are the only ones to use water quality measures based on remote sensing data. We represent these approaches in the meta-regressions with the variable, Not In Situ.
Another methodological decision is how many water quality variables to include in the primary hedonic model. For our study, most observations included just one water quality variable. The remaining 24% used up to seven water quality variables in a single model, with most using either two or five (e.g., Bin and Czajkowski, 2013; Netusil et al., 2014; Walsh and Milon, 2016). We represent this decision in our meta-analysis with an indicator variable – More than One WQ Variable. In their meta-analysis of hedonic studies on air pollution, Smith and Huang (1993) found that as the number of air pollutant variables included in the hedonic model increased, the results were less likely to yield a statistically significant relationship with house prices.
Finally, studies with larger sample sizes will yield more precise estimates, all else constant. And so, if there is in fact a true effect, then as sample size increases, a primary study would be more likely to yield a statistically significant estimate of the expected sign. To account for this effect, we include the square root of the sample size (Sample Size) as an independent variable (see Stanley, 2001; Card et al., 2010; Wehkamp et al., 2018). The sample sizes across meta-observations range from just 21 to over 281,000 home transactions.
2.2.4. Publication characteristics
In addition to the study choices described above, we want to control for two publication characteristics. The first is a linear time trend variable that reflects the year of publication. As seen in Table 2, the variable Time Trend ranges 0 = 1985 through 32 = 2017. Such a trend variable may partially capture changes in methods, data availability and quality (e.g., water quality monitoring), the publication process and selection criteria, preferences, and perceptions of water quality over time that may affect significance and theoretical consistency of the primary study estimates. To complete the set of independent variables, we differentiate peer-reviewed journal articles from book chapters and gray literature with the dummy variable, Journal. This variable helps account for study quality, at least partially, because the peer-review process will generally result in improved methodologies and screen out subpar studies. On the other hand, this variable may also help account for possible publication selection bias. All else constant, reviewers and editors may be more likely to accept a study where a statistically significant effect is found, compared to a study yielding null results. We have attempted to minimize the publication selection bias through our approach for identifying relevant studies, by casting a wide net and identifying studies that are and are not published in the peer-reviewed literature.9 Following earlier studies, both publication variables also should help control for potential selection biases (e.g., Smith and Huang, 1993; Wehkamp et al., 2018; Ohlendorf et al., 2021).
3. Meta-regression analysis
Meta-regression analysis allows us to investigate how researcher decisions (x), such as the commodity and housing market to analyze, data choices, and methodological assumptions, influence the subsequent hedonic results. Our interest is in whether a primary study result is significant and of the expected sign based on intuition and theory – i.e., . A meta-regression model can estimate the probability of as: , where denotes a cumulative distribution function. Estimates of the coefficients directly align with our study objectives by revealing how attributes of the commodity, housing market, and methodological choices lead to variation in the significance and expected sign of the estimated property value effects.
We start with the framework for Generalized Linear Models (GLM, McCullagh and Nelder, 1989; Wilson and Lorenz, 2015) where the dependent variable does not have to follow a normal distribution. Many common models fit under the GLM framework, and it provides an approach for estimating a linear relationship even if a dependent variable has a nonlinear relationship with its independent variables. An important assumption for the GLM is that all observations are independent. Because our meta-data sometimes contains observations from the same study or dataset, we use an extension of the GLM called the Generalized Estimating Equation which can account for the correlation within clusters (GEE; see Liang and Zeger, 1986; Zeger and Liang, 1986; Cameron and Miller, 2011).
Following Guignet et al. (2022), clusters are defined as unique study and housing market combinations, leading to a total of J = 98 clusters in the meta-data. Each cluster has a total number of observations defined as . is the binary outcome variable denoting whether the corresponding primary study elasticity estimate is significant and of the expected sign for observation in cluster . We define as the probability that is equal to 1 and define a function that connects to the linear predictor variables . The inverse of this function defines the link function. For this study, the function is the standard normal cumulative distribution function for the population-averaged probit model (Cameron and Miller, 2011).
| (1) |
Estimating a standard probit model that ignores the clustered structure of our meta-data provides a simple approach for estimating , provided that cluster-robust standard errors are also estimated (Cameron and Miller, 2011). However, other approaches exist to address the clustered nature of our data.
One approach is to estimate a cluster-specific model that uses the standard probit but adds a cluster-specific variable, , such that
| (2) |
where can be estimated as a random or fixed effect (Cameron and Miller, 2011). However, we are less interested in results for specific clusters and more interested in the average, or population, effects of hedonic study choices on significance and theoretical consistency.
The GEE, using a quasi-likelihood methodology that requires few assumptions about the distribution of , provides greater flexibility in identifying the correlation structure within clusters, and provides population-averaged results (Liang and Zeger, 1986; Zeger and Liang, 1986; Cameron and Miller, 2011). The family of GEE models have rarely been used in environmental economics, but there are a few examples (e.g., Johnston et al., 2002; King and Anderson, 2004). With GEE models, a cluster-specific variable is not specified (as in Eq. (2)), so we estimate (Eq. (1)), but do not ignore . Instead, expectations are defined for the jth cluster
| (3) |
where is the marginal expectation of . Using the quasi-likelihood method, the set of GEE parameters, , solves (Pendergast et al., 1996; Cameron and Miller, 2011):
| (4) |
If this were the GLM with independent observations within a cluster, we would have the variance matrix , where is a diagonal matrix of variances of as the jth diagonal element (Liang and Zeger, 1986; Pendergast et al., 1996).
Unlike the GLM, Liang and Zeger (1986) broadened the choices of correlation possibilities within clusters with , defined as the working correlation matrix. is again the diagonal matrix of variances and is a scale parameter.
| (5) |
The matrix for cluster is what differentiates the GEE model from the GLM (Pendergast et al., 1996).
Although no specific approach exists for identifying the correct correlation structure, a number of choices can be tested (Zorn, 2001). The working correlation matrices typically used include: independence, exchangeable, unstructured, and user-defined matrices (see Wilson and Lorenz, 2015). The independence correlation matrix assumes observations within the same cluster are not correlated. The exchangeable matrix assumes observations within the same cluster have the same correlation. If an unstructured matrix is chosen, each pairwise correlation is estimated, but having too many observations in a cluster or having unbalanced clusters can cause problems with the model (e.g., Shults et al., 2009). The more representative the working correlation structure is of the data, the more efficient the estimators. An incorrect choice of the working correlation structure does not affect the asymptotic consistency of the estimators as J→∞, but it can affect the consistency of the variance estimate (Zorn, 2001). Therefore, a cluster-robust estimate of the variance-covariance matrix is almost always recommended because it is consistent as long as J→∞ is met (Liang and Zeger, 1986; Cameron and Miller, 2011). A potential limitation of the GEE, then, is when the number of clusters is small (Ballinger, 2004).
Another potential limitation of the GEE is testing model fit (Zorn, 2001; Ballinger, 2004). Because the GEE uses quasi-likelihood, and not maximum likelihood like the GLM, an alternative approach for evaluating relative model performance is needed. Pan (2001) developed the quasi-likelihood for independence criterion (QIC) which is similar to the Akaike Information Criterion (see also Hardin and Hilbe, 2003). The smallest QIC can help identify the appropriate working correlation structure and best model fit. A simplification of the QIC, the QICu – which substitutes in a penalty for the number of parameters – can also be used, but only to identify the appropriate set of variables (Pan, 2001). We use a probit link function and test independence and exchangeable working correlation matrices using the QIC. Once the appropriate correlation structure is identified, we choose the preferred set of variables using the QIC and QICu.
As described earlier, is the probability that is equal to one. Similar to Rosenberger and Johnston (2009), the linear predictor variables described above can be divided into four vectors, , , , and , along with the corresponding coefficients, , so that we have the function:
| (6) |
The vector denotes variables describing the commodity (i.e., type of waterbody and water quality measure). The vector represents market characteristics such as the average home price, study region and spatial and temporal definitions of the market. The vector represents methodological choices in the primary study model (e.g., use of assessed housing values, functional form, accounting for spatial dependence, approach for acquiring measures of water quality). The last vector denotes the publication characteristics, type of publication and time trend for the year published. Estimates of , , ), and will provide insight as to how primary study choices regarding the environmental commodity, housing market, methodology, and publication characteristics, respectively, affect the probability that a hedonic study yields results that are statistically significant and consistent with expectations. Table 2 displays the variables and definitions for each category.
4. Results
4.1. Probit Generalized Estimating Equation results
We estimate separate probit GEE meta-regressions for the independence and exchangeable correlation structures using SAS 9.4 (SAS Institute 2013). Our first GEE model tests independence correlation structure which assumes there is no correlation within each cluster but estimates cluster-robust standard errors. We use the 98 unique study-by-housing market combinations to define the clusters across the 290 observations. The results are presented in Table 3. Model (1) includes the vector qij that represents the variables describing the type of water body and water quality measures examined, but none of the variables are significant.
Table 3.
GEE probit meta-regression results (independence correlation structure).
| Empty Cell | Variable | (1) | (2) | (3) | (4) | (5) |
|---|---|---|---|---|---|---|
| Intercept | 0.16 | 0.18 | 0.44 | 0.58 | 0.92 | |
| (0.204) | (0.253) | (0.365) | (0.444) | (0.708) | ||
| Environmental commodity variables | ||||||
| River | −0.25 | 0.06 | 0.21 | 3.41*** | 3.49*** | |
| (0.497) | (0.651) | (0.630) | (1.186) | (1.235) | ||
| Estuary | −0.39 | −1.62*** | −1.05** | −0.97 | −1.01 | |
| (0.340) | (0.359) | (0.501) | (0.741) | (0.765) | ||
| Nutrients | 0.46 | 0.44 | 0.90 | 2.27*** | 2.06*** | |
| (0.472) | (0.369) | (0.651) | (0.798) | (0.769) | ||
| Sediment | −0.49 | −0.26 | −0.78 | −0.62 | −0.56 | |
| (0.577) | (0.613) | (0.544) | (0.653) | (0.644) | ||
| Bacteria | 0.33 | 0.43 | 0.09 | 0.38 | 0.39 | |
| (0.641) | (0.515) | (0.394) | (0.268) | (0.315) | ||
| Biochemical | −0.64 | −0.50 | −0.96 | −0.24 | −0.22 | |
| (0.605) | (0.629) | (0.806) | (0.684) | (0.697) | ||
| Study area and market characteristics | ||||||
| Midwest | −0.35 | 0.11 | −0.48 | −0.85* | ||
| (0.405) | (0.447) | (0.628) | (0.505) | |||
| West | −0.59 | −0.21 | −1.97*** | −2.16*** | ||
| (0.595) | (0.590) | (0.614) | (0.721) | |||
| South | 1.35*** | 1.51*** | 0.17 | 0.16 | ||
| (0.355) | (0.524) | (0.700) | (0.752) | |||
| Mean House Price (thousands, 2018$) | −1.00E-04 | 1.90E-03** | 1.90E-03** | |||
| (0.001) | (0.001) | (0.001) | ||||
| Sample Years | −0.08** | −0.15** | −0.18** | |||
| (0.040) | (0.061) | (0.073) | ||||
| Multiple Counties/Subcounties | 0.48 | 0.36 | 0.39 | |||
| (0.405) | (0.477) | (0.452) | ||||
| Methodological variables | ||||||
| Linear | 0.10 | 0.33 | ||||
| (0.346) | (0.312) | |||||
| Spatial Fixed Effects | −1.35*** | −1.34*** | ||||
| (0.380) | (0.376) | |||||
| Spatial Lag | −0.73** | −0.74** | ||||
| (0.308) | (0.319) | |||||
| Spatial Autocorrelation | −0.71** | −0.67** | ||||
| (0.339) | (0.335) | |||||
| Assessed Housing Values | 1.90*** | 2.02*** | ||||
| (0.639) | (0.769) | |||||
| Not In Situ | 1.88*** | 1.97*** | ||||
| (0.455) | (0.502) | |||||
| More than One WQ Variable | −1.29** | −1.29** | ||||
| (0.568) | (0.571) | |||||
| Sample Size (square root) | 0.01*** | 0.01*** | ||||
| (0.002) | (0.002) | |||||
| Publication characteristics | ||||||
| Time Trend (year published) | 0.01 | |||||
| (0.031) | ||||||
| Journal | −0.46 | |||||
| (0.603) | ||||||
| Observations | 290 | 290 | 290 | 290 | 290 | |
| QIC | 436.12 | 398.19 | 391.38 | 334.81 | 340.95 | |
| QICu | 395.99 | 369.06 | 361.42 | 318.40 | 321.37 | |
Dependent variable: Y based on p < 0.05.
p < 0.01
p < 0.05
p < 0.1.
Cluster-robust standard errors in parentheses; clustered according to the J = 98 study-housing market combinations.
Model (2) adds variables denoting the study region of the US. For studies that estimated hedonic models for estuaries, the coefficient is now negative and significant. This suggests that compared to lakes/reservoirs, hedonic studies of estuaries are less likely to yield statistically significant results with the expected sign. Studies of waterbodies and housing markets in the south tend to be more likely to yield significant and theoretically expected results, relative to studies in the northeast region (the omitted category). None of the variables representing the categories of water quality are statistically significant, suggesting that, at least in this model, hedonic studies of the various water quality categories are equally as likely to yield the expected result (or not), all else constant.
Model (3) in Table 3 adds the remaining market definition variables from vector mij. The previous findings regarding estuaries and the south region remain robust, as does the finding that the types of water quality variable are statistically insignificant. The market characteristic variables – Mean House Price and Multiple Counties/Subcounties – are insignificant. However, Sample Years is significant and negative, suggesting that as the study period (and hence the duration of the assumed hedonic equilibrium) increases in length, the likelihood of the estimated elasticity being insignificant and/or theoretically inconsistent increases. This result is robust in subsequent meta-regression models that control for methodological features of the primary study.
Model (4) in Table 3 adds the vector representing methodological choices. We see large variability in the results. For example, the coefficient corresponding to River is now positive and significant, suggesting that hedonic studies examining rivers are more likely to yield the expected results, all else constant. The variable Nutrients is also now significant, suggesting that studies examining the impact of nutrient pollution on house prices have a higher likelihood of yielding an elasticity estimate that is significant and of the expected sign, compared to studies of water clarity (the omitted category). In Model (4), the evidence suggests that hedonic studies of waterbodies and housing markets in the west are less likely to yield the expected results (compared to studies of the northeast). In addition to Sample Years being negative and significant, another market characteristic, Mean House Price, is positive and significant – perhaps suggesting that studies of higher-end housing markets are more likely to find the expected results.
Seven of the eight methodological variables in Model (4) are significant, emphasizing the importance of primary researchers’ data decisions and modeling assumptions. For example, not using in situ water quality data (Not In Situ) and not using actual sales prices (Assessed Housing Values) lead to a higher probability of a study yielding statistically significant results that are of the expected sign. These methodological choices may reduce variability in the data, possibly facilitating more precise estimates in the primary hedonic studies (Smith and Huang, 1993). We emphasize, however, that finding the expected result does not necessarily imply the correct result. Including more than one water quality variable in the model leads to a higher likelihood that the elasticity will be insignificant and/or have an unexpected sign. This is also in line with Smith and Huang’s (1993) meta-analysis of hedonic studies on air quality. Our initial modeling approach for examining the role of controlling for spatial dependence on primary study results was to include a dummy variable that was equal to 1 if any spatial modeling approach was used. Because the coefficient was negative and significant, we were interested to see how the different approaches for modeling spatial dependence (i.e., Spatial Fixed Effects, Spatial Lag, and Spatial Autocorrelation) affect significance and the sign of the elasticity estimates. Somewhat surprisingly, the corresponding coefficients for all variables representing spatial dependence modeling approaches are significant and negative. Again, this suggests that studies are less likely to find the expected result if they control for spatial dependence. Sample Size is positive and significant, suggesting that the larger the sample size, the more likely it is that a study will yield statistically significant results that are of the expected sign. This is again consistent with Smith and Huang (1993).
Comparing Model (4) to Model (5), we see very little change in the results. The coefficients on the added publication time trend and Journal variables in Model (5) are insignificant, suggesting that, all else constant, the likelihood of hedonic studies yielding significant results of the expected sign is not different for peer-reviewed journal articles compared to other publication types, and has not changed over time. Although not perfect, perhaps this alleviates some concern regarding potential publication biases in this branch of the hedonic literature.
The same five models are re-estimated using the probit link function and exchangeable correlation structure (Table S1 in online supporting information). When comparing results that use different correlation structures, the model yielding the smallest QIC is the one that best fits the meta-data. Across the board, the independence correlation structure models in Table 3 yield the lowest QIC.10 When identifying the most appropriate model within a correlation structure, the smallest QICu can also be used. For those models using the independence structure, Model (4) (in Table 3) appears to best fit the meta-data.
Although not reported, we also ran Models (3), (4), and (5) with the dummy variable equal to 1 if any of the sample years included the housing market bubble burst (2006–2009). The results were very similar to those already presented in Table 3, but Bubble was statistically insignificant.
4.2. Robustness checks
Robustness checks on our results for an elasticity that is significant at the 5% level can be found in the online supplementary material. The robustness models use alternative dependent variable definitions. The first is a more stringent definition, where only if the elasticity was significant with a p-value less than 0.01 and had a sign consistent with economic theory, and the second is less stringent, where if the primary study elasticity estimate is of the expected sign and statistically significant based on a p-value less than 0.10. For the more stringent dependent variable we have 112 observations where , and for the less stringent dependent variable we have 173 observations where . These can be compared to our main dependent variable definition, where 151 observations had (see Table S2 for summary statistics). However, we drop the robustness model based on the 10% critical significance level because two of the independent variables, Nutrients and Assessed Housing Values, become nearly perfect predictors.
We identify the results from Model 4 in Table 3 as most appropriate and compare Y to Y0.01 (see Table S3 in online supplementary material). Out of the eight methodological variables, only Linear, Spatial Lag, and Spatial Autocorrelation differ. For the study area and market characteristics variables, only Sample Years is significant across both models. For the environmental commodity variables, the results are not robust across different definitions of the dependent variable.
5. Discussion
The primary objective of this study was to answer the following question:
What does the hedonic literature examining surface water quality generally reveal about how the type of commodity, market characteristics, methodological decisions, and publication characteristics affect the significance and theoretical consistency of the estimated property value impacts?
The results from the GEE meta-regression models provide evidence that several of the definitions and decisions made in primary studies do affect the estimated relationship between water quality and property value impacts. In particular, our meta-analysis indicates that methodological choices play a critical role in whether a hedonic study will produce results that suggest that water quality impacts home values in the expected way.
For many of the independent variables, we were unsure a priori as to what the estimated impact would be on the probability of the expected price effect. For those variables where we were able to hypothesize the effect, we did not always find the expected result. Smith and Huang (1993) hypothesized and confirmed that both linear specifications and hedonic models that included more than one air pollution measure would lead to insignificant or inconsistent results. In contrast, in the context of water quality we find a null effect with respect to assuming a linear model; although this may be due to the limited number of studies assuming this subpar functional form. On the other hand, when more than one water quality measure was included in a hedonic model, we find a negative and significant result, which agrees with Smith and Huang’s (1993) meta-analysis.
When considering the commodity definition (i.e., waterbody type and water quality category), we find that the results are somewhat sensitive across the different meta-regressions. Limited observations for some waterbody types and specific water quality measures may be one reason. Publication selection bias also may play a role in the results related to the waterbody types. For example, holding all else constant, a study about a river may only be published if it has a significant result.
One market factor that we thought might be important was the 2006–2009 housing market bubble burst. Although we find no significant effect, how housing market expansions and contractions might impact implicit price estimates of interest should be examined more closely in future research.
Generally, methodological choices appear to have a very important and consistent role in determining the estimated relationship between water quality and housing prices. The use of assessed housing values and predicted or modeled water quality data lead to a similar, higher likelihood of finding a significant estimated price impact that is of the expected sign. Actual housing prices and in situ measurements may have more random variation in the data, which could obscure the true effects researchers are trying to estimate. On the other hand, assessed values do not directly reflect market behavior, and modeled water quality values can introduce prediction error.
Nonetheless, assessed values and predicted water quality measurements may sometimes be necessary, especially with recent trends in the hedonic property value literature to analyze increasingly large, even national, study areas (e.g., Moore et al., 2020; Zhang et al., 2022; Mamun et al., 2023). Some states do not disclose or only provide limited housing market data (e.g., Nolte et al., 2023), and in such cases assessed values may be needed to fill data gaps. In addition, finding consistent water quality measures across the country is difficult, meaning that studies going forward will likely rely heavily on modeled water quality measures or data generated from algorithms and satellite imagery, rather than in situ measurements. Although best practices for hedonic modeling should be followed (Bishop et al., 2020), data limitations may require solutions for missing observations. When feasible, primary study researchers could consider comparing assessed housing values with market prices or comparing in situ water quality measurements with predicted or modeled water quality data (Wolf and Kemp, 2021). Such within-study comparisons would provide a firmer understanding of whether such data choices affect the estimated property value effects, as our meta-analysis suggests.
Given the importance of trying to minimize spatially correlated omitted variable bias in hedonic property value models, we paid particular attention to methodological choices meant to account for spatial dependence – e.g., spatial econometric specifications (Anselin and Lozano-Gracia, 2009), quasi-experimental methods (Parmeter and Pope, 2013), and spatial fixed effects (Taylor, 2017; Guignet and Lee, 2021). As noted above, our meta-dataset did not include some of the more novel types of analyses that account for spatial dependence. We find that controlling for spatial dependence is actually associated with a decrease in the likelihood of a primary study yielding a significant result of the expected sign. Although speculative, one possible explanation is that the true water quality price effects are often relatively small and controlling for spatially correlated confounders better identifies that near zero effect. On the other hand, if the role of spatial dependence in the true data generating process is minimal, then spatial fixed effects and other spatial modeling approaches may be over-parameterizing the models, making it less likely that a study would identify a significant effect if there is one. It is also possible that some approaches to address spatial dependence, such as spatial fixed effects, may be absorbing much of the price variation of interest (Abbot and Klaiber, 2011). Variation in housing prices could be more due to spatial, rather than temporal, variation in water quality (Kung et al., 2022). In such cases, it is more difficult to isolate variation due to water quality from the spatially correlated omitted variables; making it less likely that one would find the expected result.
For the 120 observations that were derived from models where no spatial methods were implemented, 78 (65%) were significant and matched expectations. It is possible that no spatial dependence was found in some of these cases; in such instances finding the expected result is reasonable. Two studies (out of the 15 that do not use spatial methods in some or all of their models) test for spatial autocorrelation and do not find it in their data (Feather et al., 1992; Liao et al., 2016). Ten studies do not mention spatial dependence in their papers, suggesting that the elasticity estimates could potentially be biased, or at least inefficient depending on the nature of the spatial dependence (e.g., Nelson, 2008; Chi and Zhu, 2020). The last three studies use spatial methods in some of their observations as a comparison to observations that do not. The negative sign in our meta-regressions that corresponds to estimates from models that did account for spatial autocorrelation suggests that standard errors could be underestimated when primary studies do not account for spatial dependence in the hedonic model. Testing for spatial dependence and using spatial methods, when appropriate, is generally considered best practice.
Estimating the effects of water quality on house prices can be challenging because researchers are attempting to statistically isolate relatively small effects from numerous other correlated factors that affect house prices. As such, a greater sample size is generally desirable in order to obtain more precise estimates. Consistent with this notion, we find that sample size is associated with an increased likelihood of estimating a statistically significant price effect that is of the expected sign. At the same time, obtaining a larger sample by expanding the study period, in and of itself, may not be desirable. All else constant, our meta-regression results suggest that an increase in sample years is associated with a decrease in the probability of estimating a significant and theoretically consistent result. This finding is in line with research emphasizing the importance of accounting for shifts in the hedonic price equilibrium over time (e.g., Kuminoff and Pope, 2014; Bishop et al., 2020; Banzhaf, 2021).
6. Conclusion
Hedonic property value methods represent a large and growing branch of the nonmarket valuation literature. As we move forward and continue to apply and advance the methodology, it is important to look back and take stock on what has been done and the empirical implications of past analyses and modeling decisions. Our meta-analysis attempts to do just that, by systematically and quantitatively reviewing the hedonic property value literature on the price effects of water quality.
With the intention of providing information to assess hedonic models and the estimated price effects, we highlight three key points. First, our meta-regression results are limited by the existing literature, so we encourage researchers to fill in areas where the literature is scarce. In the context of the US, more hedonic property value studies examining water quality in the west and midwest regions, and for rivers and estuaries, are needed. Similarly, more studies including different water quality measures will allow separate meta-regressions to examine whether the impact of methodological choices on estimated property value effects vary across those measures.
Second, we demonstrate, across all meta-regressions, that study design choices and modeling assumptions have a large influence in determining the estimated price effects. To help assess the methodological decisions made in past studies, our study included observations that did not follow current best practices. As more studies are published or made available as working papers, more novel techniques such as quasi-experiments should be added to the meta-dataset.
The fact that methodological choices can have a greater and more systematic influence on the results than characteristics of the housing market and commodity itself is eye-opening and leads to our third and final point. We unequivocally recommend that researchers continue to follow contemporary guidance in hedonic modeling (Bishop et al., 2020). Our meta-regression results suggest that practices currently considered to be subpar in many applications, like using assessed housing values and not accounting for spatial dependence, may increase the tendency for a hedonic analysis to yield the hypothesized result. We caution that the expected result is not necessarily the correct result, and researchers should continue to assess the robustness of their findings against their study design and methodological choices.
Supplementary Material
Acknowledgments
The views expressed in this article are those of the authors and do not necessarily reflect the views or policies of the US Environmental Protection Agency (US EPA). The research described in this article has been funded wholly or in part by the US EPA under contract EP–C–13–039 to Abt Associates. Any mention of trade names, products, or services does not imply an endorsement by the US Government or the US EPA. The authors declare they have no actual or potential competing financial or other conflicts of interests. The authors thank Elena Besedin, Joel Corona, Olivia Griot, Ben Holland, Matthew Ranson, and Patrick Walsh for helpful feedback early in the development of this project. We also acknowledge Marisa Mazzotta, Hale Thurston, and anonymous reviewers for helpful comments to improve the interpretation of our results and the paper.
Footnotes
The full meta-dataset is publicly accessible at US EPA’s Environmental Dataset Gateway: “Meta-dataset for property values and water quality.” Access at https://doi.org/10.23719/1518489.
For this study, the elasticity represents the percent change in home price for a 1% change in water quality.
A primary study was dropped when the necessary information to infer the price elasticity or a relevant independent variable for our meta-regression models was not provided. We also drop observations from Walsh et al. (2017) and Guignet et al. (2017) that were based on Secchi disk depth, because such observations are redundant with the original measures used in those primary studies – light attenuation (which are maintained for our meta-analysis). We also drop ten observations from studies that analyzed the effect of pH on home prices. We exclude pH because of waterbody-specific nonlinearities associated with what is a degradation versus an improvement in water quality, and because the logarithmic scale often resulted in extreme outliers.
Details can be found in Guignet et al. (2022).
Although a p-value of 0.05 tends to be standard convention, we do examine the robustness of our results to alternative cutoffs for determining statistical significance. See section 4.2 and Supplementary Table 3 for details.
We used the last year of the sample for 98 of the 290 observations.
No study in the meta-dataset used all property sales from a single county. Hedonic property value studies examining water quality almost always focus on homes within some distance of the waterbody and do not utilize the entire set of transactions in a county. We characterize such market definitions as subcounty (Single Subcounty).
Observations from two studies in the meta-dataset were originally coded as using all three spatial dependence modeling approaches. For this methodological meta-analysis, after reviewing the papers, we recoded Netusil et al. (2014) to not using spatial autocorrelation approaches, and Liu et al. (2017) was recoded as not using spatial lag models (depending on the specific observation). Olden and Tamayo (2014) was originally coded as not using spatial methods. This study, however, uses two-stage least squares and instrumental variables to deal with endogeneity. Although not a spatial econometric approach per se, it does set out to address the same spatially correlated omitted variable issue, and so we recoded this study to using spatial methods (i.e., No Spatial Method = 0), but leave the spatial fixed effects, spatial lag, and spatial autocorrelation variables in the meta-regressions equal to 0.
As the literature continues to grow, it may be important to focus on whether a theoretically consistent and significant relationship exists, or if the translation of the elasticities to a binary dependent variable artificially creates the relationship, as suggested by Stanley and Doucouliagos (2012: p. 16), through the publication selection process.
The exchangeable correlation structure model has the same significant moderator variables, but some of the significance levels differ slightly with the results in Model (4) in Table 3.
CRediT authorship contribution statement
Matthew T. Heberling: Conceptualization, Formal analysis, Investigation, Methodology, Project administration, Writing - original draft, Writing - review & editing. Dennis Guignet: Conceptualization, Investigation, Methodology, Writing - review & editing. Michael Papenfus: Conceptualization, Methodology, Writing - review & editing.
Declaration of competing interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Appendix A. Supplementary data
Multimedia component 1.
Data availability
The authors have shared the link to their data in the manuscript.
References
- Abbot JK, Klaiber HA, 2011. An embarrassment of riches: Confronting the omitted variable bias and multi-scale capitalization in hedonic price models. Rev. Econ. Stat 93 (4), 1331–1342. 10.1162/REST_a_00134. [DOI] [Google Scholar]
- Anselin L, Lozano-Gracia N, 2009. Spatial hedonic models. In: Mills TC, Patterson K (Eds.), Palgrave Handbook of Econometrics Palgrave Macmillan, London, United Kingdom, pp. 1213–1250. [Google Scholar]
- Ara S, 2007. The Influence of Water Quality on the Demand for Residential Development Around Lake Erie. Doctoral Dissertation The Ohio State University. [Google Scholar]
- Ballinger GA, 2004. Using generalized estimating equations for longitudinal data analysis. Organ. Res. Methods 7 (2), 127–150. 10.1177/1094428104263672. [DOI] [Google Scholar]
- Banzhaf HS, 2021. Difference-in-Differences hedonics. J. Polit. Econ 129 (8), 2385–2414. 10.1086/714442. [DOI] [Google Scholar]
- Bejranonda S, Hitzhusen FJ, Hite D, 1999. Agricultural sedimentation impacts on lakeside property values. Agric. Resour. Econ. Rev 28 (2), 208–218. 10.1017/S1068280500008200. [DOI] [Google Scholar]
- Bin O, Czajkowski J, 2013. The impact of technical and non-technical measures of water quality on coastal waterfront property values in South Florida. Mar. Resour. Econ 28 (1), 43–63. 10.5950/0738-1360-28.1.43. [DOI] [Google Scholar]
- Bishop KC, Kuminoff NV, Banzhaf S, Boyle KJ, von Gravenitz K, Pope JC, Smith VK, Timmins CD, 2020. Best practices in using hedonic property value models for welfare measurement. Rev. Environ. Econ. Pol 14 (2), 260–281. 10.1093/reep/reaa001. [DOI] [Google Scholar]
- Bockstael NE, McConnell KE, 2007. Environmental and Resource Valuation with Revealed Preferences: A Theoretical Guide to Empirical Methods Springer, Dordrecht, The Netherlands. [Google Scholar]
- Boyle M, Kiel K, 2001. A survey of house price hedonic studies of the impact of environmental externalities. J. R. Estate Lit 9 (2), 117–144. 10.1080/10835547.2001.12090098. [DOI] [Google Scholar]
- Boyle KJ, Poor PJ, Taylor LO, 1999. Estimating the demand for protecting freshwater lakes from eutrophication. Am. J. Agric. Econ 81 (5), 1118–1122. 10.2307/1244094. [DOI] [Google Scholar]
- Boyle K, Lewis L, Pope J, Zabel J, 2012. Valuation in a bubble: hedonic modeling pre-and post-housing market collapse. AERE Newsletter 32 (2), 24–31. [Google Scholar]
- Boyle KJ, Taylor LO, 2001. Does the measurement of property and structural characteristics affect estimated implicit prices for environmental amenities in a hedonic model? J. R. Estate Finance Econ 22 (2–3), 303–318. 10.1023/A:1007855901029. [DOI] [Google Scholar]
- Boyle KJ, Wooldridge JM, 2018. Understanding error structures and exploiting panel data in meta-analytic benefit transfers. Environ. Resour. Econ 69 (3), 609–635. 10.1007/s10640-017-0211-y. [DOI] [Google Scholar]
- Brashares E, 1985. Estimating the Instream Value of Lake Water Quality in Southeast Michigan. Doctoral Dissertation, University of Michigan. [Google Scholar]
- Cameron AC, Miller DL, 2011. Robust inference with clustered data. In: Ullah A, Giles DE (Eds.), Handbook of Empirical Economics and Finance Chapman and Hall/CRC, Boca Raton, FL, pp. 1–28. [Google Scholar]
- Card D, Kluve J, Weber A, 2010. Active labour market policy evaluations: a meta-analysis. Econ. J 120 (548), F452–F477. [Google Scholar]
- Chi G, Zhu J, 2020. Spatial Regression Models for the Social Sciences SAGE Publications, Inc. 10.4135/9781544302096. [DOI] [Google Scholar]
- Cropper ML, Deck LB, McConnell KE, 1988. On the choice of funtional form for hedonic price functions. Rev. Econ. Stat 668–675. 10.2307/1935831. [DOI]
- Feather TD, Pettit EM, Ventikos P, 1992. Valuation of Lake Resources through Hedonic Pricing (No. IRW-92-R-8) Army Engineer Inst for Water Resources, Alexandria, VA. [Google Scholar]
- Gibbs JP, Halstead JM, Boyle KJ, Huang JC, 2002. A hedonic analysis of the effects of lake water clarity on New Hampshire lakefront properties. Agric. Resour. Econ. Rev 31 (1), 39–46. 10.1017/S1068280500003464. [DOI] [Google Scholar]
- Guignet D, Griffiths C, Klemick H, Walsh PJ, 2017. The implicit price of aquatic grasses. Mar. Resour. Econ 32 (1), 21–41. 10.1086/689201. [DOI] [Google Scholar]
- Guignet D, Heberling M, Papenfus M, Griot O, 2022. Property values, water quality, and benefit transfer: a nationwide meta-analysis. Land Econ 98 (2), 191–218. 10.3368/le.98.2.050120-0062R1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Guignet D, Lee J, 2021. State of the Art of hedonic pricing. In: Oxford Research Encyclopedia of Environmental Science 10.1093/acrefore/9780199389414.013.555. [DOI] [Google Scholar]
- Hardin JW, Hilbe JM, 2003. Generalized Estimating Equations Chapman and Hall/ CRC, Boca Raton, FL. [Google Scholar]
- Havranek T, Stanley TD, Doucouliagos H, Bom P, Geyer-Klingeberg J, Iwasaki I, Reed WR, Rost K, Van Aert RCM, 2020. Reporting guidelines for meta-analysis in economics. J. Econ. Surv 34 (3), 469–475. 10.1111/joes.12363. [DOI] [Google Scholar]
- Horsch EJ, Lewis DJ, 2009. The effects of aquatic invasive species on property values: evidence from a quasi-experiment. Land Econ 85 (3), 391–409. 10.3368/le.85.3.391. [DOI] [Google Scholar]
- Hsu TI, 2000. A Hedonic Study of the Effects of Lake-Water Clarity and Aquatic Plants on Lakefront Property Prices in Vermont. Doctoral Dissertation, University of Maine. [Google Scholar]
- Johnston RJ, Besedin EY, Ranson MH, 2006a. Characterizing the effects of valuation methodology in function-based benefits transfer. Ecol. Econ 60 (2), 407–419. 10.1016/j.ecolecon.2006.03.020. [DOI] [Google Scholar]
- Johnston RJ, Ranson MH, Besedin EY, Helm EC, 2006b. What determines willingness to pay per fish? A meta-analysis of recreational fishing values. Mar. Resour. Econ 21 (1), 1–32. 10.1086/mre.21.1.42629492. [DOI] [Google Scholar]
- Johnston RJ, Swallow SK, Bauer DM, 2002. Spatial factors and stated preference values for public goods: considerations for rural land use. Land Econ 78 (4), 481–500. 10.2307/3146848. [DOI] [Google Scholar]
- Kashian R, Eiswerth ME, Skidmore M, 2006. Lake rehabilitation and the value of shoreline real estate: evidence from Delavan, Wisconsin. Rev. Reg. Stud 36 (2), 221. [Google Scholar]
- Kiel K, 2006. Environmental Contamination and house values. In: Carruthers JI, Mundy B (Eds.), Environmental Valuation: Interregional and Intraregional Perspectives Ashgate, Burlington, VT, pp. 121–146. [Google Scholar]
- King JR, Anderson CM, 2004. Marginal property tax effects of conservation easements: a Vermont case study. Am. J. Agric. Econ 86 (4), 919–932. 10.1111/j.0002-9092.2004.00643.x. [DOI] [Google Scholar]
- Krysel C, Boyer EM, Parson C, Welle P, 2003. Lakeshore Property Values and Water Quality: Evidence from Property Sales in the Mississippi Headwaters Region Minnesota Resources by the Mississippi Headwaters Board and Bemidji State University. Submitted to the Legislative Commission on. [Google Scholar]
- Kuminoff NV, Parmeter CF, Pope JC, 2010. Which hedonic models can we trust to recover the marginal willingness to pay for environmental amenities? J. Environ. Econ. Manag 60 (3), 145–160. 10.1016/j.jeem.2010.06.001. [DOI] [Google Scholar]
- Kuminoff NV, Pope JC, 2014. Do ‘capitalization effects’ for public goods reveal the Public’s willingness to pay. Int. Econ. Rev 55 (4), 1227–1250. 10.1111/iere.12088. [DOI] [Google Scholar]
- Kung M, Guignet D, Walsh P, 2022. Comparing pollution where you live and play: a hedonic analysis of enterococcus in the Long Island Sound. Mar. Resour. Econ 37 (1), 65–89. 10.1086/717265. [DOI] [Google Scholar]
- Leggett CG, Bockstael NE, 2000. Evidence of the effects of water quality on residential land prices. J. Environ. Econ. Manag 39 (2), 121–144. 10.1006/jeem.1999.1096. [DOI] [Google Scholar]
- LeSage J, Pace RK, 2009. Introduction to Spatial Econometrics Chapman and Hall/ CRC. [Google Scholar]
- Liang KY, Zeger SL, 1986. Longitudinal data analysis using generalized linear models. Biometrika 73, 13–22. 10.1093/biomet/73.1.13. [DOI] [Google Scholar]
- Liao FH, Wilhelm FM, Solomon M, 2016. The effects of ambient water quality and Eurasian watermilfoil on lakefront property values in the Coeur d’Alene area of northern Idaho, USA. Sustainability 8 (1), 44. 10.3390/su8010044. [DOI] [Google Scholar]
- Liu H, Gopalakrishnan S, Browning D, Herak P, Sivandran G, 2014. Estimating the impact of water quality on surrounding property values in Upper Big Walnut Creek Watershed in Ohio for dynamic optimal control In: 2014 AAEA Annual Meeting, July 27–29, Minneapolis, MN. [Google Scholar]
- Liu T, Opaluch JJ, Uchida E, 2017. The impact of water quality in Narragansett Bay on housing prices. Water Resour. Res 53 (8), 6454–6471. 10.1002/2016WR019606. [DOI] [Google Scholar]
- Mamun S, Castillo-Castillo A, Swedberg K, Zhang J, Boyle KJ, Cardoso D, Kling CL, Nolte C, Papenfus M, Phaneuf D, Polasky S, 2023. Valuing water quality in the United States using a national dataset on property values. Proc. Natl. Acad. Sci. USA 120 (15), e2210417120. 10.1073/pnas.2210417120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McCullagh P, Nelder JA, 1989. Generalized Linear Models, second ed. Chapman and Hall, New York. [Google Scholar]
- Michael HJ, Boyle KJ, Bouchard R, 2000. Does the measurement of environmental quality affect implicit prices estimated from hedonic models? Land Econ 72 (2), 283–298. 10.2307/3147229. [DOI] [Google Scholar]
- Moore MR, Doubek JP, Xu H, Cardinale BJ, 2020. Hedonic price estimates of lake water quality: valued attribute, instrumental variables, and ecological-economic benefits. Ecol. Econ 176, 106692 10.1016/j.ecolecon.2020.106692. [DOI] [Google Scholar]
- Nelson JP, 2008. Hedonic property value studies of Transportation noise: Aircraft and Road Traffic. In: Baranzini A, Ramirez J, Schaerer C, Thalmann P (Eds.), Hedonic Methods in Housing Markets Springer, New York, NY, pp. 57–82. 10.1007/978-0-387-76815-1_4. [DOI] [Google Scholar]
- Nelson JP, Kennedy P, 2009. The use (and abuse) of meta-analysis in environmental and natural resource economics: an assessment. Environ. Resour. Econ 42 (3), 345–377. 10.1007/s10640-008-9253-5. [DOI] [Google Scholar]
- Netusil NR, Kincaid M, Chang H, 2014. Valuing water quality in urban watersheds: a comparative analysis of Johnson Creek, Oregon, and Burnt Bridge Creek, Washington. Water Resour. Res 50 (5), 4254–4268. 10.1002/2013WR014546. [DOI] [Google Scholar]
- Nicholls S, Crompton J, 2018. A comprehensive review of the evidence of the impact of surface water quality on property values. Sustainability 10 (500), 1–30. 10.3390/su10020500. [DOI] [Google Scholar]
- Nolte C, Boyle KJ, Chaudhry A, Clapp C, Guignet D, Hennighausen H, Kushner I, Liao Y, Mamun S, Pollack A, Richardson J, 2023. Data practices for studying the impacts of environmental amenities and Hazards with nationwide property data. Land Econ 14, 2023. 10.3368/le.100.1.102122-0090R. Published online before print, June. [DOI] [Google Scholar]
- Ohlendorf N, Jakob M, Minx JC, Schroder C, Steckel JC, 2021. Distributional impacts of carbon pricing: a meta-analysis. Environ. Resour. Econ 78, 1–42. 10.1007/s10640-020-00521-1. [DOI] [Google Scholar]
- Olden JD, Tamayo M, 2014. Incentivizing the public to support invasive species management: Eurasian milfoil reduces lakefront property values. PLoS One 9 (10), e110458. 10.1371/journal.pone.0110458. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pan W, 2001. Akaike’s information criterion in generalized estimating equations. Biometrics 57 (1), 120–125. 10.1111/j.0006-341X.2001.00120.x. [DOI] [PubMed] [Google Scholar]
- Parmeter CF, Pope JC, 2013. Quasi-experiments and hedonic property value methods. In: List JA, Price MK (Eds.), Handbook on Experimental Economics and the Environment Edward Elgar Publishing, Cheltenham, United Kingdom, pp. 3–66. [Google Scholar]
- Pendergast JF, Gange SJ, Newton MA, Lindstrom MJ, Palta M, Fisher MR, 1996. A survey of methods for analyzing clustered binary response data. International Statistical Review/Revue Internationale de Statistique 89–118. 10.2307/1403425. [DOI]
- Petrolia DR, Guignet D, Whitehead J, Kent C, Caulder C, Amon K, 2021. Nonmarket valuation in the environmental Protection Agency’s Regulatory process. Appl. Econ. Perspect. Pol 43 (3), 952–969. 10.1002/aepp.13106. [DOI] [Google Scholar]
- Poor PJ, Boyle KJ, Taylor LO, Bouchard R, 2001. Objective versus subjective measures of water clarity in hedonic property value models. Land Econ 77 (4), 482–493. 10.2307/3146935. [DOI] [Google Scholar]
- Poor PJ, Pessagno KL, Paul RW, 2007. Exploring the hedonic value of ambient water quality: a local watershed-based study. Ecol. Econ 60 (4), 797–806. 10.1016/j.ecolecon.2006.02.013. [DOI] [Google Scholar]
- Rosenberger RS, Johnston RJ, 2009. Selection effects in meta-analysis and benefit transfer: avoiding unintended consequences. Land Econ 85 (3), 410–428. 10.3368/le.85.3.410. [DOI] [Google Scholar]
- SAS Institute Inc, 2013. SAS® 9.4 Statements: Reference. SAS Institute Inc, Cary, NC. [Google Scholar]
- Schütt M, 2021. Systematic variation in waste site effects on residential property values: a meta-regression analysis and benefit transfer. Environ. Resour. Econ 78 (3), 381–416. 10.1007/s10640-021-00536-2. [DOI] [Google Scholar]
- Shults J, Sun W, Tu X, Kim H, Amsterdam J, Hilbe JM, Ten-Have T, 2009. A comparison of several approaches for choosing between working correlation structures in generalized estimating equation analysis of longitudinal binary data. Stat. Med 28 (18), 2338–2355. 10.1002/sim.3622. [DOI] [PubMed] [Google Scholar]
- Smith VK, Huang JC, 1993. Hedonic models and air pollution: twenty-five years and counting. Environ. Resour. Econ 3 (4), 381–394. 10.1007/BF00418818. [DOI] [Google Scholar]
- Stanley TD, 2001. Wheat from chaff: meta-analysis as quantitative literature review. J. Econ. Perspect 15 (3), 131–150. 10.1257/jep.15.3.131. [DOI] [Google Scholar]
- Stanley TD, Doucouliagos H, 2012. Meta-regression Analysis in Economics and Business Routledge, New York, NY. [Google Scholar]
- Stanley TD, Doucouliagos H, Giles M, Heckemeyer JH, Johnston RJ, Laroche P, Nelson JP, Paldam M, Poot J, Pugh G, Rosenberger RS, 2013. Meta-analysis of economics research reporting guidelines. J. Econ. Surv 27 (2), 390–394. 10.1111/joes.12008. [DOI] [Google Scholar]
- Stapler RW, Johnston RJ, 2009. Meta-analysis, benefit transfer, and methodological covariates: implications for transfer error. Environ. Resour. Econ 42 (2) 10.1007/s10640-008-9230-z. [DOI] [Google Scholar]
- Taylor LO, 2017. Hedonics. In: Champ P, Boyle K, Brown T (Eds.), A Primer on Nonmarket Valuation. The Economics of Non-market Goods and Resources Springer, Dordrecht, pp. 235–292. 10.1007/978-94-007-7104-8_7. [DOI] [Google Scholar]
- Walsh P, Griffiths C, Guignet D, Klemick H, 2017. Modeling the property price impact of water quality in 14 Chesapeake Bay Counties. Ecol. Econ 135, 103–113. 10.1016/j.ecolecon.2016.12.014. [DOI] [Google Scholar]
- Walsh PJ, Milon JW, 2016. Nutrient standards, water quality indicators, and economic benefits from water quality regulations. Environ. Resour. Econ 64 (4), 643–661. 10.1007/s10640-015-9892-2. [DOI] [Google Scholar]
- Walsh PJ, Milon JW, Scrogin DO, 2011a. The spatial extent of water quality benefits in urban housing markets. Land Econ 87 (4), 628–644. 10.3368/le.87.4.628. [DOI] [Google Scholar]
- Walsh PJ, Milon JW, Scrogin DO, 2011b. The property-price effects of abating nutrient pollutants in urban housing markets. In: Thurston HW (Ed.), Economic Incentives for Stormwater Control CRC Press - Taylor & Francis Group, LLC, Boca Raton, FL. [Google Scholar]
- Wehkamp J, Koch N, Lübbers S, Fuss S, 2018. Governance and deforestation—a meta-analysis in economics. Ecol. Econ 144, 214–227. 10.1016/j.ecolecon.2017.07.030. [DOI] [Google Scholar]
- Wilde L, Loos C, Williamson J, 2012. Pipelines and property values: an eclectic review of the literature. J. R. Estate Lit 20 (2), 245–259. 10.1080/10835547.2014.12090339. [DOI] [Google Scholar]
- Wilson JR, Lorenz KA, 2015. Modeling binary correlated responses using SAS, SPSS and R Springer; 9. 10.1007/978-3-319-23805-0. [DOI] [Google Scholar]
- Wolf D, Kemp T, 2021. Convergent validity of satellite and Secchi disk measures of water clarity in hedonic models. Land Econ 97 (1), 39–58. 10.3368/wple.97.1.0508190062R1. [DOI] [Google Scholar]
- Yoo J, Simonit S, Connors JP, Kinzig AP, Perrings C, 2014. The valuation of off-site ecosystem service flows: deforestation, erosion and the amenity value of lakes in Prescott, Arizona. Ecol. Econ 97, 74–83. 10.1016/j.ecolecon.2013.11.001. [DOI] [Google Scholar]
- Zeger SL, Liang KY, 1986. Longitudinal Data Analysis for Discrete and Continuous Outcomes. Biometrics, pp. 121–130. 10.2307/2531248. [DOI] [PubMed]
- Zhang C, Boyle KJ, 2010. The effect of an aquatic invasive species (Eurasian watermilfoil) on lakefront property values. Ecol. Econ 70 (2), 394–404. 10.1016/j.ecolecon.2010.09.011. [DOI] [Google Scholar]
- Zhang C, Boyle KJ, Kuminoff NV, 2015. Partial identification of amenity demand functions. J. Environ. Econ. Manag 71, 180–197. 10.1016/j.jeem.2015.02.002. [DOI] [Google Scholar]
- Zhang J, Phaneuf DJ, Schaeffer BA, 2022. Property values and cyanobacterial algal blooms: evidence from satellite monitoring of Inland Lakes. Ecol. Econ 199, 107481 10.1016/j.ecolecon.2022.107481. [DOI] [Google Scholar]
- Zorn CJ, 2001. Generalized estimating equation models for correlated data: a review with applications. Am. J. Polit. Sci 470–490. 10.2307/2669353. [DOI]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The authors have shared the link to their data in the manuscript.
