Abstract
Background:
Extreme ambient heat is unambiguously associated with a higher risk of illness and death. The Optum Labs Data Warehouse (OLDW), a database of medical claims from US-based patients with commercial or Medicare Advantage health insurance, has been used to quantify heat-related health impacts. Whether results for the insured subpopulation are generalizable to the broader population has, to our knowledge, not been documented. We sought to address this question, for the US population in California from 2012 to 2019.
Methods:
We examined changes in daily rates of emergency department encounters and in-patient hospitalization encounters for all-causes, heat-related outcomes, renal disease, mental/behavioral disorders, cardiovascular disease, and respiratory disease. OLDW was the source of health data for insured individuals in California, and health data for the broader population were gathered from the California Department of Health Care Access and Information. We defined extreme heat exposure as any day in a group of 2 or more days with maximum temperatures exceeding the county-specific 97.5th percentile and used a space-time-stratified case-crossover design to assess and compare the impacts of heat on health.
Results:
Average incidence rates of medical encounters differed by dataset. However, rate ratios for emergency department encounters were similar across datasets for all causes [ratio of incidence rate ratios (rIRR) = 0.989; 95% confidence interval (CI) = 0.969, 1.009], heat-related causes (rIRR = 1.080; 95% CI = 0.999, 1.168), renal disease (rIRR = 0.963; 95% CI = 0.718, 1.292), and mental health disorders (rIRR = 1.098; 95% CI = 1.004, 1.201). Rate ratios for inpatient encounters were also similar.
Conclusions:
This work presents evidence that OLDW can continue to be a resource for estimating the health impacts of extreme heat.
Keywords: Generalizability, Heat-related health impacts, Optum Labs Data Warehouse, Time-stratified case-crossover
Extremely hot days are unambiguously associated with a higher risk of illness and death.1,2 In the US, this association has been documented using various large healthcare claims datasets. Among the US Medicare population (adults 65 and older), extreme heat has been associated with increased hospital admissions for a range of diagnoses, including heat stroke, fluid and electrolyte disorders, and renal failure.3–5 Claims datasets for specific US states have been used to associate extreme heat with emergency room visits for all-cause and mental-health-related outcomes.6–8 Hess et. al.9 used a nationally representative dataset of annualized summertime emergency department (ED) visits to show the adverse impact of heat on health across the US. Bernstein et. al. (2022) used daily counts of individuals admitted to children’s hospitals in the US to show the impact of heat on adolescent health.10 Commercial insurance claims datasets, like those in the Truven Health MarketScan Research dataset or Optum Labs Data Warehouse (OLDW), have also been used to document the association between high warm-season ambient temperatures (i.e., “heat”) and risk of ED visits for a range of physical and mental health conditions.11–15
Although the conclusions of these heat and insurance claims studies are similar, studies vary in terms of exposure metrics, temporality, and the strengths and potential limitations of their health outcome data. This leads to heterogeneity in the interpretation and magnitude of effect estimates and raises several overarching questions. The common underlying population-level association between days of extreme heat and changes in human health remains unknown. This association may be difficult to discern in the absence of a nationally representative dataset. Large insurance claims datasets can stand in as proxies for a national dataset but may over- or under-represent different segments of the population. It is therefore essential to assess whether heat–health associations generated using insurance claims datasets are generalizable to the general population.
In this study, we address the question of the generalizability of heat and health associations created using commercial claims data by leveraging the overlap in space and time across two datasets of hospital-patient encounters. The first dataset—the OLDW—contains healthcare claims data for millions of individuals of all ages with commercial or Medicare advantage health insurance, representing an estimated 6.4% of the US population as of July 2015.15 Notable strengths of the OLDW include its very large size, inclusion of patients of all ages living almost anywhere in the contiguous US, geographic resolution down to the zip code level, ability to follow individuals over time, detailed information on all medical encounters, and a high refresh rate (data are considered complete with a 6-month lag) that facilitates timely event study health impact assessments.13 The principal limitation of the OLDW in the context of environmental health research is its restriction to individuals with commercial or Medicare Advantage health insurance. Individuals with health insurance typically have higher rates of healthcare utilization but are also likely healthier than individuals without health insurance16 and potentially more protected from extreme weather,17 raising the possibility that heat–health estimates derived from the OLDW may underestimate the health impacts of heat in the general population.
The second health outcomes dataset utilized in this study was hospital discharge data for the general population obtained from all licensed hospitals in California reporting data to the state’s Department of Health Care Access and Information (HCAI).18 This dataset has the benefit of containing information from both insured and noninsured individuals in the state of California and serves to represent heat–health impacts among the general population.
We used a space-time stratified case–crossover study design to compare heat–health associations estimated from the OLDW dataset residing in the state of California from 2012 to 2019 with associations estimated from the HCAI discharge dataset for the same time period. California is an apposite environment to test the generalizability of OLDW-derived heat–health associations because of the within-state heterogeneity in contextual factors, climate zones, sociodemographics, and urban/rural populations. We first compared the demographic characteristics of patients and incidence rates of ED visits and inpatient hospitalizations in the two datasets. Next, we standardized daily counts of ED visits and inpatient hospitalizations in the OLDW dataset according to the age, sex, and county distribution of the population of California for the same years.18 Finally, we calculated incidence rate ratios (IRRs) between markers of extreme heat and healthcare utilization in the OLDW and the HCAI datasets and compared IRRs via a ratio of incidence rate ratios (rIRR) to assess the generalizability of the former results.
METHODS
Study Populations
For this study, we focused on persons aged 18 years or older residing in California during the warm seasons (May 1 through September 30) from 2012 to 2019. Daily person-time at risk for individuals in the OLDW and HCAI datasets was estimated in strata by age group (18–24, 25–34, 35–44, 45–54, 55–64, 65–74, 75+), sex (male or female), and county of residence. For OLDW, we assigned the daily person-time at risk in each stratum as the average number of OLDW enrollees in each stratum during that day’s month. The population reflected in the HCAI hospital record data were inferred based on population count data for the state of California obtained from the Centers for Disease Control and Prevention (CDC).19 CDC data were available in 5-year age groups and did not include a split at age 18. Therefore, for the HCAI dataset, counts for persons aged 18 or 19 years were approximated in each county by taking a random sample of 40% of persons with ages between 15 and 19 years (2 of 5 years between 15 and 19) and assigning them to the age group for 18–25.
Outcomes
We analyzed patterns in ED visits and inpatient hospitalizations for all causes, and those known to be exacerbated by heat exposure, including renal disease, heat-related illnesses, mental/behavioral disorders, cardiovascular disease, and respiratory disease (eTable 1; http://links.lww.com/EDE/C171). Patient records were retrieved from OLDW and a comparison dataset was obtained from the HCAI, which includes outcomes from all patients who received care at general acute care service centers in California (which includes essentially all hospitals and freestanding ambulatory clinics, and excludes military facilities).18 Although both datasets derive from the underlying population of California residents, anonymization and data licensing restrictions precluded their linkage (see explanation in eText 1; http://links.lww.com/EDE/C171). In the remainder of this article, we refer to ED visits and inpatient hospitalizations as ED and inpatient “encounters,” and the OLDW and HCAI as “datasets” (rather than as “populations”). Analyses of the OLDW data were approved by the Boston University Medical Campus Institutional Review Board (IRB). Analyses of the HCAI data were approved by the Boston University IRB and the CA Committee for the Protection of Human Subjects.
In the OLDW dataset, claims for ED and inpatient encounters were identified by revenue codes, current procedural terminology (CPT) codes, and place of service code (eFigure 1; http://links.lww.com/EDE/C171). We grouped claims into encounters by first claim date: any claims that occurred within 1 day of a previous claim for the same person were treated as the same encounter. We aggregated encounters into groups defined by encounter start date, encounter type (ED or inpatient), age group, sex, and county of patient residence. Information on other individual-level demographic characteristics such as income, race, or ethnicity, was not available. We categorized encounters containing only ED claims as an ED encounter; we categorized encounters including both ED and inpatient claims as inpatient encounters—we chose this categorization to align with how ED and inpatient encounters are recorded in the HCAI dataset.
HCAI data contain records of ED and inpatient encounters at essentially all hospitals in California. Individual-level encounters included encounter start date, county of patient residence, patient age and sex, and primary and secondary diagnosis codes; these were aggregated in a manner similar to that described for OLDW. We did not have access to expected payer records in this dataset, which precluded stratification of the HCAI dataset into individuals with and without health insurance.
Exposure
We estimated exposure to ambient extreme heat using population-weighted temperature metrics from the Parameter-elevation Regressions on Independent Slopes Model (PRISM),20 a spatiotemporal gridded meteorological dataset with approximately 4 km horizontal grid spacing.21 Drawing on prior research,22 we defined a county-specific heatwave as a sequence of 2 or more consecutive days during which maximum temperatures exceeded the year-round 97.5th percentile of county-specific daily maxima over a baseline period of 1999–2010 (method further described in eText 2; http://links.lww.com/EDE/C171). Within every heatwave spell that met this criterion, all constituent days were assigned as heatwave days and were matched to patient-hospital encounters according to the date of the first claim.22 The latter constituted our main exposure variable, which accounted for between 4.3 and 10.9% of days during each summer (eTable 2; http://links.lww.com/EDE/C171). In sensitivity analyses, we compared these results to models using a simpler exposure metric: a binary indicator of whether daily maximum temperature exceeded the local 97.5th percentile; by comparison 4.6–11.1% of days met this criterion.
Analytical Approach
For both the OLDW and HCAI datasets, we created a time series of daily ED or inpatient encounters within each age-sex-county-year stratum. For the OLDW data, we used CDC data to standardize these daily encounters by re-weighting the counts in each stratum so that the OLDW dataset more closely reflected the population of the state of California in each year (see sample calculation in eText 3; http://links.lww.com/EDE/C171), following recent approaches.23
We compared incidence rates (IR) expressed as encounters per million person–days. We then applied a space-time-stratified case–crossover design to empirically estimate the association between markers of extreme heat and rates of ED or inpatient encounters expressed as IRR.
Heat–health Associations
We used a conditional quasi-Poisson regression,24,25 with stratum () for county, year, month, and day-of-week. Conditional Poisson models provide computational gains over traditional unconditional models in regressions with many strata,26,27 and have been used in many case-crossover studies of the health impacts of heat.26,28–30 See Armstrong et al.24 for an elaborated discussion of the conditional Poisson modeling setup. A quasi-Poisson distribution was chosen to account for overdispersion. The outcome was the daily number of ED or inpatient encounters, for day t in strata s. Our predictor of interest was a binary heat wave day indicator (). Additional statistical controls included a binary variable indicating US federal holidays () and a continuous metric of daily average relative humidity (). Empty strata were removed from the dataset. The resulting regression equation for each dataset was:
The estimated parameter of interest () was used to calculate IRRs for heat exposure response under various dataset-outcome pairings. We performed sensitivity analyses under various definitions of exposure and within widened age strata (to ensure each stratum had more than 100 encounters total over the 4 years) (For sample estimation code, see eText 4; http://links.lww.com/EDE/C171).
We assessed the similarity of IRRs estimated from each dataset (HCAI, OLDW), by encounter type (ED, inpatient), and outcome definition. The exposures and outcomes for both datasets are not independent. Accordingly, the standard approach of taking the difference of regression coefficients and utilizing independent standard errors to construct a confidence interval (CI)31,32 would lead to either type I or type II errors depending on the covariance of the correlated coefficients (see explanation in eText 5; http://links.lww.com/EDE/C171). Another common method for judging significant differences between regression coefficients, that is, nonoverlapping 95% CIs, has been shown to underestimate differences in regression coefficients.33 Owing to the limitations of these approaches, we combined aggregated versions of each dataset and used a dummy variable for the dataset to directly capture the ratio of the two IRRs of interest. We present this rIRR in addition to graphical depictions of independently estimated IRRs. Our approach was informed by methods for seemingly unrelated regression,34 and a discussion of the interpretation of interaction terms in Poisson regression.35 We also were informed by studies investigating generalizability and IRRs: one that compared heart failure incidence rates across cohorts,36 and others that compared utilization rates of healthcare forms and vaccine safety during the early months of the COVID-19 pandemic in the United States.37,38 We created a combined dataset of encounters, added the dataset label to the strata definition, introduced a dummy variable for dataset (dt), where if dataset = HCAI, and created interaction terms between this dummy variable and each other covariate ( itself was not included as there were no strata containing data from both datasets by design):
The regression coefficient for the interaction term between the heatwave day and the dataset () therefore represents the rIRR between the two datasets (see eText 6; http://links.lww.com/EDE/C171 for sample code showing the equivalence of this setup to the traditional unconditional Poisson approach in a case with fewer strata). We formatted the rIRR so that rIRR >1 represents a stronger (i.e., more positive) association observed in the OLDW data, and rIRR <1 represents a stronger association in the HCAI data. All analyses were performed using R version 4.2.1,39 and all conditional Poisson models were run using the gnm package.40
RESULTS
Datasets
The total number of individuals in the OLDW dataset represented approximately 3.4% of the total in the California CDC dataset, with some key differences in terms of age and geographic distribution (Table 1). For example, the OLDW dataset included relatively more people than expected in Santa Clara County and relatively fewer people than expected in San Bernadino and Los Angeles counties. OLDW enrollees were present in all 58 counties in California, with ED or inpatient encounters recorded in 56 counties. We observed minimal temporal trends by age-sex strata (eFigure 2; http://links.lww.com/EDE/C171) and county (eFigure 3; http://links.lww.com/EDE/C171); only the smallest strata had large shifts in total population over time. As expected, standardizing the OLDW dataset eliminated differences in terms of age, sex, county, or year versus the California CDC dataset.
TABLE 1.
Annual Average Population Demographics and Between-dataset Ratios, Averaged Across Years
| OLDW enrollees in CA | CDC population data for CA | Ratio | |
|---|---|---|---|
| Total population | 1,005,765 | 31,347,773 | 0.03 |
| % Female | 50.1 | 50.6 | 0.99 |
| Age (year) categories | |||
| % in 18 to 24 | 11.3 | 17.3 | 0.66 |
| % in 25 to 34 | 20.5 | 18.6 | 1.10 |
| % in 35 to 44 | 20.8 | 16.5 | 1.26 |
| % in 45 to 54 | 19.0 | 16.4 | 1.16 |
| % in 55 to 64 | 14.7 | 14.6 | 1.01 |
| % in 65 to 74 | 7.5 | 9.5 | 0.79 |
| % in 75+ | 6.1 | 7.1 | 0.86 |
| Countya | |||
| % in Los Angeles (06037) | 18.7 | 26.1 | 0.72 |
| % in San Diego (06073) | 11.4 | 8.5 | 1.34 |
| % in Orange (06059) | 9.4 | 8.2 | 1.15 |
| % in Riverside (06065) | 4.6 | 5.9 | 0.77 |
| % in San Bernardino (06071) | 3.6 | 5.3 | 0.69 |
| % in Santa Clara (06085) | 10.7 | 4.9 | 2.18 |
| % in Alameda (06001) | 5.9 | 4.3 | 1.38 |
| % in Sacramento (06067) | 3.3 | 3.8 | 0.85 |
| % in Contra Costa (06013) | 3.6 | 2.9 | 1.26 |
| % in Other | 28.7 | 30.1 | 0.95 |
Large counties are defined by CDC population counts greater than 1 million.
CA indicates California; CDC, Centers for Disease Control and Prevention; OLDW, Optum Labs Data Warehouse.
Incidence Rates
Incidence rates of ED encounters for any cause and for heat, mental and/or behavioral disorders, cardiovascular disease, and respiratory disease were substantially higher for HCAI versus OLDW (Table 2). For example, rates of ED encounters for any cause were 335.8 encounters/million person–days in the crude OLDW dataset versus 827.3 encounters/million person-days in HCAI. Rates in OLDW were only slightly higher with versus without age-sex-county-year standardization, suggesting that differences in these factors did not explain the differences in rates observed between the crude OLDW and HCAI datasets. Only ED encounters for renal disease were of similar magnitude in the OLDW and HCAI datasets.
TABLE 2.
Incidence rates (Encounters/1M Person–days) by Outcome
| ED encounters | Inpatient encounters | |||||
|---|---|---|---|---|---|---|
| Outcome | OLDW crude | OLDW standardized |
HCAI | OLDW crude | OLDW standardized |
HCAI |
| All-cause | 335.8 | 362.7 | 827.3 | 153.8 | 163.4 | 276.3 |
| Heat-related | 16.4 | 17.9 | 30.4 | 30.1 | 34.6 | 67.1 |
| Renal disease | 0.8 | 1.0 | 0.9 | 9.0 | 10.9 | 4.3 |
| Mental/behavioral disorders | 12.9 | 13.9 | 40.5 | 11.7 | 12.9 | 19.3 |
| Cardiovascular disease | 16.1 | 17.6 | 25.9 | 41.1 | 47.7 | 37.8 |
| Respiratory disease | 17.3 | 19.5 | 44.3 | 24.3 | 28.1 | 14.8 |
ED indicates emergency department; HCAI, California Department of Health Care Access and Information; OLDW, Optum Labs Data Warehouse.
Rates of inpatient encounters for any cause, and for heat and mental/behavioral disorders were also higher in HCAI versus OLDW, while inpatient encounter rates for renal disease, cardiovascular disease, and respiratory disease were lower in the HCAI dataset. Standardization of the OLDW dataset had only a modest impact on all-cause or cause-specific inpatient incidence rates.
Heat–health Association Comparisons
Finally, we estimated the association between days of extreme heat and ED and inpatient encounters in each dataset (Figure 1). Dispersion parameters for all but one model were below 1.5, indicating adequate control for overdispersion (eTable 3; http://links.lww.com/EDE/C171). For ED encounters, the associations were typically in the same direction and broadly of similar magnitudes, regardless of the dataset. For example, the association between heat wave day and all-cause ED encounters had IRRs of 1.015 in the HCAI dataset, 1.007 in the OLDW crude dataset, and 1.004 in the OLDW standardized dataset. For heat-related ED encounters, while all IRRs were positive, OLDW IRRs were slightly higher, and with wider CIs. This suggests that enrollees included in the OLDW dataset were marginally more likely than individuals in the HCAI dataset to experience a heat-related ED encounter on days of extreme heat. ED encounters for renal disease and mental and behavioral disorders followed a similar pattern: IRRs were positive in both datasets but marginally higher in the OLDW and with wider CIs. Calculated rIRR were aligned with the visual comparison of IRRs (Tables 3 and 4). Specifically, most crude and standardized rIRR were similar across outcomes, although mental health and heat-related rIRRs indicated marginally stronger associations in OLDW data than in HCAI. IRRs for inpatient encounters were similar to those for ED encounters but with consistently wider CIs in OLDW (Figure 1B).
FIGURE 1.
Incidence rate ratios (IRR) and 95% confidence intervals by outcome for (A) emergency department (ED) encounters and (B) inpatient encounters on any day of the heatwave compared with a nonheatwave day in the same county, year, month, and day of week. Datasets are California Department of Health Care Access and Information (HCAI) and crude and standardized Optum Labs Data Warehouse (OLDW). The x-axis is log-scaled in both panels.
TABLE 3.
Coefficients and confidence Intervals of the Conditional Quasi-Poisson Model by Outcome for ED and Inpatient Encounters, Using the HCAI Dataset and Standardized OLDW Dataset
| ED encounters | Inpatient encounters | |||
|---|---|---|---|---|
| Outcome | rIRR | 95% CI | rIRR | 95% CI |
| All-cause | 0.989 | (0.969, 1.009) | 0.980 | (0.953, 1.008) |
| Heat-related | 1.080 | (0.999, 1.168) | 0.982 | (0.927, 1.040) |
| Renal disease | 0.963 | (0.718, 1.292) | 0.943 | (0.850, 1.047) |
| Mental/behavioral disorders | 1.098 | (1.004, 1.201) | 0.958 | (0.872, 1.052) |
| Cardiovascular disease | 0.995 | (0.917, 1.080) | 0.982 | (0.933, 1.033) |
| Respiratory disease | 1.047 | (0.964, 1.137) | 1.016 | (0.950, 1.086) |
rIRR > 1 represents a stronger association observed in the OLDW data, and rIRR < 1 represents a stronger association in the HCAI data.
ED indicates emergency department; HCAI, California Department of Health Care Access and Information; OLDW, Optum Labs Data Warehouse.
TABLE 4.
Coefficients and Confidence Intervals of the Conditional Quasi-Poisson Model by Outcome for ED and Inpatient Encounters, Using the HCAI Dataset and Crude OLDW Dataset
| ED encounters | Inpatient encounters | |||
|---|---|---|---|---|
| Outcome | rIRR | 95% CI | rIRR | 95% CI |
| All-cause | 0.992 | (0.973, 1.011) | 1.014 | (0.987, 1.042) |
| Heat-related | 1.059 | (0.977, 1.148) | 1.030 | (0.971, 1.092) |
| Renal disease | 0.913 | (0.657, 1.269) | 1.055 | (0.945, 1.176) |
| Mental/behavioral disorders | 1.108 | (1.012, 1.214) | 1.049 | (0.952, 1.156) |
| Cardiovascular disease | 0.996 | (0.916, 1.083) | 1.026 | (0.974, 1.081) |
| Respiratory disease | 1.048 | (0.963, 1.140) | 1.035 | (0.966, 1.109) |
rIRR > 1 represents a stronger association observed in the OLDW data, and rIRR < 1 represents a stronger association in the HCAI data.
ED indicates emergency department; HCAI, California Department of Health Care Access and Information; OLDW, Optum Labs Data Warehouse.
There was some evidence of effect measure modification by age group within each dataset (eFigures 4 and 5; http://links.lww.com/EDE/C171). For mental-health-related ED encounters, IRR in HCAI remained relatively similar across age groups, but the OLDW IRR was higher for adults aged 18–64 than for adults aged 65 and older. This may suggest that some OLDW patients (perhaps preferentially high-risk patients) left the OLDW system when they became eligible for coverage under Medicare. For inpatient encounters, the strongest effect modification by age was seen in HCAI for renal disease; IRR for adults aged 18–64 was much lower than for adults aged 65 and older.
Similarly, we observed some minor differences in IRR for each dataset depending on heat-wave definition (eFigures 6 and 7; http://links.lww.com/EDE/C171). IRRs for all-cause, heat-related, and mental-health-related ED encounters were similar regardless of heat-wave definition, but the IRR for renal disease in OLDW (but not HCAI) was highest on the 2nd+ day of heatwaves. For inpatients, the IRR for all-cause, heat-related, and renal disease conditions were similar regardless of heatwave definition, but the relative rate of mental-health-related outcomes in OLDW (but not HCAI) was lower on the first day of a heatwave.
DISCUSSION
The substantial health dangers posed by days of extreme heat are well documented. Recent heat–health studies have leveraged OLDW owing to its large spatial and temporal span and the wide age range of individuals covered.11–15 But being enrolled in a commercial health insurance plan may itself serve as a marker of lower risk of injury due to climate hazards, potentially limiting the generalizability of results derived from OLDW and similar databases of insured individuals. This article addresses the question of the generalizability of heat–health associations derived from individuals with commercial health insurance, specifically individuals with commercial or Medicare Advantage health insurance enrolled in OLDW in California. For ED encounters, we found that the associations between extreme heat and health were largely consistent in direction and magnitude across datasets. Specifically, the IRRs for heat-related ED encounters and mental-health ED encounters were positive in both datasets, albeit with a marginally larger magnitude and wider CIs in OLDW. For inpatient encounters, IRRs were similar between the two datasets, but rate ratio estimates for OLDW had wider CIs. Thus, if results were only available from the OLDW among enrollees in California, some associations between extreme heat and health harm may be missed. While the OLDW dataset may differ from the HCAI dataset in important ways, the direction and magnitude of the associations on the ratio scale between extreme heat and ED and inpatient encounters in OLDW are broadly consistent with those found in the HCAI dataset. Beyond issues of statistical precision, the similarity of IRRs for key health outcomes supports the use of OLDW data in quantifying the effect of extreme heat on human health, especially for ED encounters.
These results are contextualized by an interrogation of dataset composition and the underlying factors that drive healthcare utilization of the individuals in each dataset. The age and geographic distribution of the OLDW and HCAI datasets differ appreciably, with the crude OLDW data skewing younger and preferentially located in Santa Clara, San Diego, Orange, and Contra Costa Counties. Moreover, average incidence rates of ED and inpatient encounters were also substantially higher in the HCAI versus OLDW dataset, suggesting that the individuals in the OLDW dataset tended to be healthier than the HCAI dataset. However, standardizing the OLDW dataset to match the state population in terms of age, sex, county, and year had minimal impact on average incidence rates, suggesting that these differences do not appreciably contribute to the observed differences in incidence rates. The HCAI dataset was much larger, reflecting the ED and inpatient encounters of approximately 29 million residents of California. In comparison, the California population of OLDW reflects the medical claims of just under 1 million California residents, or about 3.4% of the state’s population during this time. As a result, estimated IRRs in the OLDW dataset have a larger variance (i.e., wider CIs) versus those derived from the HCAI dataset and a higher probability of missing some important associations of small magnitude.
Differences in incidence rates also emphasize a central question of this article—do the factors that lead to these differences affect the ability to use OLDW data to characterize changes in population-level health resulting from extreme heat? In the absence of a comprehensive population-based dataset that includes insurance status and type, we can address this question through discussion of the unmeasured individual factors that may lead to differences in ED and inpatient encounter rates by insurance status. Individuals without health insurance may, on average, have high healthcare utilization due to greater vulnerability, susceptibility, and other factors.41–43 Conversely, large systematic reviews have shown that having insurance is associated with both better health status and higher levels of healthcare utilization.16 Relevant to healthcare utilization studies in California, a state with approximately 2 million undocumented individuals at high risk for adverse health effects,44 undocumented status was not associated with higher healthcare utilization45 or ED encounter rates compared with insured populations.46 Research findings on the relationship between insurance status and inpatient encounters have shown that uninsured patients have shorter length of stay compared to publicly insured patients,47 and higher odds of discharge or transfer as compared with privately insured patients.48 However, research findings on the relationship between insurance status and ED encounter rates are mixed: recent studies have reported higher rates of ED encounters among both the insured42 and uninsured.49,50 Specific to climate hazards, uninsured individuals may be at greater risk of suffering the worst impacts and thus can have higher utilization during periods of extreme weather.16,17 A patient’s insurance status has also been associated with the differential assignment of diagnosis codes by providers (perhaps given implications for billing),51–53 which could impact the results of studies in which outcomes are identified using diagnosis codes. Considering these factors, the likelihood of a trip to the emergency room (and any resulting diagnosis) is a function of financial status, underlying health conditions, access to healthcare facilities and other community resources, and individual behavior and preferences, all of which are not neatly stratified by insurance status alone.
Given the various determinants of healthcare utilization among the insured and uninsured and our present results, we believe that OLDW can continue to be a resource for understanding the impacts of environmental exposures on human health. In both datasets, several key markers of the health impacts of extreme heat, including ED encounters for heat-related health, mental health, and renal disease, were in the same direction and had similar magnitudes. Although some of these associations were marginally stronger in OLDW than in HCAI, ultimately key outcomes were similarly and positively identified. The OLDW (and HCAI) results in this study align with others in California that characterized the relationship between extreme heat and human health, with the strongest associations seen for rates of mental health outcomes, renal disease, and dehydration/heat illness.54–57 As well, the OLDW is a unique resource in that it contains individuals of all ages with wide spatial and temporal spans in the contiguous US. A similar analysis at the national scale characterized the association between heat and ED encounters for a variety of causes.15
We have several recommendations for others using OLDW data. Whenever possible, OLDW results should be contextualized using a population-based dataset, even for a geographic subset of the total dataset. Second, population standardization can be considered but may not be necessary for future analysis, owing to the many factors that drive differences in ED and inpatient encounter rates that cannot be standardized given the available data. Finally, a strength of the OLDW is its coverage of individuals of all ages, across time and space in the contiguous United States; thus, ideal research questions will leverage the availability of data across these dimensions.
Finally, our results should be interpreted considering the following limitations: First, we did not have access to many individual-level variables in OLDW that would be useful in stratifying results, for example, income or race/ethnicity. As such, we cannot demonstrate the degree to which the OLDW California population differs from that of California on these dimensions, nor can we standardize estimates accordingly. Second, the HCAI data lacked information on the expected payer, limiting our ability to directly compare the effects of heat among the publicly insured, privately insured, and uninsured. Third, neither dataset included information on people who died during or before admission, potentially systematically excluding the small fraction of individuals most severely impacted on days of extreme heat. Fourth, our comparisons were performed for OLDW enrollees and residents in California, and the extent of generalizability may vary in other US states.
CONCLUSIONS
In this study of the impact of extreme heat on healthcare utilization, we compared IRR for ED and in-patient hospitalization encounters among adult residents of California included in a dataset of commercially insured individuals (OLDW) and a dataset representing essentially all individuals in California (HCAI). Rate ratios for days of extreme heat and ED encounters were largely of similar magnitudes and directions despite differences in baseline incidence rates. Rate ratios for days of extreme heat and inpatient encounters in OLDW and HCAI had overlapping CIs, but the CIs for OLDW were much wider. Population standardization did not meaningfully change the ratio of IRRs between these two populations. Despite some important limitations, a dataset of commercially insured individuals (OLDW) appears to be appropriate for use in quantifying the effect of extreme heat on rates of related ED encounters and, to a lesser extent, inpatient encounters.
Supplementary Material
Footnotes
Supported by the BU URBAN program via the National Science Foundation’s NRT DGE 1735087 grant, the US Department of Energy’s Office of Science (DESC0016162), the National Institute of Environmental Health Sciences (NIEHS R01-ES029950), the Wellcome Trust (Grant 216033-Z-19-Z), and the US Department of Energy, Office of Science, Biological and Environmental Research Program, Earth and Environmental Systems Modeling, MultiSector Dynamics under Cooperative Agreement DE-SC0022141.
The authors report no conflicts of interest.
The data are not available for replication because of the confidential nature of the individual-level health data used to create the aggregated counts in this research. A simplified example of the computing code is provided in the supplemental materials.
Supplemental digital content is available through direct URL citations in the HTML and PDF versions of this article (www.epidem.com).
REFERENCES
- 1.Watts N, Adger WN, Agnolucci P, et al. Health and climate change: policy responses to protect public health. Lancet. 2015;386:1861–1914. [DOI] [PubMed] [Google Scholar]
- 2.Hajat S, O’Connor M, Kosatsky T. Health effects of hot weather: from awareness of risk factors to effective health protection. Lancet. 2010;375:856–863. [DOI] [PubMed] [Google Scholar]
- 3.Bobb JF, Obermeyer Z, Wang Y, Dominici F. Cause-specific risk of hospital admission related to extreme heat in older adults. JAMA. 2014;312:2659–2667. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Gronlund CJ, Zanobetti A, Schwartz JD, Wellenius GA, O’Neill MS. Heat, heat waves, and hospital admissions among the elderly in the United States, 1992–2006. Environ Health Perspect. 2014;122:1187–1192. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Liss A, Naumova EN. Heatwaves and hospitalizations due to hyperthermia in defined climate regions in the conterminous USA. Environ Monit Assess. 2019;191:394. [DOI] [PubMed] [Google Scholar]
- 6.Fuhrmann CM, Sugg MM, Konrad CE, Waller A. Impact of extreme heat events on emergency department visits in North Carolina (2007-2011). J Community Health. 2016;41:146–156. [DOI] [PubMed] [Google Scholar]
- 7.Yoo EH, Eum Y, Gao Q, Chen K. Effect of extreme temperatures on daily emergency room visits for mental disorders. Environ Sci Pollut Res Int. 2021;28:39243–39256. [DOI] [PubMed] [Google Scholar]
- 8.Basu R, Gavin L, Pearson D, Ebisu K, Malig B. Examining the association between apparent temperature and mental health-related emergency room visits in California. Am J Epidemiol. 2018;187:726–735. [DOI] [PubMed] [Google Scholar]
- 9.Hess JJ, Saha S, Luber G; Summertime Acute Heat Illness in U.S. Emergency departments from 2006 through 2010: analysis of a Nationally Representative Sample. Environ Health Perspect. 2014;122:1209–1215. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Bernstein AS, Sun S, Weinberger KR, Spangler KR, Sheffield PE, Wellenius GA. Warm season and emergency department visits to U.S. Children’s Hospitals. Environ Health Perspect. 2022;130:17001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Saha S, Brock JW, Vaidyanathan A, Easterling DR, Luber G. Spatial variation in hyperthermia emergency department visits among those with employer-based insurance in the United States – a case–crossover analysis. Environ Health. 2015;14:20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Stowell JD, Sun Y, Spangler KR, et al. Warm-season temperatures and emergency department visits among children with health insurance. Environ Res Health. 2022;1:015002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Nori-Sarma A, Milando C, Weinberger KR, Hess JJ, Errett NA, Wellenius GA. Association between the 2021 heat wave in Portland, Oregon, and Seattle, Washington, and emergency department visits. JAMA. 2022;328:2360–2362. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Nori-Sarma A, Sun S, Sun Y, et al. Association between ambient heat and risk of emergency department visits for mental health among US adults, 2010 to 2019. JAMA Psychiatry. 2022;79:341–349. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Sun S, Weinberger KR, Nori-Sarma A, et al. Ambient heat and risks of emergency department visits among adults in the United States: time stratified case crossover study. BMJ. 2021;375:e065653. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Freeman JD, Kadiyala S, Bell JF, Martin DP. The causal effect of health insurance on utilization and outcomes in adults: a systematic review of US studies. Med Care. 2008;46:1023–1032. [DOI] [PubMed] [Google Scholar]
- 17.Benevolenza MA, DeRigne L. The impact of climate change and natural disasters on vulnerable populations: a systematic review of literature. J Hum Behav Soc Environ. 2019;29:266–281. [Google Scholar]
- 18.California Office of Statewide Health Planning and Development (OSHPD). 2019. 2016-2019 Emergency Room Visit and Patient Discharge Data File Documentation.; 2019. [Google Scholar]
- 19.National Center for Health Statistics. Vintage 2020 postcensal estimates of the resident population of the United States (April 1, 2010, July 1, 2010-July 1, 2020), by year, county, single-year of age (0, 1, 2,.., 85 years and over), bridged race, Hispanic origin, and sex. Prepared under a collaborative arrangement with the U.S. Census Bureau. Available from: /nchs/nvss/bridged_race.html as of September 22, 2021, following the release by the U.S. Census Bureau of the unbridged Vintage 2020 postcensal estimates by 5-year age group on June 17, 2021. 2023.
- 20.Spangler KR, Weinberger KR, Wellenius GA. Suitability of gridded climate datasets for use in environmental epidemiology. J Expo Sci Environ Epidemiol. 2019;29:777–789. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Daly C, Halbleib M, Smith JI, et al. Physiographically sensitive mapping of climatological temperature and precipitation across the conterminous United States. Int J Climatol. 2008;28:2031–2064. [Google Scholar]
- 22.Hajat S, Armstrong B, Baccini M, et al. Impact of high temperatures on mortality: is there an added heat wave effect? Epidemiology. 2006;17:632–638. [DOI] [PubMed] [Google Scholar]
- 23.Zuvekas SH. Comparing MEPS Use and Expenditure Estimates for the Privately Insured to Truven MarketScan® and OptumLabsTM Claims Data, 2008-2013.; 2017. [Google Scholar]
- 24.Armstrong BG, Gasparrini A, Tobias A. Conditional poisson models: a flexible alternative to conditional logistic case cross-over analysis. BMC Med Res Methodol. 2014;14:122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Wu Y, Li S, Guo Y. Space-Time-Stratified Case-Crossover Design in Environmental Epidemiology Study. Health Data Sci. 2021;2021:9870798. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Remigio RV, Jiang C, Raimann J, et al. Association of extreme heat events with hospital admission or mortality among patients with end-stage renal disease. JAMA Netw Open. 2019;2:e198904. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Ragettli MS, Vicedo-Cabrera AM, Schindler C, Röösli M. Exploring the association between heat and mortality in Switzerland between 1995 and 2013. Environ Res. 2017;158:703–709. [DOI] [PubMed] [Google Scholar]
- 28.Zhao Q, Li S, Coelho MSZS, et al. The association between heatwaves and risk of hospitalization in Brazil: a nationwide time series study between 2000 and 2015. PLoS Med. 2019;16:e1002753. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Kephart JL, Sánchez BN, Moore J, et al. City-level impact of extreme temperatures and mortality in Latin America. Nat Med. 2022;28:1700–1705. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Weinberger KR, Wu X, Sun S, et al. Heat warnings, mortality, and hospital admissions among older adults in the United States. Environ Int. 2021;157:106834. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Altman DG, Bland JM. Interaction revisited: the difference between two estimates. BMJ. 2003;326:219. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Greenland S, Senn SJ, Rothman KJ, et al. Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations. Eur J Epidemiol. 2016;31:337–350. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Schenker N, Gentleman JF. On judging the significance of differences by examining the overlap between confidence intervals. Am Stat. 2001;55:182–186. [Google Scholar]
- 34.Zellner A. An efficient method of estimating seemingly unrelated regressions and tests for aggregation bias. J Am Stat Assoc. 1962;57:348–368. [Google Scholar]
- 35.Shang S, Nesson E, Fan M. Interaction terms in Poisson and log-linear regression models. Bull Econ Res. 2018;70:E89–E96. [Google Scholar]
- 36.Kociol RD, Hammill BG, Fonarow GC, et al. Generalizability and longitudinal outcomes of a national heart failure clinical registry: comparison of Acute Decompensated Heart Failure National Registry (ADHERE) and non-ADHERE Medicare beneficiaries. Am Heart J. 2010;160:885–892. [DOI] [PubMed] [Google Scholar]
- 37.Mafi JN, Craff M, Vangala S, et al. Trends in US ambulatory care patterns during the COVID-19 pandemic, 2019-2021. JAMA. 2022;327:237–247. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Xu S, Hong V, Sy LS, et al. Changes in incidence rates of outcomes of interest in vaccine safety studies during the COVID-19 pandemic. Vaccine. 2022;40:3150–3158. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.R Core Team. R: A language and environment for statistical computing. Vienna, Austria; R Foundation for Statistical Computing. [Google Scholar]
- 40.Heather T, Davi F. Generalized nonlinear models in R: An overview of the gnm package. Available at: https://cran.r-project.org/web/packages/gnm. Accessed 5 June 2023. [Google Scholar]
- 41.Newton MF, Keirns CC, Cunningham R, Hayward RA, Stanley R. Uninsured adults presenting to US emergency departments: assumptions vs data. JAMA. 2008;300:1914–1924. [DOI] [PubMed] [Google Scholar]
- 42.Zhou RA, Baicker K, Taubman S, Finkelstein AN. The uninsured do not use the emergency department more—they use other care less. Health Aff (Millwood). 2017;36:2115–2122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Klein EY, Levin S, Toerper MF, et al. The effect of Medicaid expansion on utilization in Maryland emergency departments. Ann Emerg Med. 2017;70:607–614.e1. [DOI] [PubMed] [Google Scholar]
- 44.Hayes J, Hill L. Undocumented Immigrants in California. Public Policy Institute of California. 2017. Available at: https://www.ppic.org/wp-content/uploads/JTF_UndocumentedImmigrantsJTF.pdf. [Google Scholar]
- 45.Chavez LR. Undocumented immigrants and their use of medical services in Orange County, California. Soc Sci Med. 2012;74:887–893. [DOI] [PubMed] [Google Scholar]
- 46.Pourat N, Wallace SP, Hadler MW, Ponce N. Assessing health care services used by California’s undocumented immigrant population In 2010. Health Aff (Millwood). 2014;33:840–847. [DOI] [PubMed] [Google Scholar]
- 47.Englum BR, Hui X, Zogg CK, et al. Association between insurance status and hospital length of stay following trauma. Am Surg. 2016;82:281–288. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Venkatesh AK, Chou SC, Li SX, et al. Association between insurance status and access to hospital care in emergency department disposition. JAMA Intern Med. 2019;179:686–693. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.McConville S, Lee H. Emergency department care in California. Hans PJ, ed. 2008:10. Available at: https://citeseerx.ist.psu.edu/document?repid=rep1&type=pdf&doi=42d9449c42486d99d6034ad055e922a948592d2e.
- 50.Tang N, Stein J, Hsia RY, Maselli JH, Gonzales R. Trends and characteristics of US emergency department visits, 1997-2007. JAMA. 2010;304:664–670. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Hersh AL, Shapiro DJ, Pavia AT, Shah SS. Antibiotic prescribing in ambulatory pediatrics in the United States. Pediatrics. 2011;128:1053–1061. [DOI] [PubMed] [Google Scholar]
- 52.Carlson JN, Menegazzi JJ, Callaway CW. Magnitude of national ED visits and resource utilization by the uninsured. Am J Emerg Med. 2013;31:722–726. [DOI] [PubMed] [Google Scholar]
- 53.Mannix R, Stack AM, Chiang V. Insurance status and the care of adult patients 19 to 64 years of age visiting the emergency department. Acad Emerg Med. 2012;19:808–815. [DOI] [PubMed] [Google Scholar]
- 54.Green RS, Basu R, Malig B, Broadwin R, Kim JJ, Ostro B. The effect of temperature on hospital admissions in nine California counties. Int J Public Health. 2010;55:113–121. [DOI] [PubMed] [Google Scholar]
- 55.Basu R, Pearson D, Malig B, Broadwin R, Green R. The effect of high ambient temperature on emergency room visits. Epidemiology. 2012;23:813–820. [DOI] [PubMed] [Google Scholar]
- 56.Guirguis K, Gershunov A, Tardy A, Basu R. The impact of recent heat waves on human health in California. J Appl Meteorol Climatol. 2014;53:3–19. [Google Scholar]
- 57.Sherbakov T, Malig B, Guirguis K, Gershunov A, Basu R. Ambient temperature and added heat wave effects on hospitalizations in California from 1999 to 2009. Environ Res. 2018;160:83–90. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.

