Sign in Register Submit Manuscript

Hapres Home

Location: Home >> Detail

J Sustain Res. 2026;8(2):e260035. https://doi.org/10.20900/jsr20260035

Article

Lookout Below! The Downstream Infant Health Impacts of Mining Disasters

Benjamin A. Jones

Department of Economics, University of New Mexico, Albuquerque, NM 87106, USA

Received: 03 Feb 2026; Accepted: 10 Apr 2026; Published: 15 Apr 2026

ABSTRACT

Limited causal evidence exists on the acute infant health effects of environmental disasters, especially among mining accidents in developed countries. To address this gap, this paper exploits a natural experiment provided by the 2015 US EPA’s accidental release of mine waste water into nearby river systems at the Gold King Mine in Colorado, USA to investigate the acute birth weight and gestation effects of a short-duration shock to water quality. Using both the synthetic control method and a difference-in-differences approach and information on the precise timing of the spill, I show that instances of low birth weight and premature births increased by 0.6 and 0.71 percentage points, or 7.7% and 9.1%, respectively, in downstream counties during the year of and year after the spill. These results highlight the policy importance of funding sustainable approaches for cleaning-up legacy metal mining operations in order to minimize their impacts on adjacent populations.

KEYWORDS: mine wastewater; river contamination; infant health; environmental disasters; sustainable coexistence

ABBREVIATIONS

US EPA, US Environmental Protection Agency; SCM, synthetic control method; DID, difference-in-differences

INTRODUCTION

There is a sizable literature linking water pollution to human health outcomes (e.g., [1–3]). However, much of the focus and attention of this literature has gravitated towards chronic health impacts associated with prolonged exposure to contaminated water supplies (e.g., [4,5]) and weather-related acute impacts (e.g., [6,7]), in predominately developing counties (e.g., [8–10]). While such works make novel contributions in their own right, concerns were recently raised by The Lancet Commission on Pollution and Health that insufficient attention is being given to the water quality health impacts associated with industrial and mining industries, including accidental releases [11]. Even in developed countries, it has recently been shown that mining water quality accidents can have meaningful health and economic impacts on local communities [12].

This paper contributes new causal evidence to the nascent literature on mining disaster related acute health impacts of water pollution by exploiting a natural experiment produced by the US Environmental Protection Agency’s (US EPA) accidental release of approximately 3 million gallons of metal-rich mine waste water at the Gold King Mine in Colorado, USA in 2015. The deluge of metals and toxic compounds from the spill flowed into the nearby Animas River, which is a major source of municipal drinking water and recreation throughout southwestern Colorado and northwestern New Mexico. I investigate how releases of mine waste by-products such as arsenic, lead, and mercury into a river system affects infant health. Additionally, I investigate two states in the Western US where there is a historical legacy of abandoned mines that are in close proximity to major river systems (or their tributaries) such as the Colorado and Rio Grande rivers. This increases the generalizability of my study to other parts of the US where abandoned mines also exist (e.g., California, the mountain West, Appalachia, etc.).

There are several reasons why sustainability practitioners may be interested in this work. First, large population growth in former or current mining communities or along river systems that flow downstream from mining communities (e.g., along the Front Range of Colorado, in the Wasatch Range of Utah, etc.) means that there are increased opportunities for mine-related disasters to adversely affect individuals, families, and communities (Between 2000-2018, population growth rates in traditionally mining heavy states have outpaced the rest of the US: Utah (14.4%), Colorado (13.2%), Nevada (12.4%), Arizona, (12.2%), California (6.2%). Nevada, Arizona, and California are among the top five mineral producing states in 2016 [13]. Most of these states also rely on surface water supplies for municipal water, which mining waste water disasters can potentially contaminate.). Thus, there is a need to learn to sustainably co-exist with former and current mining operations since safe and reliable water supplies are necessary inputs into the economy [14]. Second, there are increasing concerns that early-in-life exposure to pollutants and contaminants is causally linked to long-term labor market and human capital outcomes—the “fetal origins hypothesis” [15]. Hence, in utero or neonatal exposure to contaminated water from an environmental disaster could therefore have adverse consequences on many later-in-life economic variables of interest, reducing society’s ability to sustainably co-exist with abandoned mines. Finally, given the deteriorating physical support infrastructure in many abandoned mines across the US, there is a need to better understand the potential health impacts associated with water breaches at these sites. Knowing about the magnitude of potential health impacts would be helpful when setting mine clean-up priorities and in establishing appropriate financing of such ventures along sustainability dimensions.

My primary research question is: “To what extent did the temporary shock to river water quality caused by the release of mine waste water from the Gold King Mine adversely impact infant health in downstream counties in Colorado and New Mexico during the months immediately following the spill?” Modern causal inference techniques are used to address this question, and I focus my attention on impacts to infant health outcomes. To increase the credibility of my empirical strategy, I employ the synthetic control method (SCM) of [16,17]. Importantly, synthetic control and treated counties are match-weighted to have not only similar pre-spill infant health outcomes, but are also matched on many socio-demographic and flexible weather characteristics that are known to influence infant health. In addition to the SCM model, a more traditional difference-in-differences (DID) model is also estimated, which allows me to also exploit the rich, individual birth data available to me.

Since I have data on the precise timing of the spill and subsequent movement of the pollutants downstream, I focus my analysis on a short period of time right after the spill (up to 16 months post-spill). For identification purposes, this increases the likelihood that other exogenous confounding factors affecting infant health are constant over the analysis period. However, several falsification and robustness checks are also performed in order to boost the causal interpretation of the results: (i) an internal validity test shows no significant changes in mother and infant characteristics, county socio-demographic indicators, or weather outcomes concurrent with the river water quality shock; (ii) the timing of impacts found in my empirical models are greatest during the first few months after the spill and then diminish to insignificance within one calendar year, consistent with an acute health impact environmental disaster story; (iii) falsification tests show no impact on infant health prior to the actual date of the spill, and no impact is observed on birth outcomes known to be unaffected by environmental exposures; (iv) results are robust to alternative specifications of the SCM donor pool, including alternative geographic and population-based specifications, and; (v) results are robust to alternative measures of birth outcomes and whether I look at singleton or multiple births.

This work contributes to the sustainability and environmental health literature in at least two ways. First, it provides new causal evidence that short-duration shocks to water quality can adversely affect infant health, which adds to the literature on water pollution (e.g., [18,19]) and water pollution and health (e.g., [8,20,21]). I improve upon the extant literature by showing adverse outcomes in a developed country where water quality concerns are often thought to have more marginal impacts on health (due to stringent Federal regulations, implementation of advanced water treatment technology, widespread availability of bottled water, etc.) and by looking at multiple measures of adverse health outcomes along both the intensive and extensive margins. Second, this work adds a new dimension to our understanding of the economic costs of mining-related environmental disasters, thereby complementing the small set of economic studies in this area (e.g., [12,22]). Importantly, this is the first work to causally link mine waste water (as opposed to other types of water pollution) to downstream infant health outcomes.

Environmental Disasters, Water Quality, and Human Health

There is an important distinction in the literature between the impacts of industrial environmental disasters on water quality and health and the general literature on water pollution and health outcomes. Disaster events result in a one-time shock to water quality that may cause a temporary shock to acute health outcomes. By contrast, continuous prolonged exposure to contaminated or polluted water supplies may result in different health outcomes, likely of a more chronic nature (e.g., [4,8]). The focus of this work is specifically on disaster impacts and there are surprisingly few published studies on the health impacts of shocks to water quality caused by industrial environmental disasters. Even in the epidemiology literature, few disaster-related papers have been published on water and health, and of these, an even smaller number are causally interpretable (see, for example, the discussions in [23,24]). This is troubling given that human-caused industrial environmental disasters can be significant sources of toxic releases into waterways and into the environment more generally. Recent industrial water disaster examples in the US, include: the 2014 Duke Energy spill of 27 million gallons of coal ash into the Dan River in North Carolina; the 2014 Freedom Industries spill of approximately 10,000 gallons of 4-methycyclohexane methanol (MCHM) into the Elk River in West Virginia, and; the 2010 BP Deepwater Horizon blowout that resulted in a discharge of 210 million gallons of oil into the Gulf of Mexico.

Studies in this area have generally found that water quality shocks after environmental disasters result in acute physical and mental health impacts, from the relatively benign—headaches, scratchy throats, watery eyes, and worse birth outcomes [12,25]—to the more severe—cancer, mercury poisoning, PTSD, generalized anxiety disorder, and even mortality [26–28]. Exposure pathways include direct ingestion of contaminated water, indirect consumption through contaminated food products, or through recreational exposure in polluted waterways. However, most prior work in this area, including most of the studies referenced above, are from studies of oil spills, which is just one subset of water quality-related disaster events.

While health may be impacted by many types of environmental disasters, there is a particular concern about mining and industrial accidents, which tend to be major sources of environmental releases of compounds such as arsenic, lead, and mercury; compounds which extensive prior evidence has shown can adversely affect human health (e.g., [29,30]). The Lancet Commission on Pollution and Health recently echoed this concern by specifically noting the meaningful role that mining and industrial releases of toxins can have on health outcomes, while simultaneously highlighting a dearth of research in this area [11]. To sustainability practitioners, there is therefore a concern that the health externalities of mining and industrial disasters are not fully understood due to of a lack of credible empirical analyses in this area. One policy consequence of this knowledge gap is that damage assessments of mining and industrial disasters are likely to be downward biased due to an omission of health costs directly resulting from exposure to contaminated water supplies. This could negatively affect efforts to sustainably co-exist with current and former mining sites.

Of course, this is not the first paper to propose that the sustainability and environmental health literatures be extended to include investigations of mining water quality accidents. In [12], the synthetic control method was used to study the adverse health impacts and economic costs of a 2014 mining accident that resulted in the chemical MCHM being discharged into the Elk River, West Virginia. The authors found significant deteriorations to post-birthing infant outcomes as measured by 5-minute Apgar Scores after the spill, but did not observe significant changes in either birth weight or length of gestation. To the best of my knowledge, [12] are the first to rigorously apply modern causal inference techniques to study health outcomes in the aftermath of a mining water quality accident.

In this paper, I add to the literature by studying a mine waste water disaster, rather than a chemical spill, where large quantities of metals such as arsenic, lead, mercury, manganese, chromium, etc. were released into a nearby river system. Since there is strong prior evidence that these compounds can adversely impact health, especially among children and infants, I hypothesize that birth weight and gestation length will be affected, at least temporarily, by the mine water accident investigated here. Furthermore, my results are likely more generalizable than prior work because the source of our environmental disaster is an accidental release of waste water from an abandoned underground mine. Given that hundreds, if not thousands of similarly abandoned mines exist across the US, many that may have similar waste water build-ups as found at the Gold King Mine [31], the possibility of future accidents that are similar in size and scope cannot be ruled out (Federal investigators from the US Bureau of Reclamation found that conditions that led to the Gold King disaster “are not isolated or unique and in fact are surprisingly prevalent” [32].). In light of this, the results of this work are generally informative of the human health externalities associated with potential mine waste water disasters in places across the Western US (e.g., Colorado, California, Utah, Nevada) and in Appalachia (e.g., West Virginia, Virginia, North Carolina).

The Gold King Mine Spill

On 2015 Aug 5, during routine US EPA reclamation work at the abandoned Gold King Mine near Silverton, CO, workers accidently loosened some of the rocky material in one of the main mine tunnels, thereby allowing pressurized water that had built up to flow out of the mine entrance and into nearby Cement Creek, which is a tributary of the Animas River. Over a several days period, approximately 3 million gallons of mine waste water were released into the Animas River, causing the river to turn an eerily bright orange color. The Animas River serves as a major source of drinking water for the downstream communities of Silverton, CO (2010 population of 645), Durango, CO (16,815 pop.), Aztec, NM (6765 pop.), and Farmington, NM (46,046 pop.)—see Figure 1. Near Farmington, NM, the Animas merges with the San Juan River, which is a major tributary of the Colorado River.

FIGURE 1
Figure 1. Map of the Gold King Mine Area and the Animas River. Shaded areas are the three downstream treated counties of (from north to south): San Juan, CO, La Plata, CO, and San Juan, NM.

The waste water released from the Gold King Mine contained a diverse array of heavy metals and other compounds including: aluminum, arsenic, barium, beryllium, cadmium, chromium, copper, iron, lead, manganese, mercury, silver, vanadium, and zinc (By weight, the top ten most prevalent metals in the mine waste water were, in order from highest to lowest: iron, aluminum, calcium, magnesium, potassium, lead, magnesium, zinc, copper, and sodium [33].). Using US EPA water quality data at two Animas River monitoring sites located downstream from the mine entrance: (i) the A72 site, located near Silverton, CO, and; (ii) the Animas-Rotary Park site, located in Durango, CO, I have plotted measured concentrations of various water pollutants in Figure 2, along with US EPA recommended water quality criteria for human health, or, if health criteria are unavailable, for aquatic life [34]. Concentrations are plotted immediately before and after the main plume of waste water flowed through each monitoring site: August 5–10 for site #1 (A72) and August 6–8 for site #2 (Animas-Rotary Park). As can be seen in Figure 2, pollutant concentrations during the main plume pass-by event exceeded US EPA recommendations for aluminum, arsenic, cadmium, lead, and manganese (at both sites), and copper and mercury at site #1. At peak impact, the US EPA reported that arsenic levels in the river water running through Durango were approximately 300 times the normal level, and lead levels were approximately 3500 times the normal level [35].

Coincidently, the mine accident occurred during the peak summer recreation season, and the Animas is a large source of recreational and fishing opportunities in the region; over 58,000 user days were logged on the Animas River in 2005, making it the second most recreationally active river in all of Colorado [36]. While some local residents of southern Colorado and northern New Mexico would have been recreating in the Animas River during the spill period (this is a local destination hotspot in the summer, including by this author), many non-local residents would have also likely visited the area during the spill. The birth data I use only allows me to capture localized effects and not impacts from pregnant mothers who may have visited the animas and then returned home (e.g., to a different city, state, etc.). to the extent that these non-local mothers are in my counterfactual group, this would bias us against observing an impact of the spill on infant health (because some mothers in the counterfactual would have undergone degrees of treatment). Thus, my results may potentially be downward biased as a result. However, it is unknown how many non-local residents (or pregnant mothers, to be specific) were in the area during the spill event. A few days after the spill, public officials closed off access to the river in and around Durango and Silverton, CO (lasting until August 14) and residents with wells were advised to have their water tested before any use due to possible contamination [37]. However, photographs appearing in The New York Times showed rafters and kayakers still recreating in the bright orange-colored river even after the spill [37]. The City of Durango also preemptively stopped pumping water from the Animas River into their municipal supply, in order to prevent possible contamination, and people were urged to not catch or eat any fish from the river. However, it is unknown how effective these public advisories were at preventing possible exposure to contaminants.

FIGURE 2
Figure 2. Concentrations of Water Pollution in the Animas River Before and After the Mine Spill Source: [34]. Notes: recommended criteria from US EPA National Recommended Water Quality Criteria—Human Health Criteria Table. Exceptions are for aluminum, lead, and cadmium where the Aquatic Life Criteria Table is used instead (due to the absence of a human health criterion for these pollutants).

During and immediately after the spill event, there were widespread concerns about adverse health impacts resulting from direct contact with contaminated water or from consuming crops that were irrigated by contaminated river water (e.g., [37,38]). Concerns were especially pronounced within the Navajo Nation, the largest Native American reservation in the US, who rely heavily on river water located downstream from the Gold King Mine as a source of drinking water, irrigation, and recreation [39]. Despite assurances from the Colorado Department of Public Health and Environment, among other government agencies, that any health impacts caused by the spill would be “low” [40], this is actually an open empirical question.

Of course not all residents in areas downstream from the Gold King Mine would have been necessarily exposed to contaminated water, however, based on the above discussion, there are several potential exposure pathways, including: (i) from recreation in the Animas River and its connected waterbodies; (ii) from the consumption of any marine life in the Animas or connected waterbodies, and; (iii) from possible contamination of municipal water supplies or domestic wells in the region.

There is particular concern about acute health effects immediately after the spill, when water pollution concentrations were elevated above baseline levels. Given that in utero and infant health are especially sensitive to small changes in exposure to environmental pollutants, it is possible that the spill may have resulted in temporary adverse impacts among this sensitive population. A 2018 joint Federal, state, and county agency report was released that found no long-term impacts of the Gold King Mine spill on water quality, and, moreover, the report states (without providing empirical justification) that the mine waste water poses “no risk to human health if common-sense precautions are followed”. However, this study provides no information regarding acute health impacts, which, on the basis of the prior environmental disaster literature [23], we would anticipate would be the primary source of any health impacts observed (rather than long-term or chronic health effects). The biological mechanisms linking heavy metals to fetal development are not precisely known, though there are several plausible hypotheses. For instance, arsenic in maternal blood can cross the placenta barrier and reach the fetus during the second and third trimesters. This can lead to decreased expression of adipose tissue-derived ENPP2, a phospholipase that regulates adipose tissue growth, thereby affecting fetal fat development and hence birth weight [41]. Furthermore, arsenic, even at low exposure levels, increases general inflammation in newborns exposed in utero, which can affect gestation length through cervical ripening and rupture of membranes [42]. Arsenic may also have direct toxic effects on DNA repair enzymes, which could affect fetal growth. Additionally, other heavy metals such as lead, cadmium, and mercury can reduce the thickness of the mother’s placenta and its overall functionality, with associated effects on birth weight and gestation length [43]. Heavy metals can also damage the fetus through oxidative cell stress, neurological damage, DNA injury, and uptake of calcium and glucose [44]. Birth abnormalities and fetal development issues have been observed even at heavy metal exposure levels that are virtually non-toxic to the adult mother herself and with no overt clinical signs [45], thus reinforcing the possibility that even low exposure to contaminants from the Gold King Mine spill (such as through municipal water supplies, recreation, etc.) could have significant birth outcome impacts.

MATERIALS AND METHODs

Data

Restricted-use data on the near-universe of infant health outcomes and mother characteristics were obtained for Colorado and New Mexico, with permission, from the US CDC National Center for Health Statistics (NCHS) for the years 2011–2016. The analysis is limited to only those downstream counties in Colorado and New Mexico (and not areas of Utah or Arizona) since US EPA measurements showed small to negligible effects on water quality once the river reached the Utah state line (the metals had been heavily diluted by that point) [34]. These data contain individual birth records for >99% of all live births and include identification of each mother’s county of residence. I focus on extensive and intensive margin measures of infant health: low birth weight (<2500 g at birth), prematurity (birth occurring before the 37th week of pregnancy), birth weight (in grams), and gestation length (in weeks). In addition, the data contain rich information on mother demographic characteristics (e.g., age, race, education) and behavioral actions taken during pregnancy (e.g., smoking status, prenatal care visits).

To account for potential confounders of infant health that are unrelated to the Gold King Mine spill, I combined the infant health data with annual data on household income and poverty rates (from the US Census Bureau Small Area Income and Poverty Estimates Program), unemployment rates (from the US Bureau of Labor Statistics), and demographic and educational attainment data (from the American Community Survey), all at the county-level. The specific variables used are presented in Table 1.

Since weather patterns are a known determinant of both health and pollution, I obtained detailed daily weather data on precipitation and temperature (maximum and minimum) from the NOAA National Centers for Environmental Information (NCEI). Given the presence of multiple weather monitoring sites in the counties in the study area, I follow [46] and take a population-weighted average of the weather stations within a 25-mile radius of the population centroid of the county. The weights are equal to the inverse of the square root of the station’s distance to the population-weighted county centroid. Minimum and maximum temperature are averaged to obtain mean temperature in each county. I also use daily data on wind speed and direction from NCEI’s North American Regional Reanalysis. Data consist of 32 km × 32 km grid cells of East-West and North-South wind vectors. The inverse distance weighting approach was again used to construct wind conditions at the county-level. In the models that follow, I include flexible controls for both weather and wind. Five daily temperature bins (<25°, 25°–45°F, 45°–65°F, 65°–85°F, and >85°F), three daily wind direction bins (0–120°, 120–240°, and 240–360°) and three daily wind speed bins (0–2 m/s, 2–4 m/s, >4 m/s) are used. The results are robust to the exclusion of the wind controls. However, they are included in the main analysis to eliminate any confounding effects from odor from the river caused by the mine waste water spill that could have adversely affected human health during the treatment period. The bins capture the number of days during a given pregnancy that fall into each temperature or wind direction/speed range.

TABLE 1
Table 1. Summary statistics. Sources: US CDC NCHS, US Census Bureau, US BLS, and NOAA NCEI.

Mothers who reside in either San Juan County, CO, La Plata County, CO, or San Juan County, NM, are considered to be in the treated group—i.e., they live downstream from the Gold King Mine spill along or near the Animas River (see the shaded counties in Figure 1). These counties contain the population centers of Silverton, Durango, Aztec, and Farmington, which have a combined population of more than 70,000 people. While not all mothers in these counties live along the Animas River, there is a high probability that many of them do since all the major population centers in this part of Colorado and New Mexico lie directly along the river’s path.

The donor pool of counties from which the synthetic control will be constructed consists of all other counties in Colorado and New Mexico, not including the three treated counties listed above. Specifically, the donor pool consists of 94 counties with a combined 2010 US Census population of 6,906,298. By comparison, the 2010 combined population of the three treated counties is 182,077. Health impacts of the mine accident in these counties are expected to be negligible given that the Animas River does not extend to these counties, though, intrastate travel effects cannot be completely ruled out. However, as part of my specification checks later in the paper, alternative definitions of the donor pool are explored, including more limited geographic and population-based specifications.

My identifying assumption is that water contamination from the Gold King Mine spill only impacted mothers living downstream from the mine and not mothers in other counties in Colorado and New Mexico. Additionally, I also assume that any observed changes to infant health immediately after the water quality shock can be directly attributable to the mine disaster and are not due to changes in other factors or circumstances, unrelated to the disaster event. In the results section, I provide evidence supporting this assumption. Furthermore, as previously mentioned, given the short time period that is investigated in this work, it seems unlikely that any other predictors of infant health would have significantly changed at the same time that the spill occurred. I’m looking no more than 16 months after the spill event, and, as will be shown later, peak impacts occur within the first three months after the spill, thus virtually eliminating any non-spill-related environmental confounding concerns.

The analysis period is 2011–2016, inclusive, and data are averaged at the county-year level for use with the synthetic control method. Table 1 presents the summary statistics. 7.8% of births are of low weight or are premature in the treated counties, while the comparable figures for low birth weight and prematurity in the donor pool counties are 9.4% of 11.2%, respectively. Birth weight and gestation length are very similar between treated and donor pool counties, but there are notable differences between some of the socio-demographic characteristics.

While Table 1 only presents county-level averaged data, I note that later in the paper I also take advantage of the individual birth data available to me. Additionally, in the extended analysis, I also explore models using a monthly, rather than annual, timestep. In total, three different units of analysis are investigated in the paper: county-year (baseline), county-month, and individual birth-year. Given the rich data available to me, in addition to the short-term nature of the health impacts I’m seeking to capture, it is necessary to use more than one data aggregation type.

Synthetic Control Method

To estimate the impact of the spill on infant health, the counterfactual path of birth outcomes in the downstream counties needs to be credibly identified. An increasingly preferred approach for constructing counterfactuals in causal inference settings is the synthetic control method or SCM of [16,17]. There are several advantages of the SCM over more traditional difference-in-differences (DID) approaches: (i) SCM is data driven, meaning that the subjectivity inherent in arbitrarily selecting a counterfactual is removed from the hands of the researcher; (ii) SCM moves away from using a single counterfactual unit or a simple average of counterfactual units, and instead uses a weighted average of units, and; (iii) SCM controls for time-varying unobserved confounders, thus helping to eliminate endogeneity resulting from omitted variables [47].

In this particular case, the SCM will be used to develop a synthetic control for the three downstream treated counties where water quality shocks occurred due to the Gold King Mine spill. The synthetic control will consist of a weighted combination of the 94 donor pool counties in Colorado and New Mexico that did not concurrently experience water quality shocks. Importantly, weights will be selected by the SCM such that the weighted synthetic control resembles the treated counties in terms of pre-spill observable infant health outcomes and relevant socio-demographic and weather characteristics.

Let J be the number of available donor pool counties and define the J × 1 weighting vector W = (w1,…,wJ)' such that Σ j=1 j w j =1 and wj0 for j = (1,…,J). Each scalar wj represents the nonnegative weight placed on the jth county in the synthetic control. Let H0 be a K × 1 vector of K birth outcomes and observable covariates for the downstream treated counties prior to the mine waste water spill. Let H1 be a K × J matrix of comparable data vectors for each of the J counties in the donor pool. Following [16], the vector of weights W is chosen to minimize,

W * =arg min W ( H 0 H 1 W)V( H 0 H 1 W) s.t. w j 0, j=1 J w j =1forj=(1,...,j) (1)

where V is a positive-definite matrix whose diagonal elements reflect the relative importance of the variables in H0 and H1 ([48] also suggest an alternative approach where the synthetic control is constructed based solely on pre-treatment trends in the dependent variable of interest, thus ignoring all observable covariates. The results from this alternative approach are very similar to the results when the covariates are included, consistent with the findings in [48].). Variables in H0 and H1 include pre-disaster infant health outcomes, mother’s age, Hispanic mother, African American mother, mother’s educational attainment, mother is a high school dropout, mother smoked during pregnancy, teenage mother, birth order, whether the birth was a multiple birth, sex of the child, and several county-level socio-demographic and weather indicators: poverty rate, median household income, unemployment rate, college graduation rate, percent African American, percent Hispanic, annual average precipitation, five daily mean temperature bins, three daily wind direction bins, and three daily wind speed bins. Minimizations are performed separately for each birth outcome (low birth weight, prematurity, birth weight, and gestation length) and the resulting weighting vectors are provided in Table S1 (Weights are estimated separately for each birth outcome since the common trends assumption must be met for each dependent variable investigated.).

To compare infant health outcomes between the downstream treated counties and the synthetic control, I follow [48] and use the following estimator,

SC M effect =(Healt h post treated Healt h post syntnetic )(Healt h pre treated Healt h pre syntnetic ) (2)

where Healthpost (Healthpre) is the average infant health outcome after (before) the mine waste water accident.

Characterization of uncertainty around SCMeffect is based on the results from placebo or falsification tests, wherein each donor pool county is (wrongly) assumed to have actually experienced treatment and the SCM is used to determine the magnitude of the placebo treatment effect [17]. This process is repeated for every donor pool county, thereby creating a distribution of placebo effect estimates: SCMplacebo. A two-sided p-value can be obtained from this process, following [49],

pvalue=pr(|SC M placebo ||SC M effect |)= Σ1(|SC M placebo,j ||SC M effect |) J (3)

where j is the jth placebo county SCM estimate for j = (1, ..., J). Since placebo effect estimates in cases where the pre-treatment fit is poor (as defined by the root mean squared predictor error, RMSPE) do not provide credible information to measure the relative rarity of estimating a large post-treatment effect for a unit that was well-fitted prior to treatment [17], I weight each SCMplacebo,j by its pre-treatment RMSPE. This will give more (less) weight to placebo units with good (poor) pre-treatment fit with the downstream treated counties (Effectively, I’m downweighing SCM placebo estimates that have large pre-treatment RMSPE (meaning the SCM method didn’t find a good synthetic control for these units during the pre-treatment period). This means that the p-value I calculate is based on placebo results from units with good pre-treatment fit (as defined by a low RMSPE), which allows for a more credible calculation of uncertainty since one is comparing good pre-treatment fit units from the placebo to a good pre-treatment fit from the treated unit.). In the robustness checks later in the paper, unweighted results are also reported.

Traditional Difference-In-Differences Approach

The SCM uses county-level data to determine the infant health effect estimates. However, given the individual birth data available in the NCHS dataset, I would also like to employ a secondary approach that allows me to take advantage of this rich data. To do so, a more traditional DID approach is used, but with the caveat that I still use the previously obtained synthetic control weights. Specifically, the county-level weights generated by the SCM are used to re-weight the contribution of each individual NCHS observation in a given county by the SCM-derived county weight. In other words, individual observations are weighted by their county-level SCM weight in an attempt to construct a credible control group. This follows similar approaches in [48,50] and means that I can combine some of the advantages of the SCM (in terms of obtaining similar pre-treatment trends) with the advantages afforded by the traditional DID regression approach using individual birth observations.

The empirical model takes the following form,

Healt h ict = β 0 + β 1 DownstreamAfte r ct + X it β 4 + M ct β 5 + W ct β 6 +Yea r t +Count y c + ε i ct (4)

where Healthict is the infant health outcome for mother i living in county c at year t, DownstreamAfterct is an indicator variable for downstream county treatment status, X it is a vector of mother and birth characteristics, M ct is a vector of county socio-demographic characteristics, W ct is a vector of county weather characteristics, Yeart is the year fixed effect, Countyc is the county fixed effect, and εict is the idiosyncratic error term. The same set of mother, socio-demographic, and weather variables are used here as in the SCM (Later in the paper, I also explore a monthly version of Equation (4), where month, in addition to year, fixed effects are included.). The DID estimator is β1, which provides an estimate of the changes in infant health outcomes in downstream treated counties relative to similar nearby counties that did not experience a concurrent change in river water quality (I also explored a version of the model using state fixed effects (Colorado and New Mexico), but the results were little changed as a result.). Standard errors are clustered by county.

RESULTS

Internal Validity Test

I begin by presenting evidence showing no significant changes in mother and infant characteristics, socio-demographics, or weather outcomes concurrent with the mine waste water disaster. This internal validity test was performed by using the previously obtained synthetic control weights for low birth weight to calculate SCMeffect, separately, for each observable covariate (Findings are similar to those reported here if the synthetic weights for the other infant health outcomes were used instead of the weights for low birth weight (e.g., prematurity, gestation, etc.).). In other words, I’m using the same synthetic control to calculate how observable covariates across treatment and control groups changed from before and after the Gold King Mine spill. As shown in Table 2, none of the SCMeffect estimates are statistically significant, indicating that changes in the composition of mothers or changes in county socio-demographic or weather characteristics are not driving any mine-related infant health outcomes that might be found below.

TABLE 2
Table 2. Internal Validity Test (Synthetic Control Results).
SCM Results

Results of using the SCM to estimate changes in low birth weight and prematurity are presented in Figure 3. Shown are the standardized gaps in annual outcomes between the treated and synthetic control groups (black lines) and similar gaps for each placebo effect (gray lines). The near zero gaps during the years prior to the mine spill indicate nearly identical pre-treatment levels and trends in low birth weight and prematurity—the necessary common trends assumption. In the year of the spill, gaps in the instances of low birth weight and prematurity spike, by 0.8 and 0.9 percentage points, respectively (each from a base of 7.8%), suggesting that there was an increase in adverse infant health outcomes in the downstream counties that were not simultaneously observed in the synthetic control. Low birth weight and prematurity effects linger into the year after the spill, though are diminished in magnitude.

FIGURE 3
Figure 3. Standardized Gaps in Infant Health Outcomes between Downstream Counties and the Synthetic Control. Notes: plotted are the standardized gaps in infant health outcomes between downstream counties and the synthetic control (black line) and the standardized gaps between each county in the donor pool and its unique synthetic control for 94 placebo permutations (gray lines) for low birth weight (A) and prematurity (B). Gaps standardized by pre-treatment RMSPE. Different synthetic controls used in each panel. Gaps are defined as treated minus synthetic control.

Visually, the treatment gap is a relative outlier compared to the placebo effect gaps. For the year of the mine spill, this is clearly the case for both low birth weight and prematurity; both black lines lie above all the placebo gaps. Additionally, in the year after the spill, the downstream gap is still mostly an outlier for prematurity, but begins to converge towards the placebo gaps for low birth weight. These results should be interpreted as providing evidence that the magnitude of increases in instances of low birth weight and prematurity cannot be replicated by applying the SCM to counties that are not located downstream from the Gold King Mine spill.

The same analysis was conducted for the intensive margin infant health outcomes birth weight and gestation length, with results provided in Figure 4. During the year of the spill, birth weight is lower by 130 g (from a base of 3185 g) and gestation length is lower by 0.22 weeks (from a base of 38.99 weeks). The evidence is mixed for the year after the spill since the gaps begin reverting back to their pre-disaster levels. This suggests that the mine disaster, at least temporarily, resulted in lower weight infants and in infants born earlier than those in the synthetic control group that otherwise had nearly identical pre-treatment levels and trends along the intensive margin.

To numerically present the prior graphical results, I calculated SCMeffect using Equation (2) and the corresponding p-values for each infant outcome (Table 3). Panel A (panel B) reports the extensive margin (intensive margin) results. Along the extensive margin, the Gold King Mine spill is associated with a 0.6 percentage point average two-year increase in instances of low birth weight (p = 0.01) and a 0.71 percentage point average two-year increase in instances of prematurity (p = 0.00) in the downstream Animas River counties relative to the synthetic control. These represent percentage increases of 7.7% (low birth weight) and 9.1% (prematurity), relative to the mean. Intensive margin outcomes birth weight and gestation length are, on average, lower by 98.31 g (p = 0.05) and 0.25 weeks (p = 0.09) in downstream counties in the year of and year after the mine disaster. That is, relative to the mean, birth weight and gestation length are lower by 3.1% and 0.64%, respectively, over the two-year post-spill period (As a specification check, I additionally estimated a version of Table 3 that used a single synthetic control (with one set of weights) for all four infant health outcomes, rather than separate controls for each outcome. While doing so led to some violations of the common trends assumption, especially for the intensive margin outcomes, the results continue to show significant adverse infant health outcomes by using this alternative specification, though the magnitudes of effects were different from those presented here.).

FIGURE 4
Figure 4. Standardized Gaps in Birth Weight and Gestation Outcomes between Downstream Counties and the Synthetic Control (Intensive Margin Infant Health Outcomes). Notes: plotted are the standardized gaps in intensive margin birth outcomes between downstream counties and the synthetic control (black line) and the standardized gaps between each county in the donor pool and its unique synthetic control (gray lines) for birth weight (A) and gestation length (B). Gaps standardized by pre-treatment RMSPE.
TABLE 3
Table 3. Impact of the Gold King Mine Spill on Infant Health (Synthetic Control Results).
DID Results

The results up until now have used county-level averaged data. Here, I use individual birth data to estimate the traditional DID model in Equation (4) using the synthetic control weights, as previously described. Results of these regressions are presented in Table 4. Note first that the sample sizes have increased tremendously; from 564 in the SCM results to over 490,000 now. Consistent with the earlier story, I continue to observe adverse impacts of the mine water disaster on both extensive and intensive margin infant health outcomes. DID coefficients suggest that instances of low birth weight are higher by 0.64 percentage points (p < 0.01) and prematurity cases are higher by 0.69 percentage points (p < 0.01). Birth weight is lower by 101.49 g (p < 0.05) and gestation length is lower by 0.251 weeks (p < 0.10), though with only marginal significance. These effect estimates are very similar in magnitude and significance to what was observed in the SCM results, suggesting that results are robust to: (i) different empirical models (DID vs. SCM), and; (ii) using either county-level averaged data or individual birth data.

TABLE 4
Table 4. Traditional Regression Difference-in-Differences Results.
Monthly Analysis Results

Perhaps the most convincing evidence to support an acute infant health impact story can be provided by using higher frequency monthly data, rather than the annual data used up until now. I again employ the SCM to construct synthetic counterfactuals for the downstream counties for low birth weight and prematurity, using the same donor pool and the same set of observable characteristics as before. However, a monthly calendar timestep is now employed. Figure 5 presents the results and associated p-values of this analysis. Gaps between the treated and synthetic control counties spike in the first three months immediately following the mine spill. After the immediate post-spill spikes, there is a gradual reversion to pre-spill outcomes, and by 12 calendar months after the disaster, both the low birth weight and prematurity gaps are near zero (and insignificant), indicating no difference between the downstream and synthetic control counties. At peak impact, instances of low birth weight are elevated by 1.1 percentage points, or 14.1% above the mean (two months after the spill) and instances of prematurity are elevated by 1.2 percentage points, or 15.4% above the mean (also two months after the spill). Placebo test p-values indicate that the observed post-spill outcomes are significantly different from zero.

The overall shape and timing of the effect curves are strongly suggestive of an acute health impact story consistent with a short-term change in environmental exposures. Clearly, an abrupt change in infant health outcomes occurred at the same time as the Gold King Mine spill that did not also occur in the donor pool counties. These results provide perhaps the strongest evidence yet to support a causal story.

FIGURE 5
Figure 5. Monthly Impacts of the Mine Water Disaster on Extensive Margin Infant Health Outcomes (Synthetic Control Results). Notes: monthly impacts of the mine waste water disaster on gaps in extensive margin infant health outcomes shown (black lines) along with placebo test gaps (gray lines). p-value from donor pool placebo tests. Synthetic control constructed by matching treated and donor pool counties on pre-treatment infant health outcomes, mother and infant characteristics, county socio-demographic characteristics, and county weather characteristics, as before.
Alternative Donor Pool Specification Results

I also performed many robustness checks to determine if the main specifications are robust to different constructions of the donor pool. Results from these alternative specifications are provided in Table 5. In panel A, the donor pool has been restricted to those counties immediately adjacent to the three downstream counties. Panel B restricts the donor pool to counties with a population centroid within 100 miles (161 km) of the population centroids of the downstream counties. In panels C and D, rather than constructing the donor pool based on geographic distance, I instead use a population-based metric. In panel C, the top 10 highest population counties in Colorado and New Mexico are included in the donor pool, whereas in panel D, the top 10 lowest population counties in these states constitute the donor pool. Finally, panel E drops from the donor pool all counties adjacent to the downstream counties (in case there were spillover effects to these counties).

TABLE 5
Table 5. Alternative Donor Pool Specifications (Synthetic Control Results).

With very few exceptions, the SCMMeffect estimates remain statistically significant across all five specifications and also have similar magnitudes both to each other and to the baseline results previously presented. Two notable exceptions are for gestation length in panel C (the highest population specification) and again in panel D (the lowest population specification). In these cases, the SCM results are no longer statistically significant, though it is important to note that the gestation length results were only marginally significant before, thus, the change in significance observed here is not especially meaningful. Despite these two isolated cases, I conclude from this specification check that the main findings are generalizable across alternative constructions of the donor pool, suggesting that the results are not being driven by the original donor pool choice.

Falsification Test Results

Two falsification tests are performed on the baseline results. For the first test, I show no effect of the spill on outcomes measured before the spill actually occurred. Specifically, the SCM was again employed (using the same set of matching covariates as before), but now using the years 2011 and 2012 as the pre-treatment period and the years 2013 and 2014 as the treatment period. Recall that the actual mine disaster occurred in August 2015. The results from this test are presented in Figure S1. In short, none of the infant health outcome gaps are significantly different from zero at conventional levels and all the gaps are fairly close to zero. Hence, as expected, there is no impact of the mine spill prior to when it actually occurred, consistent with a causal story.

For the second falsification test, I show no impact of the mine disaster on birth outcomes that are known to be unaffected by temporary environmental shocks: instances of large for gestational age (LGA) births and instances of postterm pregnancy. LGA describes infants who weigh more than the 90th percentile of babies of the same gestational age. In the US, this commonly translates into weighing ≥4000 g at birth [51]. A postterm pregnancy is a pregnancy that extends to 42 weeks of gestation or beyond. Prior literature has suggested that genetic factors and maternal diabetes are the most likely causes of both of these birth outcomes [51,52]. As expected, I show in Figure S2 that there is indeed no impact of the Gold King Mine spill on these birth outcomes, thereby further boosting the causal interpretation of the earlier findings.

Additional Robustness Check Results

Several additional robustness checks were performed on the types of births included in the analysis and the variables used in the SCM weighting process. These results are reported in Table S2 and are briefly summarized here. First, I limit the sample to singleton births (i.e., the birth of only one child during a single delivery) rather than both multiple and singleton births as was previously done. This check is performed because multiple births are more likely to have poor health outcomes independent of exposures to environmental pollution [18]. Second, I add controls for health insurance status (Medicaid, private insurance, and self-pay) and an indicator for receipt of WIC (Women, Infants, and Children) to the set of variables included in the SCM weighting process. Third, I do not weight the placebo effect estimates by their pre-treatment RMSPE as was done in the main specifications. Overall, as reported in Table S2, I find that the main results are largely robust to these checks.

DISCUSSION

This work provides new causal evidence that temporary shocks to water quality caused by a major mining disaster can adversely affect infant health outcomes in counties located immediately downstream from the spill site. I provide evidence that concentrations of heavy metals and toxic compounds such as arsenic, lead, and mercury were above US EPA water quality guidelines after the spill in the downstream counties investigated, making them the most probable exposure pathways for the effects observed. However, additional data and analysis would be needed to fully investigate the actual exposure mechanisms, which is outside the scope of this paper. Several possible mechanisms could stem from the fact that the Animas River is a major source of drinking water in the counties investigated, in addition to being a significant source of recreational opportunities and economic livelihoods in Colorado and New Mexico.

The main finding in this work is that instances of low birth weight and prematurity were elevated by an average of 0.6 and 0.71 percentage points, respectively, in the two years after the August 2015 Gold King Mine spill, or, by 7.7% and 9.1% relative to the mean. Based on the empirical relationship reported in [53], the observed percentage increase in low birth weight is approximately equivalent to what would be expected from a mother smoking 4.7 cigarettes per day for the duration of her pregnancy, or, alternatively, being exposed to an additional 0.14 parts per million (ppm) of carbon monoxide (CO) during the last trimester of pregnancy. For comparison, US EPA AirData reports that average CO concentrations near Durango, CO (located in our treatment area) were 0.3 ppm in 2015. Thus, the 0.14 ppm increase required to obtain similar low birth weight outcomes as caused by the Gold King Mine spill is significant by comparison. Hence, this reinforces the significance of the low birth weight finding, as well as the importance of accounting for infant health externalities when assessing the damages of mine waste water disasters.

However, while the results are significant, they are not unreasonably large when compared with other investigations of infant health impacts associated with pollution exposures. For instance, [54] found that the closure of a coal-fired power plant in New Jersey led to a 15% change in low birth weight and a 28% change in prematurity among downwind mothers. Similarly, [55] found that reduced vehicle emissions surrounding toll plazas in the northeastern US improved low birth weight by 10.8% and prematurity by 11.8% among nearby mothers. By contrast, the 7.7% (low birth weight) and 9.1% (prematurity) average two-year findings, and the peak impact findings of 14.1% (low birth weight) and 15.1% (prematurity) within the first two months after the spill are slightly smaller by comparison, yet are still meaningful.

CONCLUSIONS

Given the legacy of abandoned underground mines both in the US and globally, the potential for future mine waste water disasters is likely. The US House Committee on Natural Resources estimates that there are approximately 500,000 abandoned hard rock mines in the US as of 2016, and in Colorado alone, the site of the Gold King Mine disaster, there are an estimated 23,000 abandoned mines. Since the 1970s, there have been a total of 32 uncontrolled releases of mine waste water in the US, including in the states of California, Colorado, Kentucky, Montana, New Jersey, Pennsylvania, Virginia, and West Virginia [32]. As noted by the US Bureau of Reclamation, in their incident report on the Gold King Mine disaster, there are significant risks that future mine waste water disasters similar to the Gold King Mine incident will occur in the near future due to the sheer number of similar abandoned mines in the US, the lack of sufficient resources for proper long-term maintenance and study, and due to inconsistencies in abandoned mine management from agency to agency, which shows “little appreciation for the engineering complexity” of abandoned mine reclamation projects [32]. Thus, the results of this study are broadly informative of the potential short-term infant health risks associated with abandoned mine water releases across the US. Understanding the acute health impacts of these environmental disaster events is therefore necessary and can aid in the management of scarce reclamation budgets along sustainability dimensions.

Several potential policy recommendations emerge from this work. First, urgent Federal and state funding is needed to clean-up and sustainably manage abandoned and legacy metal mines in the US. Second, considerations of abandoned mines’ impacts (i.e., in clean-up benefit-cost analyses) needs to extend to downstream communities in instances of close proximity of mines to waterbodies. As this work shows, downstream communities far from abandoned mines can be affected by mine waste water disasters. Finally, issues of liability and responsibility in instances of mine waste water disasters needs to be formally established in law in order to properly compensate affected individuals and communities, such as for negative health impacts.

More broadly, this work illustrates that temporary water quality shocks, even in a developed country, can have meaningful impacts on human health. To the extent that people continue to migrate to abandoned industrial and mining communities, or toward cities located downstream from former industrial mining operations, then this work, and others like it, will continue to rise in importance and significance. This is especially true in cases where infant health outcomes are impacted, given their important role as determinants of long-term human capital and labor market outcomes.

SUPPLEMENTARY MATERIALS

The following supplementary materials are available online, Figure S1: Pre-Mine Disaster Falsification Test (Synthetic Control Results), Figure S2: Falsification Tests on Alternative Birth Outcomes Not Known to Be Impacted by Acute Environmental Exposures (Synthetic Control Results), Table S1: Synthetic Control Weights by Birth Outcome, Table S2: Additional Robustness Checks (Synthetic Control Results).

DATA AVAILABILITY

The dataset from the study is not available because it is restricted use under a data use agreement.

CONFLICTS OF INTEREST

The author declares that they have no conflicts of interest.

FUNDING

No funding was received for this work.

REFERENCES

1.

2.

3.

4.

5.

6.

7.

8.

9.

10.

11.

12.

13.

14.

15.

16.

17.

18.

19.

20.

21.

22.

23.

24.

25.

26.

27.

28.

29.

30.

31.

32.

33.

34.

35.

36.

37.

38.

39.

40.

41.

42.

43.

44.

45.

46.

47.

48.

49.

50.

51.

52.

53.

54.

55.

How to Cite This Article

Jones BA. Lookout Below! The Downstream Infant Health Impacts of Mining Disasters. J Sustain Res. 2026;8(2):e260035. https://doi.org/10.20900/jsr20260035

Copyright © Hapres Co., Ltd. Privacy Policy | Terms and Conditions