Oil Spills Impacting US Waterways

Data

The data came from the Bureau of Transportation Statistics. Only the pollution incidents where the Coast Guard investigated as the lead agency are included in this dataset. Data on spills where the Environmental Protection Agency or any of the state authorities are the lead agency is not included. These are mostly offshore spills as the EPA usually handles inshore spills. These statistics cover yearly gallons spilled, number of incidents, and source type from 1985, 1990, 1995-2020.

Vessel, Non-Vessel, and Mystery Sources

Gallons Spilled

oilSpills %>% filter(Source == "Vessel sources, total" | 
                     Source == "Nonvessel sources, total"|
                     Source == "Mysteryc") %>%
  plot_ly(x = ~Year, y =~`Gallons spilled`, color = ~Source, type="bar", colors = 
            c("#a6cee3", "#1f78b4", "#b2df8a"),
           width = 1000) %>%
  layout(barmode = 'dodge', yaxis = list(fixedrange = FALSE))

“On April 20, 2010, the oil drilling rig Deepwater Horizon, operating in the Macondo Prospect in the Gulf of Mexico, exploded and sank resulting in the death of 11 workers on the Deepwater Horizon and the largest spill of oil in the history of marine oil drilling operations.”

Incidents

oilSpills %>% filter(Source == "Vessel sources, total"| 
                     Source == "Nonvessel sources, total"|
                     Source == "Mysteryc")  %>%
  drop_na() %>%
  plot_ly(width = 1000) %>%
  add_trace(x = ~Year, y = ~Incidents, type = 'scatter', color =~Source, mode = 'lines+markers', 
            colors = c("#a6cee3", "#1f78b4", "#b2df8a"))

Source, Incidents, and Gallons Spilled

y2 <- list(
  overlaying = "y",
  side = "right",
  title = "Number of Incidents",
  automargin = T)

oilSpills %>% filter(Source != "TOTAL all spills", 
                     Source != "Vessel sources, total", 
                     Source != "Nonvessel sources, total")  %>%
  drop_na() %>%
  mutate(Source = fct_reorder(Source, desc(`Gallons spilled`))) %>%
  plot_ly(x = ~Year, y =~`Gallons spilled`, color = ~Source, type="bar", colors = "Accent",
          width = 1000, height = 500) %>%
  group_by(Year) %>%
  summarise(n = sum(Incidents)) %>%
  add_trace(x = ~Year, y = ~n, type = 'scatter',  mode = 'markers', 
            name = "Incidents", yaxis = "y2", color = I("black")) %>%
  layout(barmode = 'stack', yaxis2 = y2,
         yaxis = list(fixedrange = FALSE))

References

Reflections

Wilke: stacked vs grouped bar charts and the implications of connecting dots with lines
wrangling - double header excel data (code)
Improvements: double y-axis? tree-map or ribbon plot, more user interactivity - button that switches from stacked bar plot to grouped bar plot