Dataset context

The dataset came from (http://ergast.com/mrd/db/#csv). The data was compiled by Ergast Developer API, who provides a historical record of motor racing data for non-commercial purposes.

I loaded in 3 csv files with the raw datasets. Each row represents an entry. There are 1035 rows in “races,” 136 rows in status, and 24840 rows in results. The data that I use is the status, sum of statuses, and years. The statuses I look at are if the driver finished, had a mechanical failure, or an on-road accident.

Visualization

Reflection

What ideas/suggestions from Claus Wilke’s helped shape your visualization? Claus’ ideas about labeling axis/legends helped me choose to put my labels next to my lines instead of having an answer key. Also, Claus’ direction on when to use color helped my decisions with color. I chose not to distinguish the lines with different colors because the lines are already labeled; color would not be giving us any new information. I kept the blue color of the lines in order for them stand out when looking at the plot.

Is there anything more you wish you could do with this data? I think that it would be interesting to see what lap of the race has the most accidents/mechanical failures.

What were the most interesting or frustrating technical aspects of doing this? A key was not generated when I made the line plot. I thought it was odd that no labels for the lines were generated either. I had to do it manually.