Branden DuPont, Data Analyst
-Not just the first step, iterative
- Encompasses a truly breathtaking amount of tasks
- Data viz and statistical modeling (e.g. hypothesis testing) most focused on in teaching, represents 20% of time
- Giving you a comprehensive overview of concepts/tools could easily fill 1-2 graduate courses.
- Tool agnostic
Technical Work:
Projects:
Source: xkcd
“[Of statistics/data analysis] Total disconnect between what people need to actually understand data and what was being taught.” - Hadley Wickam
1. Each variable forms a column.
2. Each observation forms a row.
3. Each type of observational unit forms a table.
Hadley Wickham - "Like families, tidy datasets are all alike but every messy dataset is messy in its own way."
(Besides the feeling of superiority it gives you.)
You only need one: "Python as Super Glue for the Modern Scientific Workflow" Data analysis requires you to port data to and from different programs and tools. Python can (almost) do it all.
Excel Hell: "A place of torment and misery caused by using Excel as your primary data manipulation tool."