Exploratory Data Analysis with

Maria Schreiber

Grace Hopper Program

April 20, 2017

 

Exploratory Data Analysis with

What is EDA/ Why R?

A basic R live code

Parts of 'big data' analysis

How is 'big data' analyzed?

  • Formulating questions/hypotheses
  • Data collection/identification
  • Data wrangling/munging
  • Exploratory data analysis
  • Formal modeling/prediction/analysis
  • Drawing and communicating conclusions

Exploratory Data Analysis

EDA happens between you and the data and isn't about proving anything to anyone else yet.    

 

 

- Cathy O'Neil

Here's where R comes in!

Wait, what is R?

  • Open source statistical programming language
  • Initially released in 1995, but is gaining in popularity
  • Interpreted language - can be used in the CLI, but there are also many delightful and free GUIs
  • Enhanced by user-written packages

Live Code!

Resources

Udacity EDA course/Data visualization course

R-bloggers - https://www.r-bloggers.com/

R Cookbook - http://www.cookbook-r.com/

Ggplot 2 - http://ggplot2.tidyverse.org/index.html

NYC Open Data - https://opendata.cityofnewyork.us/

 

Check out my upcoming blogpost on Medium!

https://medium.com/@meschreiber3

Thank you!

Exploratory Data Analysis with R

By mschreiber

Exploratory Data Analysis with R

The basics of R and big data analysis

  • 683