The ability to take data—to be able to understand it, to process it, to extract value from it, to visualize it, to communicate it—that’s going to be a hugely important skill in the next decades - Hal Varian
2016
this is actually just the tip of the iceberg
Big data: The next frontier for innovation, competition, and productivity - McKinsey (May 2011)
Data Scientist: the sexiest job of the 21st century - Harvard Business Review (Oct 2012)
The number of vendors providing big data analytics will triple over the next three years - IDC (Dec 2013)
Get Familiar With Big Data Now—or Face 'Permanent Pink Slip' Demand Rises for Analytics Professionals, Data Scientists - The Wall Street Journal (April 2014)
Less Noise but More Money in Data Science - The New York Times (April 2015)
Students Flocking to Analytics & Data Science – A Good Thing? - Business 2 Community (Feb 2016)
The 25 Skills That Can Get You Hired in 2016 - Linkedin
Data Scientist: 25 Best jobs in America For 2016 - Glassdoor
https://www.kaggle.com/wiki/Software
Packages - cowsay
Graphs
Data Analysis
head(sales) cust_id sales_total num_of_orders gender 1 100001 800.64 3 F 2 100002 217.53 3 F 3 100003 74.58 2 M 4 100004 498.60 3 M 5 100005 723.11 4 F 6 100006 69.43 2 F summary(sales) cust_id sales_total num_of_orders gender Min. :100001 Min. : 30.02 Min. : 1.000 F:5035 1st Qu.:102501 1st Qu.: 80.29 1st Qu.: 2.000 M:4965 Median :105001 Median : 151.65 Median : 2.000 Mean :105001 Mean : 249.46 Mean : 2.428 3rd Qu.:107500 3rd Qu.: 295.50 3rd Qu.: 3.000 Max. :110000 Max. :7606.09 Max. :22.000
Shiny App (interactive Web Apps)
World Bank data Visualisation
library(wbstats) library(data.table) library(googleVis) myDT <- data.table( # Download World Bank data and turn into data.table wb(indicator = c("SP.POP.TOTL", "SP.DYN.LE00.IN", "SP.DYN.TFRT.IN"), mrv = 60) ) countries <- data.table(wbcountries()) # Download country mappings setkey(myDT, iso2c) # Set keys to join the data sets setkey(countries, iso2c) myDT <- countries[myDT][ ! region %in% "Aggregates"]# Add regions to the data set, but remove aggregates wDT <- reshape( # Reshape data into a wide format myDT[, list( country, region, date, value, indicator)], v.names = "value", idvar=c("date", "country", "region"), timevar="indicator", direction = "wide") wDT[, date := as.integer(date)] # Turn date, here year, from character into integer setnames(wDT, names(wDT), c("Country", "Region", "Year", "Population", "Fertility", "LifeExpectancy")) M <- gvisMotionChart(wDT, idvar = "Country", timevar = "Year", xvar = "LifeExpectancy", yvar = "Fertility", sizevar = "Population", colorvar = "Region") plot(M) # Ensure Flash player is available an enabled
Justice challa kondaiah knowledge centre
Justice challa kondaiah knowledge centre
Internship offers the opportunity to acquire a range of
for big data & data science career
For College Graduates Only
***The internship is unpaid***