what is

BIG DATA!?

presented by:

Faranak Sadeghi

Hamid Salehian

How big is BIG?

  • byte:          one grain of rice
  • byte:          one grain of rice
  • kilobyte:    cup of rice
  • byte:          one grain of rice
  • kilobyte:    cup of rice
  • megabyte: 8 bags of rice
  • byte:          one grain of rice
  • kilobyte:    cup of rice
  • megabyte: 8 bags of rice
  • gigabyte:   3 semi trucks
  • byte:          one grain of rice
  • kilobyte:    cup of rice
  • megabyte: 8 bags of rice
  • gigabyte:   3 semi trucks
  • terabyte:    2 container ships
  • byte:          one grain of rice
  • kilobyte:    cup of rice
  • megabyte: 8 bags of rice
  • gigabyte:   3 semi trucks
  • terabyte:    2 container ships
  • petabyte:   blankets Manhattan
  • byte:          one grain of rice
  • kilobyte:    cup of rice
  • megabyte: 8 bags of rice
  • gigabyte:   3 semi trucks
  • terabyte:    2 container ships
  • petabyte:   blankets Manhattan
  • exabyte:    blankets of West Cost States
  • byte:          one grain of rice
  • kilobyte:    cup of rice
  • megabyte: 8 bags of rice
  • gigabyte:   3 semi trucks
  • terabyte:    2 container ships
  • petabyte:   blankets Manhattan
  • exabyte:    blankets of West Cost States
  • zettabyte:  fill the Pacific Ocean
  • byte:          one grain of rice
  • kilobyte:    cup of rice
  • megabyte: 8 bags of rice
  • gigabyte:   3 semi trucks
  • terabyte:    2 container ships
  • petabyte:   blankets Manhattan
  • exabyte:    blankets of West Cost States
  • zettabyte:  fill the Pacific Ocean
  • yottabyte:  EARTH SIZE RICE BALL

so...where is

Big Data??

byte one grain of rice
killobyte cup of rice
megabyte 8 bags of rice
gigabyte 3 semi trucks
terabyte 2 container ships
petabyte blankets Manhattan
exabyte blankets of West Cost States
zettabyte fills the pacific ocean
yottabyte EARTH SIZE RICE BALL
byte one grain of rice
killobyte cup of rice
megabyte 8 bags of rice
gigabyte 3 semi trucks
terabyte 2 container ships
petabyte blankets Manhattan
exabyte blankets of West Cost States
zettabyte fills the pacific ocean
yottabyte EARTH SIZE RICE BALL

Hobbyist

byte one grain of rice
killobyte cup of rice
megabyte 8 bags of rice
gigabyte 3 semi trucks
terabyte 2 container ships
petabyte blankets Manhattan
exabyte blankets of West Cost States
zettabyte fills the pacific ocean
yottabyte EARTH SIZE RICE BALL

Desktop

Hobbyist

byte one grain of rice
killobyte cup of rice
megabyte 8 bags of rice
gigabyte 3 semi trucks
terabyte 2 container ships
petabyte blankets Manhattan
exabyte blankets of West Cost States
zettabyte fills the pacific ocean
yottabyte EARTH SIZE RICE BALL

Internet

Hobbyist

Desktop

byte one grain of rice
killobyte cup of rice
megabyte 8 bags of rice
gigabyte 3 semi trucks
terabyte 2 container ships
petabyte blankets Manhattan
exabyte blankets of West Cost States
zettabyte fills the pacific ocean
yottabyte EARTH SIZE RICE BALL

The Future

Internet

Big Data

Desktop

Hobbyist

....let's see what is

BIG DATA?

“Big Data in general is defined as high volume, velocity and variety information assets that demand cost-effective, innovative forms of information processing for enhanced insight and decision making.”

“Big data is data that exceeds the processing capacity of conventional database systems. The data is too big, moves too fast, or doesn't fit the strictures of your database architectures. To gain value from this data, you must choose an alternative way to process it.”

-- Forrester

O’Reilly --

let's talk about

's of big data

V

3

Volume

I have to much data

"From the dawn of civilization until 2003, humankind generated five exabytes of data. Now we produce five exabytes every two days…and the pace is accelerating."

Executive Chairman, Google

  • It is expected that by 2020 the amount of digital information in existence will have grown from 3.2 zettabytes today to 40 zettabytes.

Twitter Generate approximately 12 TB of data per day 

Facebook stores, accesses, and analyzes 30+ Petabytes of user generated data

The worlds biggest machine

Generated 30 Petabytes in 2012 > 100 PB in total!

Each engine generate 10 TB every 30 min

640TB per Flight 

Air Bus A380

LHC - Large Hadron Collider

Velocity

It's coming at me too fast

let see...

what happened in internet in 60sec

Variety

It's coming at me from too many places in too many format

Relational Data (Tables/Transaction/Legacy Data)
Text Data (Web)
Semi-structured Data (XML) 
Graph Data
Social Network, Semantic Web (RDF), … 

Streaming Data 
You can only scan the data once

 

The boom of 
the Internet of Things
 will mean that the amount of devices connected to the Internet will rise from about 13 billion today to 
50 billion by 2020

12 million RFID tags 
– used to capture data and track movement of objects in the physical world – had been sold in by 2011. By 2021, it is estimated that number will have risen to 209 billion as the Internet of Things takes off.

HP Labs estimate that by 2030, 1 trillion sensors will be in use.

Pure text, photo, audio, video, web, GPS data, sensor data, relational data bases, documents, SMS, pdf, flash, etc etc etc. 

usage of big data

prediction for US 2012 Election

Nate Silver’s, Five thirty eight blog
Predict Obama had a 86% chance of winning
Predicted all 50 state correctly     

- predictive modeling
- mybarackobama.com 
- drive traffic to other campaign sites
 Facebook page (33 million "likes")
 YouTube channel (240,000 subscribers
 and 246 million page views).
- a contest to dine with Sarah Jessica Parker
- Every single night, the team ran 66,000 
computer simulations, Reddit!!!
- Amazon web services     

Moneyball: The Art of Winning an Unfair Game

Oakland Athletics baseball team and its general manager Billy Beane

- Oakland A's' front office took advantage of more analytical gauges 
of player performance to field a team that could compete 
successfully against richer competitors in MLB

- Oakland approximately $41 million in salary, 
New York Yankees, $125 million in payroll that same season.
Oakland is forced to find players undervalued by the market, 


- Moneyball had a huge impact in other teams in MLB

And there is a moneyball movie!!!!!

Election 2016

CONCLUSION

     Data is the new Oil. Data is just like crude. It’s valuable, but if unrefined it cannot really be used. 

 

– Clive Humby, DunnHumby

Resources :       

  1. http://www.tcs.com/SiteCollectionDocuments/White%20Papers/Knowledge-Big-Data-Analytics-Product-Development-1213-1.pdf
  2. http://www.meltinfo.com/ppt/ibm-big-data
  3. http://wwwiti.cs.uni-magdeburg.de/iti_db/forschung/index.php#projekte
  4. http://datascienceseries.com/stories/ten-practical-big-data-benefits
  5. http://www.intel.com/content/dam/www/public/us/en/documents/product-briefs/big-data-cloud-technologies-brief.pdf
  6. http://www.bigdatalandscape.com/news/why-big-data-is-a-must-in-ecommerce
  7. http://www.intel.com/content/dam/www/public/us/en/documents/product-briefs/big-data-cloud-technologies-brief.pdf
  8. http://www.gxsblogs.com/morleym/2011/10/how-the-cloud-helps-manufacturers-address-%E2%80%98big-data%E2%80%99-challenges.html
  9. http://www.itbusinessedge.com/blogs/integration/three-reasons-why-life-cycle-management-matters-more-with-big-data.html
  10. http://www.forbes.com/sites/siliconangle/2012/02/29/big-data-is-creating-the-future-its-a-50-billion-market/
  11. http://plmtwine.com/tag/big-data/
  12. http://www.3dcadworld.com/big-data-will-important-manufacturers-future/

 

Question!?

Big Data

By Hamid Salehian

Big Data

What is Big Data in brief....

  • 115