what is
BIG DATA!?
presented by:
Faranak Sadeghi
Hamid Salehian
How big is BIG?

- byte: one grain of rice

- byte: one grain of rice
- kilobyte: cup of rice








- byte: one grain of rice
- kilobyte: cup of rice
- megabyte: 8 bags of rice
- byte: one grain of rice
- kilobyte: cup of rice
- megabyte: 8 bags of rice
- gigabyte: 3 semi trucks


- byte: one grain of rice
- kilobyte: cup of rice
- megabyte: 8 bags of rice
- gigabyte: 3 semi trucks
- terabyte: 2 container ships
- byte: one grain of rice
- kilobyte: cup of rice
- megabyte: 8 bags of rice
- gigabyte: 3 semi trucks
- terabyte: 2 container ships
- petabyte: blankets Manhattan


- byte: one grain of rice
- kilobyte: cup of rice
- megabyte: 8 bags of rice
- gigabyte: 3 semi trucks
- terabyte: 2 container ships
- petabyte: blankets Manhattan
- exabyte: blankets of West Cost States

- byte: one grain of rice
- kilobyte: cup of rice
- megabyte: 8 bags of rice
- gigabyte: 3 semi trucks
- terabyte: 2 container ships
- petabyte: blankets Manhattan
- exabyte: blankets of West Cost States
- zettabyte: fill the Pacific Ocean
- byte: one grain of rice
- kilobyte: cup of rice
- megabyte: 8 bags of rice
- gigabyte: 3 semi trucks
- terabyte: 2 container ships
- petabyte: blankets Manhattan
- exabyte: blankets of West Cost States
- zettabyte: fill the Pacific Ocean
- yottabyte: EARTH SIZE RICE BALL

so...where is
Big Data??
| byte | one grain of rice |
| killobyte | cup of rice |
| megabyte | 8 bags of rice |
| gigabyte | 3 semi trucks |
| terabyte | 2 container ships |
| petabyte | blankets Manhattan |
| exabyte | blankets of West Cost States |
| zettabyte | fills the pacific ocean |
| yottabyte | EARTH SIZE RICE BALL |
| byte | one grain of rice |
| killobyte | cup of rice |
| megabyte | 8 bags of rice |
| gigabyte | 3 semi trucks |
| terabyte | 2 container ships |
| petabyte | blankets Manhattan |
| exabyte | blankets of West Cost States |
| zettabyte | fills the pacific ocean |
| yottabyte | EARTH SIZE RICE BALL |

Hobbyist
| byte | one grain of rice |
| killobyte | cup of rice |
| megabyte | 8 bags of rice |
| gigabyte | 3 semi trucks |
| terabyte | 2 container ships |
| petabyte | blankets Manhattan |
| exabyte | blankets of West Cost States |
| zettabyte | fills the pacific ocean |
| yottabyte | EARTH SIZE RICE BALL |


Desktop
Hobbyist
| byte | one grain of rice |
| killobyte | cup of rice |
| megabyte | 8 bags of rice |
| gigabyte | 3 semi trucks |
| terabyte | 2 container ships |
| petabyte | blankets Manhattan |
| exabyte | blankets of West Cost States |
| zettabyte | fills the pacific ocean |
| yottabyte | EARTH SIZE RICE BALL |



Internet
Hobbyist
Desktop
| byte | one grain of rice |
| killobyte | cup of rice |
| megabyte | 8 bags of rice |
| gigabyte | 3 semi trucks |
| terabyte | 2 container ships |
| petabyte | blankets Manhattan |
| exabyte | blankets of West Cost States |
| zettabyte | fills the pacific ocean |
| yottabyte | EARTH SIZE RICE BALL |




The Future
Internet
Big Data
Desktop
Hobbyist
....let's see what is
BIG DATA?
“Big Data in general is defined as high volume, velocity and variety information assets that demand cost-effective, innovative forms of information processing for enhanced insight and decision making.”
“Big data is data that exceeds the processing capacity of conventional database systems. The data is too big, moves too fast, or doesn't fit the strictures of your database architectures. To gain value from this data, you must choose an alternative way to process it.”
-- Forrester
O’Reilly --
let's talk about
's of big data
V
3
Volume
I have to much data

"From the dawn of civilization until 2003, humankind generated five exabytes of data. Now we produce five exabytes every two days…and the pace is accelerating."



Executive Chairman, Google
- It is expected that by 2020 the amount of digital information in existence will have grown from 3.2 zettabytes today to 40 zettabytes.
Twitter Generate approximately 12 TB of data per day
Facebook stores, accesses, and analyzes 30+ Petabytes of user generated data


The worlds biggest machine
Generated 30 Petabytes in 2012 > 100 PB in total!

Each engine generate 10 TB every 30 min
640TB per Flight
Air Bus A380

LHC - Large Hadron Collider
Velocity

It's coming at me too fast
let see...
what happened in internet in 60sec
Variety
It's coming at me from too many places in too many format

Relational Data (Tables/Transaction/Legacy Data)
Text Data (Web)
Semi-structured Data (XML)
Graph Data
Social Network, Semantic Web (RDF), …
Streaming Data
You can only scan the data once
The boom of
the Internet of Things
will mean that the amount of devices connected to the Internet will rise from about 13 billion today to
50 billion by 2020
12 million RFID tags
– used to capture data and track movement of objects in the physical world – had been sold in by 2011. By 2021, it is estimated that number will have risen to 209 billion as the Internet of Things takes off.
HP Labs estimate that by 2030, 1 trillion sensors will be in use.
Pure text, photo, audio, video, web, GPS data, sensor data, relational data bases, documents, SMS, pdf, flash, etc etc etc.

usage of big data
prediction for US 2012 Election
Nate Silver’s, Five thirty eight blog
Predict Obama had a 86% chance of winning
Predicted all 50 state correctly
- predictive modeling
- mybarackobama.com
- drive traffic to other campaign sites
Facebook page (33 million "likes")
YouTube channel (240,000 subscribers
and 246 million page views).
- a contest to dine with Sarah Jessica Parker
- Every single night, the team ran 66,000
computer simulations, Reddit!!!
- Amazon web services
Moneyball: The Art of Winning an Unfair Game

Oakland Athletics baseball team and its general manager Billy Beane
- Oakland A's' front office took advantage of more analytical gauges
of player performance to field a team that could compete
successfully against richer competitors in MLB
- Oakland approximately $41 million in salary,
New York Yankees, $125 million in payroll that same season.
Oakland is forced to find players undervalued by the market,
- Moneyball had a huge impact in other teams in MLB
And there is a moneyball movie!!!!!
Election 2016
CONCLUSION
Data is the new Oil. Data is just like crude. It’s valuable, but if unrefined it cannot really be used.
– Clive Humby, DunnHumby
Resources :
- http://www.tcs.com/SiteCollectionDocuments/White%20Papers/Knowledge-Big-Data-Analytics-Product-Development-1213-1.pdf
- http://www.meltinfo.com/ppt/ibm-big-data
- http://wwwiti.cs.uni-magdeburg.de/iti_db/forschung/index.php#projekte
- http://datascienceseries.com/stories/ten-practical-big-data-benefits
- http://www.intel.com/content/dam/www/public/us/en/documents/product-briefs/big-data-cloud-technologies-brief.pdf
- http://www.bigdatalandscape.com/news/why-big-data-is-a-must-in-ecommerce
- http://www.intel.com/content/dam/www/public/us/en/documents/product-briefs/big-data-cloud-technologies-brief.pdf
- http://www.gxsblogs.com/morleym/2011/10/how-the-cloud-helps-manufacturers-address-%E2%80%98big-data%E2%80%99-challenges.html
- http://www.itbusinessedge.com/blogs/integration/three-reasons-why-life-cycle-management-matters-more-with-big-data.html
- http://www.forbes.com/sites/siliconangle/2012/02/29/big-data-is-creating-the-future-its-a-50-billion-market/
- http://plmtwine.com/tag/big-data/
- http://www.3dcadworld.com/big-data-will-important-manufacturers-future/
Question!?

Big Data
By Hamid Salehian
Big Data
What is Big Data in brief....
- 115

