presented by:
Faranak Sadeghi
Hamid Salehian
| byte | one grain of rice |
| killobyte | cup of rice |
| megabyte | 8 bags of rice |
| gigabyte | 3 semi trucks |
| terabyte | 2 container ships |
| petabyte | blankets Manhattan |
| exabyte | blankets of West Cost States |
| zettabyte | fills the pacific ocean |
| yottabyte | EARTH SIZE RICE BALL |
| byte | one grain of rice |
| killobyte | cup of rice |
| megabyte | 8 bags of rice |
| gigabyte | 3 semi trucks |
| terabyte | 2 container ships |
| petabyte | blankets Manhattan |
| exabyte | blankets of West Cost States |
| zettabyte | fills the pacific ocean |
| yottabyte | EARTH SIZE RICE BALL |
| byte | one grain of rice |
| killobyte | cup of rice |
| megabyte | 8 bags of rice |
| gigabyte | 3 semi trucks |
| terabyte | 2 container ships |
| petabyte | blankets Manhattan |
| exabyte | blankets of West Cost States |
| zettabyte | fills the pacific ocean |
| yottabyte | EARTH SIZE RICE BALL |
| byte | one grain of rice |
| killobyte | cup of rice |
| megabyte | 8 bags of rice |
| gigabyte | 3 semi trucks |
| terabyte | 2 container ships |
| petabyte | blankets Manhattan |
| exabyte | blankets of West Cost States |
| zettabyte | fills the pacific ocean |
| yottabyte | EARTH SIZE RICE BALL |
| byte | one grain of rice |
| killobyte | cup of rice |
| megabyte | 8 bags of rice |
| gigabyte | 3 semi trucks |
| terabyte | 2 container ships |
| petabyte | blankets Manhattan |
| exabyte | blankets of West Cost States |
| zettabyte | fills the pacific ocean |
| yottabyte | EARTH SIZE RICE BALL |
“Big Data in general is defined as high volume, velocity and variety information assets that demand cost-effective, innovative forms of information processing for enhanced insight and decision making.”
“Big data is data that exceeds the processing capacity of conventional database systems. The data is too big, moves too fast, or doesn't fit the strictures of your database architectures. To gain value from this data, you must choose an alternative way to process it.”
-- Forrester
O’Reilly --
let's talk about
's of big data
"From the dawn of civilization until 2003, humankind generated five exabytes of data. Now we produce five exabytes every two days…and the pace is accelerating."
Executive Chairman, Google
Twitter Generate approximately 12 TB of data per day
Facebook stores, accesses, and analyzes 30+ Petabytes of user generated data
The worlds biggest machine
Generated 30 Petabytes in 2012 > 100 PB in total!
Each engine generate 10 TB every 30 min
640TB per Flight
Air Bus A380
LHC - Large Hadron Collider
Relational Data (Tables/Transaction/Legacy Data)
Text Data (Web)
Semi-structured Data (XML)
Graph Data
Social Network, Semantic Web (RDF), …
Streaming Data
You can only scan the data once
The boom of
the Internet of Things
will mean that the amount of devices connected to the Internet will rise from about 13 billion today to
50 billion by 2020
12 million RFID tags
– used to capture data and track movement of objects in the physical world – had been sold in by 2011. By 2021, it is estimated that number will have risen to 209 billion as the Internet of Things takes off.
HP Labs estimate that by 2030, 1 trillion sensors will be in use.
Pure text, photo, audio, video, web, GPS data, sensor data, relational data bases, documents, SMS, pdf, flash, etc etc etc.
prediction for US 2012 Election
Nate Silver’s, Five thirty eight blog
Predict Obama had a 86% chance of winning
Predicted all 50 state correctly
- predictive modeling
- mybarackobama.com
- drive traffic to other campaign sites
Facebook page (33 million "likes")
YouTube channel (240,000 subscribers
and 246 million page views).
- a contest to dine with Sarah Jessica Parker
- Every single night, the team ran 66,000
computer simulations, Reddit!!!
- Amazon web services
Moneyball: The Art of Winning an Unfair Game
Oakland Athletics baseball team and its general manager Billy Beane
- Oakland A's' front office took advantage of more analytical gauges
of player performance to field a team that could compete
successfully against richer competitors in MLB
- Oakland approximately $41 million in salary,
New York Yankees, $125 million in payroll that same season.
Oakland is forced to find players undervalued by the market,
- Moneyball had a huge impact in other teams in MLB
And there is a moneyball movie!!!!!
Election 2016
Data is the new Oil. Data is just like crude. It’s valuable, but if unrefined it cannot really be used.
– Clive Humby, DunnHumby
Resources :