2
2
2
Every day, these users contribute to billions of images, posts, videos, tweets
etc. You can now imagine the insanely large amount -or Volume- of data that
is generated every minute and every hour.
Twitter alone generates more than 7 terabytes (TB) of data every day,
Facebook 10 TB
1-VOLUME
(Application View)
New Model: all of us are generating data, and all of us are consuming data
1
9
Big Data sources
Scientific instruments
Social media and networks (collecting all sorts of data)
(all of us are generating data)
With Velocity we refer to the speed with which data are being generated.
Staying with our social media example,
“Velocity we refer to the speed with which data are being generated
and need to handled by application”
3-VARIETY
(Generic View)
Different Types:
✓ Relational Data (Tables/Transaction/Legacy Data)
✓ Text Data (Web)
✓ Semi-structured Data (XML)
✓ Graph Data
– Social Network, Semantic Web (RDF), …
✓ Streaming Data
– You can only scan the data once
30
3-VARIETY
(Application View)
Big data is meaningless if it does not provide value toward some meaningful goal
33
5-VALUE
38