Unit I: Chapter 1: Introduction To Big Data
Unit I: Chapter 1: Introduction To Big Data
Unit I: Chapter 1: Introduction To Big Data
• Source: “Big Data Infographic and Gartner 2012 Top 10 Strategic Tech Trends.” Business Analytics 3.0 (blog)
(November 11, 2011). http://practicalanalytics.
Semi-Structured Data
• Also known as having a schema-less or self describing
structure refers to a form of structured data that contains tags or
markup elements in order to separate elements and generate
hierarchies of records and fields in the given data.
• Such type of data does not follow the proper structure of data
models as in relational databases
• Data is stored inconsistently in rows and columns of a database
• Some sources for semi-structured data includes:
– File systems such as Web Data in the form of cookies
– Data exchange formats such as JavaScript Object Notation(JSON)
data
Elements of Big Data
• According to Gartner, data is growing at the rate of
59% every year.
• This growth can be depicted in terms of the
following four Vs:
– Volume
– Velocity
– Variety
– Veracity
Volume
• Is the amount of data generated by the
organizations or individuals
• Today the volume of data in most organizations is
approaching exabytes
• Some experts predict the volume of data to reach
zettabytes in the coming years