Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
0% found this document useful (0 votes)
54 views

Advanced DataBase Assignment

Big data is large and complex data that cannot be processed by traditional databases. It includes structured, unstructured, and semi-structured data from sources like social media, stock exchanges, and jet engines. Big data is characterized by its high volume, variety, velocity, and variability. Analyzing big data can provide benefits like improved customer service, better operational efficiency, and enabling businesses to utilize outside intelligence for decision making.

Uploaded by

Ibrahem Ramadan
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
54 views

Advanced DataBase Assignment

Big data is large and complex data that cannot be processed by traditional databases. It includes structured, unstructured, and semi-structured data from sources like social media, stock exchanges, and jet engines. Big data is characterized by its high volume, variety, velocity, and variability. Analyzing big data can provide benefits like improved customer service, better operational efficiency, and enabling businesses to utilize outside intelligence for decision making.

Uploaded by

Ibrahem Ramadan
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 8

ADVANCE

D
DATABAS
SSIGNMEN
T

2020
Lamia Omran
Ibrahem Ramadan Ahmed // ID : 18107958
Big Data

1- What is Big Data ?

Big Data is a collection of data that is huge in volume, yet growing


exponentially with time. It is a data with so large size and complexity
that none of traditional data management tools can store it or process
it efficiently. Big data is also a data but with huge size.

Big Data is a Database that is different and advanced from the standard
database. The Standard Relational databases are efficient for storing
and processing structured data. It uses the table to store the data and
structured query language (SQL) to access and retrieve the data.
BigData is the type of data that includes unstructured and semi-
structured data. There are specific types of database known as NoSQL
databases, There are several types of NoSQL Databases and tools
available to store and process the Big Data. NoSQL Databases are
optimized for data analytics using the BigData such as text, images,
logos, and other data formats such as XML, JSON. The big data is helpful
for developing data-driven intelligent applications.
2- Examples Of Big Data

 Social Media

The statistic shows that 500+terabytes of new data get ingested into
the databases of social media site Facebook, every day. This data is
mainly generated in terms of photo and video uploads, message
exchanges, putting comments etc.
 The New York Stock Exchange generates about one terabyte of new
trade data per day

 A single Jet engine can generate 10+terabytes of data in 30 minutes


of flight time. With many thousand flights per day, generation of
data reaches up to many Petabytes.
3- What is big data analytics?

Big data analytics is the use of advanced analytic techniques against


very large, diverse big data sets that include structured, semi-
structured and unstructured data, from different sources, and in
different sizes from terabytes to zettabytes.

What is big data exactly? It can be defined as data sets whose size or
type is beyond the ability of traditional relational databases to capture,
manage and process the data with low latency. Characteristics of big
data include high volume, high velocity and high variety. Sources of
data are becoming more complex than those for traditional data
because they are being driven by artificial intelligence (AI), mobile
devices, social media and the Internet of Things (IoT). For example, the
different types of data originate from sensors, devices, video/audio,
networks, log files, transactional applications, web and social media —
much of it generated in real time and at a very large scale.

The amount of data in today’s world is staggering. But big data offers
vast opportunities for businesses, whether used independently or with
existing traditional data. Data scientists, analysts, researchers and
business users can leverage these new data sources for advanced
analytics that deliver deeper insights and to power innovative big data
applications. Some common techniques include data mining, text
analytics, predictive analytics, data visualization, AI, machine learning,
statistics and natural language processing.
4- Characteristics Of Big Data

Big data can be described by the following characteristics:

 Volume – The name Big Data itself is related to a size which is


enormous. Size of data plays a very crucial role in determining value out
of data. Also, whether a particular data can actually be considered as a
Big Data or not, is dependent upon the volume of data. Hence,
'Volume' is one characteristic which needs to be considered while
dealing with Big Data.

 Variety – The next aspect of Big Data is its variety.


Variety refers to heterogeneous sources and the nature of data, both
structured and unstructured. During earlier days, spreadsheets and
databases were the only sources of data considered by most of the
applications. Nowadays, data in the form of emails, photos, videos,
monitoring devices, PDFs, audio, etc. are also being considered in the
analysis applications. This variety of unstructured data poses certain
issues for storage, mining and analyzing data.

 Velocity – The term 'velocity' refers to the speed of generation of data.


How fast the data is generated and processed to meet the demands,
determines real potential in the data.

 Variability – This refers to the inconsistency which can be shown by the


data at times, thus hampering the process of being able to handle and
manage the data effectively.
5- Importance of Big Data Processing

Ability to process Big Data brings in multiple benefits, such as


 Businesses can utilize outside intelligence while taking decisions

Access to social data from search engines and sites like facebook, twitter are
enabling organizations to fine tune their business strategies.
 Improved customer service

Traditional customer feedback systems are getting replaced by new systems


designed with Big Data technologies. In these new systems, Big Data and
natural language processing technologies are being used to read and
evaluate consumer responses.
 Early identification of risk to the product/services, if any
 Better operational efficiency

Big Data technologies can be used for creating a staging area or landing zone
for new data before identifying what data should be moved to the data
warehouse. In addition, such integration of Big Data technologies and data
warehouse helps an organization to offload infrequently accessed data.
Summary
 Big Data definition : Big Data is defined as data that is huge in
size. Bigdata is a term used to describe a collection of data that
is huge in size and yet growing exponentially with time.
 Big Data analytics examples includes stock exchanges, social
media sites, jet engines, etc.
 Big Data could be 1) Structured, 2) Unstructured, 3) Semi-
structured
 Volume, Variety, Velocity, and Variability are few Big Data
characteristics
 Improved customer service, better operational efficiency,
Better Decision Making are few advantages of Bigdata

You might also like