CS712 Assignment 1
CS712 Assignment 1
01
The purpose of assignments is to provide you with hands on practice on database design. It is
expected that students will solve the assignments themselves. Following rules will apply during
the evaluation of assignment.
Cheating from any source will result in zero marks in the assignment.
Any student found cheating in any two of the assignments submitted will be awarded "F"
grade in the course.
In case of question No. 03 direct copy and paste from the research paper will be awarded
zero marks.
a) Consider a company ABC, having different Employees with different Skills and Hobbies
according to the below given table,
1. 1st NF:
Its in 1st Normal Form because it’s contain single values in its cells.
2. 2nd NF:
Now table is in 1st Normal Form.
Now we will remove Partial Dependency.
Make separate tables for them.
Now its in 3rd Normal Form because its contain no transitive dependency.
Question No.2 [Marks: 15]
Consider the above example and consider the below table as Employee_Personal;
Answer:
1st NF:
The Given Table is already in 1st Normal Form because it’s contain atomic/single values in its
cells.
2ndNF:
As we now Primary key is only contain one Attribute/column so It contain no Partial
Dependency.
Given table is in already 2nd Normal Form.
3rd NF:
Remove Transitive Dependency.
U_email -> Fname
Zip -> City , Province
Decompose it in sub tables
U_email Fname
rafi@ymail.com Rafiq Zi City Province
ashfaq@gmail.com Ashfaq p
dany@hotmail.co Dany 123 Peshwar KPK
m 456 Karachi Sindh
Liza@yahoo.com Liza 789 Mirpur KPK
Ch85@gmail.com Chaudry 042 Lahore Punjab
Primary Key = E_ID
Foreign Key = U_email, Zip
Read the paper entitled as “SQL: From Traditional Database to Big Data” available in the zip
folder.
While Map Reduce is a powerful framework to build highly distributed and scalable programs, it
is also complex and difficult to learn. In fact, even simple data operations, like joining two
datasets or identifying the Top-K records, require relatively complex Map Reduce programs.
This is the case because Map Reduce requires users to build a program using a procedural
language that needs a detailed specification of how a processing task should be carried out. Hive
QL supports many of the features of SQL but it does not strictly follow a full SQL standard.
Hive supports multiple DDL and DML commands such as CREATE TABLE, SELECT,
INSERT, UPDATE and DELETE. Moreover, starting with Hive0.13, it is possible to support
transactions with full ACID semantics at the row (record) level. While originally most of the
NoSQL systems did not adhere to the relational database model and did not support SQL for data
manipulation. SQL with VoltDB and it can not run on window and run only on linux
b) Suggest any other efficient framework/ method for handling big data analytics.
ANS:
A- HD-MALA structure have a great deal of focal points than exemplary and current
information digging apparatuses and items for huge examination. Speedier data to-
decision taking care of speed for nonstop military and business applications
because our consistent parallel and coursed preparing computations are made
by using the Storm that don't require questions and data advancement;
B- Results in capable enrolling considering the way that there is irrelevant data
advancement and correspondence traffic over frameworks (tallying virtual
machines), and no rough data is moved for data mining around frameworks to
basically improve preparing capability and system security;
The applications which can leverage the usage of mobile analytics can be classified as mobile
crowd-sourcing or crowd-sensing. These crowd-sourcing applications depend on multiple
sensors integrated within smart phones to collect data from a very large group of mobile
users and provide contextual information by giving priority to users' operating requirements.
It requires continuous sensing, processing and uploading or transferring sensed/processed
data to the cloud or remote servers. These energy expensive functions may in certain cases
lead to battery depletion (Gaber et al., 2013). In (Gaber et al., 2013), mobile analytics is
classified into push-based independent systems with local sensing and processing, push/pull-
based independent/collaborated systems with local sensing and cloud processing/storage and
push/pull-based collaborated system with distributed processing and load balancing between
cloud and mobile device. (Castro et al., 2014) present a collaborative extension to InCense, a
mobile phone sensing toolkit to enable behavioral data gathering from populations of mobile
phone users during mobile phone sensing campaigns A Spark-based framework (Alsheikh et
al., 2016) for learning deep models was proposed for mobile data analytics within large-scale
mobile systems A Spark-based framework (Alsheikh et al., 2016) for learning deep models
was proposed for mobile data analytics within large-scale mobile systems A Spark-based
framework (Alsheikh et al., 2016) for learning deep models was proposed for mobile data
analytics within large-scale mobile systems A Spark-based framework (Alsheikh et al., 2016)
for learning deep models was proposed for mobile data analytics within large-scale mobile
systems A Spark-based framework (Alsheikh et al., 2016) for learning deep models was
proposed for mobile data analytics within large-scale mobile systems