BDA Merged
BDA Merged
BDA Merged
___________
Q.3 (a) Discuss role of Data node and Name node in HDFS. 03
(b) Explain features and advantages of NoSQL. 04
(c) What is data stream mining? Explain stream data model and its 07
architecture in detail.
OR
Q.3 (a) What do you mean by job scheduling in Hadoop? List different 03
schedulers in Hadoop.
(b) What is NoSQL database? List the differences between NoSQL and 04
relational databases.
(c) Explain Counting oneness in a Window and Decaying Window of 07
mining data stream in detail.
*************
Seat No.: ________ Enrolment No.___________
Q.3 (a) Explain following commands of HDFS with syntax and at least one 03
example of each.
(i) copyFromLocal (ii) mv (iii) cat
(b) Write basic wordcount program using PySPARK. 04
(c) What is transformation and actions in Apache Spark? Discuss various 07
commands available for this activity in Apache Spark?
OR
Q.3 (a) Discuss various benefits of stream processing. 03
(b) What is RDD? Explain role of RDD in Spark. 04
(c) Describe different types of NoSQL databases with the help of example. 07
Q.2 (a) List various configuration files used in Hadoop Installation. What is use 03
of mapred.
(b) Explain the difference between structure and unstructured data. 04
(c) Discuss Big Data in Healthcare,Trasportation & Medicine. 07
OR
(c) Discuss Hadoop YARN in detail with failures in classic MapReduce. 07
*************
1
Seat No.: ________ Enrolment No.___________
*************
1
Seat No.: ________ Enrolment No.___________
1
Seat No.: ________ Enrolment No.___________
MARKS
Q.2 (a) Explain basic Components of Analyzing the Data with Hadoop. 03
(b) What is Map Reduce and explain How Map Reduce Works? 04
(c) Brief Anatomy of a Map Reduce Job run and Failures. 07
OR
(c) Describe Map Reduce Types and Formats. 07
Q.3 (a) Explain NoSQL data architecture. 03
(b) Elaborate Key-value stores, Graph stores, Column family stores & 04
Document stores.
(c) Describe analyzing big data with a shared-nothing architecture. 07
OR
Q.3 (a) Difference between master-slave versus peer-to-peer distribution models. 03
(b) What is a big data NoSQL? Explain in details. 04
(c) Which Four ways that NoSQL systems handle big data problems. 07
Q.4 (a) What is Stream Computing and Sampling Data in a Stream. 03
(b) Write the application of RTAP. 04
(c) Explain Stream Data Model and Architecture. 07
OR
Q.4 (a) How Graph Analytics used in Big Data. 03
(b) Write a short note on Decaying Window. 04
(c) Explain with example: How to perform Real Time Sentiment Analysis of 07
any product.
Q.5 (a) What is the use of Pig and Hive in Big Data? 03
(b) Describe data processing operators in Pig. 04
(c) Describe HBase and ZooKeeper in details. 07
OR
Q.5 (a) Explain HIVE services. 03
(b) Write application of writing Spark. 04
(c) Describe any application that you know related to enhance particular 07
business using big data and explain how it is important as a business
prospective.
*******