Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

BDA Merged

Download as pdf or txt
Download as pdf or txt
You are on page 1of 7

Seat No.: ________ Enrolment No.

___________

GUJARAT TECHNOLOGICAL UNIVERSITY


BE - SEMESTER–VII (NEW) EXAMINATION – SUMMER 2024
Subject Code: 3170722 Date:20-05-2024
Subject Name: Big Data Analytics
Time:02:30 PM TO 05:00 PM Total Marks:70
Instructions:
1. Attempt all questions.
2. Make suitable assumptions wherever necessary.
3. Figures to the right indicate full marks.
4. Simple and non-programmable scientific calculators are allowed.
MARKS
Q.1 (a) Give difference between traditional data and big data. 03
(b) What is Big data Analytics? Explain any one case study of it. 04
(c) Explain 4 V’s of Big data. 07

Q.2 (a) List various configuration files used in Hadoop Installation 03


(b) How HDFS is different from traditional NFS? 04
(c) What is Hadoop? Briefly explain the core components of it. 07
OR
(c) What is MapReduce? Explain working of various phases of 07
MapReduce with appropriate example and diagram.

Q.3 (a) Discuss role of Data node and Name node in HDFS. 03
(b) Explain features and advantages of NoSQL. 04
(c) What is data stream mining? Explain stream data model and its 07
architecture in detail.
OR
Q.3 (a) What do you mean by job scheduling in Hadoop? List different 03
schedulers in Hadoop.
(b) What is NoSQL database? List the differences between NoSQL and 04
relational databases.
(c) Explain Counting oneness in a Window and Decaying Window of 07
mining data stream in detail.

Q.4 (a) Explain the Data Model of HBase. 03


(b) How HBase uses Zookeeper to Build Applications? Explain in 04
detail.
(c) Explain Pig data Model in detail and Discuss how it will help for 07
effective data flow.
OR
Q.4 (a) Explain the HiveQL-Select-Order By with suitable example. 03
(b) Compare Hive and Pig query language. 04
(c) Explain briefly on Hbase architecture with neat diagram. 07

Q.5 (a) Justify spark is faster than mapReduce. 03


(b) Define various Actions in Spark. 04
(c) Generalize how is data analysis used in 07
a) Stock market predictions.
b) Weather forecasting predictions.
OR
Q.5 (a) What are the components of spark? Also state the features of 03
spark.
(b) What does a Spark Engine do? What is GraphX? 04
(c) What is RDD? State and Explain RDD operations. 07

*************
Seat No.: ________ Enrolment No.___________

GUJARAT TECHNOLOGICAL UNIVERSITY


BE - SEMESTER–VII (NEW) EXAMINATION – WINTER 2023
Subject Code:3170722 Date:06-12-2023
Subject Name: Big Data Analytics
Time: 10:30 AM TO 01:00 PM Total Marks:70
Instructions:
1. Attempt all questions.
2. Make suitable assumptions wherever necessary.
3. Figures to the right indicate full marks.
4. Simple and non-programmable scientific calculators are allowed.
MARKS

Q.1 (a) Explain various challenges of conventional system. 03


(b) What is big data? Explain four V’s of big data. 04
(c) Discuss structured, unstructured and semi-structured big data types. Give 07
proper examples.
Q.2 (a) Explain name node, data node and secondary node. 03
(b) Explain NoSQL business drivers. 04
(c) Write a short note on Hadoop ecosystem. 07
OR
(c) Explain working of following phases of Map Reduce with one common 07
example.
(i) Map Phase (ii) Combiner Phase
(iii) Shuffle and Sort Phase (iv) Reducer Phase

Q.3 (a) Explain following commands of HDFS with syntax and at least one 03
example of each.
(i) copyFromLocal (ii) mv (iii) cat
(b) Write basic wordcount program using PySPARK. 04
(c) What is transformation and actions in Apache Spark? Discuss various 07
commands available for this activity in Apache Spark?
OR
Q.3 (a) Discuss various benefits of stream processing. 03
(b) What is RDD? Explain role of RDD in Spark. 04
(c) Describe different types of NoSQL databases with the help of example. 07

Q.4 (a) Write differences between NoSQL and SQL. 03


(b) Define Hadoop. List and explain advantages of Hadoop. 04
(c) Explain Bloom’s Filter with the help of example. 07
OR
Q.4 (a) Explain NoSQL base property. 03
(b) Differentiate between Pig vs Hive. 04
(c) Discuss Flajolet-Martin Algorithm to count Distinct elements in a stream 07
with proper example.

Q.5 (a) Describe important features of Apache Spark. 03


(b) Write a short note: Stock Market Predictions 04
(c) Explain working of Hive with proper steps and diagram. 07
OR
Q.5 (a) Explain any three HiveQL DDL command with its syntax and example. 03
(b) How samples are collected from data stream? 04
(c) Discuss four ways that NoSQL system handles big data problems. 07
***********
1
Seat No.: ________ Enrolment No.___________

GUJARAT TECHNOLOGICAL UNIVERSITY


BE – SEMESTER- VII EXAMINATION-SUMMER 2023
Subject Code: 3170722 Date: 23/06/2023
Subject Name: Big Data Analytics
Time: 10:30 AM TO 01:00 PM Total Marks: 70
Instructions:
1. Attempt all questions.
2. Make suitable assumptions wherever necessary.
3. Figures to the right indicate full marks.
4. Simple and non-programmable scientific calculators are allowed.

Q.1 (a) List various application of big data. 03


(b) Explain the difference between structure and unstructured data. 04
(c) Define Big Data and also Explain Characteristicis of Big Data. 07

Q.2 (a) List various configuration files used in Hadoop Installation. What is use 03
of mapred.
(b) Explain the difference between structure and unstructured data. 04
(c) Discuss Big Data in Healthcare,Trasportation & Medicine. 07
OR
(c) Discuss Hadoop YARN in detail with failures in classic MapReduce. 07

Q.3 (a) Explain advantages and disadvantages of big data analytics. 03


(b) Discuss role of Data node and Name node in HDFS. 04
(c) Explain working of Hive with proper steps and diagram. 07
OR
Q.3 (a) Compare Raw oriented and Column Oriented database structures. 03
(b) Differentiate: Apache pig Vs Map Reduce. 04
(c) Define join and explain types of join. 07

Q.4 (a) Difference between master-slave versus peer-to-peer distribution 03


models.
(b) What is Stream Computing and Sampling Data in a Stream. 04
(c) What is Apache Spark? What are the advantages of using Apache Spark 07
over Hadoop? Explain in brief four major libraries of Apache Spark.
OR
Q.4 (a) Write a short note on Decaying Window 03
(b) Write a short note on RTAP. 04
(c) Explain the HiveQL-Select-Order By with suitable example. 07

Q.5 (a) Write application of writing Spark. 03


(b) Write difference between MangoDB and Hadoop. 04
(c) Define NoSQL and where is it used? (b) i) Document Oriented Database 07
ii) Graph based Database.
OR
Q.5 (a) Describe data processing operators in Pig. 03
(b) Write differences between NoSQL and SQL. 04
(c) What is Zookeeper? List the benefits of it. 07

*************
1
Seat No.: ________ Enrolment No.___________

GUJARAT TECHNOLOGICAL UNIVERSITY


BE - SEMESTER–VII (NEW) EXAMINATION – WINTER 2022
Subject Code:3170722 Date:20-01-2023
Subject Name:Big Data Analytics
Time:10:30 AM TO 01:00 PM Total Marks:70
Instructions:
1. Attempt all questions.
2. Make suitable assumptions wherever necessary.
3. Figures to the right indicate full marks.
4. Simple and non-programmable scientific calculators are allowed.
MARKS
Q.1 (a) What is Big data? Explain sources of big data. 03
(b) Explain types of Big data. 04
(c) Write a short note on Hadoop Architecture. 07

Q.2 (a) Explain usage of Map reduce. 03


(b) Explain methods of Map reduce of Mapper class. 04
(c) Write a short note on HDFS (Hadoop Distributed File System). 07
OR
(c) Define Social networks and Social Network Mining. 07

Q.3 (a) Give difference between SQL and NoSQL. 03


(b) Explain features and advantages of NoSQL. 04
(c) Explain different types of NoSQL. 07
OR
Q.3 (a) Give difference between master-slave model and peer-to-peer model. 03
(b) Give difference between DBMS and DSMS (Data Science Management 04
System).
(c) Explain filtering a stream in detail. 07

Q.4 (a) What is Stock Market Prediction? 03


(b) What is Hbase and Hive? 04
(c) Discuss about How E-Commerce is Using Big Data to Improve Business 07
in detail.
OR
Q.4 (a) Explain 5V's of Big Data. 03
(b) Give list of HBase Commands. 04
(c) What is a Zookeeper? What are the benefits of using a zookeeper? 07

Q.5 (a) Why is big data analytics important? 03


(b) Explain HIVE data types in brief. 04
(c) Explain modules of Hadoop. 07
OR
Q.5 (a) Explain advantages of Big Data Processing. 03
(b) Explain different challenges of Big Data. 04
(c) Explain Spark Architecture. 07

*************

1
Seat No.: ________ Enrolment No.___________

GUJARAT TECHNOLOGICAL UNIVERSITY


BE - SEMESTER–VII (NEW) EXAMINATION – SUMMER 2022
Subject Code:3170722 Date:18/06/2022
Subject Name:Big Data Analytics
Time:02:30 PM TO 05:00 PM Total Marks: 70
Instructions:
1. Attempt all questions.
2. Make suitable assumptions wherever necessary.
3. Figures to the right indicate full marks.
4. Simple and non-programmable scientific calculators are allowed.

Q.1 (a) What is Big data Analytics? List applications of it. 03


(b) Explain 4 V’s of Big data. 04
(c) What is Hadoop? Briefly explain the core components of it. 07

Q.2 (a) How is Big data and Hadoop related? 03


(b) How HDFS is different from traditional NFS? 04
(c) Draw and Explain HDFS architecture. How can you restart 07
NameNode and all the daemons in Hadoop?
OR
(c) What is MapReduce? Explain working of various phases of 07
MapReduce with appropriate example and diagram.
Q.3 (a) What do you mean by job scheduling in Hadoop? List different 03
schedulers in Hadoop.
(b) What are WAL, MemStore, Hfile and Hlog in HBase? 04
(c) Explain the architecture of HBase. 07
OR
Q.3 (a) What is Zookeeper? 03
(b) Differentiate between HIVE and HBASE. 04
(c) What is NoSQL database? List the differences between NoSQL and 07
relational databases. Explain in brief various types of NoSQL
databases in practice.

Q.4 (a) What is Pig? 03


(b) What are the features of MongoDB? 04
(c) Explain the concept of regions in HBase and storing Big data with 07
HBase.
OR
Q.4 (a) How MongoDB is better than SQL database? 03
(b) Explain the Data Model of HBase. 04
(c) Explain Pig data Model in detail and Discuss how it will help for 07
effective data flow.

Q.5 (a) Write a short note on Spark. 03


(b) Write difference between MongoDB and Hadoop. 04
(c) Explain CRUD operations in MongoDB. 07
OR
Q.5 (a) What are the features of MongoDB? 03
(b) Explain Replication and scaling feature of MongoDB. 04
(c) What is RDD? State and Explain RDD operations. 07
***********

1
Seat No.: ________ Enrolment No.___________

GUJARAT TECHNOLOGICAL UNIVERSITY


BE - SEMESTER–VII (NEW) EXAMINATION – WINTER 2021
Subject Code:3170722 Date:23/12/2021
Subject Name:Big Data Analytics
Time:10:30 AM TO 01:00 PM Total Marks: 70
Instructions:
1. Attempt all questions.
2. Make suitable assumptions wherever necessary.
3. Figures to the right indicate full marks.
4. Simple and non-programmable scientific calculators are allowed.

MARKS

Q.1 (a) Explain types of Big Data 03


(b) Describe Traditional vs. Big Data business approach. 04
(c) What is Big Data? Explain Challenges of Conventional System. 07

Q.2 (a) Explain basic Components of Analyzing the Data with Hadoop. 03
(b) What is Map Reduce and explain How Map Reduce Works? 04
(c) Brief Anatomy of a Map Reduce Job run and Failures. 07
OR
(c) Describe Map Reduce Types and Formats. 07
Q.3 (a) Explain NoSQL data architecture. 03
(b) Elaborate Key-value stores, Graph stores, Column family stores & 04
Document stores.
(c) Describe analyzing big data with a shared-nothing architecture. 07
OR
Q.3 (a) Difference between master-slave versus peer-to-peer distribution models. 03
(b) What is a big data NoSQL? Explain in details. 04
(c) Which Four ways that NoSQL systems handle big data problems. 07
Q.4 (a) What is Stream Computing and Sampling Data in a Stream. 03
(b) Write the application of RTAP. 04
(c) Explain Stream Data Model and Architecture. 07
OR
Q.4 (a) How Graph Analytics used in Big Data. 03
(b) Write a short note on Decaying Window. 04
(c) Explain with example: How to perform Real Time Sentiment Analysis of 07
any product.
Q.5 (a) What is the use of Pig and Hive in Big Data? 03
(b) Describe data processing operators in Pig. 04
(c) Describe HBase and ZooKeeper in details. 07
OR
Q.5 (a) Explain HIVE services. 03
(b) Write application of writing Spark. 04
(c) Describe any application that you know related to enhance particular 07
business using big data and explain how it is important as a business
prospective.

*******

You might also like