Bda

Uploaded by

This document is an exam for the subject "Big Data Analytics" taken at Gujarat Technological University. It contains 5 questions assessing various topics in big data and distributed systems. Question 1 asks about big data processing vs distributed processing, applications of big data for business, and the Hadoop architecture. Question 2 covers Avro data serialization, big data characteristics, and the Hadoop ecosystem. Question 3 involves HDFS commands, MapReduce phases, and writing MapReduce programs. Question 4 is about Zookeeper, HDFS architecture, and Apache Pig. Question 5 discusses MongoDB concepts and NoSQL databases or alternately scaling in MongoDB, RDDs in Spark, and why RDDs are better than MapReduce storage.

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Bda

Uploaded by

Jigar

0% found this document useful (0 votes)

137 views2 pages

Original Description:

Original Title

171804-2171607-BDA

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

0% found this document useful (0 votes)

137 views2 pages

Bda

Uploaded by

Jigar

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

Jump to Page

You are on page 1of 2

Search inside document

Seat No.: ________ Enrolment No.

___________

GUJARAT TECHNOLOGICAL UNIVERSITY

BE – SEMESTER 7 (NEW SYLLABUS) EXAMINATION- SUMMER 2018

Subject Code: 2171607 Date: 28-04-2018

Subject Name: BIG DATA ANALYTICS (Department Elective-II)
Time: 02:30 pm to 05:00 pm Total Marks: 70
Instructions:
1. Attempt all questions.
2. Make suitable assumptions wherever necessary.
3. Figures to the right indicate full marks.

Q.1 (a) What is Big Data? Explain how big data processing differs from 03
distributed processing.
(b) List various application of big data. How it can be used to improve 04
business for a superstore.
(c) Explain core architecture of Hadoop with suitable block diagram. Discuss 07
role of each component in detail.

Q.2 (a) Explain Avro data serialization technique in MapReduce. 03

(b) Explain characteristics of Big Data. 04
(c) What is Hadoop Ecosystem? Discuss various components of Hadoop 07
Ecosystem.
OR
(c) What is data serialization? With proper examples discuss and differentiate 07
structured, unstructured and semi-structured data. Make a note on how
type of data affects data serialization.
Q.3 (a) Explain following commands with syntax and at least one example of 03
each. (1) copyFromLocal (2) showing the content of outputfile.
(b) Explain “Map Phase” and “Combiner Phase” in MapReduce. 04
(c) Write Map Reduce steps for counting occurrences of specific numbers in 07
the input text file(s). Also write the commands to compile and run the
code.
OR
Q.3 (a) List various configuration files used in Hadoop Installation. What is use 03
of mapred-site.xml?
(b) Explain “Shuffle & Sort” phase and “Reducer Phase” in MapReduce. 04
(c) Write Map Reduce steps for counting sum of numbers in the input text 07
file(s). Also write the commands to compile and run the code.
Q.4 (a) What is Zookeeper? What are the benefits of Zookeeper? 03
(b) Draw architecture of APACHE PIG and explain in short. 04
(c) Define HDFS. Discuss the HDFS Architecture and HDFS Commands in 07
brief.
OR
Q.4 (a) What is HBase? Write a query to create a table in HBase. 03
(b) Discuss role of Data node and Name node in HDFS. 04

1
(c) Draw and explain Architecture of APACHE HIVE. Explain various data 07
insertion techniques in HIVE with example.
[P.T.O]
Q.5 (a) Explain following in brief with respect to Mongo DB : 03
1) Collections and documents
2) Indexing and retrieval
(b) Write difference between MangoDB and Hadoop. 04
(c) What is NoSQL database? List the differences between NoSQL and 07
relational databases. Explain in brief various types of NoSQL databases
in practice.
OR
Q.5 (a) Explain scaling in MangoDB. 03
(b) Explain CRUD operations in MongoDB. 04
(c) What is Resilient Distributed Dataset in Apache Spark? Explain in detail. 07
Make a note on why RDD is better than Map Reduce data storage?

*************

Italy Beyond The Obvious Sample Itinerary
Document18 pages
Italy Beyond The Obvious Sample Itinerary
ItalyBTO
100% (2)
Practical File: Internet Programming Lab
Document26 pages
Practical File: Internet Programming Lab
mohd ameer
No ratings yet
STAAD Tank Foundation
Document17 pages
STAAD Tank Foundation
iuliandurdureanu
100% (2)
IGNOU MCS-011 Previous Years Questions
Document64 pages
IGNOU MCS-011 Previous Years Questions
sanjubalumail
No ratings yet
Siddaganga Institute of Technology, Tumkur - 572 103: Usn 1 S I CSPE17
Document1 page
Siddaganga Institute of Technology, Tumkur - 572 103: Usn 1 S I CSPE17
Shivdev
No ratings yet
CCA 3 QP 2021-Final
Document2 pages
CCA 3 QP 2021-Final
1RN19CS128.Sanjana.R
No ratings yet
Database Management Systems Nov
Document6 pages
Database Management Systems Nov
RajaMaariyapan
No ratings yet
Python 15CS664 QuestionBank FINAL
Document5 pages
Python 15CS664 QuestionBank FINAL
Anu Cadzie
No ratings yet
Python Programming Unit 1
Document99 pages
Python Programming Unit 1
SRAVANTI PEC
No ratings yet
DBMS Lab Paper
Document5 pages
DBMS Lab Paper
bhaskar
No ratings yet
Siddaganga Institute of Technology, Tumkur - 572 103: Usn 1 S I CSPE17
Document2 pages
Siddaganga Institute of Technology, Tumkur - 572 103: Usn 1 S I CSPE17
Shivdev
No ratings yet
DBMS Univ Question With Answer
Document20 pages
DBMS Univ Question With Answer
actvenkatesan
No ratings yet
San Unit 1 Introduction Complete Notes Compiled
Document15 pages
San Unit 1 Introduction Complete Notes Compiled
Abhishek Shetty
No ratings yet
DBMS Chapter 4
Document39 pages
DBMS Chapter 4
Nabin Shrestha
No ratings yet
Data Mining UNIT-2 Notes
Document91 pages
Data Mining UNIT-2 Notes
padma
No ratings yet
Untitled
Document18 pages
Untitled
ಹರಿ ಶಂ
No ratings yet
SE Lab Manual
Document36 pages
SE Lab Manual
kishori shekokar
No ratings yet
CC - Ques Paper
Document2 pages
CC - Ques Paper
M SANJAY
No ratings yet
Ideal Institute: Lab Manual
Document34 pages
Ideal Institute: Lab Manual
Gaurav Srivastav
No ratings yet
KCG College of Technology Karapakkam Chennai-600 097
Document3 pages
KCG College of Technology Karapakkam Chennai-600 097
bavana
No ratings yet
CS9211-Computer Architecture Question
Document7 pages
CS9211-Computer Architecture Question
rvsamy80
No ratings yet
15CS754 SAN Solution Manual
Document15 pages
15CS754 SAN Solution Manual
kaos aod
No ratings yet
CS6456-Object Oriented Programming
Document15 pages
CS6456-Object Oriented Programming
vivek
No ratings yet
Important Questions-Unit 1
Document3 pages
Important Questions-Unit 1
Rajeshwari Kalyani
No ratings yet
CCS341 DATA WAREHOUSING FIRST INTERNAL QUESTION Set 1
Document2 pages
CCS341 DATA WAREHOUSING FIRST INTERNAL QUESTION Set 1
ummulhfathima.msec
No ratings yet
Information Technology Management Practical Files
Document57 pages
Information Technology Management Practical Files
Justin Wilkins
No ratings yet
LAB Manual: Relational Database Management System
Document65 pages
LAB Manual: Relational Database Management System
Veer Shah
No ratings yet
DSAL Lab Manual
Document61 pages
DSAL Lab Manual
r.bunny.0022
No ratings yet
Part B Questions
Document3 pages
Part B Questions
sangeetha
No ratings yet
Java Lab
Document96 pages
Java Lab
Naveen Nataraj
No ratings yet
FEB-MARCH 2023 - Answer Key 20PM01T-PMS - Watermark
Document27 pages
FEB-MARCH 2023 - Answer Key 20PM01T-PMS - Watermark
yamuna A
No ratings yet
Advanced Java and Web Technologies
Document4 pages
Advanced Java and Web Technologies
Riyaz Shaik
No ratings yet
Awsn Question Paper
Document14 pages
Awsn Question Paper
Nisha Mate
No ratings yet
WIT Important Questions-1
Document7 pages
WIT Important Questions-1
Pavithra Pavi
No ratings yet
Java Lab Manual
Document36 pages
Java Lab Manual
Tejaswini
No ratings yet
Question Bank Module-1: Department of Computer Applications 18mca53 - Machine Learning
Document7 pages
Question Bank Module-1: Department of Computer Applications 18mca53 - Machine Learning
Shiva Shankara
No ratings yet
CS6703 Grid and Cloud Computing Question Paper Nov Dec 2017
Document2 pages
CS6703 Grid and Cloud Computing Question Paper Nov Dec 2017
Ganesh Kumar
No ratings yet
DBMS QP
Document3 pages
DBMS QP
Kelly Wright
No ratings yet
Software Testing Lab Final
Document112 pages
Software Testing Lab Final
Unknown
100% (1)
Module 1 Java Notes
Document36 pages
Module 1 Java Notes
navalanr
No ratings yet
Hbase PPT PDF
Document100 pages
Hbase PPT PDF
Anupam Baruah
No ratings yet
Lab Manual B.Sc. (CA) : Department of Computer Science Ccb-2P2: Laboratory Course - Ii
Document31 pages
Lab Manual B.Sc. (CA) : Department of Computer Science Ccb-2P2: Laboratory Course - Ii
Jennifer Ledesma-Pido
No ratings yet
Cs3501 Compiler Design Laboratory 2021r - Lab Manual
Document55 pages
Cs3501 Compiler Design Laboratory 2021r - Lab Manual
williamsaswin01
No ratings yet
Foc QP 4
Document18 pages
Foc QP 4
ಹರಿ ಶಂ
No ratings yet
Model Question Paper
Document4 pages
Model Question Paper
ganashreep2003
No ratings yet
Data Analytics With Python - Unit 14 - Week 12
Document4 pages
Data Analytics With Python - Unit 14 - Week 12
D Barik
100% (1)
Foc QP 1
Document15 pages
Foc QP 1
ಹರಿ ಶಂ
No ratings yet
Dbms Model Question Papers
Document5 pages
Dbms Model Question Papers
Jaswanth Padigala
No ratings yet
TOC Question Bank - Unit - 1 - 2 - 3 - 4 - 2022
Document7 pages
TOC Question Bank - Unit - 1 - 2 - 3 - 4 - 2022
venkata karthik
No ratings yet
Evolving Role of Software PDF
Document2 pages
Evolving Role of Software PDF
Pamela
0% (1)
Course: Internet of Things: Embedded Devices - II
Document46 pages
Course: Internet of Things: Embedded Devices - II
ramna k
No ratings yet
06 Java - Lang Package
Document14 pages
06 Java - Lang Package
Ganesh
No ratings yet
Advance Data Structures Notes-R23
Document107 pages
Advance Data Structures Notes-R23
manikantagrandhi240ee
No ratings yet
r05321204 Data Warehousing and Data Mining
Document5 pages
r05321204 Data Warehousing and Data Mining
SRINIVASA RAO GANTA
No ratings yet
Web Technology Lab Manual
Document30 pages
Web Technology Lab Manual
Tanmay Mukherjee
No ratings yet
DAN Lab ManuaL
Document53 pages
DAN Lab ManuaL
SARANYA A
No ratings yet
Research Paper Presentation Pandas Moshiul Arefin
Document30 pages
Research Paper Presentation Pandas Moshiul Arefin
its4krishna3776
No ratings yet
III-II Big Data Analytics Question Bank
Document3 pages
III-II Big Data Analytics Question Bank
UDAY REDDY
100% (1)
(CSE3083) Lab Practical Assignment 2
Document8 pages
(CSE3083) Lab Practical Assignment 2
Arun Kumar Singh
0% (1)
GATE Compiler Design 93-2009
Document12 pages
GATE Compiler Design 93-2009
singhmanish1997
67% (3)
Kubernetes A Complete Guide - 2019 Edition
From Everand
Kubernetes A Complete Guide - 2019 Edition
Gerardus Blokdyk
No ratings yet
Big Data
Document2 pages
Big Data
achutha795830
No ratings yet
Gujarat Technological University
Document1 page
Gujarat Technological University
Jigar
No ratings yet
Gujarat Technological University
Document2 pages
Gujarat Technological University
Jigar
No ratings yet
Gujarat Technological University
Document1 page
Gujarat Technological University
Jigar
No ratings yet
Gujarat Technological University
Document2 pages
Gujarat Technological University
Jigar
No ratings yet
Gujarat Technological University
Document1 page
Gujarat Technological University
Jigar
No ratings yet
Gujarat Technological University
Document1 page
Gujarat Technological University
Jigar
No ratings yet
BRIC Link II Quickstart
Document4 pages
BRIC Link II Quickstart
Gary Mask
No ratings yet
Sap Ebs
Document9 pages
Sap Ebs
Prateek
100% (1)
MCQ Computer Awareness
Document70 pages
MCQ Computer Awareness
Babita Yadav
No ratings yet
CPM322E CH1 Planning PDF
Document38 pages
CPM322E CH1 Planning PDF
Ramesh Babu
No ratings yet
Edp
Document33 pages
Edp
jeff omanga
No ratings yet
AFR Replication Installation Guide
Document13 pages
AFR Replication Installation Guide
VMRO
No ratings yet
Comparison
Document3 pages
Comparison
Prashna Shrestha
No ratings yet
FEB 402 Slope Stability Analysis
Document26 pages
FEB 402 Slope Stability Analysis
lucy
No ratings yet
STIFTUNG HAUS SCHMINKE - Haus Schminke - Architecture - Virtual Tour
Document2 pages
STIFTUNG HAUS SCHMINKE - Haus Schminke - Architecture - Virtual Tour
Vanessa Haddad
No ratings yet
La Sagrada Familia Essay
Document7 pages
La Sagrada Familia Essay
ecmb73
No ratings yet
TRE QP & QB For UT2
Document6 pages
TRE QP & QB For UT2
priyanka
No ratings yet
2ND Quarter Exam
Document3 pages
2ND Quarter Exam
Be Responssible Enough
No ratings yet
Princeton Architectural Press Spring 2017 Catalog
Document108 pages
Princeton Architectural Press Spring 2017 Catalog
ChronicleBooks
100% (2)
WITSML - Six Things To Think About
Document10 pages
WITSML - Six Things To Think About
Sony Lazarus
100% (1)
PO130959
Document42 pages
PO130959
antz12345
No ratings yet
Memory Packaging Challenges - TechSearch
Document26 pages
Memory Packaging Challenges - TechSearch
Desizm .Com
No ratings yet
DCRS 5960 28F DC
Document7 pages
DCRS 5960 28F DC
Phạm Văn Thuân
No ratings yet
KPMB Report Writing Format
Document18 pages
KPMB Report Writing Format
Muhd Nur Aizat
No ratings yet
Arch 249 - Klein
Document5 pages
Arch 249 - Klein
shivamehrotra90
No ratings yet
1st Frame: Recap of The Spanish Colonization in The Philippines
Document5 pages
1st Frame: Recap of The Spanish Colonization in The Philippines
AIRA ANTONETTE MITRA
No ratings yet
Analysis and Concrete Design of Structure With STAAD
Document18 pages
Analysis and Concrete Design of Structure With STAAD
Gautam Paul
No ratings yet
Xiaomi - SM G975F - Begonia - 2022 10 27 - 14 54 44
Document33 pages
Xiaomi - SM G975F - Begonia - 2022 10 27 - 14 54 44
M Ahsinil Umam
No ratings yet
Aruba Wireless vs. Cisco Meraki Wireless LAN Report From IT Central Station 2019-11-04
Document13 pages
Aruba Wireless vs. Cisco Meraki Wireless LAN Report From IT Central Station 2019-11-04
Daniel Rosas
No ratings yet
LN03 Columns and Beams
Document33 pages
LN03 Columns and Beams
Sithai Sellathurai
No ratings yet
Hong Kong Stormwater Drainage Manual: Planning, Design and Management
Document108 pages
Hong Kong Stormwater Drainage Manual: Planning, Design and Management
Free Rain Garden Manuals
0% (1)
This Is The Way This Is: Technical Specifications
Document8 pages
This Is The Way This Is: Technical Specifications
Asseel Fleihan
No ratings yet
GROUP 2 Project
Document83 pages
GROUP 2 Project
Hemanth Krishna Sheelamshetti
No ratings yet
Flow2 Stairlift: For Straight and Curved Staircases
Document12 pages
Flow2 Stairlift: For Straight and Curved Staircases
Mark Bonnie Santos
No ratings yet