Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

Gujarat Technological University

Download as pdf or txt
Download as pdf or txt
You are on page 1of 2

Seat No.: ________ Enrolment No.

___________

GUJARAT TECHNOLOGICAL UNIVERSITY


BE - SEMESTER–V (NEW) EXAMINATION – SUMMER 2021
Subject Code:3150713 Date:07/09/2021
Subject Name:Python for Data Science
Time:10:30 AM TO 01:00 PM Total Marks: 70
Instructions:
1. Attempt all questions.
2. Make suitable assumptions wherever necessary.
3. Figures to the right indicate full marks.
4. Simple and non-programmable scientific calculators are allowed.

MARKS
Q.1 (a) Differentiate the list and dictionary data types of python by their 03
characteristics along with example in brief.
(b) What do you mean by slicing operation in string of python? Write an 04
example of slicing to fetch first name and last name from full name
of person and display it.
(c) Which are the basic activities we performed as a part of data science 07
pipeline? Summarize and explain in brief.

Q.2 (a) What is the core competencies needed to become a data scientist? 03
Explain in brief.
(b) Compare and summarize four different coding styles supported by 04
Python language.
(c) Summarize the characteristics of NumPy, Pandas, Scikit-Learn and 07
matplotlib libraries along with their usage in brief.
OR
(c) What do you mean by prototyping? List the phases of prototyping 07
and experimentation process and explain in brief.
Q.3 (a) Compare the numpy and pandas on the basis of their characteristics 03
and usage.
(b) For what purpose sampling is used. Demonstrate random sampling 04
with example.
(c) What is the need of streaming the data? Explain data uploading and 07
streaming data with example.
OR
Q.3 (a) How XPath is useful for analysis of html data? Explain in brief. 03
(b) Define term n-gram. Explain the TF-IDF techniques. 04
(c) List the techniques to handle missing data. Explain various 07
techniques with example.

Q.4 (a) List various types of graph/chart available in the pyplot of matplotlib 03
library for data visualization. Explain any two of them in brief.
(b) What kind data is analyzed with Bag of word model? Explain it with 04
example.
(c) What do you mean by time series data? How can we plot it? Explain 07
it with example to plot trend over time

OR

1
Q.4 (a) Compare bar graph, box-plot and histogram with respect to their 03
applicability in data visualization.
(b) Define stemming. Explain the concept of stemming with example. 04
(c) What is the use of scatter-plot in data visualization? Can we draw 07
trendline in scatter-plot? Explain it with example.

Q.5 (a) Define the term Data wrangling. Explain the steps needed to perform 03
data wrangling.
(b) Why we need to perform Z-score standardization in EDA? Justify it 04
with example.
(c) What is the use of hash function in EDA? Express various hashing 07
trick along with example.
OR
Q.5 (a) What do you mean by Exploratory Data Analysis (EDA)? How t-test 03
is useful for EDA?
(b) What do you mean by covariance? What is the importance of 04
covariance in data analysis? Explain it with example.
(c) List different way for defining descriptive statistics for 07
Numeric Data. Explain them in brief.

*************

You might also like