0% found this document useful (0 votes)

5 views

Python Interview Prep Doc

The document serves as a Python interview preparation guide, focusing on data visualization libraries Matplotlib and Seaborn, highlighting their differences in syntax, aesthetics, and statistical visualization capabilities. It includes practical questions and answers about customizing plots, handling categorical data, and visualizing distributions and relationships. Additionally, it compares Pandas and NumPy, explaining their functionalities and use cases in data manipulation and analysis.

Uploaded by

deepali

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

Python Interview Prep Doc

Uploaded by

deepali

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

PYTHON INTERVIEW PREP DOC

1) LIBRARIES
2) Matplotlib vs Seaborn (SEABORN - data visualization library built on top of Matplotlib)
1) Ease of syntax
- Matplotlib - write more code to create visualizations.
- Seaborn - simplifies the process of creating appealing plots
import matplotlib.pyplot as plt

# Matplotlib Scatter Plot

plt.figure(figsize=(8, 6)) # Set the figure size
plt.scatter(df['x'], df['y'], color='blue', s=100) # Scatter points
plt.title('Matplotlib Scatter Plot') # Title
plt.xlabel('X-axis') # X-axis label
plt.ylabel('Y-axis') # Y-axis label
plt.grid(True) # Add a grid
plt.axhline(0, color='black',linewidth=0.5, ls='--') # Horizontal line
plt.axvline(0, color='black',linewidth=0.5, ls='--') # Vertical line
plt.show() # Show plot

import seaborn as sns

# Seaborn Scatter Plot

plt.figure(figsize=(8, 6)) # Set the figure size
sns.scatterplot(x='x', y='y', hue='category', data=df, s=100) # Scatter points
with hue
plt.title('Seaborn Scatter Plot') # Title
plt.grid(True) # Add a grid
plt.show() # Show plot

Requires explicit commands for setting figure size, scatter points, title, labels,
grid, and lines. More code and customization steps are involved.
Combines multiple steps into a single function call and automatically handles
color coding for the categories with the hue parameter, leading to more
concise and readable code.

2) Default Aesthetics

- basic color schemes

- visually appealing plots

3) Statistical Visualizations
- limited built-in statistical plotting functions
- specializes in statistical visualizations and offers a wide array of built-in
statistical plotting functions
To create a box plot, you need to manually calculate statistics like quartiles
and then plot them.
Creating a box plot in Seaborn requires just one function call:
While powerful for general plotting, it lacks built-in functions for statistical
visualizations, requiring more manual calculations and code.

4) Integration with Pandas

-Both of them integrate well with python

5) Customization

-extensive customization options (axis,titles etc)

- While it offers customization options, they are generally more limited

compared to Matplotlib. less flexible for detailed adjustments.

Can you explain a scenario where you would choose Seaborn over Matplotlib?

 Answer: I would choose Seaborn when I need to create statistical plots, such as pair plots or
violin plots, that require quick visualization of relationships and distributions. Seaborn
simplifies the process with built-in themes and better default aesthetics, allowing me to
focus on the analysis rather than customization.

 What are some common plot types available in Matplotlib?

 Answer: Common plot types in Matplotlib include line plots, scatter plots, bar charts,
histograms, pie charts, box plots, and error bars. These cover a wide range of visualization
needs, from simple trends to complex distributions.

 How can you customize the appearance of a plot in Matplotlib?

 Answer: Customization in Matplotlib can be done using various functions. You can change
the color and style of lines, adjust marker types, set titles and labels, customize axes limits,
and modify ticks. For example, using plt.title(), plt.xlabel(), and plt.ylabel() allows you to set
titles and labels for your plots.

Matplotlib-Specific Questions

5. How do you save a plot created with Matplotlib?

o Answer: You can save a plot using the plt.savefig("filename.png") function. You can
specify different file formats such as PNG, JPG, PDF, or SVG by changing the file
extension. Additionally, you can adjust parameters like DPI for better resolution.

6. What are subplots, and how do you create them in Matplotlib?

o Answer: Subplots allow you to create multiple plots in a single figure. You can create
them using plt.subplot(nrows, ncols, index) to specify the layout or
plt.subplots(nrows, ncols) to return a figure and an array of axes. For example,
plt.subplots(2, 2) creates a 2x2 grid of subplots.

7. How can you display multiple plots in one figure?

o Answer: You can display multiple plots in one figure using subplots. For example

fig, axs = plt.subplots(2, 2)

axs[0, 0].plot(x, y1)

axs[0, 1].scatter(x, y2)

axs[1, 0].bar(x, y3)

axs[1, 1].hist(y4)

plt.show()

What is the purpose of the figure() function in Matplotlib?

 Answer: The figure() function creates a new figure object, allowing you to manage the size,
resolution, and background color of your plots. It’s important for organizing multiple plots in
one window and setting specific properties for the figure.

Seaborn-Specific Questions

9. What is a pair plot, and when would you use it?

o Answer: A pair plot is a grid of scatter plots that displays relationships between
multiple pairs of variables in a dataset. It’s useful for visualizing the distribution of
variables and spotting correlations in high-dimensional data.

 How do you handle categorical data in Seaborn?

 Answer: Seaborn provides several functions to visualize categorical data, such as

sns.boxplot(), sns.violinplot(), and sns.countplot(). These functions allow for visual
comparisons across different categories, making it easy to understand distributions and
relationships.

 What is the purpose of the hue parameter in Seaborn?

 Answer: The hue parameter in Seaborn is used to color the data points based on a
categorical variable. This enhances the visualization by allowing you to differentiate between
groups within the same plot, making it easier to observe relationships.

 How can you create a heatmap in Seaborn?

 Answer: You can create a heatmap in Seaborn using the sns.heatmap() function. This
function visualizes data in a matrix format with color coding to represent values, making it
useful for displaying correlations or frequencies.

Practical Questions

13. Given a dataset, how would you visualize the distribution of a numeric variable?
o Answer: I would use a histogram to visualize the distribution. In Matplotlib, I would
use plt.hist(data), or in Seaborn, I would use sns.histplot(data) to quickly plot the
distribution and add density curves if needed.

14. How would you visualize the relationship between two continuous variables?

o Answer: I would use a scatter plot for this purpose. In Matplotlib, I would use
plt.scatter(x, y), or in Seaborn, I could use sns.scatterplot(x='variable1', y='variable2',
data=data) to visualize the relationship and observe patterns.

15. Can you write code to create a bar chart using either library?

import seaborn as sns

import matplotlib.pyplot as plt

data = {'categories': ['A', 'B', 'C'], 'values': [10, 20, 15]}

sns.barplot(x='categories', y='values', data=data)

plt.title('Bar Chart Example')

plt.show()

Scenario-Based Questions

16. If your plots are cluttered and hard to read, what steps would you take to improve them?

o Answer: I would simplify the plot by reducing the number of elements, using fewer
colors, and ensuring adequate spacing. I would also adjust the size of the plot, add
labels and legends for clarity, and consider using faceting to break down the data
into smaller visualizations.

17. How would you visualize time series data?

o Answer: For time series data, I would typically use a line plot to visualize trends over
time. In Matplotlib, I would use plt.plot(x_dates, y_values), or in Seaborn, I could use
sns.lineplot(x='date', y='value', data=data) to visualize the data and include error
bands if necessary.

PANDAS & NUMPY

Diff b/w Pandas & Numpy

19 What are the differences between a Python list and a NumPy array?

o Answer: Key differences include:

 NumPy arrays are homogeneous (all elements of the same type), while lists
can contain mixed types.

 NumPy arrays provide more efficient memory usage and faster operations
due to optimized C implementation.
 NumPy offers a wide range of mathematical operations that are not available
for lists.

Intermediate Questions

18. What are some common functions in NumPy?

o Answer: Common functions include:

1. np.mean(): Computes the average.

2. np.median(): Computes the median.

3. np.std(): Computes the standard deviation.

4. np.sum(): Sums the elements.

5. np.dot(): Computes the dot product of two arrays.

4. What is broadcasting in NumPy?

o Answer: Broadcasting is a powerful mechanism that allows NumPy to perform

arithmetic operations on arrays of different shapes. When performing operations,
NumPy automatically expands the smaller array across the larger array to make their
shapes compatible.

Advanced Questions

7. How do you handle missing data in a NumPy array?

o Answer: You can handle missing data in NumPy arrays by using np.nan to represent
missing values. Functions like np.nanmean() can compute the mean while ignoring
NaN values.

o Check for missing values - missing_mask = np.isnan(data)

Numpy vs Pandas

NumPy:

 Primarily uses arrays (ndarray), which are homogeneous (all elements of the same type).

 Primarily used for numerical computations.

Generally faster for numerical computations,

Pandas:

 Built on top of NumPy.

 Uses Series (1D) and DataFrames (2D), which can hold mixed data types (e.g., integers, floats,
strings)

 Designed for data manipulation and analysis, particularly for tabular data (like spreadsheets
or SQL tables).

 may be slower for purely numerical operations compared to NumPy

SciPy is a powerful library that extends NumPy's capabilities, providing a robust environment for
scientific and numerical computing.(powerful numerical operation)

Statsmodels is a Python module that provides classes and functions for the estimation of many
different statistical models, as well as for conducting statistical tests, and statistical data exploration.

SERIES

What is a Pandas Series?

 Answer: A Pandas Series is a one-dimensional array-like object that can hold data of any type
(integers, floats, strings, etc.) and is associated with an index. It's similar to a list or a
dictionary but comes with additional features for data manipulation and analysis.

If you have a Series with duplicate indices, how would you handle it?

 Answer: You can use methods like groupby() to aggregate values or drop_duplicates() to
remove duplicates. You might also consider resetting the index with reset_index().

How would you handle missing values in a Pandas Series?

 Answer: You can handle missing values using methods like fillna() to fill them with a specific
value, dropna() to remove them, or interpolate() to perform interpolation.

Godfrey Boyle Chap01
83% (12)
Godfrey Boyle Chap01
20 pages
Python Seaborn Notes
No ratings yet
Python Seaborn Notes
28 pages
005 - CAT-6040 - Tank, Main Pumps + PMS
100% (2)
005 - CAT-6040 - Tank, Main Pumps + PMS
27 pages
Soc 7
No ratings yet
Soc 7
8 pages
visualization
No ratings yet
visualization
18 pages
Seaborn 2
No ratings yet
Seaborn 2
49 pages
DS - UNIT - IV - QB & Ans
No ratings yet
DS - UNIT - IV - QB & Ans
27 pages
FDS Notes Unit-5
No ratings yet
FDS Notes Unit-5
24 pages
Day2Part2. DataVisualization
No ratings yet
Day2Part2. DataVisualization
29 pages
Data Visu Lab4
No ratings yet
Data Visu Lab4
23 pages
Basic_Plotting_with_Seaborn
No ratings yet
Basic_Plotting_with_Seaborn
6 pages
Solution for mid sem paper
No ratings yet
Solution for mid sem paper
7 pages
Session 5
No ratings yet
Session 5
16 pages
Unit 5 Seaborn Visualization - Copy
No ratings yet
Unit 5 Seaborn Visualization - Copy
35 pages
Data Visualization - U5
No ratings yet
Data Visualization - U5
31 pages
Unit 5 Plotting_ matplotlib in python
No ratings yet
Unit 5 Plotting_ matplotlib in python
15 pages
Day 15
No ratings yet
Day 15
20 pages
ISE2_2020BTECS00004
No ratings yet
ISE2_2020BTECS00004
12 pages
Visualization Library Documentation
No ratings yet
Visualization Library Documentation
16 pages
Data-Visualization-with-Matplotlib-and-Seaborn
No ratings yet
Data-Visualization-with-Matplotlib-and-Seaborn
10 pages
seaborn
No ratings yet
seaborn
71 pages
Ultimate_Data_Visualization_Guide_with_Python
No ratings yet
Ultimate_Data_Visualization_Guide_with_Python
26 pages
Matplotlib in Python
No ratings yet
Matplotlib in Python
23 pages
Day 5 Data Visualisation With Maplotlib and Seaborn
No ratings yet
Day 5 Data Visualisation With Maplotlib and Seaborn
5 pages
B15_Python_b3_Visualization
No ratings yet
B15_Python_b3_Visualization
45 pages
Matplotlib and Seaborn
No ratings yet
Matplotlib and Seaborn
20 pages
Questions of Matpotlib and Seaborn
No ratings yet
Questions of Matpotlib and Seaborn
31 pages
Seaborn - Part 1
No ratings yet
Seaborn - Part 1
22 pages
Data Visualization
No ratings yet
Data Visualization
33 pages
II CSE CS3352 FDS QB Unit5
No ratings yet
II CSE CS3352 FDS QB Unit5
4 pages
An Introduction To Seaborn
No ratings yet
An Introduction To Seaborn
42 pages
Day 14
No ratings yet
Day 14
17 pages
FAQ - Python For Visualization-2 - Python For Data Science - Great Learning
No ratings yet
FAQ - Python For Visualization-2 - Python For Data Science - Great Learning
7 pages
DM File
No ratings yet
DM File
22 pages
Chapter11_DataVisualization2
No ratings yet
Chapter11_DataVisualization2
43 pages
Unit 2
No ratings yet
Unit 2
39 pages
Matplotlib in Python
No ratings yet
Matplotlib in Python
43 pages
5_6233181033324352260
No ratings yet
5_6233181033324352260
5 pages
Visualization with Seaborn _ Python Data Science Handbook
No ratings yet
Visualization with Seaborn _ Python Data Science Handbook
17 pages
DV LAb Staff
No ratings yet
DV LAb Staff
73 pages
pandas_cheat_sheet_2
No ratings yet
pandas_cheat_sheet_2
12 pages
Seaborn
No ratings yet
Seaborn
7 pages
10 Must-know Seaborn Visualization Plots for Multivariate Data Analysis in Python _ by Susan Maina _ Towards Data Science
No ratings yet
10 Must-know Seaborn Visualization Plots for Multivariate Data Analysis in Python _ by Susan Maina _ Towards Data Science
39 pages
1
No ratings yet
1
1 page
Introduction Tom at Plot Lib
No ratings yet
Introduction Tom at Plot Lib
38 pages
Visualization With Matplotlib
No ratings yet
Visualization With Matplotlib
18 pages
Seaborn Cheat Sheet Python For Data Science: 3 Plotting With Seaborn 3 Plotting With Seaborn
No ratings yet
Seaborn Cheat Sheet Python For Data Science: 3 Plotting With Seaborn 3 Plotting With Seaborn
1 page
Unit V Notes
No ratings yet
Unit V Notes
66 pages
ProgrammingForDS12_viz
No ratings yet
ProgrammingForDS12_viz
25 pages
Chapter 4 Data Visualizations
No ratings yet
Chapter 4 Data Visualizations
24 pages
Exp-9
No ratings yet
Exp-9
3 pages
Data Visualization With Python
No ratings yet
Data Visualization With Python
36 pages
Visualization With Help of PANDAS
No ratings yet
Visualization With Help of PANDAS
83 pages
Data Analysis Graphs
No ratings yet
Data Analysis Graphs
9 pages
Advanced_Plot_Types_with_Seaborn
No ratings yet
Advanced_Plot_Types_with_Seaborn
4 pages
4-Seaborn Plot
No ratings yet
4-Seaborn Plot
6 pages
Lecture 2.3
No ratings yet
Lecture 2.3
25 pages
Experiment 3
No ratings yet
Experiment 3
10 pages
Illuminating Data: A hands on guide to data visualization in R
From Everand
Illuminating Data: A hands on guide to data visualization in R
Eman Ahmad
No ratings yet
Symbolic Mathematics in Data Science. Algebra, Calculus, and Geometry with Matlab
From Everand
Symbolic Mathematics in Data Science. Algebra, Calculus, and Geometry with Matlab
César Pérez López
No ratings yet
Scratch Games Programming for Kids & Students: A Step-by-Step Guide and Design Programs for Creating Thoughtful Animations, Puzzles, and Games with Scratch 3.0
From Everand
Scratch Games Programming for Kids & Students: A Step-by-Step Guide and Design Programs for Creating Thoughtful Animations, Puzzles, and Games with Scratch 3.0
Falade A. Kenny
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
The Numpy Pocketbook: Essentials on the Go
From Everand
The Numpy Pocketbook: Essentials on the Go
Silas Meadowlark
No ratings yet
Chords Chart
No ratings yet
Chords Chart
1 page
500 Kva Perkins Newage Diesel Generator: Dimensions
No ratings yet
500 Kva Perkins Newage Diesel Generator: Dimensions
2 pages
PCB Design Course - Checklist PDF
No ratings yet
PCB Design Course - Checklist PDF
5 pages
PH Control Simulation
No ratings yet
PH Control Simulation
14 pages
Indirect Methods of Obtaining Activity and Mobility of Structure-Borne Sound Sources
No ratings yet
Indirect Methods of Obtaining Activity and Mobility of Structure-Borne Sound Sources
311 pages
Session F-2 - Statistics and Probability For Middle-School Math Te PDF
No ratings yet
Session F-2 - Statistics and Probability For Middle-School Math Te PDF
25 pages
ip project subham (1)
No ratings yet
ip project subham (1)
38 pages
LTV Series Ventilators: Quick Reference Guide
No ratings yet
LTV Series Ventilators: Quick Reference Guide
81 pages
Store Procedure
No ratings yet
Store Procedure
254 pages
Sun Path Diagram: Section A
No ratings yet
Sun Path Diagram: Section A
1 page
PyDictionary 1.3.4: Python Package Index
No ratings yet
PyDictionary 1.3.4: Python Package Index
3 pages
SAPEWM01
No ratings yet
SAPEWM01
127 pages
T1 - Conservation Laws
No ratings yet
T1 - Conservation Laws
3 pages
Nordac Pro - SK 500e Frequency Inverter - Brochure
No ratings yet
Nordac Pro - SK 500e Frequency Inverter - Brochure
44 pages
EOY Practice-Paper Year 9 - 2024
No ratings yet
EOY Practice-Paper Year 9 - 2024
7 pages
CED - QB Unit 1-2
No ratings yet
CED - QB Unit 1-2
2 pages
C Programming
No ratings yet
C Programming
55 pages
(Ebook) Machine Learning for Business Analytics: Concepts, Techniques and Applications with JMP Pro, 2nd Edition by Galit Shmueli, Peter C. Bruce, Mia L. Stephens, Muralidhara Anandamurthy, Nitin R. Patel ISBN 9781119903833, 1119903831 - Quickly download the ebook to read anytime, anywhere
100% (2)
(Ebook) Machine Learning for Business Analytics: Concepts, Techniques and Applications with JMP Pro, 2nd Edition by Galit Shmueli, Peter C. Bruce, Mia L. Stephens, Muralidhara Anandamurthy, Nitin R. Patel ISBN 9781119903833, 1119903831 - Quickly download the ebook to read anytime, anywhere
86 pages
Velocity Burndown Chart
No ratings yet
Velocity Burndown Chart
3 pages
Ministry of Education Secondary Engagement Programme Grade 10 Chemistry Week 6 Lesson 2 Topic: Sub-Topic: Objectives
No ratings yet
Ministry of Education Secondary Engagement Programme Grade 10 Chemistry Week 6 Lesson 2 Topic: Sub-Topic: Objectives
3 pages
MEMO 2025 MLIT GRD 11 INVESTIGATION (17_FEBRUARY_2025) F (1)
No ratings yet
MEMO 2025 MLIT GRD 11 INVESTIGATION (17_FEBRUARY_2025) F (1)
6 pages
The Warsaw Econometric Challenge Second Edition: Prediction of An Upgrade of A Class in LOT Flights
No ratings yet
The Warsaw Econometric Challenge Second Edition: Prediction of An Upgrade of A Class in LOT Flights
13 pages
Childhood Trauma Questionnaire-Short Form Evaluati
No ratings yet
Childhood Trauma Questionnaire-Short Form Evaluati
39 pages
DIY Timber Plans
No ratings yet
DIY Timber Plans
3 pages
Module 3
No ratings yet
Module 3
27 pages
Urea Production
100% (1)
Urea Production
9 pages
Japanese Grammar Focus
No ratings yet
Japanese Grammar Focus
222 pages

Python Interview Prep Doc

Uploaded by

Python Interview Prep Doc

Uploaded by

PYTHON INTERVIEW PREP DOC

# Matplotlib Scatter Plot

import seaborn as sns

# Seaborn Scatter Plot

- basic color schemes

4) Integration with Pandas

-Both of them integrate well with python

-extensive customization options (axis,titles etc)

- While it offers customization options, they are generally more limited

compared to Matplotlib. less flexible for detailed adjustments.

 What are some common plot types available in Matplotlib?

 How can you customize the appearance of a plot in Matplotlib?

5. How do you save a plot created with Matplotlib?

6. What are subplots, and how do you create them in Matplotlib?

7. How can you display multiple plots in one figure?

fig, axs = plt.subplots(2, 2)

axs[0, 0].plot(x, y1)

axs[0, 1].scatter(x, y2)

axs[1, 0].bar(x, y3)

What is the purpose of the figure() function in Matplotlib?

9. What is a pair plot, and when would you use it?

 How do you handle categorical data in Seaborn?

 Answer: Seaborn provides several functions to visualize categorical data, such as

 What is the purpose of the hue parameter in Seaborn?

 How can you create a heatmap in Seaborn?

import seaborn as sns

import matplotlib.pyplot as plt

data = {'categories': ['A', 'B', 'C'], 'values': [10, 20, 15]}

sns.barplot(x='categories', y='values', data=data)

plt.title('Bar Chart Example')

17. How would you visualize time series data?

PANDAS & NUMPY

o Answer: Key differences include:

18. What are some common functions in NumPy?

o Answer: Common functions include:

1. np.mean(): Computes the average.

2. np.median(): Computes the median.

3. np.std(): Computes the standard deviation.

4. np.sum(): Sums the elements.

5. np.dot(): Computes the dot product of two arrays.

4. What is broadcasting in NumPy?

o Answer: Broadcasting is a powerful mechanism that allows NumPy to perform

7. How do you handle missing data in a NumPy array?

o Check for missing values - missing_mask = np.isnan(data)

 Primarily used for numerical computations.

Generally faster for numerical computations,

 Built on top of NumPy.

 may be slower for purely numerical operations compared to NumPy

What is a Pandas Series?

How would you handle missing values in a Pandas Series?

You might also like