0% found this document useful (0 votes)

98 views

Python Pandas Presentation

Uploaded by

prakharsharma2208

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

98 views

Python Pandas Presentation

Uploaded by

prakharsharma2208

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 32

PANDAS FOR

PYTHON
By:
Aayushi Pathak, Bhauk Yadav, Abhijeet, Srishti Jain, Praveen Shahani
Table of contents

01 Basic Understaning of Pandas

02 Pandas for Data Analysis

03 Broadway Theatre Example Using Pandas

Introduction

• Pandas is a popular open-source Python

library for data manipulation and analysis. It
provides data structures and functions that
enable users to work with structured data
efficiently
01
Basics of Pandas
• Pandas can be easily installed
using pip, which is a package
manager for Python. To install
pandas, you can run the
following command in the
terminal

• Once installed, you can import

the library into your Python
program using the following
code
Working with Series in Pandas

• A Series is a one-
dimensional labeled array
that can hold any data type. OUTPUT
• We can also provide
custom labels for the
Series using the index OUTPUT
parameter
Working with DataFrames in Pandas

• A Data Frame is a two-

dimensional table-like data
structure with labeled rows
and columns. Here is an
example of how to create a
DataFrame in Pandas
OUTPUT
Pivoting Data Frame

It is used to reshape a
given data frame
organized by given index/
column values. It does not
support data aggregation,
multiple values will result
in a multi index in the
columns. OUTPUT
Descriptive Statistics Using Pandas
• Descriptive statistics are
brief informational
coefficients that summarize
a given data set
• They are broken down into
measures of central
tendency and measures of
variability (spread)
• Measures of central
tendency include the mean,
median, and mode, while
measures of variability
include standard deviation,
variance, minimum and
maximum variables.
We can use df.describe() it will also give all the measures mentioned

OUTPUT
02
Pandas for Data Analysis
Steps Covered

 Importing the Data

 Data Manipulation
 Data Exploration
 Data Reindexing and Altering
 Data Cleaning
Importing Data

• The first step in data cleaning is to import the data into Pandas. Pandas provides several functions to
read different types of data, such as CSV, Excel, SQL, and more.
Data Exploration

• Before cleaning the data,

it is important to explore
the data and identify any
potential issues. Pandas
provides several functions
to explore the data, such
as head(), tail(), info(),
describe(), and more.
Data Cleaning

• Once we have explored

the data and identified any
potential issues, we can
start cleaning the data.
Pandas provides several
functions for data
cleaning, such as
dropna(), fillna(),
replace(), and more.
01 02
Filtering Sorting

Data Manipulation

03 04
Merging Grouping
Data Filtering

• Filtering is the process of

selecting a subset of data
based on specific
conditions. Pandas
provides several functions
for filtering data, such as
loc(), iloc(), and query().
Data Sorting

• Sorting is the process of

arranging the data in a
specific order based on one
or more columns. Pandas
provides a sort_values()
function for sorting data.
Data Merging

• Merging is the process of

combining two or more
Data Frames into a single
DataFrame based on a
common column. Pandas
provides a merge()
function for merging data.
Data Grouping

• Grouping is the process of

grouping the data based
on one or more columns
and then applying a
function to each group.
Pandas provides a
groupby() function for
grouping data.
Data Reindexing
Reindexing in Pandas can be used to change the index of rows and columns of a DataFrame.

Step: Firstly, make a data table in python

OUTPUT

– Here (name, marks and course are the column names and (1,2,3,4,5) are the rows name.
Reindexing the Rows
One can reindex a single row or multiple rows by using reindex() method. Default values in the
new index that are not present in the dataframe are assigned NaN.

– Here in reindexing the rows only the place is being changed here from (1st position to 2nd or 3rd).
Don’t think you can change the row name using it.
– We can reindex a single column or multiple columns by using reindex() method and by specifying
the axis we want to reindex. Default values in the new index that are not present in the dataframe are
assigned NaN.
Reindexing the Columns
We can reindex a single column or multiple columns by using reindex() method and by specifying
the axis we want to reindex. Default values in the new index that are not present in the dataframe
are assigned NaN.

– Use ffill() function to fill the missing values along the index axis.
– When ffill() is applied across the index then any missing value is filled based on the corresponding
value in the previous row.
– Here we just make a DataFrame –Firstly, we will fill this
with some missing values and missing value using the –Now we will fill the NaN
these values is denoted by NaN. index axis. value using column axis
Altering/Rename Column Labels
Using Rename() Function: One way of renaming the columns in a Pandas Dataframe is by
using the rename() function. This method is quite useful when we need to rename some selected
columns because we need to specify information only for the columns which are to be renamed.
– Rename Column name using
– By Assigning a list of new column names
DataFrameset_axis() Function
03
Broadway Theatre Example Using
Pandas
https://colab.research.google.com/drive/1HDKICQU0foyTdIHHkFyNQTUxrNzlfK9l?
usp=sharing
THANK YOU

Fiber Optic Cable Plant Documentation PDF
No ratings yet
Fiber Optic Cable Plant Documentation PDF
6 pages
Accenture Pov Manufacturing Digital Final
No ratings yet
Accenture Pov Manufacturing Digital Final
20 pages
Chapter 1 ITSM
No ratings yet
Chapter 1 ITSM
22 pages
CRM
No ratings yet
CRM
4 pages
Microwave Presentation
No ratings yet
Microwave Presentation
12 pages
8DAX in Power BI For Market Basket Analysis and Sales Data Analysis
No ratings yet
8DAX in Power BI For Market Basket Analysis and Sales Data Analysis
29 pages
Python: An Introduction Python: An Introduction
100% (1)
Python: An Introduction Python: An Introduction
82 pages
Vedant Metaverse
No ratings yet
Vedant Metaverse
12 pages
Anomaly Detection: Course: Data Mining II
No ratings yet
Anomaly Detection: Course: Data Mining II
12 pages
Supply Chain Management
No ratings yet
Supply Chain Management
2 pages
Python + MongoDB
No ratings yet
Python + MongoDB
12 pages
Library Management - A Survey
No ratings yet
Library Management - A Survey
6 pages
of Restaurant
100% (1)
of Restaurant
14 pages
Metaverse
No ratings yet
Metaverse
15 pages
Extensible Markup Language
No ratings yet
Extensible Markup Language
38 pages
Blockchain and IoT Based Food Traceability For Smart Agriculture
No ratings yet
Blockchain and IoT Based Food Traceability For Smart Agriculture
6 pages
ML Lab Session 06 - VGG16-CNN
No ratings yet
ML Lab Session 06 - VGG16-CNN
15 pages
Parking Occupancy Detection Using cOMPUTER VISION
No ratings yet
Parking Occupancy Detection Using cOMPUTER VISION
14 pages
Netops
No ratings yet
Netops
81 pages
Instant Download 5G Mobile Core Network Design Deployment Automation and Testing Strategies 1st Edition Rajaneesh Sudhakar Shetty PDF All Chapters
100% (4)
Instant Download 5G Mobile Core Network Design Deployment Automation and Testing Strategies 1st Edition Rajaneesh Sudhakar Shetty PDF All Chapters
62 pages
Beaglebone Black
No ratings yet
Beaglebone Black
63 pages
Meta Search: Chilika Pujari
No ratings yet
Meta Search: Chilika Pujari
14 pages
Cryptography and Information Security
No ratings yet
Cryptography and Information Security
309 pages
Multi Gpu Programming With Mpi
No ratings yet
Multi Gpu Programming With Mpi
93 pages
DWDM Lecture Notes
No ratings yet
DWDM Lecture Notes
139 pages
IT543 - Tran Nguyen Quynh Tram - Project 6.1
0% (1)
IT543 - Tran Nguyen Quynh Tram - Project 6.1
5 pages
Fragmentation: Univ.-Prof. Dr. Peter Brezany Institut Für Scientific Computing Universität Wien
No ratings yet
Fragmentation: Univ.-Prof. Dr. Peter Brezany Institut Für Scientific Computing Universität Wien
17 pages
Lung Disease Detection Using X Rays: Under The Mentorship of
No ratings yet
Lung Disease Detection Using X Rays: Under The Mentorship of
39 pages
Project Management Professional (PMP) Certification Training
No ratings yet
Project Management Professional (PMP) Certification Training
10 pages
CH 1 Python Revision Tour - I
No ratings yet
CH 1 Python Revision Tour - I
60 pages
Python Guide PDF
100% (1)
Python Guide PDF
82 pages
ddb03 2
No ratings yet
ddb03 2
62 pages
Machine Learning
No ratings yet
Machine Learning
2 pages
EE698Z Machine Learning For Wireless Communication
No ratings yet
EE698Z Machine Learning For Wireless Communication
16 pages
17CS834 - SMS-Module2-Queueing Models (Chapter 2) Notes
No ratings yet
17CS834 - SMS-Module2-Queueing Models (Chapter 2) Notes
20 pages
(eBook PDF) Introduction to Data Mining 2nd Edition by Pang-Ning Tanpdf download
100% (8)
(eBook PDF) Introduction to Data Mining 2nd Edition by Pang-Ning Tanpdf download
51 pages
Writing Simple Automation Scripts With Python
No ratings yet
Writing Simple Automation Scripts With Python
3 pages
Real Time Analytics With Apache Kafka and Spark: @rahuldausa
No ratings yet
Real Time Analytics With Apache Kafka and Spark: @rahuldausa
54 pages
ITIL Management Overview
No ratings yet
ITIL Management Overview
23 pages
Sentiment Analysis
No ratings yet
Sentiment Analysis
30 pages
Python Coding by Solving African Problem Regis Nguessan
100% (1)
Python Coding by Solving African Problem Regis Nguessan
55 pages
Unit 1 Full Notes
No ratings yet
Unit 1 Full Notes
52 pages
DBMS - LAB Manual
No ratings yet
DBMS - LAB Manual
22 pages
Introduction To IT Project Management
No ratings yet
Introduction To IT Project Management
72 pages
Distributed System
100% (1)
Distributed System
119 pages
C-Api in Python
No ratings yet
C-Api in Python
162 pages
Data Warehouse Week 1
No ratings yet
Data Warehouse Week 1
78 pages
ISWA Unit1pptx 2023 08 28 19 47 11
No ratings yet
ISWA Unit1pptx 2023 08 28 19 47 11
47 pages
Multithreaded Programming Using Java Threads
No ratings yet
Multithreaded Programming Using Java Threads
33 pages
DWDM Unit 4
No ratings yet
DWDM Unit 4
22 pages
Data Mining Query Language
0% (1)
Data Mining Query Language
7 pages
Python Setup and Usage: Release 3.7.4rc1
No ratings yet
Python Setup and Usage: Release 3.7.4rc1
80 pages
5G Technology
No ratings yet
5G Technology
26 pages
Lecture 1 - Introduction To Information Systems
No ratings yet
Lecture 1 - Introduction To Information Systems
48 pages
Beyond 5G - Security in 6G Era-v2 Mr.Saro Velrajan
No ratings yet
Beyond 5G - Security in 6G Era-v2 Mr.Saro Velrajan
40 pages
AI Program-Simplilearn
No ratings yet
AI Program-Simplilearn
27 pages
Coordination and Agreement Distributed Systems Designs and Concept
No ratings yet
Coordination and Agreement Distributed Systems Designs and Concept
63 pages
Honours in Artificial Intelligence and Machine Learning: Board of Studies (Computer Engineering)
No ratings yet
Honours in Artificial Intelligence and Machine Learning: Board of Studies (Computer Engineering)
16 pages
Case Study eKYC Solution New PDF
No ratings yet
Case Study eKYC Solution New PDF
2 pages
Emerging Technologies in Information and Communications Technology
From Everand
Emerging Technologies in Information and Communications Technology
Fouad Sabry
No ratings yet
Drive testing The Ultimate Step-By-Step Guide
From Everand
Drive testing The Ultimate Step-By-Step Guide
Gerardus Blokdyk
No ratings yet
LESSON 10 Current and Future Trends of Media and Information
No ratings yet
LESSON 10 Current and Future Trends of Media and Information
9 pages
Giving Technical and Operational Definition
No ratings yet
Giving Technical and Operational Definition
3 pages
6MD66xx Manual PIXIT A5 V047100 en PDF
No ratings yet
6MD66xx Manual PIXIT A5 V047100 en PDF
110 pages
Programming in C: Pointers and Arrays
No ratings yet
Programming in C: Pointers and Arrays
16 pages
CG Question BANK PDF
No ratings yet
CG Question BANK PDF
5 pages
Tesda Portfolio
No ratings yet
Tesda Portfolio
107 pages
Chapter 3 Network Design and VPN Technologies
No ratings yet
Chapter 3 Network Design and VPN Technologies
9 pages
HP Festive OFFER 2020: WWW - Redeemnow.in/hpfestiveoffer
No ratings yet
HP Festive OFFER 2020: WWW - Redeemnow.in/hpfestiveoffer
12 pages
6.3.1.8 Packet Tracer - Exploring Internetworking Devices
100% (8)
6.3.1.8 Packet Tracer - Exploring Internetworking Devices
6 pages
DBMS Practicals Sem 3 Mca Idol - Shree Ram College
100% (3)
DBMS Practicals Sem 3 Mca Idol - Shree Ram College
34 pages
Entity-Relationship Diagram (ERD)
No ratings yet
Entity-Relationship Diagram (ERD)
40 pages
The Applications of Chemical Engineering Simulation Software
100% (1)
The Applications of Chemical Engineering Simulation Software
9 pages
Class 12 Computer Science Solved Sample Paper 1 - 2012
No ratings yet
Class 12 Computer Science Solved Sample Paper 1 - 2012
18 pages
Reset Guide
No ratings yet
Reset Guide
2 pages
Active Directory Architecture
No ratings yet
Active Directory Architecture
3 pages
BTPT ONU Configuration Guide: This Guidance Document Is Applicable To BT-BCM6838 Series Models ONU Equipment of BTPT
No ratings yet
BTPT ONU Configuration Guide: This Guidance Document Is Applicable To BT-BCM6838 Series Models ONU Equipment of BTPT
10 pages
Environment Specific Extensions Activity Guide - 042024
No ratings yet
Environment Specific Extensions Activity Guide - 042024
19 pages
Netbox
No ratings yet
Netbox
22 pages
Spark Concept
No ratings yet
Spark Concept
18 pages
Ultimate How To Bluetooth Swift With Hardware in 20 Minutes
No ratings yet
Ultimate How To Bluetooth Swift With Hardware in 20 Minutes
47 pages
5G: New Air Interface and Radio Access Virtualization: Huawei White Paper Ȕ April 2015
No ratings yet
5G: New Air Interface and Radio Access Virtualization: Huawei White Paper Ȕ April 2015
11 pages
MC-10181336-0001 Mclaren 12C Reprograming
No ratings yet
MC-10181336-0001 Mclaren 12C Reprograming
12 pages
1646 PDF
No ratings yet
1646 PDF
244 pages
Embedded I/O Modules: Leverage Proven Design Technology at A Lower Cost
No ratings yet
Embedded I/O Modules: Leverage Proven Design Technology at A Lower Cost
3 pages
Timer Bts BSC
No ratings yet
Timer Bts BSC
50 pages
DDL and DML Commands in SQL
No ratings yet
DDL and DML Commands in SQL
14 pages
SQF NS2
No ratings yet
SQF NS2
6 pages
SEMDISPLAY LP SubhamDas
No ratings yet
SEMDISPLAY LP SubhamDas
13 pages
CON - Q1 - Weekly Plan & Report - ALEM - W43
No ratings yet
CON - Q1 - Weekly Plan & Report - ALEM - W43
90 pages

Python Pandas Presentation

Uploaded by

Python Pandas Presentation

Uploaded by

PANDAS FOR

01 Basic Understaning of Pandas

02 Pandas for Data Analysis

03 Broadway Theatre Example Using Pandas

• Pandas is a popular open-source Python

• Once installed, you can import

• A Data Frame is a two-

 Importing the Data

• Before cleaning the data,

• Once we have explored

• Filtering is the process of

• Sorting is the process of

• Merging is the process of

• Grouping is the process of

Step: Firstly, make a data table in python

You might also like