Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
0% found this document useful (0 votes)
7 views

Python Libraries For Data Science

The document discusses several popular Python libraries for data science including NumPy, SciPy, Pandas, Scikit-Learn, matplotlib, and Seaborn. It provides brief descriptions and links for each library.

Uploaded by

Meetanshi Gor
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
7 views

Python Libraries For Data Science

The document discusses several popular Python libraries for data science including NumPy, SciPy, Pandas, Scikit-Learn, matplotlib, and Seaborn. It provides brief descriptions and links for each library.

Uploaded by

Meetanshi Gor
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 10

Python Libraries for

Data Science

MD Arshad Ahmad
15 Years+ Experience in Data Science
Mentored 100+ people
Many popular Python
toolboxes/libraries:

• NumPy

• SciPy

• Pandas
• SciKit-Learn

Visualization libraries

• matplotlib• Seaborn

3
Python Libraries for Data
Science

NumPy:
▪ introduces objects for
multidimensional arrays and matrices,
as well as functions that allow to
easily perform advanced mathematical
and statistical operations on those
objects

▪ provides vectorization of
mathematical operations on arrays
and matrices which significantly
improves the performance

▪ many other python libraries are built on


NumPy

Link: http://www.numpy.org/

Python Libraries for Data


Science
SciPy:
▪ collection of algorithms for linear
algebra, differential equations,
numerical integration, optimization,
statistics and more

▪ part of SciPy Stack

▪ built on NumPy
Link: https://www.scipy.org/scipylib/

Python Libraries for Data


Science
Pandas:
▪ adds data structures and tools
designed to work with table-like data
(similar to Series and Data Frames in
R)

▪ provides tools for data


manipulation: reshaping, merging,
sorting, slicing, aggregation etc.

▪ allows handling missing data


Link: http://pandas.pydata.org/

Python Libraries for Data


Science
SciKit-Learn:
▪ provides machine learning
algorithms: classification, regression,
clustering, model validation etc.

▪ built on NumPy, SciPy and matplotlib

Link: http://scikit-learn.org/
7

Python Libraries for Data


Science
matplotlib:
▪ python 2D plotting library which
produces publication quality figures
in a variety of hardcopy formats

▪ a set of functionalities similar to those of


MATLAB

▪ line plots, scatter plots, barcharts, histograms,


pie charts etc.

▪ relatively low-level; some effort


needed to create advanced visualization

Link: https://matplotlib.org/

Python Libraries for Data


Science
Seaborn:
▪ based on matplotlib

▪ provides high level interface for

drawing attractive statistical

graphics ▪ Similar (in style) to the


popular ggplot2 library in R

Link: https://seaborn.pydata.org/

You might also like