Data Science Using With Python
Data Science Using With Python
A
Introduction
Python Libraries
Presentation title 3
What is Python
• Python is open source, interpreted, high level language and provides great approach for object-
oriented programming. It is one of the best language used by data scientist for various data science
projects/application.
Presentation title 4
• When it comes to data science application ,it gives an extensive library to bargain with and not suggest to this open source ,
• Python is a common goal programming language , that is growing more and more famous for creating data science
• Python is extensively used in the scientific and research summation because this is easy to use has very easy syntax that
makes very simple adapt for people who don’t have a software engineer background.
Data Science 5
Python Data Structures
Presentation title 6
Python Libraries For Data Science
• Numpy
• Keras
• SciPy
• Pandas
• SciKit-Learn
• Matplotlib
• Seaborn
• Tensorflow
Presentation title 7
NumPy
NumPy (Numerical Python) is the fundamental package for numerical computation in Python; it contains a powerful N-
dimensional array object. It has around 18,000 comments on GitHub and an active community of 700 contributors. It’s a
general-purpose array-processing package that provides high-performance multidimensional objects called arrays and tools for
working with them. NumPy also addresses the slowness problem partly by providing these multidimensional arrays as well as
providing functions and operators that operate efficiently on these arrays.
Features:
•Provides fast, precompiled functions for numerical routines
•Array-oriented computing for better efficiency
•Supports an object-oriented approach
•Compact and faster computations with vectorization
Applications:
•Extensively used in data analysis
•Creates powerful N-dimensional array
8
•Forms the base of other libraries, such as SciPy and scikit-learn
SciPy
SciPy (Scientific Python) is another free and open-source Python library for data science that is extensively used for high-
level computations. SciPy has around 19,000 comments on GitHub and an active community of about 600 contributors. It’s
extensively used for scientific and technical computations, because it extends NumPy and provides many user-friendly and
efficient routines for scientific calculations.
Features:
•Collection of algorithms and functions built on the NumPy extension of Python
•High-level commands for data manipulation and visualization
•Multidimensional image processing with the SciPy ndimage submodule
•Includes built-in functions for solving differential equations
Applications:
•Multidimensional image operations
•Solving differential equations and the Fourier transform
9
•Optimization algorithms
TensorFlow
The first in the list of python libraries for data science is TensorFlow. TensorFlow is a library for high-performance
numerical computations with around 35,000 comments and a vibrant community of around 1,500 contributors. It’s used
across various scientific fields. TensorFlow is basically a framework for defining and running computations that involve
tensors, which are partially defined computational objects that eventually produce a value.
Features:
•Better computational graph visualizations
•Reduces error by 50 to 60 percent in neural machine learning
•Parallel computing to execute complex models
•Seamless library management backed by Google
•Quicker updates and frequent new releases to provide you with the latest features
10
Some Additional Libraries
• Data Exploration finding out further about data we hold.
• Data Munging cleaning the data and performing by it to perform it rightly suit statistical modelling.
Presentation title 11
Data Science vs Data Analytics
Presentation title 12
Conclusion
• Data science with python is truly an excellent tool and is growing an increasingly attractive language with
data scientists. The object being, thats easy to read, blends fine with other databases including tools like
Spark and Hadoop. Majorly, this has a famous computational power and has the highest data analytics
libraries.
Presentation title 13
Submitted By:-
20AR1A0592