0% found this document useful (0 votes)

18 views

Module4 DataAnalyticsLanguages

Uploaded by

Bhumika Kukade

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views

Module4 DataAnalyticsLanguages

Uploaded by

Bhumika Kukade

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 30

Module 4:

Data Analytics Languages--

Python

31/07/2024 Slide 1
History

• Python created by Guido van Rossum in the

Netherlands in 1990
• Popular programming language
• Widely used in industry and academia
• Simple, intuitive syntax
• Rich library
• Two versions in existence today Python 2 and
Python 3
eLahe Technologies 2020
31/07/2024 2
www.elahetech.com
Interpreted Language
• Python is an interpreted language as opposed
to being compiled
• An interpreter reads a high level program and
executes it
• A compiler translates the program into an
executable object code first which is
subsequently executed

eLahe Technologies 2020

31/07/2024 3
www.elahetech.com
Numpy

• NumPy is the fundamental package for scientific

computing with Python. It contains among other
things:
• a powerful N-dimensional array object
• sophisticated (broadcasting) functions
• tools for integrating C/C++ and Fortran code
• useful linear algebra, Fourier transform, and random
number capabilities

eLahe Technologies 2020

31/07/2024 4
www.elahetech.com
Matplotlib

• Matplotlib is a Python 2D plotting library

which produces publication quality figures in
a variety of hardcopy formats and interactive
environments across platforms.

eLahe Technologies 2020

31/07/2024 5
www.elahetech.com
pandas

• pandas is an open source, BSD-licensed

library providing high-performance, easy-to-
use data structures and data analysis tools
for Python

eLahe Technologies 2020

31/07/2024 6
www.elahetech.com
Python Regex

31/07/2024 Slide 7
Regular Expressions

In computing, a regular expression, also referred to as

"regex" or "regexp", provides a concise and flexible
means for matching strings of text, such as particular
characters, words, or patterns of characters. A regular
expression is written in a formal language
that can be interpreted by a regular expression
processor.

http://en.wikipedia.org/wiki/Regular_expression

31/07/2024 8
Python Regular Expressions
^ Matches the beginning of a line
$ Matches the end of the line
. Matches any character
\s Matches whitespace
\S Matches any non-whitespace character
* Repeats a character zero or more times
*? Repeats a character zero or more times (non-greedy)
+ Repeats a chracter one or more times
+? Repeats a character one or more times (non-greedy)
[aeiou] Matches a single character in the listed set
[^XYZ] Matches a single character not in the listed set
[a-z0-9] The set of characters can include a range
( Indicates where string extraction is to start
) Indicates where string extraction is to end

31/07/2024 9
The Regular Expression Module
• Before you can use regular expressions in your
program, you must import the library using
"import re"
• You can use re.search() to see if a string matches a
regular expression similar to using the find()
method for strings
• You can use re.findall() extract portions of a string
that match your regular expression similar to a
combination of find() and slicing: var[5:10]

31/07/2024 10
Wild-Card Characters

• The dot character matches any character

• If you add the asterisk character, the character is
"any number of times"
X-Sieve: CMU Sieve 2.3
X-DSPAM-Result: Innocent
X-DSPAM-Confidence: 0.8475 ^X.*:
X-Content-Type-Message-Body: text/plain

31/07/2024 11
Wild-Card Characters

• The dot character matches any character

• If you add the asterisk character, the character is
"any number of times"
Match the start of the line Many times
X-Sieve: CMU Sieve 2.3
X-DSPAM-Result: Innocent
X-DSPAM-Confidence: 0.8475 ^X.*:
X-Content-Type-Message-Body: text/plain
Match any character

31/07/2024 12
Wild-Card Characters

• Depending on how "clean" your data is and the

purpose of your application, you may want to
narrow your match down a bit
Match the start of the line Many times
X-Sieve: CMU Sieve 2.3
X-DSPAM-Result: Innocent
X-DSPAM-Confidence: 0.8475 ^X.*:
X-Content-Type-Message-Body: text/plain
Match any character

31/07/2024 13
Greedy Matching

• The repeat characters (* and +) push outward in both

directions (greedy) to match the largest possible string
One or more
>>> import re characters
>>> x = 'From: Using the : character'
>>> y = re.findall('^F.+:', x)
>>> print y
^F.+:
['From: Using the :']
First character in the Last character in the
Why not 'From:'? match is an F match is a :

31/07/2024 14
Non-Greedy Matching

• Not all regular expression repeat codes are greedy!

If you add a ? character - the + and * chill outOne
a bit...
or more
>>> import re characters but
>>> x = 'From: Using the : character' not greedily
>>> y = re.findall('^F.+?:', x)
>>> print y
^F.+?:
['From:']
First character in the Last character in the
match is an F match is a :

31/07/2024 15
Python Slicing

31/07/2024 Slide 16
String Slices
• >>>fruit = “apple”
• >>>fruit[1:3]
• >>>’pp’
• >>>fruit[1:]
• >>>’pple’
• >>>fruit[:4]
• >>>’appl’
• >>>fruit[:]
• >>>’apple’

31/07/2024 17
List Slices
• >>>b
• [3, 4, 5, 6]
• >>>b[0:3]
• [3,4,5]
• b[0:j] with j > 3 and b[0:] are same
• >>>b[:2]
• [3,4]

31/07/2024 18
List Slices
• >>>b[2:2]
• []
• b[i:j:k] is a subset of b[i:j] with elements
picked in steps of k
• >>>b=[1, 2, 3, 4, 5, 6, 7, 8, 9, 10]
• >>>b[0:10:3]
• [1, 4, 7]

31/07/2024 19
NumPy array slicing
• 1-d array slicing and indexing is similar to
Python lists
• import numpy as np
• arr1=np.array([1,2,5,6,4,3])
• arr1[2:4]=99

• arr1
• Out[8]: array([ 1, 2, 99, 99, 4, 3])
eLahe Technologies 2020
31/07/2024 20
www.elahetech.com
NumPy array slicing

• Slicing in ndarrays is different from Python lists in that

data is not copied
• Slices are views on the original array!
• arr2=arr1[2:4]

• arr2[0]=88

• arr1
• Out[13]: array([ 1, 2, 88, 99, 4, 3])

eLahe Technologies 2020

31/07/2024 21
www.elahetech.com
Sets

31/07/2024 Slide 22
in and notin
• >>>setA= {1,3,5,7}
• >>>3 in setA
• True
• >>>3 not in setA
• False
• >>>4 not in setA
• True

31/07/2024 23
Subset
• >>>setA= {1,3,5,7}
• >>>setB= {1, 3, 5, 7, 9}
• >>>setC = {1,3,5,9,10}
• >>>setA issubset setB
• True
• >>> setA issubset setC
• False

31/07/2024 24
Superset
• >>>setA= {1,3,5,7}
• >>>setB= {1, 3, 5, 7, 9}
• >>>setC = {1,3,5,9,10}
• >>>setA issuperset setB
• False
• >>> setB issuperset setA
• True
• >>> setC issuperset setA
• False

31/07/2024 25
Set Union

• >>>setA= {1,3,5,7}
• >>>setB= {7, 5, 9}
• >>>setA.union(setB)
• {1,3,5,7,9}
• >>>setA | setB
• {1, 3, 5, 7, 9}

31/07/2024 26
Set Intersection

• >>>setA= {1,3,5,7}
• >>>setB= {7, 5, 9}
• >>>setA.intersection(setB)
• {5,7}
• >>>setA & setB
• {5, 7}

31/07/2024 27
Dictionaries

31/07/2024 Slide 28
Dictionaries

>>>
• Lists index their entries >>> purse = dict() >>>purse['money'] =
12
based on the position >>> purse['candy'] = 3
in the list >>> purse['tissues'] = 75
>>> print(purse)
• Dictionaries are like {'money': 12, 'tissues': 75, 'candy': 3}
bags - no order >>> print(purse['candy'])
3
• So we index the things >>> purse['candy'] = purse['candy'] + 2
we put in the dictionary >>> print(purse)
{'money': 12, 'tissues': 75, 'candy': 5}
with a “lookup tag”
Comparing Lists and
Dictionaries
Dictionaries are like lists except that they use keys instead of
numbers to look up values

>>> lst = list() >>> ddd = dict()

>>> lst.append(21) >>> ddd['age'] = 21
>>> lst.append(183) >>> ddd['course'] = 182
>>> print(lst) >>> print(ddd)
[21, 183] {'course': 182, 'age': 21}
>>> lst[0] = 23 >>> ddd['age'] = 23
>>> print(lst) >>> print(ddd)
[23, 183] {'course': 182, 'age': 23}

CS-1039-Velocity Radar Systems
100% (1)
CS-1039-Velocity Radar Systems
16 pages
Sap-Taw12 Questions
100% (2)
Sap-Taw12 Questions
27 pages
Advanced C++ Interview Questions You'll Most Likely Be Asked
From Everand
Advanced C++ Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Python 201 - (Slightly) Advanced Python Topics
No ratings yet
Python 201 - (Slightly) Advanced Python Topics
69 pages
Python Day- 10
No ratings yet
Python Day- 10
11 pages
Python For Data Science
100% (1)
Python For Data Science
4 pages
Python Cheat Sheet Dataquest PDF
No ratings yet
Python Cheat Sheet Dataquest PDF
5 pages
Python Cheat Sheet Intermediate
No ratings yet
Python Cheat Sheet Intermediate
1 page
Python Cheatsheet
100% (1)
Python Cheatsheet
1 page
Day2.2 DataAnalyticsLanguages
No ratings yet
Day2.2 DataAnalyticsLanguages
100 pages
Python Programming Language
No ratings yet
Python Programming Language
10 pages
Learn Python and Automate Network Tasks: Build Your Own Apps
100% (1)
Learn Python and Automate Network Tasks: Build Your Own Apps
10 pages
Python Progr Module 3 - 6th EC by 21EC643
No ratings yet
Python Progr Module 3 - 6th EC by 21EC643
24 pages
Python Course Notes
No ratings yet
Python Course Notes
10 pages
Py Regex
No ratings yet
Py Regex
50 pages
MLPA 1-7 Chapter
No ratings yet
MLPA 1-7 Chapter
117 pages
Regular Expressions: Python For Everybody
No ratings yet
Regular Expressions: Python For Everybody
34 pages
Lecture 3-4 Regex
No ratings yet
Lecture 3-4 Regex
33 pages
17_Regular Expression
No ratings yet
17_Regular Expression
20 pages
Untitled
No ratings yet
Untitled
53 pages
Python For Economists
No ratings yet
Python For Economists
34 pages
Introduction to Python Programming
No ratings yet
Introduction to Python Programming
9 pages
Regular Expressions: Python For Everybody
No ratings yet
Regular Expressions: Python For Everybody
34 pages
Python Module-3 Notes (21EC646)_final
No ratings yet
Python Module-3 Notes (21EC646)_final
37 pages
Sundeep Agarwal Understanding Python Re Gex
No ratings yet
Sundeep Agarwal Understanding Python Re Gex
228 pages
Python Re
No ratings yet
Python Re
101 pages
5A - Regex
No ratings yet
5A - Regex
32 pages
X - Table of Contents
No ratings yet
X - Table of Contents
5 pages
Day3.3 StringManipulation
No ratings yet
Day3.3 StringManipulation
43 pages
Module 3 Regular Expressions
No ratings yet
Module 3 Regular Expressions
8 pages
UNIT4
No ratings yet
UNIT4
67 pages
Module3 RegularExpressions
No ratings yet
Module3 RegularExpressions
8 pages
Unit7_RegularExpressionpdf__2023_10_17_09_16_29
No ratings yet
Unit7_RegularExpressionpdf__2023_10_17_09_16_29
17 pages
Python Ultimate Guide
100% (1)
Python Ultimate Guide
10 pages
Slicing and Indexing
No ratings yet
Slicing and Indexing
16 pages
Data Science Report
No ratings yet
Data Science Report
126 pages
BITypes Notes
No ratings yet
BITypes Notes
7 pages
Python Refcard
100% (1)
Python Refcard
2 pages
Python Refcard
100% (5)
Python Refcard
2 pages
Unit-3 Python
No ratings yet
Unit-3 Python
72 pages
9Python-Simple-Character-Matches
No ratings yet
9Python-Simple-Character-Matches
19 pages
Regular Expressions
100% (1)
Regular Expressions
15 pages
Session22 To 24 PYTHON COLAB
No ratings yet
Session22 To 24 PYTHON COLAB
128 pages
PP_Module-3 Notes
No ratings yet
PP_Module-3 Notes
56 pages
A Winter Training Report On Automation Using Python
No ratings yet
A Winter Training Report On Automation Using Python
29 pages
Pythoncheatsheet: Dunder Methods
No ratings yet
Pythoncheatsheet: Dunder Methods
14 pages
Unit 2
No ratings yet
Unit 2
69 pages
Lec 2
No ratings yet
Lec 2
58 pages
13B RegExp
No ratings yet
13B RegExp
38 pages
Regular Exp
No ratings yet
Regular Exp
6 pages
Python - Slide 5
No ratings yet
Python - Slide 5
42 pages
A Winter Training Report On Automation Using Python
No ratings yet
A Winter Training Report On Automation Using Python
30 pages
Python Mid 1 Scheme
No ratings yet
Python Mid 1 Scheme
12 pages
Lec 06 - Regular Expression
No ratings yet
Lec 06 - Regular Expression
19 pages
Python Basics: Course Overview
No ratings yet
Python Basics: Course Overview
7 pages
SLIDES 1 Escp - Python - Ds - 2020
100% (1)
SLIDES 1 Escp - Python - Ds - 2020
50 pages
Python presentation 2
No ratings yet
Python presentation 2
27 pages
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)
Python For Beginners
From Everand
Python For Beginners
Célio Azevedo
No ratings yet
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Simplifying Data Science With Python
From Everand
Simplifying Data Science With Python
Billy David millican
No ratings yet
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
From Everand
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
Tenko
No ratings yet
FAQ11
No ratings yet
FAQ11
5 pages
SQL Server DBA Interview Questions
No ratings yet
SQL Server DBA Interview Questions
7 pages
Erin Loan Repayment API
No ratings yet
Erin Loan Repayment API
6 pages
SAP Multi-Bank Connectivity - Solution Brief PDF
50% (2)
SAP Multi-Bank Connectivity - Solution Brief PDF
5 pages
1st Sem Finals Com Arch
No ratings yet
1st Sem Finals Com Arch
8 pages
ICT Project Proposal
No ratings yet
ICT Project Proposal
13 pages
LAb 5 Windows Forensics
No ratings yet
LAb 5 Windows Forensics
7 pages
Ariba Network IntegrationGuide11s1
No ratings yet
Ariba Network IntegrationGuide11s1
72 pages
5-Module 4-Cloud Environments - Case study_ One cloud service provider per service model-02-09-2024
No ratings yet
5-Module 4-Cloud Environments - Case study_ One cloud service provider per service model-02-09-2024
98 pages
Pricing - Condition Base Value
100% (1)
Pricing - Condition Base Value
5 pages
How To Install SCT Software
100% (2)
How To Install SCT Software
18 pages
Interactive Restaurant Website Project
No ratings yet
Interactive Restaurant Website Project
33 pages
Module 1
No ratings yet
Module 1
78 pages
VijayPolamarasetti FOR Telugu Calling and Tech Support
No ratings yet
VijayPolamarasetti FOR Telugu Calling and Tech Support
2 pages
Cyber Crime PDF
No ratings yet
Cyber Crime PDF
13 pages
Database design assignment 1 seemester esofta metro campus
No ratings yet
Database design assignment 1 seemester esofta metro campus
118 pages
Pencil User Manual 2.0
No ratings yet
Pencil User Manual 2.0
18 pages
SS7 Media Gateway: SVI - MG 1000
No ratings yet
SS7 Media Gateway: SVI - MG 1000
2 pages
C3_WordProcessor (1)
No ratings yet
C3_WordProcessor (1)
2 pages
Assignment No.: - 2 Ques:1 List The Principles of OOSE With Its Concepts. Ans:1
No ratings yet
Assignment No.: - 2 Ques:1 List The Principles of OOSE With Its Concepts. Ans:1
11 pages
Integrating Requirements Engineering Into Software Engineering Processes
No ratings yet
Integrating Requirements Engineering Into Software Engineering Processes
71 pages
JD - VMWare Systems Engineer I
No ratings yet
JD - VMWare Systems Engineer I
2 pages
How To Understand Salesforce Commerce Cloud Platform - by Oleg Sapishchuk - Medium
No ratings yet
How To Understand Salesforce Commerce Cloud Platform - by Oleg Sapishchuk - Medium
17 pages
Lusca Setting Game
No ratings yet
Lusca Setting Game
47 pages
PEOs, POs and PSOs - IIIT Hyderabad
No ratings yet
PEOs, POs and PSOs - IIIT Hyderabad
3 pages
Session 6 How To Write A Progress Report
No ratings yet
Session 6 How To Write A Progress Report
3 pages
Module 2 Point 4
No ratings yet
Module 2 Point 4
18 pages
Digital
No ratings yet
Digital
68 pages

Module4 DataAnalyticsLanguages

Uploaded by

Module4 DataAnalyticsLanguages

Uploaded by

Module 4:

Data Analytics Languages--

• Python created by Guido van Rossum in the

eLahe Technologies 2020

• NumPy is the fundamental package for scientific

eLahe Technologies 2020

• Matplotlib is a Python 2D plotting library

eLahe Technologies 2020

• pandas is an open source, BSD-licensed

eLahe Technologies 2020

In computing, a regular expression, also referred to as

• The dot character matches any character

• The dot character matches any character

• Depending on how "clean" your data is and the

• The repeat characters (* and +) push outward in both

• Not all regular expression repeat codes are greedy!

• Slicing in ndarrays is different from Python lists in that

eLahe Technologies 2020

>>> lst = list() >>> ddd = dict()

You might also like