0% found this document useful (0 votes)

2 views

Lecture 7 Re Part2 Split

Uploaded by

mudassirsabri45

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Lecture 7 Re Part2 Split

Uploaded by

mudassirsabri45

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

re.

split()
Split string by the occurrences of a character or a pattern, upon finding that pattern, the
remaining characters from the string are returned as part of the resulting list.

• Syntax : re.split(pattern, string, maxsplit=0, flags=0)

The First parameter, pattern denotes the regular expression, string is the given string in which
pattern will be searched for and in which splitting occurs, maxsplit if not provided is considered
to be zero ‘0’, and if any nonzero value is provided, then at most that many splits occur. If
maxsplit = 1, then the string will split once only, resulting in a list of length 2. The flags are very
useful and can help to shorten code, they are not necessary parameters, eg: flags =
re.IGNORECASE, in this split, the case, i.e. the lowercase or the uppercase will be ignored.

l = [1,3 ,4]
k = ['a','b','c']
print(l+k)

[1, 3, 4, 'a', 'b', 'c']

from re import split

# '\W+' denotes Non-Alphanumeric Characters

# or group of characters Upon finding ','
# or whitespace ' ', the split(), splits the
# string from that point
print(split('\w+', 'Words, words , Words'))
print(split('\W', "Word's words Words"))

# Here ':', ' ' ,',' are not AlphaNumeric thus,

# the point where splitting occurs
print(split('\W+', 'On 12th Jan 2016, at 11:02 AM'))

# '\d+' denotes Numeric Characters or group of

# characters Splitting occurs at '12', '2016',
# '11', '02' only
print(split('\d+', 'On 12th Jan 2016, at 11:02 AM'))

['', ', ', ' , ', '']

['Word', 's', 'words', 'Words']
['On', '12th', 'Jan', '2016', 'at', '11', '02', 'AM']
['On ', 'th Jan ', ', at ', ':', ' AM']

from IPython.display import display, Image

display(Image(filename='flags.png'))
import re

# Splitting will occurs only once, at

# '12', returned list will have length 2
print(re.split('\d+', 'On 12th Jan 2016, at 11:02 AM', 1))

# 'Boy' and 'boy' will be treated same when

# flags = re.IGNORECASE
print(re.split('[a-f]+', 'Aey, Boy oh boy, come here',
flags=re.IGNORECASE))
print(re.split('[a-f]+', 'Aey, Boy oh boy, come here'))

['On ', 'th Jan 2016, at 11:02 AM']

['', 'y, ', 'oy oh ', 'oy, ', 'om', ' h', 'r', '']
['A', 'y, Boy oh ', 'oy, ', 'om', ' h', 'r', '']

re.sub()
The ‘sub’ in the function stands for SubString, a certain regular expression pattern is searched in
the given string(3rd parameter), and upon finding the substring pattern is replaced by repl(2nd
parameter), count checks and maintains the number of times this occurs.

• Syntax: re.sub(pattern, repl, string, count=0, flags=0)

import re

# Regular Expression pattern 'ub' matches the

# string at "Subject" and "Uber". As the CASE
# has been ignored, using Flag, 'ub' should
# match twice with the string Upon matching,
# 'ub' is replaced by '~*' in "Subject", and
# in "Uber", 'Ub' is replaced.
print(re.sub('ub', '~*', 'Subject has Uber booked already',
flags=re.IGNORECASE))
# Consider the Case Sensitivity, 'Ub' in
# "Uber", will not be replaced.
print(re.sub('ub', '~*', 'Subject has Uber booked already'))

# As count has been given value 1, the maximum

# times replacement occurs is 1
print(re.sub('ub', '~*', 'Subject has Uber booked already',
count=1, flags=re.IGNORECASE))

# 'r' before the pattern denotes RE, \s is for

# start and end of a String.
print(re.sub(r'\sAND\s', ' & ', 'Baked Beans And Spam',
flags=re.IGNORECASE))

S~ject has ~er booked already

S~*ject has Uber booked already
S~*ject has Uber booked already
Baked Beans & Spam

re.subn()
subn() is similar to sub() in all ways, except in its way of providing output. It returns a tuple with
count of the total of replacement and the new string rather than just the string.

• Syntax: re.subn(pattern, repl, string, count=0, flags=0)

import re

print(re.subn('ub', '~*', 'Subject has Uber booked already'))

t = re.subn('ub', '~*', 'Subject has Uber booked already',

flags=re.IGNORECASE)
print(t)
print(len(t))

# This will give same output as sub() would have

print(t[0])

('S~*ject has Uber booked already', 1)

('S~*ject has ~*er booked already', 2)
2
S~*ject has ~*er booked already
re.escape()
Returns string with all non-alphanumerics backslashed, this is useful if you want to match an
arbitrary literal string that may have regular expression metacharacters in it.

• Syntax: re.escape(string)
import re

# escape() returns a string with BackSlash '\',

# before every Non-Alphanumeric Character
# In 1st case only ' ', is not alphanumeric
# In 2nd case, ' ', caret '^', '-', '[]', '\'
# are not alphanumeric
print(re.escape("Awesome even"))
print(re.escape("I Asked what is this [a-9], he said \t ^WoW"))

Awesome\ even
I\ Asked\ what\ is\ this\ \[a\-9\],\ he\ said\ \ \ \^WoW

re.search()
This method either returns None (if the pattern doesn’t match), or a re.MatchObject contains
information about the matching part of the string. This method stops after the first match, so
this is best suited for testing a regular expression more than extracting data.

import re

regex = r"([a-zA-Z]+) (\d+)"

match = re.search(regex, "I was born on June 24")

if match != None:

print ("Match at index %s, %s" % (match.start(), match.end()))

print ("Full match: %s" % (match.group(0)))

# So this will print "June"

print ("Month: %s" % (match.group(1)))

# So this will print "24"

print ("Day: %s" % (match.group(2)))

else:
print ("The regex pattern does not match.")
Match at index 14, 21
Full match: June 24
Month: June
Day: 24

import re

s = "Welcome to Artificial intelligence "

# here x is the match object

res = re.search(r"\bA", s)

print(res.re)
print(res.string)

re.compile('\\bA')
Welcome to Artificial intelligence

Getting matched substring

group() method returns the part of the string for which the patterns match. See the below
example for a better understanding.

import re

s = "Welcome to Artificial Intelligence"

# here x is the match object

res = re.search(r"\D{3} t", s)

print(res.group())

ome t

# A Python program to demonstrate working of re.match().

import re
regex = r"([a-zA-Z]+) (\d+)"
match = re.search(regex, "I was born on June 24")
if match != None:

print ("Match at index %s, %s" % (match.start(), match.end()))

print ('date = ', match.group(0))

# So this will print "June"

print ("Month: %s" % (match.group(1)))
# So this will print "24"
print ("Day: %s" % (match.group(2)))

else:
print ("The regex pattern does not match.")

Match at index 14, 21

date = June 24
Month: June
Day: 24

Matching a Pattern with Text

re.match() : This function attempts to match pattern to whole string. The re.match function
returns a match object on success, None on failure.

re.match(pattern, string, flags=0)

• pattern : Regular expression to be matched.
• string : String where pattern is searched
• flags : We can specify different flags using bitwise OR (|).
# A Python program to demonstrate working
# of re.match().
import re

# a sample function that uses regular expressions

# to find month and day of a date.
def findMonthAndDate(string):

regex = r"([a-zA-Z]+) (\d+)"

match = re.match(regex, string)

if match == None:
print ("Not a valid date")
return

print ("Given Data: %s" % (match.group()))

print ("Month: %s" % (match.group(1)))
print ("Day: %s" % (match.group(2)))

# Driver Code
findMonthAndDate("Jun 24")
print("")
findMonthAndDate("I was born on June 24")
Given Data: Jun 24
Month: Jun
Day: 24

Not a valid date

Finding all occurrences of a pattern

re.findall() : Return all non-overlapping matches of pattern in string, as a list of strings. The
string is scanned left-to-right, and matches are returned in the order found

# A Python program to demonstrate working of

# findall()
import re

# A sample text string where regular expression

# is searched.
string = """Hello my Number is 123456789 and
my friend's number is 987654321"""

# A sample regular expression to find digits.

regex = '\d+'

match = re.findall(regex, string)

print(match)

# This example is contributed by Ayush Saluja.

['123456789', '987654321']

print([x for x in range(10) if x <2])

[0, 1]

a = "Hello, World!"
print(a[2:4])

len(a)

print(a[-5:-2])

orl
fruits = ["apple","banana","cherry"]
print(fruits[-2])

banana

The C# Player's Guide - 5th Edition - 5.0.0
83% (18)
The C# Player's Guide - 5th Edition - 5.0.0
497 pages
Corce
70% (46)
Corce
206 pages
Neetcode 150 Solution
No ratings yet
Neetcode 150 Solution
74 pages
Ap Computer Science Principles Practice Exam and Notes 2021
100% (4)
Ap Computer Science Principles Practice Exam and Notes 2021
108 pages
The Ethical Slut PDF
55% (69)
The Ethical Slut PDF
298 pages
Hacking The Art of Exploitation 2nd Edition Jon Erickson
100% (19)
Hacking The Art of Exploitation 2nd Edition Jon Erickson
492 pages
50 Phone Hacks DR - Brad
58% (19)
50 Phone Hacks DR - Brad
29 pages
2000 Iptv
100% (2)
2000 Iptv
70 pages
C# Cheat Sheet
100% (5)
C# Cheat Sheet
12 pages
BitCoin White Paper
100% (4)
BitCoin White Paper
9 pages
Guitar Center Guitar
No ratings yet
Guitar Center Guitar
1 page
PDF
100% (1)
PDF
568 pages
GM M11GM-Ia-3 Q1
100% (1)
GM M11GM-Ia-3 Q1
6 pages
FSA Guidance Document - As Published 08.03.2019 1.0
No ratings yet
FSA Guidance Document - As Published 08.03.2019 1.0
31 pages
Regular Expression
No ratings yet
Regular Expression
21 pages
Summary Python 1
No ratings yet
Summary Python 1
36 pages
Regular Expression l
No ratings yet
Regular Expression l
20 pages
Unit-3 Python
No ratings yet
Unit-3 Python
72 pages
unit 4 Regular expression
No ratings yet
unit 4 Regular expression
16 pages
UNIT - 4 REGEX
No ratings yet
UNIT - 4 REGEX
28 pages
Regular Expressions
No ratings yet
Regular Expressions
104 pages
Nuevo Documento de Texto
No ratings yet
Nuevo Documento de Texto
6 pages
Jobanpy
No ratings yet
Jobanpy
47 pages
Regular Expression
No ratings yet
Regular Expression
18 pages
Manipulating Text with Regular Expression in python
No ratings yet
Manipulating Text with Regular Expression in python
4 pages
This Is The Course Script
No ratings yet
This Is The Course Script
9 pages
String R
No ratings yet
String R
6 pages
R Imp Funtions
No ratings yet
R Imp Funtions
10 pages
Module 6 NumPY and Pandas
No ratings yet
Module 6 NumPY and Pandas
12 pages
Compute The Greatest Common Divisor and Least Common Multiple of Two Integers
No ratings yet
Compute The Greatest Common Divisor and Least Common Multiple of Two Integers
26 pages
Notes on CS - python
No ratings yet
Notes on CS - python
10 pages
mod-3-PATTERN MATCHING WITH REGULAR EXPRESSIONS
No ratings yet
mod-3-PATTERN MATCHING WITH REGULAR EXPRESSIONS
21 pages
Python Code Practice
No ratings yet
Python Code Practice
11 pages
Import: Datetime Is - Valid - Date (Date - STR)
No ratings yet
Import: Datetime Is - Valid - Date (Date - STR)
4 pages
41 Questions To Test Your Knowledge of Python Strings PDF
No ratings yet
41 Questions To Test Your Knowledge of Python Strings PDF
13 pages
RegEx 1
No ratings yet
RegEx 1
48 pages
Data_Types.ipynb - Colaboratory
No ratings yet
Data_Types.ipynb - Colaboratory
14 pages
A Short List of The Most Useful R Commands
No ratings yet
A Short List of The Most Useful R Commands
8 pages
1012 Quick Reference
No ratings yet
1012 Quick Reference
2 pages
regexpresion
No ratings yet
regexpresion
11 pages
Python Regular Expression
100% (1)
Python Regular Expression
31 pages
4 5843859483645716285
No ratings yet
4 5843859483645716285
6 pages
regular exp
No ratings yet
regular exp
10 pages
DSA Turing
No ratings yet
DSA Turing
8 pages
2021 Uam 2107
No ratings yet
2021 Uam 2107
8 pages
Advanced Python Programming Practical Manual
No ratings yet
Advanced Python Programming Practical Manual
29 pages
Lab Manual Python 2023-Final
No ratings yet
Lab Manual Python 2023-Final
48 pages
Week 7
No ratings yet
Week 7
30 pages
Eneral Definitions
No ratings yet
Eneral Definitions
7 pages
A Short List of The Most Useful R Commands
No ratings yet
A Short List of The Most Useful R Commands
11 pages
Regular Exp
No ratings yet
Regular Exp
6 pages
Prac1.ipynb (Auto-R) - JupyterLab
No ratings yet
Prac1.ipynb (Auto-R) - JupyterLab
3 pages
Lecture3_DynamicLanguage
No ratings yet
Lecture3_DynamicLanguage
24 pages
Strings
No ratings yet
Strings
57 pages
R Notes For Data Analysis and Statistical Inference
No ratings yet
R Notes For Data Analysis and Statistical Inference
10 pages
String GGG
No ratings yet
String GGG
47 pages
Python_Lab_Record_final (1)
No ratings yet
Python_Lab_Record_final (1)
79 pages
44DBD3C517136789270
No ratings yet
44DBD3C517136789270
21 pages
Python Practical File
No ratings yet
Python Practical File
38 pages
NUMPY
No ratings yet
NUMPY
8 pages
Numpy
No ratings yet
Numpy
9 pages
9.RegEx (1)
No ratings yet
9.RegEx (1)
57 pages
string python
No ratings yet
string python
8 pages
8B Recap of Array Programming With Loops and Map, Reduce, Filter
No ratings yet
8B Recap of Array Programming With Loops and Map, Reduce, Filter
49 pages
PYTHONBOOK
No ratings yet
PYTHONBOOK
32 pages
CSCI Final Cheat Sheet
No ratings yet
CSCI Final Cheat Sheet
14 pages
Regular Expressions: Python For Everybody
No ratings yet
Regular Expressions: Python For Everybody
34 pages
Program Design Notes
No ratings yet
Program Design Notes
6 pages
Day 1
No ratings yet
Day 1
13 pages
Raunakmalkani 20BIT032 Assignment RegularExpressions
No ratings yet
Raunakmalkani 20BIT032 Assignment RegularExpressions
14 pages
(Python) Regex Cheat Sheet
100% (1)
(Python) Regex Cheat Sheet
1 page
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Coding With JavaScript For Dummies Everything To Know About JavaScript (2020) - 40153
100% (1)
Coding With JavaScript For Dummies Everything To Know About JavaScript (2020) - 40153
247 pages
AI Tools and Prompts
100% (4)
AI Tools and Prompts
94 pages
Simple Sabotage Field Manual
100% (2)
Simple Sabotage Field Manual
16 pages
FORScan 2015-2018 F150s
0% (1)
FORScan 2015-2018 F150s
34 pages
Eat That Frog
100% (10)
Eat That Frog
124 pages
Introductory Algebra
100% (4)
Introductory Algebra
214 pages
Learn Javascript in A DAY!
100% (8)
Learn Javascript in A DAY!
192 pages
Java Programming Cheatsheet
100% (1)
Java Programming Cheatsheet
14 pages
The JavaScript Beginner's Handbook
90% (10)
The JavaScript Beginner's Handbook
76 pages
Computing Command Line Linux
No ratings yet
Computing Command Line Linux
203 pages
Coffee Break Python Workbook Mayer
No ratings yet
Coffee Break Python Workbook Mayer
297 pages
PDF
100% (1)
PDF
192 pages
Hacking in Detail
0% (3)
Hacking in Detail
24 pages
The Linux Command Line
100% (4)
The Linux Command Line
537 pages
Pats Relearn
No ratings yet
Pats Relearn
2 pages
A Comprehensive Analysis of Package Hallucinations by Code Generating LLMs [2406.10279]
100% (1)
A Comprehensive Analysis of Package Hallucinations by Code Generating LLMs [2406.10279]
20 pages
Introduction To Computer Science
100% (6)
Introduction To Computer Science
202 pages
Linux Cheat Sheet
No ratings yet
Linux Cheat Sheet
4 pages
Learn To Code HTML and CSS Develop Style Websites PDF
100% (2)
Learn To Code HTML and CSS Develop Style Websites PDF
595 pages
Virginia Science Third Grade Plants 2
No ratings yet
Virginia Science Third Grade Plants 2
2 pages
Unable To Save Application Information During Test Application or MER Creation
No ratings yet
Unable To Save Application Information During Test Application or MER Creation
6 pages
A 90 Crash Awaits Kaspa (KAS) Token Price - CoinChapter
No ratings yet
A 90 Crash Awaits Kaspa (KAS) Token Price - CoinChapter
1 page
Megger Grounding Bonding
No ratings yet
Megger Grounding Bonding
118 pages
Derrick 5T FOR DISMANTLE TC1
No ratings yet
Derrick 5T FOR DISMANTLE TC1
1 page
Quick Guide VIO 200 S: Necessary Operating Steps
No ratings yet
Quick Guide VIO 200 S: Necessary Operating Steps
2 pages
Chap 07 Spreadsheet Models
No ratings yet
Chap 07 Spreadsheet Models
43 pages
Rear End Module
100% (1)
Rear End Module
103 pages
Masina Taiat Iarba - Electrolux Bernard Loisirs 605350 DATASHEET
No ratings yet
Masina Taiat Iarba - Electrolux Bernard Loisirs 605350 DATASHEET
3 pages
Smart Bridge - Automatic Height Increase During Flooding
No ratings yet
Smart Bridge - Automatic Height Increase During Flooding
12 pages
CR 975 System
No ratings yet
CR 975 System
129 pages
Robot API RCDesign
No ratings yet
Robot API RCDesign
14 pages
Project MIC(1)
No ratings yet
Project MIC(1)
11 pages
Shear Wall Notes:: Ac Pad Must Meet Minimum Design Flood Elevation in Flood Zone Locations 1'-0" 3'-0"
No ratings yet
Shear Wall Notes:: Ac Pad Must Meet Minimum Design Flood Elevation in Flood Zone Locations 1'-0" 3'-0"
1 page
Data-Intensive Computing
No ratings yet
Data-Intensive Computing
88 pages
Databeat CheatSheet OMNIpro GettingStarted
100% (1)
Databeat CheatSheet OMNIpro GettingStarted
1 page
Blockchain Notes
No ratings yet
Blockchain Notes
3 pages
TOS in Math 7
No ratings yet
TOS in Math 7
2 pages
Bsbins 601 Project Portfolio
No ratings yet
Bsbins 601 Project Portfolio
21 pages
Semiconductor KRC101S KRC106S: Technical Data
No ratings yet
Semiconductor KRC101S KRC106S: Technical Data
7 pages
CCP6214-Algorithm Design and Analysis Approved
No ratings yet
CCP6214-Algorithm Design and Analysis Approved
3 pages
Resume Marjorie Turingan v2021-2
No ratings yet
Resume Marjorie Turingan v2021-2
3 pages
Category-I: Northern Region (Critical & Non-Critical)
No ratings yet
Category-I: Northern Region (Critical & Non-Critical)
26 pages
Parameters For Selection of Pumps For Different Applications
67% (3)
Parameters For Selection of Pumps For Different Applications
10 pages
Columbia University Mention of Hedy Lamarr
No ratings yet
Columbia University Mention of Hedy Lamarr
44 pages
Big Java Solution Manual Ch17
100% (2)
Big Java Solution Manual Ch17
17 pages
9852 2878 01 Maintenance Instructions COP1132B
No ratings yet
9852 2878 01 Maintenance Instructions COP1132B
42 pages
Mars L2000 - Elevator Systems - Merih Elevator
No ratings yet
Mars L2000 - Elevator Systems - Merih Elevator
4 pages