0% found this document useful (0 votes)

66 views

13 Python Ch05 ORC

Regular expressions are a powerful tool for string manipulation that are present in most modern programming languages as a library. They allow users to describe search patterns to extract information from text. The re module in Python provides functions like match(), search(), sub(), findall(), and finditer() to work with regular expressions. match() checks if a pattern is at the start of a string while search() checks anywhere in the string. sub() replaces patterns in a string. findall() and finditer() return lists/iterators of all matches of a pattern in a string.

Uploaded by

WIIL WAAAL

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

66 views

13 Python Ch05 ORC

Uploaded by

WIIL WAAAL

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Strings 1

5.1 REGULAR EXPRESSIONS

Regular expressions are a powerful tool for various kinds of string manipulation. These are basically a special
text string that is used for describing a search pattern to extract information from text such as code, files, log,
spreadsheets, or even documents.
Regular expressions are a domain specific language (DSL) that is present Programming Tip: An
as a library in most of the modern programming languages, besides Python. exception re.error is raised
A regular expression is a special sequence of characters that helps to match or if any error occurs while
find strings in another string. In Python, regular expressions can be accessed compiling or using regular
using the re module which comes as a part of the Standard Library. In this expressions.
section, we will discuss some important methods in the re module.

5.1.1 The match() Function

As the name suggest, the match() function matches a pattern to a string with optional flags. The syntax
of match() function is,
re.match(pattern, string, flags=0)

The function tries to match the pattern (which specifies the regular expression to be matched) with a string
(that will be searched for the pattern at the beginning of the string). The flag field is optional. Some values
of flags are specified in the Table 6.4. To specify more than one flag, you can use the bitwise OR operator as
in re.I | re.M. If the re.match() function finds a match, it returns the match object and None otherwise.
Table 6.4 Different values of flags
Flag Description
re.I Case sensitive matching
re.M Matches at the end of the line
re.X Ignores whitespace characters
re.U Interprets letters according to Unicode character set

Example 6.26 Program to demonstrate the use of match() function

import re
string = "She sells sea shells on the sea shore"
pattern1 = "sells"
if re.match(pattern1, string):
print("Match Found")
else:
print(pattern1, "is not present in the string")
pattern2 = "She"
if re.match(pattern2, string):
print("Match Found")
else:
print(pattern2, "is not present in the string")

OUTPUT
sells is not present in the string
Match Found

13_Python_Ch05_ORC.indd 1 27-05-2019 16:49:42

2 Problem Solving and Programming with Python

In the above program, ‘sells’ is present in the string but still we got the output as match not found. This is
because the re.match() function finds a match only at the beginning of the string. Since, the word ‘sells’ is
present in the middle of the string, hence the result.

Note On success, match() function returns an object representing the match, else returns None.

5.1.2 The search() Function

In the previous function, we saw that even when the pattern was present in the
Programming Tip: While
string, None was returned because the match was done only at the beginning
using regular expressions,
of the string. So, we have another function, i.e. search(), in the re module always use raw strings.
that searches for a pattern anywhere in the string. The syntax of the search()
function can be given as,

re.search(pattern, string, flags=0)

The syntax is similar to the match() function. The function searches for first occurrence of pattern
within a string with optional flags. If the search is successful, a match object is returned and None
otherwise.

Example 6.1 Program to demonstrate the use of search() function

import re
string = "She sells sea shells on the sea shore"
pattern = "sells"
if re.search(pattern, string):
print("Match Found")
else:
print(pattern, "is not present in the string")

OUTPUT
Match Found

Note The re.search() finds a match of a pattern anywhere in the string.

5.1.3 The sub() Function

The sub() function in the re module can be used to search a pattern in the string and replace it with another
pattern. The syntax of sub() function can be given as,

re.sub(pattern, repl, string, max=0)

According to the syntax, the sub() function replaces all occurrences of the pattern in string with repl,
substituting all occurrences unless any max value is provided. This method returns a modified string.

13_Python_Ch05_ORC.indd 2 27-05-2019 16:49:42

Strings 3

Example 6.2 Program to demonstrate the use of sub() function

import re
string = "She sells sea shells on the sea shore"
pattern = "sea"
repl = "ocean"
new_string = re.sub(pattern, repl, string, 1)
print(new_string)

OUTPUT
She sells ocean shells on the sea shore

In the above program, note that only one occurrence was replaced and not all because we had provided 1
as the value of max.

5.1.4 The findall() and finditer() Functions

The findall() function is used to search a string and returns a list of matches of the pattern in the string. If
no match is found, then the returned list is empty. The syntax of match() function can be given as,

matchList = re.findall(pattern, input_str, flags=0)

Example 6.3 Program to demonstrate the use of findall() function

import re
pattern = r"[a-zA-Z]+ \d+"
matches = re.findall(pattern, "LXI 2013, VXI 2015, VDI 20104, Maruti Suzuki Cars in
India")
for match in matches:
print(match, end = " ")

OUTPUT
LXI 2013 VXI 2015 VDI 20104

Note The re.findall() function returns a list of all substrings that match a pattern.

In the above code, the regular expression, pattern = r"[a-zA-Z]+ \d+", finds all patterns that begin
with one or more characters followed by a space and then followed by one or more digits.
The finditer() function is same as findall() function but instead of returning match objects, it returns
an iterator. This iterator can be used to print the index of match in the given string.

13_Python_Ch05_ORC.indd 3 27-05-2019 16:49:42

4 Problem Solving and Programming with Python

Example 6.4 Program to demonstrate the use of finditer() function

import re
pattern = r"[a-zA-Z]+ \d+"
matches = re.finditer(pattern, "LXI 2013, VXI 2015, VDI 20104, Maruti Suzuki Cars
availble with us")
for match in matches:
print("Match found at starting index : ", match.start())
print("Match found at ending index : ", match.end())
print("Match found at starting and ending index : ", match.span())

OUTPUT
Match found at starting index : 0
Match found at ending index : 8
Match found at starting and ending index : (0, 8)
Match found at starting index : 10
Match found at ending index : 18
Match found at starting and ending index : (10, 18)
Match found at starting index : 20
Match found at ending index : 29
Match found at starting and ending index : (20, 29)

Note that the start() function returns the starting index of the first match in the given string. Similarly,
we have end() function which returns the ending index of the first match. Another method, span() returns
the starting and ending index of the first match as a tuple.

Note The match object returned by search(), match(), and findall() functions have start()
and end() methods, that returns the starting and ending index of the first match.

5.1.5 Flag Options

The search(), findall(), and match() functions of the module take options to modify the behavior of the
pattern match. Some of these flags are:
re.I or re.IGNORECASE—Ignores case of characters, so "Match", "MATCH", "mAtCh", etc are all same
re.S or re.DOTALL—Enables dot (.) to match newline character. By default, dot matches any character
other than the newline character.
re.M or re.MULTILINE—Makes the ^ and $ to match the start and end of each line. That is, it matches
even after and before line breaks in the string. By default, ^ and $ matches the start and end of the whole
string.
re.L or re.LOCALE—Makes the flag \w to match all characters that are considered letters in the given
current locale settings.
re.U or re.UNICODE—Treats all letters from all scripts as word characters.

13_Python_Ch05_ORC.indd 4 27-05-2019 16:49:42

Fanuc PMC - Ladder Language - Programming Manual PDF
67% (6)
Fanuc PMC - Ladder Language - Programming Manual PDF
1,508 pages
Automotive Mechanics / S. Srinivasan
No ratings yet
Automotive Mechanics / S. Srinivasan
1 page
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)
17_Regular Expression
No ratings yet
17_Regular Expression
20 pages
UNIT4
No ratings yet
UNIT4
67 pages
UNIT - 4 REGEX
No ratings yet
UNIT - 4 REGEX
28 pages
regular exp
No ratings yet
regular exp
10 pages
Regular Expression 01
No ratings yet
Regular Expression 01
48 pages
Regular Expression
No ratings yet
Regular Expression
17 pages
Regular Expressions
100% (1)
Regular Expressions
15 pages
9.RegEx (1)
No ratings yet
9.RegEx (1)
57 pages
unit 4 Regular expression
No ratings yet
unit 4 Regular expression
16 pages
Pattern Matching Using Search
No ratings yet
Pattern Matching Using Search
2 pages
Python unit 3
No ratings yet
Python unit 3
46 pages
Regular Expression l
No ratings yet
Regular Expression l
20 pages
PP - Chapter - 4
No ratings yet
PP - Chapter - 4
15 pages
Python Programming: Reema Thareja
No ratings yet
Python Programming: Reema Thareja
27 pages
Regular Expression
No ratings yet
Regular Expression
21 pages
Regular Expressions - Regexes in Python (Part 2) - Real Python
No ratings yet
Regular Expressions - Regexes in Python (Part 2) - Real Python
27 pages
Unit 4 - Regular Expressions
No ratings yet
Unit 4 - Regular Expressions
20 pages
Regular Expression
No ratings yet
Regular Expression
22 pages
Regular Expression 4
No ratings yet
Regular Expression 4
16 pages
Day-13 Python Regx
No ratings yet
Day-13 Python Regx
11 pages
Python Complete Unit 3
No ratings yet
Python Complete Unit 3
40 pages
Python Regular Expressions
No ratings yet
Python Regular Expressions
6 pages
Unit7_RegularExpressionpdf__2023_10_17_09_16_29
No ratings yet
Unit7_RegularExpressionpdf__2023_10_17_09_16_29
17 pages
Python Assignment Date: 08-11-2021: Name-Navjeet Kaur Sap ID-500076160 Roll No - R134219065
No ratings yet
Python Assignment Date: 08-11-2021: Name-Navjeet Kaur Sap ID-500076160 Roll No - R134219065
3 pages
Regular Expressions: Regular Expressions Are A Powerful Tool For Various Kinds of String Manipulation
No ratings yet
Regular Expressions: Regular Expressions Are A Powerful Tool For Various Kinds of String Manipulation
4 pages
Python Regular Expressions
No ratings yet
Python Regular Expressions
14 pages
PP_Module-3 Notes
No ratings yet
PP_Module-3 Notes
56 pages
Python Re
No ratings yet
Python Re
18 pages
Advanced Python Programming Practical Manual
No ratings yet
Advanced Python Programming Practical Manual
29 pages
python_reg_expressions
No ratings yet
python_reg_expressions
8 pages
Unit-3 Python
No ratings yet
Unit-3 Python
72 pages
Regular Expressions - Regexes in Python (Part 1) - Real Python
No ratings yet
Regular Expressions - Regexes in Python (Part 1) - Real Python
44 pages
Python Module-41
No ratings yet
Python Module-41
56 pages
23.python Regular Expressions
No ratings yet
23.python Regular Expressions
7 pages
Regex Case Interview Guide
No ratings yet
Regex Case Interview Guide
10 pages
Python Reg Expressions PDF
No ratings yet
Python Reg Expressions PDF
8 pages
Python Regex
No ratings yet
Python Regex
8 pages
Unit-Iii Chapter-1: Python Strings Revisited
100% (2)
Unit-Iii Chapter-1: Python Strings Revisited
49 pages
Ge Rex
No ratings yet
Ge Rex
32 pages
Lecture 6 Re Basics
No ratings yet
Lecture 6 Re Basics
12 pages
Lecture 7 Re Part2 Split
No ratings yet
Lecture 7 Re Part2 Split
8 pages
Regular Expressions
No ratings yet
Regular Expressions
5 pages
Search For A String in Python-Exp-5
No ratings yet
Search For A String in Python-Exp-5
6 pages
6 Python Regex Search Function
No ratings yet
6 Python Regex Search Function
4 pages
Regular Expressions
No ratings yet
Regular Expressions
9 pages
Python Regex Cheat Sheet
No ratings yet
Python Regex Cheat Sheet
29 pages
13B RegExp
No ratings yet
13B RegExp
38 pages
Python 201 - (Slightly) Advanced Python Topics
No ratings yet
Python 201 - (Slightly) Advanced Python Topics
69 pages
8 Regular Expressions (E Next - In)
No ratings yet
8 Regular Expressions (E Next - In)
3 pages
Lec 06 - Regular Expression
No ratings yet
Lec 06 - Regular Expression
19 pages
RegEx-in-Python
No ratings yet
RegEx-in-Python
5 pages
4 Python Regex Match Function
No ratings yet
4 Python Regex Match Function
4 pages
Python Re Modul
No ratings yet
Python Re Modul
3 pages
RegEx 1
No ratings yet
RegEx 1
48 pages
Chapter - 11 - Regular Expressions
100% (1)
Chapter - 11 - Regular Expressions
10 pages
Python
No ratings yet
Python
4 pages
Unit 2
No ratings yet
Unit 2
69 pages
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
Python For Beginners
From Everand
Python For Beginners
Célio Azevedo
No ratings yet
Name: Mohamed Abdulqadr Mohamed Section: Koe58 Course Code: Mgn909 Roll No: 22 REG NO: 11801223 Q1)
No ratings yet
Name: Mohamed Abdulqadr Mohamed Section: Koe58 Course Code: Mgn909 Roll No: 22 REG NO: 11801223 Q1)
7 pages
7th Final Exam Review KEY
No ratings yet
7th Final Exam Review KEY
12 pages
Trojan & Backdoors
No ratings yet
Trojan & Backdoors
15 pages
Reading Comprehension: It'S All About Practice
No ratings yet
Reading Comprehension: It'S All About Practice
33 pages
Academic Task 3 - PES 316: To Evaluate Students' Critical Thinking and Problem Solving Skills
No ratings yet
Academic Task 3 - PES 316: To Evaluate Students' Critical Thinking and Problem Solving Skills
1 page
Synopsis Python Captcha
No ratings yet
Synopsis Python Captcha
5 pages
Case Based GD Case 1
No ratings yet
Case Based GD Case 1
1 page
Manual Retail SAP 1
No ratings yet
Manual Retail SAP 1
116 pages
Facial Recognition Presentation
No ratings yet
Facial Recognition Presentation
17 pages
Tia Portal V17 Technical Highlights
No ratings yet
Tia Portal V17 Technical Highlights
50 pages
Mga Dapat Tandaan Sa Pagbuo NG Sanggunian: Tagapag-Ulat: Berongoy, Joyce L. Quebec, Raffy L
No ratings yet
Mga Dapat Tandaan Sa Pagbuo NG Sanggunian: Tagapag-Ulat: Berongoy, Joyce L. Quebec, Raffy L
28 pages
b0400df S
No ratings yet
b0400df S
88 pages
APV09112316738 - Jababeka Quotation of License Docusign
No ratings yet
APV09112316738 - Jababeka Quotation of License Docusign
2 pages
APBO Meeting 4 (2024)
No ratings yet
APBO Meeting 4 (2024)
20 pages
CSEC IT PPT Notes-Information Processing - 012250
No ratings yet
CSEC IT PPT Notes-Information Processing - 012250
44 pages
Trakker Mobile App Tutorial Presentation
No ratings yet
Trakker Mobile App Tutorial Presentation
8 pages
Huawei SD-WAN Solution Datasheet
No ratings yet
Huawei SD-WAN Solution Datasheet
10 pages
CSA Guidelines NEP
No ratings yet
CSA Guidelines NEP
1 page
Shadman, Khan - MR.: Professional Profile
No ratings yet
Shadman, Khan - MR.: Professional Profile
6 pages
Galera Cluster
100% (1)
Galera Cluster
106 pages
DSS Express System Requirements and Performance V8.0.2 20210519
No ratings yet
DSS Express System Requirements and Performance V8.0.2 20210519
10 pages
Orchadmin Command: DataStage e
No ratings yet
Orchadmin Command: DataStage e
2 pages
SIP 2024 Student Tech Guide
No ratings yet
SIP 2024 Student Tech Guide
54 pages
Experiment 2: Name: Kshitij Sakpal PRN: 121A3023 Batch: E1
No ratings yet
Experiment 2: Name: Kshitij Sakpal PRN: 121A3023 Batch: E1
10 pages
Iisc Thesis Template
100% (3)
Iisc Thesis Template
6 pages
EDI Material PDF
No ratings yet
EDI Material PDF
244 pages
Website Development and Maintenance Policy
No ratings yet
Website Development and Maintenance Policy
2 pages
Grade 12 Unit 3
No ratings yet
Grade 12 Unit 3
26 pages
Management Information System For The College of Sports, Physical Education and Recreation
No ratings yet
Management Information System For The College of Sports, Physical Education and Recreation
30 pages
Configuring Tivoli Workload Scheduler in A Firewalled Network
No ratings yet
Configuring Tivoli Workload Scheduler in A Firewalled Network
20 pages
RSLogix 5000 - 19.01.01 (Released 10 - 2011)
No ratings yet
RSLogix 5000 - 19.01.01 (Released 10 - 2011)
4 pages
APKTOOLS 1.5.2 & Apk Tool Install Windows R05-Ibot
67% (3)
APKTOOLS 1.5.2 & Apk Tool Install Windows R05-Ibot
2 pages
PythonPackagesandData Accesss
No ratings yet
PythonPackagesandData Accesss
4 pages
Current Log
No ratings yet
Current Log
7 pages
CPP Report F 7543
No ratings yet
CPP Report F 7543
34 pages