0% found this document useful (0 votes)

28 views

Regular Expression

This document discusses regular expressions (regex) in Python. It provides examples of common regex functions like re.findall(), re.split(), re.sub(), and re.search(). It also describes match objects that are returned when a regex pattern is found, including attributes like match.group(), match.start(), match.end(), match.span(), match.re, and match.string.

Uploaded by

HEART SPORTS

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views

Regular Expression

Uploaded by

HEART SPORTS

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 21

Match Objects

and
Substituting

BY
MUHAMMAD FARSAN
Regular
Expressions
What is Regular Expression

A Regular Expression (RegEx) is a sequence of characters that defines

a search pattern. For example,

^a……s$

The above code defines a RegEx pattern. The pattern is: any
five letter string starting with a and ending with s.
ython has a module named re to work with regular expression. Here’s an
example:

import re

pattern = ‘^a…s$’
test_string = ‘abyss’
Result =
re.match(pattern,
test_string)

if result:
print(“Search successful.”)
else:
print(“Search
unsuccessful”)
Here, we used re.match function to grab pattern within the test_string.
The method returns a match object if the search is successful. If not, it returns
No
Specify Pattern Using RegEx:
To specify regular expressions, metacharacters are used. In the
above example, ^ and $ are metacharacters.

Meta characters:
Metacharacters are characters that are interpreted in a special way by
a RegEx engine. Here's a list of metacharacters:
[] . ^ $ *

[] – Square brackets:
Square brackets specifies a set of char cters you wish to match.
Above, [abc] will match if the string you are trying to match contains
any of the a, b or c.

You can also specify a range of charac ters using – inside square
brackets.
o [a-e] is the same as [abcde].
o [1-4] is the same as [1234].
o [0-39] is the same as [01239].
r
You can complement (invert) the cha acter set by using caret ^ symbol
at the start of a square-bracket.
o [^abc] means any character except a or b or c.
o [^0-9] means any non-digit character.
– Period:
period matches any single character (except newline ’\n’).
^ – Caret:
The caret symbol ^ is used to check if a string starts with a certain
character.
$ – Dollar:
The dollar symbol $ is used to check if a string ends with a certain character.
+ – Plus:
The plus symbol + matches one or more occurrences of the pattern left to
it.
Python RegEx

Python has a module named re to work with regular expressions. To use

it, we need to import the module.

import re

The module defines several functions and constants to work with

RegEx.
re.findall()

The re.findall() method returns a list of strings containing all

matches

Example:
#program to extract numbers from a string

import re

string = ‘hello 12 hi 89. Howdy 34

pattern = ‘\d+’
result = re.findall(patern, string)
print(result)

#Output: [‘12’, ‘89’, ‘34’]

If the pattern is no found, re.findall() returns an empty

list.
re.split()

The re.split method splits the string where there is a match and returns a
list of strings where the splits have occurred.

Example:

import re

String = ‘Twelve:12 Eighty nine:89’

pattern = ‘\d+’

result = re.split(pattern, string)

Print( result )

#Output: [ ‘Twelve:’, ‘ Eighty

nine:’, ‘.’]

If the pattern is no found, re.split() returns a list containing an empty

string.
re.sub()
The syntax of re.sub() is:

re.sub(pattern, replace,
string)
Example:
#program to remove all white spaces
Import re
string = ‘abc 12\ de 23 \n f4
6’

#matches all whitespace characters

pattern = ‘\s+’

#empty string
replace = ‘ ’
e
new_string =
re.sub(patter n
n, r
re.subn()
The re.subn() is similar to re.sub() expect it returns a tuple of 2
items containing the new string and the number of substitutions
made.

Example:
e
#program to remove all whitespac

s Import re
#multiline string
string = ‘abc 12\ de 23 \n f45
6’

# matches all whitespace characters

pattern = ‘\s+’

#empty string
replace = ‘ ’
new_string = re.subn(pattern, replace, string)
print(new_string)
re.search()

The re.search() method takes two arguments: a pattern and a string. The
method looks for the first location where the RegEx pattern produces a match
with the string. If the search is successful, re.search() returns a match
object; if not, it returns None.

match = re.search(pattern, str)

Example:

import re
String = “Python is fun”

#check if ‘Python’ is at the beginning

match = re.search(‘\APython’, string)

if match:
print(“pattern found inside the string”)
else:
print(“pattern not found”)

#Output : pattern found inside the

string

Here, match contains a match object.

Math Object

You can get methods and attributes of a match object using dir()
function. Some of the commonly used methods and attributes of match
objects are:

Import re
String = ‘ 39801 356, 2102 1111 ’
s
#Three digit number followed by pace followed by two digit
number Pattern = ‘ (\d{3}) (\d{2}) ’

#match variable contains a Match object.

Match = re.search(pattern, string)

If match:
print(match.group())
Else:
print(“ pattern not found ”)
# Output: 801 35
ere, match variable contains a match object.
ur pattern (\d{3}) (\d{2}) has two subgroups (\d{3}) and (\d{2}). You can get
th art of the string of these parenthesized subgroups. Here's how:

>>> match.group(1)
‘801’

>>> match.group(2)
‘35’

>>> match.group(3)
(‘801’, ‘35’)

>>> match.group()
(‘801’, ‘35’)
match.start(), match.end() and match.span()

The start() function returns the index of t he matched substring.

Similarly, end() returns the end index of the matched substring.

>>> match.start()
2
>>>match.end()
8

The span() function returns a tuple containing start and end index of
the matched part.

>>> match.span()
(2, 8)
match.re and match.string

The re attribute of a matched object retu rns a regular expression

object. Similarly, string attribute returns the passed string.

>>> match.re
Re.compile(‘ (\\d{3}) (\\d{2}) )
’

>>> match.string
‘ 39801 356, 2102 1111 ’

Neetcode 150 Solution
No ratings yet
Neetcode 150 Solution
74 pages
Problem Solving-2
31% (16)
Problem Solving-2
98 pages
Mini Lesson Plan 1 Syntax Surgery
100% (1)
Mini Lesson Plan 1 Syntax Surgery
5 pages
Thomas Abercrombie - Pathways of Memory and Power - Ethnography and History Among An Andean People-The University of Wisconsin Press (1998)
No ratings yet
Thomas Abercrombie - Pathways of Memory and Power - Ethnography and History Among An Andean People-The University of Wisconsin Press (1998)
632 pages
Japanese Dialects PDF
No ratings yet
Japanese Dialects PDF
3 pages
English Form 2 Mid Year
0% (1)
English Form 2 Mid Year
11 pages
Regular Expression l
No ratings yet
Regular Expression l
20 pages
Regular Exp
No ratings yet
Regular Exp
6 pages
Unit-3 Python
No ratings yet
Unit-3 Python
72 pages
Advanced Python Programming Practical Manual
No ratings yet
Advanced Python Programming Practical Manual
29 pages
Lecture 7 Re Part2 Split
No ratings yet
Lecture 7 Re Part2 Split
8 pages
Manipulating Text with Regular Expression in python
No ratings yet
Manipulating Text with Regular Expression in python
4 pages
Chapter - 11 - Regular Expressions
100% (1)
Chapter - 11 - Regular Expressions
10 pages
Unit 4 - Regular Expressions
No ratings yet
Unit 4 - Regular Expressions
20 pages
Lecture 9 Python
No ratings yet
Lecture 9 Python
8 pages
UNIT - 4 REGEX
No ratings yet
UNIT - 4 REGEX
28 pages
unit 4 Regular expression
No ratings yet
unit 4 Regular expression
16 pages
Python Regex: Re - Match, Re - Search, Re - Findall With Example
No ratings yet
Python Regex: Re - Match, Re - Search, Re - Findall With Example
10 pages
Pattern Matching and Regex Examples Practiced
No ratings yet
Pattern Matching and Regex Examples Practiced
7 pages
Regular Expression Interview Questions
No ratings yet
Regular Expression Interview Questions
11 pages
Python Reg Expressions PDF
No ratings yet
Python Reg Expressions PDF
8 pages
Java Regular Expressions
No ratings yet
Java Regular Expressions
7 pages
9.RegEx (1)
No ratings yet
9.RegEx (1)
57 pages
RegEx in Python (4)
No ratings yet
RegEx in Python (4)
6 pages
Python Complete Unit 3
No ratings yet
Python Complete Unit 3
40 pages
Python Regular Expression
100% (1)
Python Regular Expression
31 pages
Python Regex
No ratings yet
Python Regex
8 pages
Nuevo Documento de Texto
No ratings yet
Nuevo Documento de Texto
6 pages
regular exp
No ratings yet
regular exp
10 pages
Python unit 3
No ratings yet
Python unit 3
46 pages
Raunakmalkani 20BIT032 Assignment RegularExpressions
No ratings yet
Raunakmalkani 20BIT032 Assignment RegularExpressions
14 pages
R Imp Funtions
No ratings yet
R Imp Funtions
10 pages
UNIT 3 QB ANSWER
No ratings yet
UNIT 3 QB ANSWER
27 pages
UNIT - 5
No ratings yet
UNIT - 5
22 pages
OS Lab - Week 8
No ratings yet
OS Lab - Week 8
28 pages
17_Regular Expression
No ratings yet
17_Regular Expression
20 pages
8 Regular Expressions (E Next - In)
No ratings yet
8 Regular Expressions (E Next - In)
3 pages
Module3 RegularExpressions
No ratings yet
Module3 RegularExpressions
8 pages
Regular Expression
No ratings yet
Regular Expression
18 pages
Array in C
No ratings yet
Array in C
6 pages
Questions for dsa
No ratings yet
Questions for dsa
28 pages
Chapter 2.5
No ratings yet
Chapter 2.5
4 pages
css unit 5 dev notes
No ratings yet
css unit 5 dev notes
13 pages
03 Lexical Analysis
No ratings yet
03 Lexical Analysis
86 pages
Strings 2
No ratings yet
Strings 2
8 pages
CP CT 2 Important Questions Answers
No ratings yet
CP CT 2 Important Questions Answers
15 pages
Unit7_RegularExpressionpdf__2023_10_17_09_16_29
No ratings yet
Unit7_RegularExpressionpdf__2023_10_17_09_16_29
17 pages
Regular Expression 01
No ratings yet
Regular Expression 01
48 pages
Regular Expressions
No ratings yet
Regular Expressions
104 pages
unit-2 notes.docx
No ratings yet
unit-2 notes.docx
18 pages
Module 4 - Regular Expressions1
No ratings yet
Module 4 - Regular Expressions1
37 pages
14-03-2023
No ratings yet
14-03-2023
4 pages
RegEx 1
No ratings yet
RegEx 1
48 pages
Assignment 3
No ratings yet
Assignment 3
5 pages
Questions Sample Input Output
No ratings yet
Questions Sample Input Output
12 pages
String R
No ratings yet
String R
6 pages
C Sharp Challange
No ratings yet
C Sharp Challange
26 pages
regexpresion
No ratings yet
regexpresion
11 pages
Summary On Java Expression
No ratings yet
Summary On Java Expression
18 pages
BPOPS103/203 Module 4 Notes
No ratings yet
BPOPS103/203 Module 4 Notes
18 pages
Metacharacters in Python
No ratings yet
Metacharacters in Python
7 pages
BITypes Notes
No ratings yet
BITypes Notes
7 pages
Python Regular Expressions
No ratings yet
Python Regular Expressions
14 pages
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
Python - Code Challenge 5 01
No ratings yet
Python - Code Challenge 5 01
1 page
Sales and Inventory Ajin
No ratings yet
Sales and Inventory Ajin
17 pages
Muhammed Rasal Itr
No ratings yet
Muhammed Rasal Itr
29 pages
Interaction Diagrams in UML
No ratings yet
Interaction Diagrams in UML
9 pages
Create A Presentation On Object Oriented Methodologies in Case Tools
No ratings yet
Create A Presentation On Object Oriented Methodologies in Case Tools
8 pages
Unit Iii-1
No ratings yet
Unit Iii-1
42 pages
IELTS Speaking Band Descriptors
No ratings yet
IELTS Speaking Band Descriptors
4 pages
Pele's Appeal Mo'olelo, Kaona, and Hulihia in Pele and Hi'iaka Literature
No ratings yet
Pele's Appeal Mo'olelo, Kaona, and Hulihia in Pele and Hi'iaka Literature
603 pages
Rhythm Beats and Meter in Poetry.
No ratings yet
Rhythm Beats and Meter in Poetry.
20 pages
Download ebooks file The syntax of old Romanian 1st Edition Pană Dindelegan all chapters
100% (1)
Download ebooks file The syntax of old Romanian 1st Edition Pană Dindelegan all chapters
61 pages
Köçərli Sevda-Online MİQ VƏ Sertifikasiya Hazırlığı 0557312011
No ratings yet
Köçərli Sevda-Online MİQ VƏ Sertifikasiya Hazırlığı 0557312011
6 pages
A House For Mr. Biswas: Reader'S Guide
No ratings yet
A House For Mr. Biswas: Reader'S Guide
8 pages
English For Marketing Course Slides1
No ratings yet
English For Marketing Course Slides1
240 pages
Marie Curie
No ratings yet
Marie Curie
3 pages
Packages: Sudhir Talasila Preeti Navale
No ratings yet
Packages: Sudhir Talasila Preeti Navale
25 pages
Treasures Book 1 Unit 7
No ratings yet
Treasures Book 1 Unit 7
32 pages
De Thi Thu TN 2023 Tieng Anh Ly Thai To Bac Ninh Lan 1
No ratings yet
De Thi Thu TN 2023 Tieng Anh Ly Thai To Bac Ninh Lan 1
4 pages
Draft Pembagian Tugas Januari 2025
No ratings yet
Draft Pembagian Tugas Januari 2025
10 pages
Simple Present Tense Part 2
No ratings yet
Simple Present Tense Part 2
16 pages
Práctica de Inglés
No ratings yet
Práctica de Inglés
13 pages
Peer Post 1
No ratings yet
Peer Post 1
22 pages
ARCUBE Training Menu
No ratings yet
ARCUBE Training Menu
200 pages
Cause Effect Relationship (Hubungan Sebab Akibat) : Nurul Dini Munggaran, S.S Untuk Kelas XII TKR, RPL, TPTL
No ratings yet
Cause Effect Relationship (Hubungan Sebab Akibat) : Nurul Dini Munggaran, S.S Untuk Kelas XII TKR, RPL, TPTL
11 pages
English Vocabulary
No ratings yet
English Vocabulary
155 pages
187 Beautiful Words in English Speak English Now Podcast With Georgiana
No ratings yet
187 Beautiful Words in English Speak English Now Podcast With Georgiana
8 pages
Laura: Countries and Nationalities Exercises
No ratings yet
Laura: Countries and Nationalities Exercises
3 pages
MPU-3222 Coursework Plan Feb2016
No ratings yet
MPU-3222 Coursework Plan Feb2016
2 pages
Adinda Humaira Albaar
No ratings yet
Adinda Humaira Albaar
3 pages
- Đề chuẩn Tiếng Anh 2020 - Đề 1 (Mini test)
No ratings yet
- Đề chuẩn Tiếng Anh 2020 - Đề 1 (Mini test)
18 pages
Dwarf-Names - A Study in Old Icelandic Religion
No ratings yet
Dwarf-Names - A Study in Old Icelandic Religion
30 pages
Inggris Proposal Salim
No ratings yet
Inggris Proposal Salim
32 pages
All Poems
No ratings yet
All Poems
97 pages