0% found this document useful (0 votes)

2 views

regular exp

Regular Expressions (RegEx) are sequences of characters that define search patterns for strings, allowing for matching, splitting, and replacing text. Python's built-in 're' module provides functions such as search, match, findall, split, and sub to work with RegEx. Meta characters and special sequences are used to enhance pattern matching capabilities in strings.

Uploaded by

sr0935364

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

regular exp

Uploaded by

sr0935364

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Regular Expressions

A Regular Expressions (RegEx) is a special sequence of characters that uses a search pattern to find a
string or set of strings. It can detect the presence or absence of a text by matching it with a particular
pattern, and also can split a pattern into one or more sub-patterns. Python provides a re module that
supports the use of regex in Python. Its primary function is to offer a search, where it takes a regular
expression and a string. Here, it either returns the first match or else none.

RegEx Module

Python has a built-in package called re, which can be used to work with Regular Expressions.

Import the re module:

import re

RegEx in Python

When you have imported the re module, you can start using regular expressions:

Example-1:

Search the string to see if it starts with "My"and ends with "nagendra":

import re

a = "My name is nagendra"

x = re.search("^My.*nagendra$",a)

if x:

print("YES! We have a match!")

else:

print("No match")

output:

YES! We have a match!

Example-2:

import re

s = 'my name is nagendra'

print(re.search('nagendra', s))

output:

<re.Match object; span=(11, 19), match='nagendra'>

RegEx Functions

The re module offers a set of functions that allows us to search a string for match:
Function Description

Findall Returns a list containing all matches

Search Returns a Match object, if there is a match anywhere in the string

Split Returns a list where the string has been split at each match

Sub Replaces one or many matches with a string

Match Returns a Match object if there is a match starting in the string.

The match() Function

The match() function searches the string for a match, and returns a Match object if there is a match at
the starting of my string.

Example-1:

import re

a = "my name is nagendra, my age is 24"

x = re.match("my", a)

print(x)

Output:

<re.Match object; span=(0, 2), match='My'>

Example-2:

import re

a = "my name is nagendra, my age is 24"

x = re.match("The", a)

print(x)

Output:

None

The search() Function

The search() function searches the string for a match, and returns a Match,if there is a match.

If there is more than one match, only the first occurrence of the match will be returned:

Match Object

A Match Object is an object containing information about the search and the result.

Note: If there is no match, the value None will be returned, instead of the Match Object.

Example-1:
import re

a = "my name is nagendra, my age is 24"

x = re.search("my", a) #here, my is match object

print(x)

Output:

<re.Match object; span=(0, 2), match='My'>

Example-2:

import re

a = "my name is nagendra, my age is 24"

x = re.search("24", a)

print(x)

Output:

<re.Match object; span=(31, 33), match='24'>

The findall() Function

The findall() function returns a list containing all matches.

Example-1:

import re

a = "my name is nagendra, my age is 24"

x = re.findall("My", a)

print(x)

Output:

['my', 'my']

The split() Function

The split() function returns a list where the string has been split at each match:

Example-1:

import re

a = "my name is nagendra, my age is 24"

x = re.split("nagendra", a)

print(x)
Output:

['my name is ', ', my age is 24']

The sub() Function

The sub() function replaces the matches with the text of your choice:

Example-1:

import re

a = "my name is nagendra, my age is 24"

x = re.sub("nagendra","rahul", a)

print(x)

Output:

my name is rahul, my age is 24

Meta Characters

To understand the RE analogy, Meta Characters are useful, important, and will be used in functions of
module re. Below is the list of meta characters.

Meta Characters Description

\ -Used to drop the special meaning of character following it

[] -Represent a character class

^ -Matches the beginning

$ -Matches the end

. -Matches any character except newline

| -Means OR (Matches with any of the characters separated by it.

? Matches zero or one occurrence

* -Any number of occurrences (including 0 occurrences)

+ -One or more occurrences

{} -Indicate the number of occurrences of a preceding regex to match.

() -Enclose a group of Regex

[ ]- (Square Brackets)

Square Brackets ([]) represent a character class consisting of a set of

characters that we wish to match. For example, the character class [abc]

will match any single a, b, or c.

We can also specify a range of characters using – inside the square brackets. For example,

• [0-3] is sample as [0123]

• [a-c] is same as [abc]

We can also invert the character class using the caret(^) symbol. For example,

• [^0-3] means any number except 0, 1, 2, or 3

• [^a-c] means any character except a, b, or c

example-1:

import re

a= "my name is nagendra"

print(re.findall("[a-m]”, a))

output:

['m', 'a', 'm', 'e', 'i', 'a', 'g', 'e', 'd', 'a']

example-2:

import re

a= "my name is nagendra, my age is 25"

print(re.findall("[0-9]”, a))

output:

['2', '5']

^ (Caret)

Caret (^) symbol matches the beginning of the string i.e. checks whether

the string starts with the given character(s) or not. For example –

• ^B will check if the string starts with g such as Btech, Ball, BOX etc.

• ^BTECH will check if the string starts with BTECH such as BTECH

HYDERABAD, BTECH AIML, BTECH CSE etc.

example-1

import re

a = 'Btech hyderabad'

result = re.match(‘^Btech’, a)

if result:

print("Search successful.")
else:

print("Search unsuccessful.")

output: Search successful.

$ (Dollar)

Dollar($) symbol matches the end of the string i.e checks whether the string ends with the given
character(s) or not. For example –

• s$ will check for the string that ends with a such as geeks, ends, setc.

• ks$ will check for the string that ends with ks such as marks, ks, etc.

example-1:

import re

a = 'Btech'

result = re.search(‘h$’, a)

if result:

print("Search successful.")

else:

print("Search unsuccessful.")

output: Search successful.

. (Dot)

Dot(.) symbol matches only a single character except for the newline character (\n). For example –

• a.b will check for the string that contains any character at the place of the dot such as acb, adb, arb,
a1b, etc...

• .. will check if the string contains at least 2 characters.for example

a..b will check for the string that contains any two character at the place of the dot such as acrb,
adhb, arfb, a12b, etc…

example-1:

import re

a= "hello hyderabad"

x = re.findall("he..o", a)

print(x)

output:['hello']
| (Or)

Or symbol works as the or operator meaning it checks whether the pattern before or after the or
symbol is present in the string or not. For example –

• btech|mtech will match any string that contains btech or mtech.

Example-1:

import re

a= "i am from btech and i am from mtech "

x = re.findall("btech|mtech", a)

print(x)

output:

['btech', 'mtech']

Example-2:

import re

a= "i am nagendra and i am from BTECH"

x = re.findall(" BTECH | MTECH ", a)

print(x)

output:

['btech']

? (Question Mark)

The question mark symbol ? matches zero or one occurrence of the pattern left to it.

Expression String Matched?

ma?n mn 1 match

man 1 match

maaan No match (more than one a character)

main No match (a is not followed by n)

woman 1 match

example-1:

import re
a= "i am a man"

x = re.findall("ma?n", a)

print(x)

output:

['man']

example-2:

import re

a= "i am a maaaan"

x = re.findall("ma?n", a)

print(x)

output:

[ ] ( output is empty because a repeated more than once. The question mark symbol ? matches zero or
one occurrence of the pattern left to it.)

* (Star)

The star symbol * matches zero or more occurrences of the pattern left to it.

Expression String Matched?

ma*n mn 1 match

man 1 match

maaan 1 match

main No match (a is not followed by n)

woman 1 match

example-1:

import re

a= "i am a maaan"

x = re.findall("ma*n", a)

print(x)

output: ['maaan']

+ (Plus)

The plus symbol + matches one or more occurrences of the pattern left to it.

Expression String Matched?

ma+n mn No match (no a character)

man 1 match

maaan 1 match

main No match (a is not followed by n)

woman 1 match

example-1:

import re

a= "i am a maaan"

x = re.findall("ma+n", a)

print(x)

output:['maaan']

{ } (Braces)

Consider this code: {n,m}. This means at least n, and at most m repetitions of the pattern left to it.

Example-1:

import re

a= "my name is nagendra, my age is 25"

x = re.findall("my{1,3}", a)

print(x)

output: ['my', 'my']

from above pattern my{1,3} mean --if “my” is present at least once and maximum three time then it
will print “my” from above example “my” is present twice, so it will print my twice

( ) -Group

Group symbol is used to group sub-patterns.

List of special sequences

Special sequences do not match for the actual character in the string ,instead it tells the specific
location in the search string where the match must occur. It makes it easier to write commonly used
patterns.

Special Sequence Description

\A Matches if the string begins with the given character

\b Matches if the word begins or ends with the given character.\b(string) will check for the
beginning of the word and (string)\b will check for the ending of the word.

\B It is the opposite of the \b i.e. the string should not start or end with the given regex.
\d Matches any decimal digit, this is equivalent to the set class [0-9]

\D Matches any non-digit character, this is equivalent to the set class [^0-9]

\s Matches any whitespace character.

\S Matches any non-whitespace character

\w Matches any alphanumeric character, this is equivalent to the class [a-zA-Z0-9_].

\W Matches any non-alphanumeric character.

\Z Matches if the string ends with the given regex

Program:

import re

a="my name is nagendra, my age is 25"

b=re.findall("\Amy",a) output of \A is ['my']

c=re.findall("\w",a) output of \w is ['m', 'y', 'n', 'a', 'm', 'e', 'i', 's', 'n', 'a', 'g', 'e', 'n', 'd', 'r', 'a',
'm', 'y', 'a', 'g', 'e', 'i', 's', '2', '5']

d=re.findall("\W",a) output of \W is ['', '', '', ',', '', '', '', '']

e=re.findall("\d",a) output of \d is ['2', '5']

f=re.findall("\D",a) output of \D is ['m', 'y', '', 'n', 'a', 'm', 'e', '', 'i', 's', '', 'n', 'a', 'g', 'e', 'n', 'd','r', 'a', ',', '',

'm', 'y', '', 'a', 'g', 'e', '', 'i', 's', '']

g=re.findall("\s",a) output of \s is ['', '', '', '', '', '', '']

h=re.findall("\S",a) output of \S is ['m', 'y', 'n', 'a', 'm', 'e', 'i', 's', 'n', 'a', 'g', 'e', 'n', 'd', 'r', 'a',

',', 'm', 'y', 'a', 'g', 'e', 'i', 's', '2', '5']

i=re.findall(r"\bna", a) output of \b is ['na’]

j=re.findall(r"ra\b",a) output of \b is ['ra’]

print("output of \A is ",b) , print("output of \w is ",c),

print("output of \W is ",d),print("output of \d is ",e)

print("output of \D is ",f),print("output of \s is ",g)

print("output of \S is ",h)

print("output of \b is ",i), print("output of \b is ",j)

Neetcode 150 Solution
No ratings yet
Neetcode 150 Solution
74 pages
Java For Everyone
No ratings yet
Java For Everyone
6 pages
MS Excel Trade Test Actual Part 1
No ratings yet
MS Excel Trade Test Actual Part 1
4 pages
Python unit 3
No ratings yet
Python unit 3
46 pages
Python Complete Unit 3
No ratings yet
Python Complete Unit 3
40 pages
Regular Expression
No ratings yet
Regular Expression
21 pages
Regular Expression l
No ratings yet
Regular Expression l
20 pages
Lecture 7 Re Part2 Split
No ratings yet
Lecture 7 Re Part2 Split
8 pages
unit 4 Regular expression
No ratings yet
unit 4 Regular expression
16 pages
Unit-3 Python
No ratings yet
Unit-3 Python
72 pages
Regular Expressions
No ratings yet
Regular Expressions
104 pages
Manipulating Text with Regular Expression in python
No ratings yet
Manipulating Text with Regular Expression in python
4 pages
Regexp in TCL
No ratings yet
Regexp in TCL
4 pages
9.RegEx (1)
No ratings yet
9.RegEx (1)
57 pages
Regular Expression
No ratings yet
Regular Expression
18 pages
RegEx 1
No ratings yet
RegEx 1
48 pages
Python Regex: Re - Match, Re - Search, Re - Findall With Example
No ratings yet
Python Regex: Re - Match, Re - Search, Re - Findall With Example
10 pages
Lecture 9 Python
No ratings yet
Lecture 9 Python
8 pages
Unit7_RegularExpressionpdf__2023_10_17_09_16_29
No ratings yet
Unit7_RegularExpressionpdf__2023_10_17_09_16_29
17 pages
RegEx in Python (4)
No ratings yet
RegEx in Python (4)
6 pages
Python Regular Expression
100% (1)
Python Regular Expression
31 pages
(Python) Regex Cheat Sheet
100% (1)
(Python) Regex Cheat Sheet
1 page
css unit 5 dev notes
No ratings yet
css unit 5 dev notes
13 pages
03 Lexical Analysis
No ratings yet
03 Lexical Analysis
86 pages
Python Regex
No ratings yet
Python Regex
8 pages
Chapter - 11 - Regular Expressions
100% (1)
Chapter - 11 - Regular Expressions
10 pages
Python Function
No ratings yet
Python Function
11 pages
Regular Expressions in Java
No ratings yet
Regular Expressions in Java
30 pages
Python Training Tutorials
No ratings yet
Python Training Tutorials
40 pages
week6
No ratings yet
week6
5 pages
Python Programming Exercise
No ratings yet
Python Programming Exercise
7 pages
Regex (1)
No ratings yet
Regex (1)
6 pages
Python Module-3 QB Solution (21EC643)
No ratings yet
Python Module-3 QB Solution (21EC643)
24 pages
Unit 4 - Regular Expressions
No ratings yet
Unit 4 - Regular Expressions
20 pages
Raunakmalkani 20BIT032 Assignment RegularExpressions
No ratings yet
Raunakmalkani 20BIT032 Assignment RegularExpressions
14 pages
Regular Exp
No ratings yet
Regular Exp
6 pages
Hi
No ratings yet
Hi
4 pages
Python Regular Expressions
No ratings yet
Python Regular Expressions
14 pages
Write A Python Function To Find The Max of Three Numbers
No ratings yet
Write A Python Function To Find The Max of Three Numbers
4 pages
Regex Notes
No ratings yet
Regex Notes
2 pages
2021 Uam 2107
No ratings yet
2021 Uam 2107
8 pages
Summary On Java Expression
No ratings yet
Summary On Java Expression
18 pages
UNIT - 4 REGEX
No ratings yet
UNIT - 4 REGEX
28 pages
Python Libraries
No ratings yet
Python Libraries
22 pages
17_Regular Expression
No ratings yet
17_Regular Expression
20 pages
Python Reg Expressions PDF
No ratings yet
Python Reg Expressions PDF
8 pages
Metacharacters in Python
No ratings yet
Metacharacters in Python
7 pages
Python Re
No ratings yet
Python Re
18 pages
Sequences, Strings and Sets
No ratings yet
Sequences, Strings and Sets
40 pages
STRING DATA TYPE
No ratings yet
STRING DATA TYPE
13 pages
Nuevo Documento de Texto
No ratings yet
Nuevo Documento de Texto
6 pages
Python Notes by MR Saem
No ratings yet
Python Notes by MR Saem
114 pages
Module 4 - Regular Expressions1
No ratings yet
Module 4 - Regular Expressions1
37 pages
New_python_programs
No ratings yet
New_python_programs
53 pages
Python Day- 10
No ratings yet
Python Day- 10
11 pages
CSCI Final Cheat Sheet
No ratings yet
CSCI Final Cheat Sheet
14 pages
Program Design Notes
No ratings yet
Program Design Notes
6 pages
String R
No ratings yet
String R
6 pages
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Ian Talks Regex A-Z
From Everand
Ian Talks Regex A-Z
Ian Eress
No ratings yet
COA(4th)May2023
No ratings yet
COA(4th)May2023
2 pages
relational algebra
No ratings yet
relational algebra
3 pages
DE(3rd)Dec2023
No ratings yet
DE(3rd)Dec2023
2 pages
SCP(1st-2nd)Dec2020
No ratings yet
SCP(1st-2nd)Dec2020
2 pages
SCP(1st-2nd)May2019
No ratings yet
SCP(1st-2nd)May2019
2 pages
Topic - 6 (Logical Agents)
No ratings yet
Topic - 6 (Logical Agents)
32 pages
MP_unit 1_new
No ratings yet
MP_unit 1_new
84 pages
ccs372 Vir Manual
No ratings yet
ccs372 Vir Manual
120 pages
Foundation Course in English-2 (FEG-02) Assignment
No ratings yet
Foundation Course in English-2 (FEG-02) Assignment
2 pages
Function Introduction : Operation Instructions For The Cloning Function For 4 Generation Gearbox of Mercedes-Benz
No ratings yet
Function Introduction : Operation Instructions For The Cloning Function For 4 Generation Gearbox of Mercedes-Benz
12 pages
Assistive Technologies
No ratings yet
Assistive Technologies
4 pages
CAN_CANopen_MN67042_ENG
No ratings yet
CAN_CANopen_MN67042_ENG
40 pages
Question Bank Poc PDF
No ratings yet
Question Bank Poc PDF
3 pages
Switchboard Instruments Data Sheet 4921210012 UK
No ratings yet
Switchboard Instruments Data Sheet 4921210012 UK
8 pages
Abutment Lateral Movement and Pile Member Force Check
No ratings yet
Abutment Lateral Movement and Pile Member Force Check
17 pages
Concept Paper
No ratings yet
Concept Paper
5 pages
Chatbot An Education Support System
No ratings yet
Chatbot An Education Support System
12 pages
Right TO Privacy: S.S. Jain Subodh Law College
No ratings yet
Right TO Privacy: S.S. Jain Subodh Law College
15 pages
Sap PP Tcode 3
No ratings yet
Sap PP Tcode 3
6 pages
EDIFACT
No ratings yet
EDIFACT
71 pages
Json - JSON Encoder and Decoder - Python 3.7.1rc1 Documentation
No ratings yet
Json - JSON Encoder and Decoder - Python 3.7.1rc1 Documentation
12 pages
ATMdesk Field Setup Manual
No ratings yet
ATMdesk Field Setup Manual
29 pages
Lecture 3 Control Objectives (Cobit)
No ratings yet
Lecture 3 Control Objectives (Cobit)
23 pages
Bảng tra Mã Gạt máy in
No ratings yet
Bảng tra Mã Gạt máy in
9 pages
Whitepaper Navi Trans 50
No ratings yet
Whitepaper Navi Trans 50
8 pages
Lecture 5
No ratings yet
Lecture 5
66 pages
8.2.5.4 Lab - Identifying IPv6 Addresses
50% (2)
8.2.5.4 Lab - Identifying IPv6 Addresses
10 pages
Stratix 2000 Ethernet Unmanaged Switches: Installation Instructions
No ratings yet
Stratix 2000 Ethernet Unmanaged Switches: Installation Instructions
10 pages
Azure Reference Architectures - Microsoft Docs
No ratings yet
Azure Reference Architectures - Microsoft Docs
5 pages
Id:69123 SAT69 - MM Validate Run MPS Single Item
No ratings yet
Id:69123 SAT69 - MM Validate Run MPS Single Item
34 pages
Sally S. Smith: Customer Success Manager
No ratings yet
Sally S. Smith: Customer Success Manager
2 pages
Ground-Fault Protection
No ratings yet
Ground-Fault Protection
3 pages
WhitePaper - Omnichannel Retailing When It Becomes A Commodity What Then
No ratings yet
WhitePaper - Omnichannel Retailing When It Becomes A Commodity What Then
11 pages
Course Info Math 215/255
No ratings yet
Course Info Math 215/255
4 pages