0% found this document useful (0 votes)

431 views

Multithreading in Python

The document discusses multithreading in Python. It explains that multithreading allows a process to execute multiple threads concurrently by frequently switching between them. It provides an example to demonstrate creating and running threads concurrently in Python using the threading module. It also discusses potential issues like race conditions that can occur due to concurrent access to shared resources by multiple threads.

Uploaded by

angelfree68

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

431 views

Multithreading in Python

Uploaded by

angelfree68

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Multithreading in Python

13 Jul 2017

Note: This article has also featured on geeksforgeeks.org . This article covers the basics of multithreading in Python
programming language.

Just like multiprocessing, multithreading is a way of achieving multitasking. In multithreading, the concept of
threads is used.

Let us first understand the concept of thread in computer architecture.

Thread
In computing, a process is an instance of a computer program that is being executed. Any process has 3 basic
components:

 An executable program.
 The associated data needed by the program (variables, work space, buffers, etc.)
 The execution context of the program (State of process)

A thread is an entity within a process that can be scheduled for execution. Also, it is the smallest unit of processing
that can be performed in an OS (Operating System).

In simple words, a thread is a sequence of such instructions within a program that can be executed independently
of other code. For simplicity, you can assume that a thread is simply a subset of a process!

A thread contains all this information in a Thread Control Block (TCB):

 Thread Identifier: Unique id (TID) is

assigned to every new thread
 Stack pointer: Points to thread’s
stack in the process. Stack contains
the local variables under thread’s
scope.
 Program counter: a register which
stores the address of the instruction
currently being executed by thread.
 Thread state: can be running, ready,
waiting, start or done.
 Thread’s register set: registers
assigned to thread for computations.
 Parent process Pointer: A pointer to
the Process control block (PCB) of the
process that the thread lives on.

Consider the diagram to understand the

relation between process and its thread.
Multithreading
Multiple threads can exist within one process where:

 Each thread contains its own register set and local variables (stored in stack).
 All thread of a process share global variables (stored in heap) and the program code.

Consider the diagram below to understand how multiple threads exist in memory:

Multithreading is defined as the ability of a processor to execute multiple threads concurrently.

In a simple, single-core CPU, it is achieved using frequent switching between threads. This is termed as context
switching. In context switching, the state of a thread is saved and state of another thread is loaded whenever any
interrupt (due to I/O or manually set) takes place. Context switching takes place so frequently that all the threads
appear to be running parallely (this is termed as multitasking).

Consider the diagram below in which a process contains two active threads:
Multithreading in Python
In Python, the threading module provides a very simple and intuitive API for spawning multiple threads in a
program.

Let us consider a simple example using threading module:

1 # importing the threading module

2 import threading
3
4 def print_cube(num):
5 """
6 function to print cube of given num
7 """
8 print("Cube: {}".format(num * num * num))
9
10def print_square(num):
11 """
12 function to print square of given num
13 """
14 print("Square: {}".format(num * num))
15
16if __name__ == "__main__":
17 # creating thread
18 t1 = threading.Thread(target=print_square, args=(10,))
19 t2 = threading.Thread(target=print_cube, args=(10,))
20
21 # starting thread 1
22 t1.start()
23 # starting thread 2
24 t2.start()
25
26 # wait until thread 1 is completely executed
27 t1.join()
28 # wait until thread 2 is completely executed
29 t2.join()
30
31 # both threads completely executed
32 print("Done!")
Square: 100
Cube: 1000
Done!

Let us try to understand the above code:

 To import the threading module, we do:

 import threading
 To create a new thread, we create an object of Thread class. It takes following arguments:
o target: the function to be executed by thread
o args: the arguments to be passed to the target function

In above example, we created 2 threads with different target functions:

t1 = threading.Thread(target=print_square, args=(10,))
t2 = threading.Thread(target=print_cube, args=(10,))
 To start a thread, we use start method of Thread class.
 t1.start()
 t2.start()
 Once the threads start, the current program (you can
think of it like a main thread) also keeps on executing. In
order to stop execution of current program until a thread
is complete, we use join method.
 t1.join()
 t2.join()

As a result, the current program will first wait for the

completion of t1 and then t2. Once, they are finished, the
remaining statements of current program are executed.

Consider the diagram for a better understanding of how above

program works:

Now, consider the python program given below in which we print thread name and corresponding process for each
task:

import threading
1 import os
2
3 def task1():
4 print("Task 1 assigned to thread:
5 {}".format(threading.current_thread().name))
6 print("ID of process running task 1: {}".format(os.getpid()))
7
8 def task2():
9 print("Task 2 assigned to thread:
10{}".format(threading.current_thread().name))
11 print("ID of process running task 2: {}".format(os.getpid()))
12
13if __name__ == "__main__":
14
15 # print ID of current process
16 print("ID of process running main program: {}".format(os.getpid()))
17
18 # print name of main thread
19 print("Main thread name: {}".format(threading.main_thread().name))
20
21 # creating threads
22 t1 = threading.Thread(target=task1, name='t1')
23 t2 = threading.Thread(target=task2, name='t2')
24
25 # starting threads
26 t1.start()
27 t2.start()
28
29 # wait until all threads finish
30 t1.join()
t2.join()
ID of process running main program: 11758
Main thread name: MainThread
Task 1 assigned to thread: t1
ID of process running task 1: 11758
Task 2 assigned to thread: t2
ID of process running task 2: 11758

Let us try to understand the above code:

 We use os.getpid() function to get ID of current process.

 print("ID of process running main program: {}".format(os.getpid()))

As it is clear from the output, the process ID remains same for all threads.

 We use threading.main_thread() function to get the main thread object. In normal conditions, the main
thread is the thread from which the Python interpreter was started. name attribute of thread object is
used to get the name of thread.
 print("Main thread name: {}".format(threading.main_thread().name))
 We use the threading.current_thread() function to get the current thread object.
 print("Task 1 assigned to thread:
{}".format(threading.current_thread().name))

The diagram given below clears the above concept:

So, this was a brief introduction to multithreading in Python. The next article in this series covers synchronization
between multiple threads.

Synchronization between threads

Thread synchronization is defined as a mechanism which ensures that two or more concurrent threads do not
simultaneously execute some particular program segment known as critical section.

Critical section refers to the parts of the program where the shared resource is accessed. For example, in the
diagram below, 3 threads try to access shared resource or critical section at the same time.
Concurrent accesses to shared resource can lead to race condition.

A race condition occurs when two or more threads can access shared data and they try to change it at the same
time. As a result, the values of variables may be unpredictable and vary depending on the timings of context
switches of the processes.

Consider the program below to understand the concept of race condition:

1 import threading
2
3 # global variable x
4 x = 0
5
6 def increment():
7 """
8 function to increment global variable x
9 """
10global x
11x += 1
12
13def thread_task():
14"""
15task for thread
16calls increment function 100000 times.
17"""
18for _ in range(100000):
19increment()
20
21def main_task():
22global x
23# setting global variable x as 0
24x = 0
25
26# creating threads
27t1 = threading.Thread(target=thread_task)
28t2 = threading.Thread(target=thread_task)
29
30# start threads
31t1.start()
32t2.start()
33
34# wait until threads finish their job
35t1.join()
36t2.join()
37
38if __name__ == "__main__":
39for i in range(10):
40main_task()
41print("Iteration {0}: x = {1}".format(i,x))

Output:

Iteration 0: x = 175005
Iteration 1: x = 200000
Iteration 2: x = 200000
Iteration 3: x = 169432
Iteration 4: x = 153316
Iteration 5: x = 200000
Iteration 6: x = 167322
Iteration 7: x = 200000
Iteration 8: x = 169917
Iteration 9: x = 153589

In above program:

 Two threads t1 and t2 are created in main_task function and global variable x is set to 0.
 Each thread has a target function thread_task in which increment function is called 100000 times.
 increment function will increment the global variable x by 1 in each call.

The expected final value of x is 200000 but what we get in 10 iterations of main_task function is some different
values.

This happens due to concurrent access of threads to the shared variable x. This unpredictability in value of x is
nothing but race condition.

Given below is a diagram which shows how can race condition occur in above program:

Notice that expected value of x in above diagram is 12 but due to race condition, it turns out to be 11!

Hence, we need a tool for proper synchronization between multiple threads.

Using Locks
threading module provides a Lock class to deal with the race conditions. Lock is implemented using a Semaphore
object provided by the Operating System.

A semaphore is a synchronization object that controls access by multiple processes/threads to a common resource
in a parallel programming environment. It is simply a value in a designated place in operating system (or kernel)
storage that each process/thread can check and then change. Depending on the value that is found, the
process/thread can use the resource or will find that it is already in use and must wait for some period before
trying again. Semaphores can be binary (0 or 1) or can have additional values. Typically, a process/thread using
semaphores checks the value and then, if it using the resource, changes the value to reflect this so that subsequent
semaphore users will know to wait.

Lock class provides following methods:

 acquire([blocking]) : To acquire a lock. A lock can be blocking or non-blocking.

o When invoked with the blocking argument set to True (the default), thread execution is blocked
until the lock is unlocked, then lock is set to locked and return True.
o When invoked with the blocking argument set to False, thread execution is not blocked. If lock is
unlocked, then set it to locked and return True else return False immediately.
 release() : To release a lock.
o When the lock is locked, reset it to unlocked, and return. If any other threads are blocked waiting
for the lock to become unlocked, allow exactly one of them to proceed.
o If lock is already unlocked, a ThreadError is raised.

Consider the example given below:

1 import threading
2
3 # global variable x
4 x = 0
5
6 def increment():
7 """
8 function to increment global variable x
9 """
10global x
11x += 1
12
13def thread_task(lock):
14"""
15task for thread
16calls increment function 100000 times.
17"""
18for _ in range(100000):
19lock.acquire()
20increment()
21lock.release()
22
23def main_task():
24global x
25# setting global variable x as 0
26x = 0
27
28# creating a lock
29lock = threading.Lock()
30
31# creating threads
32t1 = threading.Thread(target=thread_task, args=(lock,))
33t2 = threading.Thread(target=thread_task, args=(lock,))
34
35# start threads
36t1.start()
37t2.start()
38
39# wait until threads finish their job
40t1.join()
41t2.join()
42
43if __name__ == "__main__":
44for i in range(10):
45main_task()
46print("Iteration {0}: x = {1}".format(i,x))

Output:
Iteration 0: x = 200000
Iteration 1: x = 200000
Iteration 2: x = 200000
Iteration 3: x = 200000
Iteration 4: x = 200000
Iteration 5: x = 200000
Iteration 6: x = 200000
Iteration 7: x = 200000
Iteration 8: x = 200000
Iteration 9: x = 200000

Let us try to understand the above code step by step:

 Firstly, a Lock object is created using:

 lock = threading.Lock()
 Then, lock is passed as target function argument:
 t1 = threading.Thread(target=thread_task, args=(lock,))
 t2 = threading.Thread(target=thread_task, args=(lock,))
 In the critical section of target function, we apply lock using lock.acquire() method. As soon as a lock is
acquired, no other thread can access the critical section (here, increment function) until the lock is
released using lock.release() method.
 lock.acquire()
 increment()
 lock.release()

As you can see in the results, the final value of x comes out to be 200000 every time (which is the expected final
result). Here is a diagram given below which depicts the implementation of locks in above program:
This brings us to the end of this tutorial series on Multithreading in Python.
Finally, here are are a few advantages and disadvantages of multithreading:

Advantages:

 It doesn’t block the user. This is because threads are independent of each other.
 Better use of system resources is possible since threads execute tasks parallely.
 Enhanced performance on multi-processor machines.
 Multi-threaded servers and interactive GUIs use multithreading exclusively.

Disadvantages:

 As number of threads increase, complexity increases.

 Synchronization of shared resources (objects, data) is necessary.
 It is difficult to debug, result is sometimes unpredictable.
 Potential deadlocks which leads to starvation, i.e. some threads may not be served with a bad design
 Constructing and synchronizing threads is CPU/memory intensive.

Natural language processing with TensorFlow Teach language to machines using Python s deep learning library 1st Edition Thushan Ganegedara 2024 scribd download
50% (2)
Natural language processing with TensorFlow Teach language to machines using Python s deep learning library 1st Edition Thushan Ganegedara 2024 scribd download
62 pages
Apache Cassandra Administrator Associate - Exam Practice Tests
From Everand
Apache Cassandra Administrator Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet
POS Tagging
No ratings yet
POS Tagging
11 pages
Origami Piezas de Ajedrez
No ratings yet
Origami Piezas de Ajedrez
5 pages
Shuki Kato - Wolf
100% (6)
Shuki Kato - Wolf
8 pages
Programming Essentials in Python Introduction To Python
No ratings yet
Programming Essentials in Python Introduction To Python
33 pages
Xzno22222222222 PDF
No ratings yet
Xzno22222222222 PDF
278 pages
1 Python Multithreading and Multiprocessing Tutorial
No ratings yet
1 Python Multithreading and Multiprocessing Tutorial
14 pages
Multithreading and Multiprocessing
No ratings yet
Multithreading and Multiprocessing
3 pages
CS263 - Bayesian Decision Theory
No ratings yet
CS263 - Bayesian Decision Theory
16 pages
(SpringerBriefs in Mathematics) Qi He, Le Yi Wang, George G. Yin - System Identification Using Regular and Quantized Observations - Applications of Large Deviations Principles-Springer (2013)
No ratings yet
(SpringerBriefs in Mathematics) Qi He, Le Yi Wang, George G. Yin - System Identification Using Regular and Quantized Observations - Applications of Large Deviations Principles-Springer (2013)
108 pages
Data Fusion Methodology and Applications Marina Cocchi 2024 Scribd Download
100% (3)
Data Fusion Methodology and Applications Marina Cocchi 2024 Scribd Download
49 pages
Model Checking
No ratings yet
Model Checking
6 pages
Teaching Bayesian Method
No ratings yet
Teaching Bayesian Method
20 pages
DSP Filter Design With Sptool Matlab
No ratings yet
DSP Filter Design With Sptool Matlab
6 pages
Revision - Bayesian Inference
No ratings yet
Revision - Bayesian Inference
4 pages
Project and Process Metrices
No ratings yet
Project and Process Metrices
5 pages
Explain Machine Learning Model Using SHAP
No ratings yet
Explain Machine Learning Model Using SHAP
28 pages
AI Assignment
No ratings yet
AI Assignment
6 pages
Bayesian Model Updating
No ratings yet
Bayesian Model Updating
26 pages
Bayesian Inference
No ratings yet
Bayesian Inference
5 pages
A Recurrent Neural Network
No ratings yet
A Recurrent Neural Network
22 pages
IEEE 13 Node Test Feeder
No ratings yet
IEEE 13 Node Test Feeder
11 pages
OS Lecture3 - Inter Process Communication
No ratings yet
OS Lecture3 - Inter Process Communication
43 pages
Salaryconditional
No ratings yet
Salaryconditional
1 page
Evolutionary Programming
No ratings yet
Evolutionary Programming
19 pages
Parallela Cluster by Michael Johan Kruger
No ratings yet
Parallela Cluster by Michael Johan Kruger
56 pages
Convolutional Neural Network
100% (1)
Convolutional Neural Network
3 pages
ANN - Ch2-Adaline and Madaline
100% (1)
ANN - Ch2-Adaline and Madaline
29 pages
Computer Architecture
No ratings yet
Computer Architecture
12 pages
Bayesian Learning Methods
No ratings yet
Bayesian Learning Methods
57 pages
Chapter 3 - Solving Problems by Searching
No ratings yet
Chapter 3 - Solving Problems by Searching
71 pages
Genetic Algorithm
No ratings yet
Genetic Algorithm
29 pages
A Course in Advanced Signal Processing
No ratings yet
A Course in Advanced Signal Processing
16 pages
R Visualizations: Derive Meaning from Data 1st Edition David Gerbing - The latest ebook edition with all chapters is now available
100% (3)
R Visualizations: Derive Meaning from Data 1st Edition David Gerbing - The latest ebook edition with all chapters is now available
65 pages
Non Monotonic Reasoning
No ratings yet
Non Monotonic Reasoning
45 pages
Bandits
No ratings yet
Bandits
2 pages
Data Mining - Classification
No ratings yet
Data Mining - Classification
53 pages
DCT Report
No ratings yet
DCT Report
24 pages
An Introduction To PyCUDA Using Prefix Sum Algorithm PDF
No ratings yet
An Introduction To PyCUDA Using Prefix Sum Algorithm PDF
6 pages
Automatic Fault Detection System Using PLC
No ratings yet
Automatic Fault Detection System Using PLC
26 pages
The Rabin-Karp Algorithm: String Matching
No ratings yet
The Rabin-Karp Algorithm: String Matching
18 pages
Jupyter Installation
100% (1)
Jupyter Installation
19 pages
Full download Modern Statistics with R From Wrangling and Exploring Data to Inference and Predictive Modelling Second Edition Måns Thulin pdf docx
100% (2)
Full download Modern Statistics with R From Wrangling and Exploring Data to Inference and Predictive Modelling Second Edition Måns Thulin pdf docx
71 pages
Jerasure: A Library in C Facilitating Erasure Coding For Storage Applications
No ratings yet
Jerasure: A Library in C Facilitating Erasure Coding For Storage Applications
37 pages
Online Machine Learning Algorithms For Currency Exchange Prediction
No ratings yet
Online Machine Learning Algorithms For Currency Exchange Prediction
84 pages
API Reference - Scikit-Learn 0.19.2 Documentation
No ratings yet
API Reference - Scikit-Learn 0.19.2 Documentation
21 pages
Cs433 Fa12 Hw4 Sol Correct
No ratings yet
Cs433 Fa12 Hw4 Sol Correct
14 pages
Operations Research
25% (4)
Operations Research
2 pages
Flask Restplus
No ratings yet
Flask Restplus
86 pages
ML Lab Observation
100% (1)
ML Lab Observation
44 pages
ML - Chapter 6 - Model Evaluation
No ratings yet
ML - Chapter 6 - Model Evaluation
65 pages
Multi Gpu Programming With Mpi
No ratings yet
Multi Gpu Programming With Mpi
93 pages
Q1) Classify The Types of Operating Systems in Block Diagram Types of Operating System
No ratings yet
Q1) Classify The Types of Operating Systems in Block Diagram Types of Operating System
4 pages
UNIT 5 RISC Architecture
No ratings yet
UNIT 5 RISC Architecture
16 pages
Evaluation Metrics in Machine Learning
No ratings yet
Evaluation Metrics in Machine Learning
14 pages
Fundamentals of Multi Agent Systems
No ratings yet
Fundamentals of Multi Agent Systems
155 pages
Soft Computing Lab Manual
No ratings yet
Soft Computing Lab Manual
24 pages
Minimum Vertex Cover Problem
No ratings yet
Minimum Vertex Cover Problem
2 pages
Communication Operations
No ratings yet
Communication Operations
70 pages
Advanced Unix Programming
From Everand
Advanced Unix Programming
Prof. N. B Venkateswarlu
No ratings yet
Java servlet Second Edition
From Everand
Java servlet Second Edition
Gerardus Blokdyk
No ratings yet
Gianna Alice Modular Origami PDF
No ratings yet
Gianna Alice Modular Origami PDF
80 pages
82-Maine Lobster-Robert Lang PDF
100% (2)
82-Maine Lobster-Robert Lang PDF
18 pages
Gas Pump
No ratings yet
Gas Pump
4 pages
Yoshihide Momotani - Origami Ships
100% (3)
Yoshihide Momotani - Origami Ships
60 pages
Nuclear Crane
No ratings yet
Nuclear Crane
2 pages
Marc Vigo Anglada - Trays PDF
No ratings yet
Marc Vigo Anglada - Trays PDF
3 pages
Akira Yoshizawa Panda PDF
No ratings yet
Akira Yoshizawa Panda PDF
3 pages
Duy Nguyen Origami Holidays
0% (1)
Duy Nguyen Origami Holidays
49 pages
Spine Dragon: Daniel Brown
75% (4)
Spine Dragon: Daniel Brown
10 pages
Origami Tree (Audrey Ermakov)
100% (1)
Origami Tree (Audrey Ermakov)
6 pages
Origami Minion (GT-Liu)
No ratings yet
Origami Minion (GT-Liu)
7 pages
African Elephant
50% (2)
African Elephant
16 pages
Panther - Kunsulu Jilkishiyeva
No ratings yet
Panther - Kunsulu Jilkishiyeva
6 pages

Multithreading in Python

Uploaded by

Multithreading in Python

Uploaded by

Multithreading in Python

Let us first understand the concept of thread in computer architecture.

A thread contains all this information in a Thread Control Block (TCB):

 Thread Identifier: Unique id (TID) is

Consider the diagram to understand the

Multithreading is defined as the ability of a processor to execute multiple threads concurrently.

Let us consider a simple example using threading module:

1 # importing the threading module

Let us try to understand the above code:

 To import the threading module, we do:

In above example, we created 2 threads with different target functions:

As a result, the current program will first wait for the

Consider the diagram for a better understanding of how above

Let us try to understand the above code:

 We use os.getpid() function to get ID of current process.

The diagram given below clears the above concept:

Synchronization between threads

Consider the program below to understand the concept of race condition:

Hence, we need a tool for proper synchronization between multiple threads.

Lock class provides following methods:

 acquire([blocking]) : To acquire a lock. A lock can be blocking or non-blocking.

Consider the example given below:

Let us try to understand the above code step by step:

 Firstly, a Lock object is created using:

 As number of threads increase, complexity increases.

You might also like