Unit 4 - QueryProcessingandTransactionManagementSystem

This document discusses transaction processing and management in database management systems (DBMS), outlining the concept of transactions, their states, and the ACID properties that ensure reliability. It covers concurrency control methods, including locking protocols and timestamp ordering, as well as deadlock detection and prevention strategies. Additionally, it explains the importance of logging for recovery and the implications of different scheduling types on transaction execution.

Uploaded by

hnpatil2821969

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

Unit 4 - QueryProcessingandTransactionManagementSystem

Uploaded by

hnpatil2821969

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 50

Unit 4

Query Processing And Transaction Management

-SUSHMA VANKHEDE
Introduction to Transaction in DBMS
A transaction is a set of logically related operations. For example, you are transferring money
from your bank account to your friend’s account, the set of operations would be like this:
Simple Transaction Example
1. Read your account balance
2. Deduct the amount from your balance
3. Write the remaining balance to your account
4. Read your friend’s account balance
5. Add the amount to his account balance
6. Write the new updated balance to his account
Example
In DBMS, we write the above 6 steps transaction like this:
If your account is A and your friend’s account is B, you are transferring 10000 from A to B, the
steps of the transaction are
What are the problems associated with
transaction?
Transaction failure in between the operations due to power failure, system crash etc
This is a serious problem that can leave database in an inconsistent state.

To solve this problem, we have the following two operations

Commit: If all the operations in a transaction are completed successfully then commit those
changes to the database permanently.
Rollback: If any of the operation fails then rollback all the changes done by previous operations.

But they are not sufficient in concurrent execution. To handle those problems we need to
understand database ACID properties.
ACID Properties

Atomicity: This property ensures that either all the operations of a transaction reflect in
database or none.

Consistency: To preserve the consistency of database, the execution of transaction should take
place in isolation (that means no other transaction should run concurrently when there is a
transaction already running).
ACID Properties…cont

Isolation: For every pair of transactions, one transaction should start execution only when the
other finished execution.

Durability: Once a transaction completes successfully, the changes it has made into the
database should be permanent even if there is a system failure. The recovery-management
component of database systems ensures the durability of transaction.
Transaction States
Transaction States …Cont
Active: The initial state when the transaction has just started execution.
Partially Committed: At any given point of time if the transaction is executing properly, then it is
going towards it COMMIT POINT. The values generated during the execution are all stored in
volatile storage.
Failed: If the transaction fails for some reason. The temporary values are no longer required, and
the transaction is set to ROLLBACK. It means that any change made to the database by this
transaction up to the point of the failure must be undone. If the failed transaction has
withdrawn Rs. 100/- from account A, then the ROLLBACK operation should add Rs 100/- to
account A.
Transaction States …Cont
Aborted: When the ROLLBACK operation is over, the database reaches the BFIM. The
transaction is now said to have been aborted.
Committed: If no failure occurs then the transaction reaches the COMMIT POINT. All the
temporary values are written to the stable storage and the transaction is said to have been
committed.
Terminated: Either committed or aborted, the transaction finally reaches this state.
Concurrent Execution
Concurrent execution means running side by side or parallel of transactions.
Advantages of Concurrent execution are:
Improved throughput & Resource utilization – i.e. no. of transactions executed increases in a
given amount of time & the processor is utilized properly.
Reduced Waiting time – The unpredictable delays in running transactions as well as the average
response time is reduced.
Schedule
A schedule is a collection of many transactions which is implemented as a unit. Depending upon
how these transactions are arranged in within a schedule, a schedule can be of two types:
Serial: The transactions are executed one after another, in a non-preemptive manner.
Concurrent: The transactions are executed in a preemptive, time shared method.
Schedule..cont
T1 is the transaction in which we have two accounts A and B, each containing Rs 1000/-. We now
start a transaction to deposit Rs 100/- from account A to Account B.
T2 is a transaction which deposits to account C 10% of the amount in account A.
Schedule..cont
If we prepare a serial schedule, then either T1 will completely finish before T2 can begin, or T2
will completely finish before T1 can begin.
However, if we want to create a concurrent schedule, then some Context Switching need to be
made, so that some portion of T1 will be executed, then some portion of T2 will be executed and
so on.
Concurrent Schedule
Serializability
To create error free concurrent schedules we must follow some well formed rules to arrange
instructions of the transactions.

When several concurrent transactions are trying to access the same data item, the instructions
within these concurrent transactions must be ordered in some way so as there are no problem
in accessing and releasing the shared data item.
Serializability…cont
There are two aspects of serializability:
1. Conflict Serializability
2. View Serializability
Conflict Serializability
Two instructions of two different transactions may want to access the same data item in order to
perform a read/write operation.
Conflict Serializability deals with detecting whether the instructions are conflicting in any way,
and specifying the order in which these two instructions will be executed in case there is any
conflict.
A conflict arises if at least one (or both) of the instructions is a write operation. The following
rules are important in Conflict Serializability:
1. If two instructions of the two concurrent transactions are both for read operation, then they
are not in conflict, and can be allowed to take place in any order.
Conflict Serializability
2. If one of the instructions wants to perform a read operation and the other instruction wants
to perform a write operation, then they are in conflict, hence their ordering is important. If the
read instruction is performed first, then it reads the old value of the data item and after the
reading is over, the new value of the data item is written. It the write instruction is performed
first, then updates the data item with the new value and the read instruction reads the newly
updated value.
3. 3. If both the transactions are for write operation, then they are in conflict but can be allowed
to take place in any order, because the transaction do not read the value updated by each other.
However, the value that persists in the data item after the schedule is over is the one written by
the instruction that performed the last write.
View Serializability
This is another type of serializability that can be derived by creating another schedule out of an
existing schedule, involving the same set of transactions. These two schedules would be called
View Serializable if the following rules are followed while creating the second schedule out of
the first.
Let us consider that the transactions T1 and T2 are being serialized to create two different
schedules S1 and S2 which we want to be View Equivalent and both T1 and T2 wants to access
the same data item.
1. If in S1, T1 reads the initial value of the data item, then in S2 also, T1 should read the initial
value of that same data item.
2. If in S1, T1 writes a value in the data item which is read by T2, then in S2 also, T1 should write
the value in the data item before T2 reads it.
3. If in S1, T1 performs the final write operation on that data item, then in S2 also, T1 should
perform the final write operation on that data item. Except in these three cases, any alteration
can be possible while creating S2 by modifying S1.
Concurrency-control Schemes
Concurrency-control schemes are also used to ensure serializability. All these schemes either
delay an operation or abort the transaction that issued the operation.
Most commonly used Concurrency-control schemes are:
locking protocols
timestamp based protocols
Lock based protocols
A locking protocol is a set of rules that state when a transaction may lock and unlock each of the
data items in the database.
Two-phase locking protocol: this protocol allows a transaction to lock a new data item only if
that transaction has not yet unlocked any data item. This protocol ensures serializability, but not
deadlock freedom.
Strict two-phase locking protocol: It permits release of exclusive locks only at the end of
transactions, in order to ensure recoverability and cascadelessness of the resulting schedules.
Rigorous two-phase locking protocol: This protocol releases all locks only at the end of the
transaction.
Lock-Based Protocol
In this type of protocol, any transaction cannot read or write data until it acquires an appropriate lock
on it. There are two types of lock:
1. Shared lock:
It is also known as a Read-only lock. In a shared lock, the data item can only read by the transaction.
It can be shared between the transactions because when the transaction holds a lock, then it can't
update the data on the data item.
2. Exclusive lock:
In the exclusive lock, the data item can be both reads as well as written by the transaction.
This lock is exclusive, and in this lock, multiple transactions do not modify the same data
simultaneously.
Two-phase locking (2PL)
The two-phase locking protocol divides the execution phase of the transaction into three parts.
In the first part, when the execution of the transaction starts, it seeks permission for the lock it
requires.
In the second part, the transaction acquires all the locks. The third phase is started as soon as
the transaction releases its first lock.
In the third phase, the transaction cannot demand any new locks. It only releases the acquired
locks.
Two phases of 2PL
:
Growing phase: In the growing phase, a new lock on the data item may be acquired by the
transaction, but none can be released.
Shrinking phase: In the shrinking phase, existing lock held by the transaction may be released,
but no new locks can be acquired.
In the below example, if lock conversion is allowed then the following phase can happen:
Upgrading of lock (from S(a) to X (a)) is allowed in growing phase.
Downgrading of lock (from X(a) to S(a)) must be done in shrinking phase.
Strict Two-phase locking (Strict-2PL)
The first phase of Strict-2PL is similar to 2PL. In the first phase, after acquiring all the locks, the
transaction continues to execute normally.
The only difference between 2PL and strict 2PL is that Strict-2PL does not release a lock after
using it.
Strict-2PL waits until the whole transaction to commit, and then it releases all the locks at a
time.
Strict-2PL protocol does not have shrinking phase of lock release.
Timestamp Ordering Protocol
The Timestamp Ordering Protocol is used to order the transactions based on their Timestamps.
The order of transaction is nothing but the ascending order of the transaction creation.
The priority of the older transaction is higher that's why it executes first. To determine the
timestamp of the transaction, this protocol uses system time or logical counter.
The lock-based protocol is used to manage the order between conflicting pairs among
transactions at the execution time. But Timestamp based protocols start working as soon as a
transaction is created.
Let's assume there are two transactions T1 and T2. Suppose the transaction T1 has entered the
system at 007 times and transaction T2 has entered the system at 009 times. T1 has the higher
priority, so it executes first as it is entered the system first.
The timestamp ordering protocol also maintains the timestamp of last 'read' and 'write'
operation on a data.
Working of Timestamp ordering protocol
1. Check the following condition whenever a transaction Ti issues a Read (X) operation:
If W_TS(X) >TS(Ti) then the operation is rejected.
If W_TS(X) <= TS(Ti) then the operation is executed.
Timestamps of all the data items are updated.
2. Check the following condition whenever a transaction Ti issues a Write(X) operation:
If TS(Ti) < R_TS(X) then the operation is rejected.
If TS(Ti) < W_TS(X) then the operation is rejected and Ti is rolled back otherwise the operation is
executed.
Where,
TS(TI) denotes the timestamp of the transaction Ti.
R_TS(X) denotes the Read time-stamp of data-item X.
W_TS(X) denotes the Write time-stamp of data-item X.
Precedence graph for TS ordering
Advantages and Disadvantages of TO
protocol
TO protocol ensures serializability as per the precedence graph
TS protocol ensures freedom from deadlock that means no transaction ever waits.
But the schedule may not be recoverable and may not even be cascade- free.
Intent Locks
The intent lock occurs when SQL Server wants to acquire the shared (S) lock or exclusive
(X) lock on some of the resources lower in the lock hierarchy. In practice, when SQL Server
acquires a lock on a page or row, the intent lock is required in the table.
Recoverability of Schedule
Sometimes a transaction may not execute completely due to a software issue, system crash or
hardware failure. In that case, the failed transaction has to be rollback. But some other
transaction may also have used value produced by the failed transaction. So we also have to
rollback those transactions.

Transaction must be committed in order

Irrecoverable Schedule

The schedule will be irrecoverable if Tj reads the updated value of Ti and Tj

committed before Ti commit.
Recoverable with cascading rollback:
The schedule will be recoverable with cascading rollback if Tj reads the updated value of Ti.
Commit of Tj is delayed till commit of Ti.
Transaction T1 reads and write A and commits, and that value is read and written by T2. So this
is a cascade less recoverable schedule.
Log-Based Recovery
The log is a sequence of records. Log of each transaction is maintained in some stable storage so
that if any failure occurs, then it can be recovered from there.
If any operation is performed on the database, then it will be recorded in the log.
But the process of storing the logs should be done before the actual transaction is applied in the
database.
Log-Based Recovery
Example: Transaction to modify the City of a student. The following logs are written for this
transaction.
When the transaction is initiated, then it writes 'start' log.
<Tn, Start>
When the transaction modifies the City from 'Noida' to 'Bangalore', then another log is written
to the file.
<Tn, City, 'Noida', 'Bangalore' >
When the transaction is finished, then it writes another log to indicate the end of the
transaction.
<Tn, Commit>
Log-Based Recovery
Two approaches to modify the database:
1. Deferred database modification:
The deferred modification technique occurs if the transaction does not modify the database
until it has committed.
In this method, all the logs are created and stored in the stable storage, and the database is
updated when a transaction commits.
2. Immediate database modification:
The Immediate modification technique occurs if database modification occurs while the
transaction is still active.
In this technique, the database is modified immediately after every operation. It follows an
actual database modification.
Log-Based Recovery
Recovery using Log records
When the system is crashed, then the system consults the log to find which transactions need to
be undone and which need to be redone.
If the log contains the record <Ti, Start> and <Ti, Commit> or <Ti, Commit>, then the Transaction
Ti needs to be redone.
If log contains record<Tn, Start> but does not contain the record either <Ti, commit> or <Ti,
abort>, then the Transaction Ti needs to be undone.
Deadlock in DBMS
A deadlock is a condition where two or more transactions are waiting indefinitely for one
another to give up locks. Deadlock is said to be one of the most feared complications in DBMS as
no task ever gets finished and is in waiting state forever.
Deadlock Avoidance
When a database is stuck in a deadlock state, then it is better to avoid the database rather than
aborting or restating the database. This is a waste of time and resource.
Deadlock avoidance mechanism is used to detect any deadlock situation in advance. A method
like "wait for graph" is used for detecting the deadlock situation but this method is suitable only
for the smaller database. For the larger database, deadlock prevention method can be used.
Deadlock Detection
In a database, when a transaction waits indefinitely to obtain a lock, then the DBMS should
detect whether the transaction is involved in a deadlock or not. The lock manager maintains a
Wait for the graph to detect the deadlock cycle in the database.
Wait for Graph
This is the suitable method for deadlock detection. In this method, a graph is created based on
the transaction and their lock. If the created graph has a cycle or closed loop, then there is a
deadlock.
The wait for the graph is maintained by the system for every transaction which is waiting for
some data held by the others. The system keeps checking the graph if there is any cycle in the
graph.
Deadlock Prevention
Deadlock prevention method is suitable for a large database. If the resources are allocated in
such a way that deadlock never occurs, then the deadlock can be prevented.
The Database management system analyzes the operations of the transaction whether they can
create a deadlock situation or not. If they do, then the DBMS never allowed that transaction to
be executed.
Wait-Die scheme
In this scheme, if a transaction requests for a resource which is already held with a conflicting
lock by another transaction then the DBMS simply checks the timestamp of both transactions. It
allows the older transaction to wait until the resource is available for execution.
Let's assume there are two transactions Ti and Tj and let TS(T) is a timestamp of any transaction
T. If T2 holds a lock by some other transaction and T1 is requesting for resources held by T2 then
the following actions are performed by DBMS:
Check if TS(Ti) < TS(Tj) - If Ti is the older transaction and Tj has held some resource, then Ti is
allowed to wait until the data-item is available for execution. That means if the older transaction
is waiting for a resource which is locked by the younger transaction, then the older transaction is
allowed to wait for resource until it is available.
Check if TS(Ti) < TS(Tj) - If Ti is older transaction and has held some resource and if Tj is waiting
for it, then Tj is killed and restarted later with the random delay but with the same timestamp.
Wound wait scheme
In wound wait scheme, if the older transaction requests for a resource which is held by the
younger transaction, then older transaction forces younger one to kill the transaction and
release the resource. After the minute delay, the younger transaction is restarted but with the
same timestamp.
If the older transaction has held a resource which is requested by the Younger transaction, then
the younger transaction is asked to wait until older releases it.
END
Query Processing Optimization
Query Processing Optimization
Query Processing includes translations on high level Queries into low level expressions that can
be used at physical level of file system, query optimization and actual execution of query to get
the actual result.
Basic Steps in Query Processing
1. Parsing and translation
2. Optimization
3. Evaluation
Basic Steps in Query Processing
Parsing and translation
Translate the query into its internal form. This is then translated into relational algebra.
Parser checks syntax, verifies relations
Evaluation
The query-execution engine takes a query-evaluation plan, executes that plan, and returns the
answers to the query.
Optimization
Finding the cheapest evaluation plan for a query.
END

DBMS PPT Unit-5
100% (1)
DBMS PPT Unit-5
85 pages
Rtos Lec Notes
No ratings yet
Rtos Lec Notes
120 pages
Unit-3 PPT
No ratings yet
Unit-3 PPT
40 pages
Chapter 10
No ratings yet
Chapter 10
52 pages
dbms-3rd-dbms-3rd-unit
No ratings yet
dbms-3rd-dbms-3rd-unit
7 pages
Unit 2grp
No ratings yet
Unit 2grp
16 pages
Bilal Dbms Unit-4
No ratings yet
Bilal Dbms Unit-4
65 pages
Transaction Management
No ratings yet
Transaction Management
7 pages
Unit4TransactionManagementpptx 2023 10-11-13!20!24
No ratings yet
Unit4TransactionManagementpptx 2023 10-11-13!20!24
61 pages
Trans
No ratings yet
Trans
52 pages
Recoverability in DBMS
No ratings yet
Recoverability in DBMS
19 pages
Unit 5 Transcation
No ratings yet
Unit 5 Transcation
82 pages
Intro To Transaction Processing and Theory
No ratings yet
Intro To Transaction Processing and Theory
31 pages
Unit-4 Transactions
No ratings yet
Unit-4 Transactions
32 pages
Unit IV
No ratings yet
Unit IV
87 pages
Unit 4 Dbms
No ratings yet
Unit 4 Dbms
85 pages
ch2 Part1 Transactions 1
No ratings yet
ch2 Part1 Transactions 1
34 pages
Transaction Management PDEU April 2023
No ratings yet
Transaction Management PDEU April 2023
77 pages
IT 220 Unit 6 Transaction Processing and Concurrency Control and Recovery Transaction Management
No ratings yet
IT 220 Unit 6 Transaction Processing and Concurrency Control and Recovery Transaction Management
67 pages
Topic-1: Transaction
No ratings yet
Topic-1: Transaction
40 pages
Dbms Unit 5 Final
No ratings yet
Dbms Unit 5 Final
63 pages
Transactions and Concurrecynotes
No ratings yet
Transactions and Concurrecynotes
43 pages
Final DBMS Unit-6
No ratings yet
Final DBMS Unit-6
57 pages
DBMS CS208 M4Ktunotes - in
No ratings yet
DBMS CS208 M4Ktunotes - in
36 pages
UNIT5 Transaction Processing and Concurrency Control
No ratings yet
UNIT5 Transaction Processing and Concurrency Control
121 pages
DE Module5 TransactionProcessing
No ratings yet
DE Module5 TransactionProcessing
41 pages
Unit4 DBMS Full TAB
No ratings yet
Unit4 DBMS Full TAB
42 pages
Chapter 2
No ratings yet
Chapter 2
43 pages
DBMS Transaction
No ratings yet
DBMS Transaction
36 pages
DBMS
No ratings yet
DBMS
27 pages
DBMS UNIT 3 Transaction
No ratings yet
DBMS UNIT 3 Transaction
38 pages
Unit 3_Transaction Management & Serializability
No ratings yet
Unit 3_Transaction Management & Serializability
130 pages
Transaction Management - I
No ratings yet
Transaction Management - I
43 pages
Lect 14 25052024 043851pm
No ratings yet
Lect 14 25052024 043851pm
28 pages
Dbms Unit 4 Notes
No ratings yet
Dbms Unit 4 Notes
39 pages
Lecture 13
No ratings yet
Lecture 13
43 pages
Transaction Management
No ratings yet
Transaction Management
69 pages
Part DBMS Unit 5
No ratings yet
Part DBMS Unit 5
87 pages
Transaction Management
No ratings yet
Transaction Management
69 pages
Lecture05 UCCD2303 Transaction Management Part 1
No ratings yet
Lecture05 UCCD2303 Transaction Management Part 1
56 pages
Unit-4 Transaction Mgmt
No ratings yet
Unit-4 Transaction Mgmt
66 pages
Unit 4
No ratings yet
Unit 4
71 pages
18csc303j Dbms Unit-V Updated
No ratings yet
18csc303j Dbms Unit-V Updated
85 pages
ADBMS Lec5
No ratings yet
ADBMS Lec5
61 pages
Transaction Processing
No ratings yet
Transaction Processing
22 pages
Unit - 4
No ratings yet
Unit - 4
99 pages
Tracsactions - DBMS
No ratings yet
Tracsactions - DBMS
46 pages
Dbms Unit 4
No ratings yet
Dbms Unit 4
23 pages
DBMS Unit-4 & 5
No ratings yet
DBMS Unit-4 & 5
93 pages
DBMS Unit - 3
No ratings yet
DBMS Unit - 3
46 pages
Unit-IV dbms
No ratings yet
Unit-IV dbms
36 pages
Transactions Chapter
No ratings yet
Transactions Chapter
39 pages
Unit VI Transaction Processing, Concurrency Control and Recovery Techniques
No ratings yet
Unit VI Transaction Processing, Concurrency Control and Recovery Techniques
53 pages
Unit 4
No ratings yet
Unit 4
13 pages
Unit-V
No ratings yet
Unit-V
58 pages
Transaction Management
No ratings yet
Transaction Management
18 pages
Transactions: Csc-340 10A 1
No ratings yet
Transactions: Csc-340 10A 1
39 pages
Unit-7 Transaction Processing
No ratings yet
Unit-7 Transaction Processing
107 pages
Unit Iv: Transaction and Concurrency
No ratings yet
Unit Iv: Transaction and Concurrency
54 pages
Transactions
No ratings yet
Transactions
27 pages
Professional SQL Server 2012 Internals and Troubleshooting
From Everand
Professional SQL Server 2012 Internals and Troubleshooting
Christian Bolton
4/5 (4)
Nano Station 2 Configuration Guide V1.0
No ratings yet
Nano Station 2 Configuration Guide V1.0
8 pages
Bioinformatics Installation Instructions and Links
No ratings yet
Bioinformatics Installation Instructions and Links
2 pages
8 Portable Executable Format
No ratings yet
8 Portable Executable Format
2 pages
Clean Wipe Revision History
No ratings yet
Clean Wipe Revision History
7 pages
Unit IV-Embedded Linux - For Participants
No ratings yet
Unit IV-Embedded Linux - For Participants
66 pages
Kavita Agarwal Ms Thesis Submission
No ratings yet
Kavita Agarwal Ms Thesis Submission
55 pages
Operating Systems
No ratings yet
Operating Systems
23 pages
Podman Part2
No ratings yet
Podman Part2
5 pages
[Ebooks PDF] download Dataflow Processing 1st Edition Ali R. Hurson full chapters
100% (7)
[Ebooks PDF] download Dataflow Processing 1st Edition Ali R. Hurson full chapters
70 pages
Linux
No ratings yet
Linux
50 pages
Cortex Diagnose Report
No ratings yet
Cortex Diagnose Report
19 pages
Ntbtlog
No ratings yet
Ntbtlog
96 pages
Automatic Storage Management (ASM)
No ratings yet
Automatic Storage Management (ASM)
22 pages
Installing and Registering FSUIPC7
No ratings yet
Installing and Registering FSUIPC7
8 pages
BCS303 VTU OS Notes Module 4
No ratings yet
BCS303 VTU OS Notes Module 4
37 pages
Name Synopsis Description
No ratings yet
Name Synopsis Description
4 pages
CHAPTER 4 Booting a Unix System
No ratings yet
CHAPTER 4 Booting a Unix System
32 pages
DC - Unit I
No ratings yet
DC - Unit I
55 pages
Log
No ratings yet
Log
24 pages
Tar (1) Gnu Tar Manual Tar
No ratings yet
Tar (1) Gnu Tar Manual Tar
18 pages
Chapter 3 Embedded Software
No ratings yet
Chapter 3 Embedded Software
50 pages
Operating System Concepts: The Process
No ratings yet
Operating System Concepts: The Process
6 pages
Cs3551 Distributed Computing L T P C
100% (2)
Cs3551 Distributed Computing L T P C
2 pages
Virtualization - The Other Side of The Coin: Joanna Rutkowska Invisible Things Lab
No ratings yet
Virtualization - The Other Side of The Coin: Joanna Rutkowska Invisible Things Lab
41 pages
50 REAL TIME LINUX Multiple Choice Questions and Answers LINUX Multiple Choice Questions PDF
100% (1)
50 REAL TIME LINUX Multiple Choice Questions and Answers LINUX Multiple Choice Questions PDF
16 pages
ACSLS Admin Tasks
No ratings yet
ACSLS Admin Tasks
11 pages
Android Mock Test IV
No ratings yet
Android Mock Test IV
6 pages
uClinuxforS3CEV40 English V3.1
No ratings yet
uClinuxforS3CEV40 English V3.1
19 pages
Log
No ratings yet
Log
2 pages