0% found this document useful (0 votes)

5 views

Chapter 2 Query processing and optimization [Autosaved]

software

Uploaded by

amentiabraham674

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

Chapter 2 Query processing and optimization [Autosaved]

software

Uploaded by

amentiabraham674

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 35

Advanced Database

systems

Chapter 2: Query
Processing and
Optimization

1
Overview of Query
Processing
What is query processing?
The activities involved in parsing, validating, optimizing, and executing a
query.
The aims of query processing are to transform a query written in a high-

level language into correct and efficient execution strategy expressed in a

low-level language. (i.e. SQL implementing the relational algebra).
What is query optimization?
The activity of choosing an efficient execution strategy for processing
optimization a query.
An important aspect of query processing.

The aim of query optimization is to choose the one that minimizes

resource usage.
Generally, we try to reduce the total execution time of the query, which

is the sum of the execution times of all individual operations that make up
the query.
2
Query optimization:
Example
Comparison of different processing
strategies
Find all Managers who work at a London branch.

We can write this query in SQL as:

SELECT *
FROM Staff s, Branch b
WHERE s.branchNo = b.branchNo AND
(s.position = ‘Manager’ AND b.city = ‘London’);

3
Query optimization:
Example cont’d…
Three equivalent relational algebra queries corresponding to this SQL
statement are:
σ
1. (position=‘Manager’) ∧ (city=‘London’) ∧ (Staff.branchNo=Branch.branchNo) (Staff × Branch)
σ(position=‘Manager’) ∧ (city=‘London’)(Staff
2.
Staff.branchNo=Branch.branchNo
Branch)
(σposition=‘Manager’(Staff))
3. Staff.branchNo=Branch.branchNo

(σcity=‘London’(Branch))
For this particular example assume there are 1000 tuples in

Staff, 50 tuples in Branch, 50 Managers (one for each branch),

and 5 London branches.
We compare these three queries based on the number of disk

accesses required.
There are no indexes or sort keys on either relation.

4
Query optimization:
Example cont’d…
The first query calculates the Cartesian product of
Staff and Branch
σ(position=‘Manager’) ∧ (city=‘London’) ∧ (Staff.branchNo=Branch.branchNo)
(Staff × Branch)

(1000 + 50) disk accesses to read the relations

creates a relation with (1000 * 50) tuples

to read each of these tuples again to test them against

the selection predicate (1000 * 50) disk accesses

giving a total cost of:

(1000 + 50) + 2(1000 50) = 101 050 disk

accesses

5
Example…
The second query joins Staff and Branch on the branch
number branchNo
σ(position=‘Manager’) ∧ (city=‘London’)(Staff Staff.branchNo=Branch.branchNo
Branch)
Requires (1000 + 50) disk accesses to read each of the
relations.
The join of the two relations has 1000 tuples, one for each

member of staff (a member of staff can only work at one

branch).
The Selection operation requires 1000 disk accesses to read the

result of the join.

giving a total cost of:

2*1000 + (1000 + 50) = 3050 disk accesses

6
Example…
The final query first reads each Staff tuple to determine
the Manager tuples (σposition=‘Manager’(Staff))
Staff.branchNo=Branch.branchNo (σcity=‘London’(Branch))
Requires 1000 disk accesses and produces a relation with 50

tuples.
The second Selection operation reads each Branch tuple to

determine the London branches.

Which requires 50 disk accesses and produces a relation with 5

tuples.
The final operation is the join of the reduced Staff and Branch

relations, which requires (50 + 5) disk accesses.

giving a total cost of:

1000 + 2*50 + 5 + (50 + 5) = 1160 disk accesses

Clearly the third option is the best in this case, by a factor of

87:1.
If we increase the no. of data to 10 times, the factor is 870:1. 7
Phases of query processing.

8
Dynamic versus static
optimization
 Dynamic: carry out decomposition and
optimization every time the query is run.
 Static: where the query is parsed, validated,
and optimized once.

9
Query Decomposition
 Query decomposition is the first phase of query
processing.
 The aims of query decomposition are to transform
a high-level query into a relational algebra query.
Stages of query decomposition
1. Analysis
2. Normalization
3. Semantic analysis
4. Simplification
5. Query restructuring

10
Query Decomposition:
stages cont’d…
1.Analysis
 The query is lexically and syntactically analyzed using
the techniques of programming language compilers.
 Verifies that the relations and attributes specified in
the query are defined in the system catalog.
 Example: Assume we have a Staff table with staffno.
and with position attribute which accepts variable
character string. In the following query staffNumber is
not defined and position is incompatible datatype.
SELECT staffNumber
FROM Staff
WHERE position > 10;
11
Query Decomposition:
stages cont’d..
2. Normalization
 Converts the query into a normalized form that can be more
easily manipulated.
 i.e. in SQL, the WHERE condition converted into one of two
forms by applying a few transformation rule.
 Conjunctive normal form: A sequence of conjuncts that
are connected with the ∧ (AND) operator.
e.g. (position = ‘Manager’ ∨ salary > 20000) ∧ branchNo =
‘B003’
 Disjunctive normal form :A sequence of disjuncts that
are connected with the ∨ (OR) operator.
e.g. (position = ‘Manager’ ∧ branchNo = ‘B003’ ) ∨ (salary >
20000 ∧ branchNo = ‘B003’)
12
Query Decomposition:
stages cont’d..
3. Semantic analysis
 objective of semantic analysis is to reject
normalized queries that are incorrectly
formulated or contradictory.
 A query is incorrectly formulated if components
do not contribute to the generation of the result.
which may happen if some join specifications are
missing.
 For example, the predicate (position = ‘Manager’
∧ position = ‘Assistant’) on the Staff relation is
contradictory, as a member of staff cannot be
both a Manager and an Assistant simultaneously.
13
Query Decomposition:
stages cont’d..
4. Simplification
 The objectives of the simplification stage are to
detect redundant qualifications.
 Eliminate common subexpressions.
 Transform the query to a semantically equivalent
but more easily and efficiently computed form.
For example: From Boolean algebra
p ∧ (p) ≡ p p ∨ (p) ≡ p
p ∧ false ≡ false p ∨ false ≡ p
p ∧ true ≡ p p ∨ true ≡ true
p ∧ (~p) ≡ false p ∨ (~p) ≡ true
14
Query Decomposition:
stages cont’d..
5. Query restructuring
 The query is restructured to provide a more
efficient implementation.

15
Heuristical Approach to
Query Optimization
 Uses transformation rules to convert one relational
algebra expression into an equivalent form.
 That is known to be more efficient.
Transformation Rules for the Relational
Algebra Operations
 By applying transformation rules, the optimizer can
transform one relational algebra expression into an
equivalent expression.
 In listing these rules, we use three relations R, S, and T,
with R defined over the attributes A = {A1, A2, . . . , An},
and S defined over B = {B1, B2, . . . , Bn}; p, q, and r
denote predicates, and L, L1, L2, M, M1, M2, and N denote
sets of attributes.
16
Heuristical Approach…
cont’d…
1. Conjunctive Selection operations can cascade
into individual Selection operations (and vice
versa).
 This transformation is sometimes referred to as
cascade of selection.
 σp∧ q∧ r(R)= σp (σ q (σ r(R)))
E.g.
 σbranchNo=‘B003’
∧salary>15000(Staff)=σbranchNo=‘B003’(σsalary>15000(Staff))

17
Heuristical Approach…
cont’d…
2. Commutativity of Selection operations
 σp (σ q (R))=σq (σ p (R))
E.g.
 σbranchNo=‘B003’(σsalary>15000 (Staff))=σsalary>15000
(σbranchNo=‘B003’(Staff))

18
Heuristical Approach…
cont’d…
3. In a sequence of Projection operations,

ΠLΠM ...ΠN(R) = ΠL(R)

only the last in the sequence is required

E.g.


 Π
lNameΠbranchno, lName(Staff) = ΠlName(Staff)

19
Heuristical Approach…
cont’d…
4. Commutativity of Selection and Projection.
 If the predicate p involves only the attributes in
the projection list, then the Selection and

ΠA1, . . . , Am(σp(R)) = σp(Π A1, . . . , Am(R)) where p ∈

Projection operations commute:

{A1, A2, . . . , Am}



ΠfName, IName(σIName =‘Beech’(staff)) = σIName =‘Beech’(ΠfName,

E.g.

Iname (staff))

20
Heuristical Approach…
cont’d…
5. Commutativity of Theta join (and Cartesian
product).
 R pS = S pR
 R× S= S× R

E.g.
Staff Staff.branchNo=Branch.branchNo
Branch= Branch
Staff.branchNo=Branch.branchNo
Staff

21
Heuristical Approach…
cont’d…

22
Heuristical Approach…
cont’d…

23
Heuristical Approach…
cont’d…

24
Heuristical Approach…
cont’d…

25
Heuristical Approach…
cont’d…

26
Heuristical Processing
Strategies
 Perform Selection operations as early as possible.
 Combine the Cartesian product with a subsequent
Selection operation whose predicate represents a
join condition into a Join operation.
 Use associativity of binary operations to rearrange
leaf nodes so that the leaf nodes with the most
restrictive Selection operations are executed first.
 Perform Projection operations as early as possible.
 Compute common expressions once.

27
Heuristical Query
optimization: Example
 Consider the following table :
Employee (Fname, Mname, Lname, Ssn, Bdate,
Address, Gender, Salary, Superssn,Dno)
Project (Pname, Pnumber, Plocation, Dnum)
Works_On (Essn, Pno, Hours)

28
Heuristical Query
optimization: Example
 Query Q on this table find the last names of
employees born after 1957 who work on a
project named ‘Aquarius’.
 This query can be specified in SQL as follows:

Q: SELECT Lname
FROM EMPLOYEE, WORKS_ON, PROJECT
WHERE Pname=‘Aquarius’ AND Pnumber=Pno
AND Essn=Ssn
AND Bdate > ‘1957-12-31’;

29
Heuristical Query
optimization: Example
Simplified steps in converting a query tree
during heuristic optimization
1.Initial (canonical) query tree for SQL query Q.
2.Moving SELECT operations down the query tree.
3.Applying the more restrictive SELECT operation
first.
4. Replacing CARTESIAN PRODUCT and SELECT with
JOIN operations.
5.Moving PROJECT operations down the query tree.

30
Heuristical Query
optimization: Example
1. Initial (canonical) query tree for SQL query Q.
SELECT Lname
FROM EMPLOYEE, WORKS_ON, PROJECT
WHERE Pname=‘Aquarius’ AND Pnumber=Pno AND
Essn=Ssn
AND Bdate > ‘1957-12-31’;

31
Heuristical Query
optimization: Example
2. Moving SELECT operations down the query tree.

32
Heuristical Query
optimization: Example
3. Applying the more restrictive SELECT operation first.

33
Heuristical Query
optimization: Example
4. Replacing CARTESIAN PRODUCT and SELECT with JOIN
operations.
σR.a=S.b RXS=R R.a=S.b S

34
Heuristical Query
optimization: Example
5. Moving PROJECT operations down the query tree.

Employee (Fname, Mname,

Lname, Ssn, Bdate,
Address, Gender, Salary,
Superssn,Dno)
Project (Pname, Pnumber,
Plocation, Dnum)
Works_On (Essn, Pno,
Hours) 35

X3 CommonTools v3
100% (1)
X3 CommonTools v3
198 pages
Identity and Access Management Policy
100% (3)
Identity and Access Management Policy
4 pages
Chapter - 1 - Query Optimization
No ratings yet
Chapter - 1 - Query Optimization
38 pages
Maximo Integration Framework Architecture1 0
No ratings yet
Maximo Integration Framework Architecture1 0
17 pages
Data Communication Basics CH 2
No ratings yet
Data Communication Basics CH 2
36 pages
Advanced Database Ch2 and 3
100% (1)
Advanced Database Ch2 and 3
73 pages
Chapter - 2 Query Processing
No ratings yet
Chapter - 2 Query Processing
61 pages
Ch-2 (B) Overview of Query Processing
No ratings yet
Ch-2 (B) Overview of Query Processing
73 pages
Chapter - 2 Query Processing
No ratings yet
Chapter - 2 Query Processing
61 pages
Chapter 4 Query Optimization
100% (2)
Chapter 4 Query Optimization
35 pages
2 Chapter 3 Query Optimization
No ratings yet
2 Chapter 3 Query Optimization
29 pages
Chapter 1 Query Processing
No ratings yet
Chapter 1 Query Processing
58 pages
Chapter 1 Query Processing
100% (1)
Chapter 1 Query Processing
63 pages
Chapter - 2 Query Processing
No ratings yet
Chapter - 2 Query Processing
63 pages
Advanced Database Systems: Chapter 3:query Processing and Evaluation
100% (1)
Advanced Database Systems: Chapter 3:query Processing and Evaluation
36 pages
Query Processing
No ratings yet
Query Processing
66 pages
Query Processing 1
No ratings yet
Query Processing 1
13 pages
Itm661 Lecture03 Part2 2015
No ratings yet
Itm661 Lecture03 Part2 2015
47 pages
Chapter 2 - Query Processing and Optimization
100% (1)
Chapter 2 - Query Processing and Optimization
28 pages
ADBChapter 1
No ratings yet
ADBChapter 1
32 pages
Chapter 2 Querry Proccessing
No ratings yet
Chapter 2 Querry Proccessing
7 pages
Chapter 2 Query Processing and Optimization
No ratings yet
Chapter 2 Query Processing and Optimization
45 pages
Advanced Database Systems Chapter 2
100% (1)
Advanced Database Systems Chapter 2
16 pages
ADBMS Notes
67% (3)
ADBMS Notes
48 pages
Chapter 20
No ratings yet
Chapter 20
99 pages
Lecture 20+Query+Processing+ +opt
No ratings yet
Lecture 20+Query+Processing+ +opt
22 pages
Chapter 2 Query Processing and Optimization
No ratings yet
Chapter 2 Query Processing and Optimization
58 pages
Advancedchapter 2 2013
No ratings yet
Advancedchapter 2 2013
16 pages
Advanced Database
No ratings yet
Advanced Database
47 pages
Module - 4
No ratings yet
Module - 4
60 pages
ADB Chapter 2
No ratings yet
ADB Chapter 2
40 pages
Query Processing Concepts
No ratings yet
Query Processing Concepts
99 pages
Chapter 2 Adb
No ratings yet
Chapter 2 Adb
21 pages
Chapter One1
No ratings yet
Chapter One1
21 pages
Rdbms Assignment
No ratings yet
Rdbms Assignment
12 pages
DE_Module5_QueryOptimization
No ratings yet
DE_Module5_QueryOptimization
11 pages
Query Processing
No ratings yet
Query Processing
5 pages
29-Query Optimization-04-10-2024
No ratings yet
29-Query Optimization-04-10-2024
35 pages
Chapter 1 Query Processing
100% (1)
Chapter 1 Query Processing
45 pages
DBMS Unit - 7
No ratings yet
DBMS Unit - 7
33 pages
Chapter 2-Query Processing and Optimi
No ratings yet
Chapter 2-Query Processing and Optimi
43 pages
Ad Bms Notes
No ratings yet
Ad Bms Notes
44 pages
CH - 2 Query Process
No ratings yet
CH - 2 Query Process
44 pages
Chapter 2 Query Processing
No ratings yet
Chapter 2 Query Processing
21 pages
Unit-5 Query Processing and Optimization
No ratings yet
Unit-5 Query Processing and Optimization
40 pages
query_optimization_part1
No ratings yet
query_optimization_part1
52 pages
Query Processing and Optimization
No ratings yet
Query Processing and Optimization
24 pages
Chapter 5
No ratings yet
Chapter 5
45 pages
Query Processing and Optimization
No ratings yet
Query Processing and Optimization
28 pages
Ch-2 Query Processing and Optimization
No ratings yet
Ch-2 Query Processing and Optimization
26 pages
DBMS Unit - 7
No ratings yet
DBMS Unit - 7
34 pages
Query Processing
No ratings yet
Query Processing
28 pages
Chapter 1 - Query Processing and Optimization
No ratings yet
Chapter 1 - Query Processing and Optimization
62 pages
QUERY Processing and Relational Algebra
No ratings yet
QUERY Processing and Relational Algebra
27 pages
Chapter 13: Query Processing
No ratings yet
Chapter 13: Query Processing
25 pages
CO3 Session 7
No ratings yet
CO3 Session 7
32 pages
CH - 1 Query Process SW
No ratings yet
CH - 1 Query Process SW
43 pages
Query Optimization
No ratings yet
Query Optimization
11 pages
Query Processing and Optimization: Dessalegn Mequanint
No ratings yet
Query Processing and Optimization: Dessalegn Mequanint
31 pages
Chapter 6 - Query Processing and Optimization Algorithm
No ratings yet
Chapter 6 - Query Processing and Optimization Algorithm
27 pages
Chapter 1
No ratings yet
Chapter 1
44 pages
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
Trifocal Tensor: Exploring Depth, Motion, and Structure in Computer Vision
From Everand
Trifocal Tensor: Exploring Depth, Motion, and Structure in Computer Vision
Fouad Sabry
No ratings yet
SOL Compiler Chapter Three
No ratings yet
SOL Compiler Chapter Three
52 pages
fresh assignment mathemtics
No ratings yet
fresh assignment mathemtics
1 page
Chapter 4(3)
No ratings yet
Chapter 4(3)
32 pages
Android App to Database
No ratings yet
Android App to Database
9 pages
Chapter five(1)
No ratings yet
Chapter five(1)
25 pages
Task 1 Description
No ratings yet
Task 1 Description
7 pages
Zain 11
No ratings yet
Zain 11
6 pages
VSE+InfoScale Enterprise OracleRAC 2020 05
No ratings yet
VSE+InfoScale Enterprise OracleRAC 2020 05
89 pages
Salesforce Data Loader
No ratings yet
Salesforce Data Loader
53 pages
NLP Based Extraction of Relevant Resume Classification
No ratings yet
NLP Based Extraction of Relevant Resume Classification
5 pages
Freebie - Top 52 Interview Q&A For SWEs
No ratings yet
Freebie - Top 52 Interview Q&A For SWEs
55 pages
cs project
No ratings yet
cs project
22 pages
Case Study of Data Science
No ratings yet
Case Study of Data Science
16 pages
Database Schema New
No ratings yet
Database Schema New
12 pages
Advanced Database Systems: Chapter 4: Transaction Management
No ratings yet
Advanced Database Systems: Chapter 4: Transaction Management
78 pages
Anti-Theft Mobile Phone Security System With The Help of Firebase
No ratings yet
Anti-Theft Mobile Phone Security System With The Help of Firebase
4 pages
Vertex OSeries v9 Client Utilities Users Guide
No ratings yet
Vertex OSeries v9 Client Utilities Users Guide
67 pages
Final Year Project
No ratings yet
Final Year Project
4 pages
GIS Interview Questions
100% (2)
GIS Interview Questions
6 pages
SQL Subquery
100% (1)
SQL Subquery
57 pages
Paperidiomslast
No ratings yet
Paperidiomslast
16 pages
Process Control Narratives
No ratings yet
Process Control Narratives
7 pages
01 - IIT-ADP - Adm - Projects Presentation
No ratings yet
01 - IIT-ADP - Adm - Projects Presentation
4 pages
Aptos Whitepaper
100% (1)
Aptos Whitepaper
17 pages
Morley15e PPT Ch06 REV
No ratings yet
Morley15e PPT Ch06 REV
67 pages
Current Log
No ratings yet
Current Log
13 pages
Gate DBMS
No ratings yet
Gate DBMS
164 pages
SailPoint Technical Integration Guide
No ratings yet
SailPoint Technical Integration Guide
8 pages
Readme Data Analysis Projects Meriskill
No ratings yet
Readme Data Analysis Projects Meriskill
2 pages
Cherry - J1
No ratings yet
Cherry - J1
16 pages
ORACLE-BASE - Oracle Database 21c Installation On Oracle Linux 8 (OL8)
No ratings yet
ORACLE-BASE - Oracle Database 21c Installation On Oracle Linux 8 (OL8)
10 pages
Data Analytics & Business Intelligence
No ratings yet
Data Analytics & Business Intelligence
15 pages