0% found this document useful (0 votes)

58 views

DBMS Module 2.5 Query Processing

The document discusses the basic steps in query processing which are parsing and translation, optimization, and evaluation. It describes how a query is parsed, translated to relational algebra, optimized to find the most efficient evaluation plan, and then evaluated by executing the optimized plan. The optimization step involves transforming the query using equivalence rules to find a logically equivalent plan with lower estimated cost based on statistics about the data. Some common equivalence rules allow projections, selections, and joins to be reordered and distributed in different ways during optimization.

Uploaded by

Shraddha Pattnaik

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

58 views

DBMS Module 2.5 Query Processing

Uploaded by

Shraddha Pattnaik

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

DBMS module 2.

Query Processing Strategy

Database Engineering 4th SEM CSE

Basic Steps in Query Processing
1. Parsing and translation
2. Optimization
3. Evaluation
Parser & Relational Algebra
Query Expression
Translator

Optimizer

Query Evaluation
Output Engine Execution Plan

Statistics
Data
About Data
CIS552 Query Processing 2
Database Engineering 4th SEM CSE
Basic Steps in Query Processing (Cont.)

Parsing and translation

• translate the query into its internal form. This is then
translated into relational algebra.
• Parser checks syntax, verifies relations
Evaluation
• The query-execution engine takes a query-evaluation
plan, executes that plan, and returns the answers to the
query.

Query Processing 3
Database Engineering 4th SEM CSE
Basic Steps in Query Processing
Optimization – finding the cheapest evaluation plan for a query.
• Given relational algebra expression may have many equivalent
expressions
E.g. σbalance<2500(Πbalance(account) is equivalent to
Πbalance(σbalance<2500(account))
• Any relational-algebra expression can be evaluated in
many ways. Annotated expression specifying detailed
evaluation strategy is called an evaluation-plan.
E.g. can use an index on balance to find accounts with
balance <2500, or can perform complete relation scan and
discard accounts with balance ≥ 2500
• Amongst all equivalent expressions, try to choose the one
with cheapest possible evaluation-plan. Cost estimate of a
plan based on statistical information in the DBMS catalog.

Query Processing 4
Database Engineering 4th SEM CSE
Catalog Information for Cost Estimation
• nr: number of tuples in relation r.
• br: number of blocks containing tuples of r.
• sr: size of a tuple of r in bytes.
• fr: blocking factor of r - i.e., the number of tuples of r that fit into one
block.
• V(A, r): number of distinct values that appear in r for attribute A;
same as the size of ΠA(r).
• SC(A, r): selection cardinality of attribute A of relation r; average
number of records that satisfy equality on A.
• If tuples of r are stored together physically in a file, then:
n 
br =  r 
 fr 

CIS552 Query Processing 5

Database Engineering 4th SEM CSE
Evaluation of Expressions
• Materialization: evaluate one operation at a time, starting
at the lowest-level. Use intermediate results materialized
into temporary relations to evaluate next-level operations.
• E.g., in figure below, compute and store σbalance<2500(account);
then compute and store its join with customer, and finally
compute the projection on customer-name.
Πcustomer-name

σbalance<2500 customer

account

Query Processing 6
Database Engineering 4th SEM CSE
Evaluation of Expressions (Cont.)
• Pipelining: evaluate several operations simultaneously,
passing the results of one operation on to the next.
• E.g., in expression in previous slide, don’t store result of
σbalance<2500(Account) – instead, pass tuples directly to
the join. Similarly, don’t store result of join, pass tuples
directly to projection.
• Much cheaper than materialization: no need to store a
temporary relation to disk.
• For pipelining to be effective, use evaluation algorithms
that generate output tuples even as tuples are received
for inputs to the operation.

Query Processing 7
Database Engineering 4th SEM CSE
Transformation of Relational Expressions

• Generation of query-evaluation plans for an expression

involves two steps:
1. generating logically equivalent expressions
2. annotating resultant expressions to get alternative
query plans
• Use equivalence rules to transform an expression into
an equivalent one.
• Based on estimated cost, the cheapest plan is selected.
The process is called cost based optimization.

Query Processing 8
Database Engineering 4th SEM CSE
Equivalence of Expressions
• Relations generated by two equivalent expressions have the same
set of attributes and contain the same set of tuples, although their
attributes may be ordered differently.
Πcustomer-name
Πcustomer-name

σ branch-city = Brooklyn

σ branch-city = Brooklyn
branch

account depositor branch account depositor

(a) Initial Expression Tree (b) Transformed Expression Tree

Equivalent expressions
Query Processing 9
Database Engineering 4th SEM CSE
Equivalence Rules
1. Conjunctive selection operations can be deconstructed
into a sequence of individual selections.
σθ1 ∧ θ2 (E) = σθ1 ( θ2 (E))
2. Selection operations are commutative.
σθ1 ( σθ2 (E))= σθ2 (σθ1 (E))
3. Only the last in a sequence of projection operations is
needed, the others can be omitted.
ΠL1(ΠL2(…(ΠLn(E))…)) = ΠL1(E)
4. Selections can be combined with Cartesian products
and theta joins.
(a) σθ (E1× E2) = E1 θ E2
(b) σθ1 (E1 θ E2) = E1 θ ∧ θ E2
2 1 2

Query Processing 10
Database Engineering 4th SEM CSE
Equivalence Rules (Cont.)

5. Theta-join operations (and natural joins) are

commutative.
E1 θ E2 = E2 θ E1
6. (a) Natural join operations are associative:
(E1 E2) E3 = E1 (E2 E3)
(b) Theta joins are associative in the following manner:
(E1 θ1 E2) θ 2 ∧ θ3 E3 = E1 θ1 ∧ θ3 (E2 θ2 E3)

where θ2 involves attributes from only E2 and E3.

Query Processing 11
Database Engineering 4th SEM CSE
Equivalence Rules (Cont.)
7. The selection operation distributes over the theta join
operation under the following two conditions:
(a) When all the attributes in θ0 involve only the attributes
of one of the expressions (E1) being joined.
σθ0 (E1 θ E2) = (σθ0 (E1)) θ E2
(b) When θ1 involves only the attributes of E1 and θ2
involves only the attributes of E2.
σθ ∧ θ (E1 θ E2) = (σθ1 (E1)) θ (σθ2 ( E2))
1 2

Query Processing 12
Database Engineering 4th SEM CSE
Equivalence Rules (Cont.)
8. The projection operation distributes over the theta join
operation as follows:
(a) if θ involves only attributes from L1 ∪ L2:
ΠL1∪ L2 (E1 θ E2) = (ΠL1(E1)) θ (ΠL2(E2))
(b) Consider a join E1 θ E2. Let L1 and L2 be sets of
attributes from E1 and E2, respectively. Let L3 be
attributes of E1 that are involved in join condition θ ,

but are not in L1 ∪ L2, and let L4 be attributes of E2 that

are involved in join condition θ , but are not in L1 ∪ L2.
ΠL1∪ L2 (E1 θ E2) = ΠL1∪ L2((ΠL1∪ L3 (E1)) θ (ΠL2∪ L4 (E2)))

Query Processing 13
Database Engineering 4th SEM CSE
Equivalence Rules (Cont.)
9. The set operations union and intersection are commutative (set
difference is not commutative).
E1 ∪ E2 = E2 ∪ E1
E1 ∩ E2 = E2 ∩ E1
10. Set union and intersection are associative.
11. The selection operation distributes over ∪, ∩ and −. E.g.:
σp(E1 − E2) = σp(E1) − σp(E2)
For difference
and intersection, union we also have:
σp(E1 ∩ E2) = σp(E1) ∩ σp(E2)
σp(E1 ∩ E2) = σp(E1) ∩ σp(E2)

12. The projection operation distributes over the union operation.

ΠL(E1 ∪ E2) = (ΠL(E1)) ∪ ΠL(E2))

Query Processing 14
Database Engineering 4th SEM CSE
Selection Operation Example
• Query: Find the names of all customers who have an
account at some branch located in Brooklyn.
Πcustomer-name(σbranch-city = “Brooklyn”
(branch (account depositor)))
• Transformation using rule 7a.
Πcustomer-name
((σbranch-city = “Brooklyn” (branch)) (account depositor))

• Performing the selection as early as possible reduces

the size of the relation to be joined.

Query Processing 15
Database Engineering 4th SEM CSE
Selection Operation Example(Cont.)
• Query: Find the names of all customers with an account at a
Brooklyn branch whose account balance is over $1000.
Πcustomer-name(σbranch-city = “Brooklyn” ∧ balance > 1000
(branch (account depositor))
• Transformation using join associativity (Rule 6a):
Πcustomer-name(σbranch-city = “Brooklyn” ∧ balance > 1000
(branch account) depositor))
• Second form provides an opportunity to apply the “Perform
selections early” rule, resulting in the subexpression
σbranch-city = “Brooklyn” (branch) σbalance > 1000 (account)
• Thus a sequence of transformations can be useful

Query Processing 16
Database Engineering 4th SEM CSE
Projection Operation Example

Πcustomer-name((σbranch-city = “Brooklyn” (branch)

account) depositor)
• When we compute
(σbranch-city = “Brooklyn” (branch) account)
We obtain a relation whose schema is:
(branch-name, branch-city, assets, account-number, balance)
• Push projections using equivalence rules 8a and 8b; eliminate
unneeded attributes from intermediate results to get:
Πcustomer-name ((Πaccount-number (
σbranch-city = “Brooklyn” (branch)) account)) depositor)

Query Processing 17
Database Engineering 4th SEM CSE
Join Ordering Example
• For all relations r1, r2 and r3,
(r1 r2) r3 = r1 (r2 r3)
• If r2 r3 is quite large and r1 r2 is small, we choose
(r1 r2) r3
so that we compute and store a smaller temporary
relation.

Query Processing 18
Database Engineering 4th SEM CSE
Heuristic Optimization
• Cost-based optimization is expensive, even with
dynamic programming.
• Systems may use heuristics to reduce the number of
choices that must be made in a cost-based fashion.
• Heuristic optimization transforms the query-tree by
using a set of rules that typically ( but not in all cases)
improve execution performance:
– Perform selection early (reduces the number of tuples)
– Perform projection early ( reduces the number of attributes)
– Perform most restrictive selection and join operations before
other similar operations.
• Some systems use only heuristics, others combine
heuristics with partial cost-based optimization.
Query Processing 19
Database Engineering 4th SEM CSE

(Oxford World's Classics) Euripides, James Morwood, Edith Hall - The Trojan Women and Other Plays-Oxford University Press (2009) PDF
92% (12)
(Oxford World's Classics) Euripides, James Morwood, Edith Hall - The Trojan Women and Other Plays-Oxford University Press (2009) PDF
939 pages
Violet Evergarden Volume 2
100% (4)
Violet Evergarden Volume 2
201 pages
Course - DBMS: Course Instructor Dr. K. Subrahmanyam Department of CSE
100% (1)
Course - DBMS: Course Instructor Dr. K. Subrahmanyam Department of CSE
58 pages
CSE 444 Practice Problems
No ratings yet
CSE 444 Practice Problems
13 pages
DBMS - Unit 3 1
No ratings yet
DBMS - Unit 3 1
17 pages
DBMS Module 2.1 Relational Algebra
No ratings yet
DBMS Module 2.1 Relational Algebra
32 pages
4 Chapter Four
No ratings yet
4 Chapter Four
34 pages
08 Query Processing Strategies and Optimization
No ratings yet
08 Query Processing Strategies and Optimization
32 pages
Lecture 06
No ratings yet
Lecture 06
41 pages
Query Optimization
No ratings yet
Query Optimization
103 pages
28-Query Processing-30-09-2024
No ratings yet
28-Query Processing-30-09-2024
17 pages
Chapter 13: Query Processing
No ratings yet
Chapter 13: Query Processing
25 pages
DBMS_07-08
No ratings yet
DBMS_07-08
35 pages
Chapter - 2 Query Processing
No ratings yet
Chapter - 2 Query Processing
63 pages
Chapter 5: Query Optimization: Acknowledgements: Slides Are Adapted From Böhlen and
No ratings yet
Chapter 5: Query Optimization: Acknowledgements: Slides Are Adapted From Böhlen and
53 pages
Dbms Aicte Lab
No ratings yet
Dbms Aicte Lab
42 pages
Query Processing
No ratings yet
Query Processing
28 pages
Unit 6: Query Processing and Optimization
No ratings yet
Unit 6: Query Processing and Optimization
21 pages
DE_Module5_QueryOptimization
No ratings yet
DE_Module5_QueryOptimization
11 pages
CH 14 Updated
No ratings yet
CH 14 Updated
30 pages
DBMS Unit - 7
No ratings yet
DBMS Unit - 7
34 pages
Chapter 12, 13 - Query Processing and Optimization
No ratings yet
Chapter 12, 13 - Query Processing and Optimization
24 pages
4CS4-04 U2 L1-L8 by Dr. Rajesh Kumar
No ratings yet
4CS4-04 U2 L1-L8 by Dr. Rajesh Kumar
32 pages
DDBMS-Chapter-4-SE-LectureNote (Version 1)
No ratings yet
DDBMS-Chapter-4-SE-LectureNote (Version 1)
11 pages
02 - Relational Algebra
No ratings yet
02 - Relational Algebra
22 pages
Lecture02 Relational Model
No ratings yet
Lecture02 Relational Model
8 pages
Query Processing in DBMS
No ratings yet
Query Processing in DBMS
22 pages
Query Optimization
No ratings yet
Query Optimization
63 pages
Chapter 4 - B_Relational_algebra II
No ratings yet
Chapter 4 - B_Relational_algebra II
73 pages
ADBS - Chapter Two
No ratings yet
ADBS - Chapter Two
41 pages
数据库原理与实践 Database Systems-Principle and Practice
No ratings yet
数据库原理与实践 Database Systems-Principle and Practice
182 pages
AMSAL
No ratings yet
AMSAL
58 pages
Advanced Database
No ratings yet
Advanced Database
47 pages
Ch13 QueryOptimization Korth6E
No ratings yet
Ch13 QueryOptimization Korth6E
24 pages
adbms-unit2
No ratings yet
adbms-unit2
20 pages
11 Ch13 Query Optimization
No ratings yet
11 Ch13 Query Optimization
54 pages
Relational Algebra: Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1
No ratings yet
Relational Algebra: Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1
22 pages
Relational Algebra Book 2
No ratings yet
Relational Algebra Book 2
43 pages
DBMS Module 2.3 Functional Dependency
No ratings yet
DBMS Module 2.3 Functional Dependency
72 pages
Chapter 6 Query Languges
No ratings yet
Chapter 6 Query Languges
26 pages
FALLSEM2023 24 - BCSE302L - TH - VL2023240100776 - 2023 06 14 - Reference Material I 2
No ratings yet
FALLSEM2023 24 - BCSE302L - TH - VL2023240100776 - 2023 06 14 - Reference Material I 2
14 pages
Chapter 2-Query Processing and Optimi
No ratings yet
Chapter 2-Query Processing and Optimi
43 pages
Chapter - 2 Query Processing
No ratings yet
Chapter - 2 Query Processing
61 pages
3 - Query Tuning
No ratings yet
3 - Query Tuning
42 pages
DS UNIT 3
No ratings yet
DS UNIT 3
38 pages
Database 2algebra Relationship
No ratings yet
Database 2algebra Relationship
3 pages
04 - Relational Algebra and Calculus
No ratings yet
04 - Relational Algebra and Calculus
38 pages
IF3140 Query Optimization
No ratings yet
IF3140 Query Optimization
77 pages
06 QueryProcessing-noblanks
No ratings yet
06 QueryProcessing-noblanks
56 pages
DBMS Unit - 7
No ratings yet
DBMS Unit - 7
33 pages
DBMS 1
No ratings yet
DBMS 1
43 pages
Unit 5 Query Processing Detail
No ratings yet
Unit 5 Query Processing Detail
38 pages
1.6 PPT - Query Optimization
No ratings yet
1.6 PPT - Query Optimization
53 pages
CS2202_RelAlgebra
No ratings yet
CS2202_RelAlgebra
55 pages
Query Processing in Distributed Database
No ratings yet
Query Processing in Distributed Database
20 pages
Chapter 13 (2)
No ratings yet
Chapter 13 (2)
57 pages
Relational Query Languages
No ratings yet
Relational Query Languages
42 pages
Chapter 1 Query Processing and Optimization
No ratings yet
Chapter 1 Query Processing and Optimization
108 pages
Lect 3
No ratings yet
Lect 3
40 pages
Lec5 Relational Algebra
No ratings yet
Lec5 Relational Algebra
48 pages
Administering Microsoft Azure SQL Solutions DP 300
From Everand
Administering Microsoft Azure SQL Solutions DP 300
Manish Soni
No ratings yet
Oracle 11g Streams Implementer's Guide
From Everand
Oracle 11g Streams Implementer's Guide
Ann L. R. McKinnell
No ratings yet
Beginner S1
No ratings yet
Beginner S1
233 pages
Resume C-Arm
No ratings yet
Resume C-Arm
4 pages
Angel - 2003 - Teaching Susan Glaspell's A Jury of Her Peers and Trifles
No ratings yet
Angel - 2003 - Teaching Susan Glaspell's A Jury of Her Peers and Trifles
17 pages
reading test 3+4-Khải
No ratings yet
reading test 3+4-Khải
4 pages
AT Attachment With Packet Interface - 7 Volume 1
No ratings yet
AT Attachment With Packet Interface - 7 Volume 1
390 pages
Stateflow Modelling
No ratings yet
Stateflow Modelling
21 pages
Research Paradigms
No ratings yet
Research Paradigms
24 pages
Pre Owned Cars For Sale 4272022
No ratings yet
Pre Owned Cars For Sale 4272022
5 pages
Relationship Between Cognitive Dissonance and Achievement in Mathematics Among Higher Secondary School Students
No ratings yet
Relationship Between Cognitive Dissonance and Achievement in Mathematics Among Higher Secondary School Students
5 pages
Cell Line Profile: ECACC Catalogue No. 84113001
No ratings yet
Cell Line Profile: ECACC Catalogue No. 84113001
2 pages
Bat01 Map Civ 0005
No ratings yet
Bat01 Map Civ 0005
4 pages
HW 1 Solutions 2012
100% (1)
HW 1 Solutions 2012
10 pages
Express Limited Warranty: FORM 6315 First Edition
No ratings yet
Express Limited Warranty: FORM 6315 First Edition
2 pages
Knee Rehab Self Assessment Checklist 2
100% (1)
Knee Rehab Self Assessment Checklist 2
32 pages
Casting Cover Letter
100% (1)
Casting Cover Letter
7 pages
Xub0apsnl2 PDF
No ratings yet
Xub0apsnl2 PDF
2 pages
Narcissism and Attractiveness
No ratings yet
Narcissism and Attractiveness
4 pages
Solid waste sampling is a critical process to analyze and manage waste effectively
No ratings yet
Solid waste sampling is a critical process to analyze and manage waste effectively
2 pages
Dell PowerEdge R730 and R730xd Technical Guide v1 7 Compressed
No ratings yet
Dell PowerEdge R730 and R730xd Technical Guide v1 7 Compressed
66 pages
Time and Work Book Handout 2
No ratings yet
Time and Work Book Handout 2
11 pages
Dubai Building Code 2021 - Structure
No ratings yet
Dubai Building Code 2021 - Structure
76 pages
GROUP 1 - Practical Research II - Pre-Defense
No ratings yet
GROUP 1 - Practical Research II - Pre-Defense
9 pages
Department of Education: Philippine Contemporary Arts in The Region Quarter3-Week3-4
No ratings yet
Department of Education: Philippine Contemporary Arts in The Region Quarter3-Week3-4
6 pages
National Association For Lay Ministry Code of Ethics
No ratings yet
National Association For Lay Ministry Code of Ethics
2 pages
Human Resource Management Assignment Case Study - Job Analysis at Go-Forward
No ratings yet
Human Resource Management Assignment Case Study - Job Analysis at Go-Forward
11 pages
PCB (Printed Circuit Board) Layout and EMI (Electromagnetic I..
100% (3)
PCB (Printed Circuit Board) Layout and EMI (Electromagnetic I..
2 pages
Chapter 2 - Ideation
No ratings yet
Chapter 2 - Ideation
33 pages
SmartSDR Software User Guide
No ratings yet
SmartSDR Software User Guide
222 pages