Introduction To Query Processing

This document provides an overview of query processing and optimization. It discusses the basic steps of query processing which are parsing, optimization, and evaluation. It also covers measures of query cost such as disk accesses and CPU time. Query optimization is the process of selecting the most efficient query evaluation plan by transforming the query expression and evaluating operations in the most cost-effective order. The goal is to minimize the number of disk accesses and reduce the response time for a query.

Uploaded by

Atharva Tadge

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views

Introduction To Query Processing

Uploaded by

Atharva Tadge

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

Introduction to Query Processing

and
Query Optimization
Outline

Overview
Measures of Query Cost
Query Optimization
What is Query Processing?

➢ Query processing: Activities involved in

extracting data from a database.
➢ Three basic steps:
1. Parsing and Translation
2. Optimization
3. Evaluation
Steps in Query Processing
Measures of Query Cost

➢ Cost is generally measured as total elapsed time for answering query

➢ Many factors contribute to time cost
1. Disk accesses (Time to process a data request and retrive data
from the storage device)
2. CPU (time to execute a query)
3. Network communication cost
➢ Disk access is the predominant cost, and is also relatively easy to
estimate.
➢ Cost to write a block is greater than cost to read a block
• data is read back after being written to ensure that the write
was successful
Measures of Query Cost

For simplicity we just use the number of block transfers from disk and
the number of seeks as the cost measures
tT – time to transfer one block
tS – time for one seek
Cost for b block transfers plus S seeks
b * tT + S * tS
We ignore CPU costs for simplicity
Real systems do take CPU cost into account
We do not include cost to writing output to disk in our cost
Select operation
➢ Symbol: 
➢ Notation:  condition (Relation)
➢ Operation : Select tuple from a relation that satisfy a given condition.

➢ Search algorithm
1. Linear search (A1)
2. Binary search (A2)
Linear search (A1)
It Scan each file block and test all records to see whether they satisfy the selection condition.

Cost estimate = br block transfers

•br denotes number of blocks containing records from relation r
If selection is on a key attribute (primary key), then system can
stop on finding record
• cost = (br /2) block transfers
Linear search can be applied regardless of
• Selection condition or
• Ordering of records in the file, or
• Availability of indices
This algorithm is slower than binary search algorithm.
Binary search (A2)
Is used when selection is an equality comparison on the primary key and
relation is sorted on primary key attribute.

Cost of binary search = [log2(br)]

br denotes number of blocks containing records from relation r

If the selection is on non primary attribute then multiple block may

contains required records , then the cost of scanning such block need to
be added to the cost estimate.

This algorithm is faster than linear search algorithm

Evaluation of expressions

➢ Method
1. Materialization
2. Pipelining
Materialization
➢ Materialized evaluation: evaluate one operation at a time, starting at the lowest-
level. (from bottom and perform the inner most operations first)
➢ The intermediate results of each operation is materialized (store in temporary
relation)and become input for subsequent( evaluate next-level operations).
➢ The cost of materialization is the sum of the individual operations plus the cost of
writing the intermediate results to disk.
The problem ;
1. Creates lots of temporary relation
2. Perform lots of I/O operation
Pipelining
It evaluate several operations simultaneously, passing the results of one
operation on to the next.
To reduce number of intermediate temporary relations , we pass results of one
operation to the next operation in the pipeline.
Combining operations into a pipeline eliminates the cost of reading and writing
temporary relations.
Much cheaper than materialization: no need to store a temporary relation to disk.
Pipelines can be executed in two ways:
Demand driven –system makes request for tuples from the operation at the top
of pipeline
Producer driven – Operation do not wait for request to produce tuple but
generate the tuples eagerly.
Query Optimization

Process of selecting the most efficient query evaluation plan

Query Optimization

Customer Account
Cid Ano C_name Ano Balance
A1 3000
101 A1 Ram
A2 1000
102 A2 Harsh
A3 2000
103 A3 Deepak
A4 4000
104 A4 Gopal

Efficient Plan 2 records 4 records

4 records 4 records
Transformation of relational Expression
Cascade of selection
Combined selection operation can be divided into sequence of individual selection.
Selection operation
Selection operation are commutative
Project opeartion
If more than one projection operation is used in expression then only the
outer projection operation is required.
Join
Natural join operations are associative
Example
Recap

Query processing
Measures of Query Cost
Evaluation of expressions
Query representation

M4 M5 M6 M7 Ta2 Panaligan
100% (1)
M4 M5 M6 M7 Ta2 Panaligan
16 pages
Unit 1 Exam Paper Jan 2022
50% (2)
Unit 1 Exam Paper Jan 2022
24 pages
Lect#2 DDBS (Characteristics and Layers of Query Processing)
78% (9)
Lect#2 DDBS (Characteristics and Layers of Query Processing)
20 pages
Mini Project Report-2
0% (2)
Mini Project Report-2
26 pages
DDS Unit - 2
No ratings yet
DDS Unit - 2
7 pages
Query Optimizattion
No ratings yet
Query Optimizattion
113 pages
Ivunit Query Processing
No ratings yet
Ivunit Query Processing
12 pages
Lesson 05
No ratings yet
Lesson 05
29 pages
3 - Query Tuning
No ratings yet
3 - Query Tuning
42 pages
Query Processing
No ratings yet
Query Processing
39 pages
06 Query Processing (2) - NDN
No ratings yet
06 Query Processing (2) - NDN
31 pages
7-Query Processing
No ratings yet
7-Query Processing
47 pages
DBMS
No ratings yet
DBMS
24 pages
Query Proc Notes
No ratings yet
Query Proc Notes
10 pages
Chapter 13: Query Processing
No ratings yet
Chapter 13: Query Processing
25 pages
Amdahl's Law: S (N) T (1) /T (N)
No ratings yet
Amdahl's Law: S (N) T (1) /T (N)
46 pages
Final DBMS Unit 7
No ratings yet
Final DBMS Unit 7
48 pages
Advanced Database Systems Lecture Notes
No ratings yet
Advanced Database Systems Lecture Notes
79 pages
Query Processing and Query Optimization Techniques
No ratings yet
Query Processing and Query Optimization Techniques
20 pages
DBMS R19 UNIT IV
No ratings yet
DBMS R19 UNIT IV
25 pages
Unit 4
No ratings yet
Unit 4
24 pages
UT 1 QB Solution
No ratings yet
UT 1 QB Solution
4 pages
Query Processing and Optimization
No ratings yet
Query Processing and Optimization
33 pages
A979968895 - 21482 - 28 - 2020 - Ds 1-Basic Data Structure
No ratings yet
A979968895 - 21482 - 28 - 2020 - Ds 1-Basic Data Structure
65 pages
Parallel Sorting Algorithms
100% (1)
Parallel Sorting Algorithms
7 pages
Unit-5 Query Processing and Optimization
No ratings yet
Unit-5 Query Processing and Optimization
40 pages
Digital Logic Design Week01Fall2022
No ratings yet
Digital Logic Design Week01Fall2022
22 pages
Lecture1 Comp 202 Dsa
No ratings yet
Lecture1 Comp 202 Dsa
32 pages
Dbms Chapter 5
No ratings yet
Dbms Chapter 5
54 pages
ACA2024
No ratings yet
ACA2024
44 pages
A979968895 - 21482 - 28 - 2020 - Ds 1-Basic Data Structure
No ratings yet
A979968895 - 21482 - 28 - 2020 - Ds 1-Basic Data Structure
65 pages
HPC Unit 456
No ratings yet
HPC Unit 456
25 pages
My Lecture5 Analysis
No ratings yet
My Lecture5 Analysis
18 pages
Analysis and Estimation: Hardware Software Codesign
No ratings yet
Analysis and Estimation: Hardware Software Codesign
18 pages
Pipelining Size and Depth
No ratings yet
Pipelining Size and Depth
19 pages
Dsa Basic Data Structure
No ratings yet
Dsa Basic Data Structure
72 pages
Relational Query Optimization: Warih Maharani, ST.,MT
No ratings yet
Relational Query Optimization: Warih Maharani, ST.,MT
39 pages
Shorting
No ratings yet
Shorting
27 pages
unit-2 Query processing and optimization,Query equivalence, Join strategies (1)
No ratings yet
unit-2 Query processing and optimization,Query equivalence, Join strategies (1)
38 pages
DBMS_Unit5_Lecture1
No ratings yet
DBMS_Unit5_Lecture1
22 pages
10
No ratings yet
10
76 pages
Sudhansu,DBMS-3rd
No ratings yet
Sudhansu,DBMS-3rd
6 pages
DBMS Unit-5
No ratings yet
DBMS Unit-5
42 pages
OOAD
No ratings yet
OOAD
67 pages
Issues For ESD Estimation
No ratings yet
Issues For ESD Estimation
23 pages
DAA Lecture-1
No ratings yet
DAA Lecture-1
34 pages
Implications of A Distributed Environment Part 2
No ratings yet
Implications of A Distributed Environment Part 2
38 pages
DBMS Unit - 7
No ratings yet
DBMS Unit - 7
34 pages
Complexity Analysis: Cs 101 - Data Structures and Algorithms Concepts Reviewer
No ratings yet
Complexity Analysis: Cs 101 - Data Structures and Algorithms Concepts Reviewer
12 pages
Overview of Query Processing
No ratings yet
Overview of Query Processing
35 pages
Advanced Database
No ratings yet
Advanced Database
47 pages
Ch 13 Updated
No ratings yet
Ch 13 Updated
30 pages
Analytical Modeling of Parallel Systems: Ananth Grama, Anshul Gupta, George Karypis, and Vipin Kumar
No ratings yet
Analytical Modeling of Parallel Systems: Ananth Grama, Anshul Gupta, George Karypis, and Vipin Kumar
67 pages
Advanced Database Systems: Chapter 3:query Processing and Evaluation
100% (1)
Advanced Database Systems: Chapter 3:query Processing and Evaluation
36 pages
WINSEM2018-19 - CSE2003 - ETH - SJT311 - VL2018195002472 - Reference Material I - Overviewofalgorithm
No ratings yet
WINSEM2018-19 - CSE2003 - ETH - SJT311 - VL2018195002472 - Reference Material I - Overviewofalgorithm
12 pages
DBMS Unit - 7
No ratings yet
DBMS Unit - 7
33 pages
Assignment-2 Ami Pandat Parallel Processing: Time Complexity
No ratings yet
Assignment-2 Ami Pandat Parallel Processing: Time Complexity
12 pages
05 QueryProcessing LecW4 Feb7 22
No ratings yet
05 QueryProcessing LecW4 Feb7 22
55 pages
Distributed Querry Optimization
No ratings yet
Distributed Querry Optimization
4 pages
CD Unit 5
No ratings yet
CD Unit 5
49 pages
Practical Consideration of Internal Sorting and External
No ratings yet
Practical Consideration of Internal Sorting and External
20 pages
chap5-query-processing
No ratings yet
chap5-query-processing
17 pages
Advanced Backend Code Optimization
From Everand
Advanced Backend Code Optimization
Sid Touati
No ratings yet
002 Azure Intro Azure Architecture and Services
No ratings yet
002 Azure Intro Azure Architecture and Services
62 pages
Windows Server Administration Fundamentals
No ratings yet
Windows Server Administration Fundamentals
3 pages
Module 4: Connecting To Additional Resources: in This Module, You Will Learn
No ratings yet
Module 4: Connecting To Additional Resources: in This Module, You Will Learn
31 pages
09-2021-Revisedcost Laptopscheme.
No ratings yet
09-2021-Revisedcost Laptopscheme.
2 pages
A To Z List of All Windows CMD Commands - HELLPC
No ratings yet
A To Z List of All Windows CMD Commands - HELLPC
12 pages
ROCKY-3786EVGU2-RS-R41 ROCKY-4786EVG-RS-R40: Features Features
No ratings yet
ROCKY-3786EVGU2-RS-R41 ROCKY-4786EVG-RS-R40: Features Features
1 page
E Life EPC 415G2 PanelPC Datasheet Compressed 1
No ratings yet
E Life EPC 415G2 PanelPC Datasheet Compressed 1
2 pages
Fall 2024 - CS609 - 2
No ratings yet
Fall 2024 - CS609 - 2
3 pages
Module 2_Relevant Tools, Standards, and Engineering Constraints
No ratings yet
Module 2_Relevant Tools, Standards, and Engineering Constraints
55 pages
Ans-All True: Public Class Class Public Class Public Static Void Class, Class Class Class
No ratings yet
Ans-All True: Public Class Class Public Class Public Static Void Class, Class Class Class
14 pages
Fert POS
No ratings yet
Fert POS
2 pages
Aws Test
No ratings yet
Aws Test
7 pages
Dropbox
No ratings yet
Dropbox
5 pages
Man 8055 Ord Hand
No ratings yet
Man 8055 Ord Hand
10 pages
Lista de Precios YIM S.A.C. 15-03-22 (20%)
No ratings yet
Lista de Precios YIM S.A.C. 15-03-22 (20%)
22 pages
Separate Odd Even Numbers From Given Array
No ratings yet
Separate Odd Even Numbers From Given Array
26 pages
Zimbra CLI Commands
No ratings yet
Zimbra CLI Commands
5 pages
DS JetBox3300-w V1.0 PDF
No ratings yet
DS JetBox3300-w V1.0 PDF
3 pages
Sample For Solution Manual Starting Out With Python 4th Global Edition by Tony Gaddis
No ratings yet
Sample For Solution Manual Starting Out With Python 4th Global Edition by Tony Gaddis
32 pages
Tacl
0% (1)
Tacl
242 pages
RHEL 8.3 - Deploying Red Hat Enterprise Linux 8 On Public Cloud Platforms
No ratings yet
RHEL 8.3 - Deploying Red Hat Enterprise Linux 8 On Public Cloud Platforms
102 pages
Microprocessor Lab Manual
No ratings yet
Microprocessor Lab Manual
27 pages
Kunal Balwani Assgt 6 PDF
No ratings yet
Kunal Balwani Assgt 6 PDF
4 pages
Asus 1005ha r1.1 Schematics
No ratings yet
Asus 1005ha r1.1 Schematics
49 pages
C Questions 0
No ratings yet
C Questions 0
155 pages
1.) Turning On An LED With Your Raspberry Pi's GPIO Pins: The Breadboard
No ratings yet
1.) Turning On An LED With Your Raspberry Pi's GPIO Pins: The Breadboard
70 pages
Windows Lfi
No ratings yet
Windows Lfi
172 pages