08 File Handling

The document discusses file organization in databases, emphasizing the importance of efficient data storage on hard disks for quick access. It covers various types of record storage, including fixed and variable length records, as well as indexing methods like primary, secondary, and clustering indexing. The document also provides examples illustrating the impact of indexing on block access during data retrieval.

Uploaded by

akanksha.kumari2405

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

08 File Handling

Uploaded by

akanksha.kumari2405

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

Ashish Kumar

Dept. of CSE
Manipal University Jaipur

1
File Organization
Even though the database shows us data in the form of
relations we must understand that it is just a logical
representation.
Finally all the data needs to be stored on the hard disk as
files.
It is very important in performance point of view that this
organization should allow the database software to access
data quickly and in an efficient way.

2
Files on Disk
How to store the records on hard disk?
You can rely on operating systems. But, they way we access
normal data is totally different from the way we access
database data.
In database, we might not require the whole file, just one or
two records are required. So, we let DBMS use its very own
organization.
Database ====> Files / Block ====> Record ====> Fields
We divide each table in block & then store it in hard disk.
If the size of a block is 1024 byte and 1 record is of size 4
byte then how many records in one block you can save?
1024 / 4 =256 i.e; blocking factor is the average number of
record per block.
3
Fixed Length v/s Variable Length Records
Consider a list of mobile numbers where each number is
exactly 10 digits long.
Retrieving number will be simple here because we can read
10 characters at a time.
But consider a list of names. How do we know the size of
names? They can vary.
To make matters more complex we can have records of
certain number of fixed length entries and some variable
length entries.

4
Records on Disk in Blocks
Spanned: It allows partial part of record to be stored in a
block.
R1 R2 R3 R4 R4 R5 R6 R7

Advantages : no wastage of memory.

Disadvantage: No. of block access will increase to access a
record.
Unspanned: No record can be stored in more than one
block.
R1 R2 R3 R4 R5 R6 R7 ||||
Advantages : No. of block access will be less to access a
record.
Disadvantage: no wastage of memory.
5
Records on Disk in Files
Ordered File Organization: All records in a file are ordered on
some search key value.
Searching can be done in binary search mode.
Advantage: Searching can be efficient. Only when we search
on search key value. If we search on other attribute then no
advantage.
Disadvantage: Insertion will be expensive due to
reorganization of the entire file.
Un-Ordered File Organization: All records in a file are inserted
wherever the place is available (usually at the end of file).
Searching can be done in linear search mode.
Advantage: Insertion of a record is efficient.
Disadvantage: Searching is very inefficient. 6
Indexing
Indexing mechanisms used to speed up access to desired
data. E.g., author catalog in library
Search Key - attribute or set of attributes used to look up
records in a file.
An index file consists of records (called index entries) of
the form
search-key Block -pointer

Index files are typically much smaller than the original file
Two basic kinds of indices:
Ordered indices: search keys are stored in sorted order
Hash indices: search keys are distributed uniformly
across “buckets” using a “hash function”.
7
Classification of Indexing
Non-
Primary key + Key/Candidate
Ordered Key + Unordered

Non-Key
+ ordered

These all are single level indexing.

8
Primary Indexing
Data file is ordered on primary key & we will build index on
primary key.
A primary index is an ordered file whose records are of
fixed length with two fields. First field is same as primary
key of data file and second field is a pointer to the data
block where key is available.
Index is created for the first record of each block is known
as block anchors.

9
Dense Indexing
Dense index — Index record appears for every search-key
value in the file.

10
Sparse Indexing
Sparse Index: contains index records for only some search-
key values.
Applicable when records are sequentially ordered on
search-key

11
Secondary Indexing
Secondary Index provides a
secondary means of accessing
a file for which primary access
already exist.
It will be dense index. i.e.,
index will be created for every
record in a file.
Secondary Index does not
have any impact on how the
rows are actually organized in
data blocks.
They can be in any order. The
only ordering is w.r.t the
index key in index blocks. 12
Clustering Indexing
It is created on data file whose records are physically
ordered on a non-key attribute which does not have
distinct value for each record.

13
Primary Index Example
Suppose that we have an ordered file of 30,000 records on a
disk with block size of 1024 bytes. Records are fixed and are
unspanned of size 100 bytes. Suppose we have created
primary index on the key filed of the size 9 bytes and a
block pointer of size 6 bytes, then find the average number
of block access to search a record with and without index.
Without Indexing:
Record / block = 1024/ 100 = 10.24
Since it is unspanned, data / block = 10
Data block required to hold 30,000 = 30,000 / 10 = 3,000
Block access to search a record = log2 3000 = 11.55 =
Approx. 12
14
Primary Index Example
With Indexing:
Index record size = 9 + 6 = 15 bytes
Record / block = 1024/ 15 = 68.266
Since it is unspanned, data / block = 68
Since it is primary index, no. of index record = No. of data
block = 3,000 (Due to block anchors)
Block access to search a record = 3000 / 68 = 44.11 =
Approx. 45
No. of block access required = log2 45 + 1 = 6 + 1 = 7

15
Secondary Index Example
Same Ques as above.

Without Indexing:
Record / block = 1024/ 100 = 10.24
Since it is unspanned, data / block = 10
Data block required to hold 30,000 = 30,000 / 10 = 3,000
Since the data records are unsorted, block access to search
a record = 3000

16
Secondary Index Example
With Indexing:
Index record size = 9 + 6 = 15 bytes
Record / block = 1024/ 15 = 68.266
Since it is unspanned, data / block = 68
No. of index records = 30,000
No. of blocks required = 30,000 / 68 = 441.176 = Approx 442
No. of block access required = log2 442 + 1 = 9 + 1 = 10

17
Thank You

Indexing
No ratings yet
Indexing
8 pages
Chapter 5. Record Storage and Primary File Organization
No ratings yet
Chapter 5. Record Storage and Primary File Organization
18 pages
Mod4 Chap10 - 11 Indexing
No ratings yet
Mod4 Chap10 - 11 Indexing
77 pages
FP-Lecture-6 01
No ratings yet
FP-Lecture-6 01
33 pages
DBMS Unit-5
No ratings yet
DBMS Unit-5
33 pages
CIT 401 Lecture Note
No ratings yet
CIT 401 Lecture Note
46 pages
Weekly Exercises 01
No ratings yet
Weekly Exercises 01
16 pages
9 Files, Indices and Database Tuning
No ratings yet
9 Files, Indices and Database Tuning
17 pages
DBMS_UNIT_5_NOTES
No ratings yet
DBMS_UNIT_5_NOTES
28 pages
Single Level Indexing
No ratings yet
Single Level Indexing
9 pages
index1 (5)
No ratings yet
index1 (5)
25 pages
Indexing - II
No ratings yet
Indexing - II
57 pages
Unit 6 notes DBMS final
No ratings yet
Unit 6 notes DBMS final
14 pages
Primary Indexing
No ratings yet
Primary Indexing
7 pages
File Organization and Indexing
No ratings yet
File Organization and Indexing
13 pages
Indexing
No ratings yet
Indexing
2 pages
Dbms Unit III Notes
No ratings yet
Dbms Unit III Notes
27 pages
Dbms Mod3
No ratings yet
Dbms Mod3
54 pages
CO3 Notes Indexing
No ratings yet
CO3 Notes Indexing
11 pages
Self Unit 2
No ratings yet
Self Unit 2
18 pages
Assignment (DS)
No ratings yet
Assignment (DS)
8 pages
Indexing_complete note
No ratings yet
Indexing_complete note
49 pages
Indexing Dbms
No ratings yet
Indexing Dbms
22 pages
10 File Organization in DBMS
No ratings yet
10 File Organization in DBMS
15 pages
DBMS - R2017 - Anna University
No ratings yet
DBMS - R2017 - Anna University
20 pages
dbms 5
No ratings yet
dbms 5
38 pages
Memoryhierarchy Indexing
No ratings yet
Memoryhierarchy Indexing
9 pages
File Organizations and Indexes
No ratings yet
File Organizations and Indexes
51 pages
Indexing
No ratings yet
Indexing
27 pages
Introduction To: Information Retrieval
No ratings yet
Introduction To: Information Retrieval
50 pages
CS3492 DBMS Unit-4
No ratings yet
CS3492 DBMS Unit-4
24 pages
Index Structures
No ratings yet
Index Structures
34 pages
Chap. 2 File Organization and Indexing: Abel J.P. Gomes
No ratings yet
Chap. 2 File Organization and Indexing: Abel J.P. Gomes
20 pages
History of File Structures
No ratings yet
History of File Structures
26 pages
heap file org GROUP 7
No ratings yet
heap file org GROUP 7
34 pages
Disk Storage, Basic File Structures, and Hashing
No ratings yet
Disk Storage, Basic File Structures, and Hashing
18 pages
Assignment 3
No ratings yet
Assignment 3
4 pages
Chapter - 8 1 97
No ratings yet
Chapter - 8 1 97
97 pages
Unit Iv
No ratings yet
Unit Iv
6 pages
DSA Unit6 Theory
No ratings yet
DSA Unit6 Theory
23 pages
index2 (1)
No ratings yet
index2 (1)
24 pages
Inls 623 - Database Systems Ii - File Structures, Indexing, and Hashing
No ratings yet
Inls 623 - Database Systems Ii - File Structures, Indexing, and Hashing
41 pages
Types of Indexes
No ratings yet
Types of Indexes
9 pages
S - UNIT VII Indexing in Database
No ratings yet
S - UNIT VII Indexing in Database
9 pages
8 DataStorageIndexingStructures Updated
No ratings yet
8 DataStorageIndexingStructures Updated
57 pages
File Organization and Indexing (1)
No ratings yet
File Organization and Indexing (1)
38 pages
Chapter-1(OS)
No ratings yet
Chapter-1(OS)
14 pages
Lecture 4-Indexconstruction
No ratings yet
Lecture 4-Indexconstruction
45 pages
FS M1 Part1
No ratings yet
FS M1 Part1
151 pages
File Management15
No ratings yet
File Management15
48 pages
Fundamental File Structure Concepts
No ratings yet
Fundamental File Structure Concepts
17 pages
Indexing Lecture Nov 2023 Summary
No ratings yet
Indexing Lecture Nov 2023 Summary
41 pages
DBMS-UNIT 4
No ratings yet
DBMS-UNIT 4
26 pages
IT3031-L06-Indexing
No ratings yet
IT3031-L06-Indexing
45 pages
Chapter 12: Indexing and Hashing
No ratings yet
Chapter 12: Indexing and Hashing
31 pages
File Storage and Indexing: Lesson 13 Cs 3200 Kathleen Durant PHD
No ratings yet
File Storage and Indexing: Lesson 13 Cs 3200 Kathleen Durant PHD
46 pages
Indexing
No ratings yet
Indexing
6 pages
Indexing Structures For Files: Database Design Database Design
No ratings yet
Indexing Structures For Files: Database Design Database Design
9 pages
Unit 5
No ratings yet
Unit 5
185 pages
Search Tree: Fundamentals and Applications
From Everand
Search Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
Speccy Report - HOUSE
No ratings yet
Speccy Report - HOUSE
121 pages
Chapter 4-Java Language Fundamentals
No ratings yet
Chapter 4-Java Language Fundamentals
30 pages
DigiJED Students
No ratings yet
DigiJED Students
7 pages
Report Finalev7
No ratings yet
Report Finalev7
65 pages
Hybrid Models in Chemicals: Leveraging Industrial AI To Overcome Operational Challenges
No ratings yet
Hybrid Models in Chemicals: Leveraging Industrial AI To Overcome Operational Challenges
11 pages
Websphere MQ Runs Inside The QMQM Subsystem So This Needs To Be Running Before Anything Else Can Be Done
No ratings yet
Websphere MQ Runs Inside The QMQM Subsystem So This Needs To Be Running Before Anything Else Can Be Done
16 pages
Vishnu Mini Project
No ratings yet
Vishnu Mini Project
46 pages
PDF
No ratings yet
PDF
269 pages
Intellitc: Automating Type Changes in Intellij Idea: Oleg Smirnov Ameya Ketkar Timofey Bryksin
No ratings yet
Intellitc: Automating Type Changes in Intellij Idea: Oleg Smirnov Ameya Ketkar Timofey Bryksin
5 pages
SQL Server Security (Logins, Users - Fixed Roles)
No ratings yet
SQL Server Security (Logins, Users - Fixed Roles)
3 pages
Unit-2 Linear Data Structure-Stack
No ratings yet
Unit-2 Linear Data Structure-Stack
40 pages
Assignment 1: Computer Science
No ratings yet
Assignment 1: Computer Science
7 pages
ASSIGNMENT-3
No ratings yet
ASSIGNMENT-3
11 pages
Tuya Smarthome Goods
No ratings yet
Tuya Smarthome Goods
58 pages
Unit 5 Dpco
No ratings yet
Unit 5 Dpco
20 pages
CCNA 1 (v5.1 + v6.0) Chapter 1 Exam Answers Quiz#1
No ratings yet
CCNA 1 (v5.1 + v6.0) Chapter 1 Exam Answers Quiz#1
2 pages
LC4 Instructions
No ratings yet
LC4 Instructions
1 page
GC Ac ST
No ratings yet
GC Ac ST
1 page
The Role of Nursing Informatics On Promoting Quality of Health Care and The Need For Appropriate Education
No ratings yet
The Role of Nursing Informatics On Promoting Quality of Health Care and The Need For Appropriate Education
7 pages
BGP Soft Reconfiguration
No ratings yet
BGP Soft Reconfiguration
13 pages
LU14 Instruction Fetch and Execution Steps 1665227118233
No ratings yet
LU14 Instruction Fetch and Execution Steps 1665227118233
14 pages
Lab 03 Scanning Networks
No ratings yet
Lab 03 Scanning Networks
116 pages
W1-Module 001 Introduction To Network Security
No ratings yet
W1-Module 001 Introduction To Network Security
9 pages
Industry Oriented Software and Hardware Training For Biomedical - 2015
No ratings yet
Industry Oriented Software and Hardware Training For Biomedical - 2015
1 page
SP20-BCS-017 - SP20-BCS-080 Scope
No ratings yet
SP20-BCS-017 - SP20-BCS-080 Scope
14 pages
COMP1406-W23-T01 Specification
No ratings yet
COMP1406-W23-T01 Specification
12 pages
SP 10 - Control of Documents and Data 2
No ratings yet
SP 10 - Control of Documents and Data 2
4 pages
36-Multilevel A-Diakoptics For The Dynamic Power-Flow Simulation of Hybrid Power Distribution Systems
No ratings yet
36-Multilevel A-Diakoptics For The Dynamic Power-Flow Simulation of Hybrid Power Distribution Systems
10 pages
OptiCon SBG-1000 User Manual (IP-PBX)
No ratings yet
OptiCon SBG-1000 User Manual (IP-PBX)
184 pages
Bangladesh Rural Electrification Board Exam-Previous Year Questions
No ratings yet
Bangladesh Rural Electrification Board Exam-Previous Year Questions
2 pages