0% found this document useful (0 votes)

33 views

Example Problems

Data base design

Uploaded by

Sandra Kaveesher

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views

Example Problems

Data base design

Uploaded by

Sandra Kaveesher

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 10

Example Problems:

1. Here are a few example problems related to blocking factors, along with solutions to help you
understand the concept better:

### Problem 1: Basic Blocking Factor Calculation

*Problem:*

You have a file where each record is 200 bytes long. The storage block size is 1 KB (1024 bytes).
What is the blocking factor for this file?

*Solution:*

The blocking factor is calculated by dividing the block size by the record size.

\text{Blocking Factor} = \frac{\text{Block Size}}{\text{Record Size}} = \frac{1024 \text{ bytes}}{200

\text{ bytes}} = 5.12

Since the blocking factor must be a whole number, the blocking factor here would be 5. This
means that 5 records can fit into one block.

---

### Problem 2: Identifying Records in a Specific Block

*Problem:*

Given the blocking factor calculated as 5 in Problem 1, determine which records would be stored
in Block 3.

*Solution:*

To find the records in Block 3:

1. The first block (Block 1) would contain records 1 to 5.

2. The second block (Block 2) would contain records 6 to 10.

3. The third block (Block 3) would therefore contain records 11 to 15.

So, records 11 through 15 would be stored in Block 3.

---

### Problem 3: Adjusting Blocking Factor with Different Block Sizes

*Problem:*

If the block size is increased to 2 KB (2048 bytes) while keeping the record size the same at 200
bytes, what would be the new blocking factor?

*Solution:*

The new blocking factor is calculated similarly:

\text{Blocking Factor} = \frac{\text{Block Size}}{\text{Record Size}} = \frac{2048 \text{ bytes}}{200

\text{ bytes}} = 10.24

Rounding down, the new blocking factor is 10. This means that with a block size of 2 KB, each
block can store 10 records.

---

### Problem 4: Effect of Variable Record Sizes

*Problem:*

Assume you have a file with variable record sizes, ranging between 150 bytes and 250 bytes. If the
block size is 1.5 KB (1536 bytes), how would you calculate an approximate blocking factor?

*Solution:*

For variable record sizes, an average record size can be used to calculate an approximate blocking
factor.
1. First, calculate the average record size:

\text{Average Record Size} = \frac{150 + 250}{2} = 200 \text{ bytes}

2. Then, calculate the blocking factor using the average record size:

\text{Blocking Factor} = \frac{\text{Block Size}}{\text{Average Record Size}} = \frac{1536 \

text{ bytes}}{200 \text{ bytes}} = 7.68

Rounding down, the approximate blocking factor would be 7.

---

### Problem 5: Impact of Changing Record Sizes on Storage Efficiency

*Problem:*

Suppose you decrease the record size from 200 bytes to 180 bytes, keeping the block size constant
at 1 KB (1024 bytes). What is the impact on the blocking factor, and how does this affect storage
efficiency?

*Solution:*

First, calculate the new blocking factor:

\text{Blocking Factor} = \frac{\text{Block Size}}{\text{Record Size}} = \frac{1024 \text{ bytes}}{180

\text{ bytes}} = 5.69

Rounding down, the new blocking factor is 5.

*Impact on Storage Efficiency:*

- *Before (with 200-byte records):* 5 records per block with 24 bytes left unused (1024 - (5 * 200)).

- *After (with 180-byte records):* 5 records per block with 124 bytes left unused (1024 - (5 * 180)).

The reduction in record size slightly decreases the storage efficiency, as more space is left unused
in each block.

2. Assume an employee file with 60,000 records with the size of 200 bytes, which is ordered by its
key field employee_number (of length 18 bits). The employee file is stored in hard disk blocks of
size 2048 bytes. [5][CO01][BTL4]

(a) Calculate the average number of block accesses required to locate any employee record
directly in the file, for the given employee_number.

(b) If an index file is created for this employee file, how much improvement will be obtained in the
search for the same record (assume length of pointer information is 12 bits)?

Solution :

Let's break down the problem and calculate the required values step by step:

### Part (a): Average Number of Block Accesses to Locate a Record

1. *File Characteristics*:

- Number of records: \( n = 60,000 \)

- Size of each record: 200 bytes

- Block size: 2048 bytes

- Key field size (employee number): 18 bits (2.25 bytes)

2. Number of records per block:

\text{Records per block} = \frac{\text{Block size}}{\text{Record size}} = \frac{2048 \text{ bytes}}

{200 \text{ bytes}} = 10.24 \approx 10 \text{ records/block}

Since only full records can be stored, each block will store 10 records.
3. *Total number of blocks required*:

\text{Total blocks} = \frac{\text{Total records}}{\text{Records per block}} = \frac{60,000}{10} =

6,000 \text{ blocks}

4. Average number of block accesses:

If the file is ordered by the key field (employee number), and assuming a uniform distribution
and direct access, the average number of block accesses in a linear search would be:

\text{Average block accesses} = \frac{\text{Total blocks}}{2} = \frac{6,000}{2} = 3,000

### Part (b): Improvement Using an Index File

1. Index File Characteristics:

- Pointer size: 12 bits (1.5 bytes)

- Key size: 18 bits (2.25 bytes)

- Total index entry size: \( 2.25 \text{ bytes} + 1.5 \text{ bytes} = 3.75 \text{ bytes} \)

2. Number of index entries per block:

\text{Index entries per block} = \frac{\text{Block size}}{\text{Index entry size}} = \frac{2048 \

text{ bytes}}{3.75 \text{ bytes}} \approx 546 \text{ entries/block}

3. Total number of index blocks:

\text{Total index entries} = \text{Total blocks} = 6,000 \text{ entries}

\[
\text{Total index blocks} = \frac{6,000 \text{ entries}}{546 \text{ entries/block}} \approx 11 \
text{ blocks}

4. Average number of block accesses using index:

For a two-level index, the search involves accessing:

- One block for the first-level index.

- One block for the corresponding block of the second-level index.

- One block for the actual data block.

Therefore, the total number of block accesses is:

\text{Total block accesses with index} = 1 + 1 + 1 = 3 \text{ block accesses}

### Improvement Calculation:

The improvement in block accesses is the ratio of block accesses without the index to block
accesses with the index:

\text{Improvement} = \frac{\text{Block accesses without index}}{\text{Block accesses with index}}

= \frac{3,000}{3} = 1,000 \text{ times}

Thus, using an index file provides a significant improvement, reducing the average number of
block accesses by a factor of 1,000.

3. Assume a folding hash function defined as follows: “For the given key ‘X’, the hash value H(X) =
(a + b + c) mod M, where a, b and c are the three parts of given key ‘X’”.

Analyze the above hash function and infer its working to fold the key 123456789 in to hash table
of ten spaces (0 to 9).

Solution:
To analyze the given folding hash function and apply it to the key 123456789, let's break it down
into steps:

### Step 1: Divide the Key into Parts

The key 123456789 needs to be divided into three parts. The key length is 9 digits, so we can
divide it evenly into three parts:

- Part 1 (a) = 123

- Part 2 (b) = 456

- Part 3 (c) = 789

### Step 2: Calculate the Hash Value

The hash value is computed as the sum of the three parts modulo the size of the hash table (which
is 10 in this case).

So, the formula is:

H(X) = (a + b + c) \mod M

Substituting the values:

H(123456789) = (123 + 456 + 789) \mod 10

### Step 3: Perform the Calculation

First, compute the sum of the parts:

123 + 456 + 789 = 1368

Next, find the modulo with the size of the hash table (10):
\[

1368 \mod 10 = 8

### Conclusion

The hash value H(123456789) for the given key, when folded and mapped into a hash table of 10
spaces, is 8. This means the key 123456789 will be placed in position 8 of the hash table.

Indexing

4. Example 1. Suppose that we have an ordered file with r = 30,000 records stored on a disk with

100 bytes. The blocking factor for the file would be bfr = ⎣(B/R)⎦ = ⎣(1024/100)⎦ = 10 records per
block size B = 1024 bytes. File records are of fixed size and are unspanned, with record length R =

block. The number of blocks needed for the file is b = ⎡(r/bfr)⎤ = ⎡(30000/10)⎤ = 3000 blocks. A
binary search on the data file would need approximately ⎡log2b⎤= ⎡(log23000)⎤ = 12 block
accesses.

Now suppose that the ordering key field of the file is V = 9 bytes long, a block pointer is P = 6 bytes

6) = 15 bytes, so the blocking factor for the index is bfri = ⎣(B/Ri)⎦ = ⎣(1024/15)⎦ = 68 entries per
long, and we have constructed a primary index for the file. The size of each index entry is Ri = (9 +

is 3000. The num ber of index blocks is hence bi = ⎡(ri/bfri)⎤ = ⎡(3000/68)⎤ = 45 blocks. To perform
block. The total number of index entries ri is equal to the number of blocks in the data file, which

a binary search on the index file would need ⎡(log2bi)⎤ = ⎡(log245)⎤ = 6 block accesses. To search
for a record using the index, we need one additional block access to the data file for a total of 6 + 1
= 7 block accesses—an improvement over binary search on the data file, which required 12 disk
block accesses.

A major problem with a primary index—as with any ordered file—is insertion and deletion of
records. With a primary index, the problem is compounded because if we attempt to insert a
record in its correct position in the data file, we must not only move records to make space for the
new record but also change some index entries, since moving records will change the anchor
records of some blocks. Using an unordered overflow file, as discussed in Section 17.7, can reduce
this problem. Another possibility is to use a linked list of overflow records for each block in the
data file. This is similar to the method of dealing with overflow records described with hashing in
Section 17.8.2. Records within each block and its overflow linked list can be sorted to improve
retrieval time. Record deletion is handled using dele tion markers

5. Example 2. Consider the file of Example 1 with r = 30,000 fixed-length records of size R = 100
bytes stored on a disk with block size B = 1024 bytes. The file has b = 3000 blocks, as calculated in
Example 1. Suppose we want to search for a record with a specific value for the secondary key—a
nonordering key field of the file that is V = 9 bytes long. Without the secondary index, to do a
linear search on the file would require b/2 = 3000/2 = 1500 block accesses on the average.
Suppose that we con struct a secondary index on that nonordering key field of the file. As in

blocking factor for the index is bfri = ⎣(B/Ri)⎦ = ⎣(1024/15)⎦ = 68 entries per block. In a dense
Example 1, a block pointer is P = 6 bytes long, so each index entry is Ri = (9 + 6) = 15 bytes, and the

secondary index such as this, the total number of index entries ri is equal to the number of records
in the data file, which is 30,000. The number of blocks needed for the index is hence bi = ⎡(ri
/bfri)⎤ = ⎡(3000/68)⎤ = 442 blocks.

A binary search on this secondary index needs ⎡(log2bi)⎤ = ⎡(log2442)⎤ = 9 block accesses. To
search for a record using the index, we need an additional block access to the data file for a total
of 9 + 1 = 10 block accesses—a vast improvement over the 1500 block accesses needed on the
average for a linear search, but slightly worse than the 7 block accesses required for the primary
index. This difference arose because the primary index was nondense and hence shorter, with only
45 blocks in length.

We can also create a secondary index on a nonkey, nonordering field of a file. In this case,
numerous records in the data file can have the same value for the indexing field. There are several
options for implementing such an index:

■ Option 1 is to include duplicate index entries with the same K(i) value—one for each record. This
would be a dense index.

■ Option 2 is to have variable-length records for the index entries, with a repeating field for the
pointer. We keep a list of pointers in the index entry for K(i)—one pointer to each block that
contains a record whose indexing field value equals K(i). In either option 1 or option 2, the binary
search algorithm on the index must be modified appropriately to account for a variable number of
index entries per index key value.

■ Option 3, which is more commonly used, is to keep the index entries them selves at a fixed
length and have a single entry for each index field value,but to create an extra level of indirection
to handle the multiple pointers. In this nondense scheme, the pointer P(i) in index entry points to
a disk block, which contains a set of record pointers; each record pointer in that disk block points
to one of the data file records with value K(i) for the index ing field. If some value K(i) occurs in too
many records, so that their record pointers cannot fit in a single disk block, a cluster or linked list
of blocks is 18.1 Types of Single-Level Ordered Indexes 641 used. This technique is illustrated in
Figure 18.5. Retrieval via the index requires one or more additional block accesses because of the
extra level, but the algorithms for searching the index and (more importantly) for inserting of new
records in the data file are straightforward. In addition, retrievals on complex selection conditions
may be handled by referring to the record pointers, without having to retrieve many unnecessary
records from the data file (see Exercise 18.23).

6. Example 3. Suppose that the dense secondary index of Example 2 is converted into a multilevel
index. We calculated the index blocking factor bfri = 68 index entries per block, which is also the

calculated. The number of second-level blocks will be b2 = ⎡(b1/fo)⎤ = ⎡(442/68)⎤ = 7 blocks, and
fan-out fo for the multilevel index; the number of first level blocks b1 = 442 blocks was also

the number of third-level blocks will be b3 = ⎡(b2/fo)⎤ = ⎡(7/68)⎤ = 1 block. Hence, the third level
is the top level of the index, and t = 3. To access a record by searching the multilevel index, we
must access one block at each level plus one block from the data file, so we need t + 1 = 3 + 1 = 4
block accesses.

Compare this to Example 2, where 10 block accesses were needed when a single-level index and
binary search were used. Notice that we could also have a multilevel primary index, which would
be non dense. Exercise 18.18(c) illustrates this case, where we must access the data block from the
file before we can determine whether the record being searched for is in the file. For a dense
index, this can be determined by accessing the first index level (without having to access a data
block), since there is an index entry for every record in the file

Chapter 17 Disk Storage, Basic File Structures, and Hashing Disk Storage Devices
No ratings yet
Chapter 17 Disk Storage, Basic File Structures, and Hashing Disk Storage Devices
10 pages
Application Security of Erlang Concurrent System: January 2008
No ratings yet
Application Security of Erlang Concurrent System: January 2008
7 pages
Bookshop C++ OOP ASSIGNMENT SPPU
No ratings yet
Bookshop C++ OOP ASSIGNMENT SPPU
5 pages
A8 - E8-1-to-E8-3 Database System
No ratings yet
A8 - E8-1-to-E8-3 Database System
5 pages
Numerical Based On Indexing: Problem 1.2
No ratings yet
Numerical Based On Indexing: Problem 1.2
3 pages
Tugas 1 SMBD
No ratings yet
Tugas 1 SMBD
6 pages
Efficient Storage and Retrieval of Data
No ratings yet
Efficient Storage and Retrieval of Data
20 pages
ExIndexes Soln
No ratings yet
ExIndexes Soln
3 pages
File Org & Indexing - DPP 02
No ratings yet
File Org & Indexing - DPP 02
5 pages
Hashing & Indexing
No ratings yet
Hashing & Indexing
69 pages
Chapter 13: Disk Storage, Basic File Structures, and Hashing
No ratings yet
Chapter 13: Disk Storage, Basic File Structures, and Hashing
12 pages
Module1 Problems
No ratings yet
Module1 Problems
5 pages
Chapter 14: Indexing Structures For Files: Answers To Selected Exercises
No ratings yet
Chapter 14: Indexing Structures For Files: Answers To Selected Exercises
8 pages
I3306-chap2-TD2-EN - Fa23-24-Solution
No ratings yet
I3306-chap2-TD2-EN - Fa23-24-Solution
6 pages
Physical Data Storage
No ratings yet
Physical Data Storage
2 pages
Ch17Notes Indexing Structures For Files
No ratings yet
Ch17Notes Indexing Structures For Files
39 pages
P Paliva Approximating 1991
No ratings yet
P Paliva Approximating 1991
5 pages
08 File Handling
No ratings yet
08 File Handling
18 pages
Dbms Assignment Indexes by Shivanshu Mishra
No ratings yet
Dbms Assignment Indexes by Shivanshu Mishra
5 pages
Database Design and Applications (SSZ G518) 2 Semester 2017-18 Homework SOLUTIONS Topic: Indexing
No ratings yet
Database Design and Applications (SSZ G518) 2 Semester 2017-18 Homework SOLUTIONS Topic: Indexing
3 pages
4D2-Tutorial Solution3 v1.0
No ratings yet
4D2-Tutorial Solution3 v1.0
2 pages
Dbms Unit III Notes
No ratings yet
Dbms Unit III Notes
27 pages
Database 2 Notes
No ratings yet
Database 2 Notes
42 pages
9 Files, Indices and Database Tuning
No ratings yet
9 Files, Indices and Database Tuning
17 pages
File Structures Indexing Kopyası
No ratings yet
File Structures Indexing Kopyası
76 pages
Elmasri 6e Ch17 Week2 HW DiskStorage
No ratings yet
Elmasri 6e Ch17 Week2 HW DiskStorage
96 pages
SingleLevelIndexing Examples
No ratings yet
SingleLevelIndexing Examples
24 pages
File Organizations and Indexes
No ratings yet
File Organizations and Indexes
51 pages
File Management15
No ratings yet
File Management15
48 pages
Data Management: INFO125
No ratings yet
Data Management: INFO125
111 pages
Single-Level Ordered Indexes
No ratings yet
Single-Level Ordered Indexes
12 pages
File and Disk Managment Allocation Methods
No ratings yet
File and Disk Managment Allocation Methods
24 pages
Tutorial 1 5
No ratings yet
Tutorial 1 5
4 pages
HW 2 Sol 1
No ratings yet
HW 2 Sol 1
8 pages
Disk Storage, Basic File Structures, and Hashing
No ratings yet
Disk Storage, Basic File Structures, and Hashing
18 pages
File Management15
No ratings yet
File Management15
52 pages
Solution 3
No ratings yet
Solution 3
7 pages
Lecture 01 - File Storage - Part 1
No ratings yet
Lecture 01 - File Storage - Part 1
48 pages
hw3 Sol
100% (1)
hw3 Sol
6 pages
Files
No ratings yet
Files
26 pages
File Structures Indexing
No ratings yet
File Structures Indexing
58 pages
Disk Storage, Basic File Structures, and Hashing
No ratings yet
Disk Storage, Basic File Structures, and Hashing
34 pages
Elmasri_6e_Ch17_ppt_Compatibility_Mode_Repaired
No ratings yet
Elmasri_6e_Ch17_ppt_Compatibility_Mode_Repaired
32 pages
Full Unit 6 Cse 205 (1)
No ratings yet
Full Unit 6 Cse 205 (1)
20 pages
Elmasri Storage Hashing
No ratings yet
Elmasri Storage Hashing
27 pages
Topic 5
No ratings yet
Topic 5
40 pages
File System
No ratings yet
File System
9 pages
FP-Lecture-6 01
No ratings yet
FP-Lecture-6 01
33 pages
Cache Organization 2
No ratings yet
Cache Organization 2
22 pages
Examlet 4 Review
No ratings yet
Examlet 4 Review
2 pages
DS_TM_Study_Material_Presentations_Unit-4_1TM
No ratings yet
DS_TM_Study_Material_Presentations_Unit-4_1TM
22 pages
1.file Organization
No ratings yet
1.file Organization
90 pages
Indexing
No ratings yet
Indexing
27 pages
CST 204 Dbms Module - 3 Physical Data Organization
No ratings yet
CST 204 Dbms Module - 3 Physical Data Organization
93 pages
FS Presentation Modified
No ratings yet
FS Presentation Modified
14 pages
Lec 30
No ratings yet
Lec 30
22 pages
Database Design and Applications (SSZ G518) 2 Semester 2017-18 Homework Topic: Indexing
No ratings yet
Database Design and Applications (SSZ G518) 2 Semester 2017-18 Homework Topic: Indexing
1 page
File Systems
No ratings yet
File Systems
8 pages
MOD 5 QB SOLN
No ratings yet
MOD 5 QB SOLN
5 pages
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
From Everand
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
Manish Soni
No ratings yet
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
From Everand
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
Tenko
No ratings yet
UNIX Shell Programming Interview Questions You'll Most Likely Be Asked
From Everand
UNIX Shell Programming Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
O Level Computer Science Paper 1 and Paper 2 Compiled by Syed Haseeb Bari
No ratings yet
O Level Computer Science Paper 1 and Paper 2 Compiled by Syed Haseeb Bari
25 pages
How To Report Cross Comp Stock
No ratings yet
How To Report Cross Comp Stock
28 pages
Selenium WebDriver Exceptions
No ratings yet
Selenium WebDriver Exceptions
4 pages
Lab 1
No ratings yet
Lab 1
7 pages
Pygad: An Intuitive Genetic Algorithm Python Library: Ahmed Fawzy Gad
No ratings yet
Pygad: An Intuitive Genetic Algorithm Python Library: Ahmed Fawzy Gad
6 pages
Stack Implementation Using Arrays
100% (1)
Stack Implementation Using Arrays
5 pages
COMP2150 Winter2024 Test2 A01 Handout
No ratings yet
COMP2150 Winter2024 Test2 A01 Handout
2 pages
Java Web Technologies
No ratings yet
Java Web Technologies
39 pages
Full download Memory as a Programming Concept in C and C Frantisek Franek pdf docx
100% (2)
Full download Memory as a Programming Concept in C and C Frantisek Franek pdf docx
81 pages
IPC144N1F_Final_Exam_Fall_2024
No ratings yet
IPC144N1F_Final_Exam_Fall_2024
6 pages
Rishap
No ratings yet
Rishap
122 pages
DSM Developer Guide 7 Enu
No ratings yet
DSM Developer Guide 7 Enu
167 pages
Spark Project Phase1
No ratings yet
Spark Project Phase1
3 pages
Namiq's Resume
No ratings yet
Namiq's Resume
1 page
Object-Oriented Database: Adoption of Object Databases
No ratings yet
Object-Oriented Database: Adoption of Object Databases
5 pages
CICS Qustions and Answers
No ratings yet
CICS Qustions and Answers
34 pages
File Handling
No ratings yet
File Handling
25 pages
Mistral 1
No ratings yet
Mistral 1
8 pages
HackerRank 2019 2018 Developer Skills Report
No ratings yet
HackerRank 2019 2018 Developer Skills Report
28 pages
OS - Scheme-1
No ratings yet
OS - Scheme-1
19 pages
Snake game program
No ratings yet
Snake game program
7 pages
CMSC 132: Object-Oriented Programming II: Advanced Tree Structures
No ratings yet
CMSC 132: Object-Oriented Programming II: Advanced Tree Structures
25 pages
Coding IN 6months
No ratings yet
Coding IN 6months
4 pages
Built in Data Type
No ratings yet
Built in Data Type
19 pages
SQL Commands
No ratings yet
SQL Commands
23 pages
Adorno - Metaphysics PDF
No ratings yet
Adorno - Metaphysics PDF
225 pages
Jawaharlal Nehru Engineering College: Laboratory Manual
No ratings yet
Jawaharlal Nehru Engineering College: Laboratory Manual
26 pages
Subject: Computer Science Class: XII Exam: Practice Paper Time Duration: 3 Hrs M.M.: 70
No ratings yet
Subject: Computer Science Class: XII Exam: Practice Paper Time Duration: 3 Hrs M.M.: 70
7 pages