Distributed Data Systems: BITS Pilani
Distributed Data Systems: BITS Pilani
Distributed Data Systems: BITS Pilani
BITS Pilani
Pilani|Dubai|Goa|Hyderabad
Anil Kumar Ghadiyaram
Text Books
Note: In order to broaden understanding of concepts as applied to Indian IT industry, students are
advised to refer books of their choice and case-studies in their own organizations
SSZG554 - Distributed Data Systems 5th Aug 2018 2 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
BITS Pilani
Pilani|Dubai|Goa|Hyderabad
Source Courtesy: Some of the contents of this PPT are sourced from materials provided by Publishers of T1 & T2
PRESENTATION OVERVIEW
SSZG554 - Distributed Data Systems 5th Aug 2018 4 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
RAID 4 AND RAID 5
SSZG554 - Distributed Data Systems 5th Aug 2018 5 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
RAID 4 AND RAID 5
• RAID controller calculates a parity block for every four blocks and writes this onto
the fifth physical hard disk.
• For example, the RAID controller calculates the parity block PABCD for the blocks A,
B, C and D.
• If one of the four data disks fails, the RAID controller can reconstruct the data of the
defective disks using the three other data disks and the parity disk.
SSZG554 - Distributed Data Systems 5th Aug 2018 6 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
RAID 4 AND RAID 5
• From a mathematical point of view the parity block is calculated with the aid of the
logical XOR operator (Exclusive OR).
• For example, the equation PABCD = A XOR B XOR C XOR D applies.
• The space saving offered by RAID 4 and RAID 5, comes at a price in relation to RAID
10.
• Changing a data block changes the value of the associated parity block.
SSZG554 - Distributed Data Systems 5th Aug 2018 7 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
RAID 4 AND RAID 5
This means that each write operation to the virtual hard disk requires
(1) the physical writing of the data block,
(2) the recalculation of the parity block and
(3) the physical writing of the newly calculated parity block.
This extra cost for writeoperations in RAID 4 and RAID 5 is called the write penalty of
RAID 4 or the write penalty of RAID 5
SSZG554 - Distributed Data Systems 5th Aug 2018 8 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
RAID 4 AND RAID 5
• RAID 4 (parity disk) is designed to reduce the storage requirement of RAID 0+1 and
RAID 10.
• In the example, the data blocks are distributed over four physical hard disks by
means of RAID 0 (striping).
• Instead of mirroring all data once again, only a parity block is stored for each four
blocks.
SSZG554 - Distributed Data Systems 5th Aug 2018 9 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
SSZG554 - Distributed Data Systems 5th Aug 2018 10
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
RAID 5 (striped parity):
In RAID 4 each write
access by the server is
associated
with a write operation to
the parity disk for the
update of parity
information.
RAID 5 distributes
the load of the parity disk
over all physical hard
SSZG554 - Distributed Data Systems 5th Aug 2018 disks. 11
BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
RAID 6: DOUBLE PARITY
• RAID 6 offers a compromise between RAID 5 and RAID 10 by adding a second parity
hard disk to extend RAID 5, which then uses less storage capacity than RAID 10.
• There are different approaches available today for calculating the two parity blocks
of a parity group.
• However, none of these procedures has been adopted yet as an industry standard.
• Irrespective of an exact procedure, RAID 6 has a poor write performance because
the write penalty for RAID 5 strikes twice
SSZG554 - Distributed Data Systems 5th Aug 2018 12 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
RAID 2
SSZG554 - Distributed Data Systems 5th Aug 2018 13 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
RAID 3
SSZG554 - Distributed Data Systems 5th Aug 2018 14 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
COMPARISON OF THE RAID LEVELS
SSZG554 - Distributed Data Systems 5th Aug 2018 15 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
BASIC FORMS OF STORAGE
SSZG554 - Distributed Data Systems 5th Aug 2018 16 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
COMPARISON
SSZG554 - Distributed Data Systems 5th Aug 2018 17 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
COMPARISON
SSZG554 - Distributed Data Systems 5th Aug 2018 18 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956
THANK YOU
SSZG554 - Distributed Data Systems 5th Aug 2018 19 BITS Pilani, Deemed to be University under Section 3 of UGC Act, 1956