0% found this document useful (0 votes)

137 views

Data Representation

This chapter discusses various methods for representing and manipulating numeric and character data in computer systems. It begins with an introduction to basic concepts like bits, bytes and words. It then covers positional numbering systems and techniques for converting between different bases, including division algorithms for unsigned and fractional numbers. Several methods for representing signed integers are presented, including signed magnitude, ones' complement and twos' complement arithmetic. Floating point representation and its challenges are also examined. The chapter concludes with discussions of character codes and techniques for error detection and correction.

Uploaded by

Kapil Yadav

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

137 views

Data Representation

Uploaded by

Kapil Yadav

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

CHAPTER 2

Data Representation in Computer Systems

2.1 Introduction 47
2.2 Positional Numbering Systems 48
2.3 Converting Between Bases 48
2.3.1 Converting Unsigned Whole Numbers 49
2.3.2 Converting Fractions 51
2.3.3 Converting between Power-of-Two Radices 54
2.4 Signed Integer Representation 54
2.4.1 Signed Magnitude 54
2.4.2 Complement Systems 60
2.4.3 Unsigned Versus Signed Numbers 66
2.4.4 Computers, Arithmetic, and Booth’s Algorithm 66
2.4.5 Carry Versus Overflow 70
2.4.6 Binary Multiplication and Division Using Shifting 71
2.5 Floating-Point Representation 73
2.5.1 A Simple Model 74
2.5.2 Floating-Point Arithmetic 76
2.5.3 Floating-Point Errors 78
2.5.4 The IEEE-754 Floating-Point Standard 79
2.5.5 Range, Precision, and Accuracy 81
2.5.6 Additional Problems with Floating-Point Numbers 82
2.6 Character Codes 85
2.6.1 Binary-Coded Decimal 86
2.6.2 EBCDIC 87
2.6.3 ASCII 88
2.6.4 Unicode 88
2.7 Error Detection and Correction 92
2.7.1 Cyclic Redundancy Check 92
2.7.2 Hamming Codes 95
2.7.3 Reed-Soloman 102
Chapter Summary 103

CMPS375 Class Notes (Chap02) Page 1 / 20 by Kuo-pao Yang

2.1 Introduction 47

• This chapter describes the various ways in which computers can store and manipulate
numbers and characters.
• Bit: The most basic unit of information in a digital computer is called a bit, which is a
contraction of binary digit.
• Byte: In 1964, the designers of the IBM System/360 main frame computer established
a convention of using groups of 8 bits as the basic unit of addressable computer
storage. They called this collection of 8 bits a byte.
• Word: Computer words consist of two or more adjacent bytes that are sometimes
addressed and almost always are manipulated collectively. The word size represents
the data size that is handled most efficiently by a particular architecture. Words can
be 16 bits, 32 bits, 64 bits.
• Nibbles: Eight-bit bytes can be divided into two 4-bit halves call nibbles.

2.2 Positional Numbering Systems 48

• Radix (or Base): The general idea behind positional numbering systems is that a
numeric value is represented through increasing powers of a radix (or base).

System Radix Allowable Digits

---------------------------------------------------------------------
Decimal 10 0, 1, 2, 3, 4, 5, 6, 7, 8, 9
Binary 2 0, 1
Octal 8 0, 1, 2, 3, 4, 5, 6, 7
Hexadecimal 16 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, A, B, C, D, E, F

TABLE 2.1 Some Number to Remember

• EXAMPLE 2.1 Three numbers represented as powers of a radix.

243.5110 = 2 * 102 + 4 * 101 + 3 * 100 + 5 * 10-1 + 1 * 10-2

2123 = 2 * 32 + 1 * 31 + 2 * 30 = 2310

CMPS375 Class Notes (Chap02) Page 2 / 20 by Kuo-pao Yang

101102 = 1 * 24 + 0 * 23 + 1 * 22 + 1 * 21 + 0 * 20 = 2210

2.3 Converting Between Bases 48

• There are two important groups of number base conversions:

1. Conversion of decimal numbers to base-r numbers
2. Conversion of base-r numbers to decimal numbers

2.3.1 Converting Unsigned Whole Numbers 49

• EXAMPLE 2.3 Convert 10410 to base 3 using the division-remainder method.

10410 = 102123

3|104 2
3| 34 1
3| 11 2
3| 3 0
3|1 1
0

• EXAMPLE 2.4 Convert 14710 to binary

14710 = 100100112

2|147 1
2| 73 1
2|36 0
2|18 0
2|9 1
2|4 0
2|2 0
2|1 1
0

• A binary number with N bits can represent unsigned integer from 0 to 2n – 1.

• Overflow: the result of an arithmetic operation is outside the range of allowable
precision for the give number of bits.

CMPS375 Class Notes (Chap02) Page 3 / 20 by Kuo-pao Yang

2.3.2 Converting Fractions 51

• EXAMPLE 2.6 Convert 0.430410 to base 5.

0.430410 = 0.20345

• EXAMPLE 2.7 Convert 0.3437510 to binary with 4 bits to the right of the binary point.
Reading from top to bottom, 0.3437510 = 0.01012 to four binary places. We simply
discard (or truncate) our answer when the desired accuracy has been achieved.

0.3437510 = 0.01012

0.34375
X 2
0.68750
X 2
1.37500
X 2
0.75000
X 2
1.50000

• EXAMPLE 2.8 Convert 31214 to base 3

First, convert to decimal 31214 = 21710

Then convert to base 3 21710 = 220013
We have 31214 = 220013

2.3.3 Converting between Power-of-Two Radices 54

• EXAMPLE 2.9 Convert 1100100111012 to octal and hexadecimal.

110 010 011 1012 = 62358 Separate into groups of 3 for octal conversion

1100 1001 11012 = C9D16 Separate into groups of 4 for octal conversion

CMPS375 Class Notes (Chap02) Page 4 / 20 by Kuo-pao Yang

2.4 Signed Integer Representation 54

• By convention, a “1” in the high-order bit indicate a negative number.

2.4.1 Signed Magnitude 54

• A signed-magnitude number has a sign as its left-most bit (also referred to as the
high-order bit or the most significant bit) while the remaining bits represent the
magnitude (or absolute value) of the numeric value.
• N bits can represent –(2n-1 - 1) to 2n-1 -1
• EXAMPLE 2.10 Add 010011112 to 001000112 using signed-magnitude arithmetic.

010011112 (79) + 001000112 (35) = 011100102 (114)

There is no overflow in this example

• EXAMPLE 2.11 Add 010011112 to 011000112 using signed-magnitude arithmetic.

• An overflow condition and the carry is discarded, resulting in an incorrect sum.

We obtain the erroneous result of

010011112 (79) + 011000112 (99) = 01100102 (50)

• EXAMPLE 2.12 Subtract 010011112 from 011000112 using signed-magnitude

arithmetic.

We find 0110000112 (99) - 010011112 (79) = 000101002 (20)

in signed-magnitude representation.

• EXAMPLE 2.14
• EXAMPLE 2.15
• The signed magnitude has two representations for zero, 10000000 and 00000000
(and mathematically speaking, the simple shouldn’t happen!).

2.4.2 Complement Systems 60

• One’s Complement
o This sort of bit-flipping is very simple to implement in computer hardware.
o EXAMPLE 2.16 Express 2310 and -910 in 8-bit binary one’s complement form.

2310 = + (000101112) = 000101112

-910 = - (000010012) = 111101102

o EXAMPLE 2.17
o EXAMPLE 2.18
• The primary disadvantage of one’s complement is that we still have two
representations for zero: 00000000 and 11111111

CMPS375 Class Notes (Chap02) Page 5 / 20 by Kuo-pao Yang

• Two’s Complement
o Find the one’s complement and add 1.
o EXAMPLE 2.19 Express 2310, -2310, and -910 in 8-bit binary two’s complement
form.

2310 = + (000101112) = 000101112

-2310 = - (000101112) = 111010002 + 1 = 111010012
-910 = - (000010012) = 111101102 + 1 = 111101112

o EXAMPLE 2.20 Add 910 to -2310 using two’s complement arithmetic.

000010012 (910) + 111010012 (-2310) = 111100102 (-1410)

00001001 <= Carries

000010012 ( 9)
+ 111010012 +(-23)
111100102 (-14)

o EXAMPLE 2.21 Find the sum of 2310 and -910 in binary using two’s complement
arithmetic.

000101112 (2310) + 111101112 (-910) = 000011102 (1410)

11110111 <= Carries

000101112 ( 23)
+ 111101112 +(- 9)
000011102 ( 14)

o A Simple Rule for Detecting an Overflow Condition: If the carry in the sign bit
equals the carry out of the bit, no overflow has occurred. If the carry into the sign
bit is different from the carry out of the sign bit, over (and thus an error) has
occurred.

o EXAMPLE 2.22 Find the sum of 12610 and 810 in binary using two’s complement
arithmetic.

011111102 (12610) + 000010002 (810) = 100001102 (-12210)

01111000 <= Carries

011111102 ( 126)
+ 000010002 +( 8)
100001102 (-122)

A one is carried into the leftmost bit, but a zero is carried out. Because these
carries are not equal, an overflow has occurred.

CMPS375 Class Notes (Chap02) Page 6 / 20 by Kuo-pao Yang

o N bits can represent –(2n-1) to 2n-1 -1. With signed-magnitude number, for
example, 4 bits allow us to represent the value -7 through +7. However using
two’s complement, we can represent the value -8 through +7.

• Integer Multiplication and Division

o For each digit in the multiplier, the multiplicand is “shifted” one bit to the left.
When the multiplier is 1, the “shifted” multiplicand is added to a running sum of
partial products.
o EXAMPLE Find the product of 000001102 (610) and 000010112 (1110).
00000110 ( 6)
x 00001011 (11)

Multiplicand Partial Products

00000110 + 00000000 (1; add multiplicand and shift left)
00001100 + 00000110 (1; add multiplicand and shift left)
00011000 + 00010010 (0; Don’t add, just shift multiplicand left)
00110000 + 00010010 (1; add multiplicand and shift left)
= 01000010 (Product; 6 X 11 = 66)

o When the divisor is much smaller than the dividend, we get a condition known as
divide underflow, which the computer sees as the equivalent of division by zero.
o Computer makes a distinction between integer division and floating-point division.
With integer division, the answer comes in two parts: a quotient and a
remainder.
Floating-point division results in a number that is expressed as a binary
fraction.
Floating-point calculations are carried out in dedicated circuits call floating-
point units, or FPU.

2.4.3 Unsigned Versus Signed Numbers 66

• If the 4-bit binary value 1101 is unsigned, then it represents the decimal value 13, but
as a signed two’s complement number, it represents -3.
• C programming language has int and unsigned int as possible types for integer
variables.
• If we are using 4-bit unsigned binary numbers and we add 1 to 1111, we get 0000
(“return to zero”).
• If we add 1 to the largest positive 4-bit two’s complement number 0111 (+7), we get
1000 (-8).

CMPS375 Class Notes (Chap02) Page 7 / 20 by Kuo-pao Yang

2.4.4 Computers, Arithmetic, and Booth’s Algorithm 66

• Consider the following standard pencil and paper method for multiplying two’s
complement numbers (-5 X -4):
1011 (-5)
x 1100 (-4)
+ 0000 (0 in multiplier means simple shift)
+ 0000 (0 in multiplier means simple shift)
+ 1011 (1 in multiplier means add multiplicand and shift)
+ 1011____ (1 in multiplier means add multiplicand and shift)
10000100 (-4 X -5 = -124)

Note that: “Regular” multiplication clearly yields the incorrect result.

• Research into finding better arithmetic algorithms has continued apace for over 50
years. One of the many interesting products of this work is Booth’s algorithm.
• In most cases, Booth’s algorithm carries out multiplication faster and more accurately
than naïve pencil-and-paper methods.
• The general idea of Booth’s algorithm is to increase the speed of a multiplication
when there are consecutive zeros or ones in the multiplier.
• Consider the following standard multiplication example (3 X 6):
0011 (3)
x 0110 (6)
+ 0000 (0 in multiplier means simple shift)
+ 0011 (1 in multiplier means add multiplicand and shift)
+ 0011 (1 in multiplier means add multiplicand and shift)
+ 0000____ (0 in multiplier means simple shift)
0010010 (3 X 6 = 18)

CMPS375 Class Notes (Chap02) Page 8 / 20 by Kuo-pao Yang

• In Booth’s algorithm, if the multiplicand and multiplier are n-bit two’s complement
numbers, the result is a 2n-bit two’s complement value. Therefore, when we perform
our intermediate steps, we must extend our n-bit numbers to 2n-bit numbers. For
example, the 4-bit number 1000 (-8) extended to 8 bits would be 11111000.
• Booth’s algorithm is interested in pairs of bits in the multiplier and proceed according
to the following rules:
o If the current multiplier bit is 1 and the preceding bit was, we are at the
beginning of a string of ones, so subtract (10 pair) the multiplicand form the
product.
o If the current multiplier bit is 0 and the preceding bit was 1, we are at the end
of a string of ones, so we add (01 pair) the multiplicand to the product.
o If it is a 00 pair, or a 11 pair, do no arithmetic operation (we are in the middle
of a string of zeros or a string of ones). Simply shift. The power of the
algorithm is in this step: we can now treat a string of ones as a string of zeros
and do nothing more than shift.
0011 (3)
x 0110 (6)
+ 00000000 (00 = simple shift; assume a mythical 0 as the previous bit)
+ 11111101 (10 = subtract = add 1111 1101, extend sign)
+ 00000000 (11 simple shift)
+ 00000011____ (01 = add )
01000010010 (3 X 6 = 18; 010 ignore extended sign bit that go beyond 2n)

Note that: 010 Ignore extended sign bit that go beyond 2n.

• EXAMPLE 2.23 (-3 X 5) Negative 3 in 4-bit two’s complement is 1101. Extended to

8 bits, it is 11111101. Its complement is 00000011. When we see the rightmost 1 in
the multiplier, it is the beginning of a string of 1s, so we treat it as if it were the string
10:
1101 (-3; for subtracting, we will add -3’s complement, or 00000011)
x 0101 (5)
+ 00000011 (10 = subtract 1101 = add 0000 0011)
+ 11111101 (01 = add 1111 1101 to product: note sign extension)
+ 00000011 (10 = subtract 1101 = add 0000 0011)
+ 11111101____ (01 = add 1111 1101 to product)
100111110001 (-3 X 5 = -15; using the 8 rightmost bits, 11110001 or -15)

Note that: Ignore extended sign bit that go beyond 2n.

CMPS375 Class Notes (Chap02) Page 9 / 20 by Kuo-pao Yang

• EXAMPLE 2.24 Let’s look at the larger example of 53 X 126:
00110101 (53; for subtracting, we will add the complement of 53 or 11001011)
x 01111110 (126)
+0000000000000000 (00 = simple shift)
+111111111001011 (10 = subtract = add 11001011, sign extension)
+00000000000000 (11 = simple shift)
+0000000000000 (11 = simple shift)
+000000000000 (11 = simple shift)
+00000000000 (11 = simple shift)
+0000000000 (11 = simple shift)
+000110101____ (01 = add 00110101, sign extension)
10001101000010110 (53 X 126 = 6678; using the 16 rightmost bits)

Note that: Ignore extended sign bit that go beyond 2n.

• Booth’s algorithm not only allows multiplication to be performed faster in most cases,
but it also has the added bonus in that it works correctly on signed numbers.

2.4.5 Carry Versus Overflow 70

• For unsigned numbers, a carry (out of the leftmost bit) indicates the total number of
bits was not large enough to hold the resulting value, and overflow has occurred.
• For signed numbers, if the carry in to the sign bit and the carry (out of the sign bit)
differ, then overflow has occurred.

TABLE 2.2 Examples of Carry and Overflow in Signed Numbers

CMPS375 Class Notes (Chap02) Page 10 / 20 by Kuo-pao Yang

2.4.6 Binary Multiplication and Division Using Shifting 71

• We can do binary multiplication and division by 2 very easily using an arithmetic

shift operation
• A left arithmetic shift inserts a 0 in for the rightmost bit and shifts everything else left
one bit; in effect, it multiplies by 2
• A right arithmetic shift shifts everything one bit to the right, but copies the sign bit; it
divides by 2
• EXAMPLE 2.25: Multiply the value 11 (expressed using 8-bit signed two’s
complement representation) by 2.

We start with the binary value for 11:

00001011 (+11)
We shift left one place, resulting in:
00010110 (+22)
The sign bit has not changed, so the value is valid.

To multiply 11 by 4, we simply perform a left shift twice.

• EXAMPLE 2.28: Divide the value 12 (expressed using 8-bit signed two’s
complement representation) by 2.

We start with the binary value for 12:

00001100 (+12)
We shift left one place, resulting in:
00000110 (+6)
(Remember, we carry the sign bit to the left as we shift.)

To divide 12 by 4, we right shift twice.

CMPS375 Class Notes (Chap02) Page 11 / 20 by Kuo-pao Yang

2.5 Floating-Point Representation 73

• In scientific notion, numbers are expressed in two parts: a fractional part call a
mantissa, and an exponential part that indicates the power of ten to which the
mantissa should be raised to obtain the value we need.

2.5.1 A Simple Model 74

• In digital computers, floating-point number consist of three parts: a sign bit, an

exponent part (representing the exponent on a power of 2), and a fractional part
called a significand (which is a fancy word for a mantissa).

1 bit 5 bits 8 bits

Sign bit Exponent Significand

FIGURE 2.1 Simple Model Floating-Point Representation

• Unbiased Exponent
0 00101 10001000 1710 = 0.100012 * 25

0 10001 10000000 6553610 = 0.12 * 217

• Biased Exponent: We select 16 because it is midway between 0 and 31 (our exponent

has 5 bits, thus allowing for 25 or 32 values). Any number larger than 16 in the
exponent field will represent a positive value. Value less than 16 will indicate
negative values.
0 10101 10001000 1710 = 0.100012 * 25
The biased exponent is 16 + 5 = 21

0 01111 10000000 0.2510 = 0.12 * 2-1

• EXAMPLE 2.31
• A normalized form is used for storing a floating-point number in memory. A
normalized form is a floating-point representation where the leftmost bit of the
significand will always be a 1.
Example: Internal representation of (10.25)10

(10.25)10 = (1010.01) 2 (Un-normalized form)

= (1010.01) 2 x 20 .
= (101.001) 2 x 21 .
:
= (.101001) 2 x 24 (Normalized form)
= (.0101001) 2 x 25 (Un-normalized form)
= (.00101001) 2 x 26 .

Internal representation of (10.25)10 is 0 10100 10100100

CMPS375 Class Notes (Chap02) Page 12 / 20 by Kuo-pao Yang

2.5.2 Floating-Point Arithmetic 76

• EXAMPLE 2.32: Add the following binary numbers as represented in a normalized

14-bit format, using the simple model with a bias of 16.

0 10010 11001000
+
0 10000 10011010
11.001000
+ 0.10011010
11.10111010

Renormalizing we retain the larger exponent and truncate the low-order bit.

0 10010 11101110

• EXAMPLE 2.33 Multiply:

0 10010 11001000 = 0.11001000 X 22

X
0 10000 10011010 = 0.10011010 X 20
11001000
x 10011010
00000000
11001000
00000000
11001000
11001000
00000000
00000000
11001000_______
111100001010000

Renormalizing 0.0111100001010000 * 22 = 0.111100001010000 * 21 we retain the

larger exponent and truncate the low-order bit.

0 10001 11110000

2.5.3 Floating-Point Errors 78

• We intuitively understand that we are working in the system of real number. We

know that this system is infinite.
• Computers are finite systems, with finite storage. The more bits we use, the better the
approximation. However, there is always some element of error, no matter how
many bits we use.

CMPS375 Class Notes (Chap02) Page 13 / 20 by Kuo-pao Yang

2.5.4 The IEEE-754 Floating-Point Standard 79

• The IEEE-754 single precision floating point standard uses bias of 127 over its 8-bit
exponent. An exponent of 255 indicates a special value.
• The double precision standard has a bias of 1023 over its 11-bit exponent. The
“special” exponent value for a double precision number is 2047, instead of the 255
used by the single precision standard.

Special bit patterns in IEEE-754

2.5.5 Range, Precision, and Accuracy 81

• The range of a numeric integer format is the difference between the largest and
smallest values that is can express.
• The precision of a number indicates how much information we have about a value
• Accuracy refers to how closely a numeric representation approximates a true value.

2.5.6 Additional Problems with Floating-Point Numbers 82

• Because of truncated bits, you cannot always assume that a particular floating point
operation is commutative or distributive.

This means that we cannot assume:

(a + b) + c = a + (b + c)
or
a * (b + c) = ab + ac

CMPS375 Class Notes (Chap02) Page 14 / 20 by Kuo-pao Yang

2.6 Character Codes 85

• Thus, human-understandable characters must be converted to computer-

understandable bit patterns using some sort of character encoding scheme.

2.6.1 Binary-Coded Decimal 86

• Binary-coded Decimal (BCD) is a numeric coding system used primarily in IBM

mainframe and midrange systems in the 1950s and 1960s.
• BCD is very common in electronics, particularly those that display numerical data,
such as alarm clocks and calculators.
• BCD encodes each digit of a decimal number into a 4-bit binary form.
• When stored in an 8-bit byte, the upper nibble is called the zone and the lower part is
called the digit.

TABLE 2.5 Binary-Coded Decimal

2.6.2 EBCDIC 87

• In 1964, BCD was extended to an 8-bit code, Extended Binary-Coded Decimal

Interchange Code (EBCDIC).
• EBCDIC was one of the first widely-used computer codes that supported upper and
lowercase alphabetic characters, in addition to special characters, such as punctuation
and control characters.
• EBCDIC and BCD are still in use by IBM mainframes today. See Page 77 TABLE
2.6

CMPS375 Class Notes (Chap02) Page 15 / 20 by Kuo-pao Yang

2.6.3 ASCII 88

• ASCII: American Standard Code for Information Interchange

• In 1967, a derivative of this alphabet became the official standard that we now call
ASCII.

2.6.4 Unicode 88

• Both EBCDIC and ASCII were built around the Latin alphabet.
• In 1991, a new international information exchange code called Unicode.
• Unicode is a 16-bit alphabet that is downward compatible with ASCII and Latin-1
character set.
• Because the base coding of Unicode is 16 bits, it has the capacity to encode the
majority of characters used in every language of the world.
• Unicode is currently the default character set of the Java programming language.

TABLE 2.8 Unicode Codespace

• The Unicode codespace is divided into six parts. The first part is for Western
alphabet codes, including English, Greek, and Russian.
• The lowest-numbered Unicode characters comprise the ASCII code.
• The highest provide for user-defined codes.

CMPS375 Class Notes (Chap02) Page 16 / 20 by Kuo-pao Yang

2.7 Error Detection and Correction 92

• No communications channel or storage medium can be completely error-free.

2.7.1 Cyclic Redundancy Check 92

• Cyclic redundancy check (CRC) is a type of checksum used primarily in data

communications that determines whether an error has occurred within a large block or
stream of information bytes.
• Arithmetic Modulo 2The rules are as follows:

0+0=0
0+1=1
1+0=1
1+1=0

• EXAMPLE 2.35 Find the sum of 10112 and 1102 modulo 2.

10112 + 1102 = 11012 (mod 2)

• EXAMPLE 2.36 Find the quotient and remainder when 10010112 is divided by 10112.

Quotient 10102 and Remainder 1012

• Calculating and Using CRC

o Suppose we want to transmit the information string: 10010112.
o The receiver and sender decide to use the (arbitrary) polynomial pattern, 1011.
o The information string is shifted left by one position less than the number of
positions in the divisor. I = 10010110002
o The remainder is found through modulo 2 division (at right) and added to the
information string: 10010110002 + 1002 = 10010111002.
o If no bits are lost or corrupted, dividing the received information string by the
agreed upon pattern will give a remainder of zero.
o We see this is so in the calculation at the right.
o Real applications use longer polynomials to cover larger information strings.
• A remainder other than zero indicates that an error has occurred in the transmission.
• This method work best when a large prime polynomial is used.
• There are four standard polynomials used widely for this purpose:
o CRC-CCITT (ITU-T): X16 + X12 + X5 + 1
o CRC-12: X12 + X11 + X3 + X2 + X + 1
o CRC-16 (ANSI): X16 + X15 + X2 + 1
o CRC-32: X32 + X26 + X23 + X22 + X16 + X12 + X11 + X10 + X8 + X7 + X6 + X4 + X
+1
• CRC-32 has been proven that CRCs using these polynomials can detect over 99.8%
of all single-bit errors.

CMPS375 Class Notes (Chap02) Page 17 / 20 by Kuo-pao Yang

2.7.2 Hamming Codes 95

• Data communications channels are simultaneously more error-prone and more

tolerant of errors than disk systems.
• Hamming code use parity bits, also called check bits or redundant bits.
• The final word, called a code word is an n-bit unit containing m data bits and r check
bits.
n=m+r
• The Hamming distance between two code words is the number of bits in which two
code words differ.
10001001
10110001
*** Hamming distance of these two code words is 3
• The minimum Hamming distance, D(min), for a code is the smallest Hamming
distance between all pairs of words in the code.
• Hamming codes can detect (D(min) - 1) errors and correct [(D(min) – 1) / 2] errors.
• EXAMPLE 2.37
• EXAMPLE 2.38
00000
01011
10110
11101
D(min) = 3. Thus, this code can detect up to two errors and correct one single bit
error.

• We are focused on single bit error. An error could occur in any of the n bits, so each
code word can be associated with n erroneous words at a Hamming distance of 1.
• Therefore, we have n + 1 bit patterns for each code word: one valid code word, and n
erroneous words. With n-bit code words, we have 2n possible code words consisting
of 2m data bits (where m = n + r).
This gives us the inequality:

(n + 1) * 2m < = 2n

Because m = n + r, we can rewrite the inequality as:

(m + r + 1) * 2m <= 2 m + r or
(m + r + 1) <= 2r

CMPS375 Class Notes (Chap02) Page 18 / 20 by Kuo-pao Yang

• EXAMPLE 2.39 Using the Hamming code just described and even parity, encode the
8-bit ASCII character K. (The high-order bit will be zero.) Induce a single-bit error
and then indicate how to locate the error.

m = 8, we have (8 + r + 1) <= 2r then We choose r = 4

Parity bit at 1, 2, 4, 8

Char K 7510 = 010010112

1=1 5=1+4 9=1+8

2=2 6=2+4 10 = 2 + 8
3=1+2 7=1+2+4 11 = 1 + 2 + 8
4=4 8=8 12 = 4 + 8

We have the following code word as a result:

0 1 0 0 1 1 0 1 0 1 1 0
12 11 10 9 8 7 6 5 4 3 2 1

Parity b1 = b3 + b5 + b7 + b9 + b11 =1+1+1+0+1=0

Parity b2 = b3 + b6 + b7 + b10 + b11 =1+0+1+0+1=1
Parity b4 = b5 + b6 + b7 + b12 =1+0+1+0=0
Parity b8 = b9 + b10 + b11 + b12 =0+0+1+0=1

Let’s introduce an error in bit position b9, resulting in the code word:
0 1 0 1 1 1 0 1 0 1 1 0
12 11 10 9 8 7 6 5 4 3 2 1

Parity b1 = b3 + b5 + b7 + b9 + b11 =1+1+1+1+1=1 (Error, should be 0)

Parity b2 = b3 + b6 + b7 + b10 + b11 =1+0+1+0+1=1 (OK)
Parity b4 = b5 + b6 + b7 + b12 =1+0+1+0=0 (OK)
Parity b8 = b9 + b10 + b11 + b12 =1+0+1+0=0 (Error, should be 1)

We found that parity bits 1 and 8 produced an error, and 1 + 8 = 9, which in exactly
where the error occurred.

CMPS375 Class Notes (Chap02) Page 19 / 20 by Kuo-pao Yang

2.7.3 Reed-Soloman 102

• If we expect errors to occur in blocks, it stands to reason that we should use an error-
correcting code that operates at a block level, as opposed to a Hamming code, which
operates at the bit level.
• A Reed-Soloman (RS) code can be thought of as a CRC that operates over entire
characters instead of only a few bits.
• RS codes, like CRCs, are systematic: The parity bytes are append to a block of
information bytes.
• RS (n, k) code are defined using the following parameters:
o s = The number of bits in a character (or “symbol”).
o k = The number of s-bit characters comprising the data block.
o n = The number of bits in the code word.
• RS (n, k) can correct (n-k)/2 errors in the k information bytes.
• Reed-Soloman error-correction algorithms lend themselves well to implementation in
computer hardware.
• They are implemented in high-performance disk drives for mainframe computers as
well as compact disks used for music and data storage. These implementations will be
described in Chapter 7.

Chapter Summary 103

• Computers store data in the form of bits, bytes, and words using the binary
numbering system.
• Hexadecimal numbers are formed using four-bit groups called nibbles (or nybbles).
• Signed integers can be stored in one’s complement, two’s complement, or signed
magnitude representation.
• Floating-point numbers are usually coded using the IEEE 754 floating-point
standard.
• Character data is stored using ASCII, EBCDIC, or Unicode.
• Error detecting and correcting codes are necessary because we can expect no
transmission or storage medium to be perfect.
• CRC, Reed-Soloman, and Hamming codes are three important error control codes.

CMPS375 Class Notes (Chap02) Page 20 / 20 by Kuo-pao Yang

Chap 02
No ratings yet
Chap 02
16 pages
Fundamentals of Computer Programming: Jehangir Arshad Meo
No ratings yet
Fundamentals of Computer Programming: Jehangir Arshad Meo
26 pages
EC 303 Chapter 3
No ratings yet
EC 303 Chapter 3
63 pages
Chapter 2 - Number System
No ratings yet
Chapter 2 - Number System
62 pages
Chapter 6: Base Numbers: 6.1 Number Conversion
No ratings yet
Chapter 6: Base Numbers: 6.1 Number Conversion
11 pages
1 CSC 205 Chapt 1 to 2 Digital & Hardware
No ratings yet
1 CSC 205 Chapt 1 to 2 Digital & Hardware
19 pages
Topic 2 - Subtopic 2.1
No ratings yet
Topic 2 - Subtopic 2.1
53 pages
Unit 2 _ 2.1 Number system
No ratings yet
Unit 2 _ 2.1 Number system
13 pages
Review CSI104 Quiz
No ratings yet
Review CSI104 Quiz
78 pages
Switching Theory and Logic Circuits
No ratings yet
Switching Theory and Logic Circuits
159 pages
Lec7 Data Representation
No ratings yet
Lec7 Data Representation
32 pages
Computer Organization - Data Representation On CPU
No ratings yet
Computer Organization - Data Representation On CPU
67 pages
Wa0000.
No ratings yet
Wa0000.
15 pages
Switching Theory and Logic Circuits
100% (1)
Switching Theory and Logic Circuits
159 pages
Error detection-1
No ratings yet
Error detection-1
19 pages
EEC2101 Digital Logic
No ratings yet
EEC2101 Digital Logic
63 pages
Digital Logic Design - Course Materials-Fall 2008
100% (1)
Digital Logic Design - Course Materials-Fall 2008
63 pages
Digital Systems and Information: Logic and Computer Design Fundamentals
No ratings yet
Digital Systems and Information: Logic and Computer Design Fundamentals
55 pages
Topic 6. Signed Number Format: Ce232 Digital System
No ratings yet
Topic 6. Signed Number Format: Ce232 Digital System
55 pages
01 Binary Systems Part 1
No ratings yet
01 Binary Systems Part 1
43 pages
Number System and Logic Gates
No ratings yet
Number System and Logic Gates
22 pages
Unit 1
No ratings yet
Unit 1
118 pages
Class 21 Number Systems and Codes
100% (2)
Class 21 Number Systems and Codes
32 pages
FAC1002 - Binary Number System
No ratings yet
FAC1002 - Binary Number System
31 pages
Chapter 1
No ratings yet
Chapter 1
60 pages
Chapter 1
No ratings yet
Chapter 1
10 pages
Data Communication
100% (1)
Data Communication
56 pages
387 Ch1.6201.1566181855.3331
No ratings yet
387 Ch1.6201.1566181855.3331
30 pages
Computer learning 2
No ratings yet
Computer learning 2
26 pages
Number System Conversions Slide
No ratings yet
Number System Conversions Slide
40 pages
Chapter 2 3
No ratings yet
Chapter 2 3
40 pages
Digital lecture 2
No ratings yet
Digital lecture 2
13 pages
Chapt 3 Number Representation
No ratings yet
Chapt 3 Number Representation
93 pages
Number Systems
No ratings yet
Number Systems
15 pages
1st Lecture - Number - System, IEEE754
No ratings yet
1st Lecture - Number - System, IEEE754
51 pages
Architecture
No ratings yet
Architecture
67 pages
Lecture 1
No ratings yet
Lecture 1
22 pages
Number System and Codes: Introduction
No ratings yet
Number System and Codes: Introduction
35 pages
Digital Electronics
0% (1)
Digital Electronics
102 pages
Module 5_Digital Logic and Number Systems-P1
No ratings yet
Module 5_Digital Logic and Number Systems-P1
29 pages
The Need For Number System?
No ratings yet
The Need For Number System?
72 pages
Caism Lecture 1
No ratings yet
Caism Lecture 1
17 pages
Chapter 2
No ratings yet
Chapter 2
20 pages
Chapter 2: Number System
100% (1)
Chapter 2: Number System
60 pages
Number Systems: Binary
No ratings yet
Number Systems: Binary
19 pages
Digital-Electronics Easy Presentation PDF
No ratings yet
Digital-Electronics Easy Presentation PDF
217 pages
Chapter 3
No ratings yet
Chapter 3
72 pages
UNIT-4 Dr. Amita Asthana
No ratings yet
UNIT-4 Dr. Amita Asthana
90 pages
Unit 1 - Updated
No ratings yet
Unit 1 - Updated
65 pages
Electronic Logic For Science Volume 1 by
No ratings yet
Electronic Logic For Science Volume 1 by
8 pages
01 NumberSystems
No ratings yet
01 NumberSystems
15 pages
Complements (Lec4)
No ratings yet
Complements (Lec4)
13 pages
Chapter 01
No ratings yet
Chapter 01
58 pages
CNG111 Lecture 2
No ratings yet
CNG111 Lecture 2
25 pages
DLD Lab4 Stmu
No ratings yet
DLD Lab4 Stmu
14 pages
Digital Logic & Design
No ratings yet
Digital Logic & Design
61 pages
Electrical Power Generation
No ratings yet
Electrical Power Generation
15 pages
Lres 125
No ratings yet
Lres 125
123 pages
Bee Unit 3 Cse Gates, Boolean, Codes
No ratings yet
Bee Unit 3 Cse Gates, Boolean, Codes
187 pages
Principles of Digital Electronics
From Everand
Principles of Digital Electronics
Sapana Rane
No ratings yet
Banking and Insurance Law Assignment (Abhinav)
No ratings yet
Banking and Insurance Law Assignment (Abhinav)
18 pages
Examen de Certificación Práctica CCENT Nº1
No ratings yet
Examen de Certificación Práctica CCENT Nº1
19 pages
Diagrama-M Vs Curvatura
No ratings yet
Diagrama-M Vs Curvatura
5 pages
Ramdump 2024-03-24 13-38-38 0 Props
No ratings yet
Ramdump 2024-03-24 13-38-38 0 Props
14 pages
PCI Case Study Maf680
100% (26)
PCI Case Study Maf680
23 pages
Case Study - Cost Estimation With COCOMO
0% (1)
Case Study - Cost Estimation With COCOMO
2 pages
Final Presentation
No ratings yet
Final Presentation
38 pages
QP Asr DBC 2024e
No ratings yet
QP Asr DBC 2024e
3 pages
Binder 1
No ratings yet
Binder 1
74 pages
Computer Systems Servicing DLL
No ratings yet
Computer Systems Servicing DLL
11 pages
Daa 3a and B PRG
No ratings yet
Daa 3a and B PRG
4 pages
Configuring External Storage For Archive Backup - Tech
No ratings yet
Configuring External Storage For Archive Backup - Tech
6 pages
Chmod666 NIM Cheat Sheet
No ratings yet
Chmod666 NIM Cheat Sheet
2 pages
Json - JSON Encoder and Decoder - Python 3.7.1rc1 Documentation
No ratings yet
Json - JSON Encoder and Decoder - Python 3.7.1rc1 Documentation
12 pages
Introduction: Databases and Database Users
No ratings yet
Introduction: Databases and Database Users
29 pages
Computer-Aided_Design_in_the_United_States_19491984_Designing_in_a_Closed_World
No ratings yet
Computer-Aided_Design_in_the_United_States_19491984_Designing_in_a_Closed_World
17 pages
Apache Log4j2 - Remote Code Execution (RCE) December 2021: Updates
No ratings yet
Apache Log4j2 - Remote Code Execution (RCE) December 2021: Updates
5 pages
Transmission Network Design and Architecture Guidelines Version 1 3
No ratings yet
Transmission Network Design and Architecture Guidelines Version 1 3
64 pages
BTT Culminating Dragon Den 2022
No ratings yet
BTT Culminating Dragon Den 2022
2 pages
DLP TRENDS Q2 Week G - Neural and Social Networks
No ratings yet
DLP TRENDS Q2 Week G - Neural and Social Networks
10 pages
Perspective Drawing: Animation NC Ii
No ratings yet
Perspective Drawing: Animation NC Ii
8 pages
2023 2 1
No ratings yet
2023 2 1
8 pages
Application of Robotic Process Automation
No ratings yet
Application of Robotic Process Automation
10 pages
24-01-2024 - telugu parinayam daily
No ratings yet
24-01-2024 - telugu parinayam daily
221 pages
Mavic Dablo CV
No ratings yet
Mavic Dablo CV
1 page
Faculty of Business and Management Universiti Teknologi Mara
No ratings yet
Faculty of Business and Management Universiti Teknologi Mara
8 pages
LazReport Tutorial - Free Pascal Wiki
No ratings yet
LazReport Tutorial - Free Pascal Wiki
14 pages
NC941 One Zone Call Controller C/W PSU: General Installation
100% (1)
NC941 One Zone Call Controller C/W PSU: General Installation
2 pages
123 Loads
No ratings yet
123 Loads
4 pages
Switchboard Instruments Data Sheet 4921210012 UK
No ratings yet
Switchboard Instruments Data Sheet 4921210012 UK
8 pages