Ec 1009 - Digital Image Processing
Ec 1009 - Digital Image Processing
1. Define Image?
An Image may be defined as a two dimensional function f(x,y) where x & y are
spatial (plane) coordinates, and the amplitude of f at any pair of coordinates (x,y) is
called intensity or gray level of the image at that point. When x,y and the amplitude
values of f are all finite, discrete quantities we call the image as Digital Image.
3. Define Quantization?
The range of values spanned by the gray scale is called dynamic range of an
image. Image will have high contrast, if the dynamic range is high and image will have
dull washed out gray look if the dynamic range is low.
The spatial interaction of Luminance from an object and its surround creates a
phenomenon called the mach band effect.
6. Define Brightness?
If gray levels in a certain range occur frequently while others occurs rarely, the
quantization levels are finely spaced in this range and coarsely spaced outside of it.
This method is sometimes called Tapered Quantization.
Gray level refers to a scalar measure of intensity that ranges from black to grays
and finally to white.
1. RGB model
2. CMY model
3. YIQ model
4. HSI model
Hue is a color attribute that describes a pure color where saturation gives a
measure of the degree to which a pure color is diluted by white light.
1. RGB model--- used for color monitor & color video camera
2. CMY model---used for color printing
3. HIS model----used for color image processing
4. YIQ model---used for color picture transmission
4
` The hue of a perceived color depends on the adoption of the viewer. For
example, the American Flag will not immediately appear red, white, and blue of the
viewer has been subjected to high intensity red light before viewing the flag. The color
of the flag will appear to shift in hue toward the red component cyan.
16. Write the expression to find the number of bits to store a digital image?
Zooming may be viewed as over sampling. It involves the creation of new pixel
locations and the assignment of gray levels to those new locations.
Radiance is the total amount of energy that flows from the light source, and it is
usually measured in watts (w).
An image can be expanded in terms of a discrete set of basis arrays called basis
images. These basis images can be generated by unitary matrices. Alternatively, a
given NXN image can be viewed as an N^2X1 vectors. An image transform provides a
set of coordinates or basis vectors for vector space.
1) Determinant and the Eigen values of a unitary matrix have unity magnitude
2) the entropy of a random vector is preserved under a unitary Transformation
3) Since the entropy is a measure of average information, this means information
is preserved under a unitary transformation.
Forward transform
Reverse transforms
X(n) = (1/N) (k=0 to N-1) Σ x(k) exp(-j 2* pi* nk/N) ; n= 0,1,2,…N-1
1. Periodicity
WN^(K+N)= W N^K
2. Symmetry
WN^(K+N/2)= -W N^K
1. Symmetric
2. Periodic extensions
3. Sampled Fourier transform
4. Conjugate symmetry.
The NXN cosine transform c(k) is called the discrete cosine transform and is defined as
The NXN sine transform matrix Ψ = Ψ (k,n) also called the discrete sine transform , is
defined as
The Haar functions are defined on a continuous interval Xe [0,1] and for K=0,1, N-1
where N=2^n.. The integer k can be uniquely decomposed as K=2^P+Q-1.
H2 = 1 1
1 –1
36. What are the properties of Haar transform.
KL Transform is an optimal in the sense that it minimizes the mean square error
between the vectors X and their approximations X^. Due to this idea of using the Eigen
vectors corresponding to largest Eigen values. It is also known as principal component
transform.
Since mean square error of reconstructed image and original image is minimum and the
mean value of transformed image is zero so that uncorrelated.
∆f = ∂f/∂x
∂f/∂y
η(x,y)
H
g(x,y)
f(x,y)
A system operator H, which together with an additive white noise term η(x,y) a
operates on an input image f(x,y) to produce a degraded image g(x,y).
57. What is homogeneity property and what is the significance of this property?
H [k1f1(x,y)] = k1H[f1(x,y)]
Where H=operator
K1=constant
f(x,y)=input image.
It says that the response to a constant multiple of any input is equal to the response to
that input multiplied by the same constant.
65. When does the degradation model satisfy position invariant property?
An operator having input-output relationship g(x,y)=H[f(x,y)] is said to position
invariant if H[f(x-α,y-β)]=g(x-α,y-β) for any f(x,y) and α and β.
This definition indicates that the response at any point in the image depends only on the
value of the input at that point not on its position.
67. Which is the most frequent method to overcome the difficulty to formulate the
spatial relocation of pixels?
The point is the most frequent method, which are subsets of pixels whose
location in the input (distorted) and output (corrected) imaged is known precisely.
6829. What are the three methods of estimating the degradation function?
1. Observation
2. Experimentation
3. Mathematical modeling.
14
(i) Compute the sum of the products of the coefficient with the gray levels contained in
the region encompassed by the mask.
(ii) The response of the mask at any point in the image is
R = w1z1+ w2z2 + w3z3 +………..+ w9z9
Where zi = gray level of pixels associated with mass coefficient wi.
(iii) The response of the mask is defined with respect to its center
location.
76. Why edge detection is most common approach for detecting discontinuities?
The isolated points and thin lines are not frequent occurrences in most practical
applications, so edge detection is mostly preferred in detection of discontinuities.
77, How the derivatives are obtained in edge detection during formulation?
The first derivative at any point in an image is obtained by using the magnitude of
the gradient at that point. Similarly the second derivatives are obtained by using the
laplacian.
78. Write about linking edge points.
The approach for linking edge points is to analyse the characteristics of pixels in a
small neighborhood (3x3 or 5x5) about every point (x,y)in an image that has undergone
edge detection. All points that are similar are linked, forming a boundary of pixels that
share some common properties.
W1 W2 W3
79. What are the two W4 W5 W6 properties used for establishing
similarity of edge W7 W8 W9 pixels?
(1) The strength of the response of the gradient operator used to produce the edge
pixel.
(2) The direction of the gradient.
16
-The gradient vector points are in the direction of maximum rate of change of f at (x,y)
- In edge detection an important quantity is the magnitude of this vector (gradient) and is
denoted as ∆f
∆f = mag (∆f) = [Gx2+Gy2] 1/2
The direction of gradient vector also is an important quantity.
α (x,y) = tan-1(Gy/Gx)
It is a family of patterns that share some common properties. Pattern classes are
denoted as w1 w2 w3 ……… wM , where M is the number of classes.
The three principal pattern arrangements are vectors, Strings and trees. Pattern
vectors are represented by old lowercase letters such as x y z and in
In the form
• Lossless compression can recover the exact original data after compression. It is
used mainly for compressing database records, spreadsheets or word
processing files, where exact replication of the original is essential.
• Lossy compression will result in a certain loss of accuracy in exchange for a
substantial increase in compression. Lossy compression is more effective when
used to compress graphic images and digitised voice where losses outside visual
or aural perception can be tolerated.
In terms of storage, the capacity of a storage device can be effectively increased with
methods that compress a body of data on its way to a storage device and
decompresses it when it is retrieved.
At any given time, the ability of the Internet to transfer data is fixed. Thus, if data can
effectively be compressed wherever possible, significant improvements of data
throughput can be achieved. Many files can be combined into one compressed
document making sending easier.
Consider a character run of 15 'A' characters, which normally would require 15 bytes
to store:
With RLE, this would only require two bytes to store; the count (15) is stored as the first
byte and the symbol (A) as the second byte.
19
Huffman compression reduces the average code length used to represent the symbols
of an alphabet. Symbols of the source alphabet, which occur frequently, are assigned
with short length codes. The general strategy is to allow the code length to vary from
character to character and to ensure that the frequently occurring characters have
shorter codes.
Transform coding is used to convert spatial image pixel values to transform coefficient
values. Since this is a linear process and no information is lost, the number of
coefficients produced is equal to the number of pixels transformed.
The desired effect is that most of the energy in the image will be contained in a few
large transform coefficients. If it is generally the same few coefficients that contain most
of the energy in most pictures, then the coefficients may be further coded by loss less
entropy coding. In addition, it is likely that the smaller coefficients can be coarsely
quantized or deleted (lossy coding) without doing visible damage to the reproduced
image.
99. What are the different transforms used in transform coding and how the
differ?
Many types of transforms used for picture coding, are Fourier, Karhonen-Loeve, Walsh-
Hadamard, lapped orthogonal, discrete cosine (DCT), and recently, wavelets. The
various transforms differ among themselves in three basic ways that are of interest in
picture coding:
3) The appearance and visibility of coding noise due to coarse quantization of the
coefficients.
21
106.Find the number of bits to store a 128128 image with 64 gray levels.
Given:
M = N = 128
L = 64 =2k
=> k=6
No. of bits = M2k
= 1282*6
= 98304 bits
• Average value
• Convolution and Correlation
• Laplacian
•
110.Obtain the Hadamard transformation for N = 4
N = 4 = 2n
=> n = 2
x 0 1 2 3
u
0 + + + +
1 + + + +
2 + + - -
3 + + - -
111.Write down the properties of Haar transform.
• Real and orthogonal
• Very fast transform
• Basis vectors are sequentially ordered
• Has fair energy compaction for image
• Useful in feature extraction,image coding and image analysis problem
•
112.What is enhancement.
Image enhancement is a technique to process an image so that the result is more suitable
than the original image for specific applications;
1 0
0 1
0 -1
-1 0
16 marks:-
mx= E{ X}
Cx=E{ (X-mx) (X-mx) T }
M
mx= (1/M) ∑ Xk ;
K=1
M
Cx= (1/M) ∑(XkXkT-mxmxT)
K=1
One example-8 marks
7.Explain the Hadamard transform matrices Hn and also its properties
Hint:
26
And properties
8.Explain DCT and its properties
Hint:
Forward 1D DCT & 2D DCT, Inverse 1D DCT & 2D DCT
Properties
9.Define Haar transform.Derive the same for n=8.What are its properties
Hint:
Based on haar function hk (z) defined over zε (0, 1)
K=2p+q-1
Hk (z) = (1/√N )2p/2 , (q-1)/2p ≤ Z ≤ (q-1/2)/2p
= (1/√N )-2p/2 , (q-1/2)/2p ≤ Z ≤ q/2p
=0 , otherwise
And properties
10.Discuss the properties and applications of 1)Hadamard transform 2)Hotelling transform
Hint:
Properties of hadamard:
Real and orthogonal
fast transform
faster than sine transform
Good energy compaction for image
Appl:
Image data compression, filtering and design of course
Properties of hotelling:
Real and orthogonal
Not a fast transform
Best energy compaction for image
Appl:
Useful in performance evaluation & for finding performance bounds
11.Discuss the image smoothing filter with its model in the spatial domain.
Hint:
LPF-blurring
Median filter – noise reduction & for sharpening image
12.What are image sharpening filters.Explain the various types of it.
27
Hint:
used for highlighting fine details
HPF-output gets sharpen and background becomes darker
High boost- output gets sharpen but background remains unchanged
Appl:
Medical image,electronic printing,industrial inspection
13.Discuss in detail about homomorphic & derivative filters.
Hint:
Homomorphic:Improving the appearance of an image by simultaneous compression and
contrast enhancement.
f(x,y)= i(x,y)r(x,y)
block diagram
Derivative:To obtain more sharpened image
Roberts cross gradient operator ,prewitt operator, sobel operator
14.Explain Weiner smoothing filter and its relation with inverse filtering and diffracted
limited systems.
Hint:
Weiner filter: Mean square errorσe2=E{[U(m,n)-Û(m,n)]2}
∞
Weiner filter equation: Û(m,n)= ∑∑ g(m,n;k,l)v(k,l)
K,l=-∞
Response G(w1,w2)=Suv(w1,w2)Svv-1(w1,w2)
∞
2 2
σe = (1/4п ) ∫∫Se (w1, w2) dw1dw2
-∞
Inverse filter:
H-1(w1, w2) = 1/H (w1, w2)
Hint:
Used for text compression
Calculate Lavg,entropy, efficiency,redundancy,variance
Minimum variance method,advantages
L-1
EntropyH=-∑ Pi log2 Pi
i=0
Efficiency=H/ Lavg
19.Explain the types of error free compression technique.
Hint:
Variable length coding
LZW coding
Bit plane coding
20.Explain how compression is achieved in transform coding and explain about DCT
Hint:
Block diagram of encoder,decoder ,Bit allocation, 1D transform coding, 2D transform
coding,application and explain 1D,2D DCT
21.Explain arithmetic coding
Hint:
Non-block code
Explain with one example
22.Explain various functional block of JPEG standard.
Hint:
Compression standard, 2 modes, 3 different coding system-lossy baseline,extended,
lossless independent. JPEG baseline coding and decoding.
DC coefficient is
21. Explain in detail about the color model and color enhancement.
22. Explain how compression is achieved in transform coding and explain the DCT.
23. Discuss about the MPEG standard and compare with JPEG.
24. Explain arithmetic coding and Huffmann coding.
25. Explain various functional block of Digital Image processing?