Degraded Document Image Enhancing in Spatial Domain Using Adaptive Contrasting and Thresholding
Degraded Document Image Enhancing in Spatial Domain Using Adaptive Contrasting and Thresholding
Degraded Document Image Enhancing in Spatial Domain Using Adaptive Contrasting and Thresholding
I.
Dr. P V Ramaraju
G.Nagaraju
Professor
Department of ECE
SRKR Engg. College
Bhimavaram, India.
Asst.Professor,
Department of ECE
SRKR Engg. College
Bhimavaram, India.
V.Rajasekhar
M.Tech Student
Department of ECE
SRKR Engg. College
Bhimavaram, India.
INTRODUCTION
Robust binarization gives the possibility of a correct extraction
of the sketched line drawing or text from its background. For the
binarization of images many algorithms have been implemented.
Thresholding is a sufficiently accurate and high processing speed
segmentation approach to monochrome image. This paper describes a
modified logical thresholding method for binarization of seriously
degraded and very poor quality gray-scale document images.
In general there are two types of image thresholding techniques
available: global and local. In the global thresholding technique a gray
level image is converted into a binary image based on an image
intensity value called global threshold which is fixed in the whole image
domain whereas in local thresholding technique, threshold value can
vary from one pixel location to next. Thus, global thresholding converts
an input image I to a binary image G as follows G(i, j) = 1 for I (i, j)
T, or, G(i, j) = 0 for I (i, j) < T, where T is the threshold, G (i, j) =
1 for foreground and G (i, j) = 0 for background.
Whereas, for a local threshold, the threshold T is a function over
the image domain, i.e.,T= T(x, y). In addition, if in constructing the
threshold value/surface the algorithm adapts itself to the image
intensity values, then it is called dynamic or adaptive threshold.
318
T=
well for many real world images where a significant overlap exists in
the gray level histogram between the pixel intensity values of the
objects and the background due to un-even and poor illumination.
As many degraded documents do not have a clear bimodal
pattern, global thresholding [4][7] is usually not a suitable approach
for the degraded document binarization. Adaptive thresholding [8]
[14], which estimates a local threshold for each document image pixel,
is often a better approach to deal with different variations within
degraded document images. For example, the early window-based
adaptive thresholding techniques [12], [13] estimate the local threshold
by using the mean and the standard variation of image pixels within a
local neighborhood window.
The local image contrast and the local image gradient are very useful
features for segmenting the text from the document background
because the document text usually has certain image contrast to the
neighboring document background.
The image gradient is defined as follows
1
G(x,y)=fmax(x, y) fmin(x,y )
The Local contrast is defined as follows
D ( x, y )
f
f
max
max
( x, y)
( x, y)
min
min
( x, y)
( x, y)
PROPOSED METHOD
based on the local threshold that is estimated from the detected text
stroke edge pixels. Some post processing is further applied to improve
the document binarization quality.
A. Contrast Image Construction
The image contrast in Equation 2 has one typical limitation that
it may not handle document images with the bright text properly. This
is because a weak contrast will be calculated for stroke edges of the
bright text where the denominator in Equation 2 will be large but the
numerator will be small. To overcome this over-normalization problem,
we combine the local image contrast with the local image gradient and
derive an adaptive local image contrast as follows
Da ( x, y) D(x, y) (1 )( fmax (x, y) fmin ( x, y)) 3
Where
D(x,
y)
Fig. 1
Fig. 2
(a)
(b)
322
(c)
Fig. 3. Contrast Images constructed using (a) local image gradient, (b)
local image contrast [15], and (c) our proposed method for the original
sample document images which are shown in Fig. 1 and 2, respectively.
Fig. 3 shows the contrast map of the sample document images in Fig. 1
and 2 that are created by using local image gradient, local image
contrast [15] and our proposed method in Equation 3, respectively.
B. Local Threshold Estimation
The text can then be extracted from the document background
pixels once the high contrast stroke edge pixels are detected properly.
Two characteristics can be observed from different kinds of document
images [15]: First, the text pixels are close to the detected text stroke
edge pixels. Second, there is a distinct intensity difference between the
high contrast stroke edge pixels and the surrounding background
pixels. The document image text can thus be extracted based on the
detected text stroke edge pixels as follows
Ne N min&& I ( x , y ) Emean Estd / 2
R ( x , y ) 1..
5
0..otherwise
Where Emean and Estd are the mean and the standard deviation of
the image intensity of the detected high contrast image pixels (within
the original document image) within the neighborhood window that can
be evaluated as follows
E mean
I ( x, y ) * (1 E ( x, y ))
neighbor
Ne
323
E std
(( I ( x , y ) Emean ) * (1 E ( x , y ))) 2
neighbor
6: if The pixels in the pairs belong to the same class (both text or
background) then
7: Classify the foreground and background pixels based on pixel values.
8: end if
9: end for
10: Remove single-pixel artifacts [16] along the text stroke boundaries
after the document thresholding.
11: Store the new binary result to Bf .
D. Parameter Selection
In the first experiment, we apply different to obtain different
power functions and test their performance. is close to 1 when is
small and the local image contrast Da dominates the adaptive image
contrast Da in Equation 3. On the other hand, Da is mainly influenced
by local image gradient when is large. At the same time, the variation
of for different document images increases when is close to 1. Under
such circumstance, the power function becomes more sensitive to the
global image intensity variation and appropriate weights can be
assigned to images with different characteristics.
The proposed method can assign more suitable to different images
when is closer to 1. Parameter should therefore be set around 1
when the adaptability of the proposed technique is maximized and
better and more robust binarization results can be derived from
different kinds of degraded document images.
III. RESULTS
This section evaluates the results for proposed document image
binarization techniques. Given a degraded document image, an
adaptive contrast map is first constructed. The text is then segmented
based on the local threshold that is estimated from the detected text
325
Example 2
327
Fig. 10. Binarization results of the sample document image in Fig. 1(b)
produced by different methods. (a) OTSU [2]. (b) SAUV [12]. (c) NIBL
328
[13]. (d) BERN [8]. (e) GATO [17]. (f) LMM [15]. (g) BE [16]. (h)
Proposed.
Example 2
Fig. 11. Binarization results of the sample document image (PR 06) in
DIBCO 2011 dataset produced by different methods. (a) Input Image.
(b) OTSU [2]. (c) SAUV [12]. (d) NIBL [13]. (e) BERN [8]. (f) GATO
[17]. (g) LMM [15]. (h) BE [16]. (i) LELO [18]. (j) SNUS. (k) HOWE
[19]. (l) Proposed.
V.
CONCLUSION
References
1. R. C. Gonzalez and R. E. Woods, Digital Image Processing,
Pearson prentice Hall, 2005.
2. N. Otsu, A threshold selection method from gray level
histogram, IEEE Transactions on System Man Cybernatics,
Vol. SMC-9, No.1, pp. 62-66, 1979.
3. P.K. Sahoo, S. Soltani, A.K.C. Wong, and Y. Chen, A survey of
thresholding techniques, Computer Vision Graphics Image
Processing, Vol. 41, 1988, pp. 233 260.
4. A. Brink, Thresholding of digital images using two-dimensional
entropies, Pattern Recognit., vol. 25, no. 8, pp. 803808, 1992.
5. J. Kittler and J. Illingworth, On threshold selection using
clustering criteria, IEEE Trans. Syst., Man, Cybern., vol. 15,
no. 5, pp. 652655, Sep.Oct. 1985.
6. N. Otsu, A threshold selection method from gray level
histogram, IEEE Trans. Syst., Man, Cybern., vol. 19, no. 1, pp.
6266, Jan. 1979.
7. N. Papamarkos and B. Gatos, A new approach for
multithreshold selection, Comput. Vis. Graph. Image Process.,
vol. 56, no. 5, pp. 357370, 1994.
8. J. Bernsen, Dynamic thresholding of gray-level images, in
Proc. Int. Conf. Pattern Recognit., Oct. 1986, pp. 12511255.
9. L. Eikvil, T. Taxt, and K. Moen, A fast adaptive method for
binarization of document images, in Proc. Int. Conf. Document
Anal. Recognit., Sep. 1991, pp. 435443.
10. I.-K. Kim, D.-W. Jung, and R.-H. Park, Document image
binarization based on topographic analysis using a water flow
model, Pattern Recognit., vol. 35, no. 1, pp. 265277, 2002.
330
331