Abstract
Document images often suffer from different types of degradation that renders the document image binarization a challenging task. This paper presents a document image binarization technique that segments the text from badly degraded document images accurately. The proposed technique is based on the observations that the text documents usually have a document background of the uniform color and texture and the document text within it has a different intensity level compared with the surrounding document background. Given a document image, the proposed technique first estimates a document background surface through an iterative polynomial smoothing procedure. Different types of document degradation are then compensated by using the estimated document background surface. The text stroke edge is further detected from the compensated document image by using L1-norm image gradient. Finally, the document text is segmented by a local threshold that is estimated based on the detected text stroke edges. The proposed technique was submitted to the recent document image binarization contest (DIBCO) held under the framework of ICDAR 2009 and has achieved the top performance among 43 algorithms that are submitted from 35 international research groups.
Similar content being viewed by others
References
Trier O., Taxt T.: Evaluation of binarization methods for document images. IEEE Trans. Pattern Anal. Mach. Intell 17, 312–315 (1995)
Leedham G., Yan C., Takru K., Tan J.H.N., Mian L.: Comparison of some thresholding algorithms for text/background segmentation in difficult document images. Int Conf Doc Anal. Recogn. 2, 859–864 (2003)
Otsu N.: A threshold selection method from gray level histogram. IEEE Trans. Syst. Man Cybern. 19, 62–66 (1978)
Brink A.: Thresholding of digital images using two-dimensional entropies. Pattern Recogn. 25(8), 803–808 (1992)
Kittler J., Illingworth J.: On threshold selection using clustering criteria. IEEE Trans. syst. Man Cybern. 15, 652–655 (1985)
Solihin Y., Leedham C.: Integral ratio: a new class of global thresholding techniques for handwriting images. IEEE Trans. Pattern Anal. Mach. Intell. 21, 761–768 (1999)
Kim I.-K., Jung D.-W., Park R.-H.: Document image binarization based on topographic analysis using a water flow model. Pattern Recogn. 35, 141–150 (2002)
Yang J., Chen Y., Hsu W.: Adaptive thresholding algorithm and its hardware implementation. Pattern Recogn. Lett. 15(2), 141–150 (1994)
Parker, J., Jennings, C., Salkauskas, A.: Thresholding using an illumination model. International Conference on Document Analysis and Recognition, pp. 270–273. September 1993
Eikvil, L., Taxt, T., Moen, K.: A fast adaptive method for binarization of document images. International Conference on Document Analysis and Recognition, pp. 435–443, September 1991
Niblack W.: An Introduction to Digital Image Processing. Prentice-Hall, Englewood Cliffs (1986)
Sauvola J., Pietikainen M.: Adaptive document image binarization. Pattern Recogn. 33, 225–236 (2000)
Gatos B., Pratikakis I., Perantonis S.: Adaptive degraded document image binarization. Pattern Recogn. 39, 317–327 (2006)
Moghaddam R.F., Cheriet M.: Rsldi: restoration of single-sided low-quality document images. Pattern Recogn. 42, 3355–3364 (2009)
Moghaddam, R.F., Cheriet, M.: Application of multi-level classifiers and clustering for automatic word-spotting in historical document images. International Conference on Document Analysis and Recognition, pp. 511–515. July 2009
Chen Q., Sun Q., Heng P.A., Xia D.: A double-threshold image binarization method based on edge detector. Pattern Recogn. 41, 1254–1267 (2008)
Su, B., Lu, S., Tan, C.L.: Binarization of historical handwritten document images using local maximum and minimum filter. International Workshop on Document Analysis Systems, pp. 159–165. June 2010
Dawoud A.: Iterative cross section sequence graph for handwritten character segmentation. IEEE Trans. Image Process. 16, 2150–2154 (2007)
Gatos, B., Ntirogiannis, K., Pratikakis, I.: Icdar 2009 document image binarization contest (dibco 2009). International Conference on Document Analysis and Recognition, pp. 1375–1382. July 2009
Lu S., Tan C.L.: Binarization of badly illuminated document images through shading estimation and compensation. Int. Conf. Doc. Anal. Recogn. 1, 312–316 (2007)
Hamming R.W.: Digital Filter. Prentice-Hall, Englewood Cliffs (1983)
Krzysztof, M.P.M., Axel, M.: Dynamic threshold using polynomial surface regression with application to the binarization of fingerprints. Proceedings of the SPIE, vol. 5779
Seeger, M., Dance, C.: Binarising camera images for ocr. Proceedings of International Conference on Document Analysis and Recognition, pp. 54–58 (2001)
Lu, S., Tan, C.L.: Thresholding of badly illuminated document images through photometric correction. ACM symposium on Document engineering pp. 3–8. 2007
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Lu, S., Su, B. & Tan, C.L. Document image binarization using background estimation and stroke edges. IJDAR 13, 303–314 (2010). https://doi.org/10.1007/s10032-010-0130-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10032-010-0130-8