Abstract
We propose two texture-based approaches, one involving Gabor filters and the other employing log-polar wavelets, for separating text from non-text elements in a document image. Both the proposed algorithms compute local energy at some information-rich points, which are marked by Harris’ corner detector. The advantage of this approach is that the algorithm calculates the local energy at selected points and not throughout the image, thus saving a lot of computational time. The algorithm has been tested on a large set of scanned text pages and the results have been seen to be better than the results from the existing algorithms. Among the proposed schemes, the Gabor filter based scheme marginally outperforms the wavelet based scheme.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Fan, K.C., Wang, L.S., Wang, Y.K.: page segmentation and identification for intelligent signal processing. Signal Processing 45, 329–346 (1995)
Smith, M.A., Kanade, T.: Video skimming for quick browsing based on audio and image characterization, CMU-CS-95-186, Technical report, Carnegie Mellon University (1995)
Jung, K.: Neural network-based text location in color images. Pattern Recognition Letters 22, 1503–1515 (2001)
Wu, J., Qu, S.-L., Zhuo, Q., Wang, W.-Y.: Automatic text detection in complex color images. In: Proc. of Intl. Conf. on Machine Learning and Cybernetics (2002)
Yuan, Q., Tan, C.L.: Text Extraction from Gray Scale Document Images Using Edge Information. In: Proc. of Sixth Intl. Conf. on Document Analysis and Recognition (2001)
Messelodi, S., Modena, C.M.: Automatic identification and skew estimation of text lines in real scene images. Pattern Recognition 32, 791–810 (1999)
Jain, A.K., Yu, B.: Automatic text location in images and video frames. Pattern Recognition 31, 2055–2076 (1998)
Strouthpoulos, C., Papamarkos, N., Atsalakis, A.E.: Text extraction in complex color Document. Pattern Recognition 35, 1743–1758 (2002)
Sabari Raju, S., Pati, P.B., Ramakrishnan, A.G.: Text Localization and Extraction from Complex Color Images. In: Bebis, G., Boyle, R., Koracin, D., Parvin, B. (eds.) ISVC 2005. LNCS, vol. 3804, pp. 486–493. Springer, Heidelberg (2005)
Jain, R., Antani, S., Kasturi, R.: A survey on the use of pattern recognition methods for abstraction, indexing and retrieval of images and video. Pattern Recognition 35(4), 945–965 (2002)
Jung, K., Kim, K.I., Jain, A.K.: Text Information Extraction in Images and Video: A Survey. Pattern Recognition 37(5), 977–997 (2004)
Pun, C.M., Lee, M.C.: Log-polar wavelet energy signature for rotation and scale invariant texture classification. IEEE Trans. PAMI 25(5), 590–603 (2003)
Harris, C., Stephens, M.: A combined corner and edge detector. In: Proc. 4th Alvey Vision Conf., pp. 147–151 (1988)
Davoine, F., et al.: Fractal images compression based on Delaunay triangulation and vector quantization. IEEE Trans. on Image Processing 5(2), 338–346 (1996)
Xiao, Y., Yan, H.: Text region extraction in a document image based on the Delaunay tessellation. Pattern Recognition 36, 799–809 (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Nourbakhsh, F., Pati, P.B., Ramakrishnan, A.G. (2006). Text Localization and Extraction from Complex Gray Images. In: Kalra, P.K., Peleg, S. (eds) Computer Vision, Graphics and Image Processing. Lecture Notes in Computer Science, vol 4338. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11949619_69
Download citation
DOI: https://doi.org/10.1007/11949619_69
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68301-8
Online ISBN: 978-3-540-68302-5
eBook Packages: Computer ScienceComputer Science (R0)