Abstract
In this paper, we present a method of region analysis for business card images acquired in a PDA (personal digital assistant) using DCT and information pixel (IP) density. The proposed method consists of three parts: region segmentation, information region (IR) classification, and character region (CR) classification. In the region segmentation, an input business card image is partitioned into 8 × 8 blocks and the blocks are classified into information blocks (IBs) and background blocks (BBs) by a normalized DCT energy. The input image is then segmented into IRs and background regions (BRs) by region labeling on the classified blocks. In the IR classification, each IR is classified into CR or picture region (PR) by using a ratio of DCT energy of edges in horizontal and vertical directions to DCT energy of low frequency components and a density of IPs. In the CR classification, each CR is classified into large CR (LCR) or small CR (SCR) by using the density of IPs and an averaged run-length of IPs. Experimental results show that the proposed region analysis yields good performance for test images of several types of business cards acquired in a PDA under various surrounding conditions. In addition, error rates of the proposed method are shown to be 2.2–10.1% lower in region segmentation and 7.7% lower in IR classification than those of the conventional methods.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Drivas, D., Amin, A.: Page segmentation and classification utilising bottom-up approach. In: Proc. IEEE ICDAR 1995, pp. 610–614 (1995)
Sauvola, J., Pietikäinen, M.: Page segmentation and classification using fast feature extraction and connectivity analysis. In: Proc. IEEE ICDAR 1995, pp. 1127–1131 (1995)
Wang, H., Li, S.Z., Ragupathi, S.: Document segmentation and classification with top-down approach. In: Proc. IEEE 1st Int. Conf. Knowledge-Based Intelligent Electronic Systems, vol. 1, pp. 243–247 (1997)
Chen, C.T.: Transform coding of digital image using variable block DCT with adaptive thresholding and quantization. In: SPIE, vol. 1349, pp. 43–54 (1990)
Bones, P.J., Griffin, T.C., Carey-Smith, C.M.: Segmentation of document images. In: SPIE, vol. 1258, pp. 66–78 (1990)
Chaddha, N., Sharma, R., Agrawal, A., Gupta, A.: Text segmentation in mixed-mode images. In: Proc. IEEE Twenty-Eight Asilomar Conf. Signals, Systems and Computers, vol. 2, pp. 1356–1361 (1994)
O’Gorman, L.: The document spectrum for page layout analysis. IEEE Trans. Pattern Anal. Machine Intell. 15, 1162–1173 (1993)
Li, X., Oh, W.G., Ji, S.Y., Moon, K.A., Kim, H.J.: An efficient method for page segmentation. In: Proc. IEEE ICIPS 1997, vol. 2, pp. 957–961 (1997)
Lee, S.W., Ryu, D.S.: Parameter-free geometric document layout analysis. IEEE Trans. Pattern Anal. Machine Intell. 23, 1240–1256 (2001)
Yip, S.K., Chi, Z.: Page segmentation and content classification for automatic document image processing. In: Proc. IEEE Int. Symp. Intelligent Multimedia, Video and Speech Processing, pp. 279–282 (2001)
Pan, W., Jin, J., Shi, G., Wang, Q.R.: A system for automatic Chinese business card recognition. In: Proc. IEEE ICDAR 2001, pp. 577–581 (2001)
Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. Syst., Man, Cybern. SMC-9, 62–66 (1979)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Jang, I.H., Kim, C.H., Kim, N.C. (2005). Region Analysis of Business Card Images Acquired in PDA Using DCT and Information Pixel Density. In: Blanc-Talon, J., Philips, W., Popescu, D., Scheunders, P. (eds) Advanced Concepts for Intelligent Vision Systems. ACIVS 2005. Lecture Notes in Computer Science, vol 3708. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11558484_31
Download citation
DOI: https://doi.org/10.1007/11558484_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29032-2
Online ISBN: 978-3-540-32046-3
eBook Packages: Computer ScienceComputer Science (R0)