Abstract
Based on gradient and wavelet analyses, a novel scheme has been developed to extract table structures from skewed form document images. In this scheme, first, a skewed form document image is rotated according to the angle obtained from the gradient algorithm. Then the deskewed image is decomposed into four sub-images by divisible Multiresolution Analysis(MRA) wavelets. Afterwards, the table structure image which represents the geometric structure of the form can be obtained from the sub-images by a modified wavelet reconstruction algorithm. Meanwhile, another document image without table lines can be produced by Minkowski operation and is referred to as a table free image. Experimental results indicate that this new scheme can be applied to process the skewed form document images with promising achievements.
Chapter PDF
Similar content being viewed by others
Reference
R. G. Casey, D. R. Ferguson, K. M. Mohiuddin, and E. Walach, “ Intelligent Forms Processing System,” Machine Vision and Application, Vol. 5, No. 3, pp. 143–155, 1992.
ICDAR'95, Proc. Third Int. Conf. on Document Analysis and Recognition, Montreal, Canada, August 14-16, 1995.
ICDAR'97. Proc. Fourth Int. Conf. on Document Analysis and Recognition, Ulm-Germany, August 18-20, 1997.
Y. Y. Tang, H. Ma, J. Liu, B. Li, and D. Xi, “ Multiresolution Analysis in Extraction of Reference Lines from Documents with Gray Level Background,” IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol. 19, No. 8, pp. 921–926, 1997.
S. Mallat, “ A Theory of Multiresolution Signal Decomposition: the Wavelet Representation,” IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol. 11, pp. 674–693, 1989.
R. Jain, “ Extraction of Motion Information from Peripheral Processes,” IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol. 3, No. 5, pp. 489–503, 1981.
S. Mallat, A Wavelet Tour of Signal Processing, San Diego: Academic Press, 1998.
E. Turolla, Y. Belaid, and A. Belaid, “Form Item Extraction Based on Line Searching”, in Graphics Recognition: Method and Applications, Lecture Notes in Computer Science, Vol. 1072, Springer-Verlag, Berlin Heidelberg New York, pp. 69–79, 1996.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Xi, D., Lee, SW. (1999). Table Structure Extraction from Form Documents Based on Gradient-Wavelet Scheme. In: Lee, SW., Nakano, Y. (eds) Document Analysis Systems: Theory and Practice. DAS 1998. Lecture Notes in Computer Science, vol 1655. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48172-9_20
Download citation
DOI: https://doi.org/10.1007/3-540-48172-9_20
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66507-6
Online ISBN: 978-3-540-48172-0
eBook Packages: Springer Book Archive