Abstract.
Techniques are presented to directly process JBIG-encoded document images. Two experimental processing pipelines are designed to evaluate the performance of the methods from the application perspective. They are document segmentation for obtaining the global layout and the form processing system for form type identification and the form dropout. The JBIG coding context is employed to perform horizontal smearing and connected-component detection concurrently in the course of decoding the base layer of the JBIG images. It is shown that, using a simple segmentation algorithm, the global layout is identified 50 times faster compared to the case of processing the full resolution images. In addition, an original solution is presented for form type identification by use of the Hough transform of the JBIG base layer images, thus expediting it by a factor of 16 in the designed form dropout system. Advantages of the compressed domain processing include fast procedures, reduced memory requirements, and the possibility of hardware implementation.
Similar content being viewed by others
References
CCITT Recommendation T.6 Facsimile Coding Schemes and Control Functions for Group IV Facsimile Apparatus (1989) Terminal equipment and protocols for the telematic services VII, Fascicle VII\.3
International Standard ISO/IEC 11544:1993 and ITU-T Recommendation T.82 Information Technology - Coded Representation of Picture and Audio Information - Progressive Bi-level Image Compression (1993)
Shima Y, Kashioka S, Higashino J (1989) A high-speed rotation method for binary images based on coordinate operation of run data. Syst Comput Jpn 20(6):91-102
Ronse C, Devijiver PA (1984) Connected components in binary images: the detection problem, research studies. Letchworth, Herts, UK
Shima Y, Murakami T, Koga M, Yashiro H, Fujisawa H (1990) A high speed algorithm for propagation type labeling based on block sorting of runs in binary images. In: Proceedings of the 10th international conference on pattern recognition, Atlantic City, NJ, 16-21 June 1990, pp 655-658
Hinds S, Fisher JL, D'Amato DP (1990) A document skew detection method using run-length encoding and the hough transform. In: Proceedings of the 10th international conference on pattern recognition, 1:464-468
Pavlidis T (1984) A hybrid vectorization algorithm. In: Proceedings of the 7th international conference on pattern recognition. Montreal, pp 490-492
Pavlidis T (1986) A vectorizer and feature extractor for document recognition. Comput Vis Graph Image Process 35:111-127
Nagy G, Seth SC, Stoddard SD (1988) Document analysis with an expert system. In: Proceedings of the ACM conference on document processing systems, pp 169-176
Wang D, Srihari SN (1989) Classification of newspaper image blocks using texture analysis. Comput Vis Graph Image Process 47:327-352
Spitz AL (1992) Skew determination in CCITT group IV compressed document images. In: Proceedings of the symposium on document analysis and information retrieval, Las Vegas, NV, pp 11-25
Spitz AL (1995) Logotype detection in compressed domain using alignment signatures. In: Proceedings of the symposium on document analysis and information retrieval Las Vegas, NV, pp 303-310
Hull JJ, Cullen JF (1997) Document image similarity and equivalence detection. In: Proceedings of the 4th international conference on document analysis and recognition. Ulm, Germany, pp 308-312
Regentova E, Latifi S, Deng S (2002) An algorithm with reduced operations for connected components detection in ITU-T 3/4 coded images. IEEE Trans Pattern Anal Mach Intell 24(8):1039-1047
Kanai J, Bagdanov AD (1998) Projection profile based skew estimation algorithm for JBIG compressed images. Int J Doc Anal Recog 1(1):43-51
Witten I, Moffat A, Bell T (1999) Managing gigabytes, compressing and indexing documents and images. Morgan Kauffman, San Francisco
Wahl FM, Wong KJ, Casey RG (1982) Block segmentation and text extraction in mixed text/image documents. Comput Graph Image Process 20:375-390
Akiyama T, Hagita N (1990) Automated entry system for printed documents. Pattern Recog 23(11):1141-1154
Toyoda J, Noguchi Y, Nishimura Y (1982) Study of extracting Japanese newspaper article. In: Proceedings of the 6th international conference on pattern recognition, pp 1113-1115
O'Gorman L (1993) The document spectrum for structural page layout analysis. IEEE Trans Pattern Anal Mach Intell 15(11):1162-1173
Kida H, Iwaki O, Kawada K (1986) Document recognition system for office automation. In: Proccedings of the 8th international conference on pattern recognition, Paris, pp 446-448
Mao J, Abayan M, Mohiuddin K (1996) A model-based form processing sub-system. In: Proceedings of the international conference on pattern recognition, pp 691-695
Yu B, Jain A (1996) A generic system for form dropout. IEEE Trans Pattern Anal Mach Intell 18:1127-1134
Taylor S, Fritzon R, Pastor J (1992) Extraction data from preprinted forms. Mach Vis Appl 5:211-222
Liu J, Jain AK (2000) Image based form document retrieval. Pattern Recog 33:503-513
Hough PCV (1962) Methods and means for recognizing complex patterns. US Patent 3,069,654
Author information
Authors and Affiliations
Corresponding author
Additional information
Received: 31 January 2004, Accepted: 4 January 2005, Published online: 5 April 2005
Rights and permissions
About this article
Cite this article
Regentova, E.E., Latifi, S., Chen, D. et al. Document analysis by processing JBIG-encoded images. IJDAR 7, 260–272 (2005). https://doi.org/10.1007/s10032-005-0141-z
Issue Date:
DOI: https://doi.org/10.1007/s10032-005-0141-z