Document analysis by processing JBIG-encoded images

Regentova, E. E.; Latifi, S.; Chen, D.; Taghva, K.; Yao, D.

doi:10.1007/s10032-005-0141-z

E. E. Regentova¹,
S. Latifi¹,
D. Chen¹,
K. Taghva¹ &
…
D. Yao¹

We’re sorry, something doesn't seem to be working properly.

Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.

Abstract.

Techniques are presented to directly process JBIG-encoded document images. Two experimental processing pipelines are designed to evaluate the performance of the methods from the application perspective. They are document segmentation for obtaining the global layout and the form processing system for form type identification and the form dropout. The JBIG coding context is employed to perform horizontal smearing and connected-component detection concurrently in the course of decoding the base layer of the JBIG images. It is shown that, using a simple segmentation algorithm, the global layout is identified 50 times faster compared to the case of processing the full resolution images. In addition, an original solution is presented for form type identification by use of the Hough transform of the JBIG base layer images, thus expediting it by a factor of 16 in the designed form dropout system. Advantages of the compressed domain processing include fast procedures, reduced memory requirements, and the possibility of hardware implementation.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

CCITT Recommendation T.6 Facsimile Coding Schemes and Control Functions for Group IV Facsimile Apparatus (1989) Terminal equipment and protocols for the telematic services VII, Fascicle VII\.3
International Standard ISO/IEC 11544:1993 and ITU-T Recommendation T.82 Information Technology - Coded Representation of Picture and Audio Information - Progressive Bi-level Image Compression (1993)
Shima Y, Kashioka S, Higashino J (1989) A high-speed rotation method for binary images based on coordinate operation of run data. Syst Comput Jpn 20(6):91-102
Google Scholar
Ronse C, Devijiver PA (1984) Connected components in binary images: the detection problem, research studies. Letchworth, Herts, UK
Shima Y, Murakami T, Koga M, Yashiro H, Fujisawa H (1990) A high speed algorithm for propagation type labeling based on block sorting of runs in binary images. In: Proceedings of the 10th international conference on pattern recognition, Atlantic City, NJ, 16-21 June 1990, pp 655-658
Hinds S, Fisher JL, D'Amato DP (1990) A document skew detection method using run-length encoding and the hough transform. In: Proceedings of the 10th international conference on pattern recognition, 1:464-468
Pavlidis T (1984) A hybrid vectorization algorithm. In: Proceedings of the 7th international conference on pattern recognition. Montreal, pp 490-492
Pavlidis T (1986) A vectorizer and feature extractor for document recognition. Comput Vis Graph Image Process 35:111-127
Google Scholar
Nagy G, Seth SC, Stoddard SD (1988) Document analysis with an expert system. In: Proceedings of the ACM conference on document processing systems, pp 169-176
Wang D, Srihari SN (1989) Classification of newspaper image blocks using texture analysis. Comput Vis Graph Image Process 47:327-352
Google Scholar
Spitz AL (1992) Skew determination in CCITT group IV compressed document images. In: Proceedings of the symposium on document analysis and information retrieval, Las Vegas, NV, pp 11-25
Spitz AL (1995) Logotype detection in compressed domain using alignment signatures. In: Proceedings of the symposium on document analysis and information retrieval Las Vegas, NV, pp 303-310
Hull JJ, Cullen JF (1997) Document image similarity and equivalence detection. In: Proceedings of the 4th international conference on document analysis and recognition. Ulm, Germany, pp 308-312
Regentova E, Latifi S, Deng S (2002) An algorithm with reduced operations for connected components detection in ITU-T 3/4 coded images. IEEE Trans Pattern Anal Mach Intell 24(8):1039-1047
Google Scholar
Kanai J, Bagdanov AD (1998) Projection profile based skew estimation algorithm for JBIG compressed images. Int J Doc Anal Recog 1(1):43-51
Google Scholar
Witten I, Moffat A, Bell T (1999) Managing gigabytes, compressing and indexing documents and images. Morgan Kauffman, San Francisco
Wahl FM, Wong KJ, Casey RG (1982) Block segmentation and text extraction in mixed text/image documents. Comput Graph Image Process 20:375-390
Akiyama T, Hagita N (1990) Automated entry system for printed documents. Pattern Recog 23(11):1141-1154
Google Scholar
Toyoda J, Noguchi Y, Nishimura Y (1982) Study of extracting Japanese newspaper article. In: Proceedings of the 6th international conference on pattern recognition, pp 1113-1115
O'Gorman L (1993) The document spectrum for structural page layout analysis. IEEE Trans Pattern Anal Mach Intell 15(11):1162-1173
Google Scholar
Kida H, Iwaki O, Kawada K (1986) Document recognition system for office automation. In: Proccedings of the 8th international conference on pattern recognition, Paris, pp 446-448
Mao J, Abayan M, Mohiuddin K (1996) A model-based form processing sub-system. In: Proceedings of the international conference on pattern recognition, pp 691-695
Yu B, Jain A (1996) A generic system for form dropout. IEEE Trans Pattern Anal Mach Intell 18:1127-1134
Google Scholar
Taylor S, Fritzon R, Pastor J (1992) Extraction data from preprinted forms. Mach Vis Appl 5:211-222
Google Scholar
Liu J, Jain AK (2000) Image based form document retrieval. Pattern Recog 33:503-513
Google Scholar
Hough PCV (1962) Methods and means for recognizing complex patterns. US Patent 3,069,654

Download references

Author information

Authors and Affiliations

Department of Electrical and Computer Engineering, Department of Computer Science, University of Nevada Las Vegas, 4505 Maryland Parkway, Box 454026, NV 89154-4026, Las Vegas, USA
E. E. Regentova, S. Latifi, D. Chen, K. Taghva & D. Yao

Authors

E. E. Regentova
View author publications
You can also search for this author in PubMed Google Scholar
S. Latifi
View author publications
You can also search for this author in PubMed Google Scholar
D. Chen
View author publications
You can also search for this author in PubMed Google Scholar
K. Taghva
View author publications
You can also search for this author in PubMed Google Scholar
D. Yao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to E. E. Regentova.

Additional information

Received: 31 January 2004, Accepted: 4 January 2005, Published online: 5 April 2005

Rights and permissions

Reprints and permissions

About this article

Cite this article

Regentova, E.E., Latifi, S., Chen, D. et al. Document analysis by processing JBIG-encoded images. IJDAR 7, 260–272 (2005). https://doi.org/10.1007/s10032-005-0141-z

Download citation

Issue Date: September 2005
DOI: https://doi.org/10.1007/s10032-005-0141-z

Keywords:

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Document analysis by processing JBIG-encoded images

Abstract.

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

A Study on Size Optimization of Scanned Textual Documents

Post-processing coding artefacts for JPEG documents

FastSS: Fast and smooth segmentation of JPEG compressed printed text documents using DC and AC signal analysis

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords:

Subscribe and save

Buy Now

Navigation

Document analysis by processing JBIG-encoded images

Abstract.

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

A Study on Size Optimization of Scanned Textual Documents

Post-processing coding artefacts for JPEG documents

FastSS: Fast and smooth segmentation of JPEG compressed printed text documents using DC and AC signal analysis

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords:

Subscribe and save

Buy Now

Search

Navigation