Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article
Free access

Document recognition: concepts and implementations

Published: 01 December 1992 Publication History

Abstract

Document recognition is a task in which a document in its physical presentation format is transformed into a structured author-oriented model of the document. The presentation format can be bitmaps of document pages, a description of the document in a Page Description Language (PDL), or encoding of the document in a printer or graphics language. The structured model is a format allowing for addition to the document, manipulation of the document, and reformating the layout and the output appearance of the document.Fully automatic document recognition is not possible, in general, for the same reason that it is not possible to de-translate computer programs automatically. However, it is possible to develop a man-assisted semi-automatic document recognition method. This method uses two passes. The first pass is completely automatic; it produces a document format called Interactive Document Model. The Interactive Document Model comprises recognized typesetting and descriptive structures together with derived ODA logical and layout structures for the document. The model generated in the first pass is enough for most purposes and applications. However, if it is not acceptable, the user can then enter the second pass and interactively edit the logical structure.This paper has three objectives. The first is to formalize the concept of document recognition. The second is to subdivide the problem of document recognition and classify it into a number of subproblems, each dealing with different aspects of the problem. The third objective is to introduce a problem which we wish to solve, and then to present a High Level Document Recognition method and the experience in developing and using a number of implementations of the method.

References

[1]
{1} Nenad Marovac, Document Structures and Document Recognition, Reconnaissance automatique de l'ecrit, Le Havre, May 18 1990. Proc. BIGRE No. 68, May 1990.
[2]
{2} Rolf Ingold, Text Structure Recognition in Optical Reading, Structured Documents, Ed. J. Andre et al., The Cambridge Series on Electronic Publishing, Cambridge University Press, 1989.
[3]
{3} ISO 8879-1986, Text and Office Systems - Standard Generalized Markup Language, ISO, October 1986.
[4]
{4} ISO DIS 861 1987-07-16, Information Processing - Text and Office Systems - Office Document Architecture (ODA) and Interchange Format, part 1-8.
[5]
{5} ISO DIS 8613-5.1987-07-16, Information Processing - Text and Office Systems - Office Document Architecture (ODA) and Interchange Format, part 5 Office Document Interchange Format (ODIF).
[6]
{6} Nenad Marovac, Page Description languages: Concepts and Implementations, Workstations and Publication Systems, Ed. R. A. Earnshaw, Springer-Verlag 1987.
[7]
{7} Nenad Marovac. Open System Architecture for Electronic Publishing Systems, Electronic Publishing and Print Conference, Online-89, London 1989.
[8]
{8} Nenad Marovac et al, Neural Networks in Character and Word Recognition and Font Classification, in preparation.
[9]
{9} Makoto Murata. An object-oriented interperetation of ODA, Woodman'89, Rennes 29-31 may, 1989, Ed. Jacques Andre & Jean Bezivin, BIGRE 89.
[10]
{10} Wilcox L.D. and Spitz L. Automatic recognition and representation of documents, Document Manipulation and Typography, Nice, April 1988, EP-88, The Cambridge Series on Electronic Publishing, Cambridge University Press, 1989.
[11]
{11} Xerox Publishing System. System Description Manual, Xerox Corporation.
[12]
{12} R.N. Horspool, and N. Marovac, An approach to the problem of de-translation of computer programs. The Computer Journal, Vol. 23, No. 3.
[13]
{13} Leslie Lamport, Latex - A document preparation system, Addison-Wesley, 1985.
[14]
{14} Nenad Marovac, User Interface for editing of document structures in document recognition.
[15]
{15} Wordscan User Guide, Celera Recognition System, Santa Clara, CA.
[16]
{16} Lawrence Spitz, Recognition processing for multilingual documents, Proc. EP-90, Cambridge University Press, 1990.
[17]
{17} Martin Bryan, SGML An author's guide, Addison-Wesley 1988.
[18]
{18} Nenad Marovac et al, Document Recognition System as an extension to commercial text recognition systems. To be submitted.
[19]
{19} INTERPRESS - Electronic Printing Standard, Xerox Corporation.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM SIGOIS Bulletin
ACM SIGOIS Bulletin  Volume 13, Issue 3
Dec. 1992
43 pages
ISSN:0894-0819
DOI:10.1145/152683
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 December 1992
Published in SIGOIS Volume 13, Issue 3

Check for updates

Author Tags

  1. ODA
  2. documents
  3. interactive model
  4. recognition
  5. reconnaissance
  6. structures

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)50
  • Downloads (Last 6 weeks)12
Reflects downloads up to 16 Feb 2025

Other Metrics

Citations

Cited By

View all

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media