Abstract
Recommendations
Document Title Patterns in Information Retrieval
TSD '99: Proceedings of the Second International Workshop on Text, Speech and DialogueThe document titles give an important information about documents. This is why they are frequently used to obtain document keywords. We use them to determine document intentions. To obtain some textual details, we use special information extraction ...
Multi-page document analysis based on format consistency and clustering
In multi-page documents, document elements belonging to the same component usually share format regularity. We call this regularity 'document component intrinsic format consistency' (DCIFC). We present a new document analysis method based on DCIFC, ...
Document cleanup using page frame detection
When a page of a book is scanned or photocopied, textual noise (extraneous symbols from the neighboring page) and/or non-textual noise (black borders, speckles, ...) appear along the border of the document. Existing document analysis methods can handle ...
Comments
Information & Contributors
Information
Published In
Publisher
IEEE Computer Society
United States
Publication History
Qualifiers
- Article
Contributors
Other Metrics
Bibliometrics & Citations
Bibliometrics
Article Metrics
- 0Total Citations
- 0Total Downloads
- Downloads (Last 12 months)0
- Downloads (Last 6 weeks)0