Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1577802.1577807acmotherconferencesArticle/Chapter ViewAbstractPublication PagesmocrConference Proceedingsconference-collections
research-article

A stroke regeneration method for cleaning rule-lines in handwritten document images

Published: 25 July 2009 Publication History
  • Get Citation Alerts
  • Abstract

    We describe a rule-line removal algorithm for handwritten document images in this paper. Compared to the existing approaches, our algorithm obtains more scalability to higher-resolution images and thicker rule-lines. Derived from the simple gap-filling methods using line-drawing algorithms, we present a novel approach to regenerating the missing portions of text strokes. Using this approach, the deformed text can be restored to its original shape. We also explore the noise filtering method for binarized document images, in particular by choosing the morphological operator in accordance with the noise power of the input image. Our approach has proven to be effective by experiments on both real and synthetic handwritten document images.

    References

    [1]
    K. R. Arvind, J. Kumar, and A. G. Ramakrishnan. Line removal and restoration of handwritten strokes. In ICCIMA '07: Proceedings of the International Conference on Computational Intelligence and Multimedia Applications (ICCIMA 2007), pages 208--214, Washington, DC, USA, 2007. IEEE Computer Society.
    [2]
    H. Cao and V. Govindaraju. Handwritten carbon form preprocessing based on Markov random field. In IEEE Conference on Computer Vision and Pattern Recognition, 2007.
    [3]
    R. Cao and C. L. Tan. Separation of overlapping text from graphics. In International Conference on Document Analysis and Recognition, pages 44--48, 2001.
    [4]
    P. Natarajan, S. Saleem, R. Prasad, E. MacRostie, and K. Subramanian. Multi-lingual offline handwriting recognition using hidden Markov models: A script-independent approach. Springer Book Chapter on Arabic and Chinese Handwriting Recognition, 4768:231--250, 2008.
    [5]
    J. Said, M. Cheriet, and C. Suen. Dynamical morphological processing: a fast method for base line extraction. In Proceedings of the 13th International Conference on Pattern Recognition, volume 2, pages 8--12, 1996.
    [6]
    J. Wang and H. Yan. Mending broken handwriting with a macrostructure analysis method to improve recognition. Pattern Recognition Letter, 20(8):855--864, 1999.
    [7]
    X. Ye, M. Cheriet, and C. Y. Suen. A generic method of cleaning and enhancing handwritten data from business forms. Intarl J. on Document Analysis and Recognition, 4:2001.
    [8]
    J. Yoo, M. Kim, S. Y. Han, and Y. B. Kwon. Line removal and restoration of handwritten characters on the form documents. In International Conference on Document Analysis and Recognition, pages 128--131, 1997.

    Cited By

    View all
    • (2022)Dynamic hidden feature space detection of noisy image set by weight binarizationSignal, Image and Video Processing10.1007/s11760-022-02284-217:3(761-768)Online publication date: 8-Aug-2022
    • (2022)CNN-Based Ruled Line Removal in Handwritten DocumentsFrontiers in Handwriting Recognition10.1007/978-3-031-21648-0_36(530-544)Online publication date: 25-Nov-2022
    • (2019)Digitization and Parameter Extraction of Preserved Paper Electrocardiogram RecordsImmunological Tolerance10.1007/978-981-13-3600-3_46(487-495)Online publication date: 17-Jan-2019
    • Show More Cited By

    Index Terms

    1. A stroke regeneration method for cleaning rule-lines in handwritten document images

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Other conferences
        MOCR '09: Proceedings of the International Workshop on Multilingual OCR
        July 2009
        139 pages
        ISBN:9781605586984
        DOI:10.1145/1577802
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 25 July 2009

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tag

        1. OCR

        Qualifiers

        • Research-article

        Conference

        MOCR '09

        Acceptance Rates

        Overall Acceptance Rate 17 of 34 submissions, 50%

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)2
        • Downloads (Last 6 weeks)2
        Reflects downloads up to 27 Jul 2024

        Other Metrics

        Citations

        Cited By

        View all
        • (2022)Dynamic hidden feature space detection of noisy image set by weight binarizationSignal, Image and Video Processing10.1007/s11760-022-02284-217:3(761-768)Online publication date: 8-Aug-2022
        • (2022)CNN-Based Ruled Line Removal in Handwritten DocumentsFrontiers in Handwriting Recognition10.1007/978-3-031-21648-0_36(530-544)Online publication date: 25-Nov-2022
        • (2019)Digitization and Parameter Extraction of Preserved Paper Electrocardiogram RecordsImmunological Tolerance10.1007/978-981-13-3600-3_46(487-495)Online publication date: 17-Jan-2019
        • (2016)Conservative preprocessing of document imagesInternational Journal on Document Analysis and Recognition10.1007/s10032-016-0273-319:4(321-333)Online publication date: 1-Dec-2016
        • (2014)Rule Line Detection and Removal in Handwritten Text ImagesProceedings of the 2014 Fifth International Conference on Signal and Image Processing10.1109/ICSIP.2014.55(310-315)Online publication date: 8-Jan-2014
        • (2014)Progress in the Raytheon BBN Arabic Offline Handwriting Recognition System2014 14th International Conference on Frontiers in Handwriting Recognition10.1109/ICFHR.2014.99(555-560)Online publication date: Sep-2014
        • (2013)An image processing self-training system for ruling line removal algorithms2013 18th International Conference on Digital Signal Processing (DSP)10.1109/ICDSP.2013.6622767(1-6)Online publication date: Jul-2013
        • (2013)Alternatives for Page Skew Compensation in Writer IdentificationProceedings of the 2013 12th International Conference on Document Analysis and Recognition10.1109/ICDAR.2013.189(927-931)Online publication date: 25-Aug-2013
        • (2012)Applying Discriminatively Optimized Feature Transform for HMM-based Off-Line Handwriting RecognitionProceedings of the 2012 International Conference on Frontiers in Handwriting Recognition10.1109/ICFHR.2012.182(219-224)Online publication date: 18-Sep-2012
        • (2011)A real-world noisy unstructured handwritten notebook corpus for document image analysis researchProceedings of the 2011 Joint Workshop on Multilingual OCR and Analytics for Noisy Unstructured Text Data10.1145/2034617.2034620(1-8)Online publication date: 17-Sep-2011
        • Show More Cited By

        View Options

        Get Access

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media