Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2809544.2809546acmotherconferencesArticle/Chapter ViewAbstractPublication PageshipConference Proceedingsconference-collections
research-article

Space Displacement Localization Neural Networks to locate origin points of handwritten text lines in historical documents

Published: 22 August 2015 Publication History
  • Get Citation Alerts
  • Abstract

    We describe a new method for detecting and localizing multiple objects in an image using context aware deep neural networks. Common architectures either proceed locally per pixel-wise sliding-windows, or globally by predicting object localizations for a full image. We improve on this by training a semi-local model to detect and localize objects inside a large image region, which covers an object or a part of it. Context knowledge is integrated, combining multiple predictions for different regions through a spatial context layer modeled as an LSTM network.
    The proposed method is applied to a complex problem in historical document image analysis, where we show that is capable of robustly detecting text lines in the images from the ANDAR-TL competition. Experiments indicate that the model can cope with difficult situations and reach the state of the art in Vision such as other deep models.

    References

    [1]
    D. S. Bloomberg, G. E. Kopec, and L. Dasari. Measuring document image skew and orientation. In Proc. Document Recognition II, 1995.
    [2]
    K. Chatfield, K. Simonyan, A. Vedaldi, and A. Zisserman. Return of the devil in the details: Delving deep into convolutional nets. CoRR, abs/1405.3531, 2014.
    [3]
    M. Delakis and C. Garcia. text detection with convolutional neural networks. In VISAPP (2), pages 290--294, 2008.
    [4]
    X. Du, W. Pan, and T. D. Bui. Text line segmentation in handwritten documents using mumford--shah model. Pattern Recognition, 42(12):3136--3145, 2009.
    [5]
    D. Erhan, C. Szegedy, A. Toshev, and D. Anguelov. Scalable Object Detection using Deep Neural Networks. In Proc. CVPR, 2014.
    [6]
    P. Felzenszwalb, R. Girshick, D. McAllester, and D. Ramanan. Object detection with discriminatively trained part based models. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 32(9):1627--1645, 2010.
    [7]
    P. Felzenszwalb and D. Huttenlocher. Pictorial Structures for Object Recognition. International Journal of Computer Vision, 61(1):55--79, 2005.
    [8]
    C. Garcia and M. Delakis. Convolutional face finder: a neural architecture for fast and robust face detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 26:1408--1423, 2004.
    [9]
    R. Girshick, J. Donahue, T. Darrell, and J. Malik. Region-based convolutional networks for accurate object detection and semantic segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, to appear.
    [10]
    A. Graves, S. Fernández, and J. Schmidhuber. Multi-dimensional recurrent neural networks. 2007.
    [11]
    G. Hinton, N. Srivastava, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov. Improving neural networks by preventing co-adaptation of feature detectors. arXiv preprint arXiv: ..., pages 1--18, 2012.
    [12]
    G. Louloudis, B. Gatos, I. Pratikakis, and C. Halatsis. Text line and word segmentation of handwritten documents. Pattern Recognition, 42(12):3169--3183, Dec. 2009.
    [13]
    B. Moysset, T. Bluche, M. Knibbe, M. F. Benzeghiba, R. Messina, J. Louradour, and C. Kermorvant. The a2ia multi-lingual text recognition system at the maurdor evaluation. In Proc. International Conference on Frontiers in Handwriting Recognition (ICFHR), 2014.
    [14]
    A. Nicolaou and B. Gatos. Handwritten Text Line Segmentation by Shredding Text into its Lines. International Conference on Document Analysis and Recognition, 2009.
    [15]
    V. Papavassiliou, V. Katsouros, and G. Carayannis. A Morphological Approach for Text-Line Segmentation in Handwritten Documents. In International Conference on Frontiers in Handwriting Recognition, 2010.
    [16]
    I. Rabaev, O. Biller, J. El-Sana, K. Kedem, and I. Dinstein. Text line detection in corrupted and damaged historical manuscripts. In Document Analysis and Recognition (ICDAR), 2013 12th International Conference on, pages 812--816. IEEE, 2013.
    [17]
    O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Z. Huang, A. Karpathy, A. Khosla, M. Bernstein, A. C. Berg, and L. Fei-Fei. ImageNet Large Scale Visual Recognition Challenge. International Journal of Computer Vision (IJCV), 2015.
    [18]
    P. Sermanet, D. Eigen, X. Zhang, M. Mathieu, R. Fergus, and Y. LeCun. Overfeat: Integrated recognition, localization and detection using convolutional networks. arXiv preprint arXiv:1312.6229, 2013.
    [19]
    Z. Shi, S. Setlur, and V. Govindaraju. A Steerable Directional Local Profile Technique for Extraction of Handwritten Arabic Text Lines. In Proc. International Conference of Document Analysis and Recognition (ICDAR), 2009.
    [20]
    J. R. Uijlings, K. E. van de Sande, T. Gevers, and A. W. Smeulders. Selective search for object recognition. International journal of computer vision, 104(2):154--171, 2013.
    [21]
    A. Vinciarelli and J. Luettin. A new normalisation technique for cursive handwritten words. Pattern Recognition Letters, 22:1043--1050, 2001.
    [22]
    A. Zahour, L. Likforman-Sulem, W. Boussellaa, and B. Taconet. Text Line Segmentation of Historical Arabic Documents. In International Conference on Document Analysis and Recognition, 2007.
    [23]
    K. Zuiderveld. Graphics gems iv. chapter Contrast Limited Adaptive Histogram Equalization, pages 474--485. Academic Press Professional, Inc., San Diego, CA, USA, 1994.

    Cited By

    View all
    • (2023)Instance Segmentation of Handwritten Text on Historical Document Images Using Deep Learning Approaches4th International Conference on Artificial Intelligence and Applied Mathematics in Engineering10.1007/978-3-031-31956-3_20(244-253)Online publication date: 27-May-2023
    • (2021)Segmentation of text lines using multi-scale CNN from warped printed and handwritten document imagesInternational Journal on Document Analysis and Recognition (IJDAR)10.1007/s10032-021-00370-8Online publication date: 21-May-2021
    • (2020)A Novel Semantic Segmentation Model for Chinese CharactersIEEE Access10.1109/ACCESS.2020.30270198(179083-179093)Online publication date: 2020
    • Show More Cited By
    1. Space Displacement Localization Neural Networks to locate origin points of handwritten text lines in historical documents

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Other conferences
        HIP '15: Proceedings of the 3rd International Workshop on Historical Document Imaging and Processing
        August 2015
        155 pages
        ISBN:9781450336024
        DOI:10.1145/2809544
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        In-Cooperation

        • FamilySearch: FamilySearch

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 22 August 2015

        Permissions

        Request permissions for this article.

        Check for updates

        Qualifiers

        • Research-article
        • Research
        • Refereed limited

        Conference

        HIP '15

        Acceptance Rates

        Overall Acceptance Rate 52 of 90 submissions, 58%

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)7
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 11 Aug 2024

        Other Metrics

        Citations

        Cited By

        View all
        • (2023)Instance Segmentation of Handwritten Text on Historical Document Images Using Deep Learning Approaches4th International Conference on Artificial Intelligence and Applied Mathematics in Engineering10.1007/978-3-031-31956-3_20(244-253)Online publication date: 27-May-2023
        • (2021)Segmentation of text lines using multi-scale CNN from warped printed and handwritten document imagesInternational Journal on Document Analysis and Recognition (IJDAR)10.1007/s10032-021-00370-8Online publication date: 21-May-2021
        • (2020)A Novel Semantic Segmentation Model for Chinese CharactersIEEE Access10.1109/ACCESS.2020.30270198(179083-179093)Online publication date: 2020
        • (2019)Robust Keypoint Detection2019 International Conference on Document Analysis and Recognition Workshops (ICDARW)10.1109/ICDARW.2019.40072(1-7)Online publication date: Sep-2019
        • (2018)Driver information systemMultimedia Tools and Applications10.1007/s11042-017-5054-677:12(14673-14703)Online publication date: 1-Jun-2018
        • (2018)Fully convolutional network with dilated convolutions for handwritten text line segmentationInternational Journal on Document Analysis and Recognition10.1007/s10032-018-0304-321:3(177-186)Online publication date: 1-Sep-2018
        • (2017)Handwritten Text Line Segmentation Using Fully Convolutional Network2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)10.1109/ICDAR.2017.321(5-9)Online publication date: Nov-2017
        • (2016)Joint line segmentation and transcription for end-to-end handwritten paragraph recognitionProceedings of the 30th International Conference on Neural Information Processing Systems10.5555/3157096.3157190(838-846)Online publication date: 5-Dec-2016
        • (2016)Learning Text-Line Localization with Shared and Local Regression Neural Networks2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR)10.1109/ICFHR.2016.0014(1-6)Online publication date: Oct-2016

        View Options

        Get Access

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media