Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1109/ICCV.2013.157guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Text Localization in Natural Images Using Stroke Feature Transform and Text Covariance Descriptors

Published: 01 December 2013 Publication History
  • Get Citation Alerts
  • Abstract

    In this paper, we present a new approach for text localization in natural images, by discriminating text and non-text regions at three levels: pixel, component and text line levels. Firstly, a powerful low-level filter called the Stroke Feature Transform (SFT) is proposed, which extends the widely-used Stroke Width Transform (SWT) by incorporating color cues of text pixels, leading to significantly enhanced performance on inter-component separation and intra-component connection. Secondly, based on the output of SFT, we apply two classifiers, a text component classifier and a text-line classifier, sequentially to extract text regions, eliminating the heuristic procedures that are commonly used in previous approaches. The two classifiers are built upon two novel Text Covariance Descriptors (TCDs) that encode both the heuristic properties and the statistical characteristics of text stokes. Finally, text regions are located by simply thresholding the text-line confident map. Our method was evaluated on two benchmark datasets: ICDAR 2005 and ICDAR 2011, and the corresponding Fmeasure values are 0.72 and 0.73, respectively, surpassing previous methods in accuracy by a large margin.

    Cited By

    View all
    • (2023)Learning Pixel Affinity Pyramid for Arbitrary-Shaped Text DetectionACM Transactions on Multimedia Computing, Communications, and Applications10.1145/352461719:1s(1-24)Online publication date: 3-Feb-2023
    • (2022)Accelerating OCR-Based Widget Localization for Test Automation of GUI ApplicationsProceedings of the 37th IEEE/ACM International Conference on Automated Software Engineering10.1145/3551349.3556966(1-13)Online publication date: 10-Oct-2022
    • (2022)Data-efficient image captioning of fine art paintings via virtual-real semantic alignment trainingNeurocomputing10.1016/j.neucom.2022.01.068490:C(163-180)Online publication date: 14-Jun-2022
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Guide Proceedings
    ICCV '13: Proceedings of the 2013 IEEE International Conference on Computer Vision
    December 2013
    3650 pages
    ISBN:9781479928408

    Publisher

    IEEE Computer Society

    United States

    Publication History

    Published: 01 December 2013

    Author Tags

    1. Low-level filter
    2. stroke width transform
    3. text component
    4. text covariance descriptors

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Learning Pixel Affinity Pyramid for Arbitrary-Shaped Text DetectionACM Transactions on Multimedia Computing, Communications, and Applications10.1145/352461719:1s(1-24)Online publication date: 3-Feb-2023
    • (2022)Accelerating OCR-Based Widget Localization for Test Automation of GUI ApplicationsProceedings of the 37th IEEE/ACM International Conference on Automated Software Engineering10.1145/3551349.3556966(1-13)Online publication date: 10-Oct-2022
    • (2022)Data-efficient image captioning of fine art paintings via virtual-real semantic alignment trainingNeurocomputing10.1016/j.neucom.2022.01.068490:C(163-180)Online publication date: 14-Jun-2022
    • (2022)A decadeComputer Science Review10.1016/j.cosrev.2021.10043442:COnline publication date: 9-Apr-2022
    • (2020)Automatic detonator code recognition via deep neural networkExpert Systems with Applications: An International Journal10.1016/j.eswa.2019.113121145:COnline publication date: 1-May-2020
    • (2019)Oracle bone inscription detectionProceedings of the International Conference on Artificial Intelligence, Information Processing and Cloud Computing10.1145/3371425.3371434(1-8)Online publication date: 19-Dec-2019
    • (2019)AB-LSTMACM Transactions on Multimedia Computing, Communications, and Applications10.1145/335672815:4(1-23)Online publication date: 16-Dec-2019
    • (2019)A decisive content based image retrieval approach for feature fusion in visual and textual imagesKnowledge-Based Systems10.1016/j.knosys.2019.05.001179:C(8-20)Online publication date: 1-Sep-2019
    • (2019)Single Shot Text Detector with Rotational Prior BoxesNeural Processing Letters10.1007/s11063-018-9810-z49:3(863-877)Online publication date: 1-Jun-2019
    • (2019)Scene text detection with fully convolutional neural networksMultimedia Tools and Applications10.1007/s11042-019-7177-478:13(18205-18227)Online publication date: 1-Jul-2019
    • Show More Cited By

    View Options

    View options

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media