Article

Text Localization in Natural Images Using Stroke Feature Transform and Text Covariance Descriptors

Authors:

Weilin Huang,

Zhe Lin,

Jianchao Yang, and

Jue WangAuthors Info & Claims

ICCV '13: Proceedings of the 2013 IEEE International Conference on Computer Vision

December 2013

Pages 1241 - 1248

https://doi.org/10.1109/ICCV.2013.157

Published: 01 December 2013 Publication History

Abstract

In this paper, we present a new approach for text localization in natural images, by discriminating text and non-text regions at three levels: pixel, component and text line levels. Firstly, a powerful low-level filter called the Stroke Feature Transform (SFT) is proposed, which extends the widely-used Stroke Width Transform (SWT) by incorporating color cues of text pixels, leading to significantly enhanced performance on inter-component separation and intra-component connection. Secondly, based on the output of SFT, we apply two classifiers, a text component classifier and a text-line classifier, sequentially to extract text regions, eliminating the heuristic procedures that are commonly used in previous approaches. The two classifiers are built upon two novel Text Covariance Descriptors (TCDs) that encode both the heuristic properties and the statistical characteristics of text stokes. Finally, text regions are located by simply thresholding the text-line confident map. Our method was evaluated on two benchmark datasets: ICDAR 2005 and ICDAR 2011, and the corresponding Fmeasure values are 0.72 and 0.73, respectively, surpassing previous methods in accuracy by a large margin.

Cited By

View all

Fu ZXie HFang SWang YXing MZhang Y(2023)Learning Pixel Affinity Pyramid for Arbitrary-Shaped Text DetectionACM Transactions on Multimedia Computing, Communications, and Applications10.1145/352461719:1s(1-24)Online publication date: 3-Feb-2023
https://dl.acm.org/doi/10.1145/3524617
Qian JMa YLin CChen L(2022)Accelerating OCR-Based Widget Localization for Test Automation of GUI ApplicationsProceedings of the 37th IEEE/ACM International Conference on Automated Software Engineering10.1145/3551349.3556966(1-13)Online publication date: 10-Oct-2022
https://dl.acm.org/doi/10.1145/3551349.3556966
Lu YGuo CDai XWang F(2022)Data-efficient image captioning of fine art paintings via virtual-real semantic alignment trainingNeurocomputing10.1016/j.neucom.2022.01.068490:C(163-180)Online publication date: 14-Jun-2022
https://dl.acm.org/doi/10.1016/j.neucom.2022.01.068
Show More Cited By

Recommendations

Text Detection in Natural Scene Images by Stroke Gabor Words
ICDAR '11: Proceedings of the 2011 International Conference on Document Analysis and Recognition

In this paper, we propose a novel algorithm, based on stroke components and descriptive Gabor filters, to detect text regions in natural scene images. Text characters and strings are constructed by stroke components as basic units. Gabor filters are ...
Read More
Recognition of handwritten characters using local gradient feature descriptors

In this paper we propose to use local gradient feature descriptors, namely the scale invariant feature transform keypoint descriptor and the histogram of oriented gradients, for handwritten character recognition. The local gradient feature descriptors ...
Read More
Scene Text Detection Using Superpixel-Based Stroke Feature Transform and Deep Learning Based Region Classification

Scene text detection is a crucial step in end-to-end scene text recognition, a greatly challenging problem in computer vision. This paper proposes a novel scene text detection method that involves superpixel-based stroke feature transform (SSFT) and ...
Read More

Comments

Information & Contributors

Information

Published In

ICCV '13: Proceedings of the 2013 IEEE International Conference on Computer Vision

December 2013

3650 pages

ISBN:9781479928408

Publisher

IEEE Computer Society

United States

Publication History

Published: 01 December 2013

Author Tags

Qualifiers

Article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

28
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Other Metrics

View Author Metrics

Citations

Cited By

View all

Fu ZXie HFang SWang YXing MZhang Y(2023)Learning Pixel Affinity Pyramid for Arbitrary-Shaped Text DetectionACM Transactions on Multimedia Computing, Communications, and Applications10.1145/352461719:1s(1-24)Online publication date: 3-Feb-2023
https://dl.acm.org/doi/10.1145/3524617
Qian JMa YLin CChen L(2022)Accelerating OCR-Based Widget Localization for Test Automation of GUI ApplicationsProceedings of the 37th IEEE/ACM International Conference on Automated Software Engineering10.1145/3551349.3556966(1-13)Online publication date: 10-Oct-2022
https://dl.acm.org/doi/10.1145/3551349.3556966
Lu YGuo CDai XWang F(2022)Data-efficient image captioning of fine art paintings via virtual-real semantic alignment trainingNeurocomputing10.1016/j.neucom.2022.01.068490:C(163-180)Online publication date: 14-Jun-2022
https://dl.acm.org/doi/10.1016/j.neucom.2022.01.068
Rainarli ESuprapto Wahyono (2022)A decadeComputer Science Review10.1016/j.cosrev.2021.10043442:COnline publication date: 9-Apr-2022
https://dl.acm.org/doi/10.1016/j.cosrev.2021.100434
Wu JCai NLi FJiang HWang H(2020)Automatic detonator code recognition via deep neural networkExpert Systems with Applications: An International Journal10.1016/j.eswa.2019.113121145:COnline publication date: 1-May-2020
https://dl.acm.org/doi/10.1016/j.eswa.2019.113121
Xing JLiu GXiong JTavares JXu Z(2019)Oracle bone inscription detectionProceedings of the International Conference on Artificial Intelligence, Information Processing and Cloud Computing10.1145/3371425.3371434(1-8)Online publication date: 19-Dec-2019
https://dl.acm.org/doi/10.1145/3371425.3371434
Liu ZZhou WLi H(2019)AB-LSTMACM Transactions on Multimedia Computing, Communications, and Applications10.1145/335672815:4(1-23)Online publication date: 16-Dec-2019
https://dl.acm.org/doi/10.1145/3356728
Unar SWang XWang CWang Y(2019)A decisive content based image retrieval approach for feature fusion in visual and textual imagesKnowledge-Based Systems10.1016/j.knosys.2019.05.001179:C(8-20)Online publication date: 1-Sep-2019
https://dl.acm.org/doi/10.1016/j.knosys.2019.05.001
Zhu WLou JXia QRen M(2019)Single Shot Text Detector with Rotational Prior BoxesNeural Processing Letters10.1007/s11063-018-9810-z49:3(863-877)Online publication date: 1-Jun-2019
https://dl.acm.org/doi/10.1007/s11063-018-9810-z
Liu ZZhou WLi H(2019)Scene text detection with fully convolutional neural networksMultimedia Tools and Applications10.1007/s11042-019-7177-478:13(18205-18227)Online publication date: 1-Jul-2019
https://dl.acm.org/doi/10.1007/s11042-019-7177-4
Show More Cited By

View Options

View options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Abstract

Cited By

Recommendations

Text Detection in Natural Scene Images by Stroke Gabor Words

Recognition of handwritten characters using local gradient feature descriptors

Scene Text Detection Using Superpixel-Based Stroke Feature Transform and Deep Learning Based Region Classification

Comments

Information

Published In

Publisher

Publication History

Author Tags

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

Get Access

Login options

Full Access

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations