Deep Neural Network for Semantic-based Text Recognition in Images

Zheng, Yi; Wang, Qitong; Betke, Margrit

Computer Science > Computer Vision and Pattern Recognition

arXiv:1908.01403 (cs)

[Submitted on 4 Aug 2019 (v1), last revised 9 Dec 2019 (this version, v3)]

Title:Deep Neural Network for Semantic-based Text Recognition in Images

Authors:Yi Zheng, Qitong Wang, Margrit Betke

View PDF

Abstract:State-of-the-art text spotting systems typically aim to detect isolated words or word-by-word text in images of natural scenes and ignore the semantic coherence within a region of text. However, when interpreted together, seemingly isolated words may be easier to recognize. On this basis, we propose a novel "semantic-based text recognition" (STR) deep learning model that reads text in images with the help of understanding context. STR consists of several modules. We introduce the Text Grouping and Arranging (TGA) algorithm to connect and order isolated text regions. A text-recognition network interprets isolated words. Benefiting from semantic information, a sequenceto-sequence network model efficiently corrects inaccurate and uncertain phrases produced earlier in the STR pipeline. We present experiments on two new distinct datasets that contain scanned catalog images of interior designs and photographs of protesters with hand-written signs, respectively. Our results show that our STR model outperforms a baseline method that uses state-of-the-art single-wordrecognition techniques on both datasets. STR yields a high accuracy rate of 90% on the catalog images and 71% on the more difficult protest images, suggesting its generality in recognizing text.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1908.01403 [cs.CV]
	(or arXiv:1908.01403v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1908.01403

Submission history

From: Yi Zheng [view email]
[v1] Sun, 4 Aug 2019 21:32:31 UTC (9,674 KB)
[v2] Thu, 15 Aug 2019 20:43:44 UTC (9,674 KB)
[v3] Mon, 9 Dec 2019 19:46:11 UTC (9,674 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Deep Neural Network for Semantic-based Text Recognition in Images

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Deep Neural Network for Semantic-based Text Recognition in Images

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators