FOTS: Fast Oriented Text Spotting with a Unified Network

Liu, Xuebo; Liang, Ding; Yan, Shi; Chen, Dagui; Qiao, Yu; Yan, Junjie

Computer Science > Computer Vision and Pattern Recognition

arXiv:1801.01671v1 (cs)

[Submitted on 5 Jan 2018 (this version), latest version 15 Jan 2018 (v2)]

Title:FOTS: Fast Oriented Text Spotting with a Unified Network

Authors:Xuebo Liu, Ding Liang, Shi Yan, Dagui Chen, Yu Qiao, Junjie Yan

View PDF

Abstract:Incidental scene text spotting is considered one of the most difficult and valuable challenges in the document analysis community. Most existing methods treat text detection and recognition as separate tasks. In this work, we propose a unified end-to-end trainable Fast Oriented Text Spotting (FOTS) network for simultaneous detection and recognition, sharing computation and visual information among the two complementary tasks. Specially, RoIRotate is introduced to share convolutional features between detection and recognition. Benefiting from convolution sharing strategy, our FOTS has little computation overhead compared to baseline text detection network, and the joint training method learns more generic features to make our method perform better than these two-stage methods. Experiments on ICDAR 2015, ICDAR 2017 MLT, and ICDAR 2013 datasets demonstrate that the proposed method outperforms state-of-the-art methods significantly, which further allows us to develop the first real-time oriented text spotting system which surpasses all previous state-of-the-art results by more than 5% on ICDAR 2015 text spotting task while keeping 22.6 fps.

Comments:	10 pages, 6 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1801.01671 [cs.CV]
	(or arXiv:1801.01671v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1801.01671

Submission history

From: Xuebo Liu [view email]
[v1] Fri, 5 Jan 2018 08:41:57 UTC (5,736 KB)
[v2] Mon, 15 Jan 2018 11:30:21 UTC (5,737 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:FOTS: Fast Oriented Text Spotting with a Unified Network

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:FOTS: Fast Oriented Text Spotting with a Unified Network

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators