Joint Energy-based Detection and Classificationon of Multilingual Text Lines

Milevskiy, Igor; Boykov, Yuri

Computer Science > Computer Vision and Pattern Recognition

arXiv:1407.6082 (cs)

[Submitted on 23 Jul 2014]

Title:Joint Energy-based Detection and Classificationon of Multilingual Text Lines

Authors:Igor Milevskiy, Yuri Boykov

View PDF

Abstract:This paper proposes a new hierarchical MDL-based model for a joint detection and classification of multilingual text lines in im- ages taken by hand-held cameras. The majority of related text detec- tion methods assume alphabet-based writing in a single language, e.g. in Latin. They use simple clustering heuristics specific to such texts: prox- imity between letters within one line, larger distance between separate lines, etc. We are interested in a significantly more ambiguous problem where images combine alphabet and logographic characters from multiple languages and typographic rules vary a lot (e.g. English, Korean, and Chinese). Complexity of detecting and classifying text lines in multiple languages calls for a more principled approach based on information- theoretic principles. Our new MDL model includes data costs combining geometric errors with classification likelihoods and a hierarchical sparsity term based on label costs. This energy model can be efficiently minimized by fusion moves. We demonstrate robustness of the proposed algorithm on a large new database of multilingual text images collected in the pub- lic transit system of Seoul.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1407.6082 [cs.CV]
	(or arXiv:1407.6082v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1407.6082

Submission history

From: Igor Milevskiy [view email]
[v1] Wed, 23 Jul 2014 01:14:01 UTC (5,106 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2014-07

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Igor Milevskiy
Yuri Boykov

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Joint Energy-based Detection and Classificationon of Multilingual Text Lines

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Joint Energy-based Detection and Classificationon of Multilingual Text Lines

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators