Neural-Based Hit-Count Feature Extraction Method for Telugu Script Optical Character Recognition

Swamy Das, M.; Rao, Kovvur Ram Mohan; Balaji, P.

doi:10.1007/978-981-10-8204-7_48

M. Swamy Das⁷,
Kovvur Ram Mohan Rao⁸ &
P. Balaji⁷

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 33))

633 Accesses
1 Citations

Abstract

The recognition accuracy and efficiency of any OCR system greatly depend on the feature extraction methods. There are several feature extraction methods each has its own characteristics. These methods differ in terms of the number features that they extract and the complexity. With less number of features, the recognition accuracy may be low, and with more number of features, the recognize time may be more. The features are to be selected in such a way that they could distinguish one character from other with minimum comparisons and gives less false positives and false negatives. The accuracy of an OCR can be improved by changing the feature extraction methods. Telugu is called Italian of the east. But it is surprising that there are not many OCRs that could detect Telugu characters with fairly good accuracy. The accuracy of OCRs available in the market are either highly objectionable or the price is very high. To address this issue, we took up this project. Other problems include the segmentation of overlapped characters and right feature extraction. We tried to solve these issues, by taking a segmented character from a word and check to find a correct match for it or tell that the character does not exist so that the particular character can be re segmented. In this work, a hit-count-based feature extraction method with neural networks is used for the fast recognition even though the training time is more. The experimental results show that the proposed hit-count-based feature method greatly reduces the time by maintaining the recognition accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

A Hybrid Approach Optical Character Recognition for Mizo Using Artificial Neural Network

A Survey on Devanagari Character Recognition

Recognition of Odia Conjunct Characters Using a Hybrid ANN-DE Classification Technique

References

Krishnan P, Jawahar C et al (2014) Towards a robust OCR system for indic scripts. DAS
Google Scholar
Singh A, Bacchuwar K et al (2012) A survey of OCR applications. IJMLC 2(3)
Google Scholar
Sankaran N, Jawahar C (2013) Devanagari text recognition: a transcription based formulation. ICDAR
Google Scholar
Gonzalez RC, Woods RE (2001) Digital image processing. Addison-Wesley Longman Publishing Co., Inc., Boston, MA, USA
Google Scholar
Borovikov E (2014) A survey of modern optical character recognition techniques. AMS 2004, arXiv:1412.4183v1 [cs.CV] Dec 2014
Varalaxmi A, Negi A et al (2012) DataSet generation and feature extraction for Telugu hand-written recognition. IJCST 3(2):57–59
Google Scholar
http://dli.iiit.ac.in
http://www.archive.org/details/millionbooks
http://www.tesseract.org/
http://www.wikipedia.org/OCR

Download references

Author information

Authors and Affiliations

CSE Department, CBIT, Hyderabad, India
M. Swamy Das & P. Balaji
IT Department, Vasavi College of Engineering, Hyderabad, India
Kovvur Ram Mohan Rao

Authors

M. Swamy Das
View author publications
You can also search for this author in PubMed Google Scholar
Kovvur Ram Mohan Rao
View author publications
You can also search for this author in PubMed Google Scholar
P. Balaji
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to M. Swamy Das .

Editor information

Editors and Affiliations

Guru Nanak Institutions, Ibrahimpatnam, Telangana, India
H. S. Saini
Guru Nanak Institutions Technical Campus, Ibrahimpatnam, Telangana, India
Ravi Kishore Singh
Department of Electrical and Computer Engineering, Rutgers University, New Brunswick, NJ, USA
Vishal M. Patel
Department of Electronics and Communication Engineering, Guru Nanak Institutions Technical Campus, Ibrahimpatnam, Telangana, India
K. Santhi
Research and Development, Guru Nanak Institutions Technical Campus, Ibrahimpatnam, Telangana, India
S.V. Ranganayakulu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Swamy Das, M., Rao, K.R.M., Balaji, P. (2019). Neural-Based Hit-Count Feature Extraction Method for Telugu Script Optical Character Recognition. In: Saini, H., Singh, R., Patel, V., Santhi, K., Ranganayakulu, S. (eds) Innovations in Electronics and Communication Engineering. Lecture Notes in Networks and Systems, vol 33. Springer, Singapore. https://doi.org/10.1007/978-981-10-8204-7_48

Download citation

DOI: https://doi.org/10.1007/978-981-10-8204-7_48
Published: 29 August 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-8203-0
Online ISBN: 978-981-10-8204-7
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Neural-Based Hit-Count Feature Extraction Method for Telugu Script Optical Character Recognition

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A Hybrid Approach Optical Character Recognition for Mizo Using Artificial Neural Network

A Survey on Devanagari Character Recognition

Recognition of Odia Conjunct Characters Using a Hybrid ANN-DE Classification Technique

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Neural-Based Hit-Count Feature Extraction Method for Telugu Script Optical Character Recognition

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A Hybrid Approach Optical Character Recognition for Mizo Using Artificial Neural Network

A Survey on Devanagari Character Recognition

Recognition of Odia Conjunct Characters Using a Hybrid ANN-DE Classification Technique

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation