Design of An OCR System and Its Hardware Implementation
Design of An OCR System and Its Hardware Implementation
https://doi.org/10.22214/ijraset.2021.39217
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.429
Volume 9 Issue XII Dec 2021- Available at www.ijraset.com
Abstract: In this paper, we present a scheme to develop to complete OCR system for printed text English Alphabet of Uppercase
of different font and of different sizes so that we can use this system in Banking, Corporate, Legal industry and so on. OCR
system consists of different modules like preprocessing, segmentation, feature extraction and recognition. In preprocessing step
it is expected to include image gray level conversion, binary conversion etc. After finding out the feature of the segmented
characters artificial neural network and can be used for Character Recognition purpose. Efforts have been made to improve the
performance of character recognition using artificial neural network techniques. The proposed OCR system is capable of
accepting printed document images from a file and implemented using MATLAB R2014a version.
Key words: OCR, Printed text, Barcode recognition
I. INTRODUCTION
In today’s world of information, countless of data and forms, reports, letters, and contracts are generated each and every day hence,
the need to retrieve, archive, update and distribute printed documents has become increasingly important [2, 4]. An available
technology that automates these tasks on computer media is optical character recognition (OCR) which can transformed printed
documents into ASCII character so that a computer can recognize, which enable compact storage, editing, fast retrieval, and other
file manipulations through the use of a computer. Optical character recognition systems are useful for automatically reading the
contents of a document for storage in a computer memory. This system is the base for many different types of applications in
various fields, many of which we use in our daily lives. Cost effective and less time consuming, businesses, post offices, banks,
security systems, and even the field of robotics employ this system as the base of their Operations. The document image itself can
be either machine printed scanned image or an image captured by a camera or by a mobile phone. Computer system equipped with
such an OCR system can improve the speed of input operation and decrease some possible human errors. Recognition of printed
characters is itself a challenging problem since there is a variation of the same character due to change of fonts or introduction of
different types of noises. Difference in images shape and sizes makes recognition task difficult if preprocessing, feature extraction
and recognition are not robust. There may be noise pixels that are introduced due to scanning of the image. Therefore, a good
character recognition approach must eliminate the noise after reading binary image data, smooth the image for better recognition,
extract features efficiently, train the system and classify patterns.
In [4] a method for image segmentation from printed document is presented. Segmentation is typically used to trace the object and
boundaries such as line and curves in an image. The segmentation of the text reliability is necessary to perform the classification
and Recognition. The main aim of segmentation is to partition the document image into various homogeneous regions such as
text block, image block, line and word. In this paper we have introduced a clustering based neighbor method and Direction based
line segmentation method for the image segmentation. This Paper result the segmentation of documented image using various
algorithm.In [5] authors present a simple method using a self-organizing map neural network (SOM NN) which can be used for
character recognition tasks. It describes the results of training SOM NN to perform optical character recognition on images of
printed characters. 49 features have been used to distinguish between 62 characters (both uppercase and lowercase letters of the
English language and numerals). Theimplemented program recognizes text by analyzing an image file. The text to be recognized is
currently limited to characters typed using Verdana font type, bolded with a font size of 18. The program is capable of handling non-
ideal images (noisy, colored text, rotated image). Recognition accuracy is consistently 100% for ideal consisted of three layers with
680 input and 26 output images, but ranges between 80% - 100% for non-ideal images.In [6] Neural Networks are being used for
character recognition. This paper presents creating the Character Recognition System, in which Creating a Character Matrix and a
corresponding Suitable Network Structure is key. The Feed Forward Algorithm gives insight into the enter workings of a neural
network, followed by the Back Propagation Algorithm which compromises Training, Calculating Error, and Modifying Weights.
This paper made an attempt to recognize handwritten English characters by using a multilayer perceptron with one hidden layer.
A. Experimental Database
1) Creation of templates:
2) 'A' to ' Z'.
3) These databases are created for 10 different fonts and 5 different sizes.
4) Here total number of samples:- 26*5*10 = 1300 character samples.
B. Database Templates
1) Database 1
2) Database 2
3) Database 3
4) Database 4
5) Database 5
6) Database 6
7) Database 7
8) Database 8
9) Database 9
10) Database 10
2) Network Training Phase: The training phase consists of computing the 5-element feature vectors from each of the 260 images
of the training set. Training of Network is done using scaled conjugate gradient back propagation learning. Training of data
automatically stops when generalization stops improving.
3) Classification: This is done using MLP (multi-layer perceptron) [5]. The MLP consists of 5 inputs for feeding in the 5-
element feature vector for each character, and 26 outputs for discriminating between the characters. The activation transfer
functions are of log-sigmoid type. The Performance obtained after Error Histogram, ROC( Receiver Operating Characteristic)
are shown in figures.
2) Training State Plot and Error Histogram Plot: The Training State plot obtained by nprtool in MATLAB shows the progress of
other training variables such as gradient magnitude, no. of validation checks etc. The Error Histogram plot shown in Fig
shows the distribution of network error.
3) ROC Plot: An ROC (Receiver operating Characteristic) is a plot of operating point showing the trade off between a Classifier
TP(True Positive) rate and FP(False Positive) rate. It check the quality of Classifier. True Positive Rate is the percentage of
target samples that are correctly classified while False Positive Rate is percentage of non target samples that are incorrectly
classified. The ROC plot obtained shows the accuracy of ANN classifier as it hugs the left and top edges mostly. There were
little errors as the no of training dataset is less. If we increase the dataset the output will be more accurate. Figure below shows
the ROC plot of all Training, Validation and Testing data set.
b) Snap Shot Of Simulation Diagram obtained using MATLAB: If we give Input1: [0.9;0.57;0.85;0.74;0.59] and sampling time is
'1' the output y1 is shown in the simulation diagram. Likewise on changing the input we can get the desiredoutput.
9x9-bit multiplier :1
# Adders/Subtractors :1
9-bit adder :1
# Latches :1
9-bit latch :1
# Comparators :2
IX. CONCLUSION
The Three main phase of the work - Preprocessing of the image had been done based on the common preprocessing method. The
second phase i.e. Segmentation we used Connected ComponentAlgorithm. This approach promotes speed, accuracy and simplicity.
We have shown code and examples in the 8-connectivity CCA only. In the feature extraction phase we used Quin Tree method
which is very simple and have high accuracy. The most important part i.e. Recognition of character is done using ANN in
MATLAB. Second Part of project include the Hardware Implementation of Activation Function i.e. Sigmoid Function Of ANN.
Based on the results obtained, it had proved that the proposed method had produced the best accuracy rate. The paper also
present the comparison of simulink behavior of software and Hardware using FPGA. Thus this paper present an Overview design
phases of an OCR system, different methodology used for best performance and its application in the different field of areas.
REFERENCES
[1] "Historical Review of OCR Research and Development" by S. Mori, Member IEEE, Ching Y. Suen, Fellow IEEE, Proceeding IEEE, 80, No 7, July 1992.
[2] "Geometrical-matrix feature extraction for on-line handwritten characters recognition" by Saad M. Ismail, Siti Norul Huda Sheikh Abdullah, Journal of
Theoretical and Applied Information Technology, 10th March 2013. Vol. 49No.1.
[3] ''Optical Character Recognition" by Ravina Mithe, Supriya Indalkar, Nilam Divekar International Journal of Recent Technology and Engineering (IJRTE)
ISSN: 2277-3878, Volume-2, Issue-1, March 2013.
[4] "Text Detection From Documented Image Using Image Segmentation" by Santosh, Dr. Jenila Livingston L.M., Research Scholar at VIT, Chennai.,
International Journal of Technology Enhancements and emerging engineering research, vol 1, issue 4 ISSN 2347-4289 , 2013.
[5] "Optical Character Recognition Program for Images of Printed Text using a Neural Network" by Velappa Ganapathy, Charles C. H. Lean ,School of
Engineering, Monash University Malaysia.
[6] "Handwritten English Character Recognition using Neural Network" by Vijay Patil and Sanjay Shimpi Department of Computer Engineering, Vidyalankar
Institute of Technology, Wadala, Mumbai, International Journal of Computer Science & CommunicationVol. 1, No. 2, July-December 2010.
[7] "Document Analysis and Recognition (ICDAR)" by Peng Ye, Language & Media Process Lab, Univ. of Maryland, USA. 12th International Conference on,
2013.
[8] "A Detailed Review of Feature Extraction in Image Processing Systems " by Kumar, G. Bhatia, P.K., 2014. Fourth International Conference. Publication Year:
2014.
[9] "Transactions on Pattern Analysis and Machine Intelligence" by Venu Govindaraju, Senior Member, IEEE, and Sargur N. Srihari, Fellow( IEEE), IEEE , Vol.
22, No.4, April 2000.
[10] "Segmentation of Touching Character in Printed Devnagari and Bangla Script Using Fuzzy Multi factorial Analysis" by Utpal Garain and Bidyut B.
Chaudhary, IEEE Transaction on System, Man and Cybernetics- Part C: Applications andReviews, 32, November 2002.
[11] "Object Recognition System using Template Matching Based on Signature and Principal Component Analysis" by Inad A. Aljarrah, Ahmed S Goraib &
Ismail M. Akhter, IJDIWC, 2012.
[12] "OCR Error Detection and Correction of an Inflectional Indian Language Script" by B. B. Chaudhary and U. Pal , IEEE Proceeding of 13th International
Conference on 25-29 Aug., 3, 1996.
[13] "A brief review and survey of feature extraction methods for Devnagari OCR" by Holambe A.N. , Thool, R.C. Jagade, S.M. ,ICT and Knowledge
Engineering (ICT & Knowledge Engineering), 2011 9th International Conference, Digital Object Identifier: 10.1109/ICTKE.2012.6152421 Publication
Year: 2012 .
[14] "Script Identification from Indian Documents" by G.D. Joshi, S. Garg and J. Sivaswamy Proc. IAPR Workshop Document Analysis Systems, Feb. 2006.
[15] "A Devnagari OCR and A Brief Overview of OCR for Indian Script" by Veena Bansal and R.M.K. Sinha, PROC Symposium on Transaction support System
(STRANS 2001), Feb. 15-17, 2001, Kanpur, India.
[16] "Digita Image Processing using MATLAB" by Rafael C. Gonzalez, Richard E. Woods, Steven L. Eddins .
[17] "Optical Character Recognition" Line Eikvil , December 1993.
[18] "Sigmoid Function Approximation for ANN Implementation in FPGA Devices"by Djalal Eddine Khodja1 ,Aissa kheldoun2, and Larbi refoufi.
[19] " Character Recognition System" by Mohamed Cheriet, Nawwaf Kharma, Cheng-LIN LIU and Ching, John Wiley & Sons,Inc., Hoboken, NewJersey, ISBN
978-0-471-41570-1, 2007.
[20] " Neural Network Implementation Using FPGAs" by Dhirajkumar S. Jinde et al, / (IJCSIT) International Journal of Computer Science and Information
tech.,2015.