Text Recognization of Product For Blind Person Using MATLAB
Text Recognization of Product For Blind Person Using MATLAB
Text Recognization of Product For Blind Person Using MATLAB
© 2018, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 1048
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 12 | Dec 2018 www.irjet.net p-ISSN: 2395-0072
i) Identify the common deficiency in most of the character 2.2 Conversion of RGB to Gray Scale
recognition software/tools by calculating the recognition rate
of each character and digit and find out the characters and
To make the system more simple i.e. work for noisy
digits whose recognition rate is very less.
conditions or complicated background, image pre-processing
methods like noise filtering are applied. The processing time
ii) Designing and development of the model to eliminate the of the overall process is long, so to reduce this process timing
common deficiency identified. the input image is converted from RGB to gray scale. This
preprocessing of images in this paper is a technique to
iii) Develop the algorithm to implement the above model. improve the quality of images. The main purpose of this
conversion is to enhance and extracts useful information
iv) Testing and Performance evaluation by analyzing results from the image. Two preprocessing tasks, thresholding and
of model noise removal, are performed here.
The algorithm used previously cannot handle complex There are numbers of methods for binarization in document
background and multiple patterns, and extract text analysis but few in text analysis. In this paper, we reviewed
information from hand-held objects. In assistive reading text analysis binarization methods related. Thresholding
systems for blind persons, it is very challenging for users to techniques are quite popular in document analysis. Several
position the region of interest within the center of the improvements over thresholding techniques are also
camera’s view. As of now, there are still no acceptable proposed recently in document analysis and people try the
solutions. same methods to extend for scene text binarization also.
The Image or we can say printed text/label is captured by Text recognition is performed by off-the-shelf OCR prior to
camera which is used in project. Initially this image output of informative words from the localized text regions. A
containing noise in background. This complicated text region labels the minimum rectangular area for the
background can be removed by stroke width transform accommodation of characters inside it, so the border of the
algorithm that helps to recognize the character by their shape text region contacts the edge boundary of the text character.
and width by calculating each pixel by their start to end point. However, OCR generates better performance if text regions
are first assigned proper margin areas and binarized to
segment text characters from background. We propose to use
© 2018, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 1049
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 12 | Dec 2018 www.irjet.net p-ISSN: 2395-0072
Template matching algorithm for OCR. The output of the OCR 4. Result and Discussion
is nothing but a text file containing the product label (its
name) in textual form. Audio output component is to inform
the blind user of recognize text code in the form of speech or
Audio.
3. Algorithm used for Image to Audio Conversion
Fig. Flow chart of the Proposed Method Fig. Region and stroke width image
3.3 Stroke Width Transform As shown in the Result figure the toggle button 1 Proceed
Character Recognition by using OCR. This image of result
shows the region After Removing Non-Text Regions Based on
It receives the RGB image with the help of algorithm
Geometric Properties OCR is the stand of optical character
the image is converted into grey but of the same size after
recognition which is field of computer science that
that text can be marked from the region of interest. It has
recognizing image-based text from photos and transforms it
three important stages: first the most important stage is
to real digital character. OCR works like human ability in the
stroke width transform, then collection of pixel of images on
brain to recognize the letters, numbers and symbols. OCR can
their stroke width, then pigeonholing letter candidates into
read both handwritten and printed text. The performance of
regions of text. In Stroke Width Transform, the stroke in
OCR is directly related to quality of input documents and
image is converted into constant width with the help of
pictures.
continuous band. Figure shows example of stroke in image.
The Stroke Width Transform is operation for calculating
width of pixel stroke from image.
© 2018, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 1050
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 12 | Dec 2018 www.irjet.net p-ISSN: 2395-0072
6. Future scope
© 2018, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 1051