Assignment 2 MLDS Lab

The document discusses using artificial neural networks for optical character recognition (OCR). It defines OCR as the conversion of scanned text images into machine-encoded text. The document then outlines the typical steps an OCR system takes: preprocessing the image, recognizing text using pattern matching or feature extraction, and post processing. It also introduces keras-ocr, an open-source library that provides pre-trained OCR models and tools for training new models using neural networks. The objective is to recognize optical characters from images using these artificial neural network techniques.

Uploaded by

Amruta More

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views

Assignment 2 MLDS Lab

Uploaded by

Amruta More

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Assignment -2

Title: Recognize Optical Character using ANN

Objective:
To recognize the optical characters using ANN
Theory:
What is Optical Character Recognition?
Optical Character Recognition is the conversion of 2-Dimensional text data into a form of
machine-encoded text by the use of an electronic or mechanical device. The 2-Dimensional
text data can be obtained from various sources such as scanned documents like PDF files,
images with text data in formats such as .png or .jpeg, signposts like traffic posts, or any other
images with any form of textual data. There is a wide range of interesting applications for
optical character recognition.
Optical Character Recognition (OCR) is the process that converts an image of text into a
machine-readable text format. For example, if you scan a form or a receipt, your computer
saves the scan as an image file. You cannot use a text editor to edit, search, or count the words
in the image file. However, you can use OCR to convert the image into a text document with
its contents stored as text data.

OCR importants
Most business workflows involve receiving information from print media. Paper forms,
invoices, scanned legal documents, and printed contracts are all part of business processes.
These large volumes of paperwork take a lot of time and space to store and manage. Though
paperless document management is the way to go, scanning the document into an image creates
challenges. The process requires manual intervention and can be tedious and slow.

Moreover, digitizing this document content creates image files with the text hidden within it.
Text in images cannot be processed by word processing software in the same way as text
documents. OCR technology solves the problem by converting text images into text data that
can be analyzed by other business software. You can then use the data to conduct analytics,
streamline operations, automate processes, and improve productivity.

OCR work
The OCR engine or OCR software works by using the following steps:
 Image acquisition
A scanner reads documents and converts them to binary data. The OCR software analyzes the
scanned image and classifies the light areas as background and the dark areas as text.
 Preprocessing
The OCR software first cleans the image and removes errors to prepare it for reading. These
are some of its cleaning techniques:
 Deskewing or tilting the scanned document slightly to fix alignment issues during the
scan.
 Despeckling or removing any digital image spots or smoothing the edges of text images.
 Cleaning up boxes and lines in the image.
 Script recognition for multi-language OCR technology
 Text recognition
The two main types of OCR algorithms or software processes that an OCR software uses for
text recognition are called pattern matching and feature extraction.
 Pattern matching
Pattern matching works by isolating a character image, called a glyph, and comparing it with a
similarly stored glyph. Pattern recognition works only if the stored glyph has a similar font and
scale to the input glyph. This method works well with scanned images of documents that have
been typed in a known font.
 Feature extraction
Feature extraction breaks down or decomposes the glyphs into features such as lines, closed
loops, line direction, and line intersections. It then uses these features to find the best match or
the nearest neighbor among its various stored glyphs.
 Post processing
After analysis, the system converts the extracted text data into a computerized file. Some OCR
systems can create annotated PDF files that include both the before and after versions of the
scanned document.

What is keras_ocr?
keras-ocr provides out-of-the-box OCR models and an end-to-end training pipeline to build
new OCR models. Using this we get pre trained data and weights so our time and effort is
saved.
Artificial Neural Networks
 basic concepts of artificial neural networks
 building a perceptron classifier
 building a single layer neural network
 building a multilayer neural network
 analyzing sequential data with RNNs (possible)
 constructing an OCR engine

Conclusion:
Here, we studied how to recognize optical characters using ANN.

Alajaji Chen2018 Book AnIntroductionToSingle UserInf
No ratings yet
Alajaji Chen2018 Book AnIntroductionToSingle UserInf
333 pages
Mechanics Problem
No ratings yet
Mechanics Problem
9 pages
IEOR E4718 Topics in Derivatives Pricing: An Introduction To The Volatility Smile
No ratings yet
IEOR E4718 Topics in Derivatives Pricing: An Introduction To The Volatility Smile
10 pages
Practiceproblems DFA NFA PDF
No ratings yet
Practiceproblems DFA NFA PDF
3 pages
(Adaptive Computation and Machine Learning) Daphne Koller - Nir Friedman - Probabilistic Graphical Models - Principles and PDF
No ratings yet
(Adaptive Computation and Machine Learning) Daphne Koller - Nir Friedman - Probabilistic Graphical Models - Principles and PDF
1,270 pages
Optical Character Recognition
100% (1)
Optical Character Recognition
17 pages
Optical Character Recognition: Divyanshu Sagar Ahmed Zaid Faizee Vidyut Singhania
No ratings yet
Optical Character Recognition: Divyanshu Sagar Ahmed Zaid Faizee Vidyut Singhania
11 pages
CMP 222 Week 8 - Optical Character Recognition
No ratings yet
CMP 222 Week 8 - Optical Character Recognition
8 pages
Optical Character Recognition: Made By: Dhairya Goel-02814803115 Madhwan Sharma-60214803115
No ratings yet
Optical Character Recognition: Made By: Dhairya Goel-02814803115 Madhwan Sharma-60214803115
15 pages
Optical Character Recognition: Fundamentals and Applications
From Everand
Optical Character Recognition: Fundamentals and Applications
Fouad Sabry
No ratings yet
Development of Text Extraction Technique 3acb33e9
No ratings yet
Development of Text Extraction Technique 3acb33e9
8 pages
Optical Character Recognition: Selected Topics in Computer Science
No ratings yet
Optical Character Recognition: Selected Topics in Computer Science
7 pages
Optical Character Recognition (OCR) System
No ratings yet
Optical Character Recognition (OCR) System
5 pages
Optical Character Recognizer: Team Member
No ratings yet
Optical Character Recognizer: Team Member
7 pages
Optical Character Recognition Project Report
No ratings yet
Optical Character Recognition Project Report
71 pages
Optical Character Recognition: Unlocking the Power of Computer Vision for Optical Character Recognition
From Everand
Optical Character Recognition: Unlocking the Power of Computer Vision for Optical Character Recognition
Fouad Sabry
No ratings yet
Seminar Report On Optical Character Recognition: Submitted By
No ratings yet
Seminar Report On Optical Character Recognition: Submitted By
27 pages
A12REVIEW
No ratings yet
A12REVIEW
18 pages
Ocr Ann PDF
100% (1)
Ocr Ann PDF
4 pages
Machine Learning in The Field of Optical Character Recognition OCR
No ratings yet
Machine Learning in The Field of Optical Character Recognition OCR
5 pages
Optical Character Recognition System
No ratings yet
Optical Character Recognition System
41 pages
SL NO. Name Usn Number Roll No
No ratings yet
SL NO. Name Usn Number Roll No
10 pages
Project Report On OCR Scanner
No ratings yet
Project Report On OCR Scanner
40 pages
Ocr & Cbir
No ratings yet
Ocr & Cbir
13 pages
Optical Character Recognition - Report
50% (2)
Optical Character Recognition - Report
33 pages
Raj Synopsis12
No ratings yet
Raj Synopsis12
5 pages
Ocr
No ratings yet
Ocr
16 pages
AI Possible Risks & Mitigations: Optical Character Recognition
No ratings yet
AI Possible Risks & Mitigations: Optical Character Recognition
33 pages
Urdu Optical Character Recognition OCR Thesis Zaheer Ahmad Peshawar Its Soruce Code Is Available On MATLAB Site 21-01-09
100% (1)
Urdu Optical Character Recognition OCR Thesis Zaheer Ahmad Peshawar Its Soruce Code Is Available On MATLAB Site 21-01-09
61 pages
10 1109@icirca48905 2020 9183326
No ratings yet
10 1109@icirca48905 2020 9183326
6 pages
Unlocking Text from Images: The Future of OCR Technology
No ratings yet
Unlocking Text from Images: The Future of OCR Technology
4 pages
Ocr
No ratings yet
Ocr
3 pages
Optical_character_recognition_system_using_artific
No ratings yet
Optical_character_recognition_system_using_artific
7 pages
Intelligent Word Recognition: Fundamentals and Applications
From Everand
Intelligent Word Recognition: Fundamentals and Applications
Fouad Sabry
No ratings yet
Hand Written Character Recognition Using Neural Network: BACHELOR OF ENGINEERING (Computer Engineering)
No ratings yet
Hand Written Character Recognition Using Neural Network: BACHELOR OF ENGINEERING (Computer Engineering)
46 pages
Jagruthi Institute of Engineering and Technology: Optical Character Recognition
No ratings yet
Jagruthi Institute of Engineering and Technology: Optical Character Recognition
28 pages
علي عبد حسين - ماستر عام - OCR
No ratings yet
علي عبد حسين - ماستر عام - OCR
12 pages
Untitled Presentation Wonderslide
No ratings yet
Untitled Presentation Wonderslide
5 pages
Text Detector (OCR)
No ratings yet
Text Detector (OCR)
12 pages
Intelligent Character Recognition: Advancing Machine Perception in Computer Vision
From Everand
Intelligent Character Recognition: Advancing Machine Perception in Computer Vision
Fouad Sabry
No ratings yet
WIKIPEDIA - OCR or Optical Character Recognition
No ratings yet
WIKIPEDIA - OCR or Optical Character Recognition
6 pages
Review On Optical Character Recognition of Devanagari Script Using Neural Network
No ratings yet
Review On Optical Character Recognition of Devanagari Script Using Neural Network
6 pages
Optical Character Recognition: Presented By: - Vikas Shukla - Raj Singh
No ratings yet
Optical Character Recognition: Presented By: - Vikas Shukla - Raj Singh
11 pages
Optical Character Recognition: Article
No ratings yet
Optical Character Recognition: Article
5 pages
A Neural Network Implementation of Optical Character Recognition
No ratings yet
A Neural Network Implementation of Optical Character Recognition
7 pages
Optical Character Recognition: Article
No ratings yet
Optical Character Recognition: Article
5 pages
Optical Character Recognition: Kaivan Gandhi 60001160012 Rahul Jha 60001160019 Shagun Vasmatkar 60001160061
No ratings yet
Optical Character Recognition: Kaivan Gandhi 60001160012 Rahul Jha 60001160019 Shagun Vasmatkar 60001160061
7 pages
Text Detection in Natural Scene Images Using Ocr Algorithm
No ratings yet
Text Detection in Natural Scene Images Using Ocr Algorithm
3 pages
Ijcet: International Journal of Computer Engineering & Technology (Ijcet)
No ratings yet
Ijcet: International Journal of Computer Engineering & Technology (Ijcet)
14 pages
Ocr PDF
No ratings yet
Ocr PDF
5 pages
3 M&a
No ratings yet
3 M&a
24 pages
Optical Character Recognition
No ratings yet
Optical Character Recognition
7 pages
Abstract (Extract Text From Image)
No ratings yet
Abstract (Extract Text From Image)
2 pages
JETIR1804232
No ratings yet
JETIR1804232
3 pages
Optical Character L
No ratings yet
Optical Character L
2 pages
Optical Character Recognition
No ratings yet
Optical Character Recognition
3 pages
Optical Character Recognition Using MATLAB: Sandeep Tiwari, Shivangi Mishra, Priyank Bhatia, Praveen Km. Yadav
No ratings yet
Optical Character Recognition Using MATLAB: Sandeep Tiwari, Shivangi Mishra, Priyank Bhatia, Praveen Km. Yadav
4 pages
Is204 - 6
No ratings yet
Is204 - 6
27 pages
Extraction of Information From Handwriting Using Optical Character Recognition and Neural Networks
No ratings yet
Extraction of Information From Handwriting Using Optical Character Recognition and Neural Networks
6 pages
synopsis sample
No ratings yet
synopsis sample
7 pages
Raspberry Pi
No ratings yet
Raspberry Pi
21 pages
A Survey of Modern Optical Character Rec PDF
No ratings yet
A Survey of Modern Optical Character Rec PDF
37 pages
Optical Character Recognition Research: Index
No ratings yet
Optical Character Recognition Research: Index
6 pages
OCR Presentation
No ratings yet
OCR Presentation
16 pages
Multimedia and WS-CS 550-Content Analysis v1
No ratings yet
Multimedia and WS-CS 550-Content Analysis v1
27 pages
Cryptanalysis of Block Ciphers: A Survey: Francois-Xavier Standaert, Gilles Piret, Jean-Jacques Quisquater
No ratings yet
Cryptanalysis of Block Ciphers: A Survey: Francois-Xavier Standaert, Gilles Piret, Jean-Jacques Quisquater
25 pages
Voronoi Diagram, Dynamic Path Planning
No ratings yet
Voronoi Diagram, Dynamic Path Planning
4 pages
Introduction To Polynomial Regression
No ratings yet
Introduction To Polynomial Regression
5 pages
(Ebook) Introduction to Machine Learning with Python: A Guide for Data Scientists by Andreas C. Müller, Sarah Guido ISBN 9781449369415, 1449369413 download
100% (3)
(Ebook) Introduction to Machine Learning with Python: A Guide for Data Scientists by Andreas C. Müller, Sarah Guido ISBN 9781449369415, 1449369413 download
56 pages
Dr. Huma Qayyum Department of Software Engineering Huma - Ayub@uettaxila - Edu.pk
No ratings yet
Dr. Huma Qayyum Department of Software Engineering Huma - Ayub@uettaxila - Edu.pk
20 pages
Serpent and Smartcards: Abstract. We Proposed A New Block Cipher, Serpent, As A Candidate For
No ratings yet
Serpent and Smartcards: Abstract. We Proposed A New Block Cipher, Serpent, As A Candidate For
8 pages
FFT Full
No ratings yet
FFT Full
15 pages
تكليف الرياضيات
No ratings yet
تكليف الرياضيات
16 pages
Linear Regression
No ratings yet
Linear Regression
15 pages
Deep Learning Final Sheet
No ratings yet
Deep Learning Final Sheet
915 pages
Gradient
No ratings yet
Gradient
14 pages
Encryption: Shubham Gupta
No ratings yet
Encryption: Shubham Gupta
22 pages
System of Linear Equations
No ratings yet
System of Linear Equations
13 pages
Levy processes and stochastic calculus 2nd Edition David Applebaum 2024 Scribd Download
100% (14)
Levy processes and stochastic calculus 2nd Edition David Applebaum 2024 Scribd Download
60 pages
Key Encoding Messages Into Matrices
No ratings yet
Key Encoding Messages Into Matrices
4 pages
SSRN Id4501707
No ratings yet
SSRN Id4501707
159 pages
POS 6737 Robbins 2
No ratings yet
POS 6737 Robbins 2
5 pages
Block Net
No ratings yet
Block Net
21 pages
A Higher-Level Security Scheme For Key Access On Cloud Computing
No ratings yet
A Higher-Level Security Scheme For Key Access On Cloud Computing
6 pages
ArtificiaI Intelligence Engineer Brochure
No ratings yet
ArtificiaI Intelligence Engineer Brochure
27 pages
Adversarial Diffusion Distillation
No ratings yet
Adversarial Diffusion Distillation
16 pages
Herimanto 2020 J. Phys. Conf. Ser. 1566 012038 PDF
No ratings yet
Herimanto 2020 J. Phys. Conf. Ser. 1566 012038 PDF
7 pages
ANNand Its Applications
No ratings yet
ANNand Its Applications
16 pages
accenture coding
No ratings yet
accenture coding
6 pages
Siemens Simatic S 7 300 - 400 - PID Control English
100% (15)
Siemens Simatic S 7 300 - 400 - PID Control English
42 pages

Assignment 2 MLDS Lab

Uploaded by

Assignment 2 MLDS Lab

Uploaded by

Assignment -2

Title: Recognize Optical Character using ANN

You might also like