Chapter 1

The document discusses a proposed universal real-time language independent optical character recognition (OCR) system. It defines OCR and its evolution from optical to digital techniques using pattern recognition and feature extraction. The system aims to recognize handwritten text in multiple languages in real-time as it is entered. It will use image processing techniques like thresholding and morphological operations to preprocess images for feature extraction and pattern recognition. The goals of the system are to support multiple languages, reduce manual data entry work, improve data quality, understand varied handwriting, and introduce a dynamic OCR system to users.

Uploaded by

Er Mukesh Mistry

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

81 views

Chapter 1

Uploaded by

Er Mukesh Mistry

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 4

VACOE, AHMEDNAGAR Chapter 1

INTRODUCTION

1.1 PROBLEM DEFINITION & RELEVENT THEORY:Optical character recognition, usually abbreviated to OCR, is the mechanical or electronic translation of images of handwritten or typewritten text (usually captured by a scanner) into machine-editable text. OCR is a field of research in pattern recognition, artificial intelligence and machine vision. hough academic research in the field continues, the focus on OCR has shifted to implementation of proven techni!ues. Optical character recognition (using optical techni!ues such as mirrors and lenses) and digital character recognition (using scanners and computer algorithms) were originally considered separate fields. "ecause very few applications survive that use true optical techni!ues, the OCR term has now been broadened to include digital image processing as well. #arly systems re!uired training (the provision of $nown samples of each character) to read a specific font. %&ntelligent% systems with a high degree of recognition accuracy for most fonts are now common. 'ome systems are even capable of reproducing formatted output that closely approximates the original scanned page including images, columns and other non-textual components. (owever such techni!ues are limited to static scanned images, and not real time ac!uisi)ed handwritten characters. *ith advent of digital input devices li$e 'tylus, etc. the user can enter handwritten text as input to the OCR system. he system will have the capability of recogni)ing and updating the entered characters on-the-fly. his enables the user to rectify the errors at the time of entering the text and not when he+s done entering all the characters. he system will also host a universal language recognition capability. his can be achieved using pattern rec !n"t" n and #eat$re e%tract" n. ,attern recognition is a subtopic of machine learning. &t can be defined as %the act of ta$ing in raw data and ta$ing an action based on the category of the data%. -ost research in pattern recognition is about methods for supervised learning and unsupervised learning. ,attern recognition aims to classify data (patterns) based on either a priori $nowledge or on statistical information extracted from the patterns. he patterns to be classified are usually groups of measurements or observations, defining points in an appropriate multidimensional space.

Universal Real-Time Language Inde enden! OCR"

VACOE, AHMEDNAGAR

Feat$re E%tract" n &n pattern recognition and in image processing, /eature extraction is a special form of dimensionality reduction. *hen the input data to an algorithm is too large to be processed and it is suspected to be notoriously redundant (much data, but not much information) then the input data will be transformed into a reduced representation set of features (also named features vector). ransforming the input data into the set of features is called features extraction. &f the features extracted are carefully chosen it is expected that the features set will extract the relevant information from the input data in order to perform the desired tas$ using this reduced representation instead of the full si)e input. he system also employs some of the best image processing algorithms for the input to be suitable for feature extraction and pattern recognition. /ollowing are some of the image processing algorithms that will be used0 o o o o o 1rayscale Conversion hresholding #rosion 2 3ilation 'harpen 2 "lur &mage 'egmentation 4

Universal Real-Time Language Inde enden! OCR"

VACOE, AHMEDNAGAR

Universal Real-Time Language Inde enden! OCR"

VACOE, AHMEDNAGAR

1.& 'COPE:he proposed OCR system covers the range of topics, many shared with mainstream image processing. 'ome of the most important are0o 6nderstanding image user+s needs and information see$ing behavior. o &dentification of suitable ways of describing image content. o #fficient any number of feature extraction techni!ues. o ,roviding usable human interface to OCR within the vicinity of system.

1.( OB)ECTIVE:o o o o o o support more than one languages. o reduce total manpower re!uired for manual data entry wor$. o improve the !uality of the degraded data. o understand the different users handwritten scripts. o introduce the dynamic OCR system to the users.

Universal Real-Time Language Inde enden! OCR"

Fichas de Aprendizaje IBM AI - Quizlet
No ratings yet
Fichas de Aprendizaje IBM AI - Quizlet
26 pages
My Courses 2022 Second Summer CSC 7333 For Jianhua Chen Final Exam Final Exam
No ratings yet
My Courses 2022 Second Summer CSC 7333 For Jianhua Chen Final Exam Final Exam
16 pages
CSE Final Year Project Proposal
No ratings yet
CSE Final Year Project Proposal
10 pages
Character Recoganization
No ratings yet
Character Recoganization
6 pages
EEL6825-Character Recognition Algorithm Using Correlation.
No ratings yet
EEL6825-Character Recognition Algorithm Using Correlation.
8 pages
Optical Character Recognition (OCR) System
No ratings yet
Optical Character Recognition (OCR) System
5 pages
Optical Character Recognition Project Report
No ratings yet
Optical Character Recognition Project Report
71 pages
Urdu Optical Character Recognition OCR Thesis Zaheer Ahmad Peshawar Its Soruce Code Is Available On MATLAB Site 21-01-09
100% (1)
Urdu Optical Character Recognition OCR Thesis Zaheer Ahmad Peshawar Its Soruce Code Is Available On MATLAB Site 21-01-09
61 pages
OCR Using Image Processing
No ratings yet
OCR Using Image Processing
8 pages
Optical Character Recognition
100% (1)
Optical Character Recognition
17 pages
Confluence 2018 8442875
No ratings yet
Confluence 2018 8442875
4 pages
Build Your Own Optical Character Recognition (Ocr) System Using Google'S Tesseract and Opencv
No ratings yet
Build Your Own Optical Character Recognition (Ocr) System Using Google'S Tesseract and Opencv
10 pages
SL NO. Name Usn Number Roll No
No ratings yet
SL NO. Name Usn Number Roll No
10 pages
IJNRD2304119
No ratings yet
IJNRD2304119
5 pages
FFGB
No ratings yet
FFGB
12 pages
Optical Character Recognition Using MATLAB: Sandeep Tiwari, Shivangi Mishra, Priyank Bhatia, Praveen Km. Yadav
No ratings yet
Optical Character Recognition Using MATLAB: Sandeep Tiwari, Shivangi Mishra, Priyank Bhatia, Praveen Km. Yadav
4 pages
10 1109@icirca48905 2020 9183326
No ratings yet
10 1109@icirca48905 2020 9183326
6 pages
Conf Paper
No ratings yet
Conf Paper
7 pages
Cse Final Year Project Proposal PDF Free
No ratings yet
Cse Final Year Project Proposal PDF Free
10 pages
Ocr & Cbir
No ratings yet
Ocr & Cbir
13 pages
Applying AI To Biometric Identification For Recognizing Text Using One-Hot Encoding and CNN
No ratings yet
Applying AI To Biometric Identification For Recognizing Text Using One-Hot Encoding and CNN
9 pages
Artificial Intelligence by Mehedi
100% (1)
Artificial Intelligence by Mehedi
40 pages
Optical Character Recognition
No ratings yet
Optical Character Recognition
27 pages
Plagiarism Checker X Originality Report: Similarity Found: 26%
No ratings yet
Plagiarism Checker X Originality Report: Similarity Found: 26%
29 pages
Analogic Preprocessing and Segmentation Algorithms For Off-Line Handwriting Recognition
No ratings yet
Analogic Preprocessing and Segmentation Algorithms For Off-Line Handwriting Recognition
20 pages
Optical Character Recognition by Open Source OCR Tool Tesseract: A Case Study
No ratings yet
Optical Character Recognition by Open Source OCR Tool Tesseract: A Case Study
8 pages
Optical Character Recognition: Unlocking the Power of Computer Vision for Optical Character Recognition
From Everand
Optical Character Recognition: Unlocking the Power of Computer Vision for Optical Character Recognition
Fouad Sabry
No ratings yet
Optical Character Recognizer: Team Member
No ratings yet
Optical Character Recognizer: Team Member
7 pages
Multimedia and WS-CS 550-Content Analysis v1
No ratings yet
Multimedia and WS-CS 550-Content Analysis v1
27 pages
Optical_character_recognition_system_using_artific
No ratings yet
Optical_character_recognition_system_using_artific
7 pages
Optical Character Recognition: Fundamentals and Applications
From Everand
Optical Character Recognition: Fundamentals and Applications
Fouad Sabry
No ratings yet
Optical_Character_Recognition_Techniques
No ratings yet
Optical_Character_Recognition_Techniques
6 pages
Research - Paper-2 (Img To Text)
No ratings yet
Research - Paper-2 (Img To Text)
159 pages
Practical Assignment 01: OCR - Optical Character Recognition
No ratings yet
Practical Assignment 01: OCR - Optical Character Recognition
16 pages
Optical Character Recognition (Ocr) : Karan Panjwani T.E - B, 68 Guided By: Prof. Shalini Wankhade
No ratings yet
Optical Character Recognition (Ocr) : Karan Panjwani T.E - B, 68 Guided By: Prof. Shalini Wankhade
24 pages
Face Detection and Recognition Using Opencv and Python
No ratings yet
Face Detection and Recognition Using Opencv and Python
3 pages
Applying AI To Biometric Identification For Recognizing Text Using One-Hot Encoding and CNN
No ratings yet
Applying AI To Biometric Identification For Recognizing Text Using One-Hot Encoding and CNN
10 pages
A (6)
No ratings yet
A (6)
4 pages
CPP Synopsis
No ratings yet
CPP Synopsis
6 pages
digit main
No ratings yet
digit main
30 pages
Vechicle number plate detection using python and cv ppt
No ratings yet
Vechicle number plate detection using python and cv ppt
13 pages
A Review of Handwritten Text Recognition Using Machine Learning and Deep Learning Techniques
No ratings yet
A Review of Handwritten Text Recognition Using Machine Learning and Deep Learning Techniques
6 pages
10 1109@icacccn 2018 8748287
No ratings yet
10 1109@icacccn 2018 8748287
6 pages
Biometric Identification For Recognizing Text Usingvarious AI Algorithm
No ratings yet
Biometric Identification For Recognizing Text Usingvarious AI Algorithm
6 pages
Optical Character Recognition by Open Source OCR Tool Tesseract A Case Study
No ratings yet
Optical Character Recognition by Open Source OCR Tool Tesseract A Case Study
7 pages
CNN Based Digital Alphanumeric Archaeolinguistics Apprehension For Ancient Script Detection
No ratings yet
CNN Based Digital Alphanumeric Archaeolinguistics Apprehension For Ancient Script Detection
7 pages
Ocr Nanonets Tesseract
No ratings yet
Ocr Nanonets Tesseract
39 pages
Portable Camera-Based Assistive Text and Product Label Reading From Hand-Held Objects For Blind Persons
No ratings yet
Portable Camera-Based Assistive Text and Product Label Reading From Hand-Held Objects For Blind Persons
6 pages
Ocr Gtts
No ratings yet
Ocr Gtts
49 pages
Design of An OCR System and Its Hardware Implementation
No ratings yet
Design of An OCR System and Its Hardware Implementation
18 pages
Raspberry Pi Based Smart Reader For Visually Impaired People
50% (2)
Raspberry Pi Based Smart Reader For Visually Impaired People
12 pages
Final
No ratings yet
Final
28 pages
Handwritten Text Recgnition Final
No ratings yet
Handwritten Text Recgnition Final
5 pages
ANN_UNIT-4_IMP
No ratings yet
ANN_UNIT-4_IMP
6 pages
Assignment 2 MLDS Lab
No ratings yet
Assignment 2 MLDS Lab
3 pages
Optical Character Recognition System
No ratings yet
Optical Character Recognition System
41 pages
Final Project Thesis
No ratings yet
Final Project Thesis
7 pages
Review Paper On Raspberry Pi Based Ocr With Tts (19104013, 18004037, 18004051)
No ratings yet
Review Paper On Raspberry Pi Based Ocr With Tts (19104013, 18004037, 18004051)
5 pages
Ijcet: International Journal of Computer Engineering & Technology (Ijcet)
No ratings yet
Ijcet: International Journal of Computer Engineering & Technology (Ijcet)
14 pages
Text Detection in Natural Scene Images Using Ocr Algorithm
No ratings yet
Text Detection in Natural Scene Images Using Ocr Algorithm
3 pages
toll
No ratings yet
toll
6 pages
Development of Text Extraction Technique 3acb33e9
No ratings yet
Development of Text Extraction Technique 3acb33e9
8 pages
Certificate Acknowledgement List of Figures List of Tables Abbreviation 1
No ratings yet
Certificate Acknowledgement List of Figures List of Tables Abbreviation 1
3 pages
Question "Unit - 1": Q1 - Identify The Architecture and Explain It ?
No ratings yet
Question "Unit - 1": Q1 - Identify The Architecture and Explain It ?
3 pages
List of Figures I List of Tables II 1 1-2
No ratings yet
List of Figures I List of Tables II 1 1-2
2 pages
College of Engineering: Submitted by
No ratings yet
College of Engineering: Submitted by
5 pages
Analysis: 4.1 Project Scheduling and Tracking
No ratings yet
Analysis: 4.1 Project Scheduling and Tracking
16 pages
Index: Vacoe, Ahmednagar
No ratings yet
Index: Vacoe, Ahmednagar
2 pages
Chapter 7
No ratings yet
Chapter 7
1 page
System Design: 3.1 Process Model
No ratings yet
System Design: 3.1 Process Model
4 pages
1.1 Problem Definition & Relevent Theory
No ratings yet
1.1 Problem Definition & Relevent Theory
31 pages
Brain Intelligence: Go Beyond Artificial Intelligence
No ratings yet
Brain Intelligence: Go Beyond Artificial Intelligence
8 pages
Lung Cancer Detection Using Machine Learning Algorithms and Neural Network On A Conducted Survey Dataset Lung Cancer Detection
No ratings yet
Lung Cancer Detection Using Machine Learning Algorithms and Neural Network On A Conducted Survey Dataset Lung Cancer Detection
4 pages
Technologies-11-00091_Implementation of Deep Learning Models on an SoC-FPGA Device for Real-Time Music Genre Classification
No ratings yet
Technologies-11-00091_Implementation of Deep Learning Models on an SoC-FPGA Device for Real-Time Music Genre Classification
18 pages
Web Traffic Time Series Forecasting: Kaggle Competition Review Bill Tubbs July 26, 2018
No ratings yet
Web Traffic Time Series Forecasting: Kaggle Competition Review Bill Tubbs July 26, 2018
39 pages
ANN & FUZZY Techniques (MEPS303B) : Q1. Short Questions (Attempt Any 5)
No ratings yet
ANN & FUZZY Techniques (MEPS303B) : Q1. Short Questions (Attempt Any 5)
5 pages
DL Unit 3
No ratings yet
DL Unit 3
59 pages
Lec03 NeuralNetwork
No ratings yet
Lec03 NeuralNetwork
39 pages
LSTM
No ratings yet
LSTM
123 pages
Neural Networks and Its Applications: Nishant Dr. Sumeet Gill Hemant Pawar
No ratings yet
Neural Networks and Its Applications: Nishant Dr. Sumeet Gill Hemant Pawar
7 pages
Datasheet - Artificial Intelligence Engineer
No ratings yet
Datasheet - Artificial Intelligence Engineer
2 pages
Lecture 1 - Introduction To ML
No ratings yet
Lecture 1 - Introduction To ML
25 pages
DSI Guide - AI vs ML vs DL vs DS
No ratings yet
DSI Guide - AI vs ML vs DL vs DS
9 pages
AI010 804L01 Neural Networks
No ratings yet
AI010 804L01 Neural Networks
41 pages
LLM Training - A simple visual guide beginners
No ratings yet
LLM Training - A simple visual guide beginners
10 pages
Practical Natural Language Processing: A Comprehensive Guide To Building Real-World NLP Systems
No ratings yet
Practical Natural Language Processing: A Comprehensive Guide To Building Real-World NLP Systems
8 pages
Lecture7 8 - Diffusion - Model 1 78 1 66
No ratings yet
Lecture7 8 - Diffusion - Model 1 78 1 66
66 pages
Will Human Beings Be Superseded by Generative Pre-Trained Transformer 3 (GPT-3) in Programming?
No ratings yet
Will Human Beings Be Superseded by Generative Pre-Trained Transformer 3 (GPT-3) in Programming?
3 pages
Ch04-ANN-Dr Amin ML
No ratings yet
Ch04-ANN-Dr Amin ML
57 pages
Artificial Intelligence (Ai)
No ratings yet
Artificial Intelligence (Ai)
10 pages
Artificial Intelligence Doc - Docx-1
No ratings yet
Artificial Intelligence Doc - Docx-1
26 pages
Data Analytics With Cognos Questions
No ratings yet
Data Analytics With Cognos Questions
15 pages
Lecture 13 - Supervised Learning - Bidirectional Associative Memory (BAM)
No ratings yet
Lecture 13 - Supervised Learning - Bidirectional Associative Memory (BAM)
4 pages
Fuzzy Logic & Pattern Recognition
No ratings yet
Fuzzy Logic & Pattern Recognition
20 pages
Arooba Kanwal Assignment No.1
No ratings yet
Arooba Kanwal Assignment No.1
3 pages
2023 07 28 Evolution of Language Models
No ratings yet
2023 07 28 Evolution of Language Models
73 pages
CMPE 256- MIDTERM_REPORT
No ratings yet
CMPE 256- MIDTERM_REPORT
3 pages
AI Strategy Flow Chart Share by WorldLine Technology
No ratings yet
AI Strategy Flow Chart Share by WorldLine Technology
1 page
2023 ArXiv Unleash LLM For Offline RL
No ratings yet
2023 ArXiv Unleash LLM For Offline RL
19 pages