Automatic identification of a script in a given document image facilitates many important applications such as automatic archiving of multilingual documents, searching online archives of document images and for the selection of script... more
Automatic identification of a script in a given document image facilitates many important applications such as automatic archiving of multilingual documents, searching online archives of document images and for the selection of script specific OCR in a multilingual environment. In this paper, we present a scheme to identify different Indian scripts from a document image. This scheme employs hierarchical classification which uses features consistent with human perception. Such features are extracted from the responses of a multi-channel log-Gabor filter bank, designed at an optimal scale and multiple orientations. In the first stage, the classifier groups the scripts into five major classes using global features. At the next stage, a sub-classification is performed based on script-specific features. All features are extracted globally from a given text block which does not require any complex and reliable segmentation of the document image into lines and characters. Thus the proposed scheme is efficient and can be used for many practical applications which require processing large volumes of data. The scheme has been tested on 10 Indian scripts and found to be robust to skew generated in the process of scanning and relatively insensitive to change in font size. This proposed system achieves an overall classification accuracy of 97.11% on a large testing data set. These results serve to establish the utility of global approach to classification of scripts.
In this work, we propose an online handwriting solution, where the data is captured with the help of depth sensors. Users may write in the air and our method recognizes it in real time using the proposed feature representation. Our method... more
In this work, we propose an online handwriting solution, where the data is captured with the help of depth sensors. Users may write in the air and our method recognizes it in real time using the proposed feature representation. Our method uses an efficient fingertip tracking approach and reduces the necessity of pen-up/pen-down switching. We validate our method on two depth sensors, Kinect and Leap Motion Controller. On a dataset collected from 20 users, we achieve a recognition accuracy of 97.59% for character recognition. We also demonstrate how this system can be extended for lexicon recognition with reliable performance. We have also prepared a dataset containing 1,560 characters and 400 words with the intention of providing common benchmark for handwritten character recognition using depth sensors and related research.
For databases of images new sorts of query will have to be developed to make access to content efficient and convenient. Generally these queries will be based upon a number of broad classes which can be ex-tracted from the image. This... more
For databases of images new sorts of query will have to be developed to make access to content efficient and convenient. Generally these queries will be based upon a number of broad classes which can be ex-tracted from the image. This work looks at one such class, shape, and ...
Abstract: We present a computationally simple and general purpose scheme for the detection of all salient object con-tours in real images. The scheme is inspired by the mechanism of surround influence that is exhibited in 80% of neurons... more
Abstract: We present a computationally simple and general purpose scheme for the detection of all salient object con-tours in real images. The scheme is inspired by the mechanism of surround influence that is exhibited in 80% of neurons in the primary visual cortex of primates. It is ...
The importance of a companies logo as a uniquely identifying feature need not be stressed. When a new company enters the market or an existing company tries to re-brand itself it is required to come up with a new logo. Currently, the... more
The importance of a companies logo as a uniquely identifying feature need not be stressed. When a new company enters the market or an existing company tries to re-brand itself it is required to come up with a new logo. Currently, the patent office serves the purpose of making ...
Krishna Palem#1, Al Barr%2, Avinash Lingamneni^3, Vincent Mooney*4, Rajeswari Pingali&5, Harini Sampath$6 and Jayanthi Sivaswamy$7 #Department of CS, Department of ECE and Department of Statistics, Rice University ^Department of ECE, Rice... more
Krishna Palem#1, Al Barr%2, Avinash Lingamneni^3, Vincent Mooney*4, Rajeswari Pingali&5, Harini Sampath$6 and Jayanthi Sivaswamy$7 #Department of CS, Department of ECE and Department of Statistics, Rice University ^Department of ECE, Rice University ...
Contour detection is an important and difficult task for object segmentation in computer vision mainly because contours often occur in the presence of background texture. A previously reported scheme, based on the model of cortical cells... more
Contour detection is an important and difficult task for object segmentation in computer vision mainly because contours often occur in the presence of background texture. A previously reported scheme, based on the model of cortical cells in primates, analyses the local energy output ...
AbstractIn this paper we present a method for fovea localiza-tion which does not use organization information of other retinal structures like optic disk and arcades. The main advantage of this method is that it does not require... more
AbstractIn this paper we present a method for fovea localiza-tion which does not use organization information of other retinal structures like optic disk and arcades. The main advantage of this method is that it does not require segmentation/localization of other retinal structures ...
Many computer vision applications rely on the analysis of curvature within images. The use of hexagonal lattices can make this analysis easier. When coupled with the notion of attention driven search this can yield an efficient image... more
Many computer vision applications rely on the analysis of curvature within images. The use of hexagonal lattices can make this analysis easier. When coupled with the notion of attention driven search this can yield an efficient image analysis tool. As well, such ...