Abstract. It is a fact that current methodologies for automatic translation cannot be expected to... more Abstract. It is a fact that current methodologies for automatic translation cannot be expected to produce high quality translations. An alternative approach is to use them as an aid to manual translation. We focus on a possible way to help human translators: to interactively provide ...
State is a flexible system for document processing. It comprises a graphical front-end that can b... more State is a flexible system for document processing. It comprises a graphical front-end that can be easily connected to different text recognition back-ends. We comment here the front-end and two back-ends: one based on nearest neighbors and one based on Hidden ...
ABSTRACT Matrics is a system for recognition of car license plates. It works on standard PC equip... more ABSTRACT Matrics is a system for recognition of car license plates. It works on standard PC equipment with low-priced capture devices and achieves real-time performance (10 frames per second) with state of the art accuracy: the character error rate is below 1% and the plate error rate is below 3%. The recognition process is divided in two phases: plate localization and plate decoding. The system finds the plate analyzing the connected components of the image after binarization. The decoding algorithm is a Two Level process which uses fast template-based classification techniques in its first stage and optimal segmentation in the second stage. On the whole, the system represents a significant improvement over a previous version which was based on HMM.
2008 The Eighth IAPR International Workshop on Document Analysis Systems, 2008
ABSTRACT We present a complete assisted transcription system for ancient documents: State. The sy... more ABSTRACT We present a complete assisted transcription system for ancient documents: State. The system consists of two applications: a pen-based, interactive application to assist humans in transcribing ancient documents and a recognition engine which offers automatic transcriptions via a web service. The interaction model and the recognition algorithm employed in the current version of State are presented. Some preliminary experiments show the productivity gains obtained with the system when transcribing a document and the error rate of the current recognition engine.
... To evaluate the performance of the ROIs detection method, we have measured the number of time... more ... To evaluate the performance of the ROIs detection method, we have measured the number of times ... goes up to 92%, which corresponds to the 98% of times that plate region was ... be also employed to estimate the size of structural elements for applying morphological operators [1 ...
Proceedings of the 2009 international conference on Multimodal interfaces - ICMI-MLMI '09, 2009
... [3] Albert Gordo, David Llorens, Andrés Marzal, Federico Prat, and Juan Miguel Vilar. STATE: ... more ... [3] Albert Gordo, David Llorens, Andrés Marzal, Federico Prat, and Juan Miguel Vilar. STATE: A multimodal assisted text-transcription system for ancient documents. In The Eigth IAPR Workshop on Document Analysis Systems, Nara (Japan), September 2008. [4] METAe. ...
Computer-Assisted Translation (CAT) is an alternative approach to Machine Translation, that integ... more Computer-Assisted Translation (CAT) is an alternative approach to Machine Translation, that integrates human expertise into the automatic translation process. In this framework, a human translator interacts with a translation system that dynamically offers a list of translations that best completes the part of the sentence already translated. Stochastic finite-state transducer technology is proposed to support this CAT system. The system
The EuTRANS project aims at using Example-Based approaches for the automatic developmentof Machin... more The EuTRANS project aims at using Example-Based approaches for the automatic developmentof Machine Translation systems --accepting text and speech input-- for limited domain applications.During the first phase of the project, a speech translation system that is based on the use of automaticallylearnt Subsequential Transducers has been built. This paper contains a detailed and to a long extentself-contained overview of the
ABSTRACT Shape descriptions and the corresponding matching techniques must be robust to noise and... more ABSTRACT Shape descriptions and the corresponding matching techniques must be robust to noise and invariant to transformations for their use in recognition tasks. Most transformations are relatively easy to handle when contours are represented by strings. However, starting point invariance is difficult to achieve. One interesting possibility is the use of cyclic strings, which are strings that have no starting and final points. We propose new methodologies to use Hidden Markov Models to classify contours represented by cyclic strings. Experimental results show that our proposals outperform other methods in the literature.
Abstract. It is a fact that current methodologies for automatic translation cannot be expected to... more Abstract. It is a fact that current methodologies for automatic translation cannot be expected to produce high quality translations. An alternative approach is to use them as an aid to manual translation. We focus on a possible way to help human translators: to interactively provide ...
State is a flexible system for document processing. It comprises a graphical front-end that can b... more State is a flexible system for document processing. It comprises a graphical front-end that can be easily connected to different text recognition back-ends. We comment here the front-end and two back-ends: one based on nearest neighbors and one based on Hidden ...
ABSTRACT Matrics is a system for recognition of car license plates. It works on standard PC equip... more ABSTRACT Matrics is a system for recognition of car license plates. It works on standard PC equipment with low-priced capture devices and achieves real-time performance (10 frames per second) with state of the art accuracy: the character error rate is below 1% and the plate error rate is below 3%. The recognition process is divided in two phases: plate localization and plate decoding. The system finds the plate analyzing the connected components of the image after binarization. The decoding algorithm is a Two Level process which uses fast template-based classification techniques in its first stage and optimal segmentation in the second stage. On the whole, the system represents a significant improvement over a previous version which was based on HMM.
2008 The Eighth IAPR International Workshop on Document Analysis Systems, 2008
ABSTRACT We present a complete assisted transcription system for ancient documents: State. The sy... more ABSTRACT We present a complete assisted transcription system for ancient documents: State. The system consists of two applications: a pen-based, interactive application to assist humans in transcribing ancient documents and a recognition engine which offers automatic transcriptions via a web service. The interaction model and the recognition algorithm employed in the current version of State are presented. Some preliminary experiments show the productivity gains obtained with the system when transcribing a document and the error rate of the current recognition engine.
... To evaluate the performance of the ROIs detection method, we have measured the number of time... more ... To evaluate the performance of the ROIs detection method, we have measured the number of times ... goes up to 92%, which corresponds to the 98% of times that plate region was ... be also employed to estimate the size of structural elements for applying morphological operators [1 ...
Proceedings of the 2009 international conference on Multimodal interfaces - ICMI-MLMI '09, 2009
... [3] Albert Gordo, David Llorens, Andrés Marzal, Federico Prat, and Juan Miguel Vilar. STATE: ... more ... [3] Albert Gordo, David Llorens, Andrés Marzal, Federico Prat, and Juan Miguel Vilar. STATE: A multimodal assisted text-transcription system for ancient documents. In The Eigth IAPR Workshop on Document Analysis Systems, Nara (Japan), September 2008. [4] METAe. ...
Computer-Assisted Translation (CAT) is an alternative approach to Machine Translation, that integ... more Computer-Assisted Translation (CAT) is an alternative approach to Machine Translation, that integrates human expertise into the automatic translation process. In this framework, a human translator interacts with a translation system that dynamically offers a list of translations that best completes the part of the sentence already translated. Stochastic finite-state transducer technology is proposed to support this CAT system. The system
The EuTRANS project aims at using Example-Based approaches for the automatic developmentof Machin... more The EuTRANS project aims at using Example-Based approaches for the automatic developmentof Machine Translation systems --accepting text and speech input-- for limited domain applications.During the first phase of the project, a speech translation system that is based on the use of automaticallylearnt Subsequential Transducers has been built. This paper contains a detailed and to a long extentself-contained overview of the
ABSTRACT Shape descriptions and the corresponding matching techniques must be robust to noise and... more ABSTRACT Shape descriptions and the corresponding matching techniques must be robust to noise and invariant to transformations for their use in recognition tasks. Most transformations are relatively easy to handle when contours are represented by strings. However, starting point invariance is difficult to achieve. One interesting possibility is the use of cyclic strings, which are strings that have no starting and final points. We propose new methodologies to use Hidden Markov Models to classify contours represented by cyclic strings. Experimental results show that our proposals outperform other methods in the literature.
Uploads
Papers by Juan Vilar