Skip to main content

Thomas Deselaers

Google, Research, Department Member

Followers

27

Following

8

Co-authors

8

Public Views

Interests

Uploads

Papers by Thomas Deselaers

Using Heterogeneous Annotation and Visual Information for the Benchmarking of Image Retrieval Systems

Many image retrieval systems, and the evaluation methodologies of these systems, make use of eith... more Many image retrieval systems, and the evaluation methodologies of these systems, make use of either visual or textual information only. Only few combine textual and visual features for retrieval and evaluation. If text is used, it is often relies upon having a standardised and complete annotation schema for the entire collection. This, in combination with high{level semantic queries, makes visual/textual

Patch-based Object Recognition Using Discriminatively Trained Gaussian Mixtures

British Machine Vision Conference, 2006

We present an approach using Gaussian mixture models for part-based object recognition where spat... more We present an approach using Gaussian mixture models for part-based object recognition where spatial relationships of the parts are explicitly modeled and parameters of the generative model are tuned discriminatively. These extensions lead to great improvements of the classification accuracy. Fur- thermore we evaluate several improvements over our baseline system which incrementally improve the obtained results which compare favorable well

Clustering visually similar images to improve image search engines

At the moment Google image search is probably the only widely known way to search the world wide ... more At the moment Google image search is probably the only widely known way to search the world wide web for images. Google's search engine works based on text retrieval: The images are not indexed by their appearance but by text which can be found in the context of the image. To achieve enhancements for the user we propose to reorder

Overview of the ImageCLEFmed 2006 medical retrieval and annotation tasks

Cross-Language Evaluation Forum, 2000

This paper describes the medial image retrieval and the medical annotation tasks of ImageCLEF 200... more This paper describes the medial image retrieval and the medical annotation tasks of ImageCLEF 2006.These tasks are described in a separate paper from the other task to reduce the size of the overview papaer. These two medical tasks are described separately with respect to the goals, databases used, topics created and distributed among participants, results and techniques used. The

Geometric Features for Improving Continuous Appearance-based Sign Language Recognition

Procedings of the British Machine Vision Conference 2006, 2006

Tracking Using Dynamic Programming for Appearance-Based Sign Language Recognition

7th International Conference on Automatic Face and Gesture Recognition (FGR06), 2006

Patch-based Object Recognition Using Discriminatively Trained Gaussian Mixtures

Procedings of the British Machine Vision Conference 2006, 2006

Global and efficient self-similarity for object classification and detection

2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010

Visual and semantic similarity in ImageNet

CVPR 2011, 2011

... 1Computer Vision Laboratory ETH Zurich, Switzerland {deselaers,ferrari}@vision.ee.ethz.ch ...... more

Face-based Image Retrieval - One Step Toward Object-based Image Retrieval

In this paper we propose a method to retrieve images based on the persons shown. The method aims ... more In this paper we propose a method to retrieve images based on the persons shown. The method aims at retrieving from images showing groups of people those in which the same persons are depicted as in the query image. It is experimentally shown that this aim is achieved for rather simple tasks and that improvements over baseline methods are possible

Modeling Image Variability in Appearance-Based Gesture Recognition

European Conference on Computer Vision, 2000

We introduce the use of appearance-based features in hid- den Markov model emission probabilities... more We introduce the use of appearance-based features in hid- den Markov model emission probabilities to recognize dynamic gestures. Tangent distance and the image distortion model are used to directly model image variability in videos. No explicit hand models and no seg- mentation of the hand is necessary. Dierent appearance-based features are investigated and the invariant distance measures are systematically evaluated.

Continuous Sign Language Recognition - Approaches from Speech Recognition and Available Data Resources

In this paper we describe our current work on automatic continuous sign language recognition. We ... more In this paper we describe our current work on automatic continuous sign language recognition. We present an automatic sign language recognition system that is based on a large vocabulary speech recognition system and adopts many of the approaches that are conven- tionally applied in the recognition of spoken language. Furthermore, we present a set of freely available databases that can

Spoken Language Processing Techniques for Sign Language Recognition and Translation

We present an approach to automatically recognize sign language and translate it into a spoken la... more We present an approach to automatically recognize sign language and translate it into a spoken language. A system to address these tasks is created based on state-of- the-art techniques from statistical machine translation, speech recognition, and image processing research. Such a system is necessary for communication between deaf and hearing people. The communication is otherwise nearly impossible due to missing

The Visual Concept Detection Task in ImageCLEF 2008

Lecture Notes in Computer Science, 2009

Incorporating On-demand Stereo for Real Time Recognition

2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007

Smoothed Disparity Maps for Continuous American Sign Language Recognition

Lecture Notes in Computer Science, 2009

Gesture Recognition Using Image Comparison Methods

Lecture Notes in Computer Science, 2006

FIRE in ImageCLEF 2005: Combining Content-Based Image Retrieval with Textual Information Retrieval

Lecture Notes in Computer Science, 2006

FIRE in ImageCLEF 2007: Support Vector Machines and Logistic Models to Fuse Image Descriptors for Photo Retrieval

by Tobias Gass and Thomas Deselaers

Lecture Notes in Computer Science, 2008

FIRE – Flexible Image Retrieval Engine: ImageCLEF 2004 Evaluation

Lecture Notes in Computer Science, 2005

Using Heterogeneous Annotation and Visual Information for the Benchmarking of Image Retrieval Systems

Many image retrieval systems, and the evaluation methodologies of these systems, make use of eith... more Many image retrieval systems, and the evaluation methodologies of these systems, make use of either visual or textual information only. Only few combine textual and visual features for retrieval and evaluation. If text is used, it is often relies upon having a standardised and complete annotation schema for the entire collection. This, in combination with high{level semantic queries, makes visual/textual

Patch-based Object Recognition Using Discriminatively Trained Gaussian Mixtures

British Machine Vision Conference, 2006

We present an approach using Gaussian mixture models for part-based object recognition where spat... more We present an approach using Gaussian mixture models for part-based object recognition where spatial relationships of the parts are explicitly modeled and parameters of the generative model are tuned discriminatively. These extensions lead to great improvements of the classification accuracy. Fur- thermore we evaluate several improvements over our baseline system which incrementally improve the obtained results which compare favorable well

Clustering visually similar images to improve image search engines

At the moment Google image search is probably the only widely known way to search the world wide ... more At the moment Google image search is probably the only widely known way to search the world wide web for images. Google's search engine works based on text retrieval: The images are not indexed by their appearance but by text which can be found in the context of the image. To achieve enhancements for the user we propose to reorder

Overview of the ImageCLEFmed 2006 medical retrieval and annotation tasks

Cross-Language Evaluation Forum, 2000

This paper describes the medial image retrieval and the medical annotation tasks of ImageCLEF 200... more This paper describes the medial image retrieval and the medical annotation tasks of ImageCLEF 2006.These tasks are described in a separate paper from the other task to reduce the size of the overview papaer. These two medical tasks are described separately with respect to the goals, databases used, topics created and distributed among participants, results and techniques used. The

Geometric Features for Improving Continuous Appearance-based Sign Language Recognition

Procedings of the British Machine Vision Conference 2006, 2006

Tracking Using Dynamic Programming for Appearance-Based Sign Language Recognition

7th International Conference on Automatic Face and Gesture Recognition (FGR06), 2006

Patch-based Object Recognition Using Discriminatively Trained Gaussian Mixtures

Procedings of the British Machine Vision Conference 2006, 2006

Global and efficient self-similarity for object classification and detection

2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010

Visual and semantic similarity in ImageNet

CVPR 2011, 2011

... 1Computer Vision Laboratory ETH Zurich, Switzerland {deselaers,ferrari}@vision.ee.ethz.ch ...... more

Face-based Image Retrieval - One Step Toward Object-based Image Retrieval

In this paper we propose a method to retrieve images based on the persons shown. The method aims ... more In this paper we propose a method to retrieve images based on the persons shown. The method aims at retrieving from images showing groups of people those in which the same persons are depicted as in the query image. It is experimentally shown that this aim is achieved for rather simple tasks and that improvements over baseline methods are possible

Modeling Image Variability in Appearance-Based Gesture Recognition

European Conference on Computer Vision, 2000

We introduce the use of appearance-based features in hid- den Markov model emission probabilities... more We introduce the use of appearance-based features in hid- den Markov model emission probabilities to recognize dynamic gestures. Tangent distance and the image distortion model are used to directly model image variability in videos. No explicit hand models and no seg- mentation of the hand is necessary. Dierent appearance-based features are investigated and the invariant distance measures are systematically evaluated.

Continuous Sign Language Recognition - Approaches from Speech Recognition and Available Data Resources

In this paper we describe our current work on automatic continuous sign language recognition. We ... more In this paper we describe our current work on automatic continuous sign language recognition. We present an automatic sign language recognition system that is based on a large vocabulary speech recognition system and adopts many of the approaches that are conven- tionally applied in the recognition of spoken language. Furthermore, we present a set of freely available databases that can

Spoken Language Processing Techniques for Sign Language Recognition and Translation

We present an approach to automatically recognize sign language and translate it into a spoken la... more We present an approach to automatically recognize sign language and translate it into a spoken language. A system to address these tasks is created based on state-of- the-art techniques from statistical machine translation, speech recognition, and image processing research. Such a system is necessary for communication between deaf and hearing people. The communication is otherwise nearly impossible due to missing

The Visual Concept Detection Task in ImageCLEF 2008

Lecture Notes in Computer Science, 2009

Incorporating On-demand Stereo for Real Time Recognition

2007 IEEE Conference on Computer Vision and Pattern Recognition, 2007

Smoothed Disparity Maps for Continuous American Sign Language Recognition

Lecture Notes in Computer Science, 2009

Gesture Recognition Using Image Comparison Methods

Lecture Notes in Computer Science, 2006

FIRE in ImageCLEF 2005: Combining Content-Based Image Retrieval with Textual Information Retrieval

Lecture Notes in Computer Science, 2006

FIRE in ImageCLEF 2007: Support Vector Machines and Logistic Models to Fuse Image Descriptors for Photo Retrieval

by Tobias Gass and Thomas Deselaers

Lecture Notes in Computer Science, 2008

FIRE – Flexible Image Retrieval Engine: ImageCLEF 2004 Evaluation

Lecture Notes in Computer Science, 2005