Yong Rui

... The bottom architecture is an example of ho wthe model is used to describe an image object. ... V ariousfeatures, such as color, texture, shape, layout, motion parameters, etc,are extracted to make the MIR system flexible enough to... more

... The bottom architecture is an example of ho wthe model is used to describe an image object. ... V ariousfeatures, such as color, texture, shape, layout, motion parameters, etc,are extracted to make the MIR system flexible enough to support different information need of different ...

Publication Date: 1997

Publication Name: 1997 Proceedings IEEE Workshop on Content-Based Access of Image and Video Libraries

Research Interests:
Information Retrieval, Human Computer Interaction, User Interface, Multimedia information retrieval, Relevance Feedback, and 7 moreUser interfaces, Performance Improvement, Query processing, Information Retrieval System, Information Retrieval systems, Visual Features, and Multimedia Data

IntroductionWith the advances in storage technology and the advent of the World Wide Web, there has been an explosionin the amount and complexity of digital information being generated, analyzed, stored, accessed and transmitted.Most of... more

IntroductionWith the advances in storage technology and the advent of the World Wide Web, there has been an explosionin the amount and complexity of digital information being generated, analyzed, stored, accessed and transmitted.Most of this data is multimedia in nature, including digital images, video, audio and simple text data. To makeuse of this vast amount of multimedia data, we need

Publication Date: 1997

Publication Name: Workshop on Multimedia Information Systems

Research Interests:
World Wide Web, Digital Image, and Multimedia Data

Person-based indices and timelines can enable fast and non-linear access to recorded meetings. This paper focuses on how to automatically construct those indices and timelines by using face recognition techniques. While there exist... more

Person-based indices and timelines can enable fast and non-linear access to recorded meetings. This paper focuses on how to automatically construct those indices and timelines by using face recognition techniques. While there exist extensive research in generic face recognition, recognizing faces in recorded meetings is still an understudied area. Real-world meeting videos impose several interesting and unique challenges including complex lighting, low imaging quality, and large variations in head pose and size. In this paper, a promising approach based on MRC- Boosting is presented to address these challenges, which achieves encouraging performance on real-world meeting videos and shows superior accuracy and robustness compared to two popular existing approaches.

Research Interests:
Face Recognition

Download (.pdf)

... free paper. Printed in the United States of America. Page 7. To my parents, Wei, and Michael. -Sean To my parents, Dongqin, and Olivia. -Yong To Margaret; Caroline,Marjorie, Thomas, Gregory. -Tom Page 8. Page 9. Contents 1 ...

Publication Date: 2003

Machine-aided retrieval of multimedia information—image [44], video [170], or audio [195], etc.—is achieved based on representations in the form of descriptors (or feature vectors). Two issues arise: one is the effectiveness of the... more

Machine-aided retrieval of multimedia information—image [44], video [170], or audio [195], etc.—is achieved based on representations in the form of descriptors (or feature vectors). Two issues arise: one is the effectiveness of the representation, ie, to what extent can the meaningful contents of the media be represented in these vectors? The other is the selection of a similarity metric during the retrieval process. This is an important issue because the similarity metric dynamically depends upon the user and the user defined query class, ...

Publication Date: 2003

Research Interests:
On line learning and Similarity Metric

Publication Date: 2007

Research Interests:
Engineering

Download (.pdf)

ABSTRACT Different from the existing work focusing on emotion type detection, the proposed approach in this paper provides flexibility for users to pick up their favorite affective content by choosing either emotion intensity levels or... more

ABSTRACT Different from the existing work focusing on emotion type detection, the proposed approach in this paper provides flexibility for users to pick up their favorite affective content by choosing either emotion intensity levels or emotion types. Specifically, we propose a hierarchical structure for movie emotions and analyze emotion intensity and emotion type by using arousal and valence related features hierarchically. Firstly, three emotion intensity levels are detected by using fuzzy c-mean clustering on arousal features. Fuzzy clustering provides a mathematical model to represent vagueness, which is close to human perception. Then, valence related features are used to detect five emotion types. Considering video is continuous time series data and the occurrence of a certain emotion is affected by recent emotional history, conditional random fields (CRFs) are used to capture the context information. Outperforming Hidden Markov Model, CRF relaxes the independence assumption for states required by HMM and avoids bias problem. Experimental results show that CRF-based hierarchical method outperforms the one-step method on emotion type detection. User study shows that majority of the viewers prefer to have option of accessing movie content by emotion intensity levels. Majority of the users are satisfied with the proposed emotion detection.

Publication Date: 2012

Research Interests:
Engineering, Technology, and Signal Processing

Publication Date: 2008

Research Interests:
Information Retrieval, Biomedical Engineering, Knowledge Transfer, Multimedia information retrieval, Public Safety, and 6 moreConsumer Electronics, Multimedia Systems, Web Information Retrieval, Surveillance System, Electrical And Electronic Engineering, and Video Recording

Download ()

... Furthermore, the HSV, CIE-LAB, and Munsell color spaces also attempt to make the colorspace perceptu-ally uniform. ... We chose the HSV (hue, saturation, and value) color space for simplicity. Spatial space is just the 2-D Cartesian... more

... Furthermore, the HSV, CIE-LAB, and Munsell color spaces also attempt to make the colorspace perceptu-ally uniform. ... We chose the HSV (hue, saturation, and value) color space for simplicity. Spatial space is just the 2-D Cartesian space spanned ...

Publication Date: 1996

Research Interests:
Image Processing, Feature Extraction, Content based Retrieval, Region Segmentation, and Image Features

Publication Date: 2008

Research Interests:
Engineering, Computer Vision, Probability, Video Analysis, Event Detection, and 10 moreSport, Text Analysis, Semantic Information, Structure Analysis, Random Processes, Domain Specificity, Conditional Random Field, Ground Truth, Internet, and Sports video analysis

Download (.pdf)

Publication Date: 2001

Research Interests:
Software Architecture, OPERATING SYSTEM, Sound Source Localization, Video Editing, Audio Visual, and 4 moreUser Study, Management System, System performance, and Streaming Media

Download (.pdf)

Combining learning with vision techniques in interactive image retrieval has been an active research topic during the past few years. However, existing learning techniques either are based on heuristics or fail to analyze the working... more

Combining learning with vision techniques in interactive image retrieval has been an active research topic during the past few years. However, existing learning techniques either are based on heuristics or fail to analyze the working con-ditions. Furthermore, there is almost no in depth study ...

Publication Date: 2000

Research Interests:
Image Retrieval, Computer Vision and Pattern Recognition, Learning Process, Lagrange Multiplier, CVPR, and 2 moreVisual Features and Evaluation Criteria

Publication Date: 2006

Research Interests:
Information Systems, Algorithms, Artificial Intelligence, Markov chains, Computer Simulation, and 8 moreComputer Systems, Motion, Real Time, Information Storage and Retrieval, Visual Cues, Image Enhancement, Nonlinear system, and Electrical And Electronic Engineering

Download (.pdf)

Supporting multimedia search has emerged as an important research topic. There are three paradigms on the research spectrum that ranges from the least automatic to the most automatic. On the far left end, there is the pure manual labeling... more

Supporting multimedia search has emerged as an important research topic. There are three paradigms on the research spectrum that ranges from the least automatic to the most automatic. On the far left end, there is the pure manual labeling paradigm that labels multimedia content, e.g., images and video clips, manually with text labels and then use text search to search

Publication Date: 2007

Research Interests:
Video Annotation, Image Annotation, and Spectrum

Abstract Content-Based Image Retrieval (CBIR) has become one of the most active research areas in the past few years. Many visual feature representations have been explored and many systems built. While these research e orts establish the... more

Abstract Content-Based Image Retrieval (CBIR) has become one of the most active research areas in the past few years. Many visual feature representations have been explored and many systems built. While these research e orts establish the basis of CBIR, the usefulness of ...

Publication Date: 1998

Research Interests:
Content based image retrieval, Image Retrieval, Relevance Feedback, Information Need, Human Perception, and Visual Features

We propose a new multiple instance learning (MIL) al-gorithm to learn image categories. Unlike existing MIL al-gorithms, in which the individual instances in a bag are as-sumed to be independent with each other, we develop con-current... more

We propose a new multiple instance learning (MIL) al-gorithm to learn image categories. Unlike existing MIL al-gorithms, in which the individual instances in a bag are as-sumed to be independent with each other, we develop con-current tensors to explicitly model the inter-dependency ...

Publication Date: 2007

Research Interests:
Computer Vision, Image Processing, Machine Learning, Digital Photography, Multiple Instance Learning, and 11 moreAutomation, Labeling, Computer Vision and Pattern Recognition, Kernel, Layout, Matrix Decomposition, Tensors, Reproducing Kernel Hilbert Space, Hilbert Space, Tensile Stress, and Feature Space

This paper provides a comprehensive survey of the technical achievements in the research area of image retrieval, especially content-based image retrieval, an area that has been so active and prosperous in the past few years. The survey... more

This paper provides a comprehensive survey of the technical achievements in the research area of image retrieval, especially content-based image retrieval, an area that has been so active and prosperous in the past few years. The survey includes 100+ papers covering the ...

Publisher: Elsevier

Publication Date: 1999

Publication Name: Journal of visual communication and image …

Research Interests:
System Design, Content based image retrieval, Image Retrieval, Real-world Application, and Image Features

Publication Date: 2013

Download (.pdf)

Decrypting the secret of beauty or attractiveness has been the pursuit of artists and philosophers for centuries. To date, the computational model for attractiveness estimation has been actively explored in computer vision and multimedia... more

Decrypting the secret of beauty or attractiveness has been the pursuit of artists and philosophers for centuries. To date, the
computational model for attractiveness estimation has been actively explored in computer vision and multimedia community, yet with the focus mainly on facial features. In this article, we conduct a comprehensive study on female attractiveness conveyed by single/multiplemodalities of cues, that is, face, dressing and/or voice, and aim to discover how different modalities individually and collectively affect the human sense of beauty. To extensively investigate the problem, we collect the Multi-Modality Beauty (M2B) dataset, which is annotated with attractiveness levels converted from manual k-wise ratings and semantic attributes of different modalities. Inspired by the common consensus that middle-level attribute prediction can assist higher-level computer vision tasks, we manually labeled many attributes for each modality. Next, a tri-layer Dual-supervised Feature-Attribute-Task (DFAT) network is proposed to jointly learn the attribute model and attractiveness model of single/multiple modalities. To remedy possible loss of information caused by incomplete manual attributes, we also propose a novel Latent Dual-supervised Feature-Attribute-Task (LDFAT) network, where latent attributes are combined with manual attributes to contribute to the final attractiveness estimation. The extensive experimental evaluations on the collected M2B dataset well demonstrate the effectiveness of the proposed DFAT and LDFAT networks for female attractiveness prediction.

Download (.pdf)

Publication Date: 2006

Publication Name: 2006 IEEE International Conference on Multimedia and Expo

Research Interests:
System Design

Download (.pdf)

Publication Date: 1997

Publication Name: 1997 Proceedings IEEE Workshop on Content-Based Access of Image and Video Libraries

Publication Date: 1997

Publication Name: Workshop on Multimedia Information Systems

Research Interests: World Wide Web, Digital Image, and Multimedia Data<div>()</div>

Research Interests: Face Recognition<div>()</div>

Publication Date: 2003

Publication Date: 2003

Research Interests: On line learning and Similarity Metric<div>()</div>

Publication Date: 2007

Research Interests: Engineering<div>()</div>

Publication Date: 2012

Research Interests: Engineering, Technology, and Signal Processing<div>()</div>

Publication Date: 2008

Publication Date: 1996

Research Interests: Image Processing, Feature Extraction, Content based Retrieval, Region Segmentation, and Image Features<div>()</div>

Publication Date: 2008

Publication Date: 2001

Publication Date: 2000

Publication Date: 2006

Publication Date: 2007

Research Interests: Video Annotation, Image Annotation, and Spectrum<div>()</div>

Publication Date: 1998

Research Interests: Content based image retrieval, Image Retrieval, Relevance Feedback, Information Need, Human Perception, and Visual Features<div>()</div>

Publication Date: 2007

Publisher: Elsevier

Publication Date: 1999

Publication Name: Journal of visual communication and image …

Research Interests: System Design, Content based image retrieval, Image Retrieval, Real-world Application, and Image Features<div>()</div>

Publication Date: 2013

Publication Date: 2006

Publication Name: 2006 IEEE International Conference on Multimedia and Expo

Research Interests: System Design<div>()</div>

Log In

Research Interests:
World Wide Web, Digital Image, and Multimedia Data

Research Interests:
Face Recognition

Research Interests:
On line learning and Similarity Metric

Research Interests:
Engineering

Research Interests:
Engineering, Technology, and Signal Processing

Research Interests:
Image Processing, Feature Extraction, Content based Retrieval, Region Segmentation, and Image Features

Research Interests:
Video Annotation, Image Annotation, and Spectrum

Research Interests:
Content based image retrieval, Image Retrieval, Relevance Feedback, Information Need, Human Perception, and Visual Features

Research Interests:
System Design, Content based image retrieval, Image Retrieval, Real-world Application, and Image Features

Research Interests:
System Design