Keyword: Audiovisual fusion : Search

Applied Filters

People

Publications

Publication Date

4 Results for: Keyword: Audiovisual fusionEdit SearchSave SearchRSS

Searched The ACM Guide to Computing Literature (3,842,514 records)|Limit your search to The ACM Full-Text Collection (774,577 records)

Showing 1 - 4of4 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

article
February 2014
Audiovisual diarization of people in video content
Multimedia Tools and Applications (MTAA), Volume 68, Issue 3Pages 747–775https://doi.org/10.1007/s11042-012-1080-6

Audio-Visual People Diarization (AVPD) is an original framework that simultaneously improves audio, video, and audiovisual diarization results. Following a literature review of people diarization for both audio and video content and their limitations, ...
10
Metrics
Total Citations10
article
October 2011
Dynamical information fusion of heterogeneous sensors for 3D tracking using particle swarm optimization
Information Fusion (INFU), Volume 12, Issue 4Pages 275–283https://doi.org/10.1016/j.inffus.2010.06.005

This paper presents a new method for three dimensional object tracking by fusing information from stereo vision and stereo audio. From the audio data, directional information about an object is extracted by the Generalized Cross Correlation (GCC) and ...
4
Metrics
Total Citations4
article
May 2010
A New Learning Algorithm for the Fusion of Adaptive Audio---Visual Features for the Retrieval and Classification of Movie Clips
Journal of Signal Processing Systems (JSPS), Volume 59, Issue 2Pages 177–188https://doi.org/10.1007/s11265-008-0290-7

This paper presents a new learning algorithm for audiovisual fusion and demonstrates its application to video classification for film database. The proposed system utilized perceptual features for content characterization of movie clips. These features ...
2
Metrics
Total Citations2
research-article
December 2008
Boosting-Based Multimodal Speaker Detection for Distributed Meeting Videos
- Cha Zhang,
- Pei Yin,
- Yong Rui,
- R. Cutler,
- P. Viola,
- Xinding Sun,
- N. Pinto,
- Zhengyou Zhang
IEEE Transactions on Multimedia (TOM), Volume 10, Issue 8Pages 1541–1552https://doi.org/10.1109/TMM.2008.2007344

Identifying the active speaker in a video of a distributed meeting can be very helpful for remote participants to understand the dynamics of the meeting. A straightforward application of such analysis is to stream a high resolution video of the speaker ...
8
Metrics
Total Citations8

Search Results

Applied Filters

People

Names

Institutions

Authors

Publications

Journal/Magazine Names

All Publications

Content Type

Publisher

Publication Date

Results

Audiovisual diarization of people in video content

Dynamical information fusion of heterogeneous sensors for 3D tracking using particle swarm optimization

A New Learning Algorithm for the Fusion of Adaptive Audio---Visual Features for the Retrieval and Classification of Movie Clips

Boosting-Based Multimodal Speaker Detection for Distributed Meeting Videos