Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- articleFebruary 2014
Audiovisual diarization of people in video content
Multimedia Tools and Applications (MTAA), Volume 68, Issue 3Pages 747–775https://doi.org/10.1007/s11042-012-1080-6Audio-Visual People Diarization (AVPD) is an original framework that simultaneously improves audio, video, and audiovisual diarization results. Following a literature review of people diarization for both audio and video content and their limitations, ...
- articleOctober 2011
Dynamical information fusion of heterogeneous sensors for 3D tracking using particle swarm optimization
Information Fusion (INFU), Volume 12, Issue 4Pages 275–283https://doi.org/10.1016/j.inffus.2010.06.005This paper presents a new method for three dimensional object tracking by fusing information from stereo vision and stereo audio. From the audio data, directional information about an object is extracted by the Generalized Cross Correlation (GCC) and ...
- articleMay 2010
A New Learning Algorithm for the Fusion of Adaptive Audio---Visual Features for the Retrieval and Classification of Movie Clips
Journal of Signal Processing Systems (JSPS), Volume 59, Issue 2Pages 177–188https://doi.org/10.1007/s11265-008-0290-7This paper presents a new learning algorithm for audiovisual fusion and demonstrates its application to video classification for film database. The proposed system utilized perceptual features for content characterization of movie clips. These features ...
- research-articleDecember 2008
Boosting-Based Multimodal Speaker Detection for Distributed Meeting Videos
IEEE Transactions on Multimedia (TOM), Volume 10, Issue 8Pages 1541–1552https://doi.org/10.1109/TMM.2008.2007344Identifying the active speaker in a video of a distributed meeting can be very helpful for remote participants to understand the dynamics of the meeting. A straightforward application of such analysis is to stream a high resolution video of the speaker ...