Towards Long Form Audio-visual Video Understanding
Abstract
References
Index Terms
- Towards Long Form Audio-visual Video Understanding
Recommendations
Harmony across Music, Visuals and Movement in a New Audio-visual Gestural Performance
TEI '20: Proceedings of the Fourteenth International Conference on Tangible, Embedded, and Embodied InteractionThis paper describes the technology, concepts and development of Computer Storm, a live audio-visual piece created for a gestural instrument, the 'AirSticks'. The AirSticks allow the composition, performance and improvisation of live electronic music ...
The DIRAC AWEAR audio-visual platform for detection of unexpected and incongruent events
ICMI '08: Proceedings of the 10th international conference on Multimodal interfacesIt is of prime importance in everyday human life to cope with and respond appropriately to events that are not foreseen by prior experience. Machines to a large extent lack the ability to respond appropriately to such inputs. An important class of ...
Event-centric multi-modal fusion method for dense video captioning
AbstractDense video captioning aims to automatically describe several events that occur in a given video, which most state-of-the-art models accomplish by locating and describing multiple events in an untrimmed video. Despite much progress in ...
Comments
Information & Contributors
Information
Published In
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
Check for updates
Author Tags
Qualifiers
- Research-article
Contributors
Other Metrics
Bibliometrics & Citations
Bibliometrics
Article Metrics
- 0Total Citations
- 208Total Downloads
- Downloads (Last 12 months)208
- Downloads (Last 6 weeks)80
Other Metrics
Citations
View Options
Get Access
Login options
Check if you have access through your login credentials or your institution to get full access on this article.
Sign in