Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
×
Nov 5, 2023 · In this paper, we propose a multimodal conditional variational auto-encoder (MC-VAE) in two branches to achieve a unified real-world event ...
Dec 31, 2022 · In this paper, we propose a multimodal conditional variational auto-encoder (MC-VAE) in two branches to achieve a unified real-world event ...
Nov 7, 2023 · In this paper, we propose a multimodal conditional variational auto-encoder (MC-VAE) in two branches to achieve a unified real-world event ...
Multimodal Conditional VAE for Zero-Shot Real-World Event Discovery. https ... zero-shot event detection and event captioning. In: KDD, pp. 297–305 ...
Multimodal Conditional VAE for Zero-Shot Real-World Event Discovery. Z. Yang, D. Luo, J. You, Z. Guo, and Z. Yang. ADMA (2), volume 14177 of Lecture Notes ...
Accordingly, in this paper, we propose a method of grounding visual concepts for large-scale Multimedia Event Detection (MED) and Multimedia Event Captioning ( ...
Jan 20, 2023 · Nus-wide: A real-world web image database from ... A generative model for zero shot learning using conditional variational autoencoders.
Jun 26, 2021 · To overcome this problem, we propose a Multimodal Variational Auto-Encoder (M-VAE) which can learn the shared latent space of image features and ...
Missing: Conditional Real- World Event Discovery.
We focus on detecting complex events in uncon- strained Internet videos. While most existing works rely on the abundance of labeled training data, we.
Multi-modal ZSL combines information from multiple data modalities, such as text, images, videos, and audio, to predict unknown classes. By training a model ...