Diverse Image Captioning via Conditional Variational Autoencoder and Dual Contrastive Learning
Abstract
References
Index Terms
- Diverse Image Captioning via Conditional Variational Autoencoder and Dual Contrastive Learning
Recommendations
CONICA: A Contrastive Image Captioning Framework with Robust Similarity Learning
MM '23: Proceedings of the 31st ACM International Conference on MultimediaContrastive Language Image Pre-training (CLIP) has recently made significant advancements in image captioning by providing effective multi-modal representation learning capabilities. However, previous studies primarily rely on the language-aligned visual ...
Adversarial and Contrastive Variational Autoencoder for Sequential Recommendation
WWW '21: Proceedings of the Web Conference 2021Sequential recommendation as an emerging topic has attracted increasing attention due to its important practical significance. Models based on deep learning and attention mechanism have achieved good performance in sequential recommendation. Recently, ...
Diverse Image Captioning with Grounded Style
Pattern RecognitionAbstractStylized image captioning as presented in prior work aims to generate captions that reflect characteristics beyond a factual description of the scene composition, such as sentiments. Such prior work relies on given sentiment identifiers, which are ...
Comments
Information & Contributors
Information
Published In
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
Check for updates
Author Tags
Qualifiers
- Research-article
Funding Sources
- National Natural Science Foundation of China
Contributors
Other Metrics
Bibliometrics & Citations
Bibliometrics
Article Metrics
- 0Total Citations
- 424Total Downloads
- Downloads (Last 12 months)340
- Downloads (Last 6 weeks)23
Other Metrics
Citations
Cited By
View allView Options
Get Access
Login options
Check if you have access through your login credentials or your institution to get full access on this article.
Sign in