Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
×
Aug 1, 2023 · In this paper, a new transformer-based image captioning structure without recurrence and convolution is proposed to address these issues. To ...
Mar 12, 2024 · The generator uses a pretrained CNN and LSTM to predict the next word, while the selector evaluates words with fitness scores for partial ...
People also ask
Transformer-based local-global guidance for image captioning‏. H Parvin, AR Naghsh-Nilchi, HM Mohammadi‏. Expert Systems with Applications 223, 119774, 2023 ...
Oct 1, 2023 · In this paper, a new double-attention framework is presented, which improves the encoder–decoder structure to consider image captioning problems ...
Mar 10, 2024 · This work introduces a novel multitask learning framework that combines image captioning and object detection into a joint model. We propose ...
Jun 2, 2024 · ... image captioning approaches treat local feature and global feature in the image ... Transformer-Based Local-Global Guidance for Image Captioning.
(arXiv 2022.03) End-to-End Transformer Based Model for Image Captioning, [Paper] ... (arXiv 2021.09) Hybrid Local-Global Transformer for Image Dehazing, [Paper].
Jun 28, 2022 · guidance between them, which realizes the complementary advantages between local and global attention. We also fuse the refined global ...
In this paper, we presented PoS-Transformer, a novel transformer-based framework for image captioning, to separate the grammatical structures and word semantics ...