Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
×
This report presents our system developed for Ad-hoc Video Search(AVS) task in TRECVID 2019 as. Team ATL. In this AVS task, we apply a hybrid sequential ...
This report presents a hybrid sequential encoder which make use of the utilities of not only the multi-modal sources but also the feature extractors such as ...
In this report, we present our method in the CVPR 2020. Video Pentathlon challenge. In the challenge we use hybrid sequence encoder to extract information ...
Sep 10, 2020 · Dual encoding is conceptually simple, practically effective and end-to-end trained with hybrid space learning. Extensive experiments on four ...
Hybrid Sequence Encoder for Text Based Video Retrieval. record by Mingli Song • Hybrid Sequence Encoder for Text Based Video Retrieval. Xiang Wu, Da Chen ...
This report presents our model developed for Video Pentathlon challenge in CVPR 2020. The goal of this challenge is to build a system for five video ...
A curated list of deep learning resources for video-text retrieval. - danieljf24/awesome-video-text-retrieval.
Missing: Hybrid | Show results with:Hybrid
Dual encoding, trained by hybrid space learning, is a new state-of-the-art for video retrieval by text. The architecture of our model is based on existing.
In this paper, we present a multi-modal transformer to jointly encode the different modalities in video, which allows each of them to attend to the others.