Cited By
View all- Qian TCui RChen JPeng PGuo XJiang Y(2024)Locate Before Answering: Answer Guided Question Localization for Video Question AnsweringIEEE Transactions on Multimedia10.1109/TMM.2023.332387826(4554-4563)Online publication date: 1-Jan-2024
- Yu TFu KZhang JHuang QYu J(2024)Multi-Granularity Contrastive Cross-Modal Collaborative Generation for End-to-End Long-Term Video Question AnsweringIEEE Transactions on Image Processing10.1109/TIP.2024.339098433(3115-3129)Online publication date: 24-Apr-2024
- Luo YWang RZhang FZhou FLiu MFeng J(2024)Video Q &A based on two-stage deep exploration of temporally-evolving features with enhanced cross-modal attention mechanismNeural Computing and Applications10.1007/s00521-024-09482-836:14(8055-8071)Online publication date: 1-May-2024
- Show More Cited By