Cited By
View all- Luo JLi YPan YYao TFeng JChao HMei T(2025)Exploring Vision-Language Foundation Model for Novel Object CaptioningIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2024.345243735:1(91-102)Online publication date: Jan-2025
- Wang LLi HZhang MQiu HMeng FWu QXu L(2024)CrowdCaption++: Collective-Guided Crowd Scenes CaptioningIEEE Transactions on Multimedia10.1109/TMM.2023.332818926(4974-4986)Online publication date: 2024
- Wang LQiu HQiu BMeng FWu QLi H(2024)TridentCap: Image-Fact-Style Trident Semantic Framework for Stylized Image CaptioningIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2023.331513334:5(3563-3575)Online publication date: May-2024
- Show More Cited By