Cited By
View all- Fu XCai JKankanhalli MPrabhakaran BBoll SSubramanian RZheng LSingh VCesar PXie LXu D(2024)Enhancing Multimodal Large Language Models on Demonstrative Multi-Image InstructionsProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3688994(11429-11434)Online publication date: 28-Oct-2024
- Ge ZLi JYu QZhou WTang SZhuang YCai JKankanhalli MPrabhakaran BBoll SSubramanian RZheng LSingh VCesar PXie LXu D(2024)DEMON24: ACM MM24 Demonstrative Instruction Following ChallengeProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3688993(11426-11428)Online publication date: 28-Oct-2024
- Tao MBao BTang HWang YXu CCai JKankanhalli MPrabhakaran BBoll SSubramanian RZheng LSingh VCesar PXie LXu D(2024)CoIn: A Lightweight and Effective Framework for Story Visualization and ContinuationProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3680873(10659-10668)Online publication date: 28-Oct-2024
- Show More Cited By