Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleOctober 2023
Improving Zero-shot Visual Question Answering via Large Language Models with Reasoning Question Prompts
MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 4389–4400https://doi.org/10.1145/3581783.3612389Zero-shot Visual Question Answering (VQA) is a prominent vision-language task that examines both the visual and textual understanding capability of systems in the absence of training data. Recently, by converting the images into captions, information ...
- research-articleAugust 2023
A Study of Situational Reasoning for Traffic Understanding
KDD '23: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data MiningPages 3262–3272https://doi.org/10.1145/3580305.3599246Intelligent Traffic Monitoring (ITMo) technologies hold the potential for improving road safety/security and for enabling smart city infrastructure. Understanding traffic situations requires a complex fusion of perceptual information with domain-...