Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleJanuary 2024
Generic Attention-model Explainability by Weighted Relevance Accumulation
MMAsia '23: Proceedings of the 5th ACM International Conference on Multimedia in AsiaArticle No.: 63, Pages 1–7https://doi.org/10.1145/3595916.3626437Attention-based Transformer models have achieved remarkable progress in multi-modal tasks, such as visual question answering. The explainability of attention-based methods has recently attracted wide interest as it can explain the inner changes of ...
- research-articleJanuary 2024
Research on Multi-Person Pose Estimation Based on YOLO and Decoupled Multi-Level Feature Layers Fusion
MMAsia '23: Proceedings of the 5th ACM International Conference on Multimedia in AsiaArticle No.: 58, Pages 1–7https://doi.org/10.1145/3595916.3626432Multi-person pose estimation is fundamental research in the fields of AIGC, multimedia understanding,virtual reality, human-computer interaction, etc. Existing algorithms have problems such as large computational complexity, low accuracy, and an ...
- research-articleJanuary 2024
Image Cropping under Design Constraints
MMAsia '23: Proceedings of the 5th ACM International Conference on Multimedia in AsiaArticle No.: 40, Pages 1–7https://doi.org/10.1145/3595916.3626412Image cropping is essential in image editing for obtaining a compositionally enhanced image. In display media, image cropping is a prospective technique for automatically creating media content. However, image cropping for media contents is often ...