


default search action
18th ECCV 2024: Milan, Italy - Part LXXVII
- Ales Leonardis
, Elisa Ricci
, Stefan Roth
, Olga Russakovsky
, Torsten Sattler
, Gül Varol
:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part LXXVII. Lecture Notes in Computer Science 15135, Springer 2024, ISBN 978-3-031-72979-9 - Wendi Zheng, Jiayan Teng, Zhuoyi Yang, Weihan Wang, Jidong Chen, Xiaotao Gu, Yuxiao Dong, Ming Ding, Jie Tang:
CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion. 1-22 - Nanye Ma, Mark Goldstein, Michael S. Albergo, Nicholas M. Boffi, Eric Vanden-Eijnden, Saining Xie:
SiT: Exploring Flow and Diffusion-Based Generative Models with Scalable Interpolant Transformers. 23-40 - Baicheng Li, Zike Yan, Dong Wu, Hanqing Jiang, Hongbin Zha:
Learn to Memorize and to Forget: A Continual Learning Perspective of Dynamic SLAM. 41-57 - Sudhir Yarram
, Junsong Yuan
:
Forecasting Future Videos from Novel Views via Disentangled 3D Scene Representation. 58-76 - Emanuele Santellani
, Martin Zach
, Christian Sormann
, Mattia Rossi
, Andreas Kuhn
, Friedrich Fraundorfer
:
GMM-IKRS: Gaussian Mixture Models for Interpretable Keypoint Refinement and Scoring. 77-93 - Sizhuo Li
, Dimitri Gominski
, Martin Brandt
, Xiaoye Tong
, Philippe Ciais
:
Get Your Embedding Space in Order: Domain-Adaptive Regression for Forest Monitoring. 94-111 - Daniel Winter, Matan Cohen, Shlomi Fruchter, Yael Pritch, Alex Rav-Acha, Yedid Hoshen:
ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion. 112-129 - Ziyang Gong
, Fuhao Li
, Yupeng Deng
, Deblina Bhattacharjee
, Xianzheng Ma, Xiangwei Zhu
, Zhenming Ji
:
CoDA: Instructive Chain-of-Domain Adaptation with Severity-Aware Visual Prompt Tuning. 130-148 - Andrey Voynov
, Amir Hertz
, Moab Arar
, Shlomi Fruchter, Daniel Cohen-Or
:
Curved Diffusion: A Generative Model with Optical Geometry Control. 149-164 - Guangchi Fang
, Bing Wang
:
Mini-Splatting: Representing Scenes with a Constrained Number of Gaussians. 165-181 - Ziming Zhong, Yanyu Xu, Jing Li, Jiale Xu, Zhengxin Li, Chaohui Yu, Shenghua Gao:
MeshSegmenter: Zero-Shot Mesh Semantic Segmentation via Texture Synthesis. 182-199 - Kwanyoung Kim
, Yujin Oh
, Jong Chul Ye
:
OTSeg: Multi-Prompt Sinkhorn Attention for Zero-Shot Semantic Segmentation. 200-217 - Yannick Kirchhoff
, Maximilian Rokuss, Saikat Roy
, Balint Kovacs
, Constantin Ulrich
, Tassilo Wald, Maximilian Zenk
, Philipp Vollmuth
, Jens Kleesiek
, Fabian Isensee
, Klaus H. Maier-Hein
:
Skeleton Recall Loss for Connectivity Conserving and Resource Efficient Segmentation of Thin Tubular Structures. 218-234 - Yi Zhang
, Ke Yu
, Siqi Wu, Zhihai He
:
Conceptual Codebook Learning for Vision-Language Models. 235-251 - Ana-Maria Marcu
, Long Chen
, Jan Hünermann, Alice Karnsund, Benoît Hanotte, Prajwal Chidananda, Saurabh Nair, Vijay Badrinarayanan, Alex Kendall, Jamie Shotton, Elahe Arani, Oleg Sinavski:
LingoQA: Visual Question Answering for Autonomous Driving. 252-269 - Dimitrios Gerogiannis
, Foivos Paraperas Papantoniou
, Rolandos Alexandros Potamias
, Alexandros Lattas
, Stylianos Moschoglou
, Stylianos Ploumpis
, Stefanos Zafeiriou
:
AnimateMe: 4D Facial Expressions via Diffusion Models. 270-287 - Zhecan Wang, Garrett Bingham, Adams Wei Yu, Quoc V. Le, Thang Luong, Golnaz Ghiasi:
HaloQuest: A Visual Hallucination Dataset for Advancing Multimodal Reasoning. 288-304 - Kevin Xie, Jonathan Lorraine
, Tianshi Cao
, Jun Gao
, James Lucas, Antonio Torralba
, Sanja Fidler
, Xiaohui Zeng
:
LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis. 305-322 - Tianyuan Yuan, Yucheng Mao, Jiawei Yang, Yicheng Liu, Yue Wang, Hang Zhao:
PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF Priors. 323-339 - Jie Ren
, Yaxin Li
, Shenglai Zeng, Han Xu
, Lingjuan Lyu, Yue Xing
, Jiliang Tang:
Unveiling and Mitigating Memorization in Text-to-Image Diffusion Models Through Cross Attention. 340-356 - Tom Fischer
, Yaoyao Liu
, Artur Jesslen
, Noor Ahmed, Prakhar Kaushik
, Angtian Wang, Alan L. Yuille
, Adam Kortylewski
, Eddy Ilg
:
iNeMo: Incremental Neural Mesh Models for Robust Class-Incremental Learning. 357-374 - Ivona Najdenkoska, Animesh Sinha, Abhimanyu Dubey, Dhruv Mahajan, Vignesh Ramanathan, Filip Radenovic:
Context Diffusion: In-Context Aware Image Generation. 375-391 - Tongkai Shi, Lianyu Hu, Fanhua Shang, Jichao Feng, Peidong Liu, Wei Feng:
Pose-Guided Fine-Grained Sign Language Video Generation. 392-409 - Ali Zare, Yulei Niu, Hammad A. Ayyubi, Shih-Fu Chang:
RAP: Retrieval-Augmented Planner for Adaptive Procedure Planning in Instructional Videos. 410-426 - Zhengyuan Jiang, Moyang Guo, Yuepeng Hu, Jinyuan Jia, Neil Zhenqiang Gong:
Certifiably Robust Image Watermark. 427-443 - Sukrut Rao
, Sweta Mahajan, Moritz Böhle
, Bernt Schiele
:
Discover-then-Name: Task-Agnostic Concept Bottlenecks via Automated Concept Discovery. 444-461 - Qi Qian
, Juhua Hu
:
Online Zero-Shot Classification with CLIP. 462-477

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.