Sergey Tulyakov’s Post

View profile for Sergey Tulyakov, graphic

Director of Research, leading the Creative Vision team

Interested in pushing the performance of large text-to-image models to the edge? Join me at Efficient Large Vision Models workshop at #CVPR2024 in Seattle on Jun 17, 11am. In my talk titled "Edge of Efficiency: Speed and Size of Diffusion Models on the Edge" I'll share the key ideas behind SnapFusion -- the fastest on-device model. We'll see how to make U-Net efficient, how to drastically reduce the number of steps and many more details. I'll also discuss BitsFusion -- our latest foundational models quantized to 1.99 bits! While using only 1/8 of the SD v1.5 size, it actually shows higher image fidelity. Finally, we'll see a demo of a high-quality image-to-image model running on-device at stunning 10FPS and offering interactive experience! Here is my talk in 10s

Li-Yun (James) Wang

Actively looking for machine learning/deep learning R&D engineer, machine/deep learning scientist, computer vision R&D engineer, and applied scientist | Ex-Apple Inc., HP Inc., and Samsung Research America

1mo

Hi Sergey Tulyakov, awesome work! I am attending CVPR in-person this year and stopping by the Efficient Large Vision Models workshop definitely.

To view or add a comment, sign in

Explore topics