Sergey Tulyakov’s Post

Name: Sergey Tulyakov on LinkedIn: #cvpr2024
Uploaded: 2024-06-15T18:07:33.959Z
Channel: Sergey Tulyakov

Director of Research, leading the Creative Vision team

1mo

Interested in pushing the performance of large text-to-image models to the edge? Join me at Efficient Large Vision Models workshop at #CVPR2024 in Seattle on Jun 17, 11am. In my talk titled "Edge of Efficiency: Speed and Size of Diffusion Models on the Edge" I'll share the key ideas behind SnapFusion -- the fastest on-device model. We'll see how to make U-Net efficient, how to drastically reduce the number of steps and many more details. I'll also discuss BitsFusion -- our latest foundational models quantized to 1.99 bits! While using only 1/8 of the SD v1.5 size, it actually shows higher image fidelity. Finally, we'll see a demo of a high-quality image-to-image model running on-device at stunning 10FPS and offering interactive experience! Here is my talk in 10s

5 Comments

Li-Yun (James) Wang

Actively looking for machine learning/deep learning R&D engineer, machine/deep learning scientist, computer vision R&D engineer, and applied scientist | Ex-Apple Inc., HP Inc., and Samsung Research America

1mo

Hi Sergey Tulyakov, awesome work! I am attending CVPR in-person this year and stopping by the Efficient Large Vision Models workshop definitely.

1 Reaction

To view or add a comment, sign in

More Relevant Posts

Mpho Mokomiri Ephraim Shiang ll

Data Science Intern at Old Mutual | Building Intelligent Systems For Tomorrow |
1mo Edited
Report this post
Hi, Using a camera to control your laptop with machine learning – no mouse needed! Control it with your face and hands. Building Intelligent Systems for Tomorrow #MachineLearning #ComputerVision #CNN

528 Comments
Like Comment
To view or add a comment, sign in
Modelbit

762 followers
10mo
Report this post
Have you ever wanted to interact with a state of the art compute vision model? 🔎 Check out our interactive demo of Google's OWL-ViT object detection model! #machinelearning #computervision

OWL-ViT Interactive Demo - Modelbit

modelbit.com
Like Comment
To view or add a comment, sign in
Aram Vartanyan

Applying MathWorks solutions to customer technical projects.
3mo
Report this post
Step up your computer vision game by learning to automate the labeling process for object re-identification and tracking. Check out our comprehensive guide for all the details. #ComputerVision #AutomationGuide #ObjectReID

Automate Ground Truth Labeling for Object Tracking and Re-Identification - MATLAB & Simulink
Like Comment
To view or add a comment, sign in
Bronia Badubi

Hr projects/ ERM/IOP candidate/ Gallup Certified Strength coach/ Trainer by BQA/ Positive institutions consultant
1mo
Report this post
Organisations and businesses, it’s time we embrace machine learning for our talent acquisition strategy, especially for recruitment and selection and learning and development, gen Z skills are more diverse. It’s critical that we embrace these diverse skills and use to retain our gen Z …gone are the days where organisations use classroom training with trainers, gamification and machine learning are good ways of embracing the change in learning and development. #learninganddevelopment #talentaquisition #inclusion #generationZ #embracediversity

Mpho Mokomiri Ephraim Shiang ll

Data Science Intern at Old Mutual | Building Intelligent Systems For Tomorrow |
1mo Edited

Hi, Using a camera to control your laptop with machine learning – no mouse needed! Control it with your face and hands. Building Intelligent Systems for Tomorrow #MachineLearning #ComputerVision #CNN
Like Comment
To view or add a comment, sign in
Wesley Mmadike

FrontEnd software developer | Telecommunication analytics
1mo
Report this post
this is actually very impressive

Mpho Mokomiri Ephraim Shiang ll

Data Science Intern at Old Mutual | Building Intelligent Systems For Tomorrow |
1mo Edited

Hi, Using a camera to control your laptop with machine learning – no mouse needed! Control it with your face and hands. Building Intelligent Systems for Tomorrow #MachineLearning #ComputerVision #CNN

1 Comment
Like Comment
To view or add a comment, sign in
Luis Velasco

Solving problems with data, and solving data-problems
7mo
Report this post
Happy new year everyone, holiday season project is finally here! I've decided to put a few technologies together this time (embeddings, Vision API, LLMs, Autoencoders, vector databases and of course my shiny RTX4090) to build a "embedding set reconstructor": given a set of N "valid" embeddings, I've trained a model to generate the "missing" embedding given N-1 embeddings. Then applied this concept to the Fashion world using a LARGE research dataset (Chictopia10k), results are surprisingly good! https://lnkd.in/dD5wMXyQ
9 Comments
Like Comment
To view or add a comment, sign in
Bhunesh Sen

Electronics and Communication Engineering | CPP | DSA | OOP | Python | Student at Sagar Group of Institutions - Sagar Institute of Science and Technology
3mo Edited
Report this post
Working on Image Processing and Bit part of Machine learning... We are Working on Face Detection Attendance System using ESP32-CAM. Here ESP32-Cam don't have USB input, So we need a Interface to program it, However it is not as easy as it looks. It's really hard to Implement. New Challenge - #machinelearning #imageprocessing #FaceDetectionAttendanceSystem
Like Comment
To view or add a comment, sign in
Steven McCraw

Sr. Manager, Client Solution Architects at Hewlett Packard Enterprise - West Enterprise and Commercial
10mo
Report this post
Simple. 'HPE's Cray Supercomputing XD665 enables the addition of Large Language Model integration, Model Training and Fine-Tuning for workloads across industries including Healthcare & Life Science, Financial Services and Manufacturing'

Ron Javor on LinkedIn: #sustainability #hewlettpackardenterprise #cray #supercomputing #nvidia…

linkedin.com
Like Comment
To view or add a comment, sign in
Lindsay Millard

Hydrologist MEngSc RPEQ CPEng FIEAust
10mo
Report this post
here's another AI weather-model - Google DeepMind's GraphCast model. https://lnkd.in/gxnpP7sJ It's already challenging conventional weather models like the ECMWF for various parameters. In fact, it takes less than 60 seconds to generate a full 10-day forecast using a suitable graphics processing unit that's found in retail computers! there are other AI weather models like Pangu-Weather and FourCastNet with similar features. I'd imagine that these are being rolled out to enhance dam safety and water security across Australia.

GitHub - google-deepmind/graphcast

github.com
Like Comment
To view or add a comment, sign in

4,396 followers

72 Posts

View Profile Follow

Sergey Tulyakov’s Post

More Relevant Posts

Ron Javor on LinkedIn: #sustainability #hewlettpackardenterprise #cray #supercomputing #nvidia…

linkedin.com

Explore topics