Qwen2VL Chat UI Image Encoding Bug? #1338

MoRocety · 2025-01-28T19:28:37Z

Upon starting the inference server with Qwen2VL via:

./koboldcpp --model "Qwen2-VL-2B-Instruct-Q6_K_L.gguf" --mmproj "mmproj-Qwen2-VL-2B-Instruct-f16.gguf"

it seems to be working fine output wise but for every turn in a multi turn chat environemnt, it seems to be re-encoding the image from scratch, leading to inefficiency in resource constrained environments. Not sure if this is the desired behaviour as according to my limited understanding, it should just reuse and only further augment the originally created embeddings via the projector?

Release: v1.82.4
Windows subsystem for linux with CPU backend.

LostRuins · 2025-01-30T11:02:59Z

Will be improved in the next version

MoRocety · 2025-01-30T18:16:34Z

Appreciate it! When can we expect it by?

LostRuins · 2025-01-31T01:43:26Z

Within 1 week, probably

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Qwen2VL Chat UI Image Encoding Bug? #1338

Qwen2VL Chat UI Image Encoding Bug? #1338

MoRocety commented Jan 28, 2025 •

edited

Loading

LostRuins commented Jan 30, 2025

MoRocety commented Jan 30, 2025

LostRuins commented Jan 31, 2025

Qwen2VL Chat UI Image Encoding Bug? #1338

Qwen2VL Chat UI Image Encoding Bug? #1338

Comments

MoRocety commented Jan 28, 2025 • edited Loading

LostRuins commented Jan 30, 2025

MoRocety commented Jan 30, 2025

LostRuins commented Jan 31, 2025

MoRocety commented Jan 28, 2025 •

edited

Loading