Thanks, good idea, this should be possible.

freedomben · 2024-12-12T19:33:13 1734031993

That would be a pretty killer feature IMHO. Ollama's API is pretty straightforward: https://github.com/ollama/ollama/blob/main/docs/api.md

There is also (or at least used to be?) an OpenAI compatible API layer for Ollama so that may be an option as well, though my understanding is there are some downsides to using that.

Note: This comment and the link are just meant as references/conveniences, not intended as a request for free labor. Thanks for opening up the code!

moffkalast · 2024-12-12T20:03:53 1734033833

Forget ollama, just changing the URL from openai to your local server is enough, llama.cpp has a compatible endpoint. Most people just don't bother giving the option since you get a CORS error if it doesn't have a valid cert.

freedomben · 2024-12-12T23:19:11 1734045551

Neat, I didn't know that! Thanks for the tip!