Your AI Model Gateway - Smoothly Manage Multiple LLMs and Image Models, Speed Up Responses, and Ensure Non-Stop Reliability.
At Writesonic, after three years of navigating the world of large language models, we identified key challenges and built GPTRouter to solve them.
-
Model Independence: Don't put all your eggs in one basket. GPTRouter lets you break free from the limitations of relying on just one AI model like OpenAI. If one model is down, GPTRouter keeps you up and running by seamlessly switching to another.
-
Beat the Latency: Slow response times? Not anymore. GPTRouter is designed to tackle latency issues, especially with hefty models like GPT-4. Experience a smoother, faster user interaction without delays.
-
Diverse Model Integration: Why settle for one when you can have more? GPTRouter supports multiple language and image generation models, providing fallback options so your system remains robust and versatile.
- π Universal API: One API to connect them all. Easily switch between models like OpenAI, Azure OpenAI, Anthropic, Replicate, Stable Diffusion, Cohere, and more.
- π Smart Fallbacks: Keep your services uninterrupted. GPTRouter automatically switches to alternative models if your primary choice is unavailable.
- π Automatic Retries: GPTRouter intelligently retries failed requests, reducing manual effort and improving reliability.
- β±οΈ Fast and Responsive: Designed to reduce latency, GPTRouter ensures your interactions with AI models are quick and efficient.
Supported Models | Completion | Streaming | Async Completion | Async Streaming |
---|---|---|---|---|
OpenAI | β | β | β | β |
Azure OpenAI | β | β | β | β |
Anthropic | β | β | β | β |
Replicate | β | β | β | β |
Stable Diffusion | β | β | β | β |
Dalle-3 | β | β | β | β |
Cohere | β | β | β | β |
More to come | π€ | π€ | π€ | π€ |
β Streaming not applicable to Image Models
π€ Coming Soon
β¨ Contributors Welcome! β¨
Ready to get started? Here's how:
Getting The Server Running
- To run the GPTRouter server locally, follow the steps here
- Alternatively, use our Preview Deployment with the baseURL
https://gpt-router-preview.writesonic.com/
. Get your API key by filling out the form here.
Once the Server is running, you can integrate GPTRouter into your application using our Python SDK or via the API Docs. Meanwhile, we are working on JS and other clients and are looking for contributors to help out.
Install GPTRouter using pip:
pip install gptrouter
Or with conda:
conda install gptrouter -c conda-forge
Usage Example
from gpt_router.client import GPTRouterClient
from gpt_router.models import ModelGenerationRequest, GenerationParams
from gpt_router.enums import ModelsEnum, ProvidersEnum
client = GPTRouterClient(base_url='your_base_url', api_key='your_api_key')
messages = [
{"role": "user", "content": "Write me a short poem"},
]
prompt_params = GenerationParams(messages=messages)
claude2_request = ModelGenerationRequest(
model_name=ModelsEnum.CLAUDE_INSTANT_12,
provider_name=ProvidersEnum.ANTHROPIC.value,
order=1,
prompt_params=prompt_params,
)
response = client.generate(ordered_generation_requests=[claude2_request])
print(response.choices[0].text)
Discover More: Explore streaming and other examples here.
- Integrations with Langchain and LlamaIndex, expanding your options even further.
For comprehensive documentation, visit: GPTRouter Documentation
Detailed installation instructions and setup guidance can be found in our Getting Started Guide.
We welcome contributions from the community! If you're interested in improving GPTRouter, see our Contribution Guidelines.