You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
MindTrial: Evaluate and compare AI language models (LLMs) on text-based tasks with optional file/image attachments and tool use. Supports multiple providers (OpenAI, Google, Anthropic, DeepSeek, Mistral AI, xAI, Alibaba, Moonshot AI, OpenRouter), custom tasks in YAML, and HTML/CSV reports.
Multi-LLM Chat Playground is an interactive tool designed to compare responses from multiple Large Language Models (LLMs) in real time. Built using LiteLLM, this app provides a unified interface to interact with different AI models, making it easy to experiment with various prompts and understand model behaviors.