petmal/MindTrial
MindTrial: Evaluate and compare AI language models (LLMs) on text-based tasks with optional file/image attachments and tool use. Supports multiple providers (OpenAI, Google, Anthropic, DeepSeek, Mistral AI, xAI, Alibaba, Moonshot AI, OpenRouter), custom tasks in YAML, and HTML/CSV reports.
Platform-specific configuration:
{
"mcpServers": {
"MindTrial": {
"command": "npx",
"args": [
"-y",
"MindTrial"
]
}
}
}Add the config above to .claude/settings.json under the mcpServers key.
Loading reviews...