mcp-server

MCP Tool

iris-eval/mcp-server

MCP-native agent evaluation and observability server

Install

$ npx loaditout add iris-eval/mcp-server

About

Iris — MCP-Native Agent Eval & Observability

[](https://github.com/iris-eval/mcp-server) [](https://npmjs.com/package/@iris-eval/mcp-server) [](https://npmjs.com/package/@iris-eval/mcp-server) [](https://github.com/iris-eval/mcp-server/actions/workflows/ci.yml) [](LICENSE)

See what your AI agents are actually doing. Iris is an open-source MCP server that logs every trace, evaluates output quality, and tracks costs across all your agents. Any MCP-compatible agent discovers and uses it automatically — no SDK, no code changes.

The Problem

Your agents are running in production. Traditional monitoring sees 200 OK and moves on. It has no idea the agent just:

Leaked a social security number in its response
Hallucinated an answer with zero factual grounding
Burned $0.47 on a single query — 4.7x your budget threshold
Made 6 tool calls when 2 would have sufficed

Iris sees all of it.

What You Get

| | | |---|---| | Trace Logging | Hierarchical span trees with per-tool-call latency, token usage, and cost in USD. Stored in SQLite, queryable instantly. | | Output Evaluation | 12 built-in rules across 4 categories: completeness, relevance, safety, cost. PII detection, prompt injection patterns, hallucination markers. Add custom rules with Zod schemas. | | Cost Visibility | Aggregate cost across all agents over any time window. Set budget thresholds. Get flagged when agents overspend. | | Web Dashboard | Real-time dark-mode UI with trace visualization, eval result

Reviews

Loading reviews...

Quality Signals

Quality Score4600

Stars

Installs

Last updatedtoday

Security: AREADME

New

mcp-server

Install

About

Tags

Reviews

Quality Signals

Safety

Details

Embed Badge