agent-vox

MCP Tool

webkubor/agent-vox

Agent 语音桥 — 给 AI Agent 加上耳朵和嘴巴（MCP Server）

Install

$ npx loaditout add webkubor/agent-vox

Platform-specific configuration:

.claude/settings.json

{
  "mcpServers": {
    "agent-vox": {
      "command": "npx",
      "args": [
        "-y",
        "agent-vox"
      ]
    }
  }
}

Add the config above to .claude/settings.json under the mcpServers key.

About

agent-vox

Agent 语音桥 — 给 AI Agent 加上耳朵和嘴巴

让 Claude Code、Codex、Gemini CLI 等能听懂你说的话，也能把回答说给你听。

[](LICENSE) [](package.json) [](https://bun.sh/) [](https://www.apple.com/mac/)

</div>

---

架构

> 一个本地运行的轻量语音服务。不包含 LLM——Agent 本身就是大脑，agent-vox 只做语音输入输出。

---

三种接入方式

| 方式 | 命令 | 适合场景 | |------|------|---------| | MCP（推荐） | agent-vox mcp | Claude Code / Codex / Gemini 原生工具调用 | | WebSocket | agent-vox serve | 前端 / 自定义应用集成 | | CLI | agent-vox cli speak/listen | 脚本 / 不支持 MCP 的 Agent |

---

快速开始

第一步：安装依赖

# Bun 运行时
curl -fsSL https://bun.sh/install | bash

# ASR 语音识别
brew install whisper-cpp

# TTS 语音合成
pipx install edge-tts

第二步：下载 Whisper 模型

mkdir -p ~/.agent-vox/models
curl -L -o ~/.agent-vox/models/ggml-base.bin \
  "https://huggingface.co/ggerganov/whisper.cpp/resolve/main/ggml-base.bin"

> base 模型约 141MB，中文日常够用。如需更高精度，可下载 ggml-small（466MB）或 ggml-medium（1.5GB），修改模型路径即可。

第三步：安装 agent-vox

git clone https://github.com/webkubor/agent-vox.git
cd agent-vox
bun install

第四步：验证

bun run info

应该看到：

🔍 检查依赖...
  ✅ whisper-cli
  ✅ edge-tts

---

使用方式

🧠 方式一：MCP 模式（推荐）

Agent 直接把语音能力当工具用，无需写任何连接代码。

Claude Code：

claude mcp add agent-vox -- bun run /你的路径/agent-vox/src/index.js mcp

然后在 Claude Code 里直接说：

> 用 listen 工具听我这段语音，然后回答我的问题

Codex：

Reviews

Loading reviews...

Quality Signals

Installs

Last updatedtoday

Security: BREADME

New

agent-vox

Install

About

Tags

Reviews

Quality Signals

Safety

Details

Embed Badge