GongRzhe/Audio-MCP-Server
Built by Metorial, the integration platform for agentic AI.
GongRzhe/Audio-MCP-Server
Server Summary
List audio devices
Record audio
Playback recordings
Play audio files
Text-to-Speech (future implementation)
An MCP (Model Context Protocol) server that provides audio input/output capabilities for AI assistants like Claude. This server enables Claude to interact with your computer's audio system, including recording from microphones and playing audio through speakers.
To install Audio Interface Server for Claude Desktop automatically via Smithery:
npx -y @smithery/cli install @GongRzhe/Audio-MCP-Server --client claude
git clone https://github.com/GongRzhe/Audio-MCP-Server.git
cd Audio-MCP-Server
# Windows
python -m venv .venv
.venv\Scripts\activate
pip install -r requirements.txt
# macOS/Linux
python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt
python setup_mcp.py
To use this server with Claude Desktop, add the following to your Claude Desktop configuration file:
~/Library/Application Support/Claude/claude_desktop_config.json
%APPDATA%\Claude\claude_desktop_config.json
{
"mcpServers": {
"audio-interface": {
"command": "/path/to/your/.venv/bin/python",
"args": [
"/path/to/your/audio_server.py"
],
"env": {
"PYTHONPATH": "/path/to/your/audio-mcp-server"
}
}
}
}
Replace the paths with the actual paths on your system. The setup script will generate this configuration for you.
After setting up the server, restart Claude Desktop. You should see a hammer icon in the input box, indicating that tools are available.
Try asking Claude:
Lists all available audio input and output devices on your system.
Records audio from your microphone.
Parameters:
duration
: Recording duration in seconds (default: 5)sample_rate
: Sample rate in Hz (default: 44100)channels
: Number of audio channels (default: 1)device_index
: Specific input device index to use (default: system default)Plays back the most recently recorded audio.
Placeholder for text-to-speech functionality.
Parameters:
text
: The text to convert to speechvoice
: The voice to use (default: "default")Plays an audio file through your speakers.
Parameters:
file_path
: Path to the audio filedevice_index
: Specific output device index to use (default: system default)If no audio devices are found, check:
If audio playback isn't working:
If Claude can't connect to the server:
MIT
Note: This server provides tools that can access your microphone and speakers. Always review and approve tool actions before they execute.