Built by Metorial, the integration platform for agentic AI.

Learn More

Connect to
Google Drive

Explore thousands of MCP servers. And connect to them in a single function call.

Stripe

georgi.iogeorgi.io

Jessica TTS

Integrates ElevenLabs Text-to-Speech capabilities for seamless text conversion to speech, offering voice selection and management through a modern interface. Supports real-time communication with a FastAPI backend and a React frontend.

rsagacomrsagacom

智能对话机器人

A multi-platform intelligent dialogue service that supports text, voice, and image interactions. It can connect to various AI models and allows for custom enterprise AI applications through plugin extensions.

Al Amin Md Rafiul HossainAl Amin Md Rafiul Hossain

Vapi Voice AI Tools

Integrate voice AI capabilities into applications for managing voice assistants and conducting outbound calls. Provides advanced features for enhancing user interactions through voice conversations.

giannisangiannisan

Kokoro TTS Server

Integrates text-to-speech capabilities using the Kokoro TTS engine, enabling conversion of written content into spoken audio with customizable voices and adjustable speed. Supports saving audio files and cross-platform playback.

Mamerto Fabian JrMamerto Fabian Jr

ElevenLabs MCP Server

Integrates with ElevenLabs text-to-speech API to generate audio from text input, manage voice generation tasks, and store history using an SQLite database. Includes a sample SvelteKit client for performing text-to-speech conversions and managing script parts.

NakamuraYuichiNakamuraYuichi

Text-to-Speech MCP Server

Integrates high-quality text-to-speech capabilities into applications, converting text to audio with customizable voice options and output formats. Provides a command-line tool for quick conversions and supports various parameters for audio customization.

Matthew DaileyMatthew Dailey

Rime Text-to-Speech Server

Convert text to speech and play it through the system's audio with high-quality voice synthesis. Customize speech behavior using environment variables for tailored interactions.

CengSinCengSin

Fish Audio Text-to-Speech Service

Converts text into natural human speech with customizable audio formats and bitrates, while integrating seamlessly with MCP-compatible applications.

AbhiAbhi

Audio Transcriber Server

Transcribes audio files using OpenAI's speech-to-text capabilities, enabling accurate audio transcriptions and the option to save them directly to files.

GongRzheGongRzhe

Audio Interface Server

Enables interaction with a computer's audio system by listing audio devices, recording audio from microphones, and playing back recordings or audio files. Facilitates audio management and integrates audio input and output control for AI assistants.

YeoYeo

Voice Recognition Service

Provides voice recognition and text extraction capabilities, supporting both file input and base64 encoded data processed in structured formats. Operates in stdio and MCP modes for flexible integration with various systems.

DefiBaxDefiBax

Voice Recorder

Record audio and transcribe it using advanced AI models like OpenAI's Whisper. Supports integration with AI agents for enhanced interactivity and includes prompts for common recording scenarios.

David HammeDavid Hamme

Speech MCP Server

Provides text-to-speech capabilities using the Kokoro TTS model, converting text into natural-sounding speech with customizable options and multiple voice choices.

Mu7Mu7

AI-StoryLab

AI-StoryLab generates interactive stories with accompanying audio effects and provides illustration prompts. It leverages AI services for story creation, voice synthesis, sound effect generation, and suggests relevant audio placements.

yuisekiyuiseki

Edge-TTS Voice Synthesis Server

Provide natural text-to-speech conversion using Microsoft Edge's speech synthesis capabilities, enabling customizable voice output in multiple languages with adjustable speed and pitch.

ScarletLabs.aiScarletLabs.ai

Votars MCP

Integrate advanced AI functionalities for processing complex tasks through robust APIs. Supports voice recording, transcription, and intelligent AI processing for meetings.

MiniMaxMiniMax

MiniMax MCP JS

Integrates with MiniMax's AI capabilities to facilitate interaction with multimedia generation tools, including image generation, video generation, text-to-speech, and voice cloning. Supports a flexible and configurable JavaScript/TypeScript framework for versatile deployment scenarios.

ImOrengeImOrenge

VoiceMacro

VoiceMacro enables executing keyboard shortcuts and macros through voice commands on Windows. It supports custom voice command configurations and manages presets for frequent macro operations while running in the background.

Barton RhodesBarton Rhodes

Say Server

Provides text-to-speech functionality using macOS's built-in say command, allowing the generation of spoken output from text input.

AnılAnıl

High-performance Whisper ASR

Transcribes and translates audio files using a lightweight implementation of OpenAI's Whisper model, optimized for speed and low memory usage across various platforms.

Abhay BabbarAbhay Babbar

RetellAI Voice Service Integration

Manage and interact with RetellAI's voice services, facilitating call management, voice agent creation, phone number provisioning, and voice option access through a unified interface.

OmadoOmado

Voicevox Speech Synthesis Server

Provides voice synthesis capabilities compatible with VOICEVOX and similar engines through the Model Context Protocol. Facilitates speech audio generation using AI agents compatible with MCP clients.

Kentaro KuribayashiKentaro Kuribayashi

AivisSpeech

Integrate with the AivisSpeech Engine to provide high-quality speech synthesis capabilities for applications, facilitating the conversion of text to natural-sounding speech. The server offers a type-safe API compliant with the Model Context Protocol, ensuring easy configuration and extensibility.

Pink PixelPink Pixel

MCPollinations Multimodal Server

Generates images, text, and audio from prompts using the Pollinations APIs. It supports returning images as base64-encoded data and allows listing available models for image and text generation.

Toshitaka komoriToshitaka komori

BouyomiChan Text-to-Speech Server

Provides text-to-speech capabilities using BouyomiChan's Yukkuri voice, enabling voice output from text commands with customizable options for voice type, volume, speed, and pitch. Integrates seamlessly with Claude for Desktop for enhanced user interaction.