design_voice
Design Voice
Create a new voice from a text description. Returns voice previews with audio samples and generated voice IDs that can be used for text-to-speech.
design_voice
Create a new voice from a text description. Returns voice previews with audio samples and generated voice IDs that can be used for text-to-speech.
delete_voice
Permanently delete a voice from your library. Only works on voices you own (cloned or designed). This action cannot be undone.
edit_voice
Update a voice's name, description, or labels. Only works on voices you own.
get_voice
Get detailed metadata for a specific voice, including its settings, category, labels, fine-tuning status, and preview URL.
list_models
List all available ElevenLabs models with their capabilities. Useful for discovering which models support text-to-speech, voice conversion, and other features.
create_dubbing
Dub audio or video content into another language. Provide a source URL, target language, and optional configuration. Returns a dubbing project ID that can be polled for completion.
list_voices
Search and list available voices. Supports filtering by name, category, and voice type. Use this to find voice IDs for text-to-speech generation.
generate_sound_effect
Generate sound effects from text descriptions. Returns base64-encoded audio. Useful for creating cinematic sound effects for videos, voice-overs, or games.
get_dubbing
Get the status and details of a dubbing project. Use this to check if dubbing is complete and retrieve metadata about the project.
text_to_speech
Convert text into lifelike audio using AI voices. Returns base64-encoded audio data. Supports multiple models (Flash, Turbo, Multilingual v2, v3), various output formats (MP3, PCM, opus, ulaw), and fine-grained voice settings for stability, similarity, style, and speed.
list_history
List previously generated audio items from your ElevenLabs history. Supports filtering by voice and search text. Returns metadata for each generation including text, voice, model, and timestamps.
speech_to_text
Transcribe spoken audio into text. Supports speaker diarization, word-level timestamps, and language detection. Provide audio as base64-encoded data or via a cloud storage URL.
get_user_info
Get your ElevenLabs account information including subscription tier, character usage and limits, voice slots, and billing details.
list_pronunciation_dictionaries
List available pronunciation dictionaries. Pronunciation dictionaries let you customize how specific words or phrases are spoken during text-to-speech generation.
isolate_audio
Separate vocal tracks from background noise in an audio file. Accepts base64-encoded audio and returns the isolated vocals as base64-encoded audio.
Convert text to lifelike speech audio with customizable AI voices, models, languages, and output formats. Transcribe speech to text in real-time or batch mode. Manage, clone, and design voices from audio samples or text prompts. Generate music and sound effects from text descriptions. Dub and translate audio/video content into other languages. Isolate vocals from background noise, change voices in audio, and generate multi-speaker dialogue. Create and manage conversational AI agents with knowledge bases and tool integrations. Manage studio projects for long-form audio productions like audiobooks. Configure pronunciation dictionaries and administer workspace settings, users, and usage analytics.
Common questions about connecting Elevenreader to AI agents with Metorial.