Connect Aiml API to AI agents

Connect Aiml API to Claude, Codex, Cursor, or other AI agents for your entire team. Metorial security, governance, observability, and gives your team a unified Magic MCP url to connect.

Supported Tools

list_models

List Models

Retrieve the full list of available AI/ML models across all categories (text, image, video, speech, embeddings, moderation, etc.). Use this to discover available model IDs before making generation requests.

generate_embeddings

Generate Embeddings

Generate vector embeddings from text for semantic search, similarity analysis, clustering, and classification. Supports models like text-embedding-3-small (1536 dimensions), text-embedding-3-large (3072 dimensions), and multilingual models. Can embed single strings or batches of text in a single request.

speech_to_text

Speech to Text

Transcribe audio from a URL into text using speech-to-text models from OpenAI (Whisper), Deepgram (Nova-2), and Assembly AI. Submits the audio for asynchronous processing and polls until the transcription is ready. Generated transcriptions are stored on the server for 1 hour.

generate_video

Generate Video

Generate videos from text prompts or reference images using video generation models like MiniMax and Kling AI. Video generation is asynchronous — the tool submits the request and polls for results. Supports text-to-video and image-to-video workflows.

moderate_content

Moderate Content

Classify text or image content as safe or unsafe using Meta's Llama Guard content moderation models. Analyzes input for harmful content and returns a safety classification with hazard categories when unsafe. Supports text, image URLs, and base64-encoded images.

chat_completion

Chat Completion

Generate text responses using 400+ LLM models including GPT, Claude, Gemini, DeepSeek, Llama, and Qwen. Supports system prompts, multi-turn conversations, temperature control, JSON mode, and web search. Use this for text generation, code generation, reasoning, question answering, and conversational AI.

text_to_speech

Text to Speech

Convert text into natural-sounding speech audio using models from OpenAI, ElevenLabs, Deepgram, and Microsoft. Supports 120+ languages, multiple voices, adjustable speed, and various audio formats (mp3, opus, aac, flac, wav, pcm). Returns a URL to the generated audio file.

generate_image

Generate Image

Generate images from text prompts using 70+ image models including Flux, DALL-E, Stable Diffusion, Imagen, and more. Supports configurable resolution, aspect ratio, negative prompts, guidance scale, and seed for reproducibility. Returns URLs to the generated images.

More integrations teams use with Aiml API

GitHub

Manage repositories, issues, and pull requests. Create and configure branches, star repositories, review code, and merge changes. Automate CI/CD workflows with GitHub Actions, manage workflow runs, secrets, and artifacts. Track issues with labels, milestones, and assignees. Search across code, repositories, issues, and users. Manage organizations, teams, and memberships. Create and manage projects, gists, packages, deployments, and environments. Access security alerts including code scanning, secret scanning, and Dependabot alerts. Read and write file contents in repositories. Manage webhooks, notifications, and codespaces.

Sharepoint

Manage SharePoint sites, document libraries, lists, and files. Create, read, update, and delete lists and list items with custom columns. Upload, download, move, copy, and version files in document libraries. Search across sites, files, folders, lists, and list items using Microsoft Search. Manage permissions at site, list, and item levels with granular access control. Define and manage content types and site columns. Subscribe to webhooks for list and library change notifications. Retrieve site properties and search for sites across Microsoft 365.

Salesforce

Manage CRM data including Accounts, Contacts, Leads, Opportunities, Cases, and custom objects. Create, read, update, and delete records. Query data using SOQL and search across objects using SOSL. Perform bulk data operations for large-scale imports, exports, and migrations. Execute composite requests to batch multiple operations in a single API call. Access analytics, reports, and dashboards. Manage files and attachments associated with records. Interact with Chatter feeds, posts, and groups for social collaboration. Subscribe to real-time change events via Change Data Capture and Platform Events. Manage org metadata including custom objects, fields, layouts, and workflows. Query data using GraphQL for precise data retrieval across related objects.

Airtable

Create, read, update, and delete records in Airtable bases and tables. Manage base schemas including creating tables and fields. Filter records using formulas, sort by fields, and scope queries to specific views. Upsert records to find, create, or update in a single call. Upload attachments to records, read and write record comments, list accessible bases, and receive real-time base change events through webhooks.

Bitbucket

Manage Git repositories, pull requests, and CI/CD pipelines on Bitbucket Cloud. Create, fork, and configure repositories within workspaces and projects. Create, review, approve, merge, and decline pull requests with inline code comments. Browse source code, list commits, and manage branches and tags. Track issues with the built-in issue tracker. Trigger, monitor, and manage Bitbucket Pipelines. List workspace members, configure repository default reviewers and branch restrictions, create and manage repository webhooks, and search code across repositories.

Heroku

Deploy, manage, and scale applications on Heroku's cloud platform. Create and configure apps, scale dynos, provision add-ons (databases, caching, etc.), manage configuration variables, build and release code, add custom domains and SSL certificates, manage collaborators and team permissions, configure pipelines for continuous delivery, set up log drains, and sync data with Salesforce via Heroku Connect. Subscribe to webhooks for real-time notifications on app changes, builds, releases, dyno lifecycle events, and more.

Technical notes for Aiml API

Unified gateway to 400+ AI/ML models for text generation, image generation, video generation, music generation, speech-to-text, text-to-speech, content moderation, 3D model generation, vision/OCR, embeddings, and AI-powered web search. Generate chat completions and reasoning with models like GPT, Claude, Gemini, DeepSeek, and Llama. Create images from text prompts using Flux, Stable Diffusion, and DALL-E. Generate videos from text or images asynchronously. Convert speech to text and text to speech in 120+ languages. Moderate content for safety classification. Generate 3D objects from text or images. Extract text and structured data from images via OCR. Produce text embeddings for semantic search. Search the web for real-time information. Create AI Assistants for customer support and data analysis. Interact in real time via WebSocket for voice and text. Receive webhook notifications for async operation completion.

Connect Aiml API to production AI agents

See how Metorial gives Aiml API access the governance, tracing, and security controls teams need.

Frequently asked questions

Common questions about connecting Aiml API to AI agents with Metorial.

  1. Can Metorial connect Aiml API to AI agents?
    Yes. Metorial connects AI agents to Aiml API through a governed integration layer, so teams can use the provider while keeping access controlled and observable.
  2. Metorial is MCP compatible and lets teams expose approved provider tools to MCP-capable agents and clients through a controlled access layer.
  3. Metorial applies policies across users, groups, providers, agents, and individual tools, then records the context around every agent interaction.
  4. Yes. Metorial records provider activity so teams can inspect tool calls, troubleshoot integrations, and give security teams the visibility they need.