Openai
Generate text, images, audio, and video using large language models and multimodal AI. Create chat completions, generate and edit images from text prompts, convert text to speech, transcribe and translate audio, generate video, and create text embeddings for search and retrieval. Fine-tune models on custom training data, run evaluations to measure model performance, and moderate content against policy categories. Manage vector stores for semantic file search, upload and organize files, and submit batch processing jobs for asynchronous bulk requests. Conduct real-time speech-to-speech conversations via WebRTC or SIP. Administer organizations, projects, users, API keys, and audit logs programmatically. Receive webhook notifications for background responses, batch jobs, fine-tuning jobs, eval runs, and incoming realtime calls.
Audioscrape
Telnyx
proto.rostro.dev
Turn any LLM multimodal; generate images, voices, videos, 3D models, music, and more.
Eigi.ai Voice Agents
Create and manage AI voice agents, real-time conversations, and analytics with eigi.ai
xSkill AI
AI content generation with 50+ models: image, video, TTS, voice cloning, and more.
mcp.nymbo.net
Remote MCP server: fetch, search, Python, TTS, memory, image, video.
ElevenLabs MCP Server
A Model Context Protocol (MCP) server that integrates with ElevenLabs text-to-speech API, featuring both a server component and a sample web-based MCP Client (SvelteKit) for managing voice generation tasks.
elevenlabs
Official ElevenLabs Model Context Protocol (MCP) server that enables interaction with powerful Te...
omi
A Model Context Protocol server for Omi interaction and automation. This server provides tools to...
Carbon Voice
Listenetic
OpenAI Tools MCP Server
Focused MCP server for OpenAI image/audio generation (v2.0.0). Wraps endpoints via HAPI CLI.