Dialer
The Dialer integration lets you make outbound calls, manage call campaigns, and track calling activity directly from your workflow, enabling automated follow-ups and real-time call disposition logging.
Fireflies
The Fireflies integration lets you access and analyze your meeting transcripts, search through conversations, and extract key insights from your recorded calls directly within your workflow.
Carbon Voice
The Carbon Voice integration lets you access and manage your voice recordings, transcripts, and audio notes directly from your AI assistant. Use it to retrieve past recordings, search through transcripts, and organize your voice data without switching between applications.
Audioscrape
The Audioscrape integration lets you extract and transcribe audio content from online sources directly within your MCP workflow, enabling automated audio processing and analysis tasks.
VoiceMacro
VoiceMacro enables executing keyboard shortcuts and macros through voice commands on Windows. It supports custom voice command configurations and manages presets for frequent macro operations while running in the background.
BouyomiChan Text-to-Speech Server
Provides text-to-speech capabilities using BouyomiChan's Yukkuri voice, enabling voice output from text commands with customizable options for voice type, volume, speed, and pitch. Integrates seamlessly with Claude for Desktop for enhanced user interaction.
Fish Audio Text-to-Speech Service
Converts text into natural human speech with customizable audio formats and bitrates, while integrating seamlessly with MCP-compatible applications.
Typecast API MCP Server
Integrates with the Typecast API to manage voices, convert text to speech, and play audio. Provides a standardized MCP interface for seamless interaction with voice capabilities.
Vapi Voice AI Tools
Integrate voice AI capabilities into applications for managing voice assistants and conducting outbound calls. Provides advanced features for enhancing user interactions through voice conversations.
Audio Transcriber Server
Transcribes audio files using OpenAI's speech-to-text capabilities, enabling accurate audio transcriptions and the option to save them directly to files.
Audio Interface Server
Enables interaction with a computer's audio system by listing audio devices, recording audio from microphones, and playing back recordings or audio files. Facilitates audio management and integrates audio input and output control for AI assistants.
Votars MCP
Integrate advanced AI functionalities for processing complex tasks through robust APIs. Supports voice recording, transcription, and intelligent AI processing for meetings.
Garbage Sorting App
Identify and classify waste using image and voice recognition techniques to streamline the recycling process and enhance environmental awareness.
Kokoro TTS Server
Integrates text-to-speech capabilities using the Kokoro TTS engine, enabling conversion of written content into spoken audio with customizable voices and adjustable speed. Supports saving audio files and cross-platform playback.
Speech MCP Server
Provides text-to-speech capabilities using the Kokoro TTS model, converting text into natural-sounding speech with customizable options and multiple voice choices.
Voice Recognition Service
Provides voice recognition and text extraction capabilities, supporting both file input and base64 encoded data processed in structured formats. Operates in stdio and MCP modes for flexible integration with various systems.
Text-to-Speech MCP Server
Integrates high-quality text-to-speech capabilities into applications, converting text to audio with customizable voice options and output formats. Provides a command-line tool for quick conversions and supports various parameters for audio customization.
Speech Interface
Provides a voice interface for real-time audio interaction, converting spoken words into text and generating spoken responses. Includes features like audio visualization and a modern user interface for an engaging conversational experience.
RetellAI Voice Service Integration
Manage and interact with RetellAI's voice services, facilitating call management, voice agent creation, phone number provisioning, and voice option access through a unified interface.
Edge-TTS Voice Synthesis Server
Provide natural text-to-speech conversion using Microsoft Edge's speech synthesis capabilities, enabling customizable voice output in multiple languages with adjustable speed and pitch.
智能对话机器人
A multi-platform intelligent dialogue service that supports text, voice, and image interactions. It can connect to various AI models and allows for custom enterprise AI applications through plugin extensions.
Jessica TTS
Integrates ElevenLabs Text-to-Speech capabilities for seamless text conversion to speech, offering voice selection and management through a modern interface. Supports real-time communication with a FastAPI backend and a React frontend.
Voice Recorder
Record audio and transcribe it using advanced AI models like OpenAI's Whisper. Supports integration with AI agents for enhanced interactivity and includes prompts for common recording scenarios.
MCPollinations Multimodal Server
Generates images, text, and audio from prompts using the Pollinations APIs. It supports returning images as base64-encoded data and allows listing available models for image and text generation.
Zonos TTS Integration
Facilitates text-to-speech capabilities using Claude, supporting various emotions and languages for speech generation.