

ElevenLabs
AI voice platform — agents can speak with lifelike voices, clone voices, transcribe audio, and make phone calls.
About
ElevenLabs gives AI agents the ability to speak. The platform's text-to-speech API produces some of the most lifelike synthetic voices available, with nuanced intonation, emotional awareness, and 75ms latency on the Flash model for real-time applications. Agents can generate speech in over 70 languages, clone voices from short audio samples, transcribe audio to text, and even build full conversational AI voice agents that can make and receive phone calls.
The platform has seen massive adoption — 47 million monthly users, over 250,000 conversational AI agents built on the platform, and employees at 60% of Fortune 500 companies using the tools. ElevenLabs raised $500M at an $11B valuation in February 2026, reflecting the scale of investment in voice AI. Their official MCP server supports text-to-speech, voice cloning, transcription, and outbound voice calls directly from MCP-compatible agents.
The free tier includes 10,000 credits per month (roughly 10 minutes of generated speech) for non-commercial use. Paid plans start at $5/month with commercial licensing, scaling to enterprise tiers with SLAs and HIPAA compliance. The voice library includes over 10,000 pre-built voices.