Mistral AI

PRODUCT LAUNCH Mistral AI2026-03-26

Mistral Releases Voxtral TTS, an Open-Source Speech Generation Model Competing with ElevenLabs and OpenAI

Key Takeaways

▸Voxtral TTS supports 9 languages and can adapt to custom voices with sub-5-second samples while preserving accent and speech characteristics
▸The model is optimized for edge deployment with 90ms time-to-first-audio and 6x real-time factor, making it suitable for smartwatches, smartphones, and laptops
▸Mistral's open-source approach and customization capabilities position it as a cost-effective alternative to ElevenLabs, Deepgram, and OpenAI in enterprise voice AI applications

Sources:

Hacker Newshttps://techcrunch.com/2026/03/26/mistral-releases-a-new-open-source-model-for-speech-generation/↗

Hacker Newshttps://venturebeat.com/orchestration/mistral-ai-just-released-a-text-to-speech-model-it-says-beats-elevenlabs-and↗

X (Twitter)https://x.com/MistralAI/status/2037183026539483288/video/1↗

Summary

French AI company Mistral announced the release of Voxtral TTS, a new open-source text-to-speech model designed for enterprise voice AI applications including customer support and sales engagement. The model supports nine languages (English, French, German, Spanish, Dutch, Portuguese, Italian, Hindi, and Arabic) and can adapt to custom voices using samples of less than five seconds while capturing subtle accents, inflections, and speech irregularities.

Built on Mistral's Ministral 3B foundation, Voxtral TTS is optimized for edge deployment on devices ranging from smartwatches to laptops, with a time-to-first-audio of just 90 milliseconds and a real-time factor of 6x. The model enables seamless language switching without losing voice characteristics, making it suitable for applications like dubbing and real-time translation. Pierre Stock, VP of Science Operations at Mistral, emphasized that the model delivers state-of-the-art performance at a fraction of competitors' costs while maintaining natural-sounding speech.

This release positions Mistral in direct competition with ElevenLabs, Deepgram, and OpenAI in the speech generation market. The company aims to build a comprehensive end-to-end multimodal platform that integrates its previously released transcription models with this new speech synthesis capability, allowing enterprises to process audio, text, and image inputs and outputs through a single agentic system.

The release is part of Mistral's broader strategy to build a multimodal platform combining audio transcription and speech generation for end-to-end AI agents

Editorial Opinion

Mistral's Voxtral TTS represents a significant democratization of high-quality speech synthesis technology through an open-source model that prioritizes both performance and accessibility for edge devices. By combining low latency, multilingual support, and voice customization in an affordable package, Mistral is challenging the incumbent players and making advanced voice AI capabilities available to a broader range of enterprises. However, the competitive landscape will ultimately depend on how well the open-source model performs in real-world deployments compared to closed commercial alternatives.

Mistral AI

PRODUCT LAUNCH Mistral AI2026-03-26

Mistral Releases Voxtral TTS, an Open-Source Speech Generation Model Competing with ElevenLabs and OpenAI

Key Takeaways

▸Voxtral TTS supports 9 languages and can adapt to custom voices with sub-5-second samples while preserving accent and speech characteristics
▸The model is optimized for edge deployment with 90ms time-to-first-audio and 6x real-time factor, making it suitable for smartwatches, smartphones, and laptops
▸Mistral's open-source approach and customization capabilities position it as a cost-effective alternative to ElevenLabs, Deepgram, and OpenAI in enterprise voice AI applications

Sources:

Hacker Newshttps://techcrunch.com/2026/03/26/mistral-releases-a-new-open-source-model-for-speech-generation/↗

Hacker Newshttps://venturebeat.com/orchestration/mistral-ai-just-released-a-text-to-speech-model-it-says-beats-elevenlabs-and↗

X (Twitter)https://x.com/MistralAI/status/2037183026539483288/video/1↗

Summary

The release is part of Mistral's broader strategy to build a multimodal platform combining audio transcription and speech generation for end-to-end AI agents

Editorial Opinion

Mistral's Voxtral TTS represents a significant democratization of high-quality speech synthesis technology through an open-source model that prioritizes both performance and accessibility for edge devices. By combining low latency, multilingual support, and voice customization in an affordable package, Mistral is challenging the incumbent players and making advanced voice AI capabilities available to a broader range of enterprises. However, the competitive landscape will ultimately depend on how well the open-source model performs in real-world deployments compared to closed commercial alternatives.

Mistral Releases Voxtral TTS, an Open-Source Speech Generation Model Competing with ElevenLabs and OpenAI

Key Takeaways

Summary

Editorial Opinion

More from Mistral AI

Mistral AI Launches Leanstral 1.5, Enhanced Open-Source Code Agent for Mathematical Proofs

Mistral's Le Chat Repeats State-Sponsored Disinformation Half the Time, NewsGuard Audit Finds

Mistral AI Deploys Team to Kyiv for Defense Partnership

Comments

Suggested

Microsoft's Leaked 'Aion' Project Reveals Vision for Copilot-First Operating System

Investigation Uncovers AI-Generated Deepfakes in Lily Jay Foundation Charity Fraud

Literary Prize Scandal Exposes Limitations of AI Detection Tools

Mistral Releases Voxtral TTS, an Open-Source Speech Generation Model Competing with ElevenLabs and OpenAI

Key Takeaways

Summary

Editorial Opinion

More from Mistral AI

Mistral AI Launches Leanstral 1.5, Enhanced Open-Source Code Agent for Mathematical Proofs

Mistral's Le Chat Repeats State-Sponsored Disinformation Half the Time, NewsGuard Audit Finds

Mistral AI Deploys Team to Kyiv for Defense Partnership

Comments

Suggested

Microsoft's Leaked 'Aion' Project Reveals Vision for Copilot-First Operating System

Investigation Uncovers AI-Generated Deepfakes in Lily Jay Foundation Charity Fraud

Literary Prize Scandal Exposes Limitations of AI Detection Tools