BotBeat
...
← Back

> ▌

Mistral AIMistral AI
PRODUCT LAUNCHMistral AI2026-03-26

Mistral Releases Voxtral TTS, an Open-Source Speech Generation Model Competing with ElevenLabs and OpenAI

Key Takeaways

  • ▸Voxtral TTS supports 9 languages and can adapt to custom voices with sub-5-second samples while preserving accent and speech characteristics
  • ▸The model is optimized for edge deployment with 90ms time-to-first-audio and 6x real-time factor, making it suitable for smartwatches, smartphones, and laptops
  • ▸Mistral's open-source approach and customization capabilities position it as a cost-effective alternative to ElevenLabs, Deepgram, and OpenAI in enterprise voice AI applications
Sources:
Hacker Newshttps://techcrunch.com/2026/03/26/mistral-releases-a-new-open-source-model-for-speech-generation/↗
Hacker Newshttps://venturebeat.com/orchestration/mistral-ai-just-released-a-text-to-speech-model-it-says-beats-elevenlabs-and↗
X (Twitter)https://x.com/MistralAI/status/2037183026539483288/video/1↗

Summary

French AI company Mistral announced the release of Voxtral TTS, a new open-source text-to-speech model designed for enterprise voice AI applications including customer support and sales engagement. The model supports nine languages (English, French, German, Spanish, Dutch, Portuguese, Italian, Hindi, and Arabic) and can adapt to custom voices using samples of less than five seconds while capturing subtle accents, inflections, and speech irregularities.

Built on Mistral's Ministral 3B foundation, Voxtral TTS is optimized for edge deployment on devices ranging from smartwatches to laptops, with a time-to-first-audio of just 90 milliseconds and a real-time factor of 6x. The model enables seamless language switching without losing voice characteristics, making it suitable for applications like dubbing and real-time translation. Pierre Stock, VP of Science Operations at Mistral, emphasized that the model delivers state-of-the-art performance at a fraction of competitors' costs while maintaining natural-sounding speech.

This release positions Mistral in direct competition with ElevenLabs, Deepgram, and OpenAI in the speech generation market. The company aims to build a comprehensive end-to-end multimodal platform that integrates its previously released transcription models with this new speech synthesis capability, allowing enterprises to process audio, text, and image inputs and outputs through a single agentic system.

  • The release is part of Mistral's broader strategy to build a multimodal platform combining audio transcription and speech generation for end-to-end AI agents

Editorial Opinion

Mistral's Voxtral TTS represents a significant democratization of high-quality speech synthesis technology through an open-source model that prioritizes both performance and accessibility for edge devices. By combining low latency, multilingual support, and voice customization in an affordable package, Mistral is challenging the incumbent players and making advanced voice AI capabilities available to a broader range of enterprises. However, the competitive landscape will ultimately depend on how well the open-source model performs in real-world deployments compared to closed commercial alternatives.

Generative AIMultimodal AISpeech & AudioProduct LaunchOpen Source

More from Mistral AI

Mistral AIMistral AI
UPDATE

Mistral AI Launches Leanstral 1.5, Enhanced Open-Source Code Agent for Mathematical Proofs

2026-07-03
Mistral AIMistral AI
RESEARCH

Mistral's Le Chat Repeats State-Sponsored Disinformation Half the Time, NewsGuard Audit Finds

2026-06-16
Mistral AIMistral AI
PARTNERSHIP

Mistral AI Deploys Team to Kyiv for Defense Partnership

2026-06-16

Comments

Suggested

MicrosoftMicrosoft
RESEARCH

Microsoft's Leaked 'Aion' Project Reveals Vision for Copilot-First Operating System

2026-07-04
OpenAIOpenAI
INDUSTRY REPORT

Investigation Uncovers AI-Generated Deepfakes in Lily Jay Foundation Charity Fraud

2026-07-04
PangramPangram
INDUSTRY REPORT

Literary Prize Scandal Exposes Limitations of AI Detection Tools

2026-07-04
← Back to news
© 2026 BotBeat
AboutPrivacy PolicyTerms of ServiceContact Us