Both new models support 13 languages, including German, English, and Chinese. New features include speaker diarization, word-level timestamps, and support for recordings of up to three hours. Voxtral Realtime is available as open weights under the Apache 2.0 license on Hugging Face as well as via API, while Voxtral Mini Transcribe V2 is accessible only through Le Chat, the Mistral API, and a playground. Mistral introduced the first generation of Voxtral in July 2025