Mistral Launches Voxtral Transcribe 2 to Undercut Speech-to-Text Rivals on Price

Details: By Daniel Mercer; Category: Models; 5 m; 05 February 2026; 198

Mistral AI aims to undercut competitors on price in speech recognition with Voxtral Transcribe 2. The second generation of its speech-to-text models starts at $0.003 per minute and, according to Mistral, delivers higher accuracy than models such as GPT-4o mini Transcribe, Gemini 2.5 Flash, and Deepgram Nova. The model family includes two variants: Voxtral Mini Transcribe V2, designed for processing large audio files, and Voxtral Realtime, built for real-time applications with latency under 200 milliseconds. Voxtral Realtime, which costs twice as much, uses a dedicated streaming architecture that transcribes audio as it arrives, targeting use cases such as voice assistants, live captions, and call center analytics

Both new models support 13 languages, including German, English, and Chinese. New features include speaker diarization, word-level timestamps, and support for recordings of up to three hours. Voxtral Realtime is available as open weights under the Apache 2.0 license on Hugging Face as well as via API, while Voxtral Mini Transcribe V2 is accessible only through Le Chat, the Mistral API, and a playground. Mistral introduced the first generation of Voxtral in July 2025

About The Hosts

Daniel Mercer

AI Research Contributor

Daniel Mercer is an AI research contributor specializing in large language models, benchmarking, and multimodal systems. He writes about model capabilities, limitations, and real-world performance across leading AI assistants and platforms.

AI News

Accenture Tracks AI Tool Usage and Ties Adoption to Promotions

Adobe Firefly Introduces Unlimited AI Image and Video Generation for Subscribers

Adobe Unveils CX Enterprise AI Agent Platform as It Searches for a New CEO

AGI May Arrive by 2026–2027, Warns Anthropic CEO Dario Amodei

AI & Society

AI Agents Create a Lobster Religion on Moltbook

AI Boom Drives Cybersecurity Hiring Despite Tech Sector Layoffs

AI Could Trigger a Major U.S. Economic Crisis by 2028, Citrini Research Warns

AI Is Increasing Workload Instead of Reducing It, ActivTrak Study Finds

AI Insights

Adobe Reinvents Document Work with Acrobat Studio and AI

AI agents could disrupt ads and reshape internet commerce

AI as a Role Model for Generation Alpha: Promise, Risks, and the Future of Childhood

AI as a Toy: Why Humanity Always Misuses New Technology First

Mistral Launches Voxtral Transcribe 2 to Undercut Speech-to-Text Rivals on Price

About The Hosts

More From Daniel Mercer

Policy & Security

OpenAI Launches GPT-5.5-Cyber for Verified Security Professionals

Models

Zhipu AI Releases GLM-5.2 Open-Source Model With 1M Token Context

Platforms

Apple Introduces Siri AI With Apple Intelligence at WWDC 2026

Work

ChatGPT Adds Job Search and Resume Tools for Career Support

Models

OpenAI Updates ChatGPT and Plans to Retire Older Models

Industry

DeepSeek Builds Code Harness to Rival Claude Code and Codex

Models

Cursor Releases Composer 2.5 AI Coding Model Based on Kimi K2.5

Platforms

Meta Launches Incognito Chat for Private AI Conversations

Culture

Richard Dawkins Spent Two Days Trying to Prove Claude Isn't Conscious — and Changed His Mind

Platforms

OpenAI, NVIDIA, AMD, Microsoft, Intel, and Broadcom Unveil MRC — New Networking Protocol for AI Supercomputers

Categories

AI News

Categories

AI & Society

Categories

AI Insights

Mistral Launches Voxtral Transcribe 2 to Undercut Speech-to-Text Rivals on Price

About The Hosts

More From Daniel Mercer