- Best-in-class reasoning and writing
- Strong ecosystem and integrations
- Advanced multimodal capabilities
Google is introducing an AI music generator called Lyria 3 to the Gemini app. The model can generate 30-second music tracks with vocals.
Anthropic has introduced Claude Opus 4.6, a new flagship model. For the first time, it features a one-million-token context window, and is designed to locate relevant information in very large documents far more reliably than previous models.
OpenAI has released its latest coding model, GPT-5.3-Codex. According to the company, it combines the coding capabilities of GPT-5.2-Codex with the reasoning and knowledge strengths of GPT-5.2, while being 25% faster than its predecessor.
Mistral AI aims to undercut competitors on price in speech recognition with Voxtral Transcribe 2. The second generation of its speech-to-text models starts at $0.003 per minute and, according to Mistral, delivers higher accuracy than models such as GPT-4o mini Transcribe, Gemini 2.5 Flash, and Deepgram Nova. The model family includes two variants: Voxtral Mini Transcribe V2, designed for processing large audio files, and Voxtral Realtime, built for real-time applications with latency under 200 milliseconds. Voxtral Realtime, which costs twice as much, uses a dedicated streaming architecture that transcribes audio as it arrives, targeting use cases such as voice assistants, live captions, and call center analytics
Meta has completed the pretraining of its new AI model, codenamed “Avocado,” according to an internal memo.
Chinese developer Kuaishou has unveiled the third version of its video generation model, Kling AI.
The Chinese company Kling has released its Kling Video Model 3.0. The new model is described as an “all-in-one creative engine” for multimodal content creation. Key features include improved consistency of characters and visual elements, video generation with 15-second clips and enhanced control, as well as customizable multi-shot sequences.
Alibaba has unveiled Qwen3-Coder-Next, a new open-weight AI model designed for coding agents and local development. The model was trained on 800,000 verifiable tasks executed in runnable environments. Despite its relatively compact design—80 billion parameters in total, with only 3 billion active parameters—it delivers strong results on SWE-Bench Pro, a benchmark for coding agents.
China’s AI labs are racing to roll out new models ahead of the Lunar New Year. According to the South China Morning Post, Zhipu AI and Minimax—both of which recently listed on the Hong Kong Stock Exchange—plan to update their flagship models within the next two weeks.
Google DeepMind equips its Gemini 3 Flash model with a new capability called “Agentic Vision.” The model is designed to actively investigate images rather than merely observe them passively — although the feature does not yet work fully automatically in all cases.