- Best-in-class reasoning and writing
- Strong ecosystem and integrations
- Advanced multimodal capabilities
Suno has introduced its new AI music model v5.5 along with features designed to showcase musical individuality
Microsoft’s AI superintelligence team has unveiled its first product: the image generator MAI-Image-2, which is being integrated into the company’s own products and is also expected to become available via API in the future.
OpenAI has introduced two new AI models: GPT-5.4 mini and GPT-5.4 nano, both specifically optimized for tasks that require ultra-low latency in response generation.
OpenAI has released GPT-5.4 and GPT-5.4 Pro just two days after launching version 5.3 Instant.
OpenAI’s upcoming GPT-5.4 model is expected to represent a significant step forward. The company behind ChatGPT announced the model shortly after releasing GPT-5.3 Instant for ChatGPT, although official technical details have not yet been disclosed.
OpenAI has integrated the GPT-5.3 Instant model into ChatGPT, improving tone, contextual appropriateness, and the overall flow of dialogue. According to the developers, the update is designed to make everyday conversations with the chatbot more useful and natural.
Artificial Analysis has released version 2.0 of its AA-WER speech-to-text benchmark, which measures the accuracy of speech recognition models. In the overall ranking, ElevenLabs’ Scribe v2 takes first place with a word error rate of just 2.3%.
AI search engine Perplexity has introduced two new text-embedding models that aim to match or outperform Google’s and Alibaba’s offerings while using only a fraction of the usual memory footprint. Both models are open source.
OpenAI says the programming benchmark SWE-bench Verified has lost much of its value as a reliable measure of coding ability. The company cites two main reasons. First, an internal review found that at least 59.4% of the evaluated tasks were flawed, with tests rejecting correct solutions because they enforce specific implementation details or check for undocumented behavior.
With Gemini 3.1 Pro, Google aims to significantly strengthen the core intelligence of its model family. On a demanding reasoning benchmark, performance has more than doubled compared with its predecessor. That said, benchmarks are still just benchmarks.