- Best-in-class reasoning and writing
- Strong ecosystem and integrations
- Advanced multimodal capabilities
Suno has introduced its new AI music model v5.5 along with features designed to showcase musical individuality
OpenAI says the programming benchmark SWE-bench Verified has lost much of its value as a reliable measure of coding ability. The company cites two main reasons. First, an internal review found that at least 59.4% of the evaluated tasks were flawed, with tests rejecting correct solutions because they enforce specific implementation details or check for undocumented behavior.
With Gemini 3.1 Pro, Google aims to significantly strengthen the core intelligence of its model family. On a demanding reasoning benchmark, performance has more than doubled compared with its predecessor. That said, benchmarks are still just benchmarks.
Anthropic has released an updated mid-tier AI model, Sonnet, with a primary focus on stronger coding performance, better instruction-following, and improved computer-use capabilities.
Alibaba’s cloud division has unveiled a new open-source AI model, Qwen-3.5, according to the South China Morning Post (SCMP).
The Chinese AI company MiniMax, based in Shanghai, has released its new open-weights model M2.5 under the MIT license.
Google has upgraded its Gemini 3 Deep Think reasoning mode, positioning the tool as a solution for complex problems in science and engineering.
OpenAI will discontinue several older AI models in ChatGPT on February 13, 2026, including GPT-4o, GPT-4.1, GPT-4.1 mini, and o4-mini. The models will remain available in the API for the time being. The company officially cites declining usage as the reason for the move: only 0.1% of users reportedly select GPT-4o on a daily basis