- Best-in-class reasoning and writing
- Strong ecosystem and integrations
- Advanced multimodal capabilities
Researchers at Georgetown University have analyzed thousands of procurement requests issued by China’s People’s Liberation Army (PLA). The documents reveal how broadly Beijing is already testing artificial intelligence for military use—from drone swarms and deepfake tools to autonomous decision-making systems.
Artificial Analysis has released version 2.0 of its AA-WER speech-to-text benchmark, which measures the accuracy of speech recognition models. In the overall ranking, ElevenLabs’ Scribe v2 takes first place with a word error rate of just 2.3%.
AI search engine Perplexity has introduced two new text-embedding models that aim to match or outperform Google’s and Alibaba’s offerings while using only a fraction of the usual memory footprint. Both models are open source.
Anthropic is expanding Claude’s agentic office capabilities, allowing the model to switch autonomously between Excel and PowerPoint — for example, running an analysis in a spreadsheet and directly turning the results into a presentation. At the same time, Anthropic is extending Cowork for Enterprise customers with private plugin marketplaces, enabling administrators to curate and distribute custom plugin collections to specific teams. These plugins turn Claude into specialized AI agents for different departments, with new templates covering HR, design, engineering, finance, and wealth management.
Chinese AI startup Deepseek has reportedly trained its latest AI model on Nvidia’s most powerful Blackwell chips, despite U.S. export restrictions. Reuters cites a senior official from the Trump administration, who said the model is expected to be released as early as next week. Rumors of chip smuggling involving Deepseek had already surfaced late last year.
OpenAI has announced two API updates aimed at developers. The new model gpt-realtime-1.5 for the Realtime API is designed to handle voice commands more reliably. According to OpenAI’s internal tests, transcription accuracy for numbers and letters improved by more than 10%, performance on logical audio tasks increased by 5%, and instruction-following accuracy rose by 7%. The underlying audio model has also been updated to version 1.5.