At an event for pharmaceutical executives, biotech founders, and researchers on Tuesday, Anthropic announced Claude Science, a major new product intended to support scientific research in the same way that Claude Code supports software engineering. Like Claude Code, Claude Science can autonomously carry out meaningful work when given concise, high-level instructions, and it has access…
Google DeepMind’s latest update to a top Gemini AI model includes a dial to control how much the system “thinks” through a response. The new feature is ostensibly designed to save money for developers, but it also concedes a problem: Reasoning models, the tech world’s new obsession, are prone to overthinking, burning money and energy…
On Thursday, weeks after launching its most powerful AI model yet, Gemini 2.5 Pro, Google published a technical report showing the results of its internal safety evaluations. However, the report is light on the details, experts say, making it difficult to determine which risks the model might pose. Technical reports provide useful — and unflattering, […]
China is cracking down on how automakers advertise driver assistance features, banning terms like “autonomous driving,” “self-driving,” and “smart driving,” Reuters reported, citing a transcript of a meeting between the government and industry representatives. The updated rule will also prohibit automakers from rolling out improvements via software updates to advanced driving assistance systems in vehicles […]
Chatbot Arena, the crowdsourced benchmarking project major AI labs rely on to test and market their AI models, is forming a company called Arena Intelligence Inc., reports Bloomberg. In a blog post published Thursday, Chatbot Arena said that the company will “give [it] the resources to improve [its platform] significantly over what it is today.” […]