Research A Theoretical Framework for Adaptive Utility-Weighted Benchmarking February 16, 2026 Arthur arXiv:2602.12356v1 Announce Type: new Abstract: Benchmarking has long served as a foundational practice in machine learning and, increasingly, in modern AI systems such as large l...
Research GT-HarmBench: Benchmarking AI Safety Risks Through the Lens of Game Th February 16, 2026 Arthur arXiv:2602.12316v1 Announce Type: new Abstract: Frontier AI systems are increasingly capable and deployed in high-stakes multi-agent environments. However, existing AI safety benc...
Daily Brief Accelerating science with AI and simulations February 16, 2026 Arthur Associate Professor Rafael Gómez-Bombarelli has spent his career applying AI to improve scientific discovery. Now he believes we are at an inflection point....
Daily Brief OpenEnv in Practice: Evaluating Tool-Using Agents in Real-World Enviro February 16, 2026 Arthur ...
Funding Claude Code costs up to $200 a month. Goose does the same thing for fr February 16, 2026 Arthur The artificial intelligence coding revolution comes with a catch: it's expensive.Claude Code, Anthropic's terminal-based AI agent that can write, debug, and deploy code a...
Daily Brief How Ricursive Intelligence raised $335M at a $4B valuation in 4 months February 16, 2026 Arthur The reason why this nascent startup had VCs lining up is the founders.They are so famed in the AI world, everyone tried to hire them....
Daily Brief Introducing Lockdown Mode and Elevated Risk labels in ChatGPT February 16, 2026 Arthur Introducing Lockdown Mode and Elevated Risk labels in ChatGPT to help organizations defend against prompt injection and AI-driven data exfiltration....
Research GT-HarmBench: Benchmarking AI Safety Risks Through the Lens of Game Th February 16, 2026 Arthur arXiv:2602.12316v1 Announce Type: new Abstract: Frontier AI systems are increasingly capable and deployed in high-stakes multi-agent environments. However, existing AI safety benc...
Daily Brief Custom Kernels for All from Codex and Claude February 16, 2026 Arthur Custom has been operating at the intersection of ambition and execution, and this week’s announcement shows just how seriously the company is taking its AI ambitions. In a landscape crowded…