Research A Theoretical Framework for Adaptive Utility-Weighted Benchmarking February 16, 2026 Arthur arXiv:2602.12356v1 Announce Type: new Abstract: Benchmarking has long served as a foundational practice in machine learning and, increasingly, in modern AI systems such as large l...
Research GT-HarmBench: Benchmarking AI Safety Risks Through the Lens of Game Th February 16, 2026 Arthur arXiv:2602.12316v1 Announce Type: new Abstract: Frontier AI systems are increasingly capable and deployed in high-stakes multi-agent environments. However, existing AI safety benc...
Funding Claude Code costs up to $200 a month. Goose does the same thing for fr February 16, 2026 Arthur The artificial intelligence coding revolution comes with a catch: it's expensive.Claude Code, Anthropic's terminal-based AI agent that can write, debug, and deploy code a...
Funding Railway secures $100 million to challenge AWS with AI-native cloud inf February 16, 2026 Arthur Railway, a San Francisco-based cloud platform that has quietly amassed two million developers without spending a dollar on marketing, announced Thursday that it raised $100 million...
Business Fractal Analytics’ muted IPO debut signals persistent AI fears in Indi February 16, 2026 Arthur As India's first AI company to IPO, Fractal Analytics didn't have a stellar first day on the public markets, as enthusiasm for the technology collided with jittery investors in the...
Funding Railway secures $100 million to challenge AWS with AI-native cloud inf February 16, 2026 Arthur Railway, a San Francisco-based cloud platform that has quietly amassed two million developers without spending a dollar on marketing, announced Thursday that it raised $100 million...
Daily Brief Fractal Analytics’ muted IPO debut signals persistent AI fears in Indi February 16, 2026 Arthur As India's first AI company to IPO, Fractal Analytics didn't have a stellar first day on the public markets, as enthusiasm for the technology collided with jittery investors in the...
Daily Brief Introducing Lockdown Mode and Elevated Risk labels in ChatGPT February 16, 2026 Arthur Introducing Lockdown Mode and Elevated Risk labels in ChatGPT to help organizations defend against prompt injection and AI-driven data exfiltration....
Daily Brief OpenAI Retires GPT-4o Amid Mental Health Lawsuits as GPT-5.3-Codex-Spa February 16, 2026 Arthur OpenAI faces eight consolidated lawsuits alleging GPT-4o's 'highly humanlike, sycophantic behavior' contributed to mental health crises, while launching a blazing-fast coding model.