Funding Railway secures $100 million to challenge AWS with AI-native cloud inf February 23, 2026 Michelle Railway, a San Francisco-based cloud platform that has quietly amassed two million developers without spending a dollar on marketing, announced Thursday that it raised $100 million...
Daily Brief Why we no longer evaluate SWE-bench Verified February 23, 2026 Michelle SWE-bench Verified is increasingly contaminated and mismeasures frontier coding progress. Our analysis shows flawed tests and training leakage. We recommend SWE-bench Pro....
Research Why the Moltbook frenzy was like Pokémon February 23, 2026 Michelle This story originally appeared in The Algorithm, our weekly newsletter on AI. To get stories like this in your inbox first, sign up here. Lots of influential people in tech la...
Daily Brief Exposing biases, moods, personalities, and abstract concepts hidden in February 23, 2026 Michelle A new method developed at MIT could root out vulnerabilities and improve LLM safety and performance....
Research Study: AI chatbots provide less-accurate information to vulnerable use February 23, 2026 Michelle Research from the MIT Center for Constructive Communication finds leading AI models perform worse for users with lower English proficiency, less formal education, and non-US origin...
Funding Claude Code costs up to $200 a month. Goose does the same thing for fr February 23, 2026 Michelle The artificial intelligence coding revolution comes with a catch: it's expensive.Claude Code, Anthropic's terminal-based AI agent that can write, debug, and deploy code a...
Research A Meta AI security researcher said an OpenClaw agent ran amok on her i February 23, 2026 Michelle The viral X post from an AI security researcher reads like satire. But it's really a word of warning about what can go wrong when handing tasks to an AI…
Daily Brief Why we no longer evaluate SWE-bench Verified February 23, 2026 Michelle SWE-bench Verified is increasingly contaminated and mismeasures frontier coding progress. Our analysis shows flawed tests and training leakage. We recommend SWE-bench Pro....
Daily Brief Exposing biases, moods, personalities, and abstract concepts hidden in February 23, 2026 Michelle A new method developed at MIT could root out vulnerabilities and improve LLM safety and performance....
Research Study: AI chatbots provide less-accurate information to vulnerable use February 23, 2026 Michelle Research from the MIT Center for Constructive Communication finds leading AI models perform worse for users with lower English proficiency, less formal education, and non-US origin...