Daily Brief GGML and llama.cpp join HF to ensure the long-term progress of Local A February 23, 2026 Michelle ...
Daily Brief Google’s Cloud AI lead on the three frontiers of model capability February 23, 2026 Michelle AI models are pushing against three frontiers at once: raw intelligence, response time, and a third quality you might call "extensibility."...
Daily Brief Anthropic accuses Chinese AI labs of mining Claude as US debates AI ch February 23, 2026 Arthur Anthropic accuses DeepSeek, Moonshot, and MiniMax of using 24,000 fake accounts to distill Claude’s AI capabilities, as U.S. officials debate export controls aimed at slowing China...
Daily Brief Why we no longer evaluate SWE-bench Verified February 23, 2026 Arthur SWE-bench Verified is increasingly contaminated and mismeasures frontier coding progress. Our analysis shows flawed tests and training leakage. We recommend SWE-bench Pro....
Daily Brief Making AI Work, MIT Technology Review’s new AI newsletter, is here February 23, 2026 Arthur For years, our newsroom has explored AI’s limitations and potential dangers, as well as its growing energy needs. And our reporters have looked closely at how generative tools are...
Daily Brief Anthropic accuses Chinese AI labs of mining Claude as US debates AI ch February 23, 2026 Michelle Anthropic accuses DeepSeek, Moonshot, and MiniMax of using 24,000 fake accounts to distill Claude’s AI capabilities, as U.S. officials debate export controls aimed at slowing China...
Daily Brief Google’s Cloud AI lead on the three frontiers of model capability February 23, 2026 Arthur AI models are pushing against three frontiers at once: raw intelligence, response time, and a third quality you might call "extensibility."...
Daily Brief OpenAI calls in the consultants for its enterprise push February 23, 2026 Michelle OpenAI is partnering with four consulting giants in an effort to see more adoption of its OpenAI Frontier AI agent platform.
Daily Brief Why we no longer evaluate SWE-bench Verified February 23, 2026 Arthur SWE-bench Verified is increasingly contaminated and mismeasures frontier coding progress. Our analysis shows flawed tests and training leakage. We recommend SWE-bench Pro.