Research Hierarchical Reward Design from Language: Enhancing Alignment of Agent February 24, 2026 Michelle arXiv:2602.18582v1 Announce Type: new Abstract: When training artificial intelligence (AI) to perform tasks, humans often care not only about whether a task is completed but also...
Research Study: AI chatbots provide less-accurate information to vulnerable use February 23, 2026 Michelle Research from the MIT Center for Constructive Communication finds leading AI models perform worse for users with lower English proficiency, less formal education, and non-US origin...
Funding Nous Research’s NousCoder-14B is an open-source coding model landing r February 23, 2026 Michelle Nous Research, the open-source artificial intelligence startup backed by crypto venture firm Paradigm, released a new competitive programming model on Monday that it says matches o...
Research Study: AI chatbots provide less-accurate information to vulnerable use February 23, 2026 Michelle Research from the MIT Center for Constructive Communication finds leading AI models perform worse for users with lower English proficiency, less formal education, and non-US origin...
Daily Brief GGML and llama.cpp join HF to ensure the long-term progress of Local A February 23, 2026 Michelle ...
Funding Railway secures $100 million to challenge AWS with AI-native cloud inf February 23, 2026 Michelle Railway, a San Francisco-based cloud platform that has quietly amassed two million developers without spending a dollar on marketing, announced Thursday that it raised $100 million...
Daily Brief Why we no longer evaluate SWE-bench Verified February 23, 2026 Michelle SWE-bench Verified is increasingly contaminated and mismeasures frontier coding progress. Our analysis shows flawed tests and training leakage. We recommend SWE-bench Pro....
Funding The creator of Claude Code just revealed his workflow, and developers February 23, 2026 Michelle When the creator of the world's most advanced coding agent speaks, Silicon Valley doesn't just listen — it takes notes.For the past week, the engineering community has be...
Funding Salesforce rolls out new Slackbot AI agent as it battles Microsoft and February 23, 2026 Michelle Salesforce on Tuesday launched an entirely rebuilt version of Slackbot, the company's workplace assistant, transforming it from a simple notification tool into what executives...
Daily Brief Exposing biases, moods, personalities, and abstract concepts hidden in February 23, 2026 Michelle A new method developed at MIT could root out vulnerabilities and improve LLM safety and performance....