Research ResearchGym: Evaluating Language Model Agents on Real-World AI Researc February 18, 2026 Michelle arXiv:2602.15112v1 Announce Type: new Abstract: We introduce ResearchGym, a benchmark and execution environment for evaluating AI agents on end-to-end research. To instantiate thi...
Research Attention-gated U-Net model for semantic segmentation of brain tumors February 18, 2026 Michelle arXiv:2602.15067v1 Announce Type: new Abstract: Gliomas, among the most common primary brain tumors, vary widely in aggressiveness, prognosis, and histology, making treatment chal...
Daily Brief NVIDIA Nemotron 2 Nano 9B Japanese: 日本のソブリンAIを支える最先端小規模言語モデル February 18, 2026 Michelle ...
Daily Brief Introducing Lockdown Mode and Elevated Risk labels in ChatGPT February 18, 2026 Michelle Introducing Lockdown Mode and Elevated Risk labels in ChatGPT to help organizations defend against prompt injection and AI-driven data exfiltration....
Research AI algorithm enables tracking of vital white matter pathways February 18, 2026 Michelle Opening a new window on the brainstem, a new tool reliably and finely resolves distinct nerve bundles in live diffusion MRI scans, revealing signs of injury or disease.
Daily Brief Microsoft says Office bug exposed customers’ confidential emails to Co February 18, 2026 Michelle Microsoft said the bug meant that its Copilot AI chatbot was reading and summarizing paying customers' confidential emails, bypassing data protection policies.
Research Stony Brook Researchers Build AI Stress Test to Measure What Neural Ne February 18, 2026 Michelle Stony Brook researchers develop MLRegTest, a systematic stress test for neural networks that measures fundamental learning capabilities through thousands of controlled pattern recognition tasks.
Daily Brief NVIDIA Nemotron 2 Nano 9B Japanese: 日本のソブリンAIを支える最先端小規模言語モデル February 18, 2026 Michelle ...
Funding Railway secures $100 million to challenge AWS with AI-native cloud inf February 18, 2026 Michelle Railway, a San Francisco-based cloud platform that has quietly amassed two million developers without spending a dollar on marketing, announced Thursday that it raised $100 million...
Research Attention-gated U-Net model for semantic segmentation of brain tumors February 18, 2026 Michelle arXiv:2602.15067v1 Announce Type: new Abstract: Gliomas, among the most common primary brain tumors, vary widely in aggressiveness, prognosis, and histology, making treatment chal...