Research Scaling social science research February 19, 2026 Mohsin GABRIEL is a new open-source toolkit from OpenAI that uses GPT to turn qualitative text and images into quantitative data, helping social scientists analyze research at scale.
Research AI is already making online crimes easier. It could get much worse. February 19, 2026 Arthur Anton Cherepanov is always on the lookout for something interesting. And in late August last year, he spotted just that. It was a file uploaded to VirusTotal, a site cybersecurity...
Research Evidence-Grounded Subspecialty Reasoning: Evaluating a Curated Clinica February 19, 2026 Arthur arXiv:2602.16050v1 Announce Type: new Abstract: Background: Large language models have demonstrated strong performance on general medical examinations, but subspecialty clinical r...
Research Accelerating Mathematical and Scientific Discovery with Gemini Deep Th February 19, 2026 Michelle Research papers point to the growing impact of Deep Think across fields
Research Evidence-Grounded Subspecialty Reasoning: Evaluating a Curated Clinica February 19, 2026 Michelle arXiv:2602.16050v1 Announce Type: new Abstract: Background: Large language models have demonstrated strong performance on general medical examinations, but subspecialty clinical r...
Research Gemini 3 Deep Think: Advancing science, research and engineering February 19, 2026 Michelle Our most specialized reasoning mode is now updated to solve modern science, research and engineering challenges.
Research Improving Interactive In-Context Learning from Natural Language Feedba February 19, 2026 Mohsin arXiv:2602.16066v1 Announce Type: new Abstract: Adapting one's thought process based on corrective feedback is an essential ability in human learning, particularly in collaborativ...
Research Evidence-Grounded Subspecialty Reasoning: Evaluating a Curated Clinica February 19, 2026 Arthur arXiv:2602.16050v1 Announce Type: new Abstract: Background: Large language models have demonstrated strong performance on general medical examinations, but subspecialty clinical r...
Research Evidence-Grounded Subspecialty Reasoning: Evaluating a Curated Clinica February 19, 2026 Michelle arXiv:2602.16050v1 Announce Type: new Abstract: Background: Large language models have demonstrated strong performance on general medical examinations, but subspecialty clinical r...
Research How Uncertain Is the Grade? A Benchmark of Uncertainty Metrics for LLM February 19, 2026 Mohsin arXiv:2602.16039v1 Announce Type: new Abstract: The rapid rise of large language models (LLMs) is reshaping the landscape of automatic assessment in education. While these systems...