AI-360

Latest posts
New SimpleQA Benchmark Aims to Test Language Models' Factual Accuracy
New SimpleQA Benchmark Aims to Test Language Models' Factual Accuracy

OpenAI's SimpleQA tests 4,326 factual questions with 3% error rate. GPT-4o scores under 40%, showing larger models excel while deeper thinking ones opt to decline.

by AI-360
MIT Launches $20 Million AI Fellowship Programme for Cross-Disciplinary Research
MIT Launches $20 Million AI Fellowship Programme for Cross-Disciplinary Research

New MIT postdoc programme pairs AI experts with mentors in biology, physics, music & more, funded by $20M Tayebati gift to advance cross-disciplinary AI applications.

by AI-360
Google Announces New AI Processor Trillium Available in Preview
Google Announces New AI Processor Trillium Available in Preview

Google's Trillium TPU achieves 4.7x peak compute vs prior gen with 67% better energy efficiency, now powering Search, YouTube & DeepMind's LLMs in preview.

by AI-360
Upgraded Claude 3.5 Sonnet Achieves New High Score on Software Engineering Benchmark
Upgraded Claude 3.5 Sonnet Achieves New High Score on Software Engineering Benchmark

Using Edit & Bash tools, newest Claude 3.5 Sonnet hits 49% on SWE-bench's GitHub issue tests, despite challenges like hidden tests & high token costs.

by AI-360
NVIDIA Sets Third-Quarter Financial Results Call for November 20
NVIDIA Sets Third-Quarter Financial Results Call for November 20

NVIDIA's Q3 fiscal 2025 results to be discussed Nov 20, with CFO Colette Kress releasing commentary prior. Q&A limited to analysts and institutional investors.

by AI-360
ChatGPT Outperforms Physicians in Medical Diagnostic Reasoning Steps, Stanford Study Shows
ChatGPT Outperforms Physicians in Medical Diagnostic Reasoning Steps, Stanford Study Shows

"ChatGPT-4 scored 92 in clinical reasoning vs physicians' 74-76. AI-assisted doctors completed diagnoses 1+ minute faster but showed no accuracy gains"

by AI-360
Your link has expired. Please request a new one.
Your link has expired. Please request a new one.
Your link has expired. Please request a new one.
Great! You've successfully signed up.
Great! You've successfully signed up.
Welcome back! You've successfully signed in.
Success! You now have access to additional content.