AI-360
The system automates converting meeting transcripts to PRDs and development tickets using specialized agents powered by Mistral Large 2, eliminating manual effort.
Med-Gemini, a multimodal model fine-tuned on de-identified medical data, achieves 91.1% accuracy on medical licensing exams and interprets 3D scans.
Anthropic's Claude 3.7 Sonnet sets new benchmarks in coding abilities. Novo Nordisk uses it to reduce clinical report writing from 12 weeks to 10 minutes.
Claude 3.7 Sonnet, the first hybrid reasoning model, will be tested by scientists to compress decades of scientific progress into shorter timeframes.
Research shows AI can accelerate education innovation by simulating interventions before human evaluation—potentially reducing timelines from decades to years.
GPT-4.5 achieves 62.5% accuracy on SimpleQA evaluation and reduces hallucination rates to 37.1%, compared to GPT-4o's 61.8% hallucination rate.