15 posts

Large Language Models

Latest posts
Stanford Study Reveals AI-Language Models Show Varying Consistency on Controversial Topics
Stanford Study Reveals AI-Language Models Show Varying Consistency on Controversial Topics

LLMs show high consistency on neutral topics like Thanksgiving but become variable on controversial issues. Larger models outperform smaller ones in reliability.

by AI-360
Mistral AI Introduces Batch API, Cuts Processing Costs by Half
Mistral AI Introduces Batch API, Cuts Processing Costs by Half

Users can process high-volume requests at half the cost of regular API calls, with applications in sentiment analysis, translation, and vector embedding.

by AI-360
xAI Launches Grok API Public Beta, Offering $25 Monthly Free Credits to Developers
xAI Launches Grok API Public Beta, Offering $25 Monthly Free Credits to Developers

"New Grok-beta model features 128k token context and function calling, with REST API compatibility with OpenAI/Anthropic. Multi-modal version coming soon."

by AI-360
OpenAI Launches Enhanced ChatGPT Web Search with Real-Time Information and Source Links
OpenAI Launches Enhanced ChatGPT Web Search with Real-Time Information and Source Links

"The search uses fine-tuned GPT-4 with novel synthetic data generation. Users can trigger web searches automatically or manually, with linked source citations."

by AI-360
Anthropic Urges Swift Government Action on AI Regulation, Citing Rising Risks
Anthropic Urges Swift Government Action on AI Regulation, Citing Rising Risks

"AI systems went from solving 2% of real coding problems to 49% in one year. Anthropic warns the window for safe regulation is closing fast."

by AI-360
New SimpleQA Benchmark Aims to Test Language Models' Factual Accuracy
New SimpleQA Benchmark Aims to Test Language Models' Factual Accuracy

OpenAI's SimpleQA tests 4,326 factual questions with 3% error rate. GPT-4o scores under 40%, showing larger models excel while deeper thinking ones opt to decline.

by AI-360
Your link has expired. Please request a new one.
Your link has expired. Please request a new one.
Your link has expired. Please request a new one.
Great! You've successfully signed up.
Great! You've successfully signed up.
Welcome back! You've successfully signed in.
Success! You now have access to additional content.