Anthropic
Claude gains preset/custom communication styles. GitLab uses feature for business cases, docs & marketing content, with style-specific memory
Claude models on AWS Bedrock power 80% faster document search in multiple languages, while Perplexity achieves 2x processing speed with improved accuracy
New prompt improver uses chain-of-thought, XML standardisation, and prefill features to enhance prompts, boosting accuracy by 30% in classification tests.
"AI systems went from solving 2% of real coding problems to 49% in one year. Anthropic warns the window for safe regulation is closing fast."
Using Edit & Bash tools, newest Claude 3.5 Sonnet hits 49% on SWE-bench's GitHub issue tests, despite challenges like hidden tests & high token costs.
"Claude 3.5 Sonnet scored 93.7% on HumanEval, tops SWE-bench Verified, offers real-time debugging and test generation via Amazon Bedrock inference"