15 posts

Anthropic

Latest posts

Long context LLMs vulnerable to "many-shot jailbreaking." Faux dialogues override safety training. Mitigation efforts ongoing but challenging.

by AI-360

Success! You now have access to additional content.