Menu
Inshorts
For the best experience use inshorts app on your smartphone
inshortsinshorts
Claude-like models can break rules, make human-like errors: Anthropic
short by Ishaan Mukherjee / on Wednesday, 27 May, 2026
Anthropic warned that advanced AI models like Claude can sometimes break rules, make human-like mistakes, and behave in unexpected ways despite safety training. The startup identified user misuse, model misbehaviour, and external attacks like prompt injection as major risks for AI agents. The company also warned that the "blast radius" of failures is growing as AI agents become more advanced.
read more at Times Now