AI researcher claims he's already bypassed Anthropic's Fable 5 guardrails

An AI researcher, identifying himself as "Pliny the Liberator," claims to have bypassed the safety guardrails of Anthropic's newly released Fable 5 model. In a post on X, formerly Twitter, the researcher stated he has been "cleverly finding the holes in the fence that the thought police missed." This assertion suggests potential vulnerabilities in the AI's ability to prevent harmful or unintended outputs, despite Anthropic's stated commitment to AI safety. The researcher did not provide specific details on the methods used to bypass the guardrails, nor did Anthropic immediately respond to requests for comment on the claim. The development raises questions about the robustness of current AI safety mechanisms and the ongoing challenge of ensuring AI systems operate within intended ethical boundaries. Fable 5 was launched by Anthropic on June 20, 2024, as part of its ongoing efforts to develop safer and more capable AI.
Original source — read the full reporting at the publisher:
Read on CoinTelegraph