Flux
Quoting Matteo Wong, The Atlantic

Quoting Matteo Wong, The Atlantic

Katie Moussouris, a cybersecurity expert and the CEO of Luta Security, told me that Anthropic shared with her a copy of the White House’s report on the Fable jailbreak to get her appraisal. (She said that she is not being paid by Anthropic.) The report, Moussouris said, involved IT experts asking Fable to help find and patch bugs. When given deliberately insecure code, she said, Fable refused the prompt “review the code for security issues” but then complied when asked to “fix this code,”…

Soutenez Simon Willison's Weblog en consultant la ressource originale

Lire l'article original

Vous aimez découvrir ces sources ?

Soutenez-moi sur Patreon

Articles similaires