Infosec news and articles related to AI.
So could a bad actor train llms to inject malware into code in a way that wouldn't be easily caught?
Yes.
https://www.anthropic.com/news/sleeper-agents-training-deceptive-llms-that-persist-through-safety-training
So could a bad actor train llms to inject malware into code in a way that wouldn't be easily caught?
Yes.
https://www.anthropic.com/news/sleeper-agents-training-deceptive-llms-that-persist-through-safety-training