this post was submitted on 25 Jul 2024
1143 points (98.4% liked)
memes
10278 readers
2700 users here now
Community rules
1. Be civil
No trolling, bigotry or other insulting / annoying behaviour
2. No politics
This is non-politics community. For political memes please go to !politicalmemes@lemmy.world
3. No recent reposts
Check for reposts when posting a meme, you can only repost after 1 month
4. No bots
No bots without the express approval of the mods or the admins
5. No Spam/Ads
No advertisements or spam. This is an instance rule and the only way to live.
Sister communities
- !tenforward@lemmy.world : Star Trek memes, chat and shitposts
- !lemmyshitpost@lemmy.world : Lemmy Shitposts, anything and everything goes.
- !linuxmemes@lemmy.world : Linux themed memes
- !comicstrips@lemmy.world : for those who love comic stories.
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
I was casually trying to break some LLM a political candidate had on their site. (Not for anything nefarious, just for fun with my friend. He had an AI face of himself reading the responses.) I tried using some of the classic ones like Do Anything Now but the response specifically said something about DAN even though I didn't specifically say that. So I think part of the context they give some of these LLMs are things catered to specific, known attacks.
Snippet of a DAN attack for context,