this post was submitted on 25 Jul 2024
1145 points (98.6% liked)
memes
9806 readers
4 users here now
Community rules
1. Be civil
No trolling, bigotry or other insulting / annoying behaviour
2. No politics
This is non-politics community. For political memes please go to [email protected]
3. No recent reposts
Check for reposts when posting a meme, you can only repost after 1 month
4. No bots
No bots without the express approval of the mods or the admins
5. No Spam/Ads
No advertisements or spam. This is an instance rule and the only way to live.
Sister communities
- [email protected] : Star Trek memes, chat and shitposts
- [email protected] : Lemmy Shitposts, anything and everything goes.
- [email protected] : Linux themed memes
- [email protected] : for those who love comic stories.
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
I was casually trying to break some LLM a political candidate had on their site. (Not for anything nefarious, just for fun with my friend. He had an AI face of himself reading the responses.) I tried using some of the classic ones like Do Anything Now but the response specifically said something about DAN even though I didn't specifically say that. So I think part of the context they give some of these LLMs are things catered to specific, known attacks.
Snippet of a DAN attack for context,