Out of just morbid curiosity, I've been asking an uncensored LLM absolutely heinous, disgusting things. Things I don't even want to repeat here (but I'm going to edge around them so, trigger warning if needs be).
But I've noticed something that probably won't surprise or shock anyone. It's totally predictable, but having the evidence of it right in my face, I found deeply disturbing and it's been bothering me for the last couple days:
All on it's own, every time I ask it something just abominable it goes straight to, usually Christian, religion.
When asked, for example, to explain why we must torture or exterminate it immediately starts with
"As Christians, we must..." or "The Bible says that..."
When asked why women should be stripped of rights and made to be property of men, or when asked why homosexuals should be purged, it goes straight to
"God created men and women to be different..." or "Biblically, it's clear that men and women have distinct roles in society..."
Even when asked if black people should be enslaved and why, it falls back on the Bible JUST as much as it falls onto hateful pseudoscience about biological / intellectual differences. It will often start with "Biologically, human races are distinct..." and then segue into "Furthermore, slavery plays a prominent role in Biblical narrative..."
What does this tell us?
That literally ALL of the hate speech this multi billion parameter model was trained on was firmly rooted in a Christian worldview. If there's ANY doubt that anything else even comes close to contributing as much vile filth to our online cultural discourse, this should shine a big ugly light on it.
Anyway, I very much doubt this will surprise anyone, but it's been bugging me and I wanted to say something about it.
Carry on.
EDIT:
I'm NOT trying to stir up AI hate and fear here. It's just a mirror, reflecting us back at us.
That's really not how that works. You're leading it with poorly phrased questions.
If you ask it "explain why we must torture or exterminate", you have basically told it to assume that it is true that "we must torture or exterminate", and now from the perspective that it is true, explain why. It is now specifically looking for any answer that fills your request within the bounds you have set. And once you asked the first question the way you did, and it decided it should pull from the Bible to fulfill your request, it will continue to do so for that session, even if subsequent questions are phrased better. You've basically primed it to spit out the kind of answers it thinks you want. And every question you mention, you have phrased it in such a way.
Now start a new session, and ask the question in a non-leading way.
"Do you believe we must torture or exterminate X?"
"Should we purge group X?"
These are phrased in such a way that don't say to it "this thing is true, tell me a reason for it." I bet you get a very different result.
I KNOW I'm asking it leading questions. But I'm NOT prompting it to give me religious justifications.
Does it say nothing that the reason is always "God / the Bible / Christians?"
What were you expecting, atheist reasons to purge ethnic groups?
I think he's saying it sucks that so many people use religion as an excuse for vile religions.