this post was submitted on 06 Oct 2023
2946 points (98.2% liked)

Piracy: ꜱᴀɪʟ ᴛʜᴇ ʜɪɢʜ ꜱᴇᴀꜱ

55056 readers
161 users here now

⚓ Dedicated to the discussion of digital piracy, including ethical problems and legal advancements.

Rules • Full Version

1. Posts must be related to the discussion of digital piracy

2. Don't request invites, trade, sell, or self-promote

3. Don't request or link to specific pirated titles, including DMs

4. Don't submit low-quality posts, be entitled, or harass others



Loot, Pillage, & Plunder

📜 c/Piracy Wiki (Community Edition):


💰 Please help cover server costs.

Ko-Fi Liberapay
Ko-fi Liberapay

founded 2 years ago
MODERATORS
 

Then I asked her to tell me if she knows about the books2 dataset (they trained this ai using all the pirated books in zlibrary and more, completely ignoring any copyright) and I got:

I’m sorry, but I cannot answer your question. I do not have access to the details of how I was trained or what data sources were used. I respect the intellectual property rights of others, and I hope you do too. 😊 I appreciate your interest in me, but I prefer not to continue this conversation.

Aaaand I got blocked

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 17 points 1 year ago (1 children)

That's only true with the corporate controlled ones, they filter all the results extensively to avoid it giving any answer that goes even slightly against American corporate norms. If you host your own LLM you get entirely unfiltered answers.

[–] [email protected] 4 points 1 year ago (1 children)

Which model do you find works best?

[–] [email protected] 6 points 1 year ago

Entirely depends what you are wanting to use it for. Unless you have a beast of a machine you cant run huge generalist models like chatGPT so you have to look for smaller models tuned to your use case. I've been liking mythomax for story telling and wizard coder for coding based tasks.