this post was submitted on 07 Aug 2023
57 points (96.7% liked)

Lemmy.ca Support / Questions

496 readers
1 users here now

Support / Questions specific to lemmy.ca.

For support / questions related to the lemmy software itself, go to [email protected]

founded 4 years ago
MODERATORS
 

Right now, robots.txt on lemmy.ca is configured this way

User-Agent: *
  Disallow: /login
  Disallow: /login_reset
  Disallow: /settings
  Disallow: /create_community
  Disallow: /create_post
  Disallow: /create_private_message
  Disallow: /inbox
  Disallow: /setup
  Disallow: /admin
  Disallow: /password_change
  Disallow: /search/
  Disallow: /modlog

Would it be a good idea privacy-wise to deny GPTBot from scrapping content from the server?

User-agent: GPTBot
Disallow: /

Thanks!

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 21 points 1 year ago

Yes, please.

We can't stop LLM developers from scraping our conversations if they're determined to do so, but we can at least make our wishes clear. If they respect our wishes, then great. If they don't, then they'll be unable to plead ignorance, and our signpost in the road (along with those from other instances) might influence legislation as it's drafted in the coming years.