this post was submitted on 19 Jun 2023
85 points (97.8% liked)

Piracy: ꜱᴀɪʟ ᴛʜᴇ ʜɪɢʜ ꜱᴇᴀꜱ

54627 readers
413 users here now

⚓ Dedicated to the discussion of digital piracy, including ethical problems and legal advancements.

Rules • Full Version

1. Posts must be related to the discussion of digital piracy

2. Don't request invites, trade, sell, or self-promote

3. Don't request or link to specific pirated titles, including DMs

4. Don't submit low-quality posts, be entitled, or harass others



Loot, Pillage, & Plunder

📜 c/Piracy Wiki (Community Edition):


💰 Please help cover server costs.

Ko-Fi Liberapay
Ko-fi Liberapay

founded 1 year ago
MODERATORS
 

(Feel free to remove this as off-topic, but this relates to the post about the r/Piracy poll regarding what content will be permitted upon reopening. The body of this post wouldn't get the same reach as a comment on that post.)

Ahoy hearties! Here what I be thinkin'. Reddit be chargin' tens of millions of doubloons for third-mates to access the API, aye? They be claimin' to deserve a share of the booty for providin' trainin' data for AI (and obviously to kill competition with third-mate apps to boot).

Methinks if yee MUST chatter with those landlubbers (such as for the purpose of recruitin' new mates or cussing out mutinous scabs), then yee ought to make any text data yee provide unappealing and unusable to potential AI-training-customers.

Paintings of (Sexy) Captain John Oliver will only sully the attention of the human users. But (pirate) coded language mayhaps be an obstruction for bots? For those who find pirate speak to be too much effort, an alternative be to speak "sdrawkcaB".

I can no longer cast my bottled messages to Reddit's shore, so any of you seadogs are free to pass it along.

top 22 comments
sorted by: hot top controversial new old
[–] [email protected] 18 points 1 year ago (1 children)

moreover, if machine learning is occurring on reddit now, it'd be absolutely hilarious if it developed a wee bit o' the speech impediment.

[–] [email protected] 14 points 1 year ago (1 children)

Either matey or uwufication sound like fun malicious compliance.

[–] [email protected] 7 points 1 year ago (1 children)

Uwuify thuwwwwungs. Dsmvwl thngs. Jubmle up splleing so thet ist still reedabel to poeple.

Jmp btwene thuwum in de sme puwust.

Make your data unclassifiable.

[–] [email protected] 6 points 1 year ago (1 children)

Yar, this one right here, Captain. Make them walk the plank! /j

[–] [email protected] 2 points 1 year ago (1 children)
[–] [email protected] 2 points 1 year ago

Naw matey, we were agreein' wi ye! Am I doing teh jumbel thing corretlcy?

[–] [email protected] 7 points 1 year ago (1 children)

If ye find writin' in Corsair speak too difficult, probably maybe also fer non native speakers, then ye can use online tools t' convert yer text fer ye!

https://pirate-speech-translator.netlify.app/
https://pirate.monkeyness.com/translate
https://funtranslations.com/pirate
https://lingojam.com/PirateSpeak

etc.

[–] [email protected] 3 points 1 year ago

The insult generator on monkeyness is hilarious

https://pirate.monkeyness.com/insult

[–] [email protected] 7 points 1 year ago (1 children)

Yarr, ye should also use homoglyph attacks, mayhaps with this confangled tool https://onlinetools.com/unicode/spoof-unicode-text

The scurvy dogs will have a mighty hard time parsing the heavy sea o' randomized data obfuscation

[–] [email protected] 5 points 1 year ago (1 children)

Any way to integrate this with shreddit to replace and save comments instead of deleting?

[–] [email protected] 2 points 1 year ago

Shouldn't be too hard to implement https://github.com/picatz/homoglyphr in the shreddit code, I don't know how it compares to the web tool I linked but it should be enough to make data cleaning a real pain in the arse for reddit

[–] [email protected] 6 points 1 year ago* (last edited 1 year ago)

Maybe also a couple of key phrases?

That one thing has certainly 'worked', FWIW as of now.

Be creative! What if AI got trained to always answer with subtle innuendo... the thought makes me all shivery.

[–] [email protected] 6 points 1 year ago* (last edited 1 year ago) (1 children)

Well, me matey, ChatGPT be speakin' like a seadog already. All ye must do is ask of it to be speakin' this way, and it will. Arrgh. Ye could write a fancy userscript to interface wit ChatGPT and be speakin' like a seadog without an ounce of effort!

[–] [email protected] 4 points 1 year ago* (last edited 1 year ago) (1 children)

True, but it might consider it "regular talk". I don't really believe that (realistically) a handful of users speaking pirate would taint chat GPT. It's more-so for anyone who wants to (temporarily) contribute to the discussion about how their favorite protesting subreddits should maliciously comply with forced reopening. I personally feel that I would not like Reddit to sell access to my comments to be used for AI training, so if I hadn't already deleted my accounts, I would taint them knowing that I'm not providing Reddit anything of value.

Ideally users would leave Reddit ASAP, but in the interim, while promoting the fediverse alternatives on Reddit, I think coded language would be the most consistent with the maliciously complaint John Oliver pictures posted to various forced-to-open subreddits.

Edit: I might have misunderstood what you were trying to say. I thought you meant that pirate speak would still be useful for training AI models. My bad! I blame the pirate speak!

[–] [email protected] 4 points 1 year ago (1 children)

I was saying that: A) ChatGPT is already fluent in good-enough pirate speak, and B) It would be possible to have ChatGPT convert modern English speech into said good-enough pirate speak using a userscript. Even if it didn't affect their data collection or AI models trained on the text, it would still be distracting and annoying for users, which might push some people away from Reddit.

[–] [email protected] 3 points 1 year ago (1 children)

That makes sense, I didn't even think about that aspect of it. Obviously users seeing posts of (Sexy Captain) John Oliver will know that those users are protesting, but seeing pirate-speak comments in non-protesting posts would also cause users to be reminded of the protest (if pirate-speak caught on large-scale and was associated with the protest).

I was thinking more small-scale individual-level and not the large-scale that (Sexy Captain) John Oliver posts have become. It would be nice if it caught on to a large scale like you suggest. My biggest gripe with these protest discussions being on Reddit is that Reddit is still benefiting from that activity and this would be my way of mitigating that.

[–] [email protected] 3 points 1 year ago

Honestly, anything to fuck with Reddit is a W in my book.

[–] [email protected] 5 points 1 year ago

Yarr matey, I already done mutiny on that foul vessel, else I'd take part in showing these scurvy dogs what fer

[–] [email protected] 5 points 1 year ago

Just fill the sub with magnet links to torrents of pics of john oliver

[–] [email protected] 3 points 1 year ago* (last edited 1 year ago)

atwhay ifyay allyay o'yay usyay alktay inyay orsaircay igpay atinlay

Translation: What if we all speak in pirate pig latin

[–] [email protected] 2 points 1 year ago (1 children)

What about those who use screenreaders? It would be unfair to them

[–] [email protected] 2 points 1 year ago

Ultimately, the goal of the protest should be to get as many users off of Reddit as possible.

It's all about harm reduction (or maximization, in this case) and minimizing the amount of traffic and useful data to Reddit. There are going to be situations where giving screenreader users the information about Lemmy/kbin will transition users off of Reddit. In that case, the amount of users leaving Reddit probably outweighs the cost of the minuscule amount data provided to Reddit in the couple of comments it takes to advertise transitioning to Lemmy/kbin to such users.

It's up to the individual to make that evaluation for themselves. If you want to propose a Lemmy/kbin alternative to Redditors on r/screenreader, then yeah, probably don't use encoded text.

load more comments
view more: next ›