this post was submitted on 11 Aug 2024
265 points (95.5% liked)

Technology

58303 readers
11 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
 

Am I missing something? The article seems to suggest it works via hidden text characters. Has OpenAI never heard of pasting text into a utf8 notepad before?

you are viewing a single comment's thread
view the rest of the comments
[โ€“] [email protected] 1 points 3 months ago (1 children)

It wouldn't be surprising to me if they've had this implemented for awhile.

There's still some question about why their 3.5 model had an apparent sudden drop-off in quality about a year ago, and among the plausible explanations for it could be that they were fucking with their weights in order to watermark the outputs in exactly the way you're mentioning. They were also fighting against prompt-injection methods and censor disapproved uses at the time, so who the fuck knows.

[โ€“] [email protected] 2 points 3 months ago

This doesn't touch the weights at all, it's just a change to the sampler.

What lobotomizes their models is cost cutting and trying to make them "safe," or at least thats what I suspect.