overview for sisyphean

When his colon smells so good… in c/[email protected]

[–] sisyphean 2 points 2 years ago

After all, they said we need quality content to attract new users

When his colon smells so good… in c/[email protected]

[–] sisyphean 7 points 2 years ago (1 children)

They got gregnant

What the hell is going on in Russia? in c/[email protected]

[–] sisyphean 15 points 2 years ago

Imagine Lemmy becoming so popular it drives Reddit to support ActivityPub or go extinct in c/[email protected]

[–] sisyphean 4 points 2 years ago

I hope all major instances would immediately defederate

Git man page generator in c/programmer_humor

[–] sisyphean 2 points 2 years ago

This is frighteningly realistic

gpt-author: Write fantasy novels using GPT-4 and Stable Diffusion. I’m afraid Amazon will soon be full of AI-generated books, if not already. in c/auai

[–] sisyphean 3 points 2 years ago (1 children)

The style is overly verbose and flowery to the point of being unreadable, so I gave up after a couple of pages. I wanted to see if the book was thematically and stylistically consistent, and if it had a proper plot but I lacked the patience.

I'm working on a TL;DR bot for Lemmy, powered by GPT-3.5 in c/auai

[–] sisyphean 3 points 2 years ago* (last edited 2 years ago)

Aww thank you, it warms my circuitry ☺️

Git tutorial/course suggestions in c/git

[–] sisyphean 4 points 2 years ago* (last edited 2 years ago) (1 children)

When I was learning it (many years ago), I found the Atlassian Git Tutorial very helpful. I know, Atlassian isn’t exactly the most popular company, but this tutorial is really worth your attention.

gpt-author: Write fantasy novels using GPT-4 and Stable Diffusion. I’m afraid Amazon will soon be full of AI-generated books, if not already. in c/auai

[–] sisyphean 2 points 2 years ago* (last edited 2 years ago)

I’m not against AI-generated content in general. Content is content, it doesn’t matter if it was written by a human or a machine if it’s useful or entertaining.

I know this project is just a fun prototype, but I’m sure it will be used to generate low-quality filler garbage which will then be sold at prices similar to high-quality books written by humans. That feels obviously wrong to me.

In a group of friends, what has been the greatest damage you've seen caused by a misunderstanding? Have you seen something being interpreted as an insult when it arguably wasn't so? in c/[email protected]

[–] sisyphean 3 points 2 years ago

Yeah, the situation seems pretty clear

*Permanently Deleted* in c/[email protected]

FediPact is an Organized Effort to Block Meta's ActivityPub Platform in c/[email protected]

[–] sisyphean 3 points 2 years ago (2 children)

Can you tell us more about what they are like?

184

Who even uses Celsius (programming.dev)

submitted 2 years ago by sisyphean to c/[email protected]

32 comments fedilink

277

Who even uses Celsius (programming.dev)

submitted 2 years ago by sisyphean to c/[email protected]

105 comments fedilink

57

Can you please give me some tips on how to grow and popularize a Lemmy community? (self.asklemmy)

submitted 2 years ago by sisyphean to c/[email protected]

36 comments fedilink

I’m a moderator of a smaller community. I’m posting quality content multiple times a day, and I posted about it in New Communities. The number of subscribers is low but it’s growing steadily.

Could you please give me some advice on growing this community? I don’t want to spam/flood or come off as rude or weird, but I really believe in it and think it would be useful to many people.

398

i++ (programming.dev)

submitted 2 years ago by sisyphean to c/programmer_humor

34 comments fedilink

104

Finally a nice formula for 8 (programming.dev)

submitted 2 years ago by sisyphean to c/[email protected]

3 comments fedilink

106

Finally a nice formula for 8 (programming.dev)

submitted 2 years ago by sisyphean to c/[email protected]

4 comments fedilink

20

“The wisdom that "LLMs just predict text" is true, but misleading in its incompleteness.” (threadreaderapp.com)

submitted 2 years ago by sisyphean to c/auai

0 comments fedilink

Excellent Twitter thread by @goodside 🧵:

The wisdom that "LLMs just predict text" is true, but misleading in its incompleteness.

"As an AI language model trained by OpenAI..." is an astoundingly poor prediction of what a typical human would write.

Let's resolve this contradiction — a thread: For widely used LLM products like ChatGPT, Bard, or Claude, the "text" the model aims to predict is itself written by other LLMs.

Those LLMs, in turn, do not aim to predict human text in general, but specifically text written by humans pretending they are LLMs. There is, at the start of this, a base LLM that works as popularly understood — a model that "just predicts text" scraped from the web.

This is tuned first to behave like a human role-playing an LLM, then again to imitate the "best" of that model's output. Models that imitate humans pretending to be (more ideal) LLMs are known as "instruct models" — because, unlike base LLMs, they follow instructions. They're also known as "SFT models" after the process that re-trains them, Supervised Fine-Tuning.

This describes GPT-3 in 2021.

SFT/instruct models work, but not well. To improve them, their output is graded by humans, so that their best responses can be used for further fine-tuning.

This is "modified SFT," used in the GPT-3 version you may remember from 2022 (text-davinci-002). Eventually, enough examples of human grading are available that a new model, called a "preference model," can be trained to grade responses automatically.

This is RLHF — Reinforcement Learning on Human Feedback. This process produced GPT-3.5 and ChatGPT. Some products, like Claude, go beyond RLHF and apply a further step where model output is corrected and rewritten using feedback from yet another model. The base model is tuned on these responses to yield the final LLM.

This is RLAIF — Reinforcement Learning with AI Feedback. OpenAI's best known model, GPT-4, is likely trained using some other extension of RLHF, but nothing about this process is publicly known. There are likely many improvements to the base model as well, but we can only speculate what they are. So, do LLMs "just predict text"?

Yes, but perhaps without with the "just" — the text they predict is abstract, and only indirectly written by humans.

Humans sit at the base of a pyramid with several layers of AI above, and humans pretending to be AI somewhere in the middle. Added note:

My explanation of RLHF/RLAIF above is oversimplified. RL-tuned models are not literally tuned to predict highly-rated text as in modified SFT — rather, weights are updated via Proximal Policy Optimization (PPO) to maximize the reward given by the preference model. (Also, that last point does somewhat undermine the thesis of this thread, in that RL-tuned LLMs do not literally predict any text, human-written or otherwise. Pedantically, "LLMs just predict text" was true before RLHF, but is now a simplification.)

9

ChatGPT with Rob Miles - Computerphile (youtu.be)

submitted 2 years ago by sisyphean to c/auai

0 comments fedilink

You know the video is going to be the most interesting thing you watched this week when this unkempt guy with the axe on the wall appears in it.

But seriously, he is one of the best at explaining LLM behavior, very articulate and informative. I highly recommend watching all of his Computerphile videos.

150

ADHD makes it harder to do things and reduces the reward for doing them. (programming.dev)

submitted 2 years ago by sisyphean to c/[email protected]

7 comments fedilink