overview for sisyphean

Beta version of AutoTLDR bot for Lemmy released (powered by GPT-3.5) in c/auai

[–] sisyphean 3 points 1 year ago (5 children)

Let’s try it this way:

https://rentry.co/evuft

(The bot probably couldn’t extract the text from that js-heavy site you linked to)

Beta version of AutoTLDR bot for Lemmy released (powered by GPT-3.5) in c/auai

[–] sisyphean 2 points 1 year ago (7 children)

Nice try:)

The 10 Circles of Magic in Production in c/auai

[–] sisyphean 2 points 1 year ago (1 children)

This long, image-heavy article will pose a challenge to @AutoTLDR. Let’s see how it fares.

The 10 Circles of Magic in Production in c/auai

Beta version of AutoTLDR bot for Lemmy released (powered by GPT-3.5) in c/auai

[–] sisyphean 3 points 1 year ago

I hope it will be useful, thanks for testing it :)

Understanding GPT tokenizers in c/auai

[–] sisyphean 1 points 1 year ago* (last edited 1 year ago)

AFAIK the way it works is that the more frequent a long sequence is, the more likely it is to get a single token. I'm not sure the fact that English is tokenized in the most efficient way is because they explicitly made it prefer English or if it is just a result of the corpus containing mostly English text.

Can anybody see this post? in c/[email protected]

[–] sisyphean 1 points 1 year ago

What post? Where?

Is anybody else more active here then they were on Reddit? in c/[email protected]

[–] sisyphean 8 points 1 year ago* (last edited 1 year ago) (1 children)

Here people actually react to what I post and write. And they react to the best possible interpretation of what I wrote, not the worst. And even if we disagree, we can still have a nice conversation.

Does anyone have a good theory about why the threadiverse is so much friendlier? Is it only because it's smaller? Is it because of the kind of people a new platform like this attracts? Because there is no karma? Maybe something else?

Everyone stay in your own lane now in c/[email protected]

[–] sisyphean 5 points 1 year ago (1 children)

Did I miss something? Or is this still about Beehaw?

Understanding GPT tokenizers in c/auai

[–] sisyphean 4 points 1 year ago* (last edited 1 year ago)

Here is an example of tokenization being biased toward English (using the author's Observable notebook):

This is the same sentence in English and my native Hungarian. I understand that this is due to the difference in the amount of text available in the two languages in the training corpus. But it's still a bit annoying that using the API for Hungarian text is more expensive :)

*Permanently Deleted* in c/[email protected]

[–] sisyphean 5 points 1 year ago* (last edited 1 year ago) (3 children)

The best hacker is of course the one who can guess the password the fastest (all-lowercase, dictionary word).

4

Sparks of AGI: early experiments with GPT-4 (2 month old yt vid) (youtu.be)

submitted 1 year ago by sisyphean to c/auai

0 comments fedilink

cross-posted from: https://lemmy.fmhy.ml/post/125116

The new wave of AI systems, ChatGPT and its more powerful successors, exhibit extraordinary capabilities across a broad swath of domains. In light of this, we discuss whether artificial INTELLIGENCE has arrived.

Paper available here: https://arxiv.org/abs/2303.12712 Video recorded at MIT on March 22nd, 2023

5

Embracing change and resetting expectations - Terence Tao about using GPT-4 (unlocked.microsoft.com)

submitted 1 year ago by sisyphean to c/[email protected]

0 comments fedilink

cross-posted from: https://programming.dev/post/154634

TL;DR (by GPT-4 🤖):

Use of AI Tools: The author routinely uses GPT-4 to answer casual and vaguely phrased questions, draft complex documents, and provide emotional support. GPT-4 can serve as a compassionate listener, an enthusiastic sounding board, a creative muse, a translator or teacher, or a devil’s advocate.

Large Language Models (LLM) and Expertise: LLMs can often persuasively mimic correct expert responses in a given knowledge domain, such as research mathematics. However, the responses often consist of nonsense when inspected closely. The author suggests that both humans and AI need to develop skills to analyze this new type of text.

AI in Mathematical Research: The author believes that the 2023-level AI can already generate suggestive hints and promising leads to a working mathematician and participate actively in the decision-making process. With the integration of tools such as formal proof verifiers, internet search, and symbolic math packages, the author expects that 2026-level AI, when used properly, will be a trustworthy co-author in mathematical research, and in many other fields as well.

Impact on Human Institutions and Practices: The author raises questions about how existing human institutions and practices will adapt to the rise of AI. For example, how will research journals change their publishing and referencing practices when AI can generate entry-level math papers for graduate students in less than a day? How will our approach to graduate education change? Will we actively encourage and train our students to use these tools?

Challenges and Future Expectations: The author acknowledges that we are largely unprepared to address these questions. There will be shocking demonstrations of AI-assisted achievement and courageous experiments to incorporate them into our professional structures. But there will also be embarrassing mistakes, controversies, painful disruptions, heated debates, and hasty decisions. The greatest challenge will be transitioning to a new AI-assisted world as safely, wisely, and equitably as possible.

7

Embracing change and resetting expectations - Terence Tao about using GPT-4 (unlocked.microsoft.com)

submitted 1 year ago by sisyphean to c/auai

0 comments fedilink

TL;DR (by GPT-4 🤖):

Use of AI Tools: The author routinely uses GPT-4 to answer casual and vaguely phrased questions, draft complex documents, and provide emotional support. GPT-4 can serve as a compassionate listener, an enthusiastic sounding board, a creative muse, a translator or teacher, or a devil’s advocate.
Large Language Models (LLM) and Expertise: LLMs can often persuasively mimic correct expert responses in a given knowledge domain, such as research mathematics. However, the responses often consist of nonsense when inspected closely. The author suggests that both humans and AI need to develop skills to analyze this new type of text.
AI in Mathematical Research: The author believes that the 2023-level AI can already generate suggestive hints and promising leads to a working mathematician and participate actively in the decision-making process. With the integration of tools such as formal proof verifiers, internet search, and symbolic math packages, the author expects that 2026-level AI, when used properly, will be a trustworthy co-author in mathematical research, and in many other fields as well.
Impact on Human Institutions and Practices: The author raises questions about how existing human institutions and practices will adapt to the rise of AI. For example, how will research journals change their publishing and referencing practices when AI can generate entry-level math papers for graduate students in less than a day? How will our approach to graduate education change? Will we actively encourage and train our students to use these tools?
Challenges and Future Expectations: The author acknowledges that we are largely unprepared to address these questions. There will be shocking demonstrations of AI-assisted achievement and courageous experiments to incorporate them into our professional structures. But there will also be embarrassing mistakes, controversies, painful disruptions, heated debates, and hasty decisions. The greatest challenge will be transitioning to a new AI-assisted world as safely, wisely, and equitably as possible.

12

Bing Chat (using GPT-4) now accepts image input (in Early Access) (twitter.com)

submitted 1 year ago* (last edited 1 year ago) by sisyphean to c/auai

7 comments fedilink

Original tweet: https://twitter.com/emollick/status/1671528847035056128

Screenshots (from the tweet):

6

TinyStories: How Small Can Language Models Be and Still Speak Coherent English? (arxiv.org)

submitted 1 year ago by sisyphean to c/[email protected]

0 comments fedilink

cross-posted from: https://programming.dev/post/133153

Quote:

In this work, we introduce TinyStories, a synthetic dataset of short stories that only contain words that a typical 3 to 4-year-olds usually understand, generated by GPT-3.5 and GPT-4. We show that TinyStories can be used to train and evaluate LMs that are much smaller than the state-of-the-art models (below 10 million total parameters), or have much simpler architectures (with only one transformer block), yet still produce fluent and consistent stories with several paragraphs that are diverse and have almost perfect grammar, and demonstrate reasoning capabilities.

Related:

Models (you can try them online):

TinyStories-1M

TinyStories-33M

An interview with the authors (highly recommended): The Tiny Model Revolution with Ronen Eldan and Yuanzhi Li of Microsoft Research