overview for xcjs

Advice - Getting started with LLMs in c/[email protected]

[–] xcjs 1 points 7 months ago (4 children)

Good luck! I'm definitely willing to spend a few minutes offering advice/double checking some configuration settings if things go awry again. Let me know how things go. :-)

Advice - Getting started with LLMs in c/[email protected]

[–] xcjs 1 points 7 months ago* (last edited 7 months ago)

It should be split between VRAM and regular RAM, at least if it's a GGUF model. Maybe it's not, and that's what's wrong?

Advice - Getting started with LLMs in c/[email protected]

[–] xcjs 1 points 7 months ago (6 children)

Ok, so using my "older" 2070 Super, I was able to get a response from a 70B parameter model in 9-12 minutes. (Llama 3 in this case.)

I'm fairly certain that you're using your CPU or having another issue. Would you like to try and debug your configuration together?

Advice - Getting started with LLMs in c/[email protected]

[–] xcjs 2 points 7 months ago

Unfortunately, I don't expect it to remain free forever.

Advice - Getting started with LLMs in c/[email protected]

[–] xcjs 5 points 7 months ago (1 children)

No offense intended, but are you sure it's using your GPU? Twenty minutes is about how long my CPU-locked instance takes to run some 70B parameter models.

On my RTX 3060, I generally get responses in seconds.

Nice demonstration of why mastodon's dominance is problematic in c/[email protected]

[–] xcjs 3 points 7 months ago* (last edited 7 months ago) (1 children)

It's a W3C managed standard, but there are tons of behavior not spelled out in the specification that platforms can choose to impose.

The standard doesn't impose a 500 character limit, but there's nothing that says there can't be a limit.

Instagram is updating its algorithm to surface more content from smaller, original creators | TechCrunch in c/[email protected]

[–] xcjs 5 points 7 months ago

Or maybe just let me focus on who I choose to follow? I'm not there for content discovery, though I know that's why most people are.

Zuckerberg says Meta's Llama 3 is really good but no chatbot is sophisticated enough to be an 'existential' threat — yet in c/[email protected]

[–] xcjs 4 points 8 months ago* (last edited 8 months ago)

I was reflecting on this myself the other day. For all my criticisms of Zuckerberg/Meta (which are very valid), they really didn't have to release anything concerning LLaMA. They're practically the only reason we have viable open source weights/models and an engine.

Disney+ debuts a new logo and integrates Hulu content, offering joint subs in the US in c/[email protected]

[–] xcjs 2 points 8 months ago

That's the funny thing about UI/UX - sometimes changing non-functional colors can hurt things.

GitLab confirms it’s removed Suyu, a fork of Nintendo Switch emulator Yuzu in c/[email protected]

[–] xcjs 1 points 9 months ago

Exactly why I surmised they may not want to keep it there: https://programming.dev/comment/8019112

Bernie Sanders unveils 32-hour workweek bill with no loss in pay for workers in c/[email protected]

[–] xcjs 14 points 9 months ago* (last edited 9 months ago) (3 children)

At some point, you lose productivity and reduced work weeks have shown increases in productivity can happen.

How to drop files from Android to home server? in c/[email protected]

[–] xcjs 2 points 9 months ago

My go-to solution for this is the Android FolderSync app with an SFTP connection.