this post was submitted on 22 Sep 2023
109 points (92.9% liked)

Linux

47969 readers
1016 users here now

From Wikipedia, the free encyclopedia

Linux is a family of open source Unix-like operating systems based on the Linux kernel, an operating system kernel first released on September 17, 1991 by Linus Torvalds. Linux is typically packaged in a Linux distribution (or distro for short).

Distributions include the Linux kernel and supporting system software and libraries, many of which are provided by the GNU Project. Many Linux distributions use the word "Linux" in their name, but the Free Software Foundation uses the name GNU/Linux to emphasize the importance of GNU software, causing some controversy.

Rules

Related Communities

Community icon by Alpár-Etele Méder, licensed under CC BY 3.0

founded 5 years ago
MODERATORS
 

There was a post asking people their opinions about Edge and many people seemed to liked the idea of Edge and seemed to be ok having it on Linux (Blasphemy)

Also, can we all agree how fast Edge went from joke to a threat? I mean, it's good now alright! It was good back then, but it's better now. Money man!!! Money! Personally I hate MS, but I can't help but see the fact that there is no alternative to Bing GPT and many features Bing offers on Linux.

If there is an open source ChatGPT how would it look? Who would bear the costs? How would we solve the server problem? i.e., it would take a ton of server space and bandwidth. Just wondering.

I am pretty sure MS products will improve greatly due to their integration with GPT what do us poor folks on Linux do?

Just want to know the answers, I don't want to discuss (aka can't comment, I need to study), but just curious!

all 36 comments
sorted by: hot top controversial new old
[–] [email protected] 32 points 1 year ago* (last edited 1 year ago) (2 children)

what do us poor folks on Linux do?

Run llama.cpp and any of the models listed here, that stuff has been around for months.

TheBloke has a lot of models converted to GGUF format which you need for llama.cpp.

Quick Start Guide (requires Nix, otherwise compile llama.cpp manually):

$ GIT_LFS_SKIP_SMUDGE=1 git clone https://huggingface.co/TheBloke/guanaco-7B-GGUF
$ cd guanaco-7B-GGUF
$ git lfs pull --include=Guanaco-7B.Q4_0.gguf
$ nix run github:ggerganov/llama.cpp -- -m Guanaco-7B.Q4_0.gguf --instruct
> Write haiku about a penguin
 A penguin walks on ice,
 Takes a plunge in the sea,
 Hides his feet from me!
[–] [email protected] 4 points 1 year ago

a package manager that can pull, build, and run from git with one command is pretty neat

[–] [email protected] 1 points 1 year ago (1 children)

I ran it on my pc with a gtx 1070 with cuda enabled and compiled with the cuda compile hint but it ran really slowly how do you get it to run fast?

[–] [email protected] 1 points 1 year ago (1 children)

To make use of GPU acceleration you have to compile it with the proper support (CUDA, OpenCL, ROCM) and add --gpu-layers 16 (or a larger number, however much your VRAM can handle). If that's not enough, than the GPU/CPU is probably to slow.

You can try a smaller model, those run faster, but give worse results.

[–] [email protected] 1 points 1 year ago

Thanks I might try that out later.

[–] [email protected] 30 points 1 year ago* (last edited 1 year ago)

KoboldAI has an "KoboldGPT" mode, which is for answering questions. You can try it at https://lite.koboldai.net/ or run it on your own.

Here's an example conversation I just had with it (emoji icons added):


🤖 Hello, I am KoboldGPT, your personal AI assistant. What would you like to know?

🧑 What is the name of the original version of the game Monopoly?

🤖 The original version of the game Monopoly is called "The Landlord's Game." It was created by Elizabeth Magie in the early 20th century. However, the most well-known version of Monopoly today is based on a variant called "Monopoly" that was designed by Charles Darrow in 1935.

🧑 Did "The Landlord's Game" have any purpose besides being a board game?

🤖 Yes, "The Landlord's Game" had a specific purpose beyond being a board game. Elizabeth Magie created it as a way to illustrate the negative aspects of concentrating land in private monopolies. She believed that it could be used as a tool for teaching economic concepts such as rent, taxes, and the distribution of wealth.

🧑 What economic ideology was associated with this game?

🤖 Elizabeth Magie was influenced by Henry George's philosophy of Georgism. Georgists believe that land should belong to all people equally, and that taxation should be based on the value of land rather than on income or production.

[–] [email protected] 22 points 1 year ago
[–] [email protected] 21 points 1 year ago* (last edited 1 year ago)

As others said you can host Koboldai on your own, and if you don't have a powerful GPU, anyone can use powerful llm models via the AI horde

[–] [email protected] 20 points 1 year ago

i don't know a single thing about whatever these things your talking about are, but i just want to say, you're the first person i've ever seen suggest that edge isn't a joke.

[–] [email protected] 19 points 1 year ago

There are quite a few of them out there. Just a few from my notes (in reverse chronological order)

The self-hosted AI/ML system has been here for a while. Granted, the vast majority of them require downloading pre-trained models before they can be used due to how much it costs to build a system of weights from scratch.

I sometimes wonder if it would be possible to build and train a truly open source model with BOINC or something. The last 30 years of history show that it's entirely feasible to build a massively distributed computing cluster, why not leverage this to build a model? I know how naive that sounds immediately after writing it, mostly because of the difficulty of getting a large enough training data set, which unfortunately has risk written all over it (read: people poisoning the model, ala Microsoft's experiment with Tay on birbsite some years back).

[–] [email protected] 13 points 1 year ago* (last edited 1 year ago) (1 children)

petals.dev It's basically bittorrent for AI. A public swarm for running and training LLMs.

[–] [email protected] 3 points 1 year ago

Wao didn't know this one, impressive thx for sharing !

[–] [email protected] 13 points 1 year ago (1 children)
[–] [email protected] 2 points 1 year ago

That's just garden-variety intelligence, nothing artificial about it.

[–] [email protected] 9 points 1 year ago

OpenAssistant

[–] [email protected] 9 points 1 year ago

I think it's far more likely that easier solutions to hosting your own LLM/ChatGPT/etc will be pushed, rather than someone (or a group of someones) offering and maintaining it - as you mentioned, that takes a lot of resources. Think about the issues that various Fediverse software has had in terms of keeping up with the load sometimes, and LLMs use way more than most Fediverse applications. Especially if you want it to be as snappy as the current ones already out there.

[–] [email protected] 9 points 1 year ago

oogabooga is an option for self-hosted LLMs

https://github.com/oobabooga/text-generation-webui

[–] [email protected] 7 points 1 year ago* (last edited 1 year ago)

Chatbot arena ranks LLM, including some truly open source ones (apache license)

Regarding UI, maybe look here?

[–] [email protected] 5 points 1 year ago

For images there are foss options that you can run locally, like stable diffusion which is so good that it rivals its proprietary counterparts.
But for text it's a horror, there are some you can try (see gpt4all), but in general chatgpt has no real competition; the foss options are currently very bad, and even the proprietary options from big corporations like bard or llama are pitiful.

but I can’t help but see the fact that there is no alternative to Bing GPT

Regarding this, there are other services that also use GPT for search, here a couple:

https://www.perplexity.ai/

https://www.phind.com/

[–] [email protected] 5 points 1 year ago* (last edited 1 year ago) (2 children)

Maybe something like this? https://huggingface.co/ https://huggingface.co/chat/

The model seems to be based on the Llama model (but not only, there seem to be multiple models). Tho I didn't really look into it on how they get their database. It also has more hallucinations than chatgpt.

[–] [email protected] 2 points 1 year ago

Huggingchat is pretty good. And the model is Open source. No need for a phone number in your account and not by a company whose name is hypocritical even.

[–] [email protected] 1 points 1 year ago (1 children)

"What's the date?"

I'm not able to provide you with the current date as I am a text-based AI language model and do not have access to real-time information. However, I can tell you the date based on my knowledge cutoff, which is December 2022. Please note that this information may not be up-to-date or accurate. Is there anything else I can help you with?

[–] [email protected] 3 points 1 year ago* (last edited 1 year ago) (1 children)

You won't get the date.

The date mentioned in the answer may also be an halucination from the bot, depending on the dataset.

And an answer to Bing chat / bard is pretty hard to do if something like this even exists, as the bot has to be able to search the Web.

[–] [email protected] 1 points 1 year ago (1 children)

Interesting. I was able to pull the date from https://www.perplexity.ai however

[–] [email protected] 2 points 1 year ago

Because it's a search engine using gpt3. Nothing special/Foss about it. Or at least I don't see any Foss sources.

[–] [email protected] 4 points 1 year ago (1 children)

I heard about LocalAI, but I never tried it.

[–] [email protected] 3 points 1 year ago

I've tried it, very practical. It's a drop in replacement for OpenAI APIs, so you can work with other tools that use their models and API as back end.

[–] [email protected] 1 points 1 year ago (1 children)

At work today, I just heard the term, "Federated Machine Learning" - where individual corporations keep their proprietary learning data, but make the models shareable. I wonder if something similar could be done with activitypub?

(Feel free to steal this idea if it's worth stealing)

[–] [email protected] 6 points 1 year ago

This doesn't work when an important part of the process is making sure your data is actually good. If the data is proprietary, there is no way to make sure it is usable.

[–] [email protected] -2 points 1 year ago (2 children)

I installed Edge on Linux because it's fun watching Linux people get angry about it

[–] [email protected] 4 points 1 year ago

You're a monster

[–] [email protected] 0 points 1 year ago

I admire your trolling power