what do us poor folks on Linux do?
Run llama.cpp and any of the models listed here, that stuff has been around for months.
TheBloke has a lot of models converted to GGUF format which you need for llama.cpp
.
Quick Start Guide (requires Nix, otherwise compile llama.cpp manually):
$ GIT_LFS_SKIP_SMUDGE=1 git clone https://huggingface.co/TheBloke/guanaco-7B-GGUF
$ cd guanaco-7B-GGUF
$ git lfs pull --include=Guanaco-7B.Q4_0.gguf
$ nix run github:ggerganov/llama.cpp -- -m Guanaco-7B.Q4_0.gguf --instruct
> Write haiku about a penguin
A penguin walks on ice,
Takes a plunge in the sea,
Hides his feet from me!