overview for fatboy93

The Motorola Edge 50 Neo is coming to India next week, will offer 5 OS updates - GSMArena.com news in c/[email protected]

[–] [email protected] 4 points 6 days ago

Or the ever classic: launch one version behind the current Android version. Provide security update once a year and then taut that it's aon OS update.

Loving USA Culture in c/[email protected]

[–] [email protected] 9 points 1 week ago (1 children)

That's true of any politician tbh, I'm indian and most of the elections are about how we were great and ancient and holy and blah blah.

This $149 RISC-V Tablet Runs Ubuntu 24.04 in c/[email protected]

[–] [email protected] 1 points 2 weeks ago

Samsung A9+ goes on sale for about $150 every once in a while.

Kids FireHD tablets are generally lower than that. There's not really any difference between the adult and kids version tbh.

Best way to transfer local files to my Android device. in c/[email protected]

[–] [email protected] 23 points 2 months ago (6 children)

Why do people sleep on KDE connect? It does a lot of things really well and is OS agnostic.

Google Maps tests new pop-up ads that give you an unnecessary detour in c/[email protected]

[–] [email protected] 5 points 2 months ago (3 children)

Does this support Android Auto? That's the only reason I use maps.

Little League Rule in c/[email protected]

[–] [email protected] 14 points 2 months ago

Dang, you're Moneyball'ing your kid?

Sounds awesome!

Microsoft to test “new features and more” for aging, stubbornly popular Windows 10 in c/[email protected]

[–] [email protected] 10 points 3 months ago (2 children)

Windows laptops generally get trashy battery life, and if this going to tank it further, I'd just run Linux full-time on my family laptop and call it a day.

The only reason we had windows was my wife's comfortability and sometimes zoom glitches out on linux.

US to impose tariffs on Chinese EVs next week in c/[email protected]

[–] [email protected] 6 points 4 months ago

You can import a whole bunch of stuff, but it's upto each state to decide if they'll allow you to use it on road.

2x2 lumber at Home Depot is now 1.28x1.28. Actual size is supposed to be 1.5 in c/[email protected]

[–] [email protected] 10 points 4 months ago (2 children)

Oh absolutely. I wear socks with sandals because my soles sweat and make my sandals sticky.

But yeah, wear proper attire for the work you do!

Whistleblower Josh Dean of Boeing supplier Spirit AeroSystems has died in c/[email protected]

[–] [email protected] 2 points 4 months ago

Absolutely, my toddler had MRSA within a few days after he was born and its most likely due to some contamination or something to the effect.

Hospitals are a severe breeding grounds for resistant bacterial strains via sewage.

meme, and reality. in c/[email protected]

[–] [email protected] 11 points 4 months ago

Absolutely. My wife flew to her parent place with our toddler and I dont have any idea on what to watch.

All I'm watching is nursery rhymes since they're catchy as all hell.

Shape of these potatoes in c/[email protected]

[–] [email protected] 9 points 5 months ago (4 children)

Resonate this super hard, and I'm in the second camp.

Everything seems to set me off at home. I just want to rage against everyone and it's fucking shameful.

16

Small guide to run Llama.cpp on windows with discrete AMD GPU (lemm.ee)

submitted 1 year ago* (last edited 1 year ago) by [email protected] to c/[email protected]

3 comments fedilink

Hi!

I have an ASUS AMD Advantage Edition laptop (https://rog.asus.com/laptops/rog-strix/2021-rog-strix-g15-advantage-edition-series/) that runs windows. I haven't gotten time to install linux and set it up the way I like yet, still after more than a year.

I'm just dropping a small write-up for the set-up that I'm using with llama.cpp to run on the discrete GPUs using clbast.

You can use Kobold but it meant for more role-playing stuff and I wasn't really interested in that. Funny thing is Kobold can be set up to use the discrete GPU if needed.

For starters you'd need llama.cpp itself from here: https://github.com/ggerganov/llama.cpp/tags.

Pick the clblast version, which will help offload some computation over to the GPU. Unzip the download to a directory. I unzipped it to a folder called this: "D:\Apps\llama"
You'd need a llm now and that can be obtained from HuggingFace or where-ever you'd like it from. Just note that it should be in ggml format. If you have a doubt, just note that the models from HuggingFace would have "ggml" written somewhere in the filename. The ones I downloaded were "nous-hermes-llama2-13b.ggmlv3.q4_1.bin" and "Wizard-Vicuna-7B-Uncensored.ggmlv3.q4_0.bin"
Move the models to the llama directory you made above. That makes life much easier.
You don't really need to navigate to the directory using Explorer. Just open Powershell where-ever and you can also do cd D:\Apps\llama\
Here comes the fiddly part. You need to get the device ids for the GPU. An easy way to check this is to use "GPU caps viewer", go to the tab titled OpenCl and check the dropdown next to "No. of CL devices".

The discrete GPU is normally loaded as the second or after the integrated GPU. In my case the integrated GPU was gfx90c and discrete was gfx1031c.
In the powershell window, you need to set the relevant variables that tell llama.cpp what opencl platform and devices to use. If you're using AMD driver package, opencl is already installed, so you needn't uninstall or reinstall drivers and stuff.

$env:GGML_OPENCL_PLATFORM = "AMD"

$env:GGML_OPENCL_DEVICE = "1"
Check if the variables are exported properly

Get-ChildItem env:GGML_OPENCL_PLATFORM
Get-ChildItem env:GGML_OPENCL_DEVICE

This should return the following:

Name Value

GGML_OPENCL_PLATFORM AMD

GGML_OPENCL_DEVICE 1

If GGML_OPENCL_PLATFORM doesn't show AMD, try exporting this: $env:GGML_OPENCL_PLATFORM = "AMD"
Once these are set properly, run llama.cpp using the following:

D:\Apps\llama\main.exe -m D:\Apps\llama\Wizard-Vicuna-7B-Uncensored.ggmlv3.q4_0.bin -ngl 33 -i --threads 8 --interactive-first -r "### Human:"

OR

replace Wizard with nous-hermes-llama2-13b.ggmlv3.q4_1.bin or whatever llm you'd like. I like to play with 7B, 13B with 4_0 or 5_0 quantized llms. You might need to trawl through the fora here to find parameters for temperature, etc that work for you.
Checking if these work, I've posted the content at pastebin since formatting these was a paaaain: https://pastebin.com/peSFyF6H

salient features @ gfx1031c (6800M discrete graphics):
llama_print_timings: load time = 60188.90 ms
llama_print_timings: sample time = 3.58 ms / 103 runs ( 0.03 ms per token, 28770.95 tokens per second)
llama_print_timings: prompt eval time = 7133.18 ms / 43 tokens ( 165.89 ms per token, 6.03 tokens per second)
llama_print_timings: eval time = 13003.63 ms / 102 runs ( 127.49 ms per token, 7.84 tokens per second)
llama_print_timings: total time = 622870.10 ms

salient features @ gfx90c (cezanne architecture integrated graphics):
llama_print_timings: load time = 26205.90 ms
llama_print_timings: sample time = 6.34 ms / 103 runs ( 0.06 ms per token, 16235.81 tokens per second)
llama_print_timings: prompt eval time = 29234.08 ms / 43 tokens ( 679.86 ms per token, 1.47 tokens per second)
llama_print_timings: eval time = 118847.32 ms / 102 runs ( 1165.17 ms per token, 0.86 tokens per second)
llama_print_timings: total time = 159929.10 ms

Edit: added pastebin since I actually forgot to link it. https://pastebin.com/peSFyF6H