Privacy

33250 readers

487 users here now

A place to discuss privacy and freedom in the digital world.

Privacy has become a very important issue in modern society, with companies and governments constantly abusing their power, more and more people are waking up to the importance of digital privacy.

In this community everyone is welcome to post links and discuss topics related to privacy.

Some Rules

Posting a link to a website containing tracking isn't great, if contents of the website are behind a paywall maybe copy them into the post
Don't promote proprietary software
Try to keep things on topic
If you have a question, please try searching for previous discussions, maybe it has already been answered
Reposts are fine, but should have at least a couple of weeks in between so that the post can reach a new audience
Be nice :)

Related communities

much thanks to @gary_host_laptop for the logo design :)

founded 5 years ago

MODERATORS

[email protected]

How to run LLaMA (and other LLMs) on Android. (lemmy.dbzer0.com)

submitted 2 days ago* (last edited 2 days ago) by [email protected] to c/[email protected]

18 comments fedilink hide all child comments

cross-posted from: https://lemmy.dbzer0.com/post/36841328

Hello, everyone! I wanted to share my experience of successfully running LLaMA on an Android device. The model that performed the best for me was llama3.2:1b on a mid-range phone with around 8 GB of RAM. I was also able to get it up and running on a lower-end phone with 4 GB RAM. However, I also tested several other models that worked quite well, including qwen2.5:0.5b , qwen2.5:1.5b , qwen2.5:3b , smallthinker , tinyllama , deepseek-r1:1.5b , and gemma2:2b. I hope this helps anyone looking to experiment with these models on mobile devices!

Step 1: Install Termux

Download and install Termux from the Google Play Store or F-Droid

Step 2: Set Up proot-distro and Install Debian
Open Termux and update the package list:
pkg update && pkg upgrade
Install proot-distro
pkg install proot-distro
Install Debian using proot-distro:
proot-distro install debian
Log in to the Debian environment:
proot-distro login debian
You will need to log-in every time you want to run Ollama. You will need to repeat this step and all the steps below every time you want to run a model (excluding step 3 and the first half of step 4).

Step 3: Install Dependencies
Update the package list in Debian:
apt update && apt upgrade
Install curl:
apt install curl

Step 4: Install Ollama
Run the following command to download and install Ollama:
curl -fsSL https://ollama.com/install.sh | sh
Start the Ollama server:
ollama serve &
After you run this command, do ctrl + c and the server will continue to run in the background.

Step 5: Download and run the Llama3.2:1B Model
Use the following command to download the Llama3.2:1B model:
ollama run llama3.2:1b
This step fetches and runs the lightweight 1-billion-parameter version of the Llama 3.2 model .

Running LLaMA and other similar models on Android devices is definitely achievable, even with mid-range hardware. The performance varies depending on the model size and your device's specifications, but with some experimentation, you can find a setup that works well for your needs. I’ll make sure to keep this post updated if there are any new developments or additional tips that could help improve the experience. If you have any questions or suggestions, feel free to share them below!

– llama

you are viewing a single comment's thread
view the rest of the comments

[–] [email protected] 2 points 1 day ago

The biggest problem:

I don't have enough RAM/GPU to run it on a server

But it looks interesting