/u/Different_Drive_1095

Running a local LLM on Android with Termux and llama.cpp

/u/Different_Drive_1095 / April 6, 2026

What I used Samsung S21 Ultra Termux llama-cpp-cli llama-cpp-server Qwen3.5-0.8B with Q5_K_M quantization from huggingface (I also tried Bonsai-8B-GGUF-1bit from huggingface. Although this is a newer model and required a different setup, which I…

Author name: /u/Different_Drive_1095

Running a local LLM on Android with Termux and llama.cpp