Gemma 4 actually running usable on an Android phone (not llama.cpp)

Gemma 4 actually running usable on an Android phone (not llama.cpp)

I wanted a real local assistant on my phone, not a demo.

First tried the usual llama.cpp in Termux — Gemma 4 was 2–3 tok/s and the phone was on fire. Then I switched to Google’s LiteRT setup, got Gemma 4 running smoothly, and wired it into an agent stack running in Termux.

Now one Android phone is:

  • running the LLM locally
  • automating its own apps via ADB
  • staying offline if I want

Happy to share details + code and hear what else you’d build on top of this.

https://preview.redd.it/7vkbrlzfryvg1.jpg?width=3024&format=pjpg&auto=webp&s=25455827ddf9715b4159ce64a18deba812cf0f5f

submitted by /u/GeeekyMD
[link] [comments]

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top