Hardware requirements

Match Gemma 4 to the hardware you actually have.

The most useful hardware question is not “what is the biggest model?” It is “what is the biggest model that still feels sane on this device?”

Hardware class
Start model
Best path
Watch for

Android phones and tablets

Android guide
E2B first, then E4B
AI Edge Gallery
CPU fallback, weak acceleration, and slower large-model tests.
E2B first
AI Edge Gallery
App-level support gaps and heavier models that load poorly.

16GB Apple silicon Mac

16GB Mac guide
E4B first, 26B A4B only if setup is disciplined
LM Studio first, llama.cpp later
Swap pressure, long-load pain, and context settings that erase headroom.

RTX desktop with 16GB VRAM

Memory guide
26B A4B as a tuned target
LM Studio baseline or llama.cpp
KV cache growth, vision overhead, and aggressive context length.

High-end desktop or workstation

Model picker
31B when convenience matters less than quality
LM Studio, llama.cpp, or an API-first stack
Over-downloading before you confirm the runtime and workload.

What moves the fit

Hardware fit is not just RAM or VRAM on paper.

Runtime overhead, context length, KV cache growth, and vision layers all change whether a setup stays usable.

That is why the same machine can feel fine in a short chat and bad in a long or tool-heavy session.

Next choices

Once the hardware class is clear, the next click should be specific.