Gemma 4 download hub

Download the easiest Gemma 4 app for your device.

Start with official installers and app-store links. Phones start with Google AI Edge Gallery. Mac and Windows start with LM Studio. Docs come after install, not before.

Fastest install paths

Android

Google AI Edge Gallery

Best first path for phones. Install from Google Play, then start with E2B.

Open Google Play Android guide

iPhone

Google AI Edge Gallery

Use the App Store build first. Do not start with manual model-file imports.

Open App Store iPhone guide

Mac

LM Studio

The easiest desktop install for Apple silicon and the cleanest first local chat setup.

Download LM Studio Mac guide

Windows

LM Studio

Best default for most RTX and mixed-hardware PCs when you want the shortest path to running locally.

Download for Windows Windows guide

After install

Three things to do after the app is on your device.

Start smaller than you want

The Reddit pattern is still the same: E2B or E4B first on mobile, E4B or 26B A4B on desktop, and 31B only after you know the machine can carry it.

Open

Prove the short chat path first

A clean short reply tells you more than a broken long-context test. Keep the first run simple before adding tools, vision, or huge prompts.

Open

Fix the runtime before blaming the model

A lot of “Gemma 4 is broken” reports came down to stale templates, old llama.cpp builds, or wrapper-specific issues.

Open

From Reddit

The same Gemma 4 setup failures keep repeating.

Recent r/LocalLLaMA threads are useful because they show what people tried after the download finished. The pattern is not mysterious: wrong model size, stale runtime support, missing acceleration, and long-context tests too early.

What breaks first

Wrong first download

Users still jump to 31B or a heavy desktop quant before checking RAM, VRAM, or whether vision layers fit.

Open guide

Stale backends and templates

A surprising number of Gemma 4 failures trace back to old llama.cpp builds, stale quants, or runtime-specific template bugs.

Open guide

Phone acceleration is inconsistent

On mobile, CPU fallback can make E4B feel broken while E2B still feels fast enough for daily use.

Open guide

Long context hides the real failure

Many setups look fine in a short chat and then collapse once tools, large prompts, or long sessions enter the picture.

Open guide

Real setups people kept using

16GB VRAM desktop

26B A4B becomes viable when you treat quant choice, context length, and vision overhead as setup decisions instead of afterthoughts.

Read the memory guide Open Reddit thread

Pixel phone daily driver

One of the clearest AI Edge Gallery reports is simple: E2B felt good enough for daily Q&A once the user stopped forcing E4B on a weak acceleration path.

Open Android guide Open Reddit thread

Old phone as local AI node

A Xiaomi 12 Pro turned into a headless local Gemma 4 box shows how far repurposed hardware can go before people move to deeper runtime control.

Read the setup guide Open Reddit thread

Seen on X

Proof that people are already getting Gemma 4 running in real setups.

Picked for real setup evidence, useful workflow patterns, and alignment with official Google or runtime docs before they land here.

qzhxjhj

Apr 12, 2026

Hermes first, local Gemma 4 next: a Mac mini hybrid stack for daily work

A practical hybrid setup: Hermes on a VPS, Gemma 4 26B MoE 6-bit on a 32GB Mac mini for everyday use, Claude reserved for the hardest questions, and local-first data storage with remote access from a clean MacBook.

Workflow pattern: Hermes + local Gemma 4 + Claude fallback

Open Mac guide

Omar Sanseviero

Apr 2, 2026

Gemma 4 E2B using agent skills directly on Android

A launch demo showing Gemma 4 E2B running on Android with the Play Store app path and on-device reasoning.

1.8K likes · 210 reposts · 251.7K views Google mobile docs + Google Play listing

See AI Edge Gallery

Josh Esye

Apr 5, 2026

Gemma 4 running fully offline on iPhone from the App Store

A phone-first walkthrough showing the App Store install path, Agent Skills, and a local Gemma 4 run on iPhone.

1.4K likes · 281 reposts · 314.3K views App Store listing + Gemma 4 model card

Open iPhone guide

KellyV

Apr 5, 2026

Offline Gemma 4 on iPhone at roughly 14 tok/s

A hands-on iPhone clip showing Gemma 4 E4B and E2B running locally without network access, plus a concrete speed signal.

1.6K likes · 230 reposts · 288.0K views App Store path + Gemma 4 iPhone guide

Check iPhone reality

On YouTube

Watch a real walkthrough after you know which install path you need.

Gemma 4 on Your Phone? Google AI Edge Gallery

syncbricks / 4:48 / Apr 5, 2026

Short phone-first walkthrough showing AI Edge Gallery, offline Gemma chat, and the easiest mobile path.

Open AI Edge Gallery guide

Google Gemma 4 Tutorial - Run AI Locally for Free

Teacher's Tech / 12:08 / Apr 5, 2026

Clear desktop-oriented setup walkthrough for people who want a calmer step-by-step guide.

Open setup guide

Google just dropped Gemma 4... (WOAH)

Matthew Berman / 9:47 / Apr 3, 2026

Quick overview for people who want context before they decide on a runtime or model size.

Pick a model next

Featured walkthrough

Gemma 4 on Your Phone? Google AI Edge Gallery

Short phone-first walkthrough showing AI Edge Gallery, offline Gemma chat, and the easiest mobile path.

Guides and docs

Use the docs when you need the path after download.

Devices

Use these when the app is already installed and you need the hardware-specific path.

Runtimes

Pick the simplest runtime that gets the job done on your machine.

Model Sizes

Start smaller. A model that loads is more useful than one that never fits.

Search hubs

If you already know the question, jump straight to the dedicated hub for downloads, browser testing, hardware fit, or side-by-side comparison.

Download

Official installers, app-store links, and the fastest safe path by device.

Open download →

Try Online

Test Gemma 4 in the browser before you commit to a local setup.

Open try online →

Hardware Requirements

Map phones, 16GB Macs, and desktop GPUs to realistic starting models.

Open hardware requirements →

Compare

Compare model sizes and runtimes without jumping between disconnected pages.

Open compare →

Try it now

Test Gemma 4 in the browser before you commit to local setup.

Open full demo ↗

Search-driven articles

Read the high-intent questions after the install path is clear.

10 min read

Gemma 4 Setup Guide from Reddit: What Breaks First and What Actually Works

A Reddit-driven Gemma 4 setup guide covering the failures users hit first, the runtimes that need extra care, and real April 2026 cases on phones, 16GB GPUs, and dedicated local boxes.

Read article

9 min read

Hermes Is Not Enough: A Practical Local Gemma 4 Stack for Daily Work

A real April 12, 2026 workflow case: keep Hermes on a VPS, move everyday reasoning to a local Gemma 4 box, reserve Claude for the hardest questions, and keep your daily laptop clean.

Read article

9 min read

Best Gemma 4 Model Size for Your Device: E2B vs E4B vs 26B A4B vs 31B

Choose the right Gemma 4 model size for Android, iPhone, Mac, and Windows based on RAM, VRAM, speed, and why 26B A4B became the real desktop sweet spot in April 2026.

Read article