Skip to content

Model Picker

ModelBest forStart here ifAvoid it if
E2BPhones and low-memory devicesYour main goal is “make it run locally today”You expect frontier-level depth
E4BBetter phones, tablets, light desktopsYou want better quality than E2B without a workstationYour device already struggles with small models
26B A4BMacs and stronger desktopsYou have real memory headroom and want the best tradeoffYou are memory constrained
31BWorkstationsQuality matters most and your hardware is seriousYou want fast setup or broad device compatibility
  • Offline chat, translation, note drafting, quick Q&A on mobile: E2B first, E4B if the device can handle it.
  • General desktop use and experimentation: E4B or 26B A4B.
  • Local research assistant, stronger writing, higher-quality reasoning: 26B A4B.
  • “I am okay trading convenience for quality”: 31B.
  • If you are not sure the model will fit, choose the smaller option.
  • If load time already feels painful, the next larger model is usually the wrong move.
  • If you need a dependable phone demo, choose stability over benchmark wins.

The most common failure pattern is:

  1. See a benchmark for 31B.
  2. Download 31B onto the wrong machine.
  3. Blame the runtime.

Usually the problem is not the runtime. It is the fit between model size and available memory.