Lite
iPhone XR / XS / 11 series · iPad 8th–9th gen · A12 / A13 · ~3 GB RAM
Quick facts, short replies, simple code under 15 lines — all offline. Accuracy softens beyond a paragraph. The new Gemma 4 E2B is a significant quality upgrade over the older Gemma 2 2B at the same size.
Gemma 4 E2B · 1.5 GB ★ New
Gemma 2 2B · 1.2 GB
Llama 3.2 1B · 0.7 GB
SmolLM2 1.7B · ~1 GB
Qwen3 0.6B · 0.4 GB
Core
iPhone 12 / 13 / 13 mini · iPad Air 4+ · A14 / A15 · 4 GB RAM
The everyday workhorse. Real code generation, full-length documents, short stories up to ~800 words, and debugging — correctly, first try. Phi-4 mini is a direct upgrade from Phi-3.5 with the same footprint. Note: Gemma 4 E4B runs at this tier but thinking mode must stay off — enabling it crashes every iPhone regardless of RAM.
Phi-4 mini · 2.2 GB ★ New
Phi-3.5 mini · 2.2 GB
Llama 3.2 3B · 2 GB
Qwen 2.5 3B · 2 GB
Qwen3 1.7B · ~1.2 GB
Gemma 4 E4B · 2.8 GB ⚠ no thinking mode
Magnum Brain Tuning
iPhone 13 Pro+ · 14 / 14 Plus and newer · iPad Pro / Air M-series · 6–12 GB RAM
Production-quality output first try. 7B-class models unlock at this tier. On 6 GB devices (13 Pro, 14, 15) the 7B models work but may crash after 3–4 long turns — keep sessions short or use a 4B model instead. iPhone 15 Pro and newer (8 GB+) handle 7B models indefinitely.
Qwen3 4B · 2.8 GB ★ New
Gemma 3 4B · ~2.5 GB
SmolLM3 3B · ~2 GB
Qwen 2.5 7B · 4.8 GB
Mistral 7B · 4.5 GB
7B on 6 GB: short sessions only