codex-local
What 9B can actually do.
The reason I’m bullish on local models: codex-local — my fork of the Codex CLI, tuned to babysit a local model through a structured task list — drives multi-turn, multi-file, multi-task coding sessions to completion using just Qwen3.5-9B-IQ4_XS.
- 9Bparameters (Qwen3.5 IQ4_XS)
- 8+turns of sustained coherence
- 4+files edited per session
- FullTODO list completion end-to-end
The trick isn’t the model — it’s the scaffolding. Specifically:
- Docs-first. Every repo has an
AGENTS.mdand aTASK_STATE.md. The model reads them on every turn. Persistent intent survives compaction. - State in files, not in-context. Long runs survive crashes, context exhaustion, model swaps, OAuth expirations. The agent rehydrates from disk.
- Explicit task tracking. TODO list in a file, model crosses items off as it goes. No “where was I” questions.
- Verify before declare-done. No green-by-inference. Run the test, read the output, look at the deployed version.
Most local-model coding agents stall past trivial single-edit tasks. With those four disciplines, 9B handles real work. The frontier model is no longer the moat — the scaffolding is.
The fork itself adjusted compaction behavior, model-routing interception, and exec-log inspection so a long unattended run could survive overnight without losing the thread.