2. gpt-5.3-codex-medium for genrally most of the other coding tasks.
3. deepseek-v4-flash for heavy agentic research/loops (non-coding related).
4. mimo-v2.5-pro for crunching/summarizing large texts.
5. gemini-3.1-flash-lite for image understanding.
6. opus-4.7 very occasionally when gpt-5.5 fails, or vice-versa, and sonnet-4.6 when codex-5.3 fails.
7. deepseek-v4-pro when I need to do a long agentic session, and want higher quality, for cheap (non-coding).
8. perplexity/pplx-embed-v1-0.6b for embeddings, via openrouter.
At work I use 4.6 because we don't have 4.7 yet...zzz...
I also do a LOT of personal/portfolio work with self-hosted models.
Ministral-3-14B-Reasoning for validating concepts, MVPs, etc and some prod systems (punches above its weight class). Qwen3.6-35B-A3B for self-hosted coding (custom harness). GPT-OSS-120B for self-hosted coding or more reasoning-intensive agentic flows. Qwen3.5-122B-A10B currently in evals for agentic coding.
fyi: I didn't have much lock on Deepseek v4 pro, with opencode + openrouter it's incredibly slow. How did op did it?