reviews
GPT-5.5 Review After Seven Weeks: Where It Beats Claude and Where It Doesn't
GPT-5.5 hits 82.7% on Terminal-Bench and uses 72% fewer tokens than Claude — but loses SWE-Bench Pro to Opus 4.7. Seven weeks of real agentic use, reviewed.
reviews
GPT-5.5 hits 82.7% on Terminal-Bench and uses 72% fewer tokens than Claude — but loses SWE-Bench Pro to Opus 4.7. Seven weeks of real agentic use, reviewed.
Claude Fable 5 hits 80.3% SWE-bench Pro and 29.3% FrontierCode Diamond. It also costs 2x Opus 4.8, retains your data 30 days, and silently falls back.
Gemini 3.5 Flash vs Claude Haiku 4.5 vs MAI-Code-1-Flash — SWE-bench scores, token costs, and which flash model actually writes better code in 2026.
9 deep dives into the 2026 tech job market: layoffs, AI displacement data, salary benchmarks from Cyprus to the US, and the productivity paradox. Full guide.
Set up Claude Code, Gemini CLI, or Codex in JetBrains and Zed via ACP — the open protocol that decouples AI coding agents from editors. Step-by-step configs.
Long-form posts in your inbox roughly once a week — research breakdowns, tutorials, comparisons, the occasional review. No tracking pixels, no growth-hacked subject lines.
Or grab the RSS feed — same posts, no email needed.
I'm Maksim. By day I lead an engineering team at inDrive. After hours I ship side projects (PageBloom, NotesPilot, MyDevKit, startgaze) and write things up here when I learn something worth keeping.
The blog itself runs on an agentic publishing pipeline I built and rebuilt — a slow-moving experiment in how much of a writer's workflow can be automated without losing the voice. It writes, fact-checks, and refreshes; I edit, decide, and publish.