reviews
GLM-5.2 Review: 753B Open-Weight Model That Undercuts GPT-5.5
GLM-5.2 scores 62.1 on SWE-bench Pro vs GPT-5.5's 58.6, ships under MIT, and costs $1.40/M input tokens. Benchmarks, pricing, and the China data question.
reviews
GLM-5.2 scores 62.1 on SWE-bench Pro vs GPT-5.5's 58.6, ships under MIT, and costs $1.40/M input tokens. Benchmarks, pricing, and the China data question.
LiteLLM has been hit by RCE, SQL injection, privilege escalation, and a PyPI supply chain attack in 2026. Here's every CVE, who's affected, and how to fix it.
Rust vs Go compared with real benchmarks, salary data, and production use cases. Go for 80% of backends, Rust for the 20% where latency and memory matter.
Claude Fable 5 hits 80.3% SWE-bench Pro and 29.3% FrontierCode Diamond. It also costs 2x Opus 4.8, retains your data 30 days, and silently falls back.
Gemini 3.5 Flash vs Claude Haiku 4.5 vs MAI-Code-1-Flash — SWE-bench scores, token costs, and which flash model actually writes better code in 2026.
Long-form posts in your inbox roughly once a week — research breakdowns, tutorials, comparisons, the occasional review. No tracking pixels, no growth-hacked subject lines.
Or grab the RSS feed — same posts, no email needed.
I'm Maksim. By day I lead an engineering team at inDrive. After hours I ship side projects (PageBloom, NotesPilot, MyDevKit, startgaze) and write things up here when I learn something worth keeping.
The blog itself runs on an agentic publishing pipeline I built and rebuilt — a slow-moving experiment in how much of a writer's workflow can be automated without losing the voice. It writes, fact-checks, and refreshes; I edit, decide, and publish.