Long-Context on danilchenko.dev

Long-Context on danilchenko.devhttps://danilchenko.dev/tags/long-context/Recent content in Long-Context on danilchenko.devHugoen-usSat, 11 Apr 2026 06:00:00 +0000TriAttention Compresses KV Cache 10.7x — How Trigonometry Fixed Long-Context Reasoninghttps://danilchenko.dev/posts/2026-04-11-triattention-kv-cache-compression-long-reasoning/Sat, 11 Apr 2026 06:00:00 +0000https://danilchenko.dev/posts/2026-04-11-triattention-kv-cache-compression-long-reasoning/TriAttention uses pre-RoPE vector concentration and trigonometric scoring to compress KV cache 10.7x while matching full attention accuracy on reasoning tasks.