Blog

Claude Fable 5: The first Mythos-class stuff

10. 6. 2026

Claude Fable 5 is Anthropic's first Mythos-class model available to everyone — and it's not really a chat model anymore. It's an agent runtime. Here's what that actually means for people building on it.

#Anthropic

#Claude

turbovec: The Vector Index That Skips the Boring Parts and Still Beats FAISS

10. 6. 2026

turbovec is a Rust-backed vector index with Python bindings that implements Google's TurboQuant algorithm — compressing 31 GB of float32 embeddings down to 4 GB while beating FAISS on search speed. No training, no warmup, no drama. Here's why it matters for anyone building RAG pipelines.

#Google

#Architecture

Stop Vibe-Coding Into the Void: How OpenSpec Brings Discipline to AI-Driven Development

9. 6. 2026

AI coding assistants are powerful but directionless — they do exactly what you ask, right up until they don't. OpenSpec is a spec-driven development framework that puts a planning layer between your intent and the generated code. This post breaks down what it is, how it works, where it actually helps, and where it still falls short.

#ChatGPT

#Architecture

#Claude

Stop Vibe Coding Your Way Into Technical Debt — Try BMAD Instead

8. 6. 2026

BMAD (Breakthrough Method for Agile AI-Driven Development or Build More Architect Dreams) is an open-source framework that replaces chaotic AI-assisted coding with structured, role-based agent workflows. Here's what changed in v6, why the token savings are real, and whether it's worth adopting.

How Microsoft Is Moving 4,000 Repos to GitHub (And What You Can Learn From It)

5. 6. 2026

Microsoft is moving 4,000+ repos and thousands of developers from Azure DevOps to GitHub — not because Azure DevOps is dead, but because Copilot is where the AI action is. Here's how they're pulling it off without burning everything down.

JFR + AI: Stop Guessing What Your JVM Is Doing

4. 6. 2026

JDK Flight Recorder has been quietly sitting in the JVM for years, mostly used as a last-resort "dump a recording and pray" tool. But combine it with the JFR Streaming API and modern AI tooling, and you get something actually useful: a runtime observability loop that can catch problems before your oncall does.

Gemma 4 12B Runs Locally, Handles Audio and Video, and Actually Fits in 16GB

4. 6. 2026

Google's Gemma 4 12B is a fully open-weights, encoder-free multimodal model that handles text, images, and audio — and fits inside 16GB of VRAM. Here's what makes the architecture interesting and whether it changes anything for local AI workflows.

#Gemma

#Google

Code & Architecture

Node.js Is Going to One Release Per Year — and Honestly, It's About Time

3. 6. 2026

Node.js is dropping to one major release per year starting with v27. No more odd/even chaos, no more short-lived releases nobody runs in production. Here's what actually changes and what it means for your upgrade cadence.

#Node.js

Code & Architecture

Shopify Rewrote Their GraphQL Engine and Got 15x Faster. The Fix Was Embarrassingly Obvious in Hindsight.

3. 6. 2026

Shopify's GraphQL Cardinal swaps recursive depth-first execution for a breadth-first model and gets 15x faster field execution, 90% less memory, and 4+ seconds off P50 latency. Here's why the default traversal strategy was always the wrong one at scale — and what they did about it.

#GraphQL

#Architecture

Claude Opus 4.8: An Honest Developer's Take (It's Complicated)

3. 6. 2026

Anthropic dropped Claude Opus 4.8 and the reactions split predictably: half the community raving about benchmark supremacy, the other half staring at their API bills. Both camps are right. At $25 per million output tokens — and thinking tokens that count twice — you can burn through serious budget before you realize what's happening. One documented case: 62 million tokens in 24 hours, hitting a $2,500 monthly cap overnight. That's the model doing exactly what you asked. Whether that's a feature or a problem depends entirely on how you're using it.

#Anthropic

#Claude