IT-in-Git

Blog

Claude Fable 5 is Anthropic's first Mythos-class model available to everyone — and it's not really a chat model anymore. It's an agent runtime. Here's what that actually means for people building on it.

turbovec is a Rust-backed vector index with Python bindings that implements Google's TurboQuant algorithm — compressing 31 GB of float32 embeddings down to 4 GB while beating FAISS on search speed. No training, no warmup, no drama. Here's why it matters for anyone building RAG pipelines.

AI coding assistants are powerful but directionless — they do exactly what you ask, right up until they don't. OpenSpec is a spec-driven development framework that puts a planning layer between your intent and the generated code. This post breaks down what it is, how it works, where it actually helps, and where it still falls short.

BMAD (Breakthrough Method for Agile AI-Driven Development or Build More Architect Dreams) is an open-source framework that replaces chaotic AI-assisted coding with structured, role-based agent workflows. Here's what changed in v6, why the token savings are real, and whether it's worth adopting.

Microsoft is moving 4,000+ repos and thousands of developers from Azure DevOps to GitHub — not because Azure DevOps is dead, but because Copilot is where the AI action is. Here's how they're pulling it off without burning everything down.

JDK Flight Recorder has been quietly sitting in the JVM for years, mostly used as a last-resort "dump a recording and pray" tool. But combine it with the JFR Streaming API and modern AI tooling, and you get something actually useful: a runtime observability loop that can catch problems before your oncall does.

Google's Gemma 4 12B is a fully open-weights, encoder-free multimodal model that handles text, images, and audio — and fits inside 16GB of VRAM. Here's what makes the architecture interesting and whether it changes anything for local AI workflows.

Node.js is dropping to one major release per year starting with v27. No more odd/even chaos, no more short-lived releases nobody runs in production. Here's what actually changes and what it means for your upgrade cadence.

Shopify's GraphQL Cardinal swaps recursive depth-first execution for a breadth-first model and gets 15x faster field execution, 90% less memory, and 4+ seconds off P50 latency. Here's why the default traversal strategy was always the wrong one at scale — and what they did about it.

Anthropic dropped Claude Opus 4.8 and the reactions split predictably: half the community raving about benchmark supremacy, the other half staring at their API bills. Both camps are right. At $25 per million output tokens — and thinking tokens that count twice — you can burn through serious budget before you realize what's happening. One documented case: 62 million tokens in 24 hours, hitting a $2,500 monthly cap overnight. That's the model doing exactly what you asked. Whether that's a feature or a problem depends entirely on how you're using it.