Engineering Reality Checks

Platform development musings, AI tool reviews, and the occasional rant about why everything is broken. Written by someone who's debugging this stuff daily.

All Posts

Everything else worth reading, in reverse chronological order because that's how time works.

Agent Frameworks Comparison

Agent Frameworks: A Brutally Honest Comparison

Every week there's a new agent framework. Every week someone on Twitter declares it "the future of AI development." Here's the state of play in early 2026, stripped of the marketing: LangGraph, CrewAI, AutoGen, Claude, OpenAI Assistants API — which should you actually use?

Read more →
RAG System State 2026

RAG in 2026: Still Duct-Taping PDFs to Chatbots

Everyone said 2025 was the year RAG would "just work." Spoiler: it didn't. The Retrieval-Augmented Generation landscape looks like a building site after the scaffolding fell down. Here's what actually changed, what didn't, and where it's going next.

Read more →
LLM Landscape Analysis

The LLM Landscape in 2026: Who's Actually Winning

I've been testing every major LLM in production for the past six months, running them through real workloads, real codebases, and real edge cases that make marketing demos cry. The results might surprise you — spoiler alert: it's not who the tech Twitter hivemind thinks it is.

Read more →
RAG Implementation

RAG Isn't Dead, You're Just Doing It Wrong

Everyone's declaring RAG obsolete in favour of massive context windows, but they're missing the point entirely. Here's why RAG still matters, how to actually implement it properly, and when those shiny new 2M context windows will actually hurt more than help.

Read more →
AI Tools Worth Paying For

AI Tools Actually Worth Paying For in 2026

I've burned through more AI tool subscriptions than I care to admit, so you don't have to. Here's what's actually worth your money, what's just repackaged OpenAI with a fancy UI, and why most "AI-powered" development tools are solving problems nobody asked for.

Read more →
AI Agents in Development

I Let AI Agents Run My Dev Workflow for a Month

What happens when you let AI agents handle your deployment pipeline, code reviews, and incident response? Spoiler: it's not the productivity paradise the demos promise, but it's not complete chaos either. Here's what actually worked and what nearly brought down production.

Read more →