Articles

All Writing

Everything I've written — deep dives, project breakdowns, and the thinking behind the work.

/12 min read

The Forge: Knowledge That Compounds

I got tired of re-teaching AI agents the same lessons every session. So I built a system where every correction, pattern, and hard-won fix compounds instead of evaporating.

toolingarchitectureAI agentsknowledge graphs
Read article
/6 min read

The Prompt Is Not the Product

After 25 years of shipping products, I can tell you: the prompt is the easy part. The hard part is everything you build around it when it inevitably fails.

promptsarchitecturelessons
Read article
/7 min read

Evals Are Unit Tests for Vibes

You can't assert on 'sounds helpful but not condescending.' After thousands of evals, here's what actually works — and the trap that wastes everyone's time.

evalstestingquality
Read article
/7 min read

Context Windows Are a Lie (Sort Of)

128k tokens is a marketing number. The usable window is a fraction of that — and I learned the hard way that how you fill it matters more than how much you have.

contextarchitectureRAG
Read article
/7 min read

Ship the Feature Without the AI First

Counterintuitive? Maybe. But after 25 years of shipping products, I know that the non-AI version teaches you everything the AI version needs to know.

productarchitecturestrategy
Read article
/7 min read

The Temperature Misconception

Most developers treat temperature as a creativity dial. Twenty-five years of debugging has taught me to distrust any parameter I can't fully explain — and temperature is a perfect example.

modelsparametersdebugging
Read article
/10 min read

RAG Is Not a Search Engine

Every tutorial makes RAG look like three steps. After building retrieval pipelines that actually work in production, I can tell you it's more like thirty.

RAGarchitectureretrieval
Read article
/7 min read

Why Fraud Detection Must Ask Questions, Not Give Answers

I built a fraud detection system that replaces confidence scores with structured doubt. Here's why single-number risk scores are the most dangerous thing in compliance — and the architecture that fixes it.

AI ethicsarchitecturefraud detectionproduct design
Read article