Writing

Essays on agentic systems in production, written from the inside.

The honest default is noFour systems that do nothing alike share their most important line of code, the one that runs when the system has no idea, and in all four it refuses. Jun 2026 Inducing the schema instead of supplying itThe unsolved part of document AI is discovering the columns, not filling them, and a column should only exist if it pays for itself in bits. Jun 2026 I built it one validation too earlyThe abstractions I have had to delete were not wrong, they were premature, built to answer a question the work had not yet asked. Jun 2026 A good average hides the column that mattersA synthetic dataset that scores well on average can still be useless and unsafe, so the only scores worth trusting gate on the worst part and fail closed. Jun 2026 Point, don't dragDrag-and-drop is the wrong handle for steering a model that builds your interface, because the thing you grab and the thing it wrote stopped being one-to-one the moment a list appeared. Jun 2026 The harness remembers what the agent forgetsThe coding agent starts every build with no memory of the last one, so the system that runs it got better by writing down where the agent kept failing. Jun 2026 Letting an agent merge to mainAutonomy is not something the model has. It is the set of gates you are willing to put around it, and an audit trail of everything it did inside them. Jun 2026 Watching an agent workThe interesting part of an agent is not that it acts, it is the shared room where you can see what it sees, point at things, and take the controls back. Jun 2026 A turn is not a requestAn agent's unit of work is a long-lived process, and the moment you treat it like an HTTP request you lose the run every time a laptop sleeps. Jun 2026 Managing engineers who never sleepA year of running AI coding agents turned into an accidental management apparatus, written one rule at a time. Jun 2026 What it takes to let agents touch enterprise dataTrust in agentic systems is an architecture property, not a model property. Jun 2026 Your gold set is lying to youReference answers are artifacts with bugs, and they are the only software in the stack that nobody code-reviews. May 2026 An answer you cannot audit is worth nothingProvenance is a data-model property, not a UI feature, and it cannot be retrofitted. May 2026 Authorization for answersWe have learned to secure what agents do. The harder question is what they are allowed to know. Apr 2026 Evaluating systems that answer in sentencesEvaluation infrastructure is the difference between a demo and a product. Apr 2026 LLM spend is an attribution problemA month of anonymous model spend taught me that cost governance for AI is identity infrastructure, and the discipline that fixes it already has a name. Apr 2026 Data infrastructure and AI infrastructure are different disciplinesThey optimize for different things, fail in different ways, and the hard problems live at the seam. Mar 2026 Why your knowledge graph isn't helping your RAGWe nearly published the fashionable verdict that GraphRAG is hype. The audit found something better, the structural reasons the published architectures break on enterprise data. Mar 2026 The physical world does not retry idempotentlyWhat burger-making robots taught me about agents acting on production data. Mar 2026 The model designs, the code enforcesIn an LLM ingestion pipeline, the model gets the judgment that requires reading the document and nothing else. The surprise is how badly the prompt wants to violate that in both directions. Feb 2026 Database lessons for the agent eraAgent platforms are rediscovering, one incident at a time, what database engines settled decades ago. Feb 2026 Tables are imagesMeaning in a table lives in its geometry, and the architecture that survives a real corpus is a per-table router, not an ideology. Jan 2026 What regulation does to architectureYears of healthcare engineering taught me that compliance, taken seriously, is a design input that produces better systems. Jan 2026 When documents become databasesSchema discovery is a corpus-statistics problem before it is a modeling problem. Dec 2025 Free like a puppy, times fifteenA product assembled from open source is not the sum of its components. It is the resolution of their disagreements. Nov 2025 Forcing the raceRace conditions are not hard to test. They are untested, because teams accept probabilistic reproduction. Oct 2025