AI Letters #09 - Ship It Without Breaking Things: Evals, Observability, and Production-Safe Agents
The vendor API renamed a field. The agent kept running. No error. No alert. Six days later, someone noticed the revenue reports were wrong. The agent had been silently writing null to a field that used to be called something else.
No crash. No exception. Just quiet, compounding damage.
