Thoughts on reliability, infrastructure, and production engineering.
Jan 24, 2026
Most teams assume production is "under control", until it suddenly isn’t. What if the real problem isn’t your tools, your cloud, or your engineers, but the fact that nobody truly owns how production behaves over time?
This article explores what Production Stability as a Service actually means, why it exists, and why boring production might be the most valuable outcome of all.
Read more →Jan 24, 2026
The Difference Between Running Software and Trusting It in Production
Most teams are running software in production, but very few actually trust it. The difference isn’t uptime, tools, or cloud providers; it’s how well teams understand failure, behavior, and risk. This article explores why running feels anxious, trusting feels calm, and how most teams quietly drift into the gap between the two.
Read more →