If you learned your prod was down from a customer - you don’t have prod. You have an MVP.
You shipped a product. Customers are using it. That’s already something. But production without monitoring means your customer is your on-call team.
At DevOps interviews, candidates get asked: “what do you do when prod goes down?” The right answer: “that call shouldn’t happen.”
Not because DevOps is that good. Because the system should know about the problem before the customer does. Always.
What normal looks like:
CTO gets a message in the morning: “X went down overnight. 5 minutes downtime. Root cause identified. Fixed. Postmortem by end of day.”
Not a customer call. A team report.
If that’s not how it works at your company - the question isn’t for your DevOps. It’s for you: how did your customer find out first?
Let’s talk.
#DevOps #SRE #CloudInfrastructure #TechLeadership #ITAudit