# Automating Kubernetes Observability: Scaling Your Metrics with Dynamic Discovery
Let’s say you have a kubernetes cluster and prometheus with multiple workloads running on it. You want to monitor the health of the cluster and the workloads.
Let’s say you have a kubernetes cluster and prometheus with multiple workloads running on it. You want to monitor the health of the cluster and the workloads.
Picture this: I’m sitting in a room packed with the infrastructure team, the vendor, and our developers. Tension is high. We had just gone through a platform bridge change that caused IPs to cycle.…
There is a stark difference between working at a company that builds technology and a company that merely uses it. In organizations where technology is not the core business, IT and engineering…
Once upon a time, my engineering team was stuck in a vicious cycle.
I recently dealt with an incident that took a development environment down for an entire week.
We have all seen it happen. An API endpoint gets slow. The database CPU starts spiking during peak hours. The immediate, knee-jerk reaction from the dev team?