Monitoring only sucks when the cost of maintenance scales proportionally with the size of the system being monitored. Recently tools like Riemann and Prometheus have emerged that address this problem by scaling out monitoring configurations sub-linearly with the size of the system. In this talk, we'll discuss the concepts of timeseries-based alerting and give practical examples that can be employed in your environment today.
Jamie's work passions are automation and monitoring. He leads a team responsible for Google's largest globally replicated eventually consistent highly available buzzword compliant key value store.