Infrastructure health is a prerequisite for service health, but it is not the same thing. We can do better than kernel counters and log scrapers. In this talk, I’ll discuss application-level telemetry best practices and the tools that turn this data into a automated management dream.