The BOSH Team Europe at SAP operates BOSH as a service for operators and the Cloud Foundry on-demand service brokers. Like every production service, the expectations to BOSH's performance and availability have increased dramatically.
Motivated by concrete issues in our production BOSH services, we have continuously worked on identifying the useful metrics and making them available to monitoring tools. No one likes to be paged and that is why it was a crucial task for us to improve the BOSH monitoring.
In this talk we share some of our operation war stories and the metrics which we identified to prevent these situations as well as some ways to expose those metrics to monitoring tools. Moreover, we will give an update on all the improvements implemented for BOSH to make a more sophisticated monitoring possible.
Beyhan is a software engineer in the BOSH team at SAP. He is a committer on the BOSH project in the last 4 year. He was working on different topics of the SAP Cloud Platform prior that.