Video Tolerates No Errors: Why 'Bare Metal' Hits Its Limits in Live Streaming
Compared to classic web applications, video is a completely different type of workload. While a web …

Monitoring data often has a short half-life: An alert pops up, the issue is resolved, and the alert disappears. However, for a managed hosting provider or a critical infrastructure operator, these data hold much more potential. They are the objective proof of the performance delivered.
The challenge lies in processing the enormous amounts of metrics produced by global endpoint monitoring every second in a way that is understandable to both technicians and customers. The solution is seamless integration into the existing observability stack using Prometheus and Grafana.
Without central integration, two separate worlds often emerge within a company:
Instead of operating endpoint monitoring as an isolated island, all results—from response time in milliseconds to TLS status—flow directly into the central time-series database (e.g., Prometheus or VictoriaMetrics).
Every check of the global PoPs is exported as a Prometheus metric. This has significant advantages:
Grafana is the window to the data. Here, we create different views for various target audiences:
The greatest operational leverage is the automation of reporting. Since the data is structured, reports can be generated at the push of a button or on a scheduled basis:
By freeing monitoring data from their silos and transforming them into professional dashboards and reports, technology becomes tangible for all involved. For the customer, it is the reassuring feeling that the promised quality is measurably maintained. For the provider, it is the efficient way to demonstrate professionalism without additional manual effort. Monitoring is ultimately not just a technical warning system but a central tool for customer retention.
Can we give the customer access to our Grafana? Yes, Grafana supports multi-tenancy. Customer accounts can be configured to see only the data of their own endpoints. This is a massive vote of confidence in one’s own service.
How do we handle maintenance windows in SLA reports? In Prometheus, maintenance times can be marked or excluded from calculations via specific metrics. This way, availability in the report is not distorted by planned work.
Is Prometheus suitable for long-term storage of SLA data? Prometheus itself is optimized for short- to medium-term data. For true SLA histories over years, connecting to a long-term storage like VictoriaMetrics or Thanos is recommended.
Can we also track error rates (Error Budgets)? Absolutely. In line with Google’s SRE principles, “Error Budgets” can be defined. The dashboard then shows not only if there is currently an issue but also how much “downtime” is left in the month before the SLA is violated.
Compared to classic web applications, video is a completely different type of workload. While a web …
In IT procurement, monitoring is often viewed as a commodity—a standard product that should cost as …
Europe is Working on Its Own Digital Payment Infrastructure The European payment landscape has long …