Observability in Detail: VictoriaMetrics, VictoriaLogs, Grafana
TL;DR Observability is based on three pillars – metrics, logs, and traces – and is translated into a …
Diese Serie erklärt systematisch, wie moderne Software compliant entwickelt und betrieben wird – von EU-Regulierungen bis zur technischen Umsetzung.
Observability is more than “Monitoring 2.0.” It is the ability to understand the state of complex distributed systems from their behavior—even when specific error patterns were previously unknown. For regulated companies, this ability has become a core component of governance.
The three classic pillars:
Metrics
Time series like latencies, error rates, resource utilization. Ideal for trends, SLOs, and proactive alerting.
Logs
Event-oriented data—from API requests to authentication attempts to database errors. Basis for forensic analysis and audit trails.
Traces
Track individual transactions through microservices, queues, and databases. Indispensable for complex ICT incidents and performance bottlenecks.
Observability becomes regulatory relevant when structured processes emerge from this data:
Instead of viewing compliance as a burden, a change in perspective is worthwhile: A good observability stack reduces downtime, accelerates incident analysis, and creates exactly the transparency that laws demand today—and that is technically sensible anyway.
The NIS-2 Directive came into force on January 16, 2023. By October 17, 2024, EU member states must transpose it into national law; from October 18, 2024, the national implementations apply.
Key points related to observability:
The DORA Regulation (Digital Operational Resilience Act) also came into force on January 16, 2023, and applies directly to financial companies and certain service providers from January 17, 2025.
DORA requires, among other things:
Without comprehensive observability, this ICT incident management is practically unfeasible: You must be able to detect, classify, prioritize incidents, and later prove which measures were effective.
The General Data Protection Regulation (GDPR / GDPR) came into force on May 25, 2018. It requires, among other things:
This is where structured, immutable logs and long-term stable metrics come into play. A well-constructed observability stack is thus a central pillar of your GDPR compliance.
VictoriaMetrics is a high-performance, Prometheus-compatible time series database optimized for large data volumes and long retention periods. This is particularly relevant for regulated environments because incidents and trends often need to be analyzed over months or years.
Many organizations start with Prometheus—and rightly so. But as soon as:
A dedicated metric engine like VictoriaMetrics comes into play. It understands the Prometheus ecosystem (scraping, remote write, PromQL) and can therefore be integrated into existing instrumentation without adjusting application code.
Technically, VictoriaMetrics relies on:
For you, this translates into:
In the context of NIS-2 and DORA, this means: You can not only use SLOs and availability metrics operationally but also retrospectively demonstrate how your systems have developed—including peak loads, resource bottlenecks, and times when measures were effective.
While metrics represent the “vital signs” of your platform, logs provide the detail for audits and forensics. VictoriaLogs is a log database in the VictoriaMetrics ecosystem designed specifically for this requirement.
Unlike traditional file logs or simple log aggregators, VictoriaLogs relies on:
The effect: You can efficiently answer specific questions, such as:
For compliance, the content of logs is not the only important factor, but also their integrity. Tamper-proof logs are relevant in several dimensions:
VictoriaLogs supports this through an append-only design, clear separation of write and read paths, and integrity mechanisms at the segment level. In practice, many organizations combine this with:
Compared to alternatives like Loki, VictoriaLogs focuses more on high throughput, structured queries, and scalable long-term retention—exactly what you need for DORA-compliant ICT incident reports and comprehensive audit trails.
Data alone does not create resilience. What matters is how teams work with this data. This is where Grafana comes into play as a visualization and analysis frontend.
A typical dashboard in a VictoriaMetrics/VictoriaLogs stack could include:
This creates a workflow that directly supports regulatory requirements:
Even though this article focuses on VictoriaMetrics (metrics) and VictoriaLogs (logs), the third pillar—traces—should not be underestimated. Whether you use OpenTelemetry, Jaeger, or another tracing stack:
A well-thought-out observability stack therefore integrates all three perspectives into a unified operating concept.
A classic monitoring setup that only includes availability checks and a few host metrics will generally not be sufficient for NIS-2 or DORA-compliant environments.
Regulations require not only the detection of disruptions but also:
For this, you need:
Monitoring is one component—observability is the complete picture.
Many traditional log platforms have historically grown from file-based logging and scale only with significant effort in truly large data volumes. They often focus on full-text search, which is only partially ideal for compliance requirements.
VictoriaLogs, on the other hand, focuses on:
TL;DR Observability is based on three pillars – metrics, logs, and traces – and is translated into a …
TL;DR GitOps with ArgoCD anchors the desired state of your applications and infrastructure in Git, …
When running applications in production, you don’t need pretty dashboards, but hard data. …