Cloud Cost Hygiene: Why Unused GPUs Are Draining Your Budget
In the realm of IT infrastructure, few things are as costly as a modern NVIDIA GPU doing nothing. …

When companies move their IT infrastructure to the cloud, they usually do so with a clear economic expectation: flexibility and full cost transparency. The principle of “Pay-as-you-go” is intended to transform unpredictable capital expenditures (CapEx) into predictable operational expenses (OpEx). However, the deeper companies are drawn into the ecosystems of the major US hyperscalers, the more complex and opaque the monthly billing becomes.
A prime example of this architectural and financial opacity can be found at the network boundary: the billing of load balancer capacities through artificial, combined metrics like the Load Balancer Capacity Unit (LCU). What initially sounds like a fair, usage-based model often turns out to be an unpredictable cost trap for growing medium-sized enterprises during peak loads or in IoT infrastructures. True economic sustainability therefore also requires a return to transparent, comprehensible pricing structures in network design.
To understand why the costs for cloud load balancers can unpredictably skyrocket, one must decipher the mathematical mechanics behind an LCU (as used, for example, by AWS Route 53 or the Application/Network Load Balancers).
An LCU is not calculated based on a single, tangible size (like pure data volume). Instead, the cloud provider continuously measures four completely different dimensions of network traffic:
The crucial catch in this model is the billing logic: At the end of the month, the dimension that caused the highest LCU value is always billed. So if your application works extremely efficiently and consumes hardly any data volume (Dimension 3 low), but due to thousands of IoT sensors must keep many long-lasting connections open (Dimension 2 extremely high), the LCU count skyrockets. You pay for the absolute maximum, even if the other three dimensions were idling.
This nested billing model may be manageable for global tech companies with their own cloud financial management departments (FinOps). For the classic SME, however, it leads to three noticeable disadvantages:
As network traffic dynamically changes due to marketing campaigns, seasonal peaks, or automated system updates, LCU utilization can hardly be precisely calculated in advance. IT management cannot predict whether the load balancer bill will be 50 euros or suddenly 2,500 euros next month. This complicates any reliable budget planning.
Companies developing smart products or networking machinery naturally operate stateful workloads. IoT devices often send only a few bytes every few minutes but keep the TCP connection permanently open to be immediately alert. In the LCU model, the dimension of “active connections” strikes mercilessly here. The company pays astronomical sums for dormant connections that generate hardly any infrastructure load.
Instead of focusing on the further development of the core application, platform engineers in LCU environments are constantly busy artificially optimizing network traffic. Complex architectures are built to aggressively sever connections or save rules – just to circumvent the next LCU cost threshold. This is wasted working time.
That cloud routing can also be economically sustainable and absolutely understandable is proven by sovereign European edge platforms. They do not impose artificial mathematical dimensions on companies but rely on a two-tier, transparent pricing model:
| Billing Feature | US Hyperscaler (LCU Model) | Sovereign Edge Platform (ayedo) |
|---|---|---|
| Price Components | 4 dynamic dimensions (maximum wins) | Fixed base price + linear traffic |
| Planability | Hardly possible (dependent on connection metrics) | Very high (fixed costs plus predictable data consumption) |
| IoT / Stateful Suitability | Poor (high costs for active persistent connections) | Excellent (unlimited number of connections included) |
| Hidden Fees | Yes (rule evaluations, separate monitoring) | No (all-in-one including Prometheus export) |
Digital sovereignty encompasses not only the protection of data from foreign states and compliance with legal frameworks such as GDPR, NIS-2, or DORA. It also means economic self-determination. Those who tie their IT infrastructure to opaque, monopolistic pricing structures give up a piece of this self-determination.
A simple, transparent, and purely volume-based billing model at the edge protects growing companies from unpredictable cost explosions. It gives SMEs back the commercial control over their IT budgets and ensures that cloud infrastructure is once again what it was meant to be from the start: a calculable, reliable, and fair engine for digital innovation.
LCU stands for Load Balancer Capacity Unit. It is an abstract calculation unit introduced by US hyperscalers (primarily Amazon Web Services / AWS) to bill the use of elastic load balancers. Instead of a simple price for data throughput, the system measures four different dimensions (new connections, active connections, processed bytes, and routing rules). The dimension with the highest consumption determines the total cost at the end of the billing period.
The trap lies in the so-called “maximum logic.” If an application is extremely economical in three out of four dimensions but has a peak in a single dimension (e.g., due to many permanently open TCP sessions from IoT devices), the entire month is billed based on this peak value. As user behavior and network streams on the internet dynamically change, the resulting LCU fees are hardly calculable in advance. This regularly leads to unpredictable cost explosions on the monthly cloud bill.
Yes, but these are usually associated with significant architectural effort. Developers must, for example, configure aggressive timeouts to immediately disconnect unused TCP connections or shorten HTTP keep-alive times. While this reduces the number of active connections, it forces clients to constantly establish new connections – which in turn drives up the “new connections” dimension. The problem is often only shifted. The more sustainable solution is to switch the network architecture to transparent, volume-based providers.
In the realm of IT infrastructure, few things are as costly as a modern NVIDIA GPU doing nothing. …
A nightmare for any IT decision-maker is the phenomenon of vendor lock-in—the technological and …
🧠Editorial Europe has been discussing digital sovereignty for about as long as companies have been …