Six Nines (99.9999%) SLA Explained
Explore the extreme availability requirement of 99.9999% (six nines), allowing only 31.5 seconds of annual downtime. Reserved for life-safety and financial core systems.
Detailed Explanation
What Does 99.9999% Uptime Mean?
A 99.9999% SLA, known as six nines, allows only approximately 31.5 seconds of downtime per year. This is an almost theoretical availability level that very few systems in the world actually achieve.
Downtime Breakdown
| Period | Allowed Downtime |
|---|---|
| Per year | 31.5 seconds |
| Per month | 2.6 seconds |
| Per week | 0.6 seconds |
| Per day | 0.086 seconds (86 milliseconds) |
The Reality of Six Nines
At 86 milliseconds of daily allowed downtime, six nines is virtually indistinguishable from "always on." To put this in perspective:
- A single TCP connection timeout (typically 3 seconds) would consume an entire month's budget
- A garbage collection pause in Java can exceed the daily budget
- DNS resolution alone can take 50-200ms, nearly consuming the daily allowance
How Is It Possible?
Systems that approach six nines use fundamentally different architectures:
- Hardware redundancy at every level — redundant power supplies, network cards, storage controllers
- Active-active across multiple regions with sub-second failover
- No single dependency — every external service has a fallback
- Custom-built infrastructure — off-the-shelf solutions cannot provide this level
- Formal verification of critical code paths
Real-World Examples
- Telephone switching systems — traditional PSTN was engineered for six nines
- Nuclear plant control systems — safety-critical systems with triple redundancy
- Air traffic control — zero tolerance for system failure
- Pacemakers and medical devices — hardware-level redundancy
The cost of six nines can be astronomical. It is justified only when human life or financial system integrity is at stake.
Use Case
Six nines is reserved for life-safety systems (medical devices, aviation), core financial infrastructure (stock exchange matching engines), and telecommunications backbone systems where failure has catastrophic consequences.