Edge architecture · a comparative study · 2026

Five ways to put
Kubernetes. at the edge.

A quantitative comparison of edge Kubernetes deployment topologies for large distributed infrastructures — built around a simulator you can drive with your own parameters.

§ 01 · Context

The decision that cascades.

A distributed edge deployment — thousands of small sites, each with a handful of servers, coordinated by a central management cluster — forces one question first: where does the Kubernetes control plane live? Five topologies are credible, each distributing control-plane components differently between the edge sites and the central cluster.

The trade-offs between robustness, latency, cost, and operational complexity aren't reducible to a single number — but they can be made quantitative. This page formalises the parameters, defines the metrics, and provides a live simulator. The simulator is the centerpiece: drive the parameters, watch the metrics shift.

§ 02 · The five models

Each topology places control-plane components differently.

Fully autonomous

Each edge site is a self-contained Kubernetes cluster with its own redundant control plane on dedicated master servers, plus workers. Maximum robustness, maximum overhead.

local control plane dedicated masters high overhead

Shared autonomous

Each site is self-contained, but master and worker components are co-located on the same servers. Same robustness, less hardware waste, identical operational complexity.

local control plane co-located medium overhead

Hosted Control Plane

Each tenant control plane runs as pods inside a central management cluster. Edge sites contain workers only. The hyperscaler-style hosted control plane.

hosted CP no edge masters low overhead

Distributed autonomous

A hybrid: one master runs locally as a fallback while the rest of the control plane runs remotely. Mixes patterns in a way that fails to maintain control-plane quorum during a partition.

hybrid no quorum on partition questionable

Gigantic cluster

A single Kubernetes cluster with masters in the cloud and every edge server as a worker. Operationally simple but hits scalability ceilings and exposes the maximum blast radius.

single cluster hits K8s limits huge blast radius

§ 03 · Simulator

Drive the parameters.

Sliders update the metrics in real time. Cells turn warning-coloured or danger-coloured when a model crosses a meaningful threshold for the chosen deployment scale.

Parameters

Topology & Scale

N · edge sites200

S · servers per site8

R · regions1

A · availability zones1

Local Control Plane

m_l · local masters3

α · local API server hit ratio0.70

L_l · local latency (ms)0.5

Hosted Control Plane

m_c · central masters3

S_cp · sites managed by single hosted CP1

k · CP replicas3

d · CP density10

L_w · WAN latency (ms)20.0

Costs & Energy

C_ops · ops cost (€/cluster/y)€10k

P_master · master power (W)200 W

Metrics

Model	CP %	Fail-ok	Lat ms	Blast	Max/cl	OpEx/y	Energy/y
(1) Fully autonomous	37.6	1	0.5	0	8	€3.0M	1.1 GWh
(2) Shared autonomous	7.7	1	0.5	0	8	€2.6M	215 MWh
(3) Headless (Kamaji)	3.8	1	20.5	200	8	€1.0M	110 MWh
(4) Distributed autonomous	15.8	0	6.5	200	8	€1.2M	461 MWh
(5) Gigantic cluster	0.2	1	20.5	200	1,600	€30k	5 MWh

good warning dangerops scales · M1=1.5 · M2=1.3 · M3=0.5 · M4=0.6 · M5=3.0

Columns

CP %: Control-plane resources as a percentage of total compute (control overhead). Dedicated master servers count fully; co-located ones count at f_cp=0.2.
Fail-ok: Local server failures the site can absorb before its control plane loses write availability.
WAN-ok: Whether the edge site keeps a writable control plane during a WAN partition with the central cluster.
Lat ms: Effective round-trip latency for an API call from the site to a control-plane server.
Blast: Number of edge sites disrupted by a failure of the central management cluster.
Max/cl: Maximum nodes joined to a single Kubernetes cluster. The practical scalability ceiling sits around 5,000.
OpEx/y: Annual operations cost = clusters × ops-scale × C_ops. Ops-scale reflects per-cluster operational difficulty.
Energy/y: Annual energy of the master infrastructure = master-nodes-equivalent × P_master × 24 h × 365.

§ 04 · Parameters

The knobs.

Edge sites

1 – 5,000

200

Number of edge sites in the deployment

Servers per site

2 – 30

Servers at each edge site (assumed uniform)

m_l

Local masters

1 – 7

Master nodes per local edge control plane

m_c

Central masters

1 – 7

Master nodes in each central management cluster

Regions

1 – 10

Number of management regions

Availability zones

1 – 5

Management sites (AZs) per region

S_cp

sites managed by single hosted CP

1 – 100

Number of edge sites managed by a single hosted tenant control plane (Headless)

CP replicas

1 – 5

Replication factor of each hosted tenant control plane

CP density

1 – 20

Tenant control planes packed per central server (Kamaji)

local API server hit ratio

0 – 1

0.70

Fraction of API operations servable within a local site

L_l

Local latency

0.1 – 5 ms

0.5

Intra-site latency to nearest server

L_w

WAN latency

1 – 100 ms

Edge-to-central WAN latency

C_ops

Ops cost

€1k – 50k / y

€10k

Operational cost per cluster per year

P_master

Master power

100 – 400 W

200 W

Average power draw of a master server

§ 05 · Metrics

Eight derived quantities.

Total master footpr

M_eq = (m_ded + f_cp·m_shared)·N + M·m_c + M·⌈(N_hosted·k) ÷ d⌉
= M_e·N + M·m_c + M_hosting

Total master footprint across the deployment, distributed across M = R × A management sites.
m_ded is the amount of dedicated server for master nodes (using m_l), while m_shared is the count of servers being both master and worker nodes (also using m_l). M_e is the footprint per edge site (co-located masters count at f_cp = 0.2). N_hosted = ⌈N ÷ (M × S_cp)⌉ is the number of hosted tenant control planes needed per management site. M_hosting represents the central servers needed to host tenant control planes across all sites.

Control plane overhead

O = M_eq ÷ (N·S + M·m_c + M_hosting) × 100

Master server-equivalents as a percentage of total compute servers.
Note: we are assuming the central clusters only have exactly the master nodes and the workers needed to run tenant control planes and no additional static workers.

Local failures tolerated

⌊(M_e − 1) ÷ 2⌋ for local quorum ; ⌊(m_c − 1) ÷ 2⌋ for central

Server failures the site can absorb before its control plane loses write availability.

WAN survival

survives ⇔ local control plane has quorum

Whether the edge site keeps a writable control plane during a WAN partition.

Effective CP latency

L_eff = L_l (local) ; L_l + L_w (remote) ; L_l + (1 − α)·L_w (hybrid)

Round-trip to a control-plane API server, accounting for an optional local cache.

Blast radius

sites disrupted by central failure : 0 (autonomous) or N_local (hosted)

How many edge sites lose control-plane writes if a single central management cluster fails.

Max nodes per cluster

S (distributed) ; N_local·S (gigantic)

Hits Kubernetes' practical scalability ceiling around 5,000 nodes. Measured per management site for the gigantic model.

OpEx per year

OpEx = clusters × ops-scale × C_ops

Ops scales encode per-cluster operational difficulty: Kamaji-managed clusters are homogeneous (0.5–0.6), bespoke clusters are heavy (1.3–1.5), one hard gigantic cluster is heaviest of all (3.0).

Energy per year

E = M_eq × P_master × 24 h × 365

Energy consumed by control-plane infrastructure alone (excludes worker load).

§ 06 · Limitations

What this model deliberately doesn't capture.

The model assumes uniform edge sites (real deployments rarely are), ignores correlated failures (a fibre cut takes down many sites at once), doesn't model storage and has a simple Region / Availability Zone with uniform sites in each zone.