Ten diagrams that make the technical map and the beginner's companion legible at a glance. Each plate is a concept graph — nodes are things, edges are dependencies. Read them slowly; the industry is a graph, not a list.
Every AI-compute story is one of three constraints in disguise. They are not independent — relieving one usually loads another.
The spine of the industry, read upward from atoms to geopolitics. Everything in the technical map lives at one of these layers — and every layer constrains the one above it.
A cross-section of a modern leading-edge die. Signals come in from the top; power increasingly comes in from the back. Nine-tenths of the height is wire, not transistor.
A modern AI accelerator is not a chip — it is a package: compute dies, HBM stacks, and a silicon interposer all co-fabricated as one object. This is CoWoS, TSMC's engine of the AI buildout.
The atomic-scale story of how the industry kept shrinking past the limits of each previous design — by redesigning the switch itself.
No country makes a modern AI chip alone. Each step is controlled by one-to-three suppliers, usually in a single country, and disrupting any of them cascades through the whole industry.
A single GPU's journey through the industry, with the cumulative lead-time clock running in the margin. It takes the better part of a year to go from polished wafer to a running rack.
Post-Blackwell the rack is the product. Seventy-two GPUs stitched into one scale-up domain by a copper NVLink backplane, drawing ~120 kW and cooled by liquid. Rubin's Kyber rack will push this to 144 and then 576 GPUs — and go optical.
An AI cluster is not one network, it is three — at very different scales, speeds, and costs. The back-end alone accounts for ~85% of cluster networking spend.
From high-voltage utility feeder to transistor: roughly a dozen conversion stages, each shedding heat, each with its own supply chain. This is why "build a 1 GW datacenter" is a three-year project, not a three-month one.