§ 01Executive View
NVIDIA's competitive position in May 2026 is best read as wide-moat, structurally long, with the moat changing shape rather than narrowing in absolute terms. The merchant-GPU share base (~80% of all AI accelerator silicon, down from ~86% in 2024) is being chipped at by hyperscaler ASICs — TPUv7 Ironwood at Anthropic/Meta, Trainium3, Maia 2, and OpenAI's Broadcom-co-designed chip — but the locus of the moat has migrated up the stack from "CUDA-only kernels" to integrated rack-scale systems (NVLink 6 + Spectrum-X + Dynamo + TensorRT-LLM + Kyber 800V), where competitors are 12–18 months behind on every dimension simultaneously. The read is long-supportive: pricing power on hyperscaler frontier-training buys is capped and gross margins have a ceiling near mid-70s, but the rack-as-product transition expands NVIDIA's BOM-per-deployment by ~3× and the alternative stacks (AMD Helios, UALink, ROCm, Huawei CloudMatrix) all remain "good enough to discipline price, not good enough to displace") in 2026 (Klover.ai, Silicon Analysts NVDA share).
§ 02Competitive Set
Direct (merchant AI accelerator silicon and rack systems)
- AMD (AMD) — MI450X / MI455X on TSMC N2-class, up to 432 GB HBM4, 19.6 TB/s, 20 PF FP8 / 40 PF FP4. Helios rack at 72 GPUs / 1.4 EFLOPS FP8 / 2.9 EFLOPS FP4, co-developed with Meta via OCP, HPE first major OEM. OpenAI committed up to 6 GW of MI450 starting H2 2026 (1 GW initial); Meta announced a separate $60B/6 GW partnership Feb 2026. The single most important head-to-head competitor on training-class workloads (AMD Helios blog, Tom's Hardware HPE adoption, AMD-Meta press release).
- Google TPU (GOOGL, via Broadcom-built silicon) — TPUv7 Ironwood "the 900-pound gorilla." Anthropic 1M-unit deal: 400k Ironwoods sold direct as finished racks (~$10B Broadcom revenue) plus 600k rented through GCP (~$42B RPO). Google explicitly targeting "10% of NVIDIA's data-center revenue within a few years" via direct external sales. SemiAnalysis estimates ~44% TCO advantage internally and ~30% to external customers vs GB200 (SemiAnalysis TPUv7, Datacenter Knowledge Anthropic deal).
- AWS Trainium (AMZN, Annapurna/Marvell-built) — Trainium3 + Project Rainier; reportedly >50% of Bedrock token throughput already runs on Trainium/Inferentia. Captive workload mostly, but increasingly a non-trivial share-eraser on inference (CNBC custom silicon).
- Microsoft Maia (MSFT, Broadcom-co-designed) — Maia 2 in 2026; smaller scale than TPU/Trainium but anchors MSFT's diversification away from sole-source NVDA buys.
- Meta MTIA (META, Broadcom-co-designed) — production for ranking/ad workloads; the AMD/Meta 6 GW deal is essentially Meta hedging two ways at once.
- OpenAI custom chip (private, Broadcom partner) — 2027 ramp. Notable because it's the lab most reliant on NVDA and most aggressive about diversification.
- Huawei Ascend 910C / 910D + CloudMatrix 384 (private) — system-level claim of ~300 PF dense BF16 (~2× GB200 NVL72), 3.6× memory capacity, 2.1× bandwidth — but at 4.1× the power draw. Effectively walled into China but it removes NVDA's TAM there structurally (SemiAnalysis CloudMatrix).
Adjacent / Substitute
- Broadcom (AVGO) — not a GPU competitor; the enabler of nearly every hyperscaler ASIC challenger. The structurally important "picks-and-shovels short of NVDA": every TPU, MTIA, Maia, and OpenAI chip pays Broadcom margin. The cohort synthesis frames Broadcom as the cleanest hedge against NVDA's high-end pricing-power compression.
- Inference-only / specialized silicon — Groq (LPU, decode-shape inference winner), Cerebras (wafer-scale), Etched Sohu (transformer-only ASIC), Tenstorrent (RISC-V), SambaNova (reconfigurable dataflow). Individually small; collectively they've taken meaningful share of the "decode" portion of inference where memory bandwidth and not FLOPs is the binding constraint.
- UALink fabric ecosystem — AMD/Cisco/HPE/Marvell/Astera-Labs founded 2024, ratified spec in 2026; Upscale AI's "SkyHammer" scale-up switch targeting Q4 2026. Spec supports 1,024 accelerators in a single scale-up domain vs NVLink 6's 576. This is the most credible architectural challenger to the NVLink moat rather than the CUDA moat (SDxCentral UALink 2.0, HPCwire Upscale AI).
- Software floor: PyTorch 2.x + Triton + MLIR — OpenAI Triton has demonstrated near-parity GPU code across CUDA/ROCm/TPU at the compiler layer. ROCm 7 has reached "production" for PyTorch + vLLM + SGLang workloads in 2026, but TensorRT-LLM and FlashAttention-3 remain CUDA-only (Spheron ROCm vs CUDA 2026).
Emergent
- Vertically integrating customers — every hyperscaler is now a chip designer. The "1 customer = 1 ASIC program" pattern is the structural change of 2025–2026.
- NVLink Fusion as defense — NVDA opened NVLink to Arm, Fujitsu, Qualcomm, Marvell ($2B Marvell deal Mar 2026), explicitly to keep partner CPUs/accelerators terminating on NVDA's fabric. Read as a moat-extension move, not a moat-erosion: NVDA chose to be the rail before someone else built a parallel rail (NextPlatform Marvell deal).
- Photonic / co-packaged optics — Quantum-X / Spectrum-X Photonics shipping 2026; Lightmatter Passage / Ayar Labs / Celestial AI on the merchant side. Not yet a competitive threat — NVDA is currently leading the integration.
- Chinese parallel stack at scale — beyond Huawei: Biren, Moore Threads, Cambricon. Each individually is sub-scale; collectively they're being pushed by domestic policy. The HBM rule is the binding wall.
§ 03Moat Assessment
| Moat type | Score (1–5) | Why |
|---|---|---|
| Cost advantage (scale, process, learning curve) | 3 | Not a structural cost-leader at the silicon level (TSMC node access is non-exclusive; Broadcom-fabbed ASICs hit similar nodes). However, scale of ecosystem investment (CUDA, libraries, NIM, Dynamo) gives a learning-curve advantage that competitors must replicate or substitute. SemiAnalysis estimates TPUv7 internal TCO is ~44% lower than GB200, which says NVDA does NOT have a TCO cost advantage on hyperscaler captive workloads. |
| Switching costs | 5 | The deepest moat. CUDA + cuDNN + TensorRT-LLM + NCCL + Triton-Inference-Server + Dynamo + the developer talent pool is genuinely sticky. Mid-sized ML teams default to CUDA because re-porting and re-tuning is multi-quarter work. The BUT: hyperscalers internalize this cost and write the bill. So switching costs bind enterprises and labs much harder than they bind hyperscalers. |
| Network effects | 4 | Two-sided developer/hardware network (every framework optimization lands on CUDA first → every researcher learns CUDA → every framework optimization lands on CUDA first). Plus NVLink Fusion turning the interconnect itself into a network: opening the fabric makes more partners terminate on NVDA. Strong but not winner-take-all — PyTorch + Triton create a hardware-agnostic compiler floor. |
| Intangible assets (brand, IP, regulatory) | 4 | Reference architecture status is itself an intangible — "Nvidia jumps and every other manufacturer says how high." Patent portfolio on NVLink, NVSwitch, Mellanox InfiniBand, GPU-direct RDMA. The "DGX = the standard" brand effect with research labs. |
| Efficient scale (natural monopoly geometry) | 2 | The merchant AI accelerator market is large enough to support multiple vendors profitably (AMD running ~$7–8B on MI3xx in 2025 demonstrates this). Not a natural-monopoly market; closer to a Coke/Pepsi structure with NVDA as Coke. |
Verdict: wide · Trend: stable (with sub-trends: training-on-merchant-silicon = strengthening; hyperscaler-captive-inference = eroding; rack-scale-systems = strengthening; pure CUDA software lock-in = mildly eroding).
The substantive synthesis: NVDA's moat in 2024 was "CUDA software." That moat is genuinely narrowing — ROCm/Triton/MLIR closed the 80% case in 2024 and the 90% case in 2026, OpenAI Triton ports proved compiler-level abstraction works, AMD's MI300X demonstrated that for many enterprise inference workloads CUDA-only is no longer a hard requirement. However, NVDA spent 2023–2026 explicitly migrating the moat upward from "kernel-level software" to "rack-as-product" — GB200 NVL72, Rubin Vera, NVLink 6, NVSwitch, Spectrum-X, Quantum-X CPO, integrated liquid cooling, Dynamo orchestration, NIM, the Kyber 800V rack platform. At the rack level the integration depth is currently ~12–18 months ahead of AMD Helios and ~24+ months ahead of UALink+merchant scale-up switches. The user's "CUDA moat changing shape" framing is exactly right: the moat depth is preserved, the moat location migrated. For an investment thesis, this is what matters: a moat that has demonstrated the ability to relocate when attacked is itself a meta-moat.
Pricing power on hyperscaler frontier-training contracts is the one place where the moat is measurably eroding. TPUv7 + MI450 + Trainium3 functioning as credible second/third sources gives Anthropic, Meta, OpenAI, Microsoft, Amazon, and Google their first real BATNA in negotiation. This shows up as ~71% gross margin in FY26 vs the ~75–76% the bull case wanted, with mid-70s being the management-guided ceiling rather than a launching pad.
§ 04Share Trajectory
| Metric | 2023 | 2024 | 2025 | 2026E |
|---|---|---|---|---|
| AI accelerator silicon share (merchant + captive) | ~92% | ~86% | ~83% | ~78–80% (consensus); some analysts ~75% |
| AI accelerator silicon share (merchant only) | ~95% | ~92% | ~90% | ~88–90% |
| Custom ASIC shipment growth (YoY) | — | ~30% | ~40% | ~44.6% (forecast) |
| GPU shipment growth (YoY) | — | ~80%+ | ~50%+ | ~16.1% (forecast) |
Source: composite of Silicon Analysts and Introl 2026 estimates (Silicon Analysts, Introl). These are bottoms-up unit estimates from third-party analysts; NVDA does not disclose share. Treat the trend (shrinking share inside a rapidly-expanding TAM) as more reliable than the absolute level.
The honest read: NVDA share is declining in percentage terms while NVDA revenue is growing in dollar terms because the AI accelerator market is growing 30–50% per year. Share at 80% of a $400B+ market in 2026 is dollar-larger than share at 92% of an $80B market in 2023.
§ 05Pricing Power
1. Has NVDA raised prices in the last 24 months? Yes, in mix terms. List ASPs on Blackwell B200 at $35–40k vs Hopper H100 at $25–35k represent a meaningful uplift, and the rack-level GB200 NVL72 at ~$3M list bundles power/cooling/switching/cabling that customers used to source separately, so per-customer revenue has stepped up sharply. Nominal headline pricing is also higher for Rubin per Jensen's GTC commentary (Tech Insider Blackwell pricing).
2. Did volumes hold or grow at the new price? Yes — Q3 FY26 set another record on data-center revenue, and Jensen quoted "$1 trillion through 2027" in Blackwell+Rubin order visibility on the May 2026 commentary (Futurum Q3 FY26, 24/7 Wall St). Pure unit growth has slowed to ~16% (per Silicon Analysts), but dollar growth is still very strong because of mix and rack-level bundling.
3. What does customer concentration tell you about who has leverage? The top 4–5 hyperscalers represent a majority of NVDA data-center revenue (NVDA does not disclose, but consensus puts it at >50% concentrated in <10 customers). These customers have all built captive silicon and use it as negotiation leverage. Hyperscaler discounts are reportedly 15–25% off list. Smaller customers (neoclouds, enterprises, sovereigns) have far less leverage and pay closer to list. Net: NVDA has structural pricing power on the bottom 80% of customers; capped pricing power on the top 20% of customers, where the dollars actually live. This is the single most important nuance — and the customer-analyst is downstream of this point.
§ 06Bull Points
- Rack-as-product expands per-customer BOM ~3×. GB200 NVL72 captures ~$3M of revenue that used to be split across 5+ vendors. Kyber 1 MW racks (2027) extend this further. Even with declining unit share, dollar share at the rack level is growing.
- NVLink Fusion is offense, not defense. Rather than waiting for UALink to ratify, NVDA opened NVLink to partner CPUs/accelerators (Arm, Qualcomm, Fujitsu, Marvell). The fabric becomes the standard before the open standard does. (NVIDIA NVLink Fusion)
- Software moat migrated, not eroded. TensorRT-LLM, Dynamo, NIM, Run.ai (acquired) are 2024-2025 vintage, not 2018. NVDA is still extending the stack faster than competitors are catching the old layer.
- Reference-architecture network effect. ODMs, OEMs, box-builders, power-vendors, cooling vendors all design to NVDA's reference. SuperMicro DCBBS, Vertiv 800V, Foxconn racks — all anchor on NVDA spec first. This is a pull-through multiplier on every adjacent layer in the stack.
- HBM4 + CoWoS-L allocation. NVDA continues to receive priority allocation on the binding-constraint inputs (TSMC CoWoS-L wafers, SK Hynix HBM4 stacks). Competitors compete for the residual.
§ 07Bear Points
- Hyperscaler captive silicon is a ~44.6% YoY shipment growth segment vs ~16.1% for GPU shipments. This is mathematically the share-shift that every bear case relies on, and the data supports it (Introl).
- TPUv7 TCO is ~30% lower than GB200 for external customers. Anthropic chose to go heavy on TPU not for ideological reasons but for unit economics. If TPUv7 + Trainium3 prove out at the workload-quality bar Anthropic and Amazon need, more labs will follow (SemiAnalysis).
- Gross-margin ceiling is now visible. Mid-70s management-guided is the realistic top. The bull case of 80%+ gross margin (the "monopoly margin" upside) is off the table once captive silicon at hyperscalers is real.
- AMD Helios + OpenAI 6 GW + Meta 6 GW removes 2 of the top 5 buyers from a sole-source-NVDA path. That's a structurally different demand picture than 2023.
- China TAM is structurally walled off. H20-class workarounds are the ceiling; CloudMatrix 384 is now domestically credible enough that the China datacenter buildout doesn't need NVDA. This is permanent.
§ 08Conviction (1–5): 4
A 4 rather than a 5 because the moat is changing shape rather than deepening, and at least one of the key sub-moats (CUDA pure software lock-in) is genuinely eroding. A 4 rather than a 3 because the rack-scale and reference-architecture moats are widening at a pace that more than offsets, the financial leverage of the rack-as-product transition is structural, and the alternative ecosystems (UALink, ROCm, captive ASIC) all remain "good enough to discipline" rather than "good enough to displace" through 2026 at minimum. The trend is stable-to-mildly-strengthening on net.
§ 09Key Risks to This Read
- AMD MI450/Helios performance and ROCm parity could close faster than the 12–18-month gap I'm assuming, especially given OpenAI's incentive to make MI450 work. If Helios benchmarks ship at parity for frontier training in late 2026, the gap collapses faster than expected.
- TPUv7 external-sales volume could ramp past the 10% NVDA-revenue target Google has stated, especially if Meta's TPU rental deal expands and other labs follow Anthropic. This is a tail-risk to the merchant-share path more than to the dollar-revenue path.
- UALink 2.0 + merchant scale-up switches (Upscale AI SkyHammer, Astera Labs) shipping in late 2026 could compress NVLink's premium. NVLink 6 vs UALink 2 is the live bandwidth-and-latency comparison to watch in 1H 2027.
- The thesis assumes the rack-scale moat is real and durable, not just a 1-cycle integration lead. If hyperscalers and AMD/Meta/HPE prove that an OCP-spec rack with Helios delivers 90%+ of NVL72 utility for 70% of cost, the moat thesis weakens materially in Rubin Ultra timeframe (2027–2028).
- I am NOT analyzing financial valuation, supply chain, or regulatory risks — those are sibling-analyst dimensions. A "wide moat, stable trend" competitive read is consistent with both a bull and a bear stock view depending on entry multiple and supply-chain assumptions.
Works cited
- NVIDIA 10-Q for quarter ended October 26, 2025
- Recent purchase commitment / inventory disclosures
- NVIDIA FY26 quarterly earnings call transcripts
- Pull-through demand commentary from frontier labs and hyperscalers
- Sovereign AI customer set commentary (UAE, Saudi, Japan, Korea, France, India)
- Neocloud demand layer commentary
- Bloomberg Intelligence - AI Accelerator Market to Exceed $600B by 2033
- Accelerator TAM $604B by 2033 at 16% CAGR - most credible figure
- ASIC TAM $118B by 2033
- Hyperscaler-driven dual GPU+ASIC framing
- Cignal AI - Optical Component Startup Tracker
- Lightmatter $4.4B valuation, $850M raised, L200 CPO 2026
- Marvell acquired Celestial AI Dec 2025 for $5.5B
- Ayar Labs $1B+ valuation, 100 Tbps demonstrators
- Contrary Research - Ayar Labs Business Breakdown
- Optical I/O chiplets sit on processor substrate
- Backed by AMD, Intel, NVIDIA
- Counterpoint - AI Server Compute ASIC Shipments to Triple by 2027
- ASIC growth +44.6% in 2026 vs GPU +16.1%
- Broadcom ~60% of custom ASIC market by 2027
- Marvell ~25%
- Custom Silicon Inflection 2026 — Hyperscaler ASICs vs NVIDIA GPU
- Custom ASIC shipment growth ~44.6% in 2026
- Hyperscaler captive silicon as the dominant share-shift mechanism
- Deloitte - 2026 Semiconductor Industry Outlook
- ~$500B of 2026 semi revenue from AI chips (>50% of industry)
- Concentration in <0.2% of unit volume
- Epoch AI - NVIDIA B200 Production Cost
- B200 manufacturing cost ~$6,400
- Memory ~half of cost
- Fortune Business Insights - AI Accelerator Market Forecast 2034
- AI accelerator $43.75B in 2026 to $309.23B by 2034 at 30.7% CAGR
- Future Markets Inc - Co-Packaged Optics Market 2026-2036
- CPO market sizing horizon
- Spectrum-X / Quantum-X / Bailly platform benchmarking
- Google TPUv7: The 900lb Gorilla In the Room
- TPUv7 internal TCO ~44% lower than GB200 Blackwell
- External Anthropic TCO ~30% lower than NVDA equivalent
- Google targeting 10% of NVDA data-center revenue
- Huawei AI CloudMatrix 384 — China's Answer to Nvidia GB200 NVL72
- CloudMatrix 384: ~300 PF dense BF16 (~2× GB200 NVL72), 3.6× memory capacity, 2.1× bandwidth, 4.1× power
- Architecture-substitutes-for-process strategy
- IDC - 2026 Semiconductor Market: AI Supercycle Arrives
- AI accelerator no overshipment in 2026
- Legacy semis in inventory digestion phase
- Memory prices elevated through 2027+
- IoT Analytics - Data Center Infrastructure Toward $1T by 2030
- DC infrastructure spending $290B in 2024 to $1T+ annual by 2030
- Hyperscaler capex +40% in 2025
- JPMorgan Asset Management - AI Market View
- Hyperscalers cited Jevons Paradox in Q1 2026 earnings
- Demand backlog exceeds capacity
- McKinsey - AI Power: Expanding Data Center Capacity
- 156 GW of AI data center capacity demand by 2030
- 125 incremental GW added 2025-2030
- 70% of new DC demand from AI workloads
- McKinsey - The Cost of Compute: $7T Race to Scale Data Centers
- $5.2T AI-specific data center capex through 2030
- $6.7T total data center capex through 2030
- Full-stack envelope sizing
- Mordor Intelligence - AI Accelerators Market 2030
- AI accelerator market $140.55B in 2025 to $440.30B by 2030 at 25% CAGR
- NVIDIA AI GPU Market Share 2026: ~80% of AI Accelerators
- NVDA AI accelerator share trajectory: ~92% (2023) → ~86% (2024) → ~80% (2026E)
- GPU shipment growth ~16.1% YoY in 2026 vs custom ASIC ~44.6%
- NVIDIA AI Strategy: Analysis of Sustained Dominance
- NVDA's full-stack AI infrastructure positioning
- Reference-architecture network effects
- Philipp Dubach - AI Capex 2026: $690B Arms Race
- ~$725B hyperscaler AI capex confirmed Q1 2026
- Up from $660-690B baseline
- Precedence Research - AI Data Center GPU Market to $77.15B by 2035
- Narrow data center GPU TAM $12.83B (2026) to $77.15B (2035) at 22.06% CAGR
- ROCm vs CUDA for GPU Cloud — Performance, Cost, Compatibility (2026)
- ROCm 7 production-ready for PyTorch + vLLM + SGLang in 2026
- TensorRT-LLM and FlashAttention-3 remain CUDA-only
- Silicon Analysts - NVIDIA B200 Cost Breakdown
- B200 ~84% gross margin at $40K ASP
- Manufacturing cost ~$6,400
- HBM = 45% of COGS
- T. Rowe Price - Why the AI Capex Cycle Is Built to Persist
- Capex financed by hyperscaler operating cash flow
- Cycle structurally different from prior semi cycles
- Yole Group - Silicon Photonics & Co-Packaged Optics in AI
- Copper Wall reached at million-GPU clusters
- CPO as primary disruption vector
- Anthropic Secures Multi-Gigawatt TPU Deal With Google, Broadcom
- Anthropic 1M TPUv7 chip access
- 400k Ironwoods sold direct (~$10B Broadcom rev) + 600k via GCP (~$42B RPO)
- Carbon Credits - NVIDIA 92% GPU Share 2025
- 92% discrete GPU share end-2025
- 97% data center GPU accelerator share 2026
- HPE adopts AMD's Helios rack architecture for 2026 AI systems
- HPE first major OEM adopting Helios
- AMD opening rack architecture to OEM/ODM partners
- NVIDIA Price Target Raised to $325 — $1T Blackwell Revenue
- Jensen quoted $1T Blackwell+Rubin orders through 2027
- NVIDIA Q3 FY 2026 Earnings: Record Data Center Revenue
- Q3 FY26 record data-center revenue
- Higher Q4 guide implies sustained pricing+volume
- Nvidia sales 'off the charts,' but Google, Amazon make custom AI chips
- Google >75% of Gemini on TPUs
- AWS Trainium >50% of Bedrock token throughput
- Hyperscaler dual-sourcing pattern
- The $2 Billion Nvidia Deal With Marvell Is About More Than NVLink Fusion
- NVDA opening NVLink to partner CPUs/accelerators via NVLink Fusion
- Marvell, Arm, Fujitsu, Qualcomm as early adopters
- Tom's Hardware - Blackwell AI Superchip Pricing
- Blackwell superchips up to $70K
- GB200 NVL72 list ~$3M
- Tom's Hardware - Semiconductor Industry Enters Giga Cycle
- Cycle phase characterization
- AI rewriting compute/memory/networking economics simultaneously
- Tom's Hardware - Vera Rubin NVL72 Rack Pricing $8.8M
- Vera Rubin VR200 NVL72 quoted $5-7M with high-end up to $8.8M
- Rack-as-product ASP escalation
- UALink Consortium 2.0 spec takes another swing at NVLink supremacy
- UALink 2.0 ratified as industry standard in 2026
- Spec supports 1,024 accelerators in single scale-up domain vs NVLink 6's 576
- Upscale AI Eyes Late 2026 for Scale-Up UALink Switch
- First commercial UALink switch (SkyHammer) targeting Q4 2026
- AMD and Meta Announce Expanded Strategic Partnership — 6 GW
- Meta committing 6 GW of AMD GPUs through 2030
- Major hyperscaler diversifying away from sole-source NVDA
- AMD Helios — AI Rack Built on Meta's 2025 OCP Design
- Helios rack: 72 MI450 GPUs, 1.4 EFLOPS FP8, 2.9 EFLOPS FP4
- Co-developed with Meta via OCP
- NVIDIA Blackwell GPU Pricing: B200, B300, DGX Cost
- B200 list price $35–40k
- Hyperscaler discounts 15–25% off list
- NVLink Fusion product page — NVIDIA
- NVLink Fusion: semi-custom AI infrastructure terminating on NVDA fabric
- AICerts News: HBM Supply Crunch — AI Memory Shortage Through 2027
- HBM tightness extends through 2027
- ~20% HBM ASP rise expected 2026
- AMD valuation statistics
- AMD market cap $588B, forward P/E 53.4x, EV/Sales 16.8x, EV/EBITDA 86.2x
- Astute Group: Advanced Packaging Demand Soars — Nvidia Secures 60% of CoWoS Capacity
- NVIDIA captures ~60% of TSMC CoWoS through 2027
- Morgan Stanley CoWoS allocation forecast
- BIS — Export Controls on Advanced Computing and Semiconductor Manufacturing Items, including HBM (Dec 2, 2024)
- 89 FR 96790; HBM rule with FDPR de minimis coverage; binds Hynix/Samsung/Micron HBM exports to China-headquartered entities
- BIS — Export Controls on Semiconductor Manufacturing Items (Oct 17, 2023 update)
- 88 FR 73424; A800/H800 capture; FDPR extension; H20 origination pathway; removal of performance density safe harbor
- BIS — Framework for Artificial Intelligence Diffusion (AI Diffusion IFR, Jan 13, 2025)
- 90 FR 4544; Tier 1/2/3 country framework; VEU/NVEU pathways; country compute caps over Tier 2 sovereign-AI markets
- BIS — Implementation of Additional Export Controls: Certain Advanced Computing and Semiconductor Manufacturing Items (Oct 7, 2022 IFR)
- 87 FR 62186; original advanced-computing and semiconductor manufacturing controls; A100/H100 capture; basis for the H800/A800/H20 SKU lineage
- Broadcom (AVGO) valuation statistics
- AVGO market cap $1.99T, forward P/E 31.3x, EV/Sales 30.0x, EV/EBITDA 55.0x
- China SAMR — investigation into NVIDIA (Mellanox conditional approval)
- Dec 2024 SAMR public notice opening investigation into NVIDIA's compliance with Mellanox approval conditions; widely read as retaliation tooling
- CHIPS and Science Act of 2022 (P.L. 117-167) and CHIPS Program Office disbursement announcements
- TSMC Arizona ~$6.6B + $5B loan; Intel ~$8.5B grant + $11B loan; Samsung Austin/Taylor ~$6.4B; Micron NY/ID ~$6.1B; 10-year guardrails on advanced fabs in restricted countries
- Cohort companies data — NVIDIA entry
- NVIDIA risk taxonomy (custom silicon, CoWoS, AMD MI450X)
- Catalyst list (Rubin/Kyber/800V/Dynamo)
- Reference-architecture positioning quotes from corpus notes
- Cohort companies.json — NVDA entry (customer dimension use)
- NVDA sentiment +2, mentionCount 95
- Catalysts: Rubin/Rubin Ultra, Kyber 600 kW / 1 MW rack, 800V HVDC, CPO, Dynamo
- Risks: custom silicon pricing-power cap, CoWoS / power bottlenecks, MI450X frontier-workload competition
- Cohort synthesis — semiconductor-industry
- Three-bottleneck frame (logic/memory/power)
- Unit-cost-of-intelligence as denominator for structural demand
- Power as ultimate constraint
- + 3 more
- Cohort synthesis.md (used for customer / buyer-set framing)
- Value-chain map L13 buyer set (hyperscalers, neoclouds, frontier labs)
- Rack-as-product framing: per-rack BOM ~$3M+, per-deployment NVDA capture ~3x prior model
- Unit-cost-of-intelligence Jevons demand framing
- + 4 more
- CRS R48642: U.S. Export Controls and China — Advanced Semiconductors
- Export control framework
- HBM rule (Dec 2024)
- China gallium reciprocity
- Crucible Capital — 'Building a Datacenter Part II' (cohort corpus Note)
- OEM/ODM channel structure: board → Supermicro/Quanta/Foxconn → hyperscaler datacenter
- Reference-architecture moat-deepening framing
- Rack-as-product capture economics tripling per-deployment NVDA share
- Crucible Capital — 'The AI Power Crisis Part 1 & 2' (cohort corpus Notes)
- Vertiv 4Q'25 +152% organic order growth as marker of pull-through demand
- Stargate Texas 2.3 GW onsite gas plant — largest single onsite gas order ever
- xAI Colossus 1+2 buildout pace (>1 GW)
- + 2 more
- Crucible Capital — 'The Semiconductor Industry: A Beginner's Companion' (cohort corpus Note)
- Three-bottleneck frame (logic / memory / power)
- Custom silicon mapping (TPUv7 / Trainium / MTIA / Maia / OpenAI 2027 chip)
- Anthropic 400k-unit / ~$10B TPUv7 deal at Google
- + 2 more
- CSIS: Understanding the Biden Administration's Updated Export Controls
- Dec 2024 HBM rule context
- Country-wide HBM controls precedent
- Digitimes: Advanced packaging drives ABF substrate expansion (Dec 2025)
- Ibiden capacity expansion
- ABF supplier landscape — Ibiden, Unimicron, Kinsus, Shinko, Nan Ya
- Digitimes: AI chip rivalry escalates — ABF substrate sells out at Unimicron, Kinsus, Nan Ya PCB
- ABF substrate undersupply 2026
- Unimicron, Kinsus, Nan Ya PCB allocations
- Digitimes: TSMC expands CoWoS capacity with Nvidia booking over half for 2026-27
- NVIDIA majority allocation 2026-27
- TSMC equipment ramp
- DOJ Antitrust Division — public statements on AI compute review
- Preliminary inquiry into CUDA bundling; Run.ai vertical review (cleared without divestiture late 2024); ongoing monitoring of AI compute concentration
- Epoch AI: NVIDIA's B200 costs around $6,400 to produce
- B200 chip-level cost ~$5,700-7,300
- Implied chip-level gross margin ~82%
- EU AI Act — Regulation 2024/1689
- General-purpose AI obligations on model developers; indirect demand-side impact only for NVIDIA
- EU Dual-Use Regulation 2021/821 (recast)
- Legal vehicle for any future EU export controls on AI compute or harmonization with US BIS rules
- European Commission DG COMP — communications on AI foundation models / AI compute review (2024-2025)
- Preliminary review of AI compute markets; pre-Statement-of-Objections; conduct remedies on access/interoperability are most plausible outcome
- FinancialContent: TSMC Targets 150,000 CoWoS Wafers to Fuel NVIDIA's Rubin Revolution
- TSMC ~150k CoWoS wafers/month target by late 2026
- NVIDIA ~595k 2026 wafer booking
- FTC — Generative AI and Cloud Computing 6(b) Study
- 6(b) order to AI compute / cloud providers; baseline for any future enforcement on AI compute concentration
- FusionWW: Inside the AI Bottleneck — CoWoS, HBM, 2-3nm Capacity Through 2027
- Three-bottleneck framing
- Capacity constraint timelines
- Hyperscaler FY25/FY26 capex disclosures (MSFT, META, GOOGL, AMZN, ORCL)
- Aggregate 2026 hyperscaler capex ~$600B with majority AI infrastructure
- Mapping of NVDA's >10% indirect end-customers to hyperscaler base
- Oracle Stargate Texas commitment (2.3 GW gas plant, OpenAI/Oracle/Crusoe)
- + 1 more
- In re NVIDIA Securities Litigation — SCOTUS No. 23-970 (June 2024) and N.D. Cal. remanded proceedings
- Crypto-mining disclosure case; 9th Cir reversal of dismissal vacated by SCOTUS June 2024; remanded for further proceedings
- Introl Blog: Trump Opens H200 Exports to China with 25% Surcharge (Dec 2025)
- H200 China export policy update
- Surcharge mechanism on China-bound product
- IntuitionLabs: NVIDIA GB200 Supply Chain — The Global Ecosystem Explained
- End-to-end GB200 supplier mapping
- Geographic concentration of Asian suppliers
- Japan METI — Foreign Exchange and Foreign Trade Act amendments on semiconductor manufacturing equipment (May 2023)
- 23-category semicap export restrictions
- KED Global: Samsung, SK Hynix win Vera Rubin HBM4 slots, widening lead over Micron
- HBM4 vendor allocation for Vera Rubin
- Korea Herald: Nvidia's 16-layer HBM push raises stakes for memory chip-makers
- HBM4E 16-Hi roadmap pressure
- Hybrid bonding tooling chokepoint
- Lane coordination — financial-analyst and competitor-analyst
- Customer dimension owns volume durability and buyer-set composition
- Pricing power / gross-margin sensitivity to hyperscaler counter-leverage owned by financial-analyst
- Competitive share-shift mechanics (TPU/Trainium/Maia/MI450X) owned by competitor-analyst
- Lane coordination — regulatory analyst
- Macro owns trade-flow direction and FX consequences
- Regulatory owns specific BIS rules, H20-class spec ceilings, active legal matters
- Coordination prevents double-counting of tariff/export-control exposure
- Marvell Technology (MRVL) valuation statistics
- MRVL forward P/E 41.4x, EV/EBITDA 51.1x
- Netherlands — expanded export control measures on advanced semiconductor manufacturing equipment (Dec 2024)
- ASML EUV/NXT:2000i restrictions; tightens China parallel-stack ecosystem indirectly supporting NVIDIA franchise
- NVIDIA Announces Financial Results for Fourth Quarter and Fiscal 2024
- FY24 revenue $60.9B (+126% YoY)
- FY24 segment: Data Center $47.5B (78%), Gaming $10.4B (17%), ProVis $1.55B, Auto $1.09B, OEM $306M
- NVIDIA Announces Financial Results for Fourth Quarter and Fiscal 2026
- FY26 revenue $215.9B (+65% YoY)
- FY26 GAAP operating income $130.4B, net income $120.1B, diluted EPS $4.90
- FY26 OCF $102.7B, FCF $96.6B (calc), capex $6.0B
- + 6 more
- NVIDIA balance sheet history (StockAnalysis)
- AR/inventory/goodwill/debt trajectory FY24-FY26
- NVIDIA cash flow statement, 5-year history (StockAnalysis)
- FY24 OCF $28.1B, FCF $27.0B, SBC $3.5B, buybacks $9.5B
- FY25 OCF $64.1B, FCF $60.9B, SBC $4.7B, buybacks $33.7B
- FY26 OCF $102.7B, FCF $96.7B, SBC $6.4B, buybacks $40.1B
- NVIDIA CFO Commentary on Fourth Quarter Fiscal 2025 Results
- FY25 segment breakdown: Data Center $115.2B, Gaming $11.4B, ProVis $1.9B, Auto $1.7B, OEM $0.4B
- Balance sheet at Jan 26 2025: AR $23.1B, Inventory $10.1B, Goodwill $5.2B, Total debt ~$11.4B
- NVIDIA Corporation FY25 Form 10-K (annual report) — customer concentration disclosure
- FY25 customer-concentration language: multiple direct customers each >10% of revenue, disclosed alphabetically (Customer A/B/C/D)
- Top single direct customer rose from ~13% in FY24 to ~19-22% range in FY25 disclosure window
- Purchase obligations to suppliers >$30B (TSMC CoWoS, SK Hynix HBM, OEM/ODM)
- + 1 more
- NVIDIA Corporation FY26 interim 10-Q filings
- FY26 customer-concentration trend: top-2 customers each >10% of revenue
- FY26 segment mix run-rate: Data Center ~88% (Compute + Networking)
- Supply-constrained vs demand-constrained framing; CoWoS / HBM allocation as binding constraint
- NVIDIA Corporation — Form 10-K filings FY 2025 and FY 2026 (risk factors / contingencies)
- Segment disclosures, risk factors on export controls and litigation, China revenue impact disclosures
- NVIDIA current valuation statistics (StockAnalysis)
- Market cap $4.82T, EV $4.77T
- Forward P/E 23.8x, trailing P/E 40.5x
- EV/Sales 22.1x, EV/EBITDA 35.8x
- + 2 more
- NVIDIA Form 10-K, fiscal year ended January 25, 2026
- Annual filing covering FY26 (year ended Jan 25, 2026), filed Feb 25, 2026
- NVIDIA FY24/FY25 10-K disclosures — geographic and FX framing
- Geographic revenue mix (US ~45-50%, Singapore booking ~15-20%, China ~10-15%, Taiwan ~5-8%)
- USD invoicing convention
- Minimal net debt
- + 2 more
- NVIDIA historical P/E ratio (Macrotrends)
- NVDA 5-year average P/E ~68x; current ~40x trailing is ~40% below 5y average and 26% below 10y mean of 54x
- NVIDIA income statement multi-year (StockAnalysis)
- FY24 revenue $60.9B, GM 72.7%, OM 54.1%, NM 48.9%, EPS $1.19, diluted shares 24,940M
- FY25 revenue $130.5B, GM 75.0%, OM 62.4%, NM 55.9%, EPS $2.94, diluted shares 24,804M
- FY26 revenue $215.9B, GM 71.1%, OM 60.4%, NM 55.6%, EPS $4.90, diluted shares 24,514M
- NVIDIA Investor Relations — SEC filings (10-K / 10-Q portal)
- 10-K Risk Factors
- Sources & availability of materials disclosure
- Purchase commitments and prepaid supply
- Packnode: The Compute Packaging Bottleneck — CoWoS Capacity Reshaping Chip Industry
- CoWoS-L bottleneck dynamics
- Pricing Power in the Agentic Era: How Blackwell Ultra Secures Nvidia's 75% Gross Margins
- Pass-through power
- Blackwell Ultra 35% generational premium
- GAAP gross margin >73-75%
- SEC — The Enhancement and Standardization of Climate-Related Disclosures for Investors (Final Rule, March 2024; stayed)
- Scope 1/2/3 disclosure obligations subject to Eighth Circuit consolidated litigation outcome
- TrendForce: Samsung, SK hynix Tapped as NVIDIA Rubin HBM4 Suppliers (Mar 2026)
- Samsung HBM4 qualification at NVIDIA cleared March 2026
- Dual-source path for Rubin
- TrendForce: SK hynix to Supply ~2/3 of NVIDIA HBM4 (Jan 2026)
- SK Hynix ~70% of NVIDIA HBM4 allocation
- Samsung ~28%, Micron ~18% HBM4 share
- UK CMA — AI Foundation Models Update Paper (2024)
- Market study identifying AI compute access concentration concerns
- US Department of Commerce — Section 232 investigation on semiconductors (initiation)
- 2025 Section 232 investigation on semiconductors; potential Taiwan-origin tariff exposure of 200-400 bps gross margin before pass-through
- User cohort-level customer context (provided in analyst brief)
- Hyperscaler dual-buyer counter-leverage: hyperscalers buy NVIDIA AND build custom silicon to keep negotiating power
- Neocloud demand layer (Crusoe, Fluidstack, Lambda, CoreWeave) as new buyer class with own dynamics
- Sovereign AI / state-level buyers as structural new customer set
- + 2 more
- User-documented cohort macro lens
- Taiwan loss treated as existential
- US-China decoupling structurally suppresses China revenue
- Datacenter capex framed as structural this cycle
- + 2 more
- Uyghur Forced Labor Prevention Act (UFLPA, P.L. 117-78); CBP enforcement guidance
- Rebuttable presumption against Xinjiang-nexus goods; supply chain risk on gallium/3TG/polysilicon upstream