info@castlerockdigital.com · LinkedIn: /castle-rock-digital-llc · castlerockdigital.com
01 · Storage
02 · Systems
03 · Facilities
04 · Quantum
05 · Processing
06 · Interconnects
07 · Memory
08 · TCO & Procurement
TCO &
Procurement
Hardware specs fill seven modules of this series. This one answers the question every CFO and investment committee ultimately asks: what does it actually cost? A 100,000-GPU cluster carries a $3.8B hardware price tag — but that represents less than 52% of true 5-year total cost of ownership once power, cooling, networking, facilities, staffing, software, and maintenance are modeled in full. For every dollar spent on GPUs, buyers must budget $0.93 in ancillary costs over the ownership period.
◆ Ownership Reality
100K-GPU Cluster$3.8B Hardware CapEx$5.8B True 5-Year TCOHardware = <52% of Cost
Global AI Infra CapEx 2026
$312B
+41% YoY · Hyperscaler + Enterprise + HPC
Cost / TFLOPS FP16 (B200)
$0.09/TF
Down from $3.40 (V100 2017) — 38× improvement
Power % of 5-Year TCO
22%
100K GPU cluster at $0.05/kWh · +3pp vs. prior gen
GPU Refresh Cycle
2.8yr
AI accelerators — down from 5.2yr in 2019
CapEx by Cluster Size — Initial Build Cost ($M)
Annual OpEx Breakdown — 100K GPU Cluster (~$1.06B/yr)
$131M/yr
Power cost for 300 MW cluster at $0.05/kWh — rises to $472M/yr at $0.18/kWh (EU colocation rates)
200–400 FTE
Staffing required for 100K GPU cluster — $36–88M/yr at $180–220K fully-loaded; talent scarcity is the binding constraint
$50–120M
Annual enterprise software licenses at 100K GPU scale — NVIDIA AI Enterprise, workload managers, storage SW, security tooling
💡
Energy Arbitrage is a $585M Competitive Moat: The spread between cheapest (Iceland hydro ~$0.03/kWh) and most expensive (UK colocation ~$0.18/kWh) electricity translates to a $585M 5-year power cost differential for a 300MW cluster. Google, Microsoft, and Amazon have all signed multi-gigawatt nuclear PPAs specifically to lock in competitive OpEx — power location is not a facilities decision, it is a 9-figure financial decision.
01 · Storage
02 · Systems
03 · Facilities
04 · Quantum
05 · Processing
06 · Interconnects
07 · Memory
08 · TCO & Procurement
GPU Procurement Channels — Discount, Lead Time & Access · Q1 2026
Channel Discount vs. List Lead Time Access Threshold Best For
NVIDIA Direct (Preferred Partner) 5–15% 6–18 months $100M+ annual commitment Hyperscalers / Top CSPs
OEM (Dell / HPE / Lenovo / Supermicro) 3–10% 8–16 months Qualified enterprise buyer Enterprise / National Labs
ODM Direct (Quanta / Wiwynn / Foxconn) 8–18% 10–20 months Volume >$50M + technical team Hyperscalers / AI Companies
Cloud Reserved Instances (1–3yr) 30–60% vs. on-demand Immediate – 4 weeks 1–3 year commitment Startups / Variable demand
Cloud Spot / On-Demand 0% (premium pricing) Minutes Credit card / account Prototyping / burst compute
Secondary Market / Brokers −50% to +150% 1–8 weeks Cash buyers · caveat emptor Emergency / speculative only
Government Programs (DOE / DOD / NSF) 15–30% effective 12–36 months Approved research institution National Labs / Universities
H100 broker peak: $68K/unit (Q3 2023)
H100 normalized: $29K/unit (Q1 2026)
B200 allocations: 9–18 month wait (non-preferred)
AI GPU depreciation: 2–3yr accelerated
Cloud break-even vs. owned: ~14 months (1K GPU)
Global AI Infra CapEx — $B by Buyer Segment · 2026–2031F
5-Yr TCO per PFLOPS FP8 — Efficiency Trend ($M)
▲ Bull Case
$1.02T
26.8% CAGR · 2026–2031
AGI investment supercycle — scaling compute remains the decisive variable, sovereign AI adds $100B+ cumulatively, power and land become the binding constraints, not capital
● Base Case
$780B
20.1% CAGR · 2026–2031
20% CAGR sustained by GPU generational upgrades (B200 → Rubin), enterprise AI adoption broadening the buyer base, and sovereign programs from EU, UAE, India, Japan, and KSA
▼ Bear Case
$520B
10.8% CAGR · 2026–2031
Algorithmic efficiency gains (MoE, INT4 quantization) reduce hardware demand per capability; AI capex cycle cools post-2027; export controls restrict GPU shipments to key markets
Engage the Full Intelligence Report
The complete TCO & Procurement module delivers 60+ pages of financial models, build-vs-cloud break-even analyses, procurement playbooks, lease/purchase decision frameworks, and ROI benchmarks by workload — built for CFOs, infrastructure leads, and investment committees.
5-Yr TCO Model Templates
Build vs. Cloud Break-Even
Procurement Timing Playbook
ROI by Workload Benchmarks
Financing Structure Decision Guide
Power Cost Sensitivity Models
Research Inquiries
info@castlerockdigital.com
Web
castlerockdigital.com
LinkedIn