GPU cloud costs vary wildly — the same H100 can cost 2-3× more depending on which provider you choose. We've been collecting spot prices every 30 minutes from seven providers since March 2026. This is what real 30-day data shows about who's actually cheapest.
TL;DR: Spot prices vary significantly — check the live comparison table for current rates.
The table below shows 30-day average spot prices per GPU model across all seven providers. ★ marks the cheapest provider for each GPU. Stability rating reflects price volatility (coefficient of variation — lower means more predictable pricing).
| Provider | H100 Spot (avg) | A100 Spot (avg) | L40S Spot (avg) | Stability |
|---|
Source: RoofRun price_snapshots, 30-day window ending 2026-06-03. ★ = cheapest for that GPU model. Raw data available via API.
On-demand pricing tells a different story — spot prices can be 40-70% cheaper than on-demand rates but come with interruption risk. For fault-tolerant workloads (training with checkpointing, batch inference, data preprocessing), spot instances offer substantial savings. For latency-sensitive or stateful workloads, on-demand is safer.
The GPU-specialist providers — RunPod, Vast.ai, and Lambda Cloud — consistently undercut the hyperscalers on spot pricing. They run leaner operations focused exclusively on GPU compute, which shows up in the numbers.
The hyperscalers (AWS, GCP, Azure) charge a premium but offer integrations, compliance, and SLA guarantees that matter for enterprise workloads. CoreWeave sits between these categories — GPU-native like the specialists, but with enterprise-grade infrastructure and network.
Spot pricing introduces interruption risk — your instance can be terminated with little notice when the provider needs capacity. The economics only make sense when your workload can handle it:
A provider with highly variable spot pricing creates planning challenges — budgets blow out when prices spike, and automation built around a "typical" price breaks. Our stability rating (coefficient of variation) measures this: a score under 10% means pricing is predictable; above 25% means expect surprises.
Check the 30-day trends page for per-provider, per-GPU volatility charts and stability ratings across all GPU models we track.
Not every workload needs an H100. The A100 and L40S often deliver 80-90% of H100 performance for inference and fine-tuning at significantly lower cost. The A10G and T4 are worth considering for smaller models or high-throughput inference where batch sizes fit.
See our H100 pricing page and A100 pricing page for current live prices and historical trends per GPU model.
Set alerts, compare live spot prices, and watch 30-day trends across all 7 providers — free.
Get Started Free →
All pricing data in this article comes from RoofRun's own polling infrastructure.
We query provider APIs and pricing pages every 30 minutes and store results in
price_snapshots. Averages reflect the trailing 30 days ending 2026-06-03.
Providers with no spot pricing for a GPU model show — in the table.
Access raw data via the public JSON API.