GUIDE · 2026

Cloud GPU Provider Comparison 2026: AWS vs GCP vs Azure vs Lambda vs CoreWeave vs RunPod vs Vast.ai

Not all GPU clouds are equal. AWS charges enterprise rates for enterprise problems you may not have. Lambda Cloud offers the simplest pricing but no spot market. CoreWeave runs rings around hyperscalers for pure GPU throughput. RunPod and Vast.ai push prices to near-hardware cost. This guide breaks down each provider with live pricing so you can pick the right one for your workload — not just the default.

📅 Updated April 28, 2026 ⏱ 12 min read Live pricing data 7 providers compared
Table of Contents
  1. Summary Comparison Table
  2. 1. Amazon Web Services (AWS)
  3. 2. Google Cloud Platform (GCP)
  4. 3. Microsoft Azure
  5. 4. Lambda Cloud
  6. 5. CoreWeave
  7. 6. RunPod
  8. 7. Vast.ai
  9. How to Choose
7
Providers Compared
49+
Live Price Points
30min
Data Refresh Rate

Summary Comparison Table

All pricing pulled live from provider APIs every 30 minutes. Price ranges reflect current spot and on-demand availability.

Provider GPUs Available Price Range Spot Support Best For
AWS H100, A100, V100, T4 $0.13–$98.32/hr ✓ (EC2 Spot) Enterprise & compliance
GCP H100, A100, L4, T4, V100 $0.16–$97.34/hr ✓ (Preemptible) Vertex AI & TPU workloads
Azure H100, A100, V100 $0.11–$29.92/hr ✓ (Spot VMs) Microsoft-stack enterprises
Lambda Cloud H100, A100, A10G $0.60–$2.50/hr ✗ (On-demand only) Researchers & flat pricing
CoreWeave H100, A100, A40, RTX 4090 $1.23–$4.76/hr ✓ Yes Inference at scale
RunPod H100, A100, RTX 4090, L40S $0.19–$3.79/hr ✓ (Community) Price-sensitive experiments
Vast.ai H100, A100, RTX 3090/4090 $0.02–$4.11/hr ✓ (Bid) Lowest cost batch jobs
Live data · Updated Jun 3, 2026, 07:03 AM UTC Full comparison table →

Amazon Web Services (AWS)

AWS is the default choice for enterprises — and the most expensive for pure GPU throughput. GPU instances run on EC2 under the p5 (H100), p4d/p4de (A100), and p3 (V100) families. SageMaker wraps these with managed ML infrastructure at an additional premium.

AWS GPU spot instances (EC2 Spot) offer meaningful discounts but require interruption handling. For production inference, most teams end up on On-Demand or Savings Plans — which narrows the price advantage significantly.

1
AWS
Amazon Web Services · EC2 GPU · SageMaker
⭐ Best for enterprise & compliance

Live Pricing

GPUSpot PriceOn-DemandSavings
NVIDIA H100 SXM80GB$31.30/hr$98.32/hrSave 68%
NVIDIA A100 40GB40GB$8.10/hr$19.22/hrSave 58%
NVIDIA A100 SXM80GB$13.07/hr$32.77/hrSave 60%
NVIDIA L40S48GB$1.82/hr$3.22/hrSave 43%
NVIDIA A10G24GB$0.68/hr$1.62/hrSave 58%
NVIDIA L424GB$0.13/hr$0.98/hrSave 87%
NVIDIA T416GB$0.67/hr$4.35/hrSave 85%
NVIDIA V10016GB$4.95/hr$12.24/hrSave 60%
Pros
  • Largest global region coverage
  • Best enterprise compliance (HIPAA, SOC 2, FedRAMP)
  • Deep ecosystem integration (S3, EKS, SageMaker)
  • Spot instances with up to 90% off on-demand
  • Reserved instances for predictable long-term cost
Cons
  • Most expensive on-demand GPU pricing
  • Complex pricing model (egress, storage, support tiers)
  • GPU availability tighter than specialized providers
  • SageMaker adds 20-30% overhead vs. raw EC2
  • Requires AWS expertise to operate efficiently
When to choose AWS

Your team already runs on AWS infrastructure, you need compliance certifications, or your SLAs require the hyperscaler-grade reliability guarantee. If you're greenfield, there are cheaper options.

Google Cloud Platform (GCP)

GCP is the strongest hyperscaler for ML workloads due to its native integration with Vertex AI, TensorFlow ecosystem, and proprietary TPU v5 access. For GPU specifically, GCP offers A100s and H100s via Compute Engine, and preemptible (spot) instances at meaningful discounts.

GCP's managed ML pipeline (Vertex AI Pipelines, Model Registry, Experiments) makes it the hyperscaler choice for teams standardizing their MLOps stack. Pure compute price-per-GPU is competitive with AWS but still trails specialized GPU clouds.

2
GCP
Google Cloud Platform · Compute Engine · Vertex AI
⭐ Best for Vertex AI & TPU workloads

Live Pricing

GPUSpot PriceOn-DemandSavings
NVIDIA H100 SXM80GB$28.96/hr$97.34/hrSave 70%
NVIDIA A100 40GB40GB$4.49/hr$14.69/hrSave 69%
NVIDIA A100 SXM80GB$1.52/hr$5.01/hrSave 70%
NVIDIA L424GB$0.28/hr$0.71/hrSave 61%
NVIDIA T416GB$0.16/hr$0.54/hrSave 71%
NVIDIA V10016GB$0.89/hr$2.95/hrSave 70%
Pros
  • Exclusive TPU v5 access (unmatched for JAX/TF)
  • Vertex AI MLOps platform (pipelines, registry, experiments)
  • Preemptible instances with solid availability
  • Strong BigQuery / data warehouse integration
  • Committed use discounts up to 57%
Cons
  • H100 availability can be constrained
  • On-demand GPU pricing similar to AWS — not cheap
  • Vertex AI adds complexity vs. raw Compute Engine
  • Egress costs add up for large data pipelines
  • Less GPU variety than CoreWeave or RunPod
When to choose GCP

You need TPUs, you're building on Vertex AI, or your data stack lives in BigQuery. For pure GPU training with no Google ecosystem lock-in, specialized providers are cheaper.

Microsoft Azure

Azure's GPU offering targets the enterprise market with NDv4 (A100) and NDv5 (H100) series. Azure Machine Learning (AML) provides end-to-end MLOps tooling, and Azure OpenAI Service gives enterprise access to GPT-4 models — making Azure the default for Microsoft-stack organizations.

Azure Spot VMs offer GPU discounts, though availability in premium GPU SKUs is often constrained. For organizations already on M365, Azure AD, and Microsoft security stack, Azure reduces operational friction even if raw GPU pricing isn't the most competitive.

3
Azure
Microsoft Azure · Azure Machine Learning · NDv4/v5
⭐ Best for Microsoft-stack enterprises

Live Pricing

GPUSpot PriceOn-DemandSavings
NVIDIA H100 SXM80GB$4.40/hr$13.96/hrSave 68%
NVIDIA A100 SXM80GB$7.29/hr$29.92/hrSave 76%
NVIDIA T416GB$0.11/hr$0.53/hrSave 80%
NVIDIA V10016GB$0.70/hr$3.37/hrSave 79%
Pros
  • Best Microsoft ecosystem integration (Teams, M365, Entra)
  • Strong compliance (FedRAMP High, HIPAA, ITAR)
  • Azure OpenAI for enterprise GPT-4 access
  • Azure ML for end-to-end MLOps
  • Hybrid connectivity with on-prem via ExpressRoute
Cons
  • Most complex billing of the three hyperscalers
  • H100 Spot VM availability often limited
  • Generally highest on-demand GPU prices
  • Azure portal UX is notoriously dense
  • Quota approval process slow for new GPU capacity
When to choose Azure

Your org runs on Microsoft infrastructure and Azure AD SSO matters more than GPU price. If you're greenfield and just need cheap GPUs, Azure is rarely the answer.

Lambda Cloud

Lambda Cloud is purpose-built for ML teams that want GPU access with zero overhead. No spot market, no complex pricing tiers — just flat on-demand rates that are substantially cheaper than AWS/GCP/Azure for equivalent hardware.

Lambda's H100 and A100 pricing regularly undercuts the hyperscalers by 40-60% on on-demand rates. No egress fees, no storage markups, no support tier upsell. This simplicity attracts researchers and ML startups that find hyperscaler complexity a tax on their time.

4
Lambda Cloud
GPU-native cloud · Flat pricing · No egress fees
⭐ Best for researchers & flat pricing

Live Pricing

GPUSpot PriceOn-DemandSavings
NVIDIA H100 SXM80GB$2.50/hr$2.49/hr
NVIDIA A100 SXM80GB$1.29/hr$1.29/hr
NVIDIA A10G24GB$0.60/hr$0.60/hr
Pros
  • Flat on-demand pricing — no pricing complexity
  • No egress fees (rare among cloud providers)
  • 40-60% cheaper than AWS/GCP for H100 on-demand
  • Pre-installed ML stack (PyTorch, CUDA, cuDNN)
  • Simple API and clean UX
Cons
  • No spot instances — on-demand only
  • Limited global region coverage
  • No managed ML services (just raw compute)
  • GPU availability can be constrained in peak periods
  • Less ecosystem than hyperscalers
When to choose Lambda Cloud

You want cheap, reliable H100/A100 access without AWS/GCP complexity. No spot tolerance required. Lambda is the "just give me a GPU" answer.

CoreWeave

CoreWeave is the GPU cloud designed by ML engineers for ML engineers. NVIDIA-partnered, Kubernetes-native, and built exclusively around GPU workloads since 2019. CoreWeave offers the widest selection of current-generation NVIDIA hardware — H100, H200, A100, A40, RTX 4090 — with both spot and reserved options.

At scale, CoreWeave competes directly with hyperscalers on raw GPU throughput while offering better performance-per-dollar. Their interconnect and NVLink topology is optimized for large model training. Best-in-class for companies running inference at serious scale.

5
CoreWeave
GPU-specialized · NVIDIA partner · Kubernetes-native
⭐ Best for inference at scale

Live Pricing

GPUSpot PriceOn-DemandSavings
NVIDIA H100 SXM80GB$2.08/hr$4.76/hrSave 56%
NVIDIA A100 SXM80GB$1.23/hr$2.21/hrSave 44%
NVIDIA L40S48GB$1.85/hr$1.84/hrSave -1%
NVIDIA A4048GB$1.28/hr$1.28/hr
Pros
  • Widest GPU selection of any provider
  • NVIDIA partnership — first access to new hardware
  • Kubernetes-native with GPU operator support
  • InfiniBand interconnect for multi-node training
  • Competitive spot pricing, strong reserved rates
Cons
  • Best rates require commitment contracts
  • Less beginner-friendly than Lambda Cloud
  • No managed ML services — you bring your own stack
  • Enterprise sales process for large allocations
  • Smaller community vs. AWS/GCP
When to choose CoreWeave

You're running production inference or large-scale training and need the best GPU-per-dollar with reliability guarantees. CoreWeave is where serious ML infrastructure teams land.

RunPod

RunPod operates a two-tier model: Secure Cloud (their own datacenter infrastructure) and Community Cloud (idle GPUs from independent hosts). Community Cloud offers the lowest spot prices in the market but without SLA guarantees. Secure Cloud provides more reliability at a modest premium.

RunPod's developer experience is excellent — one-click container deployments, serverless GPU endpoints, and a template library covering popular models (Stable Diffusion, LLaMA, Whisper). The tradeoff: community pods can disappear, and uptime SLAs aren't contractual.

6
RunPod
Secure Cloud + Community GPU marketplace
⭐ Best for price-sensitive experiments

Live Pricing

GPUSpot PriceOn-DemandSavings
NVIDIA H200141GB$0.50/hr$3.79/hrSave 87%
NVIDIA H100 PCIe80GB$1.99/hr$2.89/hrSave 31%
NVIDIA H100 SXM80GB$2.59/hr$3.19/hrSave 19%
NVIDIA A100 PCIe80GB$1.19/hr$1.39/hrSave 14%
NVIDIA A100 SXM80GB$1.00/hr$1.35/hrSave 26%
NVIDIA L40S48GB$0.79/hr$0.86/hrSave 8%
NVIDIA L4048GB$0.69/hr$0.82/hrSave 16%
NVIDIA A10G24GB$0.28/hr$0.37/hrSave 24%
NVIDIA L424GB$0.44/hr$0.39/hrSave -13%
NVIDIA V10016GB$0.19/hr$0.26/hrSave 26%
NVIDIA RTX 409024GB$0.34/hr$0.69/hrSave 51%
NVIDIA RTX 309024GB$0.22/hr$0.46/hrSave 52%
Pros
  • Lowest spot prices among reliable providers
  • Instant GPU availability (community cloud)
  • Serverless GPU endpoints for inference APIs
  • One-click templates for popular models
  • Per-second billing — no minimum commitment
Cons
  • Community pods: no SLA, variable reliability
  • Not suitable for production SLA-bound workloads
  • Smaller GPU configurations vs. hyperscalers
  • Support response slower than enterprise providers
  • Community host quality varies
When to choose RunPod

You're experimenting, fine-tuning, or running batch jobs where interruption is tolerable. RunPod's Secure Cloud works for light production use; Community Cloud is for dev/test and cost maximization.

Vast.ai

Vast.ai is a decentralized GPU marketplace where independent hosts rent out idle compute. This model pushes prices to near-hardware cost — you'll routinely find RTX 4090 and A100 instances at prices that don't exist anywhere else. The tradeoff is that you're renting from individuals, not datacenters.

Vast.ai's bidding system lets you set your price target and receive instances when matching supply appears. For overnight batch jobs, exploratory training runs, and cost-maximizing inference, Vast.ai is in a different price tier from every other option.

7
Vast.ai
Decentralized GPU marketplace · Bid system
⭐ Best for lowest-cost batch jobs

Live Pricing

GPUSpot PriceOn-DemandSavings
NVIDIA H200141GB$1.52/hr$2.13/hrSave 29%
NVIDIA H100 PCIe80GB$2.13/hr$2.99/hrSave 29%
NVIDIA H100 SXM80GB$2.93/hr$4.11/hrSave 29%
NVIDIA A100 40GB40GB$0.68/hr$1.10/hrSave 38%
NVIDIA A100 PCIe80GB$0.76/hr$1.06/hrSave 29%
NVIDIA A100 SXM80GB$0.43/hr$0.60/hrSave 29%
NVIDIA L40S48GB$0.85/hr$1.19/hrSave 29%
NVIDIA L424GB$0.18/hr$0.30/hrSave 40%
NVIDIA T416GB$0.07/hr$0.15/hrSave 53%
NVIDIA V10016GB$0.02/hr$0.03/hrSave 28%
NVIDIA RTX 409024GB$0.13/hr$0.19/hrSave 29%
NVIDIA RTX 309024GB$0.05/hr$0.07/hrSave 28%
Pros
  • Absolute lowest GPU prices available anywhere
  • Consumer GPU access (RTX 4090, RTX 3090) at scale
  • Bid system — set your price, get matched when available
  • Huge selection: hundreds of active offers simultaneously
  • Ideal for long overnight training runs
Cons
  • No uptime SLA — individual hosts, no datacenter guarantee
  • Variable network performance between hosts
  • Consumer GPUs slower per-flop than data center cards
  • Security model differs from enterprise providers
  • Not suitable for latency-sensitive production inference
When to choose Vast.ai

You want the absolute lowest price and can tolerate variable reliability. Fine-tuning, exploratory runs, and non-critical batch jobs are the sweet spot. For production inference, look at CoreWeave or RunPod Secure.

How to Choose the Right Provider

The right provider depends on your workload type, reliability requirements, and existing infrastructure. Here's the decision tree most ML teams end up using:

Enterprise with compliance requirements?

→ AWS, Azure, or GCP. Only hyperscalers have HIPAA, FedRAMP, and SOC 2 Type II. AWS if you're already in AWS. Azure if you're Microsoft-stack. GCP if you're using Vertex AI or TPUs.

ML startup or research team, no compliance needs?

→ Lambda Cloud or CoreWeave. Lambda for simplicity and flat pricing. CoreWeave for scale, GPU variety, and best inference performance. Both beat hyperscalers by 40-60% on on-demand rates.

Cost-first, interruption-tolerant workloads?

→ RunPod or Vast.ai. RunPod Secure Cloud for moderate reliability. Vast.ai for maximum savings on batch jobs where an interruption just means restarting. Expect 50-80% savings vs. on-demand hyperscalers.

Whichever provider you choose, monitor actual spot prices before you commit. Provider pricing changes frequently, and the difference between the cheapest and most expensive option for the same GPU can be 5-10× on any given day. RoofRun tracks all 7 providers every 30 minutes so you always have current data.

Compare real-time prices →

RoofRun monitors all 7 providers every 30 minutes. See live H100, A100, and L40S prices side-by-side, filtered by provider and GPU type.