GPU pricing

GPU Cloud Pricing Decisions

Use these pages when the GPU hourly rate is visible but the workload placement decision is still muddy. The useful question is usually about useful GPU-hours, data movement, provider variance, commitment, and operations.

By Andrew Cooper, Founder of RunPlacement Updated May 2026 Provider-neutral, estimate-labeled guidance Verify current provider pricing

Direct answer

GPU cloud pricing should be evaluated by useful GPU-hours, completed job cost, data movement, and operations, not by H100 hourly rate alone.

Start with the most specific page in this cluster, then move to the checklist or quiz once you have a bill line, quote, or workload constraint.

Best starting page: Useful GPU-Hour Examples

Start here if

Route yourself to the right page

This hub is a cluster map, not just a list. Use the matching page first, then follow the internal links.

Use this cluster when

  • You have two GPU quotes that do not include the same storage, transfer, support, or commitment terms.
  • The cheapest listed H100 rate looks good but the workload may queue, retry, idle, or stage data.
  • You need a buyer-friendly way to explain GPU cost beyond list price.

Common mistakes

  • Ranking providers by listed GPU rate only.
  • Ignoring data movement and persistent storage.
  • Treating managed inference and self-service GPU as operationally equivalent.

Questions answered

Top questions in this cluster

These are the concrete questions the pages below are built to answer.

Why can the cheapest H100 hourly rate be more expensive?
What should I ask before accepting a GPU cloud quote?
How do idle time and retries change GPU cost?
When does specialized GPU cloud beat default cloud?

Start here

Use the cluster in this order

  1. Open the GPU quote checklist.
  2. Compare useful GPU-hours, not just hourly price.
  3. Run the placement intake once you have one quote or bill line.

Common confusion

The listed H100 price does not explain queue time, failed jobs, storage, or egress.
Specialized GPU clouds can be cheaper, but only if the workload tolerates their ops model.
Utilization matters more than hourly price for steady inference and training.

Decision pages

Start with the closest problem

These pages answer the specific questions inside this topic cluster.

Frameworks

Define the concepts behind the answers

These pages give readers the definitions, formulas, and decision tables behind the cluster.

Provider comparisonsCloud GPU Quote Comparison: The Questions To Ask Every Providercost_breakdown

A practical checklist for comparing cloud GPU quotes across hourly rate, billing unit, storage, bandwidth, availability, support, and commitments.

Capacity decisionsGPU Utilization Break-Even: When A Cheap GPU Cloud Actually Saves Moneycost_breakdown

A practical GPU utilization break-even page for deciding when lower hourly rates outweigh idle time, retries, and operational overhead.

Capacity decisionsGPU Training Cost Breakdown: Before You Rent The Biggest GPUcost_breakdown

A practical breakdown of GPU training cost drivers, including runtime, checkpointing, failed runs, storage, data movement, and capacity planning.

Provider comparisonsCheapest H100 Cloud: Why The Lowest Price Can Be The Wrong Answerdecision

A practical decision page explaining why the cheapest H100 cloud listing may not be the cheapest workload placement.

Provider comparisonsVast.ai vs Managed GPU Cloud: When Marketplace Pricing Is Worth Itcomparison

A practical decision page for comparing Vast.ai marketplace GPU pricing with managed GPU clouds for experiments, inference, and training.

Cost breakdownsGPU Cloud Hidden Fees: The Costs Missing From The Hourly GPU Ratecost_breakdown

A checklist of GPU cloud costs that are easy to miss, including storage, bandwidth, idle time, retries, support, and commitment waste.

Provider comparisonsH100 Cloud Pricing Comparison: What To Compare Before The Hourly Ratecomparison

A practical H100 cloud pricing comparison checklist focused on useful GPU-hours, availability, storage, bandwidth, and operational tradeoffs.

Cost breakdownsGPU Cloud Pricing Checklist: What the Hourly Rate Leaves Outcost_breakdown

A checklist for comparing GPU cloud quotes beyond the hourly GPU price, including storage, bandwidth, idle time, availability, and ops.

Provider comparisonsRunPod vs Lambda vs Vast.ai: Which GPU Cloud Fits Your Workload?comparison

Compare RunPod, Lambda, and Vast.ai by workload shape, reliability needs, pricing model, and operational tolerance.

Provider comparisonsHow to Systematically Compare Cloud GPU Prices Across 20+ Providerscomparison

The real approach to comparing GPU prices on AWS, Google, Oracle, and 20+ providers—when spot/on-demand, regions, and volatility can drive 2x–8x price swings monthly. Shortcuts, tradeoffs, and decision tools.

Resources

Useful checklists

Worksheets that make this topic easier to compare with real request volume, bill lines, quotes, or workload notes.

RunPlacement quiz

Pressure-test this workload

Start with useful GPU-hours and workload tolerance before trusting a cheaper GPU listing.

Uses workload type, budget, GPU need, data movement, priority, and ops tolerance.
Use the quiz