Resource library
Checklists and worksheets for messy compute decisions.
Use these when you need to turn a vague cloud, GPU, inference, or migration question into fields a team can compare.
AI inference cost
AI inference cost assets
API, managed inference, self-hosted GPU, batch, realtime, and hybrid serving cost decisions.
GPU pricing
GPU pricing assets
Quote review, useful GPU-hours, data movement, utilization, and provider tradeoffs.
AWS bill shock
AWS bill shock assets
Line-item triage before assuming the whole cloud placement is wrong.
Cloud migration
Cloud migration assets
Exit costs, payback windows, portability, and partial move decisions.
Workload placement
Workload placement assets
The baseline worksheet for choosing a placement category before comparing vendors.
Frameworks
Definitions and formulas behind the worksheets
These framework pages support the worksheets with concise terms, examples, and decision tables.
Choose workload placement by matching the workload's cost driver, data movement, performance needs, operational tolerance, and commitment horizon to the right infrastructure category.
GPU pricingUseful GPU-Hour Frameworkuseful GPU-hourUseful GPU-hour cost is the better comparison unit when GPU providers differ in utilization, queueing, reliability, storage behavior, or operational model.
AWS bill shockCloud Bill Shock Taxonomycloud bill shockClassify bill shock by driver class first: compute, network, storage, observability, managed services, support, marketplace, or commitment mismatch.
Cloud migrationCloud Exit Payback Frameworkcloud exit paybackCloud exit is financially serious only when steady savings repay migration work, data movement, service replacement, downtime risk, rollback planning, and new operations inside an acceptable window.
Workload placementManaged Platform vs Infrastructure Control Frameworkinfrastructure control premiumA managed platform is better when it removes operational work the team should not own; direct infrastructure is better when control creates enough performance, cost, or compliance value to justify the burden.
AI inference costAI Inference Cost ModelAI inference costAI inference cost should be compared as effective cost per successful request and monthly serving cost, not just token price or GPU hourly rate.
Worked examples
Useful GPU-hour math
Use this proof asset when a quote comparison needs concrete examples rather than definitions.