Pricing — Volt

Pricing that scales from token to bare metal

Three ways to run frontier open-weights models in your customer's metro. Zero egress on every tier.

Save up to 60% on 36-month reserved capacity

Volt Spark

Tokens-as-a-service. OpenAI-compatible. Llama 70B standard rate.

$0.95/M

Standard rate · committed volume

Volt Forge

GPU-as-a-service. Dedicated leases in your namespace.

$2.36/GPU/hr

NVIDIA B200 · 36-mo reserved

Volt Vault

Dedicated bare-metal. Sovereign by default.

$85,000/mo

8-GPU B200 rack · 36-mo

Answers on zero egress, model catalog, pricing, and the sovereign tier behind Volt's inference cloud.

Can't find what you're looking for? Reach our team

Zero egress, in-metro serving, at Bedrock-beating prices. Your data never leaves the city.