Pricing and Billing
Last updated
Last updated
Inference usage is calculated based on different units for each model. See the tables below for pricing based on model type.
We are working hard to improve metrics and payment systems for our inference services. Please bear with us while we grow!
You will receive a monthly invoice to your account email address for inference usage. Feel free to with questions.
Llama-2-13B-chat
$0.24 / 1M tokens
Llama-2-70B-chat
$0.99 / 1M tokens
Falcon-40B-instruct
$0.66 / 1M tokens
Stable Diffusion XL
all resolutions
$0.0024 / image
Real-ESRGAN
$0.003 / image
GFPGAN
$0.004 / image
Whisper-large-v2 or v3
transcription or translation
$0.0010 / minute of audio
Whisper-large-v2 or v3
transcription or translation + alignment and/or diarization
$0.0020 / minute of audio