Pricing and Billing
Inference Pricing
Inference usage is calculated based on different units for each model. See the tables below for pricing based on model type.
Language Models
Model | Price |
---|---|
Llama-2-13B-chat | $0.24 / 1M tokens |
Llama-2-70B-chat | $0.99 / 1M tokens |
Falcon-40B-instruct | $0.66 / 1M tokens |
Image Models
Model | Resolution | Price |
---|---|---|
Stable Diffusion XL | all resolutions | $0.0024 / image |
Upscaling Models
Model | Price |
---|---|
Real-ESRGAN | $0.003 / image |
GFPGAN | $0.004 / image |
Audio Models
Model | Mode | Price |
---|---|---|
Whisper-large-v2 or v3 | transcription or translation | $0.0010 / minute of audio |
Whisper-large-v2 or v3 | transcription or translation + alignment and/or diarization | $0.0020 / minute of audio |
Making Payments
You will receive a monthly invoice to your account email address for inference usage. Feel free to contact support with questions.
We are working hard to improve metrics and payment systems for our inference services. Please bear with us while we grow!
Last updated