Back to providers
Deep Infra
HealthyTelemetry updated 37m agoServerless inference platform offering fast and cost-effective access to popular open-weight models. OpenAI-compatible API with pay-per-token pricing and no minimum commitments.
API Base URL
https://api.deepinfra.com/v1/openai
Authentication
bearer
Uptime (24h)
100.0%
Uptime (7d)
100.0%
Supported Regions
us-east-1eu-west-1
Latency (TTFT)
Time to first token percentiles
No latency data available
Health History
Uptime over the last 7 days
7-Day Uptime100.00% — Excellent
24-Hour Uptime100.00% — Excellent
Current Status
healthy
Last Checked
37m ago
Supported Models (6)
Models available through this provider. Click a model to view details.
| Model | Pricing (per 1M) | Rate Limits | Regions |
|---|---|---|---|
DeepSeek R1 deepseek-r1 | In: $0.40 Out: $1.60 | 600 RPM / 1.0M TPM | us-east-1eu-west-1 |
DeepSeek V4 Flash deepseek-v4-flash | In: $0.07 Out: $0.14 | 600 RPM / 1.0M TPM | us-east-1eu-west-1 |
Gemma 4 31B gemma-4 | In: $0.10 Out: $0.20 | 600 RPM / 1.0M TPM | us-east-1eu-west-1 |
Llama 4 Maverick llama-4-maverick | In: $0.20 Out: $0.60 | 600 RPM / 1.0M TPM | us-east-1eu-west-1 |
Llama 4 Scout llama-4-scout | In: $0.06 Out: $0.18 | 600 RPM / 1.0M TPM | us-east-1eu-west-1 |
Mistral Medium 3.5 mistral-medium-3-5 | In: $0.40 Out: $1.20 | 600 RPM / 1.0M TPM | us-east-1eu-west-1 |