Back to providers

Deep Infra

HealthyTelemetry updated 37m ago

Serverless inference platform offering fast and cost-effective access to popular open-weight models. OpenAI-compatible API with pay-per-token pricing and no minimum commitments.

API Base URL

https://api.deepinfra.com/v1/openai

Authentication

bearer

Uptime (24h)

100.0%

Uptime (7d)

100.0%

Supported Regions

us-east-1eu-west-1

Latency (TTFT)

Time to first token percentiles

No latency data available

Health History

Uptime over the last 7 days

7-Day Uptime100.00% — Excellent
24-Hour Uptime100.00% — Excellent

Current Status

healthy

Last Checked

37m ago

Supported Models (6)

Models available through this provider. Click a model to view details.

ModelPricing (per 1M)Rate LimitsRegions

DeepSeek R1

deepseek-r1

In: $0.40
Out: $1.60
600 RPM / 1.0M TPM
us-east-1eu-west-1

DeepSeek V4 Flash

deepseek-v4-flash

In: $0.07
Out: $0.14
600 RPM / 1.0M TPM
us-east-1eu-west-1

Gemma 4 31B

gemma-4

In: $0.10
Out: $0.20
600 RPM / 1.0M TPM
us-east-1eu-west-1

Llama 4 Maverick

llama-4-maverick

In: $0.20
Out: $0.60
600 RPM / 1.0M TPM
us-east-1eu-west-1

Llama 4 Scout

llama-4-scout

In: $0.06
Out: $0.18
600 RPM / 1.0M TPM
us-east-1eu-west-1

Mistral Medium 3.5

mistral-medium-3-5

In: $0.40
Out: $1.20
600 RPM / 1.0M TPM
us-east-1eu-west-1