Back to providers

Inference.net

HealthyTelemetry updated 38m ago

Distributed AI inference network providing affordable access to open-source language models through a decentralized GPU marketplace, offering OpenAI-compatible API endpoints with competitive per-token pricing.

API Base URL

https://api.inference.net/v1

Authentication

api-key

Uptime (24h)

100.0%

Uptime (7d)

100.0%

Supported Regions

us-east-1eu-west-1

Latency (TTFT)

Time to first token percentiles

No latency data available

Health History

Uptime over the last 7 days

7-Day Uptime100.00% — Excellent
24-Hour Uptime100.00% — Excellent

Current Status

healthy

Last Checked

38m ago

Supported Models (1)

Models available through this provider. Click a model to view details.

ModelPricing (per 1M)Rate LimitsRegions

Llama 3.3 70B Instruct

llama-3-3-70b

In: $0.30
Out: $0.30
60 RPM / 200K TPM
us-east-1eu-west-1