Back to providers

Replicate

HealthyTelemetry updated 36m ago

Cloud platform for running open-source AI models with a simple API. Hosts over 1000 community models with serverless GPU inference, pay-per-second pricing, and no infrastructure management required.

API Base URL

https://api.replicate.com/v1

Authentication

bearer

Uptime (24h)

100.0%

Uptime (7d)

100.0%

Supported Regions

us-east-1us-west-2

Latency (TTFT)

Time to first token percentiles

No latency data available

Health History

Uptime over the last 7 days

7-Day Uptime100.00% — Excellent
24-Hour Uptime100.00% — Excellent

Current Status

healthy

Last Checked

36m ago

Supported Models (2)

Models available through this provider. Click a model to view details.

ModelPricing (per 1M)Rate LimitsRegions

Llama 4 Maverick

llama-4-maverick

In: $0.30
Out: $0.95
300 RPM / 500K TPM
us-east-1us-west-2

Llama 4 Scout

llama-4-scout

In: $0.15
Out: $0.40
300 RPM / 500K TPM
us-east-1us-west-2