Back to providers
Replicate
HealthyTelemetry updated 36m agoCloud platform for running open-source AI models with a simple API. Hosts over 1000 community models with serverless GPU inference, pay-per-second pricing, and no infrastructure management required.
API Base URL
https://api.replicate.com/v1
Authentication
bearer
Uptime (24h)
100.0%
Uptime (7d)
100.0%
Supported Regions
us-east-1us-west-2
Latency (TTFT)
Time to first token percentiles
No latency data available
Health History
Uptime over the last 7 days
7-Day Uptime100.00% — Excellent
24-Hour Uptime100.00% — Excellent
Current Status
healthy
Last Checked
36m ago
Supported Models (2)
Models available through this provider. Click a model to view details.
| Model | Pricing (per 1M) | Rate Limits | Regions |
|---|---|---|---|
Llama 4 Maverick llama-4-maverick | In: $0.30 Out: $0.95 | 300 RPM / 500K TPM | us-east-1us-west-2 |
Llama 4 Scout llama-4-scout | In: $0.15 Out: $0.40 | 300 RPM / 500K TPM | us-east-1us-west-2 |