Back to providers

Modal

HealthyTelemetry updated 37m ago

Serverless cloud platform for running AI workloads with on-demand GPU access, offering custom model deployment and OpenAI-compatible inference endpoints with automatic scaling and pay-per-second pricing.

API Base URL

https://api.modal.com/v1

Authentication

bearer

Uptime (24h)

100.0%

Uptime (7d)

100.0%

Supported Regions

us-east-1us-west-2

Latency (TTFT)

Time to first token percentiles

No latency data available

Health History

Uptime over the last 7 days

7-Day Uptime100.00% — Excellent
24-Hour Uptime100.00% — Excellent

Current Status

healthy

Last Checked

37m ago

Supported Models (1)

Models available through this provider. Click a model to view details.

ModelPricing (per 1M)Rate LimitsRegions

Llama 3.1 8B Instruct

llama-3-1-8b

In: $0.20
Out: $0.20
120 RPM / 400K TPM
us-east-1us-west-2