Back to providers
Together AI
HealthyTelemetry updated 38m agoCloud platform for running and fine-tuning open-source AI models, offering competitive pricing and OpenAI-compatible API endpoints for popular open-weight models.
API Base URL
https://api.together.xyz/v1
Authentication
bearer
Uptime (24h)
100.0%
Uptime (7d)
100.0%
Supported Regions
us-east-1us-west-2
Latency (TTFT)
Time to first token percentiles
No latency data available
Health History
Uptime over the last 7 days
7-Day Uptime100.00% — Excellent
24-Hour Uptime100.00% — Excellent
Current Status
healthy
Last Checked
38m ago
Supported Models (11)
Models available through this provider. Click a model to view details.
| Model | Pricing (per 1M) | Rate Limits | Regions |
|---|---|---|---|
DeepSeek R1 deepseek-r1 | In: $3.00 Out: $7.00 | 600 RPM / 1.0M TPM | us-east-1us-west-2 |
DeepSeek V3 deepseek-v3 | In: $0.90 Out: $0.90 | 600 RPM / 1.0M TPM | us-east-1us-west-2 |
DeepSeek V4 deepseek-v4 | In: $0.90 Out: $0.90 | 600 RPM / 1.0M TPM | us-east-1us-west-2 |
Gemma 4 31B gemma-4 | In: $0.50 Out: $0.50 | 600 RPM / 1.0M TPM | us-east-1us-west-2 |
GPT-OSS 120B gpt-oss-120b | In: $1.80 Out: $1.80 | 600 RPM / 1.0M TPM | us-east-1us-west-2 |
Llama 3.3 70B Instruct llama-3-3-70b | In: $0.88 Out: $0.88 | 600 RPM / 1.0M TPM | us-east-1us-west-2 |
Llama 4 Maverick llama-4-maverick | In: $0.27 Out: $0.27 | 600 RPM / 1.0M TPM | us-east-1us-west-2 |
Llama 4 Scout llama-4-scout | In: $0.18 Out: $0.18 | 600 RPM / 1.0M TPM | us-east-1us-west-2 |
Mistral Large 3 mistral-large-3 | In: $1.80 Out: $1.80 | 600 RPM / 1.0M TPM | us-east-1us-west-2 |
Qwen 3.6 qwen3-6 | In: $0.30 Out: $0.90 | 600 RPM / 1.0M TPM | us-east-1us-west-2 |
Qwen3 Coder qwen3-coder | In: $0.20 Out: $0.60 | 600 RPM / 1.0M TPM | us-east-1us-west-2 |