Back to providers

Together AI

HealthyTelemetry updated 38m ago

Cloud platform for running and fine-tuning open-source AI models, offering competitive pricing and OpenAI-compatible API endpoints for popular open-weight models.

API Base URL

https://api.together.xyz/v1

Authentication

bearer

Uptime (24h)

100.0%

Uptime (7d)

100.0%

Supported Regions

us-east-1us-west-2

Latency (TTFT)

Time to first token percentiles

No latency data available

Health History

Uptime over the last 7 days

7-Day Uptime100.00% — Excellent
24-Hour Uptime100.00% — Excellent

Current Status

healthy

Last Checked

38m ago

Supported Models (11)

Models available through this provider. Click a model to view details.

ModelPricing (per 1M)Rate LimitsRegions

DeepSeek R1

deepseek-r1

In: $3.00
Out: $7.00
600 RPM / 1.0M TPM
us-east-1us-west-2

DeepSeek V3

deepseek-v3

In: $0.90
Out: $0.90
600 RPM / 1.0M TPM
us-east-1us-west-2

DeepSeek V4

deepseek-v4

In: $0.90
Out: $0.90
600 RPM / 1.0M TPM
us-east-1us-west-2

Gemma 4 31B

gemma-4

In: $0.50
Out: $0.50
600 RPM / 1.0M TPM
us-east-1us-west-2

GPT-OSS 120B

gpt-oss-120b

In: $1.80
Out: $1.80
600 RPM / 1.0M TPM
us-east-1us-west-2

Llama 3.3 70B Instruct

llama-3-3-70b

In: $0.88
Out: $0.88
600 RPM / 1.0M TPM
us-east-1us-west-2

Llama 4 Maverick

llama-4-maverick

In: $0.27
Out: $0.27
600 RPM / 1.0M TPM
us-east-1us-west-2

Llama 4 Scout

llama-4-scout

In: $0.18
Out: $0.18
600 RPM / 1.0M TPM
us-east-1us-west-2

Mistral Large 3

mistral-large-3

In: $1.80
Out: $1.80
600 RPM / 1.0M TPM
us-east-1us-west-2

Qwen 3.6

qwen3-6

In: $0.30
Out: $0.90
600 RPM / 1.0M TPM
us-east-1us-west-2

Qwen3 Coder

qwen3-coder

In: $0.20
Out: $0.60
600 RPM / 1.0M TPM
us-east-1us-west-2