Back to providers

NVIDIA NIM

HealthyTelemetry updated 36m ago

NVIDIA's inference microservice platform providing optimized deployment of LLMs on GPU infrastructure. Offers free endpoints for select models and partner endpoints through Deep Infra, Together AI, Bitdeer, GMI Cloud, and CoreWeave.

API Base URL

https://integrate.api.nvidia.com/v1

Authentication

api-key

Uptime (24h)

100.0%

Uptime (7d)

100.0%

Supported Regions

us-east-1us-west-2global

Latency (TTFT)

Time to first token percentiles

No latency data available

Health History

Uptime over the last 7 days

7-Day Uptime100.00% — Excellent
24-Hour Uptime100.00% — Excellent

Current Status

healthy

Last Checked

36m ago

Supported Models (11)

Models available through this provider. Click a model to view details.

ModelPricing (per 1M)Rate LimitsRegions

DeepSeek V4 Flash

deepseek-v4-flash

In: $0.00
Out: $0.00
100 RPM / 500K TPM
us-east-1us-west-2global

DeepSeek V4 Pro

deepseek-v4-pro

In: $0.00
Out: $0.00
100 RPM / 500K TPM
us-east-1us-west-2global

Gemma 4 31B

gemma-4

In: $0.00
Out: $0.00
100 RPM / 500K TPM
us-east-1us-west-2global

GLM-4.7

glm-4-7

In: $0.00
Out: $0.00
100 RPM / 500K TPM
us-east-1us-west-2global

GLM-5.1

glm-5-1

In: $0.00
Out: $0.00
100 RPM / 500K TPM
us-east-1us-west-2global

Kimi K2.6

kimi-k2-6

In: $0.00
Out: $0.00
100 RPM / 500K TPM
us-east-1us-west-2global

MiniMax M2.7

minimax-m2-7

In: $0.00
Out: $0.00
100 RPM / 500K TPM
us-east-1us-west-2global

Mistral Medium 3.5

mistral-medium-3-5

In: $0.00
Out: $0.00
100 RPM / 500K TPM
us-east-1us-west-2global

Mistral Small 4

mistral-small-4

In: $0.00
Out: $0.00
100 RPM / 500K TPM
us-east-1us-west-2global

Nemotron 3 Super 120B

nemotron-3-super-120b

In: $0.00
Out: $0.00
100 RPM / 500K TPM
us-east-1us-west-2global

Nemotron Nano 9B v2

nemotron-nano-9b

In: $0.00
Out: $0.00
100 RPM / 500K TPM
us-east-1us-west-2global