Back to providers
NVIDIA NIM
HealthyTelemetry updated 36m agoNVIDIA's inference microservice platform providing optimized deployment of LLMs on GPU infrastructure. Offers free endpoints for select models and partner endpoints through Deep Infra, Together AI, Bitdeer, GMI Cloud, and CoreWeave.
API Base URL
https://integrate.api.nvidia.com/v1
Authentication
api-key
Uptime (24h)
100.0%
Uptime (7d)
100.0%
Supported Regions
us-east-1us-west-2global
Latency (TTFT)
Time to first token percentiles
No latency data available
Health History
Uptime over the last 7 days
7-Day Uptime100.00% — Excellent
24-Hour Uptime100.00% — Excellent
Current Status
healthy
Last Checked
36m ago
Supported Models (11)
Models available through this provider. Click a model to view details.
| Model | Pricing (per 1M) | Rate Limits | Regions |
|---|---|---|---|
DeepSeek V4 Flash deepseek-v4-flash | In: $0.00 Out: $0.00 | 100 RPM / 500K TPM | us-east-1us-west-2global |
DeepSeek V4 Pro deepseek-v4-pro | In: $0.00 Out: $0.00 | 100 RPM / 500K TPM | us-east-1us-west-2global |
Gemma 4 31B gemma-4 | In: $0.00 Out: $0.00 | 100 RPM / 500K TPM | us-east-1us-west-2global |
GLM-4.7 glm-4-7 | In: $0.00 Out: $0.00 | 100 RPM / 500K TPM | us-east-1us-west-2global |
GLM-5.1 glm-5-1 | In: $0.00 Out: $0.00 | 100 RPM / 500K TPM | us-east-1us-west-2global |
Kimi K2.6 kimi-k2-6 | In: $0.00 Out: $0.00 | 100 RPM / 500K TPM | us-east-1us-west-2global |
MiniMax M2.7 minimax-m2-7 | In: $0.00 Out: $0.00 | 100 RPM / 500K TPM | us-east-1us-west-2global |
Mistral Medium 3.5 mistral-medium-3-5 | In: $0.00 Out: $0.00 | 100 RPM / 500K TPM | us-east-1us-west-2global |
Mistral Small 4 mistral-small-4 | In: $0.00 Out: $0.00 | 100 RPM / 500K TPM | us-east-1us-west-2global |
Nemotron 3 Super 120B nemotron-3-super-120b | In: $0.00 Out: $0.00 | 100 RPM / 500K TPM | us-east-1us-west-2global |
Nemotron Nano 9B v2 nemotron-nano-9b | In: $0.00 Out: $0.00 | 100 RPM / 500K TPM | us-east-1us-west-2global |