115 models · 50 providers · 183 mappings

Open Registry & Telemetry
for AI Infrastructure

Discover, validate and compare LLM models, inference providers, MCP servers, and agent skills using open data and real-time telemetry. Track latency, uptime, pricing, capabilities, and provider mappings in one place.

Tracking providers across the ecosystem

OpenAI
Anthropic
Google
AWS Bedrock
Meta
Groq
Mistral
DeepSeek
xAI
IBM
Azure AI
Cohere
NVIDIA
Alibaba
Xiaomi
Hugging Face
Together AI
Fireworks
Replicate
SambaNova
Scaleway
Nebius
OpenAI
Anthropic
Google
AWS Bedrock
Meta
Groq
Mistral
DeepSeek
xAI
IBM
Azure AI
Cohere
NVIDIA
Alibaba
Xiaomi
Hugging Face
Together AI
Fireworks
Replicate
SambaNova
Scaleway
Nebius

115

Models

50

Providers

183

Provider Mappings

$5.41

Avg $/1M Tokens

Popular Models

Top-ranked models by relevance and provider availability

View all →

DeepSeek R1

131K ctx

DeepSeek's reasoning-focused model trained with reinforcement learning for complex multi-step reasoning. Excels at math, science, and coding problems requiring chain-of-thought reasoning.

GPT-5.5 Pro

256K ctx

OpenAI's premium tier model with extended reasoning capabilities, higher accuracy on complex tasks, and priority access. Optimized for professional and enterprise workloads requiring maximum quality.

Gemini 3.1 Pro

2.0M ctx

Google's latest flagship multimodal model with state-of-the-art performance on reasoning, coding, and multimodal understanding. Features native tool use, grounding, and million-token context window.

Claude Opus 4.8

300K ctx

Anthropic's most advanced model, building on Opus 4.7 with improvements across benchmarks in coding, agentic skills, reasoning, and knowledge work. Features enhanced honesty, better tool use efficiency, dynamic workflows support, and improved alignment.

DeepSeek V4

256K ctx

DeepSeek's fourth-generation model with improved mixture-of-experts architecture, enhanced reasoning and coding capabilities, and stronger multilingual performance. Competitive with frontier proprietary models.

Gemma 4 31B

262K ctx

Google's flagship open-weight dense model with 31B parameters. All parameters active per forward pass. Ranks among top open models with strong performance on AIME 2026 (89.2%) and MMLU Pro (85.2%). Supports vision and extended context.

Nemotron 3 Ultra

1.0M ctx

NVIDIA's flagship open 550B-parameter Mixture-of-Experts model with 55B active parameters, built for frontier reasoning and orchestration in long-running agentic systems. Features hybrid Mamba-Transformer architecture, LatentMoE routing, multi-token prediction, and NVFP4 precision for 5x higher throughput. Achieves 30% lower cost-to-task-completion on agentic benchmarks. Supports 1M+ token context window with 95% accuracy on Ruler@1M.

Gemma 4 12B

262K ctx

Google's medium-size open-weight model with 12 billion parameters from the Gemma 4 family. Encoder-free unified multimodal architecture that natively processes text, image, audio, and video inputs without dedicated encoders. Features a 256K context window and supports 140+ languages. First medium-sized model capable of natively ingesting audio. Suitable for local deployment on GPUs with 16GB VRAM.

Kimi K2.7 Code

1.0M ctx

Moonshot AI's latest open-source, coding-focused model in the Kimi K2 family, built to complete end-to-end programming tasks reliably over long contexts. A 1-trillion-parameter model that cuts reasoning token usage by roughly 30% versus K2.6 while improving coding and agent performance — +21.8% on Kimi Code Bench v2, +11.0% on Program Bench, and +31.5% on MLS Bench Lite for multi-language support. Released under a Modified MIT License and available via Kimi APIs and Hugging Face.

MiniMax M3

1.0M ctx

MiniMax's frontier open-weight model with 1M-token context window, native multimodality (text, image, video), and strong coding capabilities. Built on MiniMax Sparse Attention (MSA) architecture, achieving 59% on SWE-Bench Pro with significantly improved efficiency at long context.

Mistral Medium 3.5

128K ctx

Mistral AI's balanced model offering strong multilingual performance with excellent price-performance ratio. Optimized for production workloads requiring reliable quality across European and global languages.

DeepSeek V4 Flash

1.0M ctx

DeepSeek's efficient V4 model with 284B total parameters (13B activated). Optimized for speed and cost-efficiency while maintaining strong performance. Supports 1M token context window.

OpenModels CLI

Browse models, compare providers, and check telemetry directly from your terminal. JSON and YAML output for scripting and CI/CD.

npm install -g openmodels-cli
terminal