115 models · 50 providers · 183 mappings

Open Registry & Telemetry
for AI Infrastructure

Name: OpenModels
Creator: OpenModels
License: https://github.com/openmodelsrun/openmodels

Discover, validate and compare LLM models, inference providers, MCP servers, and agent skills using open data and real-time telemetry. Track latency, uptime, pricing, capabilities, and provider mappings in one place.

Explore Models GitHub Docs

Tracking providers across the ecosystem

OpenAI

Anthropic

Google

AWS Bedrock

Popular Models

Top-ranked models by relevance and provider availability

View all →

DeepSeek R1

131K ctx

DeepSeek's reasoning-focused model trained with reinforcement learning for complex multi-step reasoning. Excels at math, science, and coding problems requiring chain-of-thought reasoning.

GPT-5.5 Pro

256K ctx

OpenAI's premium tier model with extended reasoning capabilities, higher accuracy on complex tasks, and priority access. Optimized for professional and enterprise workloads requiring maximum quality.

Gemini 3.1 Pro

2.0M ctx

Google's latest flagship multimodal model with state-of-the-art performance on reasoning, coding, and multimodal understanding. Features native tool use, grounding, and million-token context window.

Claude Opus 4.8

300K ctx

Anthropic's most advanced model, building on Opus 4.7 with improvements across benchmarks in coding, agentic skills, reasoning, and knowledge work. Features enhanced honesty, better tool use efficiency, dynamic workflows support, and improved alignment.

DeepSeek V4

256K ctx

DeepSeek's fourth-generation model with improved mixture-of-experts architecture, enhanced reasoning and coding capabilities, and stronger multilingual performance. Competitive with frontier proprietary models.

Gemma 4 31B

262K ctx

Google's flagship open-weight dense model with 31B parameters. All parameters active per forward pass. Ranks among top open models with strong performance on AIME 2026 (89.2%) and MMLU Pro (85.2%). Supports vision and extended context.

Nemotron 3 Ultra

1.0M ctx

NVIDIA's flagship open 550B-parameter Mixture-of-Experts model with 55B active parameters, built for frontier reasoning and orchestration in long-running agentic systems. Features hybrid Mamba-Transformer architecture, LatentMoE routing, multi-token prediction, and NVFP4 precision for 5x higher throughput. Achieves 30% lower cost-to-task-completion on agentic benchmarks. Supports 1M+ token context window with 95% accuracy on Ruler@1M.

Gemma 4 12B

262K ctx

Google's medium-size open-weight model with 12 billion parameters from the Gemma 4 family. Encoder-free unified multimodal architecture that natively processes text, image, audio, and video inputs without dedicated encoders. Features a 256K context window and supports 140+ languages. First medium-sized model capable of natively ingesting audio. Suitable for local deployment on GPUs with 16GB VRAM.

Kimi K2.7 Code

1.0M ctx

Moonshot AI's latest open-source, coding-focused model in the Kimi K2 family, built to complete end-to-end programming tasks reliably over long contexts. A 1-trillion-parameter model that cuts reasoning token usage by roughly 30% versus K2.6 while improving coding and agent performance — +21.8% on Kimi Code Bench v2, +11.0% on Program Bench, and +31.5% on MLS Bench Lite for multi-language support. Released under a Modified MIT License and available via Kimi APIs and Hugging Face.

MiniMax M3

1.0M ctx

MiniMax's frontier open-weight model with 1M-token context window, native multimodality (text, image, video), and strong coding capabilities. Built on MiniMax Sparse Attention (MSA) architecture, achieving 59% on SWE-Bench Pro with significantly improved efficiency at long context.

Mistral Medium 3.5

128K ctx

Mistral AI's balanced model offering strong multilingual performance with excellent price-performance ratio. Optimized for production workloads requiring reliable quality across European and global languages.

DeepSeek V4 Flash

1.0M ctx

DeepSeek's efficient V4 model with 284B total parameters (13B activated). Optimized for speed and cost-efficiency while maintaining strong performance. Supports 1M token context window.

Latest Insights

Analysis, benchmarks and comparisons across the LLM ecosystem

View all →

May 27, 2026·4 min read

The AI Race Is Shifting From IQ to Agentic Economics

The AI race is shifting from benchmark scores to agentic economics. Why inference costs, latency, and open-weight models are reshaping the industry in 2026.

May 15, 2026·3 min read

Stanford AI Index 2026: AI Is Scaling Faster Than Society Can Adapt

The release of the 2026 AI Index Report by Stanford HAI paints a very clear picture: artificial intelligence is no longer an emerging technology — it has become global infrastructure.

May 15, 2026·5 min read

Claude Mythos Preview: The First AI System Card That Feels Like a Warning

Anthropic’s release of the Claude Mythos Preview System Card may become one of the most important AI safety documents published so far — not because of hype, benchmark scores, or product launches, but because of its tone.

OpenModels CLI

Browse models, compare providers, and check telemetry directly from your terminal. JSON and YAML output for scripting and CI/CD.

npm install -g openmodels-cli

Learn more

npm GitHub Docs

terminal

Recently Added

Latest models added to the registry

View all →

Open Registry & Telemetryfor AI Infrastructure

Popular Models

DeepSeek R1

GPT-5.5 Pro

Gemini 3.1 Pro

Claude Opus 4.8

DeepSeek V4

Gemma 4 31B

Nemotron 3 Ultra

Gemma 4 12B

Kimi K2.7 Code

MiniMax M3

Mistral Medium 3.5

DeepSeek V4 Flash

Latest Insights

The AI Race Is Shifting From IQ to Agentic Economics

Stanford AI Index 2026: AI Is Scaling Faster Than Society Can Adapt

Claude Mythos Preview: The First AI System Card That Feels Like a Warning

OpenModels CLI

Recently Added

GLM-5.2

Sarvam-M

Sarvam-105B

Sarvam-1

Sarvam-30B

Recently Added

GLM-5.2

Sarvam-M

Sarvam-105B

Sarvam-1

Sarvam-30B

Open Registry & Telemetry
for AI Infrastructure