Open Registry & Telemetry
for AI Infrastructure
Discover, validate and compare LLM models, inference providers, MCP servers, and agent skills using open data and real-time telemetry. Track latency, uptime, pricing, capabilities, and provider mappings in one place.
Tracking providers across the ecosystem
115
Models
50
Providers
183
Provider Mappings
$5.41
Avg $/1M Tokens
Popular Models
Top-ranked models by relevance and provider availability
DeepSeek R1
DeepSeek's reasoning-focused model trained with reinforcement learning for complex multi-step reasoning. Excels at math, science, and coding problems requiring chain-of-thought reasoning.
GPT-5.5 Pro
OpenAI's premium tier model with extended reasoning capabilities, higher accuracy on complex tasks, and priority access. Optimized for professional and enterprise workloads requiring maximum quality.
Gemini 3.1 Pro
Google's latest flagship multimodal model with state-of-the-art performance on reasoning, coding, and multimodal understanding. Features native tool use, grounding, and million-token context window.
Claude Opus 4.8
Anthropic's most advanced model, building on Opus 4.7 with improvements across benchmarks in coding, agentic skills, reasoning, and knowledge work. Features enhanced honesty, better tool use efficiency, dynamic workflows support, and improved alignment.
DeepSeek V4
DeepSeek's fourth-generation model with improved mixture-of-experts architecture, enhanced reasoning and coding capabilities, and stronger multilingual performance. Competitive with frontier proprietary models.
Gemma 4 31B
Google's flagship open-weight dense model with 31B parameters. All parameters active per forward pass. Ranks among top open models with strong performance on AIME 2026 (89.2%) and MMLU Pro (85.2%). Supports vision and extended context.
Nemotron 3 Ultra
NVIDIA's flagship open 550B-parameter Mixture-of-Experts model with 55B active parameters, built for frontier reasoning and orchestration in long-running agentic systems. Features hybrid Mamba-Transformer architecture, LatentMoE routing, multi-token prediction, and NVFP4 precision for 5x higher throughput. Achieves 30% lower cost-to-task-completion on agentic benchmarks. Supports 1M+ token context window with 95% accuracy on Ruler@1M.
Gemma 4 12B
Google's medium-size open-weight model with 12 billion parameters from the Gemma 4 family. Encoder-free unified multimodal architecture that natively processes text, image, audio, and video inputs without dedicated encoders. Features a 256K context window and supports 140+ languages. First medium-sized model capable of natively ingesting audio. Suitable for local deployment on GPUs with 16GB VRAM.
Kimi K2.7 Code
Moonshot AI's latest open-source, coding-focused model in the Kimi K2 family, built to complete end-to-end programming tasks reliably over long contexts. A 1-trillion-parameter model that cuts reasoning token usage by roughly 30% versus K2.6 while improving coding and agent performance — +21.8% on Kimi Code Bench v2, +11.0% on Program Bench, and +31.5% on MLS Bench Lite for multi-language support. Released under a Modified MIT License and available via Kimi APIs and Hugging Face.
MiniMax M3
MiniMax's frontier open-weight model with 1M-token context window, native multimodality (text, image, video), and strong coding capabilities. Built on MiniMax Sparse Attention (MSA) architecture, achieving 59% on SWE-Bench Pro with significantly improved efficiency at long context.
Latest Insights
Analysis, benchmarks and comparisons across the LLM ecosystem

The AI Race Is Shifting From IQ to Agentic Economics
The AI race is shifting from benchmark scores to agentic economics. Why inference costs, latency, and open-weight models are reshaping the industry in 2026.

Stanford AI Index 2026: AI Is Scaling Faster Than Society Can Adapt
The release of the 2026 AI Index Report by Stanford HAI paints a very clear picture: artificial intelligence is no longer an emerging technology — it has become global infrastructure.

Claude Mythos Preview: The First AI System Card That Feels Like a Warning
Anthropic’s release of the Claude Mythos Preview System Card may become one of the most important AI safety documents published so far — not because of hype, benchmark scores, or product launches, but because of its tone.
OpenModels CLI
Browse models, compare providers, and check telemetry directly from your terminal. JSON and YAML output for scripting and CI/CD.
npm install -g openmodels-cli