Models

Browse 24 canonical LLM models across all providers

Sort by

Compare Models

Showing 1–24 of 24 models

Claude Opus 4.7

300K ctx

Anthropic's latest and most advanced model with state-of-the-art reasoning, coding, and analysis capabilities.

Claude Sonnet 4.6

200K ctx

Anthropic's balanced model offering strong performance with efficient cost for everyday tasks.

Claude Opus 4.6

200K ctx

Anthropic's previous flagship model with excellent reasoning and coding capabilities.

GPT-5.5 Pro

256K ctx

OpenAI's most capable model with advanced reasoning, multimodal understanding, and tool use.

GPT-5

128K ctx

OpenAI's fifth-generation model with strong general intelligence and tool use.

Gemini 3.1 Pro

2.0M ctx

Google's most advanced multimodal model with native audio, video, and code understanding.

Gemini 2.5 Pro

1.0M ctx

Google's powerful multimodal model with long context and strong reasoning.

Grok 4

256K ctx

xAI's flagship model with real-time knowledge, strong reasoning, and humor.

Grok 4.1 Fast

128K ctx

xAI's speed-optimized model for low-latency inference with strong capabilities.

Kimi K2.6

1.0M ctx

Moonshot AI's advanced model with strong multilingual and long-context capabilities.

Llama 4 Maverick

1.0M ctx

Meta's largest open-weight model with 400B parameters and state-of-the-art open performance.

Llama 4 Scout

1.0M ctx

Meta's efficient open-weight model optimized for deployment with strong capabilities.

Llama 3.3 70B

128K ctx

Meta's efficient 70B parameter model with strong instruction following and coding.

Mistral Large 3

128K ctx

Mistral AI's flagship model with strong multilingual, reasoning, and coding capabilities.

Mistral Medium 3.5

128K ctx

Mistral AI's balanced model offering strong performance at moderate cost.

Nemotron 3 Super 120B

128K ctx

NVIDIA's large-scale model optimized for enterprise AI workloads and reasoning.

DeepSeek V4 Pro

128K ctx

DeepSeek's most capable model with advanced reasoning and coding at competitive pricing.

DeepSeek V3

128K ctx

DeepSeek's advanced reasoning model with strong coding capabilities at low cost.

Qwen3 Coder

131K ctx

Alibaba's specialized coding model with strong code generation and understanding.

Devstral 2

128K ctx

Mistral's specialized coding model built for software development workflows.

GPT-5.5

1.0M ctx

OpenAI's most capable model for complex real-world work including coding, research, and document creation.

Muse Spark

256K ctx

Meta Superintelligence Labs' first model with advanced reasoning, multimodal understanding, and agentic capabilities.

Qwen 3.6 35B-A3B

131K ctx

Alibaba's efficient MoE model with 35B total / 3B active parameters. Frontier-level agentic coding performance.

GigaChat 3.1 Ultra

32K ctx

Sber's flagship MoE model with 702B total / 36B active parameters for multilingual workloads and reasoning.