Models

Browse 22 canonical LLM models across all providers

22 models

Qwen 3.7 Plus

131K ctx

Alibaba's multimodal variant in the Qwen 3.7 family, optimized for vision understanding and multimodal tasks. Ranked

$0.80$2.40 / 1M tokens

chatcompletionfunction-callingvision+2
textimagecode

Qwen 3.7 Max

131K ctx

Alibaba's flagship proprietary model engineered for advanced agentic coding, complex reasoning, and long-horizon task execution. Ranked

$1.30$7.80 / 1M tokens

chatcompletionfunction-callingcode-generation+1
textcode

MiniCPM-V 4.6

256K ctx

Ultra-efficient multimodal language model from OpenBMB built on SigLIP2-400M and Qwen3.5-0.8B (~1B parameters). Supports single-image, multi-image, and video understanding with mixed 4x/16x visual token compression. Designed for edge deployment on iOS, Android, and HarmonyOS.

chatcompletionvision
textimagevideo

DeepSeek V4 Pro

2 providers1.0M ctx

DeepSeek's flagship V4 model with 1.6T total parameters (49B activated). MoE architecture supporting 1M token context. Closes the gap with frontier proprietary models on reasoning and coding benchmarks.

$0.00$2.19 / 1M tokens

chatcompletionfunction-callingcode-generation+1
textcode

DeepSeek V4 Flash

3 providers1.0M ctx

DeepSeek's efficient V4 model with 284B total parameters (13B activated). Optimized for speed and cost-efficiency while maintaining strong performance. Supports 1M token context window.

$0.00$0.40 / 1M tokens

chatcompletionfunction-callingcode-generation
textcode

Qwen 3.6 27B

131K ctx

Alibaba's dense 27B parameter model that outperforms its own 397B MoE predecessor on agentic coding benchmarks. Strong multilingual and reasoning capabilities released under Apache 2.0.

$0.20$0.60 / 1M tokens

chatcompletionfunction-callingvision+2
textimagecode

Hy3 Preview

2 providers256K ctx

Tencent's flagship open-weight Mixture-of-Experts model from the Hunyuan family with 295B total parameters and 21B active. Integrates fast and slow thinking modes with configurable reasoning effort. Designed for agentic workflows, cross-file code refactoring, long-document analysis, and multi-step tool use.

$0.00$0.28 / 1M tokens

chatcompletionfunction-callingcode-generation+1
textcode

MiMo-V2.5-Pro

2 providers1.0M ctx

Xiaomi's flagship 1.02T-parameter Mixture-of-Experts model with 42B active parameters, built on a hybrid-attention architecture with 3-layer Multi-Token Prediction. Designed for complex agentic tasks, software engineering, and long-horizon instruction following with a 1M-token context window.

$1.00$3.00 / 1M tokens

chatcompletionfunction-callingcode-generation+1
textcode

Qwen 3.6 35B-A3B

131K ctx

Alibaba's efficient Mixture-of-Experts model with 35B total parameters and 3B active per token. Frontier-level agentic coding performance with 73.4% on SWE-bench Verified and 92.7 on AIME 2026. Released under Apache 2.0.

$0.14$0.42 / 1M tokens

chatcompletionfunction-callingvision+2
textimagecode

Qwen 3.6 Plus

131K ctx

Alibaba's proprietary flagship model in the Qwen 3.6 family, targeting enterprise AI workflows with stronger agentic coding capability, visual coding support, and end-to-end enterprise engineering features.

$0.80$2.40 / 1M tokens

chatcompletionfunction-callingvision+2
textimagecode

MiniMax M2.7

2 providers200K ctx

MiniMax's latest large language model with strong multilingual and multimodal capabilities. Competitive pricing with high-quality text generation and improved reasoning performance.

$0.00$1.50 / 1M tokens

chatcompletionfunction-callingcode-generation+1
textcode

Kimi K2.6

2 providers1.0M ctx

Moonshot AI's latest model with ultra-long context window support, strong reasoning capabilities, and excellent performance on complex multi-step tasks. Known for reliable long-document understanding.

$0.00$3.00 / 1M tokens

chatcompletionfunction-callingcode-generation+1
textcode

Qwen 3.6

131K ctx

Alibaba's latest Qwen model with enhanced reasoning, multilingual capabilities, and improved instruction following. Features strong performance on coding, math, and general knowledge benchmarks.

$0.30$0.90 / 1M tokens

chatcompletionfunction-callingcode-generation+1
textcode

GLM-5.1

2 providers131K ctx

Zhipu AI's latest bilingual model with strong Chinese and English capabilities. Features improved reasoning, coding, and tool use with competitive performance on academic benchmarks.

$0.00$3.00 / 1M tokens

chatcompletionfunction-callingcode-generation+1
textcode

DeepSeek V4

6 providers256K ctx

DeepSeek's fourth-generation model with improved mixture-of-experts architecture, enhanced reasoning and coding capabilities, and stronger multilingual performance. Competitive with frontier proprietary models.

$0.14$1.10 / 1M tokens

chatcompletionfunction-callingcode-generation+1
textcode

GLM-4.7

2 providers131K ctx

Zhipu AI's multilingual agentic coding model with strong reasoning, tool use, and UI generation capabilities. Predecessor to GLM-5.1 with competitive performance on coding benchmarks.

$0.00$1.50 / 1M tokens

chatcompletionfunction-callingcode-generation+1
textcode

Qwen3 235B

131K ctx

Alibaba's Qwen3 235B mixture-of-experts model delivering frontier-level performance with advanced reasoning, function calling, and code generation capabilities at massive scale.

chatcompletionfunction-callingreasoning+1
textcode

Qwen3 32B

5 providers131K ctx

Alibaba's Qwen3 32B dense language model with strong reasoning and multilingual capabilities, supporting function calling and code generation across diverse tasks.

$0.16$2.24 / 1M tokens

chatcompletionfunction-callingreasoning+1
textcode

Qwen3 Coder

131K ctx

Alibaba's Qwen3 Coder model optimized for software development tasks including code generation, debugging, code review, and technical documentation with strong multilingual programming support.

$0.20$0.60 / 1M tokens

chatcompletionfunction-callingcode-generation
textcode

QwQ 32B

131K ctx

Alibaba's QwQ 32B reasoning-focused model designed for complex problem solving, mathematical reasoning, and step-by-step logical analysis with strong chain-of-thought capabilities.

chatcompletionreasoning
text

DeepSeek R1

5 providers131K ctx

DeepSeek's reasoning-focused model trained with reinforcement learning for complex multi-step reasoning. Excels at math, science, and coding problems requiring chain-of-thought reasoning.

$0.40$7.00 / 1M tokens

chatcompletioncode-generationreasoning
textcode

DeepSeek V3

2 providers128K ctx

DeepSeek's third-generation large language model featuring mixture-of-experts architecture, strong multilingual capabilities, and competitive performance on reasoning and coding benchmarks.

$0.27$1.10 / 1M tokens

chatcompletionfunction-callingcode-generation+1
textcode